RU2806550C1 - Strains and methods of obtaining protein of single-cellular organisms or biomass - Google Patents
Strains and methods of obtaining protein of single-cellular organisms or biomass Download PDFInfo
- Publication number
- RU2806550C1 RU2806550C1 RU2022110524A RU2022110524A RU2806550C1 RU 2806550 C1 RU2806550 C1 RU 2806550C1 RU 2022110524 A RU2022110524 A RU 2022110524A RU 2022110524 A RU2022110524 A RU 2022110524A RU 2806550 C1 RU2806550 C1 RU 2806550C1
- Authority
- RU
- Russia
- Prior art keywords
- ala
- leu
- gly
- val
- glu
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 144
- 238000000034 method Methods 0.000 title claims abstract description 114
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 81
- 239000002028 Biomass Substances 0.000 title claims abstract description 42
- 230000001580 bacterial effect Effects 0.000 claims abstract description 76
- 235000013305 food Nutrition 0.000 claims abstract description 22
- 241000589506 Xanthobacter Species 0.000 claims abstract description 13
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 claims description 38
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 36
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 33
- 229910052799 carbon Inorganic materials 0.000 claims description 33
- 108010020056 Hydrogenase Proteins 0.000 claims description 23
- 229910052739 hydrogen Inorganic materials 0.000 claims description 23
- 239000001257 hydrogen Substances 0.000 claims description 22
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 claims description 20
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 claims description 19
- 229910002092 carbon dioxide Inorganic materials 0.000 claims description 19
- 239000001569 carbon dioxide Substances 0.000 claims description 18
- 108010020943 Nitrogenase Proteins 0.000 claims description 17
- 238000004519 manufacturing process Methods 0.000 claims description 17
- 108010038629 Molybdoferredoxin Proteins 0.000 claims description 16
- 239000011734 sodium Substances 0.000 claims description 16
- 229910052757 nitrogen Inorganic materials 0.000 claims description 15
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 claims description 14
- 229910052500 inorganic mineral Inorganic materials 0.000 claims description 14
- 239000002609 medium Substances 0.000 claims description 14
- 239000011707 mineral Substances 0.000 claims description 14
- HBELESVMOSDEOV-UHFFFAOYSA-N [Fe].[Mo] Chemical compound [Fe].[Mo] HBELESVMOSDEOV-UHFFFAOYSA-N 0.000 claims description 13
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 claims description 12
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 claims description 11
- 229910052760 oxygen Inorganic materials 0.000 claims description 11
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 claims description 10
- 239000001301 oxygen Substances 0.000 claims description 10
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 claims description 9
- 150000003841 chloride salts Chemical class 0.000 claims description 9
- 108020004465 16S ribosomal RNA Proteins 0.000 claims description 7
- 229910052742 iron Inorganic materials 0.000 claims description 7
- YAHZABJORDUQGO-NQXXGFSBSA-N D-ribulose 1,5-bisphosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)C(=O)COP(O)(O)=O YAHZABJORDUQGO-NQXXGFSBSA-N 0.000 claims description 6
- 229910021529 ammonia Inorganic materials 0.000 claims description 6
- 239000004202 carbamide Substances 0.000 claims description 6
- 229910001873 dinitrogen Inorganic materials 0.000 claims description 6
- 108010083084 nickel-iron-selenium hydrogenase Proteins 0.000 claims description 6
- 108090000417 Oxygenases Proteins 0.000 claims description 5
- 102000004020 Oxygenases Human genes 0.000 claims description 5
- -1 manganese, boron salts Chemical class 0.000 claims description 5
- 229940088594 vitamin Drugs 0.000 claims description 5
- 229930003231 vitamin Natural products 0.000 claims description 5
- 235000013343 vitamin Nutrition 0.000 claims description 5
- 239000011782 vitamin Substances 0.000 claims description 5
- 108050003295 ATP synthase protein I Proteins 0.000 claims description 4
- 229910002651 NO3 Inorganic materials 0.000 claims description 4
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 claims description 4
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 claims description 4
- FKNQFGJONOIPTF-UHFFFAOYSA-N Sodium cation Chemical compound [Na+] FKNQFGJONOIPTF-UHFFFAOYSA-N 0.000 claims description 4
- 101150026213 atpB gene Proteins 0.000 claims description 4
- 101150099875 atpE gene Proteins 0.000 claims description 4
- 101150018639 atpFH gene Proteins 0.000 claims description 4
- 101150048329 atpH gene Proteins 0.000 claims description 4
- 101150105046 atpI gene Proteins 0.000 claims description 4
- 239000007788 liquid Substances 0.000 claims description 4
- 108010015857 nitrogenase reductase Proteins 0.000 claims description 4
- 229910001415 sodium ion Inorganic materials 0.000 claims description 4
- 101710201407 ATP synthase F(0) complex subunit B1, mitochondrial Proteins 0.000 claims description 3
- 102100023619 ATP synthase F(0) complex subunit B1, mitochondrial Human genes 0.000 claims description 3
- 101710115740 ATP synthase subunit 4, mitochondrial Proteins 0.000 claims description 3
- 101710114068 ATP synthase subunit b Proteins 0.000 claims description 3
- 101710193701 ATP synthase subunit delta Proteins 0.000 claims description 3
- 229910019142 PO4 Inorganic materials 0.000 claims description 3
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 claims description 3
- 150000003863 ammonium salts Chemical class 0.000 claims description 3
- 238000001035 drying Methods 0.000 claims description 3
- 239000001963 growth medium Substances 0.000 claims description 3
- 235000021317 phosphate Nutrition 0.000 claims description 3
- 239000011591 potassium Substances 0.000 claims description 3
- 229910052700 potassium Inorganic materials 0.000 claims description 3
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 claims description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 claims description 2
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 claims description 2
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 claims description 2
- ZOKXTWBITQBERF-UHFFFAOYSA-N Molybdenum Chemical compound [Mo] ZOKXTWBITQBERF-UHFFFAOYSA-N 0.000 claims description 2
- BUGBHKTXTAQXES-UHFFFAOYSA-N Selenium Chemical compound [Se] BUGBHKTXTAQXES-UHFFFAOYSA-N 0.000 claims description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 claims description 2
- 229910052791 calcium Inorganic materials 0.000 claims description 2
- 239000011575 calcium Substances 0.000 claims description 2
- 239000010941 cobalt Substances 0.000 claims description 2
- 229910017052 cobalt Inorganic materials 0.000 claims description 2
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 claims description 2
- 229910052802 copper Inorganic materials 0.000 claims description 2
- 239000010949 copper Substances 0.000 claims description 2
- PNDPGZBMCMUPRI-UHFFFAOYSA-N iodine Chemical compound II PNDPGZBMCMUPRI-UHFFFAOYSA-N 0.000 claims description 2
- 239000011777 magnesium Substances 0.000 claims description 2
- 229910052749 magnesium Inorganic materials 0.000 claims description 2
- 229910052750 molybdenum Inorganic materials 0.000 claims description 2
- 239000011733 molybdenum Substances 0.000 claims description 2
- 229910052759 nickel Inorganic materials 0.000 claims description 2
- 229910052711 selenium Inorganic materials 0.000 claims description 2
- 239000011669 selenium Substances 0.000 claims description 2
- 229910052708 sodium Inorganic materials 0.000 claims description 2
- 229910052720 vanadium Inorganic materials 0.000 claims description 2
- LEONUFNNVUYDNQ-UHFFFAOYSA-N vanadium atom Chemical compound [V] LEONUFNNVUYDNQ-UHFFFAOYSA-N 0.000 claims description 2
- 239000011701 zinc Substances 0.000 claims description 2
- 229910052725 zinc Inorganic materials 0.000 claims description 2
- 150000003013 phosphoric acid derivatives Chemical class 0.000 claims 1
- 150000003467 sulfuric acid derivatives Chemical class 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 2
- 239000000126 substance Substances 0.000 abstract description 2
- 235000018102 proteins Nutrition 0.000 description 77
- 150000001413 amino acids Chemical class 0.000 description 35
- 239000002773 nucleotide Substances 0.000 description 35
- 125000003729 nucleotide group Chemical group 0.000 description 35
- 210000004027 cell Anatomy 0.000 description 26
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 26
- 108010005233 alanylglutamic acid Proteins 0.000 description 25
- 108010050848 glycylleucine Proteins 0.000 description 23
- 239000007789 gas Substances 0.000 description 20
- 239000000047 product Substances 0.000 description 20
- 108010047495 alanylglycine Proteins 0.000 description 19
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 17
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 17
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 17
- 238000004255 ion exchange chromatography Methods 0.000 description 17
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 15
- 108010049041 glutamylalanine Proteins 0.000 description 15
- 108010061238 threonyl-glycine Proteins 0.000 description 15
- 108010079364 N-glycylalanine Proteins 0.000 description 14
- 235000010755 mineral Nutrition 0.000 description 13
- 241000894006 Bacteria Species 0.000 description 12
- 108010044940 alanylglutamine Proteins 0.000 description 12
- 238000004364 calculation method Methods 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 108010073969 valyllysine Proteins 0.000 description 11
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 10
- 108010038633 aspartylglutamate Proteins 0.000 description 10
- 108010089804 glycyl-threonine Proteins 0.000 description 10
- 241000880493 Leptailurus serval Species 0.000 description 9
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 9
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 9
- 239000000843 powder Substances 0.000 description 9
- 239000000725 suspension Substances 0.000 description 9
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 8
- MYSWGUAQZAJSOK-UHFFFAOYSA-N ciprofloxacin Chemical compound C12=CC(N3CCNCC3)=C(F)C=C2C(=O)C(C(=O)O)=CN1C1CC1 MYSWGUAQZAJSOK-UHFFFAOYSA-N 0.000 description 8
- 238000004128 high performance liquid chromatography Methods 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 244000005700 microbiome Species 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 8
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 7
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 7
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 108010064235 lysylglycine Proteins 0.000 description 7
- 230000003287 optical effect Effects 0.000 description 7
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 6
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 6
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 6
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 235000001014 amino acid Nutrition 0.000 description 6
- 229940024606 amino acid Drugs 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 150000002431 hydrogen Chemical class 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 150000002894 organic compounds Chemical class 0.000 description 6
- 235000004252 protein component Nutrition 0.000 description 6
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 5
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 5
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 5
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 5
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- 241000080590 Niso Species 0.000 description 5
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 5
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 108010008355 arginyl-glutamine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 238000000105 evaporative light scattering detection Methods 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 5
- 239000007003 mineral medium Substances 0.000 description 5
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 5
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 5
- FGIUAXJPYTZDNR-UHFFFAOYSA-N potassium nitrate Chemical compound [K+].[O-][N+]([O-])=O FGIUAXJPYTZDNR-UHFFFAOYSA-N 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 4
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 4
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 4
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 4
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 4
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 4
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 4
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 4
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 4
- 108010078777 Colistin Proteins 0.000 description 4
- 229930182566 Gentamicin Natural products 0.000 description 4
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 4
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 4
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 4
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 4
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 4
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 239000004098 Tetracycline Substances 0.000 description 4
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 4
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 4
- 241001495153 Xanthobacter sp. Species 0.000 description 4
- 241000865160 Xanthobacter tagetidis Species 0.000 description 4
- 238000002835 absorbance Methods 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 239000003242 anti bacterial agent Substances 0.000 description 4
- 229940088710 antibiotic agent Drugs 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 150000001720 carbohydrates Chemical class 0.000 description 4
- 235000014633 carbohydrates Nutrition 0.000 description 4
- 229960003405 ciprofloxacin Drugs 0.000 description 4
- 229960003346 colistin Drugs 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 238000001914 filtration Methods 0.000 description 4
- 229960000308 fosfomycin Drugs 0.000 description 4
- YMDXZJFXQJVXBF-STHAYSLISA-N fosfomycin Chemical compound C[C@@H]1O[C@@H]1P(O)(O)=O YMDXZJFXQJVXBF-STHAYSLISA-N 0.000 description 4
- 229960002518 gentamicin Drugs 0.000 description 4
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- JORAUNFTUVJTNG-BSTBCYLQSA-N n-[(2s)-4-amino-1-[[(2s,3r)-1-[[(2s)-4-amino-1-oxo-1-[[(3s,6s,9s,12s,15r,18s,21s)-6,9,18-tris(2-aminoethyl)-3-[(1r)-1-hydroxyethyl]-12,15-bis(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-h Chemical compound CC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@H]([C@@H](C)O)CN[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@H](CCN)NC1=O.CCC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@H]([C@@H](C)O)CN[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@H](CCN)NC1=O JORAUNFTUVJTNG-BSTBCYLQSA-N 0.000 description 4
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 4
- 108010018625 phenylalanylarginine Proteins 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- XDJYMJULXQKGMM-UHFFFAOYSA-N polymyxin E1 Natural products CCC(C)CCCCC(=O)NC(CCN)C(=O)NC(C(C)O)C(=O)NC(CCN)C(=O)NC1CCNC(=O)C(C(C)O)NC(=O)C(CCN)NC(=O)C(CCN)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C(CCN)NC1=O XDJYMJULXQKGMM-UHFFFAOYSA-N 0.000 description 4
- KNIWPHSUTGNZST-UHFFFAOYSA-N polymyxin E2 Natural products CC(C)CCCCC(=O)NC(CCN)C(=O)NC(C(C)O)C(=O)NC(CCN)C(=O)NC1CCNC(=O)C(C(C)O)NC(=O)C(CCN)NC(=O)C(CCN)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C(CCN)NC1=O KNIWPHSUTGNZST-UHFFFAOYSA-N 0.000 description 4
- IOLCXVTUBQKXJR-UHFFFAOYSA-M potassium bromide Chemical compound [K+].[Br-] IOLCXVTUBQKXJR-UHFFFAOYSA-M 0.000 description 4
- NLKNQRATVPKPDG-UHFFFAOYSA-M potassium iodide Chemical compound [K+].[I-] NLKNQRATVPKPDG-UHFFFAOYSA-M 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- LXNHXLLTXMVWPM-UHFFFAOYSA-N pyridoxine Chemical compound CC1=NC=C(CO)C(CO)=C1O LXNHXLLTXMVWPM-UHFFFAOYSA-N 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 235000002639 sodium chloride Nutrition 0.000 description 4
- 239000002689 soil Substances 0.000 description 4
- 229960005322 streptomycin Drugs 0.000 description 4
- VLYWMPOKSSWJAL-UHFFFAOYSA-N sulfamethoxypyridazine Chemical compound N1=NC(OC)=CC=C1NS(=O)(=O)C1=CC=C(N)C=C1 VLYWMPOKSSWJAL-UHFFFAOYSA-N 0.000 description 4
- 229960002180 tetracycline Drugs 0.000 description 4
- 229930101283 tetracycline Natural products 0.000 description 4
- 235000019364 tetracycline Nutrition 0.000 description 4
- 150000003522 tetracyclines Chemical class 0.000 description 4
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 3
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 3
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 3
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 3
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 3
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 3
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 3
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 3
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 3
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 3
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 3
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 3
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 3
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 3
- 241001360526 Escherichia coli ATCC 25922 Species 0.000 description 3
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 3
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 3
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 3
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 3
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 3
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 3
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 3
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 3
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 3
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 3
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 3
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 3
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 3
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 3
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 3
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 3
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 3
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 3
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 3
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 3
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 3
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 3
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 3
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 3
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 3
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 3
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 3
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 3
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 3
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 3
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 3
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 3
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 239000000908 ammonium hydroxide Substances 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 235000011130 ammonium sulphate Nutrition 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 239000006285 cell suspension Substances 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 108010054812 diprotin A Proteins 0.000 description 3
- 239000002158 endotoxin Substances 0.000 description 3
- 235000012041 food component Nutrition 0.000 description 3
- 239000005417 food ingredient Substances 0.000 description 3
- 239000012737 fresh medium Substances 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 238000012239 gene modification Methods 0.000 description 3
- 230000005017 genetic modification Effects 0.000 description 3
- 235000013617 genetically modified food Nutrition 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 108010027338 isoleucylcysteine Proteins 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 229920006008 lipopolysaccharide Polymers 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 3
- 230000002906 microbiologic effect Effects 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 238000000164 protein isolation Methods 0.000 description 3
- 235000003441 saturated fatty acids Nutrition 0.000 description 3
- 150000004671 saturated fatty acids Chemical class 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 239000008399 tap water Substances 0.000 description 3
- 235000020679 tap water Nutrition 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 108010027345 wheylin-1 peptide Proteins 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 2
- GHOKWGTUZJEAQD-ZETCQYMHSA-N (D)-(+)-Pantothenic acid Chemical compound OCC(C)(C)[C@@H](O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-ZETCQYMHSA-N 0.000 description 2
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 2
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 2
- 102100021921 ATP synthase subunit a Human genes 0.000 description 2
- 101710114070 ATP synthase subunit a Proteins 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 2
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 2
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 2
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 2
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 2
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 2
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 2
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 2
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 2
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 2
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 2
- BJPPYOMRAVLXBY-YUMQZZPRSA-N Gln-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N BJPPYOMRAVLXBY-YUMQZZPRSA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 2
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 2
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- ZMVCLTGPGWJAEE-JYJNAYRXSA-N Glu-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)O ZMVCLTGPGWJAEE-JYJNAYRXSA-N 0.000 description 2
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 2
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 2
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 2
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- NIOPEYHPOBWLQO-KBPBESRZSA-N Gly-Trp-Glu Chemical compound NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOPEYHPOBWLQO-KBPBESRZSA-N 0.000 description 2
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 2
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 2
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 2
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 2
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 2
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- CNXOBMMOYZPPGS-NUTKFTJISA-N Lys-Trp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O CNXOBMMOYZPPGS-NUTKFTJISA-N 0.000 description 2
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 2
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 2
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 2
- GVIVXNFKJQFTCE-YUMQZZPRSA-N Met-Gly-Gln Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O GVIVXNFKJQFTCE-YUMQZZPRSA-N 0.000 description 2
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 2
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 2
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 2
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 2
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 2
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- PVNIIMVLHYAWGP-UHFFFAOYSA-N Niacin Chemical compound OC(=O)C1=CC=CN=C1 PVNIIMVLHYAWGP-UHFFFAOYSA-N 0.000 description 2
- 108010013639 Peptidoglycan Proteins 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 2
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 2
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 2
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 2
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 2
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 2
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 2
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- 108010073771 Soybean Proteins Proteins 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 2
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 2
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 2
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 2
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 241000368136 Xanthobacteraceae Species 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- FPIPGXGPPPQFEQ-OVSJKPMPSA-N all-trans-retinol Chemical compound OC\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-OVSJKPMPSA-N 0.000 description 2
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 description 2
- VSCWAEJMTAWNJL-UHFFFAOYSA-K aluminium trichloride Chemical compound Cl[Al](Cl)Cl VSCWAEJMTAWNJL-UHFFFAOYSA-K 0.000 description 2
- 235000019270 ammonium chloride Nutrition 0.000 description 2
- 230000000845 anti-microbial effect Effects 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- OSGAYBCDTDRGGQ-UHFFFAOYSA-L calcium sulfate Chemical compound [Ca+2].[O-]S([O-])(=O)=O OSGAYBCDTDRGGQ-UHFFFAOYSA-L 0.000 description 2
- 230000003013 cytotoxicity Effects 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 235000014113 dietary fatty acids Nutrition 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 2
- 229910000397 disodium phosphate Inorganic materials 0.000 description 2
- 235000019800 disodium phosphate Nutrition 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 235000020776 essential amino acid Nutrition 0.000 description 2
- 239000003797 essential amino acid Substances 0.000 description 2
- 229930195729 fatty acid Natural products 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 150000004665 fatty acids Chemical class 0.000 description 2
- 238000005189 flocculation Methods 0.000 description 2
- 230000016615 flocculation Effects 0.000 description 2
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000000265 homogenisation Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 229960003512 nicotinic acid Drugs 0.000 description 2
- 231100001084 no genetic toxicology Toxicity 0.000 description 2
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 2
- SECPZKHBENQXJG-FPLPWBNLSA-N palmitoleic acid Chemical compound CCCCCC\C=C/CCCCCCCC(O)=O SECPZKHBENQXJG-FPLPWBNLSA-N 0.000 description 2
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 229960002477 riboflavin Drugs 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N serine Chemical compound OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000001488 sodium phosphate Substances 0.000 description 2
- 235000019710 soybean protein Nutrition 0.000 description 2
- 229960003495 thiamine Drugs 0.000 description 2
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 239000011573 trace mineral Substances 0.000 description 2
- 235000013619 trace mineral Nutrition 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 235000021122 unsaturated fatty acids Nutrition 0.000 description 2
- 150000004670 unsaturated fatty acids Chemical class 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 235000019156 vitamin B Nutrition 0.000 description 2
- 239000011720 vitamin B Substances 0.000 description 2
- 229940011671 vitamin b6 Drugs 0.000 description 2
- 150000003722 vitamin derivatives Chemical class 0.000 description 2
- JIAARYAFYJHUJI-UHFFFAOYSA-L zinc dichloride Chemical compound [Cl-].[Cl-].[Zn+2] JIAARYAFYJHUJI-UHFFFAOYSA-L 0.000 description 2
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 2
- 229910000368 zinc sulfate Inorganic materials 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- SUQWGICKJIJKNO-IHRRRGAJSA-N (2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]acetyl]amino]pentanedioic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O SUQWGICKJIJKNO-IHRRRGAJSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- FPIPGXGPPPQFEQ-UHFFFAOYSA-N 13-cis retinol Natural products OCC=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-UHFFFAOYSA-N 0.000 description 1
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 1
- 239000001763 2-hydroxyethyl(trimethyl)azanium Substances 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- PMYDPQQPEAYXKD-UHFFFAOYSA-N 3-hydroxy-n-naphthalen-2-ylnaphthalene-2-carboxamide Chemical compound C1=CC=CC2=CC(NC(=O)C3=CC4=CC=CC=C4C=C3O)=CC=C21 PMYDPQQPEAYXKD-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UHFFFAOYSA-L 3-phosphoglycerate(2-) Chemical compound [O-]C(=O)C(O)COP(O)([O-])=O OSJPPGNTCRNQQC-UHFFFAOYSA-L 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- YBPLKDWJFYCZSV-ZLUOBGJFSA-N Ala-Asn-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N YBPLKDWJFYCZSV-ZLUOBGJFSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- FCXAUASCMJOFEY-NDKCEZKHSA-N Ala-Leu-Thr-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O FCXAUASCMJOFEY-NDKCEZKHSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N Alanine Chemical compound CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 1
- 241001135756 Alphaproteobacteria Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Natural products CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 1
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- RTDZQOFEGPWSJD-AVGNSLFASA-N Arg-Leu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O RTDZQOFEGPWSJD-AVGNSLFASA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 1
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- TWVTVZUGEDBAJF-ACZMJKKPSA-N Asn-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N TWVTVZUGEDBAJF-ACZMJKKPSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- JGIAYNNXZKKKOW-KKUMJFAQSA-N Asn-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N JGIAYNNXZKKKOW-KKUMJFAQSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- YQKYLDVPCOGIRB-SEKJGCFDSA-N Asp-Leu-Thr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YQKYLDVPCOGIRB-SEKJGCFDSA-N 0.000 description 1
- WNGZKSVJFDZICU-XIRDDKMYSA-N Asp-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N WNGZKSVJFDZICU-XIRDDKMYSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 1
- SHBKFJNZNSGHDS-FGPLHTHASA-N Asp-Met-Thr-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O SHBKFJNZNSGHDS-FGPLHTHASA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- DWOSGXZMLQNDBN-FXQIFTODSA-N Asp-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O DWOSGXZMLQNDBN-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- RCGVPVZHKAXDPA-NYVOZVTQSA-N Asp-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CC(=O)O)N RCGVPVZHKAXDPA-NYVOZVTQSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- ZOXJGFHDIHLPTG-UHFFFAOYSA-N Boron Chemical compound [B] ZOXJGFHDIHLPTG-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- UGFAIRIUMAVXCW-UHFFFAOYSA-N Carbon monoxide Chemical compound [O+]#[C-] UGFAIRIUMAVXCW-UHFFFAOYSA-N 0.000 description 1
- GHOKWGTUZJEAQD-UHFFFAOYSA-N Chick antidermatitis factor Natural products OCC(C)(C)C(O)C(=O)NCCC(O)=O GHOKWGTUZJEAQD-UHFFFAOYSA-N 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 235000019743 Choline chloride Nutrition 0.000 description 1
- 229910021591 Copper(I) chloride Inorganic materials 0.000 description 1
- 241000252867 Cupriavidus metallidurans Species 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 1
- ZOLXQKZHYOHHMD-DLOVCJGASA-N Cys-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N ZOLXQKZHYOHHMD-DLOVCJGASA-N 0.000 description 1
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 1
- JIVJXVJMOBVCJF-ZLUOBGJFSA-N Cys-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)N JIVJXVJMOBVCJF-ZLUOBGJFSA-N 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- SDWZYDDNSMPBRM-AVGNSLFASA-N Cys-Gln-Phe Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SDWZYDDNSMPBRM-AVGNSLFASA-N 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 1
- VCIIDXDOPGHMDQ-WDSKDSINSA-N Cys-Gly-Gln Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VCIIDXDOPGHMDQ-WDSKDSINSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- WPXPYZPGSGWQSC-DCAQKATOSA-N Cys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WPXPYZPGSGWQSC-DCAQKATOSA-N 0.000 description 1
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 1
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 1
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- AFYGNOJUTMXQIG-FXQIFTODSA-N Cys-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N AFYGNOJUTMXQIG-FXQIFTODSA-N 0.000 description 1
- MTNUYDIILCWPEP-GUBZILKMSA-N Cys-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CS MTNUYDIILCWPEP-GUBZILKMSA-N 0.000 description 1
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 1
- ZXGDAZLSOSYSBA-IHRRRGAJSA-N Cys-Val-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZXGDAZLSOSYSBA-IHRRRGAJSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- ZZZCUOFIHGPKAK-UHFFFAOYSA-N D-erythro-ascorbic acid Natural products OCC1OC(=O)C(O)=C1O ZZZCUOFIHGPKAK-UHFFFAOYSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- MYMOFIZGZYHOMD-UHFFFAOYSA-N Dioxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 108010092526 GKPV peptide Proteins 0.000 description 1
- FAQVCWVVIYYWRR-WHFBIAKZSA-N Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O FAQVCWVVIYYWRR-WHFBIAKZSA-N 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 1
- XUZQMPGBGFQJMY-SRVKXCTJSA-N Gln-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XUZQMPGBGFQJMY-SRVKXCTJSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 1
- HOIPREWORBVRLD-XIRDDKMYSA-N Glu-Met-Trp Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O HOIPREWORBVRLD-XIRDDKMYSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- RZEDHGORCKRINR-STQMWFEESA-N Gly-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN RZEDHGORCKRINR-STQMWFEESA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 1
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 1
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- CZXKZMQKXQZDEX-YUMQZZPRSA-N His-Gly-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N CZXKZMQKXQZDEX-YUMQZZPRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 1
- QPSCMXDWVKWVOW-BZSNNMDCSA-N His-His-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QPSCMXDWVKWVOW-BZSNNMDCSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- UIRUVUUGUYCMBY-KCTSRDHCSA-N His-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N UIRUVUUGUYCMBY-KCTSRDHCSA-N 0.000 description 1
- FWWJVUFXUQOEDM-WDSOQIARSA-N His-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N FWWJVUFXUQOEDM-WDSOQIARSA-N 0.000 description 1
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 229910021578 Iron(III) chloride Inorganic materials 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- UIJVKVHLCQSPOJ-XIRDDKMYSA-N Lys-Ser-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O UIJVKVHLCQSPOJ-XIRDDKMYSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 1
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- PJWDQHNOJIBMRY-JYJNAYRXSA-N Met-Arg-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PJWDQHNOJIBMRY-JYJNAYRXSA-N 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 1
- WVTYEEPGEUSFGQ-LPEHRKFASA-N Met-Cys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WVTYEEPGEUSFGQ-LPEHRKFASA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- CFRRIZLGFGJEDB-SRVKXCTJSA-N Met-His-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CFRRIZLGFGJEDB-SRVKXCTJSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- ODFBIJXEWPWSAN-CYDGBPFRSA-N Met-Ile-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O ODFBIJXEWPWSAN-CYDGBPFRSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 1
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 1
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 1
- RUTZUJXAVNWLQP-BVSLBCMMSA-N Met-Tyr-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 RUTZUJXAVNWLQP-BVSLBCMMSA-N 0.000 description 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- 108700005443 Microbial Genes Proteins 0.000 description 1
- 108010085220 Multiprotein Complexes Proteins 0.000 description 1
- 102000007474 Multiprotein Complexes Human genes 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 229910021586 Nickel(II) chloride Inorganic materials 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 235000021319 Palmitoleic acid Nutrition 0.000 description 1
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 1
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 1
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- PLNHHOXNVSYKOB-JYJNAYRXSA-N Phe-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N PLNHHOXNVSYKOB-JYJNAYRXSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- DHZOGDVYRQOGAC-BZSNNMDCSA-N Phe-Cys-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DHZOGDVYRQOGAC-BZSNNMDCSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- OHIYMVFLQXTZAW-UFYCRDLUSA-N Phe-Met-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OHIYMVFLQXTZAW-UFYCRDLUSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- RETPETNFPLNLRV-JYJNAYRXSA-N Pro-Asn-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O RETPETNFPLNLRV-JYJNAYRXSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- JRBWMRUPXWPEID-JYJNAYRXSA-N Pro-Trp-Cys Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(=O)O)C(=O)[C@@H]1CCCN1 JRBWMRUPXWPEID-JYJNAYRXSA-N 0.000 description 1
- FYXCBXDAMPEHIQ-FHWLQOOXSA-N Pro-Trp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O FYXCBXDAMPEHIQ-FHWLQOOXSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000192142 Proteobacteria Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 241000589157 Rhizobiales Species 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241001522306 Serinus serinus Species 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 229910000831 Steel Inorganic materials 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- JZRWCGZRTZMZEH-UHFFFAOYSA-N Thiamine Natural products CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- QFEYTTHKPSOFLV-OSUNSFLBSA-N Thr-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H]([C@@H](C)O)N QFEYTTHKPSOFLV-OSUNSFLBSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- DKNYWNPPSZCWCJ-GBALPHGKSA-N Thr-Trp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N)O DKNYWNPPSZCWCJ-GBALPHGKSA-N 0.000 description 1
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 1
- JISIQDCOHJOOPU-WFBYXXMGSA-N Trp-Cys-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O JISIQDCOHJOOPU-WFBYXXMGSA-N 0.000 description 1
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 1
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- PHNBFZBKLWEBJN-BPUTZDHNSA-N Trp-Glu-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PHNBFZBKLWEBJN-BPUTZDHNSA-N 0.000 description 1
- KDWZQYUTMJSYRJ-BHYGNILZSA-N Trp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O KDWZQYUTMJSYRJ-BHYGNILZSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 1
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 1
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 1
- FXHOCONKLLUOCF-WDSOQIARSA-N Trp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FXHOCONKLLUOCF-WDSOQIARSA-N 0.000 description 1
- NFVQCNMGJILYMI-SZMVWBNQSA-N Trp-Met-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NFVQCNMGJILYMI-SZMVWBNQSA-N 0.000 description 1
- BGWSLEYVITZIQP-DCPHZVHLSA-N Trp-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O BGWSLEYVITZIQP-DCPHZVHLSA-N 0.000 description 1
- WHJVRIBYQWHRQA-NQCBNZPSSA-N Trp-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 WHJVRIBYQWHRQA-NQCBNZPSSA-N 0.000 description 1
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 description 1
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 1
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- RGYDQHBLMMAYNZ-IHRRRGAJSA-N Tyr-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N RGYDQHBLMMAYNZ-IHRRRGAJSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- FPIPGXGPPPQFEQ-BOOMUCAASA-N Vitamin A Natural products OC/C=C(/C)\C=C\C=C(\C)/C=C/C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-BOOMUCAASA-N 0.000 description 1
- 229930003451 Vitamin B1 Natural products 0.000 description 1
- 229930003779 Vitamin B12 Natural products 0.000 description 1
- 229930003471 Vitamin B2 Natural products 0.000 description 1
- 229930003537 Vitamin B3 Natural products 0.000 description 1
- 229930003268 Vitamin C Natural products 0.000 description 1
- MECHNRXZTMCUDQ-UHFFFAOYSA-N Vitamin D2 Natural products C1CCC2(C)C(C(C)C=CC(C)C(C)C)CCC2C1=CC=C1CC(O)CCC1=C MECHNRXZTMCUDQ-UHFFFAOYSA-N 0.000 description 1
- QYSXJUFSXHHAJI-XFEUOLMDSA-N Vitamin D3 Natural products C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C/C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-XFEUOLMDSA-N 0.000 description 1
- 229930003427 Vitamin E Natural products 0.000 description 1
- 241000216454 Xanthobacter agilis Species 0.000 description 1
- 241001114033 Xanthobacter aminoxidans Species 0.000 description 1
- 241000589494 Xanthobacter autotrophicus Species 0.000 description 1
- 241001311561 Xanthobacter autotrophicus Py2 Species 0.000 description 1
- 241000589495 Xanthobacter flavus Species 0.000 description 1
- 241000589951 Xanthobacter viscosus Species 0.000 description 1
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 238000002814 agar dilution Methods 0.000 description 1
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 1
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 235000020661 alpha-linolenic acid Nutrition 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 239000011609 ammonium molybdate Substances 0.000 description 1
- 235000018660 ammonium molybdate Nutrition 0.000 description 1
- APUPEJJSWDHEBO-UHFFFAOYSA-P ammonium molybdate Chemical compound [NH4+].[NH4+].[O-][Mo]([O-])(=O)=O APUPEJJSWDHEBO-UHFFFAOYSA-P 0.000 description 1
- 229940010552 ammonium molybdate Drugs 0.000 description 1
- 239000003674 animal food additive Substances 0.000 description 1
- 238000009635 antibiotic susceptibility testing Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 1
- 238000004380 ashing Methods 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000001651 autotrophic effect Effects 0.000 description 1
- 230000000721 bacterilogical effect Effects 0.000 description 1
- 238000000498 ball milling Methods 0.000 description 1
- WDIHJSXYQDMJHN-UHFFFAOYSA-L barium chloride Chemical compound [Cl-].[Cl-].[Ba+2] WDIHJSXYQDMJHN-UHFFFAOYSA-L 0.000 description 1
- 229910001626 barium chloride Inorganic materials 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 108010062635 beta-lactotensin Proteins 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 229910052796 boron Inorganic materials 0.000 description 1
- 238000002815 broth microdilution Methods 0.000 description 1
- 229910000019 calcium carbonate Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- SGMZJAMFUVOLNK-UHFFFAOYSA-M choline chloride Chemical compound [Cl-].C[N+](C)(C)CCO SGMZJAMFUVOLNK-UHFFFAOYSA-M 0.000 description 1
- 229960003178 choline chloride Drugs 0.000 description 1
- 229910052804 chromium Inorganic materials 0.000 description 1
- 239000011651 chromium Substances 0.000 description 1
- SECPZKHBENQXJG-UHFFFAOYSA-N cis-palmitoleic acid Natural products CCCCCCC=CCCCCCCCC(O)=O SECPZKHBENQXJG-UHFFFAOYSA-N 0.000 description 1
- GVPFVAHMJGGAJG-UHFFFAOYSA-L cobalt dichloride Chemical compound [Cl-].[Cl-].[Co+2] GVPFVAHMJGGAJG-UHFFFAOYSA-L 0.000 description 1
- KTVIXTQDYHMGHF-UHFFFAOYSA-L cobalt(2+) sulfate Chemical compound [Co+2].[O-]S([O-])(=O)=O KTVIXTQDYHMGHF-UHFFFAOYSA-L 0.000 description 1
- AGVAZMGAQJOSFJ-WZHZPDAFSA-M cobalt(2+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2r)-1-[3-[(1r,2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2 Chemical compound [Co+2].N#[C-].[N-]([C@@H]1[C@H](CC(N)=O)[C@@]2(C)CCC(=O)NC[C@@H](C)OP(O)(=O)O[C@H]3[C@H]([C@H](O[C@@H]3CO)N3C4=CC(C)=C(C)C=C4N=C3)O)\C2=C(C)/C([C@H](C\2(C)C)CCC(N)=O)=N/C/2=C\C([C@H]([C@@]/2(CC(N)=O)C)CCC(N)=O)=N\C\2=C(C)/C2=N[C@]1(C)[C@@](C)(CC(N)=O)[C@@H]2CCC(N)=O AGVAZMGAQJOSFJ-WZHZPDAFSA-M 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000010411 cooking Methods 0.000 description 1
- 229910000365 copper sulfate Inorganic materials 0.000 description 1
- OXBLHERUFWYNTN-UHFFFAOYSA-M copper(I) chloride Chemical compound [Cu]Cl OXBLHERUFWYNTN-UHFFFAOYSA-M 0.000 description 1
- ORTQZVOHEJQUHG-UHFFFAOYSA-L copper(II) chloride Chemical compound Cl[Cu]Cl ORTQZVOHEJQUHG-UHFFFAOYSA-L 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- RGZGHMSJVAQDQO-UHFFFAOYSA-L copper;selenate Chemical compound [Cu+2].[O-][Se]([O-])(=O)=O RGZGHMSJVAQDQO-UHFFFAOYSA-L 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 238000002784 cytotoxicity assay Methods 0.000 description 1
- 231100000263 cytotoxicity test Toxicity 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000003113 dilution method Methods 0.000 description 1
- 229910001882 dioxygen Inorganic materials 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- 235000019797 dipotassium phosphate Nutrition 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- BVTBRVFYZUCAKH-UHFFFAOYSA-L disodium selenite Chemical compound [Na+].[Na+].[O-][Se]([O-])=O BVTBRVFYZUCAKH-UHFFFAOYSA-L 0.000 description 1
- 238000002036 drum drying Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000005868 electrolysis reaction Methods 0.000 description 1
- 229960002061 ergocalciferol Drugs 0.000 description 1
- 238000002481 ethanol extraction Methods 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 229960002089 ferrous chloride Drugs 0.000 description 1
- 239000011790 ferrous sulphate Substances 0.000 description 1
- 235000003891 ferrous sulphate Nutrition 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 235000013312 flour Nutrition 0.000 description 1
- 229960000304 folic acid Drugs 0.000 description 1
- 235000019152 folic acid Nutrition 0.000 description 1
- 239000011724 folic acid Substances 0.000 description 1
- 239000002778 food additive Substances 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 1
- 230000008570 general process Effects 0.000 description 1
- 231100000097 genotoxicity assay Toxicity 0.000 description 1
- 238000002746 genotoxicity assay Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- NMCUIPGRVMDVDB-UHFFFAOYSA-L iron dichloride Chemical compound Cl[Fe]Cl NMCUIPGRVMDVDB-UHFFFAOYSA-L 0.000 description 1
- RBTARNINKXHZNM-UHFFFAOYSA-K iron trichloride Chemical compound Cl[Fe](Cl)Cl RBTARNINKXHZNM-UHFFFAOYSA-K 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 108010043612 kentsin Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 235000020778 linoleic acid Nutrition 0.000 description 1
- OYHQOLUKZRVURQ-IXWMQOLASA-N linoleic acid Natural products CCCCC\C=C/C\C=C\CCCCCCCC(O)=O OYHQOLUKZRVURQ-IXWMQOLASA-N 0.000 description 1
- 229960004488 linolenic acid Drugs 0.000 description 1
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 239000011565 manganese chloride Substances 0.000 description 1
- 235000002867 manganese chloride Nutrition 0.000 description 1
- 229940099607 manganese chloride Drugs 0.000 description 1
- 229940099596 manganese sulfate Drugs 0.000 description 1
- 239000011702 manganese sulphate Substances 0.000 description 1
- 235000007079 manganese sulphate Nutrition 0.000 description 1
- WPBNNNQJVZRUHP-UHFFFAOYSA-L manganese(2+);methyl n-[[2-(methoxycarbonylcarbamothioylamino)phenyl]carbamothioyl]carbamate;n-[2-(sulfidocarbothioylamino)ethyl]carbamodithioate Chemical compound [Mn+2].[S-]C(=S)NCCNC([S-])=S.COC(=O)NC(=S)NC1=CC=CC=C1NC(=S)NC(=O)OC WPBNNNQJVZRUHP-UHFFFAOYSA-L 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 235000021281 monounsaturated fatty acids Nutrition 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- WQEPLUUGTLDZJY-UHFFFAOYSA-N n-Pentadecanoic acid Natural products CCCCCCCCCCCCCCC(O)=O WQEPLUUGTLDZJY-UHFFFAOYSA-N 0.000 description 1
- QMMRZOWCJAIUJA-UHFFFAOYSA-L nickel dichloride Chemical compound Cl[Ni]Cl QMMRZOWCJAIUJA-UHFFFAOYSA-L 0.000 description 1
- LGQLOGILCSXPEA-UHFFFAOYSA-L nickel sulfate Chemical compound [Ni+2].[O-]S([O-])(=O)=O LGQLOGILCSXPEA-UHFFFAOYSA-L 0.000 description 1
- 229910000363 nickel(II) sulfate Inorganic materials 0.000 description 1
- 235000001968 nicotinic acid Nutrition 0.000 description 1
- 239000011664 nicotinic acid Substances 0.000 description 1
- DFPAKSUCGFBDDF-UHFFFAOYSA-N nicotinic acid amide Natural products NC(=O)C1=CC=CN=C1 DFPAKSUCGFBDDF-UHFFFAOYSA-N 0.000 description 1
- 150000002823 nitrates Chemical class 0.000 description 1
- 231100001083 no cytotoxicity Toxicity 0.000 description 1
- 239000002417 nutraceutical Substances 0.000 description 1
- 235000021436 nutraceutical agent Nutrition 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 235000020660 omega-3 fatty acid Nutrition 0.000 description 1
- 229940012843 omega-3 fatty acid Drugs 0.000 description 1
- 239000006014 omega-3 oil Substances 0.000 description 1
- 235000020665 omega-6 fatty acid Nutrition 0.000 description 1
- 229940033080 omega-6 fatty acid Drugs 0.000 description 1
- 125000001477 organic nitrogen group Chemical group 0.000 description 1
- 238000000643 oven drying Methods 0.000 description 1
- 230000010627 oxidative phosphorylation Effects 0.000 description 1
- 229940055726 pantothenic acid Drugs 0.000 description 1
- 235000019161 pantothenic acid Nutrition 0.000 description 1
- 239000011713 pantothenic acid Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 239000004323 potassium nitrate Substances 0.000 description 1
- 235000010333 potassium nitrate Nutrition 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 235000013406 prebiotics Nutrition 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000006041 probiotic Substances 0.000 description 1
- 235000018291 probiotics Nutrition 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- RADKZDMFGJYCBB-UHFFFAOYSA-N pyridoxal hydrochloride Natural products CC1=NC=C(CO)C(C=O)=C1O RADKZDMFGJYCBB-UHFFFAOYSA-N 0.000 description 1
- 235000008160 pyridoxine Nutrition 0.000 description 1
- 239000011677 pyridoxine Substances 0.000 description 1
- ZUFQODAHGAHPFQ-UHFFFAOYSA-N pyridoxine hydrochloride Chemical compound Cl.CC1=NC=C(CO)C(CO)=C1O ZUFQODAHGAHPFQ-UHFFFAOYSA-N 0.000 description 1
- 235000019171 pyridoxine hydrochloride Nutrition 0.000 description 1
- 239000011764 pyridoxine hydrochloride Substances 0.000 description 1
- 229960004172 pyridoxine hydrochloride Drugs 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 101150079601 recA gene Proteins 0.000 description 1
- 238000006479 redox reaction Methods 0.000 description 1
- 235000021067 refined food Nutrition 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 235000019192 riboflavin Nutrition 0.000 description 1
- 239000002151 riboflavin Substances 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000013535 sea water Substances 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 238000011218 seed culture Methods 0.000 description 1
- 229940091258 selenium supplement Drugs 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 235000017557 sodium bicarbonate Nutrition 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- CMZUMMUJMWNLFH-UHFFFAOYSA-N sodium metavanadate Chemical compound [Na+].[O-][V](=O)=O CMZUMMUJMWNLFH-UHFFFAOYSA-N 0.000 description 1
- 239000011684 sodium molybdate Substances 0.000 description 1
- 235000015393 sodium molybdate Nutrition 0.000 description 1
- TVXXNOYZHKPKGW-UHFFFAOYSA-N sodium molybdate (anhydrous) Chemical compound [Na+].[Na+].[O-][Mo]([O-])(=O)=O TVXXNOYZHKPKGW-UHFFFAOYSA-N 0.000 description 1
- 239000011655 sodium selenate Substances 0.000 description 1
- 235000018716 sodium selenate Nutrition 0.000 description 1
- 229960001881 sodium selenate Drugs 0.000 description 1
- 239000011781 sodium selenite Substances 0.000 description 1
- 235000015921 sodium selenite Nutrition 0.000 description 1
- 229960001471 sodium selenite Drugs 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 239000010959 steel Substances 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 235000019157 thiamine Nutrition 0.000 description 1
- KYMBYSLLVAOCFI-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SCN1CC1=CN=C(C)N=C1N KYMBYSLLVAOCFI-UHFFFAOYSA-N 0.000 description 1
- 239000011721 thiamine Substances 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- HPGGPRDJHPYFRM-UHFFFAOYSA-J tin(iv) chloride Chemical compound Cl[Sn](Cl)(Cl)Cl HPGGPRDJHPYFRM-UHFFFAOYSA-J 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000002723 toxicity assay Methods 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- MCYXWOCTIYEQRK-UHFFFAOYSA-K tripotassium dioxido-sulfanylidene-sulfido-lambda5-phosphane Chemical compound [K+].[K+].[K+].[O-]P([O-])([S-])=S MCYXWOCTIYEQRK-UHFFFAOYSA-K 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010032276 tyrosyl-glutamyl-tyrosyl-glutamic acid Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 101150005573 uvrA gene Proteins 0.000 description 1
- 238000001291 vacuum drying Methods 0.000 description 1
- 238000003828 vacuum filtration Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 235000019155 vitamin A Nutrition 0.000 description 1
- 239000011719 vitamin A Substances 0.000 description 1
- 235000010374 vitamin B1 Nutrition 0.000 description 1
- 239000011691 vitamin B1 Substances 0.000 description 1
- 235000019163 vitamin B12 Nutrition 0.000 description 1
- 239000011715 vitamin B12 Substances 0.000 description 1
- 235000019164 vitamin B2 Nutrition 0.000 description 1
- 239000011716 vitamin B2 Substances 0.000 description 1
- 235000019160 vitamin B3 Nutrition 0.000 description 1
- 239000011708 vitamin B3 Substances 0.000 description 1
- 235000019158 vitamin B6 Nutrition 0.000 description 1
- 239000011726 vitamin B6 Substances 0.000 description 1
- 235000019154 vitamin C Nutrition 0.000 description 1
- 239000011718 vitamin C Substances 0.000 description 1
- 235000001892 vitamin D2 Nutrition 0.000 description 1
- 239000011653 vitamin D2 Substances 0.000 description 1
- MECHNRXZTMCUDQ-RKHKHRCZSA-N vitamin D2 Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)/C=C/[C@H](C)C(C)C)=C\C=C1\C[C@@H](O)CCC1=C MECHNRXZTMCUDQ-RKHKHRCZSA-N 0.000 description 1
- 235000005282 vitamin D3 Nutrition 0.000 description 1
- 239000011647 vitamin D3 Substances 0.000 description 1
- QYSXJUFSXHHAJI-YRZJJWOYSA-N vitamin D3 Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C\C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-YRZJJWOYSA-N 0.000 description 1
- 235000019165 vitamin E Nutrition 0.000 description 1
- 239000011709 vitamin E Substances 0.000 description 1
- 229940046009 vitamin E Drugs 0.000 description 1
- 229940045997 vitamin a Drugs 0.000 description 1
- 229940021056 vitamin d3 Drugs 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- 239000011592 zinc chloride Substances 0.000 description 1
- 235000005074 zinc chloride Nutrition 0.000 description 1
- 229960001763 zinc sulfate Drugs 0.000 description 1
- 239000011686 zinc sulphate Substances 0.000 description 1
- 235000009529 zinc sulphate Nutrition 0.000 description 1
Abstract
Description
ОБЛАСТЬ ТЕХНИКИ, К КОТОРОЙ ОТНОСИТСЯ ИЗОБРЕТЕНИЕFIELD OF TECHNOLOGY TO WHICH THE INVENTION RELATES
Настоящее изобретение относится к получению белка и/или других макромолекул с использованием микроорганизмов. В частности, настоящее изобретение относится к новым бактериальным штаммам и способам непрерывного культивирования для получения белка или биомассы с использованием бактерий, при которых в клетки поступают газы и минеральные вещества. Настоящее изобретение также относится к продуктам этих способов и применению этих продуктов, например, в пищевых продуктах или кормах.The present invention relates to the production of protein and/or other macromolecules using microorganisms. In particular, the present invention relates to new bacterial strains and methods of continuous cultivation for the production of protein or biomass using bacteria, in which gases and minerals are introduced into the cells. The present invention also relates to products of these methods and the use of these products, for example in food or feed.
УРОВЕНЬ ТЕХНИКИ ИЗОБРЕТЕНИЯBACKGROUND OF THE INVENTION
Растущее население мира, изменение климата и нехватка воды все чаще представляют угрозу для традиционного сельского хозяйства и, следовательно, для достаточного снабжения продовольствием и кормами. Поэтому исследуются альтернативные источники органических молекул, таких как белки. Потенциальной альтернативой является получение единичных клеток, то есть получение белка и/или других макромолекул с использованием микроорганизмов.A growing world population, climate change and water scarcity increasingly pose a threat to traditional agriculture and therefore to an adequate supply of food and feed. Therefore, alternative sources of organic molecules such as proteins are being explored. A potential alternative is the production of single cells, that is, the production of protein and/or other macromolecules using microorganisms.
Описаны хемоавтотрофные микроорганизмы, способные расти на минимальной минеральной среде с газообразным водородом в качестве источника энергии и углекислым газом в качестве единственного источника углерода. Для обзора этих микроорганизмов см., например, Shively et al. (1998) Annu Rev Microbiol 52:191. В Международной патентной заявке WO2018144965 описаны различные микроорганизмы и биопроцессы для преобразования газообразных субстратов в высокобелковую биомассу. В работе Andersen et al. (1979) Biochim Biophys Acta 585:1-11 описаны мутантные штаммы Alcaligenes eutrophus, водородной бактерии, которая легко растет в гетеротрофных и автотрофных условиях. Были охарактеризованы мутанты с измененной активностью рибулозо-1,5-бисфосфаткарбоксилазы/оксигеназы (rubisco). В работе Ohmiya et al. (2003) J. Biosci. Bioeng. 95:549-561 описано применение микробных генов для утилизации неразлагаемой биомассы. В работе Yu Jian et al. (2013) Int J Hydrogen Ener 38:8683-8690 описана фиксация диоксида углерода бактериальным изолятом, окисляющим водород. Высокая энергетическая эффективность 50% была измерена при умеренной концентрации кислорода (10 мол.%).Chemoautotrophic microorganisms are described that are capable of growing on a minimal mineral medium with hydrogen gas as an energy source and carbon dioxide as the only carbon source. For a review of these microorganisms, see, for example, Shively et al. (1998) Annu Rev Microbiol 52:191. International Patent Application WO2018144965 describes various microorganisms and bioprocesses for converting gaseous substrates into high-protein biomass. The work of Andersen et al. (1979) Biochim Biophys Acta 585:1-11 describe mutant strains of Alcaligenes eutrophus , a hydrogen bacterium that grows readily in heterotrophic and autotrophic conditions. Mutants with altered ribulose-1,5-bisphosphate carboxylase/oxygenase (rubisco) activity have been characterized. The work of Ohmiya et al. (2003) J. Biosci. Bioeng. 95:549-561 describes the use of microbial genes for the utilization of non-degradable biomass. In the work of Yu Jian et al. (2013) Int J Hydrogen Ener 38:8683–8690 described carbon dioxide fixation by a hydrogen-oxidizing bacterial isolate. A high energy efficiency of 50% was measured at a moderate oxygen concentration (10 mol%).
Однако различные хемоавтотрофные микроорганизмы обладают различными свойствами с точки зрения скорости роста, выхода, состава биомассы, а также свойств, связанных с использованием в качестве пищевого ингредиента, таких как безопасность для потребления человеком, вкус, запах, ощущение во рту, технические и функциональные свойства в приготовление пищи и т.д. Не каждый хемоавтотрофный микроорганизм имеет достаточную скорость роста и обеспечивает достаточный выход, и не каждый способ можно реально масштабировать до экономически жизнеспособного крупномасштабного процесса. Чтобы иметь достаточный выход функционального белка, т.е. для пищевых продуктов или кормов важно найти подходящий продуцирующий организм и подходящий способ, который можно было бы осуществлять в больших масштабах. Эта потребность решается настоящим изобретением.However, different chemoautotrophic microorganisms have different properties in terms of growth rate, yield, biomass composition, as well as properties associated with use as a food ingredient, such as safety for human consumption, taste, smell, mouthfeel, technical and functional properties in cooking, etc. Not every chemoautotrophic microorganism has a sufficient growth rate and provides sufficient yield, and not every method can be realistically scaled up to an economically viable large-scale process. To have sufficient yield of functional protein, i.e. for food or feed, it is important to find a suitable production organism and a suitable process that can be carried out on a large scale. This need is addressed by the present invention.
СУЩНОСТЬ ИЗОБРЕТЕНИЯSUMMARY OF THE INVENTION
В первом основном аспекте настоящее изобретение относится к выделенному бактериальному штамму VTT-E-193585 или его производному.In a first main aspect, the present invention relates to the isolated bacterial strain VTT-E-193585 or a derivative thereof.
В дополнительных аспектах настоящее изобретение относится к культуре, содержащей бактериальный штамм настоящего изобретения или его производное. Кроме того, настоящее изобретение относится к способу получения биомассы и/или белка, где указанный способ включает в себя культивирование бактериального штамма настоящего изобретения или его производного.In additional aspects, the present invention relates to a culture containing a bacterial strain of the present invention or a derivative thereof. In addition, the present invention relates to a method for producing biomass and/or protein, where the method includes cultivating a bacterial strain of the present invention or a derivative thereof.
В ещё одном аспекте настоящее изобретение относится к способу получения биомассы и/или белка, где указанный способ включает в себя культивирование бактериального штамма рода Xanthobacter в непрерывной культуре с водородом в качестве источника энергии и источником неорганического углерода, где источник неорганического углерода включает в себя двуокись углерода.In yet another aspect, the present invention relates to a method for producing biomass and/or protein, wherein the method includes cultivating a bacterial strain of the genus Xanthobacter in a continuous culture with hydrogen as an energy source and a source of inorganic carbon, wherein the inorganic carbon source includes carbon dioxide .
В дополнительных основных аспектах настоящее изобретение относится к белку в виде большого количества, биомассе или небелковым клеточным или химическим компонентам, полученным или получаемым способом настоящего изобретению, и к пищевому продукту или кормовому продукту, полученному или получаемому способом настоящего изобретения.In additional basic aspects, the present invention relates to bulk protein, biomass, or non-protein cellular or chemical components produced or produced by the process of the present invention, and to a food or feed product produced or produced by the method of the present invention.
КРАТКОЕ ОПИСАНИЕ ЧЕРТЕЖЕЙBRIEF DESCRIPTION OF THE DRAWINGS
Фиг. 1. Оптическая плотность, измеренная при длине волны 600 нм (черные кружки), и показания датчика оптической плотности во время хемоавтотрофного культивирования в объёме 200 литров выделенного бактериального штамма, депонированного как VTT-E-193585. Fig. 1 . Optical density measured at a wavelength of 600 nm (black circles) and optical density sensor readings during chemoautotrophic cultivation in a volume of 200 liters of the isolated bacterial strain, deposited as VTT-E-193585.
Фиг. 2. Оптическая плотность, измеренная при длине волны 600 нм во время параллельного хемоавтотрофного культивирования в объёме 200 мл выделенного бактериального штамма, депонированного как VTT-E-193585, на различных источниках азота. Fig. 2 . Optical density measured at a wavelength of 600 nm during parallel chemoautotrophic cultivation in a volume of 200 ml of the isolated bacterial strain deposited as VTT-E-193585 on various nitrogen sources.
ПОДРОБНОЕ ОПИСАНИЕ ИЗОБРЕТЕНИЯDETAILED DESCRIPTION OF THE INVENTION
ОпределенияDefinitions
Используемый здесь термин «выделенный», например, в контексте штамма означает выделенный из его естественной среды. Предпочтительно выделенный штамм является чистым, т.е. свободным от других штаммов.As used herein, the term “isolated,” for example, in the context of a strain means isolated from its natural environment. Preferably, the isolated strain is pure, i.e. free from other strains.
Термин «производный», используемый здесь в контексте штамма, относится к штамму, который получен из эталонного штамма, т.е. создан с использованием эталонного штамма в качестве отправной точки. Например, генетически модифицированный или иным образом мутировавший или генетически модифицированный штамм представляет собой осуществление такого производного. Генетические модификации включают в себя точечные мутации, а также вставки или делеции, включая вставки или делеции целых локусов или их фрагментов. Производное предпочтительно имеет менее 10 генетических модификаций, т.е. менее 5, например 4, 3, 2 или 1 генетические модификации по сравнению с эталонным штаммом.The term "derivative" as used herein in the context of a strain refers to a strain that is derived from a reference strain, i.e. created using a reference strain as a starting point. For example, a genetically modified or otherwise mutated or genetically modified strain is an embodiment of such a derivative. Genetic modifications include point mutations and insertions or deletions, including insertions or deletions of entire loci or fragments thereof. The derivative preferably has less than 10 genetic modifications, i.e. less than 5, for example 4, 3, 2 or 1 genetic modifications compared to the reference strain.
Используемый здесь термин «культура» относится к суспензии жизнеспособных клеток в жидкой среде.As used herein, the term “culture” refers to a suspension of viable cells in a liquid medium.
Термин «биомасса» имеет свое обычное значение в области бактериальной ферментации и относится к клеточному материалу.The term "biomass" has its usual meaning in the field of bacterial fermentation and refers to cellular material.
Используемый здесь термин «непрерывное культивирование» относится к процессу культивирования, при котором свежая среда непрерывно добавляется к культуре, а среда с бактериальной культурой непрерывно удаляется практически с одинаковой скоростью.As used herein, the term "continuous culture" refers to a culture process in which fresh media is continuously added to the culture and bacterial culture media is continuously removed at substantially the same rate.
Аспекты и варианты осуществления настоящего изобретенияAspects and Embodiments of the Present Invention
В первом основном аспекте настоящее изобретение относится к выделенному бактериальному штамму VTT-E-193585 или его производному.In a first main aspect, the present invention relates to the isolated bacterial strain VTT-E-193585 or a derivative thereof.
Штамм VTT-E-193585 был выделен с побережья Балтийского моря в Наантали, Финляндия. Этот организм способен расти в подходящих условиях биореактора с минимальной минеральной средой с водородом в качестве источника энергии и углекислым газом в качестве источника углерода в условиях ограниченного содержания кислорода. Секвенирование 16S и метагеномное секвенирование Illumina показали, что этот штамм, скорее всего, относится к роду Xanthobacter, но не является известным видом. Бактериальный штамм очень подходит для пищевых продуктов и кормов, поскольку порошок высушенных клеток имеет высокое содержание белка и содержит все незаменимые аминокислоты. Он также содержит больше ненасыщенных жирных кислот, чем насыщенных, и высокий уровень витаминов группы В. Он имеет низкие уровни пептидогликанов и липополисахаридов, которые могут вызывать аллергию или токсичность. Был проведен анализ токсичности, и для штамма не наблюдалось генотоксичности или цитотоксичности. Кроме того, штамм в целом чувствителен к антибиотикам.Strain VTT-E-193585 was isolated from the Baltic Sea coast in Naantali, Finland. This organism is capable of growing in suitable bioreactor conditions with a minimal mineral environment with hydrogen as an energy source and carbon dioxide as a carbon source under oxygen-limited conditions. 16S sequencing and Illumina metagenomic sequencing indicated that the strain most likely belongs to the genus Xanthobacter , but is not a known species. The bacterial strain is very suitable for food and feed because the dried cell powder has a high protein content and contains all the essential amino acids. It also contains more unsaturated fatty acids than saturated fatty acids and high levels of B vitamins. It has low levels of peptidoglycans and lipopolysaccharides, which can cause allergies or toxicity. A toxicity assay was performed and no genotoxicity or cytotoxicity was observed for the strain. In addition, the strain is generally sensitive to antibiotics.
Штамм VTT-E-193585 (SoF1) был депонирован 11 июня 2019 г. в Коллекции культур VTT в Центре технических исследований VTT в Финляндии, P.O. Box 1000, FI-02044 VTT, Финляндия, Международный депозитарный орган в соответствии с Будапештским договором. Дополнительная информация о характеристиках штамма и способах культивирования штамма представлена в приведенных здесь Примерах.Strain VTT-E-193585 (SoF1) was deposited on 11 June 2019 in the VTT Culture Collection at the VTT Technical Research Center in Finland, P.O. Box 1000, FI-02044 VTT, Finland, International Depository Authority under the Treaty of Budapest. Additional information regarding the characteristics of the strain and methods for culturing the strain are provided in the Examples provided herein.
В предпочтительном варианте осуществления изобретения, если штамм является производным штамма VTT-E-193585, то это производное сохраняет способность расти с использованием газообразного водорода в качестве источника энергии и диоксида углерода в качестве единственного источника углерода.In a preferred embodiment, if the strain is a derivative of strain VTT-E-193585, the derivative retains the ability to grow using hydrogen gas as an energy source and carbon dioxide as the sole carbon source.
В одном варианте осуществления изобретения, если штамм является производным штамма VTT-E-193585, то это производное содержит 16S рибосомальную РНК, указанную в SEQ ID NO:1, или 16S рибосомальную РНК, имеющую до 20 нуклеотидных различий с SEQ ID NO:1, например от 1 до 10, например от 1 до 5, например одно, два или три нуклеотидных отличия с SEQ ID NO:1.In one embodiment of the invention, if the strain is a derivative of strain VTT-E-193585, then this derivative contains 16S ribosomal RNA specified in SEQ ID NO:1, or 16S ribosomal RNA having up to 20 nucleotide differences from SEQ ID NO:1, eg 1 to 10, eg 1 to 5, eg one, two or three nucleotide differences of SEQ ID NO:1.
SEQ ID NO:1. Последовательность 16S рибосомальной РНК штамма VTT-E-193585:SEQ ID NO:1. Sequence of 16S ribosomal RNA of strain VTT-E-193585:
CTTGAGAGTTTGATCCTGGCTCAGAGCGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGCCCAGCAATGGGAGCGGCAGACGGGTGAGTAACGCGTGGGGATGTGCCCAATGGTACGGAATAACCCAGGGAAACTTGGACTAATACCGTATGAGCCCTTCGGGGGAAAGATTTATCGCCATTGGATCAACCCGCGTCTGATTAGCTAGTTGGTGGGGTAACGGCCCACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGGACAATGGGCGCAAGCCTGATCCAGCCATGCCGCGTGTGTGATGAAGGCCTTAGGGTTGTAAAGCACTTTCGCCGGTGAAGATAATGACGGTAACCGGAGAAGAAGCCCCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGGGCTAGCGTTGCTCGGAATCACTGGGCGTAAAGCGCACGTAGGCGGATCGTTAAGTCAGGGGTGAAATCCTGGAGCTCAACTCCAGAACTGCCCTTGATACTGGCGACCTTGAGTTCGAGAGAGGTTGGTGGAACTGCGAGTGTAGAGGTGAAATTCGTAGATATTCGCAAGAACACCAGTGGCGAAGGCGGCCAACTGGCTCGATACTGACGCTGAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGGATGCTAGCCGTTGGGCAGCTTGCTGTTCAGTGGCGCAGCTAACGCATTAAGCATCCCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGCAGAACCTTACCAGCCTTTGACATGGCAGGACGATTTCCAGAGATGGATCTCTTCCAGCAATGGACCTGCACACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGCCTCTAGTTGCCAGCATTCAGTTGGGCACTCTAGAGGGACTGCCGGTGATAAGCCGAGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTACGGGCTGGGCTACACACGTGCTACAATGGTGGTGACAGTGGGATGCGAAAGGGCGACCTCTAGCAAATCTCCAAAAGCCATCTCAGTTCGGATTGTACTCTGCAACTCGAGTGCATGAAGTTGGAATCGCTAGTAATCGTGGATCAGCATGCCACGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGGAGTTGGCTTTACCCGAAGGCGCTGCGCTAACCCGCAAGGGAGGCAGGCGACCACGGTAGGGTCAGCGACTGGGGTGAAGTCGTAACAAGGTAGCCGTAGGGGAACCTGCGGCTGGATCACCTCCTTTCTTGAGAGTTTGATCCTGGCTCAGAGCGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGCCCAGCAATGGGAGCGGCAGACGGGTGAGTAACGCGTGGGGATGTGCCCAATGGTACGGAATAACCCAGGGAAACTTGGACTAATACCGTATGAGCCCTTCGGGGGAAAGATTTATCGCCATTGGATCAACCCGCGTCTGATTAGCTAGTTGGTGGGGTAACGGCCCACCAAGGCG ACGATCAGTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGGACAATGGGCGCAAGCCTGATCCAGCCATGCCGCGTGTGTGATGAAGGCCTTAGGGTTGTAAAGCACTTTCGCCGGTGAAGATAATGACGGTAACCGGAGAAGAAGCCCCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGGGCT AGCGTTGCTCGGAATCACTGGGCGTAAAGCGCACGTAGGCGGATCGTTAAGTCAGGGGTGAAATCCTGGAGCTCAACTCCAGAACTGCCCTTGATACTGGCGACCTTGAGTTCGAGAGAGGTTGGTGGAACTGCGAGTGTAGAGGTGAAATTCGTAGATATTCGCAAGAACACCAGTGGCGAAGGCGGCCAACTGGCTCGATACTGACGCTGAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAG ATACCCTGGTAGTCCACGCCGTAAACGATGGATGCTAGCCGTTGGGCAGCTTGCTGTTCAGTGGCGCAGCTAACGCATTAAGCATCCCGCCTGGGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGCAGAACCTTACCAGCCTTTGACATGGCAGGACGATTTCCAGAGATGGATCTCTTCCAGCAAT GGACCTGCACACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGCCTCTAGTTGCCAGCATTCAGTTGGGCACTCTAGAGGGACTGCCGGTGATAAGCCGAGAGGAAGGTGGGGATGACGTCAAGTCCTCATGGCCCTTACGGGCTGGGCTACACACGTGCTACAATGGTGGTGACAGTGGGATGCGAAAGGGCGACC TCTAGCAAATCTCCAAAAGCCATCTCAGTTCGGATTGTACTCTGCAACTCGAGTGCATGAAGTTGGAATCGCTAGTAATCGTGGATCAGCATGCCACGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGGAGTTGGCTTTACCCGAAGGCGCTGCGCTAACCCGCAAGGGAGGCAGGCGACCACGGTAGGGTCAGCGACTGGGGTGAAGTCGTAACAAGGTAGCCGTA GGGGAACCTGCGGCTGGATCACCTCCTTT
В ещё одном аспекте настоящее изобретение относится к культуре, содержащей бактериальный штамм настоящего изобретения или его производное. В предпочтительном варианте осуществления изобретения объём культуры составляет 100 мл или более, например 1 л или более, например 10 л или более, например 1000 л или более, например 10 000 л или более, например 50 000 л или более, например 100 000 л или более, например 200 000 л или более.In another aspect, the present invention relates to a culture containing a bacterial strain of the present invention or a derivative thereof. In a preferred embodiment of the invention, the culture volume is 100 ml or more, for example 1 L or more, for example 10 L or more, for example 1000 L or more, for example 10,000 L or more, for example 50,000 L or more, for example 100,000 L or more, for example 200,000 l or more.
В ещё одном аспекте настоящее изобретение относится к способу получения биомассы и/или белка, где указанный способ включает в себя культивирование бактериального штамма настоящего изобретения или его производного. В одном варианте осуществления изобретения способ предназначен для производства биомассы. В другом варианте осуществления изобретения способ предназначен для получения белка. В одном варианте осуществления изобретения способ включает в себя культивирование штамма в непрерывной культуре с водородом в качестве источника энергии и источником неорганического углерода, где источник неорганического углерода включает в себя диоксид углерода. В дополнительном варианте осуществления изобретения способ предназначен для производства биомассы и включает в себя культивирование штамма в непрерывной культуре с водородом в качестве источника энергии и источником неорганического углерода, где источник неорганического углерода включает в себя диоксид углерода. Ниже описаны различные дополнительные варианты осуществления способа.In yet another aspect, the present invention relates to a method for producing biomass and/or protein, wherein the method includes cultivating a bacterial strain of the present invention or a derivative thereof. In one embodiment of the invention, the method is for the production of biomass. In another embodiment of the invention, the method is for producing protein. In one embodiment of the invention, the method includes cultivating the strain in a continuous culture with hydrogen as an energy source and an inorganic carbon source, wherein the inorganic carbon source includes carbon dioxide. In a further embodiment of the invention, the method is for producing biomass and includes cultivating the strain in a continuous culture with hydrogen as an energy source and an inorganic carbon source, wherein the inorganic carbon source includes carbon dioxide. Various additional embodiments of the method are described below.
В ещё одном основном аспекте настоящее изобретение относится к способу получения биомассы и/или белка, где указанный способ включает в себя культивирование бактериального штамма рода Xanthobacter в непрерывной культуре с водородом в качестве источника энергии и источником неорганического углерода, где источник неорганического углерода включает в себя двуокись углерода. В одном варианте осуществления изобретения способ предназначен для производства биомассы. В другом варианте осуществления изобретения способ предназначен для получения белка. Ниже описаны различные дополнительные варианты осуществления способа.In another basic aspect, the present invention relates to a method for producing biomass and/or protein, wherein the method comprises cultivating a bacterial strain of the genus Xanthobacter in a continuous culture with hydrogen as an energy source and a source of inorganic carbon, wherein the inorganic carbon source includes dioxide carbon. In one embodiment of the invention, the method is for the production of biomass. In another embodiment of the invention, the method is for producing protein. Various additional embodiments of the method are described below.
Судя по последовательности генома, штамм, депонированный под номером VTT-E-193585, скорее всего, использует цикл Кальвина-Бенсона-Бэсшема для фиксации углерода, в котором молекула углекислого газа соединяется с 5-углеродной цепью рибулозо-1,5-бисфосфата, образуя две молекулы глицерат-3-фосфата. Это позволяет штамму синтезировать все другие органические молекулы, необходимые для роста. Энергия водорода поступает в клетку, скорее всего, через NAD+-восстанавливающие гидрогеназы и/или NiFeSe-гидрогеназы. По сути, это окислительно-восстановительная реакция, в которой водород (H2) окисляется до H+, а NAD+ восстанавливается до NADH. Помимо АТФ, NADH является одним из основных переносчиков энергии внутри живых организмов. В качестве альтернативы, некоторый другой энергетический эквивалент восстанавливается другим ферментом гидрогеназы с использованием H2. Цикл Кальвина-Бенсона-Бэсшема требует энергии в виде АТФ и NADH/NADPH для связывания СО2. Штамм, скорее всего, генерирует АТФ посредством окислительного фосфорилирования, которое состоит из четырех белковых комплексов, создающих протонный градиент через мембрану. Протонный градиент генерируется в основном за счет энергии NADH. Протонный градиент управляет комплексом АТФ-синтазы, генерирующим АТФ. Судя по последовательности генома, штамм имеет бактериальную АТФ-синтазу F-типа.Based on the genome sequence, the strain deposited as VTT-E-193585 most likely uses the Calvin-Benson-Bassham cycle for carbon fixation, in which a carbon dioxide molecule combines with a 5-carbon chain of ribulose-1,5-bisphosphate to form two molecules of glycerate-3-phosphate. This allows the strain to synthesize all the other organic molecules needed for growth. Hydrogen energy enters the cell most likely through NAD + -reducing hydrogenases and/or NiFeSe hydrogenases. Essentially, it is a redox reaction in which hydrogen (H 2 ) is oxidized to H + and NAD + is reduced to NADH. In addition to ATP, NADH is one of the main energy carriers within living organisms. Alternatively, some other energy equivalent is reduced by another hydrogenase enzyme using H 2 . The Calvin-Benson-Bassham cycle requires energy in the form of ATP and NADH/NADPH to bind CO 2 . The strain most likely generates ATP through oxidative phosphorylation, which consists of four protein complexes that create a proton gradient across the membrane. The proton gradient is generated mainly by the energy of NADH. The proton gradient drives the ATP synthase complex that generates ATP. Judging by the genome sequence, the strain has an F-type bacterial ATP synthase.
Следует понимать, что когда указано, что способ включает в себя культивирование штамма с источником неорганического углерода, то источник неорганического углерода является основным источником углерода в культуре. Таким образом, в культуре могут присутствовать незначительные количества источников органического углерода, но основной метаболизм и рост культуры основаны на использовании источника неорганического углерода, предпочтительно двуокиси углерода, в качестве источника углерода. Предпочтительно доля углерода, подаваемого в культуру, которая является органической, составляет менее 5%, например менее 1%, например менее 0,1% всего углерода, поступающего в культуру в процессе реализации способа. Предпочтительно в процессе реализации способа не подаются никакие источники органического углерода.It should be understood that when a method is stated to involve cultivating a strain with an inorganic carbon source, the inorganic carbon source is the primary carbon source in the culture. Thus, trace amounts of organic carbon sources may be present in the crop, but the major metabolism and growth of the crop is based on the use of an inorganic carbon source, preferably carbon dioxide, as the carbon source. Preferably, the proportion of carbon supplied to the crop that is organic is less than 5%, for example less than 1%, for example less than 0.1% of the total carbon supplied to the crop during the process. Preferably, no sources of organic carbon are supplied during the implementation of the method.
Точно так же следует понимать, что когда указано, что способ включает в себя культивирование штамма с водородом (H2) в качестве источника энергии, то водород является основным источником энергии в культуре. Таким образом, в культуре могут присутствовать другие второстепенные источники энергии, такие как аммиак, который может являться источником азота, или небольшие количества органических соединений, но основной метаболизм и рост культуры основаны на использовании водорода в качестве источник энергии. В общем процессе водород предпочтительно получают электролизом воды; т.е. путем разделения воды электричеством на газообразный водород и кислород. Таким образом, газообразные водород и кислород подаются в биореактор из расположенного поблизости электролизёра. Альтернативно, электроды могут быть помещены внутрь биореактора для производства водорода и кислорода в биореакторе, а не в отдельном электролизёре.Likewise, it should be understood that when a method is stated to involve cultivating a strain with hydrogen (H 2 ) as an energy source, then hydrogen is the primary energy source in the culture. Thus, other minor energy sources may be present in the crop, such as ammonia, which can be a source of nitrogen, or small amounts of organic compounds, but the major metabolism and growth of the crop is based on the use of hydrogen as an energy source. In the general process, hydrogen is preferably produced by electrolysis of water; those. by splitting water into hydrogen and oxygen gases with electricity. Thus, hydrogen and oxygen gases are supplied to the bioreactor from a nearby electrolyzer. Alternatively, electrodes can be placed inside a bioreactor to produce hydrogen and oxygen in the bioreactor rather than in a separate electrolyser.
Источник неорганического углерода, содержащий диоксид углерода, может включать в себя другие источники неорганического углерода, такие как, например, монооксид углерода. В одном варианте осуществления изобретения в культуру подают только источники углерода в газообразной форме. В предпочтительном варианте осуществления изобретения диоксид углерода является единственным источником неорганического углерода и, действительно, единственным источником углерода, подаваемым в культуру. В одном варианте осуществления изобретения в культуру подаются только газы и минералы, а уровень диоксида углерода в подаваемом газе составляет от 10% до 50%, т.е. от 15% до 45%, например от 20% до 40%, например от 25% до 35%, например, от 26% до 30%.The inorganic carbon source containing carbon dioxide may include other inorganic carbon sources such as, for example, carbon monoxide. In one embodiment of the invention, only carbon sources in gaseous form are supplied to the crop. In a preferred embodiment of the invention, carbon dioxide is the only source of inorganic carbon and, indeed, the only source of carbon supplied to the crop. In one embodiment of the invention, only gases and minerals are supplied to the crop, and the level of carbon dioxide in the supplied gas is from 10% to 50%, i.e. from 15% to 45%, for example from 20% to 40%, for example from 25% to 35%, for example from 26% to 30%.
В другом варианте осуществления изобретения в культуру подают газы и минералы, а уровень водорода (H2) в подаваемом газе составляет от 30% до 80%, например от 35% до 75%, например от 40% до 70%, например от 45% до 65%, например от 50% до 60%.In another embodiment of the invention, gases and minerals are supplied to the crop and the level of hydrogen (H 2 ) in the supplied gas is from 30% to 80%, for example from 35% to 75%, for example from 40% to 70%, for example from 45% up to 65%, for example from 50% to 60%.
В другом варианте осуществления изобретения в культуру подают газы и минералы, а уровень кислорода (O2) в подаваемом газе составляет от 10% до 25%, например от 15% до 20%, например, от 16% до 18%. В другом варианте осуществления изобретения уровень подаваемого кислорода таков, что уровень растворенного кислорода в культуре поддерживается на уровне от 5% до 10%.In another embodiment of the invention, gases and minerals are supplied to the crop, and the level of oxygen (O 2 ) in the supplied gas is from 10% to 25%, for example from 15% to 20%, for example from 16% to 18%. In another embodiment of the invention, the level of oxygen supplied is such that the dissolved oxygen level in the culture is maintained at 5% to 10%.
В предпочтительном варианте осуществления изобретения в культуру подают только газы и минералы, и подаваемый газ содержит H2, CO2 и O2, где процентное содержание H2 составляет от 40% до 70%, процентное содержание CO2 составляет от 18% до 28% и процентное содержание O2 составляет от 12% до 22%.In a preferred embodiment of the invention, only gases and minerals are supplied to the crop, and the supplied gas contains H 2 , CO 2 and O 2 , where the percentage of H 2 is from 40% to 70%, the percentage of CO 2 is from 18% to 28% and the percentage of O 2 is from 12% to 22%.
Как правило, способ настоящего изобретения включает в себя добавление источника азота. Источник азота может быть, например, представлен в виде гидроксида аммония, соли аммония, такой как сульфат аммония или хлорид аммония, аммиака, мочевины или нитрата, например нитрата калия. В других вариантах осуществления изобретения в качестве источника азота используют газообразный азот (N2). В предпочтительном варианте осуществления изобретения источник азота представляет собой гидроксид аммония или соль аммония, такую как сульфат аммония.Typically, the method of the present invention involves adding a nitrogen source. The nitrogen source may, for example, be in the form of ammonium hydroxide, an ammonium salt such as ammonium sulfate or ammonium chloride, ammonia, urea or a nitrate such as potassium nitrate. In other embodiments of the invention, nitrogen gas (N 2 ) is used as the nitrogen source. In a preferred embodiment of the invention, the nitrogen source is ammonium hydroxide or an ammonium salt such as ammonium sulfate.
В одном варианте осуществления изобретения подаваемый источник азота представляет собой гидроксид аммония в концентрации от 100 мг/л до 10 г/л, например, от 250 мг/л до 4 г/л, например от 0,5 г/л до 2 г/л, например от 0,75 г/л до 1,5 г/л.In one embodiment of the invention, the supplied nitrogen source is ammonium hydroxide at a concentration of from 100 mg/L to 10 g/L, for example from 250 mg/L to 4 g/L, for example from 0.5 g/L to 2 g/ l, for example from 0.75 g/l to 1.5 g/l.
Как правило, способ настоящего изобретения включает в себя добавление минералов, таких как минералы, содержащие аммоний, фосфат, калий, натрий, ванадий, железо, сульфат, магний, кальций, молибден, марганец, бор, цинк, кобальт, селен, йод, медь и/или никель. Подходящие минеральные среды хорошо известны в данной области техники и, например, имеют, были описаны в Thermophilic Bacteria, CRC Press, Boca Raton, FL, Jacob K. Kristjansson, ed., 1992, например, на странице 87, в таблице 4.Typically, the method of the present invention involves the addition of minerals such as minerals containing ammonium, phosphate, potassium, sodium, vanadium, iron, sulfate, magnesium, calcium, molybdenum, manganese, boron, zinc, cobalt, selenium, iodine, copper and/or nickel. Suitable mineral media are well known in the art and, for example, have been described in Thermophilic Bacteria, CRC Press, Boca Raton, FL, Jacob K. Kristjansson, ed., 1992, for example, page 87, table 4.
В одном варианте осуществления изобретения добавленные минералы включают в себя один или несколько из следующих минералов: аммиак, аммоний (например, хлорид аммония (NH4Cl), сульфат аммония ((NH4)2SO4)), нитрат (например, нитрат калия (KNO3)), мочевина или источник органического азота; фосфат (например, динатрийфосфат (Na2HPO4), фосфат калия (KH2PO4), фосфорная кислота (H3PO4), дитиофосфат калия (K3PS2O2), ортофосфат калия (K3PO4), динатрийфосфат (Na2HPO4•2H2O), дикалийфосфат (K2HPO4) или монокалийфосфат (KH2PO4); сульфат; дрожжевой экстракт; хелатное железо (например, с ЭДТА или лимонной кислотой); калий (например, фосфат калия (KH2PO4), нитрат калия (KNO3), йодид калия (KI), бромид калия (KBr)); и другие неорганические соли, минералы и микроэлементы (например, хлорид натрия (NaCl), сульфат магния (MgSO4•7H2O) или хлорид магния (MgCl2), хлорид кальция (CaCl2), сульфат кальция (CaSO4) или карбонат кальция (CaCO3), сульфат марганца (MnSO4•7H2O) или хлорид марганца (MnCl2), хлорид железа (FeCl2), сульфат железа (FeSO4•7H2O) или хлорид железа (FeCl2 •4H2O), гидрокарбонат натрия (NaHCO3) или карбонат натрия (Na2CO3), сульфат цинка (ZnSO4) или хлорид цинка (ZnCl2), молибдат аммония (NH4MoO4) или молибдат натрия (Na2MoO4•2H2O), сульфат меди (CuSO4) или хлорид меди (CuCl2•2H2O), хлорид кобальта (CoCl2•6H2O) или сульфат кобальта (CoSO4), хлорид алюминия (AlCl3•6H2O), хлорид лития (LiCl), борная кислота (H3BO3), хлорид никеля (NiCl2•6H2O) или сульфат никеля (NiSO4), хлорид олова (SnCl2•H2O), хлорид бария (BaCl2•2H2O), селенат меди (CuSeO4·5H2O), селенат натрия (Na2SeO4) или селенит натрия (Na2SeO3), метаванадат натрия (NaVO3), соли хрома).In one embodiment of the invention, the added minerals include one or more of the following minerals: ammonia, ammonium (eg, ammonium chloride (NH 4 Cl), ammonium sulfate ((NH 4 ) 2 SO 4 )), nitrate (eg, potassium nitrate (KNO 3 )), urea or organic nitrogen source; phosphate (e.g. disodium phosphate (Na 2 HPO 4 ), potassium phosphate (KH 2 PO 4 ), phosphoric acid (H 3 PO 4 ), potassium dithiophosphate (K 3 PS 2 O 2 ), potassium orthophosphate (K 3 PO 4 ), disodium phosphate (Na 2 HPO 4 •2H 2 O), dipotassium phosphate (K 2 HPO 4 ) or monopotassium phosphate (KH 2 PO 4 ); sulfate; yeast extract; chelated iron (for example, with EDTA or citric acid); potassium (for example, phosphate potassium (KH 2 PO 4 ), potassium nitrate (KNO 3 ), potassium iodide (KI), potassium bromide (KBr)); and other inorganic salts, minerals and trace elements (for example, sodium chloride (NaCl), magnesium sulfate (MgSO 4 •7H 2 O) or magnesium chloride (MgCl 2 ), calcium chloride (CaCl 2 ), calcium sulfate (CaSO 4 ) or calcium carbonate (CaCO 3 ), manganese sulfate (MnSO 4 •7H 2 O) or manganese chloride (MnCl 2 ), ferrous chloride (FeCl 2 ), ferrous sulfate (FeSO 4 •7H 2 O) or ferric chloride (FeCl 2 •4H 2 O), sodium bicarbonate (NaHCO 3 ) or sodium carbonate (Na 2 CO 3 ), zinc sulfate ( ZnSO 4 ) or zinc chloride (ZnCl 2 ), ammonium molybdate (NH 4 MoO 4 ) or sodium molybdate (Na 2 MoO 4 •2H 2 O), copper sulfate (CuSO 4 ) or copper chloride (CuCl 2 •2H 2 O) , cobalt chloride (CoCl 2 •6H 2 O) or cobalt sulfate (CoSO 4 ), aluminum chloride (AlCl 3 •6H 2 O), lithium chloride (LiCl), boric acid (H 3 BO 3 ), nickel chloride (NiCl 2 •6H 2 O) or nickel sulfate (NiSO 4 ), tin chloride (SnCl 2 •H 2 O), barium chloride (BaCl 2 •2H 2 O), copper selenate (CuSeO 4 5H 2 O), sodium selenate (Na 2 SeO 4 ) or sodium selenite (Na 2 SeO 3 ), sodium metavanadate (NaVO 3 ), chromium salts).
В предпочтительном варианте осуществления изобретения способ настоящего изобретения включает в себя добавление одного, нескольких или всех из: NH4OH, KH2PO4, Na2HPO4•2H2O, NaVO3•H2O, FeSO4×7H2O, MgSO4•7H2O, CaSO4, Na2MoO4•2H2O, MnSO4•7H2O, ZnSO4. •7H2O, H3BO3, CoSO4, CuSO4, NiSO4.In a preferred embodiment, the method of the present invention includes adding one, more or all of: NH 4 OH, KH 2 PO 4 , Na 2 HPO 4 •2H 2 O, NaVO 3 •H 2 O, FeSO 4 × 7H 2 O , MgSO 4 •7H 2 O, CaSO 4 , Na 2 MoO 4 •2H 2 O, MnSO 4 •7H 2 O, ZnSO 4 . •7H 2 O, H 3 BO 3 , CoSO 4 , CuSO 4 , NiSO 4 .
В одном варианте осуществления изобретения среда, подаваемая клеткам, содержит менее 1 г/л хлоридных солей, например менее 0,25 г/л хлоридных солей, например менее 0,1 г/л хлоридных солей, например менее 0,025 г/л хлоридных солей, например менее 0,01 г/л хлорида. В одном варианте осуществления изобретения в культуру не добавляют хлоридные соли.In one embodiment of the invention, the medium supplied to the cells contains less than 1 g/L chloride salts, such as less than 0.25 g/L chloride salts, such as less than 0.1 g/L chloride salts, such as less than 0.025 g/L chloride salts, for example less than 0.01 g/l chloride. In one embodiment of the invention, no chloride salts are added to the culture.
В другом варианте осуществления изобретения во время осуществления способа не добавляют витамины, т.е. среда, подаваемая в культуру, не содержит витаминов.In another embodiment of the invention, no vitamins are added during the method, i.e. the medium supplied to the culture does not contain vitamins.
В другом варианте осуществления изобретения во время осуществления способа не добавляют аминокислоты, т.е. среда, подаваемая в культуру, не содержит аминокислот.In another embodiment of the invention, no amino acids are added during the method, i.e. the medium supplied to the culture does not contain amino acids.
В другом варианте осуществления изобретения во время осуществления способа не добавляют никаких органических соединений, т.е. среда, подаваемая в культуру, не содержит никаких органических соединений.In another embodiment of the invention, no organic compounds are added during the process, i.e. the medium supplied to the culture does not contain any organic compounds.
В некоторых вариантах осуществления изобретения значение pH бактериальной культуры регулируется на определенном уровне. В некоторых вариантах осуществления изобретения значение рН регулируют в пределах оптимального диапазона для поддержания и/или роста бактерий, и/или образования органических соединений. В одном варианте осуществления изобретения значение pH в культуре поддерживается в диапазоне от 5,5 до 8,0, например, от 6,5 до 7,0, например, 6,8.In some embodiments of the invention, the pH of the bacterial culture is adjusted to a certain level. In some embodiments of the invention, the pH value is adjusted within the optimal range for the maintenance and/or growth of bacteria and/or the formation of organic compounds. In one embodiment of the invention, the pH of the culture is maintained in the range of 5.5 to 8.0, for example 6.5 to 7.0, for example 6.8.
В некоторых вариантах осуществления изобретения контролируют температуру бактериальной культуры. В некоторых вариантах осуществления изобретения температуру регулируют в пределах оптимального диапазона для поддержания и/или роста бактерий, и/или образования органических соединений. В одном варианте осуществления изобретения культуру выращивают при температуре от 25°С до 40°С, например, от 28°С до 32°С, например, при 30°С.In some embodiments, the temperature of the bacterial culture is controlled. In some embodiments of the invention, the temperature is controlled within an optimal range for the maintenance and/or growth of bacteria and/or the formation of organic compounds. In one embodiment of the invention, the crop is grown at a temperature of from 25°C to 40°C, for example from 28°C to 32°C, for example at 30°C.
Как правило, способ изобретения осуществляют в биореакторе. Биореактор используется для культивирования клеток, которые могут поддерживаться на определенных фазах кривой их роста. Использование биореакторов выгодно во многих отношениях для культивирования в условиях хемоавтотрофного роста. Как правило, в биореакторе облегчается контроль условий роста, включая контроль растворенного диоксида углерода, кислорода и других газов, таких как водород, а также других растворенных питательных веществ, микроэлементов, температуры и рН. Питательные среды, а также газы можно добавлять в биореактор или в виде разового добавления, или периодически, или в ответ на обнаруженное истощение или в заданный запрограммированный момент времени, либо непрерывно в течение периода выращивания и/или поддержания культуры. В непрерывном процессе культивирования в биореактор непрерывно добавляют питательные среды, а также газы. Кроме того, из биореактора постоянно удаляют среду, содержащую бактерии.Typically, the method of the invention is carried out in a bioreactor. A bioreactor is used to culture cells that can be maintained at specific phases of their growth curve. The use of bioreactors is beneficial in many respects for cultivation under chemoautotrophic growth conditions. Typically, a bioreactor facilitates control of growth conditions, including control of dissolved carbon dioxide, oxygen and other gases such as hydrogen, as well as other dissolved nutrients, trace elements, temperature and pH. Nutrient media, as well as gases, can be added to the bioreactor either as a one-time addition, or periodically, or in response to detected depletion, or at a given programmed point in time, or continuously during the growth and/or maintenance period of the culture. In a continuous cultivation process, nutrient media as well as gases are continuously added to the bioreactor. In addition, media containing bacteria are continuously removed from the bioreactor.
В предпочтительном варианте осуществления изобретения объём бактериальной культуры составляет 100 мл или более, например 1 л или более, например 10 л или более, например 100 л или более, например 1000 л или более, например 10000 л или более, например 50000 л или более, например, 100000 л или более, например 200000 л или более.In a preferred embodiment of the invention, the volume of the bacterial culture is 100 ml or more, for example 1 L or more, for example 10 L or more, for example 100 L or more, for example 1000 L or more, for example 10000 L or more, for example 50000 L or more, for example, 100,000 l or more, for example, 200,000 l or more.
В одном варианте осуществления изобретения продуктивность культуры составляет более 0,1 г сухой массы клеток на литр в час, например более 0,2, например более 0,3, например более 0,4, например более 0,5, например более 0,6, например более 0,7, например более 0,8, например более 0,9, например более 1 г на литр в час.In one embodiment of the invention, the culture productivity is more than 0.1 g dry cell mass per liter per hour, for example more than 0.2, for example more than 0.3, for example more than 0.4, for example more than 0.5, for example more than 0.6 , for example more than 0.7, for example more than 0.8, for example more than 0.9, for example more than 1 g per liter per hour.
Бактерии можно инокулировать непосредственно из банка клеток или с помощью посевной культуры в меньшем масштабе. Желательно, чтобы подача свежей среды к культуре и удаление израсходованной среды с бактериями происходили с одинаковой скоростью, чтобы объём в биореакторе оставался неизменным.Bacteria can be inoculated directly from a cell bank or by seed culture on a smaller scale. It is desirable that the supply of fresh medium to the culture and the removal of spent medium with bacteria occur at the same rate, so that the volume in the bioreactor remains unchanged.
В одном варианте осуществления изобретения после начальной фазы достижения подходящей плотности клеток бактерии растут в стационарном или псевдостационарном состоянии, постоянно оставаясь в своей логарифмической фазе при ОП600 выше 5, например выше 10, например выше 20, например от 50 до 200, например от 50 до 100.In one embodiment of the invention, after an initial phase of reaching a suitable cell density, the bacteria grow in a stationary or pseudo-stationary state, constantly remaining in their logarithmic phase at an OD600 above 5, for example above 10, for example above 20, for example from 50 to 200, for example from 50 to 100 .
В одном варианте осуществления способа настоящего изобретения бактериальный штамм имеет скорость роста 0,04-0,12 ч-1.In one embodiment of the method of the present invention, the bacterial strain has a growth rate of 0.04-0.12 h -1 .
В другом варианте осуществления способа настоящего изобретения скорость подачи жидкости в непрерывной фазе составляет 50-80% от скорости роста.In another embodiment of the method of the present invention, the liquid feed rate in the continuous phase is 50-80% of the growth rate.
Xanthobacter представляет собой род грамотрицательных бактерий из семейства Xanthobacteraceae.В одном варианте осуществления изобретения штамм Xanthobacter, используемый в способе по изобретению, представляет собой штамм, который использует путь Кальвина-Бенсона-Бэсшема для превращения диоксида углерода в органические соединения, например, глюкозу, необходимые для живых организмов. Xanthobacter is a genus of Gram-negative bacteria in the family Xanthobacteraceae . In one embodiment, the Xanthobacter strain used in the method of the invention is a strain that uses the Calvin-Benson-Bassham pathway to convert carbon dioxide into organic compounds, such as glucose, needed for living organisms.
В одном варианте осуществления изобретения штамм Xanthobacter, используемый в способе настоящего изобретения, представляет собой штамм, который использует гидрогеназы NiFeSe для превращения водорода (H2) в эквиваленты клеточной энергии.In one embodiment, the Xanthobacter strain used in the method of the present invention is a strain that uses NiFeSe hydrogenases to convert hydrogen (H 2 ) into cellular energy equivalents.
В одном варианте осуществления изобретения штамм Xanthobacter, используемый в способе настоящего изобретения, представляет собой штамм, который использует NAD+-восстанавливающие гидрогеназы для превращения водорода (H2) в эквиваленты клеточной энергии.In one embodiment, the Xanthobacter strain used in the method of the present invention is a strain that uses NAD + -reducing hydrogenases to convert hydrogen (H 2 ) into cellular energy equivalents.
В одном варианте осуществления изобретения штамм Xanthobacter, используемый в способе настоящего изобретения, способен к фиксации азота.In one embodiment of the invention, the Xanthobacter strain used in the method of the present invention is capable of nitrogen fixation.
В одном варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, выбран из группы, состоящей из: X.agilis, X.aminoxidans, X.autotrophicus, X.flavus, X.tagetidis, X.viscosus, Xanthobacter sp. 126, Xanthobacter sp. 91 и штамма VTT-E-193585.In one embodiment, the bacterial strain used in the method of the present invention is selected from the group consisting of: X. agilis , X. aminooxidans , X. autotrophicus , X. flavus , X. tagetidis , X. viscosus , Xanthobacter sp. 126, Xanthobacter sp. 91 and strain VTT-E-193585.
В предпочтительном варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, представляет собой VTT-E-193585 или X. tagetidis. Наиболее предпочтительно штамм, используемый в способе настоящего изобретения, представляет собой штамм VTT-E-193585.In a preferred embodiment, the bacterial strain used in the method of the present invention is VTT-E-193585 or X. tagetidis . Most preferably, the strain used in the method of the present invention is strain VTT-E-193585.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит 16S рибосомальную РНК, указанную в SEQ ID NO:1, или 16S рибосомальную РНК, имеющую до 20 нуклеотидных различий с SEQ ID NO:1, например от 1 до 10, например от 1 до 5, например одно, два или три нуклеотидных отличия с SEQ ID NO:1.In another embodiment, the bacterial strain used in the method of the present invention contains 16S ribosomal RNA specified in SEQ ID NO:1, or 16S ribosomal RNA having up to 20 nucleotide differences from SEQ ID NO:1, for example from 1 to 10, for example from 1 to 5, for example one, two or three nucleotide differences from SEQ ID NO:1.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий большую цепь рибулозо-1,5-бисфосфаткарбоксилазы/оксигеназы (rubisco), имеющую последовательность, указанную в SEQ ID NO:3, или последовательность, имеющую более 93% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98% идентичности, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:3.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding a large chain ribulose-1,5-bisphosphate carboxylase/oxygenase (rubisco) having the sequence specified in SEQ ID NO:3, or a sequence having more than 93 % identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98% identity, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:3.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий малую цепь рибулозо-1,5-бисфосфаткарбоксилазы/оксигеназы (rubisco), имеющую последовательность, указанную в SEQ ID NO:5, или последовательность, имеющую более 83% идентичность по последовательности, например более 86% идентичности, например более 90% идентичности, например более 95% идентичности, например, более 96% идентичности, например более 97% идентичности, например, более 98% идентичности, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:5.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding a small chain ribulose-1,5-bisphosphate carboxylase/oxygenase (rubisco) having the sequence specified in SEQ ID NO:5, or a sequence having more than 83 % sequence identity, for example more than 86% identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98% identity, for example more than 99% sequence identity with the sequence shown in SEQ ID NO:5.
SEQ ID NO:2:SEQ ID NO:2:
Нуклеотидная последовательность большой цепи рибулозобисфосфаткарбоксилазы:Nucleotide sequence of large chain ribulose bisphosphate carboxylase:
ATGGGTGCCGAAGCAACCGTCGGGCAGATCACGGACGCCAAGAAGAGATACGCCGCCGGCGTGCTGAAGTACGCCCAGATGGGCTACTGGAACGGCGACTACGTTCCCAAGGACACCGACCTCCTGGCGGTGTTCCGCATCACCCCCCAGGCGGGCGTGGACCCGGTGGAAGCCGCCGCGGCGGTCGCCGGCGAAAGCTCCACCGCTACCTGGACCGTGGTGTGGACCGACCGGCTCACCGCCGCCGACGTCTACCGCGCCAAGGCCTACAAGGTGGAGCCGGTGCCGGGCCAGGAAGGCCAGTATTTCTGCTACATCGCCTATGATCTCGATTTGTTCGAGGAAGGCTCCATCGCCAACCTCACGGCGTCGATCATCGGCAACGTCTTCTCCTTCAAGCCGCTGAAGGCGGCGCGGCTGGAGGACATGCGGCTTCCCGTCGCCTATGTGAAGACCTTCCGCGGCCCGCCCACCGGCATCGTGGTCGAGCGCGAGCGCCTGGACAAGTTCGGCCGCCCCCTTCTGGGCGCCACCACCAAGCCGAAGCTTGGCCTCTCGGGCAAGAATTACGGCCGCGTGGTCTATGAGGCCCTCAAGGGCGGCCTCGACTTCGTGAAGGACGACGAGAACATCAACTCGCAGCCCTTCATGCACTGGCGCGATCGCTTCCTCTATTGCATGGAGGCCGTCAACAAGGCCCAGGCCGAGACCGGCGAGGTGAAGGGGCACTATCTCAACATCACCGCCGGGACCATGGAGGAGATGTACCGCCGCGCCGAGTTCGCCAAGGAACTGGGCTCCGTGGTGGTGATGGTGGATCTCATCATCGGCTGGACCGCCATCCAGTCCATGTCCAACTGGTGCCGCGAGAACGACATGATCCTGCACATGCACCGTGCGGGCCATGGCACCTACACGCGCCAGAAGAGCCACGGCGTCTCCTTCCGCGTCATCGCCAAGTGGCTGCGGCTCGCCGGCGTCGACCACCTGCACACCGGCACCGCCGTGGGCAAGCTGGAAGGCGACCCCATGACCGTGCAGGGCTTCTACAATGTCTGCCGCGAGACGACGACGCAGCAGGACCTCACCCGCGGCCTGTTCTTCGAGCAGGACTGGGGCGGCATCCGCAAGGTGATGCCGGTGGCCTCCGGCGGCATCCATGCGGGCCAGATGCACCAGCTCATCGACCTGTTCGGCGAGGACGTGGTGCTCCAGTTCGGCGGCGGCACCATCGGCCACCCGGACGGCATCCAGGCCGGCGCCACCGCCAACCGCGTGGCGCTGGAAACCATGATCCTCGCCCGCAACGAGGGCCGCGACATCAGGAACGAGGGCCCGGAAATCCTGGTGGAAGCCGCCAAATGGTGCCGTCCGCTGCGCGCGGCGCTCGATACCTGGGGCGAGGTGACCTTCAACTACGCCTCCACCGACACGTCCGATTACGTGCCCACCGCGTCCGTCGCCTGAATGGGTGCCGAAGCAACCGTCGGGCAGATCACGGACGCCAAGAAGAGATACGCCGCCGGCGTGCTGAAGTACGCCCAGATGGGCTACTGGAACGGCGACTACGTTCCCAAGGACACCGACCTCCTGGCGGTGTTCCGCATCACCCCCCAGGCGGGCGTGGACCCGGTGGAAGCCGCCGCGGCGGTCGCCGGCGAAAGCTCCACCGCTACCTGGACCGTGGTGTGGACCGACCGGCTCA CCGCCGCCGACGTCTACCGCGCCAAGGCCTACAAGGTGGAGCCGGTGCCGGGCCAGGAAGGCCAGTATTTCTGCTACATCGCCTATGATCTCGATTTGTTCGAGGAAGGCTCCATCGCCAACCTCACGGCGTCGATCATCGGCAACGTCTTCTCCTTCAAGCCGCTGAAGGCGGCGCGGCTGGAGGACATGCGGCTTCCCGTCGCCTATGTGAAGACCTTCCGCGGCCCGCCCACCGGCATCGTG GTCGAGCGCGAGCGCCTGGACAAGTTCGGCCGCCCCCTTCTGGGCGCCACCACCAAGCCGAAGCTTGGCCTCTCGGGCAAGAATTACGGCCGCGTGGTCTATGAGGCCCTCAAGGGCGGCCTCGACTTCGTGAAGGACGACGAGAACATCAACTCGCAGCCCTTCATGCACTGGCGCGATCGCTTCCTCTATTGCATGGAGGCCGTCAACAAGGCCCAGGCCGAGACCGGCGAGGTGAAGGGGCACT ATCTCAACATCACCGCCGGGACCATGGAGGAGATGTACCGCCGCGCCGAGTTCGCCAAGGAACTGGGCTCCGTGGTGGTGATGGTGGATCTCATCATCGGCTGGACCGCCATCCAGTCCATGTCCAACTGGTGCCGCGAGAACGACATGATCCTGCACATGCACCGTGCGGGCCATGGCACCTACACGCGCCAGAAGAGCCACGGCGTCTCCTTCCGCGTCATCGCCAAGTGGCTGCGGCTCGCC GGCGTCGACCACCTGCACACCGGCACCGCCGTGGGCAAGCTGGAAGGCGACCCCATGACCGTGCAGGGCTTCTACAATGTCTGCCGCGAGACGACGACGCAGCAGGACCTCACCCGCGGCCTGTTCTTCGAGCAGGACTGGGGGCGGCATCCGCAAGGTGATGCCGGTGGCCTCCGGCGGCATCCATGCGGGCCAGATGCACCAGCTCATCGACCTGTTCGGCGAGGACGTGGTGCTCCAGTT CGGCGGCGGCACCATCGGCCACCCGGACGGCATCCAGGCCGGCGCCACCGCCAACCGCGTGGCGCTGGAAACCATGATCCTCGCCCGCAACGAGGGCCGCGACATCAGGAACGAGGGCCCGGAAATCCTGGTGGAAGCCGCCAAATGGTGCCGTCCGCTGCGCGCGGCGCTCGATACCTGGGGCGAGGTGACCTTCAACTACGCCTCCACCGACACGTCCGATTACGTGCCCACCGCGTCCGTC GCCTGA
SEQ ID NO: 3:SEQ ID NO: 3:
Аминокислотная последовательность большой цепи рибулозобисфосфаткарбоксилазыAmino acid sequence of large chain ribulose bisphosphate carboxylase
MGAEATVGQITDAKKRYAAGVLKYAQMGYWNGDYVPKDTDLLAVFRITPQAGVDPVEAAAAVAGESSTATWTVVWTDRLTAADVYRAKAYKVEPVPGQEGQYFCYIAYDLDLFEEGSIANLTASIIGNVFSFKPLKAARLEDMRLPVAYVKTFRGPPTGIVVERERLDKFGRPLLGATTKPKLGLSGKNYGRVVYEALKGGLDFVKDDENINSQPFMHWRDRFLYCMEAVNKAQAETGEVKGHYLNITAGTMEEMYRRAEFAKELGSVVVMVDLIIGWTAIQSMSNWCRENDMILHMHRAGHGTYTRQKSHGVSFRVIAKWLRLAGVDHLHTGTAVGKLEGDPMTVQGFYNVCRETTTQQDLTRGLFFEQDWGGIRKVMPVASGGIHAGQMHQLIDLFGEDVVLQFGGGTIGHPDGIQAGATANRVALETMILARNEGRDIRNEGPEILVEAAKWCRPLRAALDTWGEVTFNYASTDTSDYVPTASVAMGAEATVGQITDAKKRYAAGVLKYAQMGYWNGDYVPKDTDLLAVFRITPQAGVDPVEAAAAVAGESSTATWTVVWTDRLTAADVYRAKAYKVEPVPGQEGQYFCYIAYDLDLFEEGSIANLTASIIGNVFSFKPLKAARLEDMRLPVAYVKTFRGPPTGIVVERERLDKFGRPLLGATTKPKLGLSGKNYGRVVYEALK GGLDFVKDDENINSQPFMHWRDRFLYCMEAVNKAQAETGEVKGHYLNITAGTMEEMYRRAEFAKELGSVVVMVDLIIGWTAIQSMSNWCRENDMILHMHRAGHGTYTRQKSHGVSFRVIAKWLRLAGVDHLHTGTAVGKLEGDPMTVQGFYNVCRETTTQQDLTRGLFFEQDWGGIRKVMPVASGGIHAGQMH QLIDLFGEDVVLQFGGGTIGHPDGIQAGATANRVALETMILARNEGRDIRNEGPEILVEAAKWCRPLRAALDTWGEVTFNYASTDTSDYVPTASVA
SEQ ID NO: 4:SEQ ID NO: 4:
Нуклеотидная последовательность малой цепи рибулозобисфосфаткарбоксилазы:Nucleotide sequence of small chain ribulose bisphosphate carboxylase:
ATGCGCATCACCCAAGGCTCCTTCTCCTTCCTGCCGGACCTCACCGACACGCAGATCAAGGCCCAGGTGCAATATTGCCTGGACCAGGGCTGGGCGGTCTCGGTGGAGCACACCGACGATCCCCACCCGCGCAACACCTATTGGGAGATGTGGGGCCCGCCCATGTTCGATCTGCGCGACGCGGCCGGCGTCTTCGGCGAGATCGAAGCCTGCCGGGCCGCCAATCCCGAGCATTATGTGCGGGTGAACGCCTTCGATTCCAGCCGCGGATGGGAGACGATCCGCCTGTCCTTCATCGTTCAGCGGCCCACCGTGGAAGAGGGCTTCCGCCTCGACCGCACCGAAGGCAAGGGCCGCAACCAGAGCTACGCCATGCGCTACCGGGCGCAGTTCGCGCCGCGCTGAATGCGCATCACCCAAGGCTCCTTCTCCTTCCTGCCGGACCTCACCGACACGCAGATCAAGGCCCAGGTGCAATATTGCCTGGACCAGGGCTGGGCGGTCTCGGTGGAGCACACCGACGATCCCCACCCGCGCAACACCTATTGGGAGATGTGGGGCCCGCCCATGTTCGATCTGCGCGACGCGGCCCGGCGTCTTCGGCGAGATCGAAGCCTGCCGGGCCGCCAATCCCGAGCATTATGTG CGGGTGAACGCCTTCGATTCCAGCCGCGGATGGGAGACGATCCGCCTGTCCTTCATCGTTCAGCGGCCCACCGTGGAAGAGGGCTTCCGCCTCGACCGCACCGAAGGCAAGGGCCGCAACCAGAGCTACGCCATGCGCTACCGGGCGCAGTTCGCGCCGCGCTGA
SEQ ID NO: 5:SEQ ID NO: 5:
Аминокислотная последовательность малой цепи рибулозобисфосфаткарбоксилазы:Amino acid sequence of small chain ribulose bisphosphate carboxylase:
MRITQGSFSFLPDLTDTQIKAQVQYCLDQGWAVSVEHTDDPHPRNTYWEMWGPPMFDLRDAAGVFGEIEACRAANPEHYVRVNAFDSSRGWETIRLSFIVQRPTVEEGFRLDRTEGKGRNQSYAMRYRAQFAPRMRITQGSFSFLPDLTDTQIKAQVQYCLDQGWAVSVEHTDDPHPRNTYWEMWGPPMFDLRDAAGVFGEIEACRAANPEHYVRVNAFDSSRGWETIRLSFIVQRPTVEEGFRLDRTEGKGRNQSYAMRYRAQFAPR
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий альфа-субъединицу NAD+-восстанавливающей гидрогеназы HoxS, имеющую последовательность, указанную в SEQ ID NO:7, или последовательность, имеющую более 70% идентичности по последовательности, например более 80% идентичности, например более 90% идентичности, например более 95% идентичности, например более 96% идентичности, например, более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:7.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the alpha subunit of NAD + -reducing hydrogenase HoxS having the sequence specified in SEQ ID NO: 7, or a sequence having more than 70% sequence identity, for example more than 80% identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:7.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий бета-субъединицу NAD+-восстанавливающей гидрогеназы HoxS, имеющую последовательность, указанную в SEQ ID NO:9, или последовательность, имеющую более 77% идентичности по последовательности, например более 80% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например, более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:9.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the beta subunit of NAD + -reducing hydrogenase HoxS having the sequence specified in SEQ ID NO:9, or a sequence having more than 77% sequence identity, for example, more than 80% identity, for example more than 90% identity, for example, more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:9.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий гамма-субъединицу NAD+-восстанавливающей гидрогеназы HoxS, имеющую последовательность, указанную в SEQ ID NO:11, или последовательность, имеющую более 70% идентичности по последовательности, например более 80% идентичности, например более 90% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:11.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the gamma subunit of NAD + -reducing hydrogenase HoxS having the sequence specified in SEQ ID NO:11, or a sequence having more than 70% sequence identity, for example, more than 80% identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO :eleven.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий дельта-субъединицу NAD+-восстанавливающей гидрогеназы HoxS, имеющую последовательность, указанную в SEQ ID NO:13, или последовательность, имеющую более 79% идентичности по последовательности, например более 80% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например, более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:13.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the delta subunit of the NAD+ reducing hydrogenase HoxS having the sequence set forth in SEQ ID NO:13, or a sequence having greater than 79% sequence identity, for example more than 80% identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:13.
SEQ ID NO:6:SEQ ID NO:6:
Нуклеотидная последовательность альфа-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Nucleotide sequence of the alpha subunit of NAD+ - reducing hydrogenase HoxS:
ATGATGCCATCTGAGCCGCACGGCGCGGGCATGCCGCCCCCACGGGAAGCGGCCGCGGTTCCCACCCCCCAGGAGGTGAGCGCGGTGGTGGCCGAGGTGGTCGCGGATGCCGTGGCATCGGTGGGCGGCGCACGCACCCGGCTCATGGACATCGTCCAGCTGGCCCAGCAGCGTCTCGGCCATCTCTCCGAAGAGACCATGGCGGCCATTGCCGCGCGGCTCGCCATTCCGCCGGTGGAAGTGGCGGACATGGTGTCCTTCTACGCCTTCCTGAACCGCGCGCCCAAGGGCCGCTACCACATCCGCCTGTCGCGCAGCCCCATCTCGCTGATGAAGGGCGCCGAGGCGGTGGCTGCCGCCTTCTGCCAGATCCTCGGCATCGCCATGGGCGAGACCTCGCAGGATGGCGACTTCACCCTGGAATGGACCAACGACATCGGCATGGCCGACCAGGAGCCGGCCGCCCTCGTCAACGGCACGGTGATGACGCAGCTCGCGCCCGGCGATGCGGCCATCATCGTCGGCCGGCTGCGGGCCCATCACGCGCCCAATGCCCTGCCGCTGTTCCCTGGAGCCGGCGTGGCCGGCTCCGGCCTGCCCCATGCCCGGATCCGCCCCAGCCTGGTGATGCCGGGACAGCTTCTGTTCCGCGAGGACCACACGACGCCGGGCGCCGGCATCAAGGCGGCACTCGCCCTCACCCCGGACGAAGTGGTGCAGAAGGTCTCCGCCGCGCGCCTGCGCGGGCGGGGTGGCGCCGGCTTTCCCACCGGTCTCAAATGGAAGCTCTGCCGCCAGTCGCCCGCCACCACCCGCCATGTGATCTGCAATGCGGACGAGGGCGAGCCCGGCACCTTCAAGGATCGCGTGCTGCTCACGCAGGCGCCGCACCTCATGTTCGACGGCATGACCATCGCCGGCTACGCCTTGGGGGCGCGGGAGGGCGTGGTCTATCTGCGCGGCGAGTACGCCTATCTGTGGGAGCCTCTGCATGCGGTCCTGCGCGAGCGCTATGGGCTCGGGCTCGCCGGCGCGAACATCCTGGGACACGCGGGCTTCGACTTCGACATCCGCATCCAGCTGGGCGCCGGCGCCTATATCTGCGGCGAGGAATCCGCGCTGGTGGAATCGCTGGAAGGCAAGCGCGGCTCGCCCCGCGACCGCCCCCCCTTCCCCACCGTGCGCGGCCATCTCCAGCAGCCCACCGCCGTGGACAATGTGGAGACCTTCGCCTGCGCCGCCCGCATCCTGGAGGATGGCGTGGAGGCGTTCGCGGGCATCGGCACGCCCGAATCCGCCGGCACGAAGCTCCTCTCGGTGTCGGGCGATTGCCCGCGCCCCGGCGTGTATGAGGTGCCCTTCGGCCTCACGGTGAACGCGCTGCTCGACCTTGTCGGCGCGCCGGACGCCGCCTTCGTGCAGATGGGTGGGCCGTCCGGCCAATGCGTGGCGCCGAAGGATTACGGCCGCCGCATCGCCTTCGAGGACCTGCCCACCGGCGGCTCGGTGATGGTGTTCGGCCCGGGGCGCGACGTGCTCGCCATGGTGCGCGAGTTCGCGGATTTCTTCGCCGGCGAATCCTGCGGCTGGTGCACGCCCTGCCGGGTGGGCACCACCTTGCTCAAGGAAGAGCTGGACAAGCTCCTCGCCAACCGCGCCACCCTCGCCGACATCCGCGCGCTGGAGACCCTGGCCACGACCGTCTCCCGCACCAGCCGCTGCGGCCTCGGCCAGACGGCGCCCAACCCCATCCTTTCCACCATGCGCAACCTGCCGGAAGCCTATGAGGCGAGGCTGAGGCCCGAAGACTTCCTGCCCTGGGCCTCGCTCGACGAGGCGCTGAAGCCCGCCATCGTCATCCAGGGCCGCGCGCCCGTGCCGGAGGAAGAGGCATGAATGATGCCATCTGAGCCGCACGGCGCGGGCATGCCGCCCCCACGGGAAGCGGCCGCGGTTCCCACCCCCCAGGAGGTGAGCGCGGTGGTGGCCGAGGTGGTCGCGGATGCCGTGGCATCGGTGGGCGGCGCACGCACCCGGCTCATGGACATCGTCCAGCTGGCCCAGCAGCGTCTCGGCCATCTCTCCGAAGACCATGGCGGCCATTGCCGCGCGGCTCGCCATTCCGCCGGTGGAAG TGGCGGACATGGTGTCCTTCTACGCCTTCCTGAACCGCGCGCCCAAGGGCCGCTACCACATCCGCCTGTCGCGCAGCCCCATCTCGCTGATGAAGGGCGCCGAGGCGGTGGCTGCCGCCTTCTGCCAGATCCTCGGCATCGCCATGGGCGAGACCTCGCAGGATGGCGACTTCACCCTGGAATGGACCAACGACATCGGCATGGCCGACCAGGAGCCGGCCGCCCTCGTCAACGGCACGGTGATGACG CAGCTCGCGCCCGGCGATGCGGCCATCATCGTCGGCCGGCTGCGGGCCCATCACGCGCCCAATGCCCTGCCGCTGTTCCCTGGAGCCGGCGTGGCCGGCTCCGGCCTGCCCCATGCCCGGATCCGCCCCAGCCTGGTGATGCCGGGACAGCTTCTGTTCCGCGAGGACCACACGACGCCGGGCGCCGGCATCAAGGCGGCACTCGCCCTCACCCCGGACGAAGTGGTGCAGAAGGTCTCCGCCGCGC GCCTGCGCGGGCGGGGTGGCGCCGGCTTTCCCACCGGTCTCAAATGGAAGCTCTGCCGCCAGTCGCCCGCCACCACCCGCCATGTGATCTGCAATGCGGACGAGGGCGAGCCCGGCACCTTCAAGGATCGCGTGCTGCTCACGCAGGCGCCGCACCTCATGTTCGACGGCATGACCATCGCCGGCTACGCCTTGGGGCCGGGAGGGCGTGGTCTATCTGCGCGGCGAGTACGCCTATCTGTG GGAGCCTCTGCATGCGGTCCTGCGCGAGCGCTATGGGCTCGGGCTCGCCGGCGCGAACATCCTGGGACACGCGGGCTTCGACTTCGACATCCGCATCCAGCTGGGCGCCGGCGCCTATATCTGCGGCGAGGAATCCGCGCTGGTGGAATCGCTGGAAGGCAAGCGCGGCTCGCCCCGCGACCGCCCCCCCTTCCCCACCGTGCGCGGCCATCTCCAGCAGCCCACCGCCGTGGACAATGTGGAGACCTT CGCCTGCGCCGCCCGCATCCTGGAGGATGGCGTGGAGGCGTTCGCGGGCATCGGCACGCCCGAATCCGCCGGCACGAAGCTCCTCTCGGTGTCGGGCGATTGCCCGCGCCCCGGCGTGTATGAGGTGCCCTTCGGCCTCACGGTGAACGCGCTGCTCGACCTTGTCGGCGCGCCGGACGCCGCCTTCGTGCAGATGGGTGGGCCGTCCGGCCAATGCGTGGCGCCGAAGGATTACGGCCGC CGCATCGCCTTCGAGGACCTGCCCACCGGCGGCTCGGTGATGGTGTTCGGCCCGGGGGCGCGACGTGCTCGCCATGGTGCGCGAGTTCGCGGATTTCTTCGCCGGCGAATCCTGCGGCTGGTGCACGCCCTGCCGGGTGGGCACCACCTTGCTCAAGGAAGAGCTGGACAAGCTCCTCGCCAACCGCGCCACCCTCGCCGACATCCGCGCGCTGGAGACCCTGGCCACGACCGTCTCCCGCACCAGC CGCTGCGGCCTCGGCCAGACGGCGCCCAACCCCATCCTTTCCACCATGCGCAACCTGCCGGAAGCCTATGAGGCGAGGCTGAGGCCCGAAGACTTCCTGCCCTGGGCCTCGCTCGACGAGGCGCTGAAGCCCGCCATCGTCATCCAGGGCCGCGCGCCCGTGCCGGAGGAAGAGGCATGA
SEQ ID NO: 7:SEQ ID NO: 7:
Аминокислотная последовательность альфа-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Amino acid sequence of the alpha subunit of NAD+ - reducing hydrogenase HoxS:
MMPSEPHGAGMPPPREAAAVPTPQEVSAVVAEVVADAVASVGGARTRLMDIVQLAQQRLGHLSEETMAAIAARLAIPPVEVADMVSFYAFLNRAPKGRYHIRLSRSPISLMKGAEAVAAAFCQILGIAMGETSQDGDFTLEWTNDIGMADQEPAALVNGTVMTQLAPGDAAIIVGRLRAHHAPNALPLFPGAGVAGSGLPHARIRPSLVMPGQLLFREDHTTPGAGIKAALALTPDEVVQKVSAARLRGRGGAGFPTGLKWKLCRQSPATTRHVICNADEGEPGTFKDRVLLTQAPHLMFDGMTIAGYALGAREGVVYLRGEYAYLWEPLHAVLRERYGLGLAGANILGHAGFDFDIRIQLGAGAYICGEESALVESLEGKRGSPRDRPPFPTVRGHLQQPTAVDNVETFACAARILEDGVEAFAGIGTPESAGTKLLSVSGDCPRPGVYEVPFGLTVNALLDLVGAPDAAFVQMGGPSGQCVAPKDYGRRIAFEDLPTGGSVMVFGPGRDVLAMVREFADFFAGESCGWCTPCRVGTTLLKEELDKLLANRATLADIRALETLATTVSRTSRCGLGQTAPNPILSTMRNLPEAYEARLRPEDFLPWASLDEALKPAIVIQGRAPVPEEEAMMPSEPHGAGMPPPREAAAVPTPQEVSAVVAEVVADAVASVGGARTRLMDIVQLAQQRLGHLSEETMAAIAARLAIPPVEVADMVSFYAFLNRAPKGRYHIRLSRSPISLMKGAEAVAAAFCQILGIAMGETSQDGDFTLEWTNDIGMADQEPAALVNGTVMTQLAPGDAAIIVGRLRAHHAPNALPLFPGAGVAGSGLPHARIRPSLVMPGQLLFREDHTTPGA GIKAALALTPDEVVQKVSAARLRGRGGAGFPTGLKWKLCRQSPATTRHVICNADEGEPGTFKDRVLLTQAPHLMFDGMTIAGYALGAREGVVYLRGEYAYLWEPLHAVLRERYGLGLAGANILGHAGFDFDIRIQLGAGAYICGEESALVESLEGKRGSPRDRPPFPTVRGHLQQPTAVDNVETFACAARILEDGVEAFAGIGTPESAG TKLLSVSGDCPRPGVYEVPFGLTVNALLDLVGAPDAAFVQMGGPSGQCVAPKDYGRRIAFEDLPTGGSVMVFGPGRDVLAMVREFADFFAGESCGWCTPCRVGTTLLKEELDKLLANRATLADIRALETLATTVSRTSRCGLGQTAPNPILSTMRNLPEAYEARLRPEDFLPWASLDEALKPAIVIQGRAPVPEEEA
SEQ ID NO: 8:SEQ ID NO:8:
Нуклеотидная последовательность бета-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Nucleotide sequence of the beta subunit of NAD+ - reducing hydrogenase HoxS:
ATGAGCCGGGGATCCCCCGATGCCGGGAAAGACCGCACCATGAGCGCCACCGACGGCACCACCGCCCCCCGCAAGATCGTCATCGATCCGGTGACCCGCGTGGAGGGCCACGGCAAGGTCACCATCCGCCTGGATGAAGCCGGCGCGGTGGAGGATGCGCGTTTCCACATCGTGGAGTTCCGCGGCTTCGAGCGGTTCATCCAGGGCCGGATGTACTGGGAAGTGCCCCTTATCATCCAGCGGCTGTGCGGCATCTGCCCGGTGAGCCACCATCTGGCGGCGGCGAAAGCCATGGACCAGGTGGCGGGCGTGGACCGCGTACCGCCCACCGCCGAGAAACTGCGCCGGCTGATGCATTATGGGCAGGTGCTGCAATCCAACGCTTTGCACATCTTCCACCTCGCCTCGCCCGACCTCCTGTTCGGCTTCGACGCGCCGGCCGAGCAGCGCAACATCATCGCCGTGCTCCAGCGTTATCCGGAGATCGGCAAATGGGCGATCTTCATCAGGAAGTTCGGCCAGGAGGTCATCAAGGCCACCGGCGGGCGCAAGATCCATCCCACCAGCGCCATTCCCGGCGGGGTCAACCAGAACCTCGCCGTGGAGGACCGCGACGCCCTGCGCGCCAAGGTGGGCGAGATCATCAGCTGGTGCATGGCGGCGCTGGACCATCACAAGGCCTATGTGGCGGAAAACCGGGCGCTGCATGACAGCTTCGCCGCCTTCCCCTCCGCCTTCATGAGCCTCGTGGGGCCGGATGGCGGCATGGACCTTTATGACGGCACCCTGCGGGTGATCGATGCCGAGGGCGCCCCCCTCATCGAAGGCGCGCCGCCCGCCTCCTACCGCGACCACCTCATCGAGGAGGTGCGGCCCTGGAGCTATCTGAAATTCCCCCATCTGCGCGCCTTCGGCCGCGACGATGGCTGGTATCGGGTCGGCCCCCTCGCCCAGGTCAATTGCGCCGCGTCCATCGACACGCCCCGCGCCGAGGCGGCCCGGCGGGACTTCATGGCCGAGGGCGGCGGCAAGCCGGTGCATGCCACCCTCGCTTATCACTGGGCGCGGCTCATCGTGCTGGTCCATTGCGCGGAGAAGATCGAACAGCTGCTGTTCGACGACGACCTGCAAGGCTGCGATCTGCGTGCGGAGGGCACCCGGCGCGGGGAAGGCGTCGCCTGGATCGAGGCGCCGCGCGGCACCCTCATCCACCATTACGAGGTGGACGAGAACGACCAGGTGCGCCGCGCCAACCTCATCGTCTCCACCACCCACAATAACGAGGCCATGAACCGCGCCGTGCGGCAGGTGGCGAAGACGGACCTTTCCGGTCGCGAGATCACCGAAGGGCTGCTGAACCATATCGAGGTGGCCATCCGCGCCTTCGACCCCTGCCTGTCCTGCGCCACCCATGCGCTGGGCCAGATGCCGCTGATCGTGACGCTTGAAGATGCCTCCGGCGCAGAGATCGCCCGCGGAGTGAAGGAATGAATGAGCCGGGGATCCCCCGATGCCGGGAAAGACCGCACCATGAGCGCCACCGACGGCACCACCGCCCCCCGCAAGATCGTCATCGATCCGGTGACCCGCGTGGAGGGCCACGGCAAGGTCACCATCCGCCTGGATGAAGCCGGCGCGGTGGAGGATGCGCGTTTCCACATCGTGGAGTTCCGCGGCTTCGAGCGGTTCATCCAGGGCCGGATGTACTGGGAAGTGCCCCTTATCATCCAGCGGC TGTGCGGCATCTGCCCGGTGAGCCACCATCTGGCGGCGGCGAAAGCCATGGACCAGGTGGCGGGCGTGGACCGCGTACCGCCCACCGCCGAGAAACTGCGCCGGCTGATGCATTATGGGCAGGTGCTGCAATCCAACGCTTTGCACATCTTCCACCTCGCCTCGCCCGACCTCCTGTTCGGCTTCGACGCGCCGGCCGAGCAGCGCAACATCATCGCCGTGCTCCAGCGTTATCCGGAGATCGGCAAATG GGCGATCTTCATCAGGAAGTTCGGCCAGGAGGTCATCAAGGCCACCGGCGGGCGCAAGATCCATCCCACCAGCGCCATTCCCGGCGGGGTCAACCAGAACCTCGCCGTGGAGGACCGCGACGCCCTGCGCGCCAAGGTGGGCGAGATCATCAGCTGGTGCATGGCGGCGCTGGACCATCACAAGGCCTATGTGGCGGAAAACCGGGCGCTGCATGACAGCTTCGCCGCCTTCCCCTCCGCCTT CATGAGCCTCGTGGGGCCGGATGGCGGCATGGACCTTTATGACGGCACCCTGCGGGTGATCGATGCCGAGGGCGCCCCCCTCATCGAAGGCGCGCCGCCCGCCTCCTACCGCGACCACCTCATCGAGGAGGTGCGGCCCTGGAGCTATCTGAAATTCCCCCATCTGCGCGCCTTCGGCCGCGACGATGGCTGGTATCGGGTCGGCCCCCTCGCCCAGGTCAATTGCGCCGCGTCCATCGACACGCCCCGCG CCGAGGCGGCCCGGCGGGACTTCATGGCCGAGGGCGGCGGCAAGCCGGTGCATGCCACCCTCGCTTATCACTGGGCGCGGCTCATCGTGCTGGTCCATTGCGCGGAGAAGATCGAACAGCTGCTGTTCGACGACGACCTGCAAGGCTGCGATCTGCGTGCGGAGGGCACCCGGCGCGGGGAAGGCGTCGCCTGGATCGAGGCGCCGCGCGGCACCCTCATCCACCATTACGAGGTGGACGAGAACGAC CAGGTGCGCCGCGCCAACCTCATCGTCTCCACCACCCACAATAACGAGGCCATGAACCGCGCCGTGCGGCAGGTGGCGAAGACGGACCTTTCCGGTCGCGAGATCACCGAAGGGCTGCTGAACCATATCGAGGTGGCCATCCGCGCCTTCGACCCCTGCCTGTCCTGCGCCACCCATGCGCTGGGCCAGATGCCGCTGATCGTGACGCTTGAAGATGCCTCCGGCGCAGAGATCGCCCGCGGAGT GAAGGAATGA
SEQ ID NO: 9:SEQ ID NO:9:
Аминокислотная последовательность бета-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Amino acid sequence of the beta subunit of NAD+ - reducing hydrogenase HoxS:
MSRGSPDAGKDRTMSATDGTTAPRKIVIDPVTRVEGHGKVTIRLDEAGAVEDARFHIVEFRGFERFIQGRMYWEVPLIIQRLCGICPVSHHLAAAKAMDQVAGVDRVPPTAEKLRRLMHYGQVLQSNALHIFHLASPDLLFGFDAPAEQRNIIAVLQRYPEIGKWAIFIRKFGQEVIKATGGRKIHPTSAIPGGVNQNLAVEDRDALRAKVGEIISWCMAALDHHKAYVAENRALHDSFAAFPSAFMSLVGPDGGMDLYDGTLRVIDAEGAPLIEGAPPASYRDHLIEEVRPWSYLKFPHLRAFGRDDGWYRVGPLAQVNCAASIDTPRAEAARRDFMAEGGGKPVHATLAYHWARLIVLVHCAEKIEQLLFDDDLQGCDLRAEGTRRGEGVAWIEAPRGTLIHHYEVDENDQVRRANLIVSTTHNNEAMNRAVRQVAKTDLSGREITEGLLNHIEVAIRAFDPCLSCATHALGQMPLIVTLEDASGAEIARGVKEMSRGSPDAGKDRTMSATDGTTAPRKIVIDPVTRVEGHGKVTIRLDEAGAVEDARFHIVEFRGFERFIQGRMYWEVPLIIQRLCGICPVSHHLAAAKAMDQVAGVDRVPPTAEKLRRLMHYGQVLQSNALHIFHLASPDLLFGFDAPAEQRNIIAVLQRYPEIGKWAIFIRKFGQEVIKATGGRKIHPTSAIPGGVNQNLAVEDRD ALRAKVGEIISWCMAALDHHKAYVAENRALHDSFAAFPSAFMSLVGPDGGMDLYDGTLRVIDAEGAPLIEGAPPASYRDHLIEEVRPWSYLKFPHLRAFGRDDGWYRVGPLAQVNCAASIDTPRAEAARRDFMAEGGGKPVHATLAYHWARLIVLVHCAEKIEQLLFDDDLQGCDLRAEGTRRGEGVAWIEAPRGTLIHHYEVDENDQ VRRANLIVSTTHNNEAMNRAVRQVAKTDLSGREITEGLLNHIEVAIRAFDPCLSCATHALGQMPLIVTLEDASGAEIARGVKE
SEQ ID NO: 10:SEQ ID NO: 10:
Нуклеотидная последовательность гамма-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Nucleotide sequence of the gamma subunit of NAD+ - reducing hydrogenase HoxS:
ATGAGCGAGACCCCCTTCACCTTTACCGTGGACGGCATCGCGGTCCCGGCCACCCCCGGCCAGAGCGTCATCGAGGCGTGCGATGCGGCGGGCATCTATATCCCGCGCCTGTGCCACCACCCGGACCTGCCGCCGGCGGGCCATTGCCGGGTGTGCACCTGCATCATCGACGGGCGGCCGGCCAGCGCCTGCACCATGCCCGCCGCCAGGGGCATGGTGGTGGAGAACGAGACGCCCGCTTTGCTGGCGGAGCGGCGCACGCTGATCGAGATGCTGTTCGCGGAAGGCAACCATTTCTGCCAGTTCTGCGAGGCGAGCGGCGATTGCGAATTGCAGGCGCTGGGCTACCTGTTCGGCATGGTGGCCCCGCCCTTCCCCCATCTGTGGCCGAAGCGGCCGGTGGATGCCAGCCATCCGGATATCTATATCGACCACAATCGCTGCATCCTGTGCTCGCGCTGCGTGCGCGCCTCGCGCACCCTGGACGGCAAGTCCGTGTTCGGCTTCGAGGGGCGCGGCATCGAGATGCATCTGGCGGTGACCGGCGGGCACCTGGACGACAGCGCCATCGCCGCCGCCGACAGGGCGGTTGAGATGTGCCCGGTGGGCTGCATCGTCCTCAAGCGCACCGGCTACCGCACGCCCTATGGCCGGCGGCGCTACGACGCCGCGCCCATCGGCTCCGACATCACCGCCCGGCGCGGCGGCGCGAAGGACTGAATGAGCGAGACCCCCTTCACCTTTACCGTGGACGGCATCGCGGTCCCGGCCACCCCCGGCCAGAGCGTCATCGAGGCGTGCGATGCGGCGGGCATCTATATCCCGCGCCTGTGCCACCACCCGGACCTGCCGCCGGCGGGCCATTGCCGGGTGTGCACCTGCATCATCGACGGGCGGCCGGCCAGCGCCTGCACCATGCCCGCCGCCAGGGGCATGGTGGTGGAGAACGAGACGCCCGCTTTGC TGGCGGAGCGGCGCACGCTGATCGAGATGCTGTTCGCGGAAGGCAACCATTTCTGCCAGTTCTGCGAGGCGAGCGGCGATTGCGAATTGCAGGCGCTGGGCTACCTGTTCGGCATGGTGGCCCCGCCCTTCCCCCATCTGTGGCCGAAGCGGCCGGTGGATGCCAGCCATCCGGATATCTATATCGACCACAATCGCTGCATCCTGTGCTCGCGCTGCGTGCGCGCCTCGCGCCACCCTGGACGG CAAGTCCGTGTTCGGCTTCGAGGGGCGCGGCATCGGATGCATCTGGCGGTGACCGGCGGGCACCTGGACGACAGCGCCATCGCCGCCGCCGACAGGGCGGTTGAGATGTGCCCGGTGGGCTGCATCGTCCTCAAGCGCACCGGCTACCGCACGCCCTATGGCCGGCGGCGCTACGACGCCGCGCCCATCGGCTCCGACATCACCGCCCGGCGCGGCGGCGCGAAGGACTGA
SEQ ID NO: 11:SEQ ID NO: 11:
Аминокислотная последовательность гамма-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Amino acid sequence of the gamma subunit of NAD+ - reducing hydrogenase HoxS:
MSETPFTFTVDGIAVPATPGQSVIEACDAAGIYIPRLCHHPDLPPAGHCRVCTCIIDGRPASACTMPAARGMVVENETPALLAERRTLIEMLFAEGNHFCQFCEASGDCELQALGYLFGMVAPPFPHLWPKRPVDASHPDIYIDHNRCILCSRCVRASRTLDGKSVFGFEGRGIEMHLAVTGGHLDDSAIAAADRAVEMCPVGCIVLKRTGYRTPYGRRRYDAAPIGSDITARRGGAKDMSETPFTFTVDGIAVPATPGQSVIEACDAAGIYIPRLCHHPDLPPAGHCRVCTCIIDGRPASACTMPAARGMVVENETPALLAERRTLIEMLFAEGNHFCQFCEASGDCELQALGYLFGMVAPPFPHLWPKRPVDASHPDIYIDHNRCILCSRCVRASRTLDGKSVFGFEGRGIEMHLAVTGGHLDDSAIAAADRAVEMCPVGCIVLKRTGYRTPY GRRRYDAAPIGSDITARRGGAKD
SEQ ID NO: 12:SEQ ID NO: 12:
Нуклеотидная последовательность дельта-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Nucleotide sequence of the delta subunit of NAD+ - reducing hydrogenase HoxS:
ATGGCCAAGCCCAAACTCGCCACCTGCGCGCTGGCCGGCTGCTTCGGCTGCCACATGTCCTTCCTGGACATGGACGAGCGCATCGTCGAGCTCATCGACCTGGTGGACCTCGACGTCTCGCCCCTCGACGACAAGAAAAACTTCACCGGCATGGTGGAAATCGGCCTGGTGGAAGGCGGCTGCGCCGACGAGCGCCATGTGAAGGTGCTGCGCGAGTTCCGCGAGAAATCCCGCATCCTGGTGGCGGTGGGCGCCTGCGCCATCACCGGCGGCATCCCGGCATTGCGCAACCTCGCCGGCCTCGACGAATGCCTGAGGGAAGCCTACCTCACCGGCCCCACGGTGGAAGGCGGCGGGCTCATTCCCAACGACCCGGAGCTGCCGCTGCTGCTGGACAAGGTCTATCCGGTGCAGGACTTCGTGAAGATCGACCATTTCCTGCCCGGCTGCCCGCCCTCGGCCGACGCCATCTGGGCGGCTCTGAAGGCGCTGCTGACCGGCACCGAGCCGCATCTGCCCTACCCGCTTTTCAAGTACGAATGAATGGCCAAGCCCAAACTCGCCACCTGCGCGCTGGCCGGCTGCTTCGGCTGCCACATGTCCTTCCTGGACATGGACGAGCGCATCGTCGAGCTCATCGACCTGGTGGACCTCGACGTCTCGCCCCTCGACGACAAGAAAAACTTCACCGGCATGGTGGAAATCGGCCTGGTGGAAGGCGGCTGCGCCGACGAGCGCCATGTGAAGGTGCTGCGCGAGTTCCGCGAGAAATCCCGCATCCTGGTG GCGGTGGGCGCCTGCGCCATCACCGGCGGCATCCCGGCATTGCGCAACCTCGCCGGCCTCGACGAATGCCTGAGGGAAGCCTACCTCACCGGCCCCACGGTGGAAGGCGGCGGGCTCATTCCCAACGACCCGGAGCTGCCGCTGCTGCTGGACAAGGTCTATCCGGTGCAGGACTTCGTGAAGATCGACCATTTCCTGCCCGGCTGCCCGCCCTCGGCCGACGCCATCTGGGCGGCTCTGAAGGCGC TGCTGACCGGCACCGAGCCGCATCTGCCCTACCCGCTTTTCAAGTACGAATGA
SEQ ID NO: 13:SEQ ID NO: 13:
Аминокислотная последовательность дельта-субъединицы NAD+-восстанавливающей гидрогеназы HoxS:Amino acid sequence of the delta subunit of NAD+ - reducing hydrogenase HoxS:
MAKPKLATCALAGCFGCHMSFLDMDERIVELIDLVDLDVSPLDDKKNFTGMVEIGLVEGGCADERHVKVLREFREKSRILVAVGACAITGGIPALRNLAGLDECLREAYLTGPTVEGGGLIPNDPELPLLLDKVYPVQDFVKIDHFLPGCPPSADAIWAALKALLTGTEPHLPYPLFKYEMAKPKLATCALAGCFGCHMSFLDMDERIVELIDLVDLDVSPLDDKKNFTGMVEIGLVEGGCADERHVKVLREFREKSRILVAVGACAITGGIPALRNLAGLDECLREAYLTGPTVEGGGLIPNDPELPLLLDKVYPVQDFVKIDHFLPGCPPSADAIWAALKALLTGTEPHLPYPLFKYE
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий большую субъединицу гидрогеназы NiFeSe, имеющую последовательность, указанную в SEQ ID NO:15, или последовательность, имеющую более 84% идентичности по последовательности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например, более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:15.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding a large subunit of NiFeSe hydrogenase having the sequence set forth in SEQ ID NO:15, or a sequence having greater than 84% sequence identity, such as greater than 90% sequence identity , for example, more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:15.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий малую субъединицу гидрогеназы NiFeSe, имеющую последовательность, указанную в SEQ ID NO:17, или последовательность, имеющую более 90% идентичности, например более 95% идентичности, например более 96% идентичности, например, более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:17.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding a small subunit of NiFeSe hydrogenase having the sequence specified in SEQ ID NO:17, or a sequence having more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:17.
SEQ ID NO: 14:SEQ ID NO: 14:
Нуклеотидная последовательность большой субъединицы периплазматической гидрогеназы [NiFeSe]:Nucleotide sequence of the large subunit of periplasmic hydrogenase [NiFeSe]:
TCCAGACCCGGGCAACATTGCTCCATGTGCTGGGCACCCTGGCCGGCCGCTGGCCCCATACCCTCGCGCTCCAGCCCGGCGGGGTGACCCGAAGCGCCGACCAGCACGACCGCATGCGCCTGCTCGCGACGCTGAAGGCGGTGCGGGCGGCGCTGGAAGAGACCTTGTTCGGCGCGCCTTTGGAAGAGGTGGCGGCCCTGGACGGCGCCGCCGCCGTGGAGGCCTGGCGCGCCAACGGCCCGGAAGGGGATTTCCGCCTGTTCCTGGAGATCGCCGCCGACCTGGAGCTGGACCGGCTCGGCCGCGCGCACGACCGCTTTCTCTCCTTCGGCGCCTACGCCCAGGACGAGGGGCGCCTTTATGGCGCCGGCACCTTCGAGGCCGGGACGGCGGGAGGGCTCGATCCCAACGCCATCACCGAGGACCACGCCTTCGCCCGCATGGAGGACCGCGCGGCGCCCCATGCGCCCTTTGACGGCTCCACCTTCCCCGATGCCGACGACACCGAGGGCTACACCTGGTGCAAGGCGCCGCGCCTTGCCGGCCTGCCCTTCGAGACCGGCGCCTTCGCCCGGCAGGTGGTGGCGGGCCATCCGCTCGCCCGGGACCTCGTGACGCGGGAAGGCGGCACTGTGCGCAGCCGCGTGGTCGGCCGGCTGCTGGAAACCGCGCGCACCCTGATCGCCATGGAGGGCTGGGTGAAGGAACTGCGGCCCGAAGGGCCCTGGTGCGCCCAGGGCCACCTGCCCCAGGAAGGCCGCGCCTTCGGCCTCACCGAGGCGGCGCGCGGGGCGCTCGGCCACTGGATGGTGGTGGAGAAGGGCCGCATTGCCCGCTACCAGATCATCGCCCCCACCACCTGGAACTTCTCCCCCCGCGACGGCGCGGGCCTGCCCGGCCCGCTGGAGACGGCCCTGGTGGGCGCGCCCGTGCGGCAGGGAGAGACGACGCCCGTGAGCGTGCAGCACATCGTGCGCTCCTTCGACCCGTGCATGGTCTGCACTGTGCATTGATCCAGACCCGGGCAACATTGCTCCATGTGCTGGGCACCCTGGCCGGCCGCTGGCCCCATACCCTCGCGCTCCAGCCCGGCGGGGTGACCCGAAGCGCCGACCAGCACGACCGCATGCGCCTGCTCGCGACGCTGAAGGCGGTGCGGGCGGCGCTGGAAGAGACCTTGTTCGGCGCGCCTTTGGAAGAGGTGGCGGCCCTGGACGGCGCCGCCGCCGTGGAGGCCTGGCGCGCCAACGGCCCGGA AGGGGATTTCCGCCTGTTCCTGGAGATCGCCGCCGACCTGGAGCTGGACCGGCTCGGCCGCGCGCACGACCGCTTTCTCTCCTTCGGCGCCTACGCCCAGGACGAGGGGCGCCTTTATGGCGCCGGCACCTTCGAGGCCGGGACGGCGGGAGGGCTCGATCCCAACGCCATCACCGAGGACCACGCCTTCGCCCGCATGGAGGACCGCGCGCGCCCCATGCGCCCTTTGACGGCTCCACCTTCCCC GATGCCGACGACACCGAGGGCTACACCTGGTGCAAGGCGCCGCGCCTTGCCGGCCTGCCCTTCGAGACCGGCGCCTTCGCCCGGCAGGTGGTGGCGGGCCATCCGCTCGCCCGGGACCTCGTGACGCGGGAAGGCGGCACTGTGCGCAGCCGCGTGGTCGGCCGGCTGCTGGAAACCGCGCGCCACCCTGATCGCCATGGAGGGCTGGGTGAAGGAACTGCGGCCCGAAGGGCCCTGGTGCG CCCAGGGCCACCTGCCCCAGGAAGGCCGCGCCTTCGGCCTCACCGAGGCGGCGCGGGGCGCTCGGCCACTGGATGGTGGTGGAGAAGGGCCGCATTGCCCGCTACCAGATCATCGCCCCCACCACCTGGAACTTCTCCCCCCGCGACGGCGCGGGCCTGCCCGGCCCGCTGGAGACGGCCCTGGTGGGCGCGCCCGTGCGGCAGGGAGAGACGACGCCCGTGAGCGTGCAGCACATCGTGC GCTCCTTCGACCCGTGCATGGTCTGCACTGTGCATTGA
SEQ ID NO: 15:SEQ ID NO: 15:
Аминокислотная последовательность большой субъединицы периплазматической гидрогеназы [NiFeSe]:Amino acid sequence of the large subunit of periplasmic hydrogenase [NiFeSe]:
MSAETRRLVVGPFNRVEGDLEVRLDVQDGRVQQAFVSSPLFRGFERILEGRDPRDALVIAPRICGICSVSQSHAAALALAGLQGIAPTHDGRIATNLIVAAENVADHLTHFHVFFMPDFARAVYEDRPWFAQAARRFKANQGVSVRRALQTRATLLHVLGTLAGRWPHTLALQPGGVTRSADQHDRMRLLATLKAVRAALEETLFGAPLEEVAALDGAAAVEAWRANGPEGDFRLFLEIAADLELDRLGRAHDRFLSFGAYAQDEGRLYGAGTFEAGTAGGLDPNAITEDHAFARMEDRAAPHAPFDGSTFPDADDTEGYTWCKAPRLAGLPFETGAFARQVVAGHPLARDLVTREGGTVRSRVVGRLLETARTLIAMEGWVKELRPEGPWCAQGHLPQEGRAFGLTEAARGALGHWMVVEKGRIARYQIIAPTTWNFSPRDGAGLPGPLETALVGAPVRQGETTPVSVQHIVRSFDPCMVCTVHMSAETRRLVVGPFNRVEGDLEVRLDVQDGRVQQAFVSSPLFRGFERILEGRDPRDALVIAPRICGICSVSQSHAAALALAGLQGIAPTHDGRIATNLIVAAENVADHLTHFHVFFMPDFARAVYEDRPWFAQAARRFKANQGVSVRRALQTRATLLHVLGTLAGRWPHTLALQPGGVTRSADQHDRMRLLATLKAVRAALEETLFGAPL EEVAALDGAAAVEAWRANGPEGDFRLFLEIAADLELDRLGRAHDRFLSFGAYAQDEGRLYGAGTFEAGTAGGLDPNAITEDHAFARMEDRAAPHAPFDGSTFPDADDTEGYTWCKAPRLAGLPFETGAFARQVVAGHPLARDLVTREGGTVRSRVVGRLLETARTLIAMEGWVKELRPEGPWCAQGHLPQEGRAFGLTEAARGALGHWMVVEKGRIARYQIIAP TTWNFSPRDGAGLPGPLETALVGAPVRQGETTPVSVQHIVRSFDPCMVCTVH
SEQ ID NO: 16:SEQ ID NO: 16:
Нуклеотидная последовательность малой субъединицы периплазматической гидрогеназы [NiFeSe]Nucleotide sequence of the small subunit of periplasmic hydrogenase [NiFeSe]
ACGGGGGAGGAAGCCCGCGCCATCTTCGACGCCATCCTTGCCGGCGTTATCGTCCTCGACGCCCTGTGCGTGGAAGGCGCGCTGCTGCGCGGGCCGAACGGCACCGGGCGCTTCCATGTGCTGGCGGGCACGGACACCCCCACCATCGACTGGGCGCGGCAGCTCGCCGGCATGGCGCGCCACGTGGTGGCGGTGGGCACCTGCGCCGCCTATGGGGGCGTGACGGCGGCGGGCATCAACCCCACCGATGCCTGCGGCCTCCAGTTCGACGGACGCCGGAAGGGTGGGGCGCTGGGGGCGGACTTCCGCTCCCGCTCGGGGCTTCCGGTCATCAATGTGGCCGGCTGCCCCACCCATCCCAACTGGGTGACGGAAACCCTGATGCTGCTCGCCTGCGGCCTGCTGGGCGAGGCCGACCTCGACGTCTATGGCCGCCCGCGCTTCTATGCGGACCTGCTGGTGCATCACGGCTGCCCGCGCAACGAATACTATGAATACAAGGCGAGCGCCGAGAAGATGAGCGACCTCGGCTGCATGATGGAGCATCTGGGCTGCCTCGGCACCCAGGCCCACGCCGACTGCAACACGCGCCTTTGGAATGGCGAGGGCTCGTGCACCCGCGGCGGCTATGCCTGCATCAACTGCACGGCGCCGGAATTCGAGGAGCCGGGCCACGCCTTCCTGGAGACGCCCAAGATCGGCGGCATCCCCATCGGCCTGCCCACCGACATGCCCAAGGCCTGGTTCATCGCCTTGTCCTCCCTCGCCAAGGCGGCGACGCCGGAGCGGCTGCGCAAGAACGCGGTGTCCGACCATGTGGTCACGCCGCCCGCCGTCAAGGACATCAAGCGGCGATGAACGGGGGAGGAAGCCCGCGCCATCTTCGACGCCATCCTTGCCGGCGTTATCGTCCTCGACGCCCTGTGCGTGGAAGGCGCGCTGCTGCGCGGGCCGAACGGCACCGGGCGCTTCCATGTGCTGGCGGGCACGGACACCCCCACCATCGACTGGGCGCGGCAGCTCGCCGGCATGGCGCGCCACGTGGTGGCGGTGGGCACCTGCGCCGCCTATGGGGGCGTGACGGCGGCGGGCATCAACCC CACCGATGCCTGCGGCCTCCAGTTCGACGGACGCCGGAAGGGTGGGGCGCTGGGGGCGGACTTCCGCTCCCGCTCGGGGCTTCCGGTCATCAATGTGGCCGGCTGCCCCACCCATCCCAACTGGGTGACGGAAACCCTGATGCTGCTCGCCTGCGGCCTGCTGGGCGAGGCCGACCTCGACGTCTATGGCCGCCCGCTTCTATGCGGACCTGCTGGTGCATCACGGCTGCCCGCGCAACGAATACT ATGAATACAAGGCGAGCGCCGAGAAGATGAGCGACCTCGGCTGCATGATGGAGCATCTGGGCTGCCTCGGCACCCAGGCCCACGCCGACTGCAACACGCGCCTTTGGAATGGCGAGGGCTCGTGCACCCGCGGCGGCTATGCCTGCATCAACTGCACGGCGCCGGAATTCGAGGAGCCGGGCCACGCCTTCCTGGAGACGCCCAAGATCGGCGGCATCCCCATCGGCCTGCCCACCGACATGCCCAAG GCCTGGTTCATCGCCTTGTCCTCCCTCGCCAAGGCGGCGACGCCGGAGCGGCTGCGCAAGAACGCGGTGTCCGACCATGTGGTCACGCCGCCCGCCGTCAAGGACATCAAGCGGCGATGA
SEQ ID NO: 17:SEQ ID NO: 17:
Аминокислотная последовательность малой субъединицы периплазматической гидрогеназы [NiFeSe]Amino acid sequence of the small subunit of periplasmic hydrogenase [NiFeSe]
MSTPFSVLWLQSGGCGGCTMSLLCAEAPDLATTLDAAGIGFLWHPALSEETGEEARAIFDAILAGVIVLDALCVEGALLRGPNGTGRFHVLAGTDTPTIDWARQLAGMARHVVAVGTCAAYGGVTAAGINPTDACGLQFDGRRKGGALGADFRSRSGLPVINVAGCPTHPNWVTETLMLLACGLLGEADLDVYGRPRFYADLLVHHGCPRNEYYEYKASAEKMSDLGCMMEHLGCLGTQAHADCNTRLWNGEGSCTRGGYACINCTAPEFEEPGHAFLETPKIGGIPIGLPTDMPKAWFIALSSLAKAATPERLRKNAVSDHVVTPPAVKDIKRRMSTPFSVLWLQSGGCGGCTMSLLCAEAPDLATTLDAAGIGFLWHPALSEETGEEARAIFDAILAGVIVLDALCVEGALLRGPNGTGRFHVLAGTDTPTIDWARQLAGMARHVVAVGTCAAYGGVTAAGINPTDACGLQFDGRRKGGALGADFRSRSGLPVINVAGCPTHPNWVTETLMLLACGLLGEADLDVYGRPRFYADRNLLVHHGCPEY YEYKASAEKMSDLGCMMEHLGCLGTQAHADCNTRLWNGEGSCTRGGYACINCTAPEFEEPGHAFLETPKIGGIPIGLPTDMPKAWFIALSSLAKAATPERLRKNAVSDHVVTPPAVKDIKRR
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий гамма-цепь АТФ-синтазы atpG_1, имеющую последовательность, указанную в SEQ ID NO:19, или последовательность, имеющую более 70% идентичности, например более 80% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:19.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the gamma chain of ATP synthase atpG_1 having the sequence specified in SEQ ID NO:19, or a sequence having more than 70% identity, for example more than 80% identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:19.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий альфа-субъединицу АТФ-синтазы atpA_1, имеющую последовательность, указанную в SEQ ID NO:21, или последовательность, имеющую более 78% идентичности, например более 80% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:21.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the alpha subunit of ATP synthase atpA_1 having the sequence specified in SEQ ID NO:21, or a sequence having greater than 78% identity, for example greater than 80% identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:21.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретению, содержит ген, кодирующий субъединицу b АТФ-синтазы atpF_1, имеющую последовательность, указанную в SEQ ID NO:23, или последовательность, имеющую более 62% идентичности, например более 70% идентичности, например, более 80% идентичности, например более 90% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:23.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the b subunit of ATP synthase atpF_1 having the sequence specified in SEQ ID NO:23, or a sequence having more than 62% identity, for example more than 70% identity , for example, more than 80% identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:23.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий субъединицу c специфичной к ионам натрия АТФ-синтазы atpE_1, имеющую последовательность, указанную в SEQ ID NO:25, или последовательность, имеющую 90% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:25.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the c subunit of the sodium ion-specific ATP synthase atpE_1 having the sequence specified in SEQ ID NO:25, or a sequence having 90% identity, for example more 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:25.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий субъединицу a АТФ-синтазы atpB_1, имеющую последовательность, указанную в SEQ ID NO:27, или последовательность, имеющую более 80% идентичности, например более 90% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:27.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the ATP synthase subunit a atpB_1 having the sequence set forth in SEQ ID NO:27, or a sequence having greater than 80% identity, such as greater than 90% identity , for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:27.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий эпсилон-цепь АТФ-синтазы atpC_1, имеющую последовательность, указанную в SEQ ID NO: 29, или последовательность, имеющую более 71% идентичности, например более 80% идентичности, например более 90% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:29.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the epsilon chain of the ATP synthase atpC_1 having the sequence set forth in SEQ ID NO: 29, or a sequence having greater than 71% identity, such as greater than 80% identity. identity, for example more than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:29.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий бета-субъединицу АТФ-синтазы atpD_1, имеющую последовательность, указанную в SEQ ID NO:31, или последовательность, имеющую более 84% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:31.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the beta subunit of ATP synthase atpD_1 having the sequence specified in SEQ ID NO:31, or a sequence having greater than 84% identity, for example greater than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:31.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий бета-субъединицу АТФ-синтазы atpD_2, имеющую последовательность, указанную в SEQ ID NO:33, или последовательность, имеющую более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:33.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the beta subunit of ATP synthase atpD_2 having the sequence specified in SEQ ID NO:33, or a sequence having greater than 97% identity, for example greater than 98% , for example, more than 99% sequence identity with the sequence presented in SEQ ID NO:33.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретению, содержит ген, кодирующий гамма-цепь АТФ-синтазы atpG_2, имеющую последовательность, указанную в SEQ ID NO:35, или последовательность, имеющую более 86% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:35.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the gamma chain of ATP synthase atpG_2 having the sequence specified in SEQ ID NO:35, or a sequence having greater than 86% identity, for example greater than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:35.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий субъединицу альфа АТФ-синтазы atpA_2, имеющую последовательность, указанную в SEQ ID NO: 37, или последовательность, имеющую более 98%, например более 99% идентичности по последовательности с последовательностью, указанной в SEQ ID NO:37.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding an ATP synthase alpha subunit atpA_2 having the sequence set forth in SEQ ID NO: 37, or a sequence having greater than 98%, for example greater than 99%, identity to sequence with the sequence specified in SEQ ID NO:37.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий дельта-субъединицу АТФ-синтазы atpH, имеющую последовательность, указанную в SEQ ID NO:39, или последовательность, имеющую более 85% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:39.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the delta subunit of ATP synthase atpH having the sequence specified in SEQ ID NO:39, or a sequence having greater than 85% identity, for example greater than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:39.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий субъединицу b АТФ-синтазы atpF_2, имеющую последовательность, указанную в SEQ ID NO:41, или последовательность, имеющую более 87% идентичности, например более 90% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:41.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding ATP synthase subunit b atpF_2 having the sequence set forth in SEQ ID NO:41, or a sequence having greater than 87% identity, such as greater than 90% identity , for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:41.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий субъединицу b' АТФ-синтазы atpG_3, имеющую последовательность, указанную в SEQ ID NO:43, или последовательность, имеющую более 81% идентичности, например более 90% идентичности, например более 95% идентичностиь, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:43.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the b' subunit of ATP synthase atpG_3 having the sequence specified in SEQ ID NO:43, or a sequence having greater than 81% identity, for example greater than 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:43.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий субъединицу c АТФ-синтазы atpE_2, имеющую последовательность, указанную в SEQ ID NO:45, или последовательность, имеющую более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:45.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the c subunit of ATP synthase atpE_2 having the sequence specified in SEQ ID NO:45, or a sequence having greater than 98%, for example greater than 99%, identity to sequence with the sequence presented in SEQ ID NO:45.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий субъединицу a АТФ-синтазы atpB_2, имеющую последовательность, указанную в SEQ ID NO:47, или последовательность, имеющую более 92% идентичности, например более 95% идентичности, например более 96% идентичности, например, более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:47.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the ATP synthase subunit a atpB_2 having the sequence set forth in SEQ ID NO:47, or a sequence having greater than 92% identity, such as greater than 95% identity , for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:47.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий белок I АТФ-синтазы atpI, имеющий последовательность, указанную в SEQ ID NO: 49, или последовательность, имеющую более 60% идентичности, например более 70% идентичности, например, более 80% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например, более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:49.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding ATP synthase protein I atpI having the sequence set forth in SEQ ID NO: 49, or a sequence having greater than 60% identity, such as greater than 70% identity , for example, more than 80% identity, for example more than 90% identity, for example, more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:49.
SEQ ID NO: 18:SEQ ID NO: 18:
Нуклеотидная последовательность гамма-цепи АТФ-синтазы atpG_1Nucleotide sequence of the gamma chain of ATP synthase atpG_1
GTGACCGAGCGCCTGTCCGACGTCAACGCCCGCATCGCCTCGGTGCGGCAGCTCTCATCGGTCATCACGGCCATGCGGGGCATTGCGGCGGCGCGGGCGCGGGAGGCGCGGGGTCGGCTCGACGGCATCCGCGCCTATGCGCAGACCATCGCCGAGGCCATCGGCCATGTGCTCGCCGTGCTGCCCGAGGAGGCCCGCGCCCGGTCCTCCGGGCACCGGCATCGGGGCCATGCGGTCATCGCCCTGTGCGCGGAGCAGGGCTTTGCCGGCGTCTTCAACGAGCGGGTGCTGGACGAGGCCGCCCGGCTGCTGACCGGCGGGGCGGGGCCGGCCGAGCTGCTGCTGGTGGGCGACCGGGGCCTGATGGTGGCCCGCGAGCGGGGGCTCGATGTCTCCTGGTCGGTGCCCATGGTGGCCCATGCGGGCCAGGCCTCGGCGCTGGCGGACCGCATCAGCGAGGAGCTCTACCGGCGGATCGATGCGGGACGGGTGACGCGGGTGTCGGTGGTGCACGCCGAGCCCGCCGCGTCCGCCGCCATCGAGACGGTGGTGAAAGTGCTGGTGCCGTTCGACTTCGCCCGCTTCCCCCTGGCGCGGGTGGCATCCGCCCCGCTCATGACCATGCCGCCGCCGCGGCTGCTGGCCCAGCTGTCGGAGGAATATGTGTTCGCCGAGCTGTGCGAGGCGCTCACCTTGTCCTTCGCGGCGGAGAACGAGGCCCGCATGCGGGCCATGATCGCCGCCCGCGCCAATGTGGCCGATACCCTGGAGGGCCTCGTCGGCCGCGCCCGGCAGATGCGCCAGGAGGAGATCACCAACGAGATCATCGAGCTGGAAGGCGGCGCCGGCAGCGCCCGGCATGCGGATTGAGTGACCGAGCGCCTGTCCGACGTCAACGCCCGCATCGCCTCGGTGCGGCAGCTCTCATCGGTCATCACGGCCATGCGGGGCATTGCGGCGGCGCGGGCGCGGGAGGCGCGGGGTCGGCTCGACGGCATCCGCGCCTATGCGCAGACCATCGCCGAGGCCATCGGCCATGTGCTCGCCGTGCTGCCCGAGGAGGCCCGCGCCCGGTCCTCCGGGCACCGGCATCGGGGCCATGCGGTCATCGCCCTG TGCGCGGAGCAGGGCTTTGCCGGCGTCTTCAACGAGCGGGTGCTGGACGAGGCCGCCCGGCTGCTGACCGGCGGGGCGGGGCCGGCCGAGCTGCTGCTGGTGGGCGACCGGGGCCTGATGGTGGCCCGCGAGCGGGGGCTCGATGTCTCCTGGTCGGTGCCCATGGTGGCCCATGCGGGCCAGGCCTCGGCGCTGGCGGACCGCATCAGCGAGGAGCTCTACCGGCGGATCGATGCGGGACGGG TGACGCGGGTGTCGGTGGTGCACGCCGAGCCCGCCGCGTCCGCCGCCATCGAGACGGTGGTGAAAGTGCTGGTGCCGTTCGACTTCGCCCGCTTCCCCCTGGCGCGGGTGGCATCCGCCCCGCTCATGACCATGCCGCCGCCGCGGCTGCTGGCCCAGCTGTCGGAGGAATATGTGTTCGCCGAGCTGTGCGAGGCGCTCACCTTGTCCTTCGCGGCGGAGAACGAGGCCCGCATGCGGGCCATGATC GCCGCCCGCGCCAATGTGGCCGATACCCTGGAGGGCCTCGTCGGCCGCGCCCGGCAGATGCGCCAGGAGGAGATCACCAACGAGATCATCGAGCTGGAAGGCGGCGCCGGCAGCGCCCGGCATGCGGATTGA
SEQ ID NO: 19:SEQ ID NO: 19:
Аминокислотная последовательность гамма-цепи АТФ-синтазы atpG_1Amino acid sequence of the gamma chain of ATP synthase atpG_1
MTERLSDVNARIASVRQLSSVITAMRGIAAARAREARGRLDGIRAYAQTIAEAIGHVLAVLPEEARARSSGHRHRGHAVIALCAEQGFAGVFNERVLDEAARLLTGGAGPAELLLVGDRGLMVARERGLDVSWSVPMVAHAGQASALADRISEELYRRIDAGRVTRVSVVHAEPAASAAIETVVKVLVPFDFARFPLARVASAPLMTMPPPRLLAQLSEEYVFAELCEALTLSFAAENEARMRAMIAARANVADTLEGLVGRARQMRQEEITNEIIELEGGAGSARHADMTERLSDVNARIASVRQLSSVITAMRGIAAARAREARGRLDGIRAYAQTIAEAIGHVLAVLPEARARSSGHRHRGHAVIALCAEQGFAGVFNERVLDEAARLLTGGAGPAELLLVGDRGLMVARERGLDVSWSVPMVAHAGQASALADRISEELYRRIDAGRVTRVSVVHAEPAASAAIETVVKVLVPFDFARFPLARVASAPLMTMPPPRLLAQLS EEYVFAELCEALTLSFAAENEARMRAMIAARANVADTLEGLVGRARQMRQEEITNEIIELEGGAGSARHAD
SEQ ID NO: 20:SEQ ID NO: 20:
Нуклеотидная последовательность альфа-субъединицы АТФ-синтазы atpA_1Nucleotide sequence of the alpha subunit of ATP synthase atpA_1
ATGAGCACGGGCGCGCAAGCGAGCGAGGATTGGCTCACCCGGAGCCGGGCGGCCCTGGCCGGGACGCGCCTTTCCCAGCAATCCCAATCGGTGGGCCGGGTGGAGGAGATGGCCGACGGCATCGCCCGCGTCTCCGGCCTGCCGGATGTGCGGCTCGACGAGCTTCTCACCTTCGAGGGCGGCCAGACCGGCTATGCCCTCACCCTCGATCGCACCGAGATCGCCGTGGTGCTGCTGGATGACGCCTCCGGCGTGGAGGCGGGCGCCCGGGTGTTCGGCACCGGCGAGGTGGTGAAGGTGCCGGTGGGGCCGGGGCTGCTGGGCCGCATCGTCGACCCCCTCGGCCGGCCCATGGACCGCTCCGAGCCGGTGGTGGCGCAGGCGCACCATCCCATCGAGCGGCCGGCGCCGGCCATCATCGCCCGCGACCTGGTCTCGCAGCCGGTTCAGACCGGCACGCTGGTGGTGGATGCGCTGTTCTCCCTCGGCCGGGGCCAGCGCGAGCTCATCATCGGCGACCGGGCTACCGGCAAGACCGCCATCGCGGTGGACACCATCATCAGCCAGAAGCATTCGGACATCGTGTGCATCTACGTGGCGGTGGGCCAGCGCGCCGCCGCCGTGGAGCGGGTGGTGGAGGCGGTGCGCGCCCACGGGGCGATCGAGCGCTGCATCTTCGTGGTCGCCTCGGCCGCCGCCTCGCCAGGGCTGCAATGGATCGCGCCGTTCGCCGGCATGACCATGGCGGAATATTTCCGCGACAACGGCCAGCATGCGCTCATCATCATCGATGATCTCACCAAGCATGCGGCCACCCATCGCGAGCTGGCGCTGCTCACCCACGAGCCGCCGGGCCGCGAGGCCTATCCCGGCGACATCTTCTATGTGCACGCCCGCCTTCTGGAGCGGGCCGCCAAGCTCTCCGCCGAGCTGGGCGGTGGCTCGCTCACGGCCCTGCCCATCGCGGAGACGGACGCGGGAAACCTCTCCGCCTATATCCCCACCAACCTCATCTCCATCACCGATGGGCAGATCGTGCTGGATTCGCGGCTGTTCGCGGCCAACCAGCGCCCGGCGGTGGATGTGGGCCTCTCCGTGAGCCGGGTGGGCGGCAAGGCGCAGCATCCCGCGCTTCGGGCCGTGTCCGGGCGCATCCGGCTCGATTATTCCCAGTTCCTGGAGCTGGAAATGTTCACCCGCTTCGGCGGCATCACCGATACCCGCGTGAAGGCGCAGATCACCCGGGGCGAGCGCATCCGCGCGCTGCTCACCCAGCCGCGCTTTTCCACCCTGCGCCTTCAGGACGAGGTGGCGCTGCTGGCCGCGCTGGCGGAGGGGGTGTTCGACACTTTGGCCCCGGGGCTGATGGGCGCCGTGCGTGCCCGCATTCCGGCCCAGCTGGATGCGCAGGTGAAGGACGTGGCCTCGGCCCTCGCCGAGGGCAAGGTGCTGGAGGAGGGCTTGCACGCCCGTCTCGTGGCGGCCGTGCGGGCCGTCGCGGCGGACGTGGCCGCGACCGCGAAGGCCGGGCCGTGAATGAGCACGGGCGCGCAAGCGAGCGAGGATTGGCTCACCCGGAGCCGGGCGGCCCTGGCCGGGACGCGCCTTTCCCAGCAATCCCAATCGGTGGGCCGGGTGGAGGAGATGGCCGACGGCATCGCCCGCGTCTCCGGCCTGCCGGATGTGCGGCTCGACGAGCTTCTCACCTTCGAGGGCGGCCAGACCGGCTATGCCCTCACCCTCGATCGCACCGAGATCGCCGTGGTGCTGCTGGATG ACGCCTCCGGCGTGGAGGCGGGCGCCCGGGTGTTCGGCACCGGCGAGGTGGTGAAGGTGCCGGTGGGGCCGGGGCTGCTGGGCCGCATCGTCGACCCCCTCGGCCGGCCCATGGACCGCTCCGAGCCGGTGGTGGCGCAGGCGCACCATCCCATCGAGCGGCCGGCGCCGGCCATCATCGCCCGCGACCTGGTCTCGCAGCCGGTTCAGACCGGCACGCTGGTGGTGGATGCGCTGTTCCTCCCTC GGCCGGGGCCAGCGCGAGCTCATCATCGGCGACCGGGCTACCGGCAAGACCGCCATCGCGGTGGACACCATCATCAGCCAGAAGCATTCGGACATCGTGTGCATCTACGTGGCGGTGGGCCAGCGCGCCGCCGCCGTGGAGCGGGTGGTGGAGGCGGTGCGCGCCCACGGGGCGATCGAGCGCTGCATCTTCGTGGTCGCCTCGGCCGCCGCCTCGCCAGGGCTGCAATGGATCGCGCCGTTCG CCGGCATGACCATGGCGGAATATTTCCGCGACAACGGCCAGCATGCGCTCATCATCATCGATGATCTCACCAAGCATGCGGCCACCCATCGCGAGCTGGCGCTGCTCACCCACGAGCCGCCGGGCCGCGAGGCCTATCCCGGCGACATCTTCTATGTGCACGCCCGCCTTCTGGAGCGGGCCGCCAAGCTCTCCGCCGAGCTGGGCGGTGGCTCGCTCACGGCCCTGCCCATCGCGGAGACGGACGCGGGAAA CCTCTCCGCCTATATCCCCACCAACCTCATCTCCATCACCGATGGGCAGATCGTGCTGGATTCGCGGCTGTTCGCGGCCAACCAGCGCCCGGCGGTGGATGTGGGCCTCTCCGTGAGCCGGGTGGGCGGCAAGGCGCAGCATCCCGCGCTTCGGGCCGTGTCCGGGCGCATCCGGCTCGATTATTCCCAGTTCCTGGAGCTGGAAATGTTCACCCGCTTCGGCGGCATCACCGATACCCGCG TGAAGGCGCAGATCACCCGGGGCGAGCGCATCCGCGCGCTGCTCACCCAGCCGCGCTTTTCCACCCTGCGCCTTCAGGACGAGGTGGCGCTGCTGGCCGCGCTGGCGGAGGGGGTGTTCGACACTTTGGCCCCGGGGCTGATGGGCGCCGTGCGTGCCCGCATTCCGGCCCAGCTGGATGCGCAGGTGAAGGACGTGGCCTCGGCCCTCGCCGAGGGCAAGGTGCTGGAGGAGGGCTTGCACGC CCGTCTCGTGGCGGCCGTGCGGGCCGTCGCGGCGGACGTGGCCGCGACCGCGAAGGCCGGGCCGTGA
SEQ ID NO: 21:SEQ ID NO: 21:
Аминокислотная последовательность альфа-субъединицы АТФ-синтазы atpA_1Amino acid sequence of the alpha subunit of ATP synthase atpA_1
MSTGAQASEDWLTRSRAALAGTRLSQQSQSVGRVEEMADGIARVSGLPDVRLDELLTFEGGQTGYALTLDRTEIAVVLLDDASGVEAGARVFGTGEVVKVPVGPGLLGRIVDPLGRPMDRSEPVVAQAHHPIERPAPAIIARDLVSQPVQTGTLVVDALFSLGRGQRELIIGDRATGKTAIAVDTIISQKHSDIVCIYVAVGQRAAAVERVVEAVRAHGAIERCIFVVASAAASPGLQWIAPFAGMTMAEYFRDNGQHALIIIDDLTKHAATHRELALLTHEPPGREAYPGDIFYVHARLLERAAKLSAELGGGSLTALPIAETDAGNLSAYIPTNLISITDGQIVLDSRLFAANQRPAVDVGLSVSRVGGKAQHPALRAVSGRIRLDYSQFLELEMFTRFGGITDTRVKAQITRGERIRALLTQPRFSTLRLQDEVALLAALAEGVFDTLAPGLMGAVRARIPAQLDAQVKDVASALAEGKVLEEGLHARLVAAVRAVAADVAATAKAGPMSTGAQASEDWLTRSRAALAGTRLSQQSQSVGRVEEMADGIARVSGLPDVRLDELLTFEGGQTGYALTLDRTEIAVVLLDDASGVEAGARVFGTGEVVKVPVGPGLLGRIVDPLGRPMDRSEPVVAQAHHPIERPAPAIIARDLVSQPVQTGTLVVDALFSLGRGQRELIIGDRATGKTAIAVDTIISQKHSDIVCIYVAVGQ RAAAVERVVEAVRAHGAIERCIFVVASAAASPGLQWIAPFAGMTMAEYFRDNGQHALIIIDDLTKHAATHRELALLTHEPPGREAYPGDIFYVHARLLERAAKLSAELGGGSLTALPIAETDAGNLSAYIPTNLISITDGQIVLDSRLFAANQRPAVDVGLSVSRVGGKAQHPALRAVSGRIRLDYSQFLELEMFTRFGGITDTRVKAQITRGERIR ALLTQPRFSTLRLQDEVALLAALAEGVFDTLAPGLMGAVRARIPAQLDAQVKDVASALAEGKVLEEGLHARLVAAVRAVAADVAATAKAGP
SEQ ID NO: 22:SEQ ID NO: 22:
Нуклеотидная последовательность субъединицы b АТФ-синтазы atpF_1Nucleotide sequence of subunit b of ATP synthase atpF_1
ATGCAGATCGACTGGTGGACGCTGGGCCTGCAGACGGTCAACGTCCTCGTTCTCATCTGGCTCCTGAGCCGCTTCCTGTTCAAGCCGGTGGCGCAGGTCATCGCGCAGCGCCGTGCCGAGATCGAGAAGCTGGTGGAGGATGCGCGCGCCGCCAAGGCCGCCGCCGAGGCCGAGCGGGACACGGCGAAGGCGGAGGAGGCGCGCCTTGCCGCCGAGCGCGGCGCCCGCATGGCGGCGGTCGCCAAGGAGGCGGAGGCGCAGAAGGCGGCATTGCTGGCCGCCGCCAAGACCGAGGCCGAGGCCCTGCACGCGGCCGCGGAAGCGGCCATCGTCCGGGCGCGGGCGAGCGAGGAGGAAGCCGCCGCCGACCGCGCCAGCCGCCTTGCCGTGGACATCGCCGCCAAGCTGCTGGACCGGCTGCCCGACGACGCCCGGGTCGCGGGCTTCATCGATGGCCTCGCCGAGGGGCTTGAAGCCCTGCCCGAGGCGAGCCGGGCGGTGATCGGCGTCGACGGCGCGCCAGTGCGCGTGACGGCCGCGCGCGCCCTTATGCCGGCGGAGGAGGAGGCCTGCCGCACGCGGCTCTCCCAGGCGCTGGGCCGTCCGGTGACGCTGGCCGTGACCATCGACCCCGCCCTCATCGCCGGCCTGGAGATGGAGACGCCCCACGCGGTGGTGCGCAATTCCTTCAAGGCCGATCTCGACCGCGTCACCGCGGCGCTCACCCATCATGGGACCTGAATGCAGATCGACTGGTGGACGCTGGGCCTGCAGACGGTCAACGTCCTCGTTCTCATCTGGCTCCTGAGCCGCTTCCTGTTCAAGCCGGTGGCGCCAGGTCATCGCGCAGCGCCGTGCCGAGATCGAGAAGCTGGTGGAGGATGCGCGCGCCGCCAAGGCCGCCGCCGAGGCCGAGCGGGACACGGCGAAGGCGGAGGAGGCGCGCCTTGCCGCCGAGCGCGGCGCCCGCATGGCGGCGGTCGCCAAG GAGGCGGAGGCGCAGAAGGCGGCATTGCTGGCCGCCGCCAAGACCGAGGCCGAGGCCCTGCACGCGGCCGCGGAAGCGGCCATCGTCCGGGCGCGGGCGAGCGAGGAGGAAGCCGCCGCCGACCGCGCCAGCCGCCTTGCCGTGGACATCGCCGCCAAGCTGCTGGACCGGCTGCCCGACGACGCCCGGGTCGCGGGCTTCATCGATGGCCTCGCCGAGGGGCTTGAAGCCCTGCCCGAGGCGAGCC GGGCGGTGATCGGCGTCGACGGCGCGCAGTGCGCGTGACGGCCGCGCGCCCCTTATGCCGGCGGAGGAGGAGGCCTGCCGCACGCGGCTCTCCCAGGCGCTGGGCCGTCCGGTGACGCTGGCCGTGACCATCGACCCCGCCCTCATCGCCGGCCTGGAGATGGAGACGCCCCACGCGGTGGTGCGCAATTCCTTCAAGGCCGATCTCGACCGCGTCACCGCGGCGCTCACCCATCATGGGA CCTGA
SEQ ID NO: 23:SEQ ID NO: 23:
Аминокислотная последовательность субъединицы b АТФ-синтазы atpF_1Amino acid sequence of ATP synthase subunit b atpF_1
MQIDWWTLGLQTVNVLVLIWLLSRFLFKPVAQVIAQRRAEIEKLVEDARAAKAAAEAERDTAKAEEARLAAERGARMAAVAKEAEAQKAALLAAAKTEAEALHAAAEAAIVRARASEEEAAADRASRLAVDIAAKLLDRLPDDARVAGFIDGLAEGLEALPEASRAVIGVDGAPVRVTAARALMPAEEEACRTRLSQALGRPVTLAVTIDPALIAGLEMETPHAVVRNSFKADLDRVTAALTHHGTMQIDWWTLGLQTVNVLVLIWLLSRFLFKPVAQVIAQRRAEIEKLVEDARAAKAAAEAERDTAKAEEARLAAERGARMAAVAKEAEAQKAALLAAAKTEAEALHAAAEAAIVRARASEEEAAADRASRLAVDIAAKLLDRLPDDARVAGFIDGLAEGLEALPEASRAVIGVDGAPVRVTAARALMPAEEEACRTRLSQALGRPVTLAVTIDPALIAGLEM ETPHAVVRNSFKADLDRVTAALTHHGT
SEQ ID NO: 24:SEQ ID NO: 24:
Нуклеотидная последовательность субъединицы c специфичной к ионам натрия АТФ-синтазы atpE_1Nucleotide sequence of subunit c of the sodium ion-specific ATP synthase atpE_1
ATGACTGTCGAGATGGTCAGCATCTTCGCGGCGGCGCTCGCCGTCTCCTTCGGCGCCATCGGGCCGGCCCTGGGCGAGGGCCGGGCGGTGGCCGCGGCCATGGACGCCATCGCCCGCCAGCCGGAGGCGGCCGGAACCTTGTCGCGCACGCTCTTCGTCGGCCTCGCCATGATCGAGACCATGGCGATCTACTGCCTGGTGATCGCGCTCCTGGTGCTCTTCGCCAATCCGTTCGTGAAGTGAATGACTGTCGAGATGGTCAGCATCTTCGCGGCGGCGCTCGCCGTCTCCTTCGGCGCCATCGGGCCGGCCCTGGGCGAGGGCCGGGCGGTGGCCGCGGCCATGGACGCCATCGCCCGCCAGCCGGAGGCGGCCGGAACCTTGTCGCGCACGCTCTTCGTCGGCCTCGCCATGATCGAGACCATGGCGATCTACTGCCTGGTGATCGCGCTCCTGGTGCTCTTCGCCAATCCGTTCGTGAAGT GA
SEQ ID NO: 25:SEQ ID NO: 25:
Аминокислотная последовательность субъединицы c специфичной к ионам натрия АТФ-синтазы atpE_1 Amino acid sequence of subunit c of the sodium ion-specific ATP synthase atpE_1
MTVEMVSIFAAALAVSFGAIGPALGEGRAVAAAMDAIARQPEAAGTLSRTLFVGLAMIETMAIYCLVIALLVLFANPFVKMTVEMVSIFAAALAVSFGAIGPALGEGRAVAAAMDAIARQPEAAGTLSRTLFVGLAMIETMAIYCLVIALLVLFANPFVK
SEQ ID NO: 26:SEQ ID NO: 26:
Нуклеотидная последовательность субъединицы a АТФ-синтазы atpB_1Nucleotide sequence of subunit a of ATP synthase atpB_1
ATGGGCTCGCCGCTGATCCTCGAACCCCTGTTCCATATCGGGCCCGTGCCCATCACCGCGCCGGTGGTGGTCACCTGGCTCATCATGGCCGCCTTCATTGGGCTGGCGCGGCTCATCACCCGGAAGCTTTCCACCGATCCCACCCGGACCCAGGCGGCGGTGGAAACGGTGCTGACCGCCATCGATTCCCAGATCGCCGACACCATGCAGGCCGATCCCGCGCCTTATCGCGCGCTCATCGGCACCATCTTCCTTTATGTGCTGGTGGCCAACTGGTCCTCGCTCATCCCGGGCATCGAGCCGCCCACGGCGCATATCGAGACCGATGCGGCGCTCGCTTTCATCGTGTTCGCCGCCACCATCGGGTTCGGGTTGAAGACAAGGGGTGTGAAGGGCTATCTCGCCACCTTCGCCGAACCCTCCTGGGTGATGATCCCGCTCAATGTGGTGGAGCAGATCACCCGGACCTTCTCGCTCATCGTGCGCCTGTTCGGCAACATCATGAGCGGGGTGTTCGTGGTCGGCATCATCCTGTCCCTCGCCGGGCTGCTGGTGCCCATCCCCCTCATGGCGCTCGATCTCCTGACCGGCGCCGTGCAGGCCTACATCTTCGCGGTGCTGGCCTGCGTGTTCATCGGCGCGGCCATTGGCGAGGCGCCGGCAAAGCCCCAATCGAAGGAGCCAGGGAAAACATCATGAATGGGCTCGCCGCTGATCCTCGAACCCCTGTTCCATATCGGGCCCGTGCCCATCACCGCGCCGGTGGTGGTCACCTGGCTCATCATGGCCGCCTTCATTGGGCTGGCGCGGCTCATCACCCGGAAGCTTTCCACCGATCCCACCCGGACCCAGGCGGCGGTGGAAACGGTGCTGACCGCCATCGATTCCCAGATCGCCGACACCATGCAGGCCGATCCCGCGCCTTATCGCGCTCATCGGCAC CATCTTCCTTTATGTGCTGGTGGCCAACTGGTCCTCGCTCATCCCGGGCATCGAGCCGCCCACGGCGCATATCGAGACCGATGCGGCGCTCGCTTTCATCGTGTTCGCCGCCACCATCGGGTTCGGGTTGAAGACAAGGGGTGTGAAGGGCTATCTCGCCACCTTCGCCGAACCCTCCTGGGTGATGATCCCGCTCAATGTGGTGGAGCAGATCACCCGGACCTTCTCGCTCATCGTGCGCCTGTTCGGC AACATCATGAGCGGGGTGTTCGTGGTCGGCATCCTGTCCCTCGCCGGGCTGCTGGTGCCCATCCCCCTCATGGCGCTCGATCTCCTGACCGGCGCCGTGCAGGCCTACATCTTCGCGGTGCTGGCCTGCGTGTTCATCGGCGCGGCCATTGGCGAGGCGCCGGCAAAGCCCCAATCGAAGGAGCCAGGGAAAACATCATGA
SEQ ID NO: 27:SEQ ID NO: 27:
Аминокислотная последовательность субъединицы a АТФ-синтазы atpB_1Amino acid sequence of subunit a of ATP synthase atpB_1
MGSPLILEPLFHIGPVPITAPVVVTWLIMAAFIGLARLITRKLSTDPTRTQAAVETVLTAIDSQIADTMQADPAPYRALIGTIFLYVLVANWSSLIPGIEPPTAHIETDAALAFIVFAATIGFGLKTRGVKGYLATFAEPSWVMIPLNVVEQITRTFSLIVRLFGNIMSGVFVVGIILSLAGLLVPIPLMALDLLTGAVQAYIFAVLACVFIGAAIGEAPAKPQSKEPGKTSMGSPLILEPLFHIGPVPITAPVVVTWLIMAAFIGLARLITRKLSTDPTRTQAAVETVLTAIDSQIADTMQADPAPYRALIGTIFLYVLVANWSSLIPGIEPPTAHIETDAALAFIVFAATIGFGLKTRGVKGYLATFAEPSWVMIPLNVVEQITRTFSLIVRLFGNIMSGVFVVGIILSLAGLLVPIPLMALDLLTGAVQAYIFAVLACV FIGAAIGEAPAKPQSKEPGKTS
SEQ ID NO: 28:SEQ ID NO: 28:
Нуклеотидная последовательность эпсилон-цепи АТФ-синтазы atpC_1Nucleotide sequence of the epsilon chain of ATP synthase atpC_1
GTGAGCGCGCCGCTGCACCTCACCATCACCACGCCGGCCGCCGTTCTGGTGGACCGTGCCGACATCGTGGCCCTGCGTGCCGAGGACGAGAGCGGCAGCTTCGGCATCCTGCCCGGCCATGCGGATTTCCTGACCGTTCTGGAGGCCTGCGTGGTGCGCTTCAAGGATGGGGCCGACGGCGTGCATTATTGTGCTCTCAGTGGTGGCGTGCTGTCGGTCGAGGAGGGCCGGCGCATCGCCATCGCCTGCCGTCAGGGCACGGTGAGCGACGACCTGGTCGCCCTGGAAGGGGCGGTGGACGCCATGCGTTCGGCGGAGAGCGATGCCGACAAGCGGGCCCGGGTGGAGCAGATGCGCCTTCATGCCCACGCCGTGCGCCAGCTCCTGCACTATCTGCGGCCCGGCCGGGCCGGCGGCGTGGCGCCGGCCGCCGCGCCGGAGGAGGGGCCGTCATGAGTGAGCGCGCCGCTGCACCTCACCATCACCACGCCGGCCGCCGTTCTGGTGGACCGTGCCGACATCGTGGCCCTGCGTGCCGAGGACGAGAGCGGCAGCTTCGGCATCCTGCCCGGCCATGCGGATTTCCTGACCGTTCTGGAGGCCTGCGTGGTGCGCTTCAAGGATGGGGCCGACGGCGTGCATTATTGTGCTCTCAGTGGTGGCGTGCTGTCGGTCGAGGAGGGCCGGCGCATC GCCATCGCCTGCCGTCAGGGCACGTGAGCGACGACCTGGTCGCCCTGGAAGGGGCGGTGGACGCCATGCGTTCGGCGGAGAGCGATGCCGACAAGCGGGCCCGGGTGGAGCAGATGCGCCTTCATGCCCACGCCGTGCGCCAGCTCTGCACTATCTGCGGCCCGGCCGGGCCGGCGGCGTGGCGCCGGCCGCCGCGCCGGAGGAGGGGCCGTCATGA
SEQ ID NO: 29:SEQ ID NO: 29:
Аминокислотная последовательность эпсилон-цепи АТФ-синтазы atpC_1Amino acid sequence of the epsilon chain of ATP synthase atpC_1
MSAPLHLTITTPAAVLVDRADIVALRAEDESGSFGILPGHADFLTVLEACVVRFKDGADGVHYCALSGGVLSVEEGRRIAIACRQGTVSDDLVALEGAVDAMRSAESDADKRARVEQMRLHAHAVRQLLHYLRPGRAGGVAPAAAPEEGPSMSAPLHLTITTPAAVLVDRADIVALRAEDESGSFGILPGHADFLTVLEACVVRFKDGADGVHYCALSGGVLSVEEGRRIAIACRQGTVSDDLVALEGAVDAMRSAESDADKRARVEQMRLHAHAVRQLLHYLRPGRAGGVAPAAAPEEGPS
SEQ ID NO: 30:SEQ ID NO: 30:
Нуклеотидная последовательность бета-субъединицы АТФ-синтазы atpD_1Nucleotide sequence of the beta subunit of ATP synthase atpD_1
ATGGCAGCGGCAGATGAGGAGGCGCAATCGGCCGCCGGCCCCGCCTCGGGCCGGGTGGTGGCCGTGCGCGGCGCGGTGATCGACATCGCCTTTGCCCAGCCTCCGCTGCCGCCGCTGGACGACGCCCTTCTCATCACCGACGGCCGGGGCGGCACGGTGCTGGTGGAGGTGCAGAGCCATATGGATCGGCACACGGTGCGCGCCATCGCCCTTCAGGCCACCACCGGCCTCAGCCGGGGGCTGGAGGCGGCGCGGGTGGGCGGGCCGGTGAAGGTGCCGGTGGGAGACCATGTGCTCGGCCGCCTCCTGGATGTCACCGGCGCCATCGGCGACAAGGGCGGGCCGCTGCCGGCCGACGTGCCCACGCGGCCGATCCACCACGCGCCGCCATCCTTCGCCGCGCAGGGCGGCACGTCCGATCTGTTTCGCACCGGCATCAAGGTCATCGACCTCCTGGCGCCCCTCGCCCAGGGCGGCAAGGCGGCCATGTTCGGCGGGGCCGGCGTGGGCAAGACCGTGCTGGTGATGGAGCTGATCCACGCCATGGTGGCGAGCTACAAGGGCATCTCGGTGTTTGCCGGCGTGGGGGAGCGCTCCCGCGAGGGCCACGAGATGCTGCTGGACATGACCGATTCCGGCGTGCTCGACCGCACCGTTCTGGTCTATGGCCAGATGAACGAGCCCCCCGGGGCCCGCTGGCGGGTGCCCATGACGGCGCTGACCATCGCCGAATATTTCCGCGACGAGAAGCACCAGAACGTCCTGCTGCTGATGGACAACATCTTCCGCTTCGTCCAGGCGGGGGCGGAGGTCTCCGGCCTTTTGGGCCGTCCGCCCTCCCGGGTGGGATACCAGCCGACGCTGGCGAGCGAGGTGGCGGCGCTCCAGGAACGCATCACCTCCGTGGGCGAGGCCTCGGTGACCGCCATCGAGGCGGTCTACGTGCCGGCGGATGACTTCACCGATCCCGCCGTGACCACCATCGCCGCCCACGTGGATTCCATGGTGGTGCTCTCCCGCGCCATGGCGGCGGAGGGCATGTATCCGGCGGTGGACCCCATCTCCTCCTCGTCGGTGCTGCTCGACCCGCTCATCGTGGGGGACGAGCATGCGCGCGTCGCCAACGAGGTGCGCCGGACCATCGAGCATTATCGCGAGCTTCAGGATGTGATCTCGCTGCTGGGCATGGAGGAATTGGGCACCGAGGATCGCCGCATCGTGGAGCGGGCGCGCCGGCTCCAGCGCTTCCTCACCCAGCCCTTCACGGTCACCGAGGCCTTCACCGGCGTGCCCGGCCGCTCGGTGGCCATCGCCGACACCATCGCCGGCTGCAGGATGATCCTGTCCGGCGCCTGCGACGACTGGCAGGAAAGCGCCCTCTACATGGTGGGCACCATCGACGAGGCCCGCCAGAAGGAGGAGGCCGCTCGCGCCAAGGCGGGGCAGGGCGCCCCGGCCGGGACGGCAGCCGAGACGGCGGAGGCCGCCCCGTGAATGGCAGCGGCAGATGAGGAGGCGCAATCGGCCGCCGGCCCCGCCTCGGGCCGGGTGGTGGCCGTGCGCGGCGCGGTGATCGACATCGCCTTTGCCCAGCCTCCGCTGCCGCCGCTGGACGACGCCCTTCTCATCACCGACGGCCCGGGGCGGCACGGTGCTGGTGGAGGTGCAGAGCCATATGGATCGGCACACGGTGCGCGCCATCGCCCTTCAGGCCACCACCGGCCTCAGCCGGGGGCT GGAGGCGGCGCGGGTGGGCGGGCCGGTGAAGGTGCCGGTGGGAGACCATGTGCTCGGCCGCCTCCTGGATGTCACCGGCGCCATCGGCGACAAGGGCGGGCCGCTGCCGGCCGACGTGCCCACGCGGCCGATCCACCACGCGCCGCCATCCTTCGCCGCGCAGGGCGGCACGTCCGATCTGTTTCGCACCGGCATCAAGGTCATCGACCTCCTGGCGCCCCTCGCCCAGGGCGGCAAGGCGGC CATGTTCGGCGGGGCCGGCGTGGGCAAGACCGTGCTGGTGATGGAGCTGATCCACGCCATGGTGGCGAGCTACAAGGGCATCTCGGTGTTTGCCGGCGTGGGGGAGCGCTCCCGCGAGGGCCACGAGATGCTGCTGGACATGACCGATTCCGGCGTGCTCGACCGCACCGTTCTGGTCTATGGCCAGATGAACGAGCCCCCCGGGGCCCGCTGGCGGGTGCCCATGACGGCGCTGACCATCGCCGAATA TTTCCGCGACGAGAAGCACCAGAACGTCCTGCTGCTGATGGACAACATCTTCCGCTTCGTCCAGGCGGGGGCGGAGGTCTCCGGCCTTTTGGGCCGTCCGCCCTCCCGGGTGGGATACCAGCCGACGCTGGCGAGCGAGGTGGCGGCGCTCCAGGAACGCATCACCTCCGTGGGCGAGGCCTCGGTGACCGCCATCGAGGCGGTCTACGTGCCGGCGGATGACTTCACCGATCCCGCCGTGA CCACCATCGCCGCCCACGTGGATTCCATGGTGGTGCTCTCCCGCGCCATGGCGGCGGAGGGCATGTATCCGGCGGTGGACCCCATCTCCTCCTCGTCGGTGCTGCTCGACCCGCTCATCGTGGGGGACGAGCATGCGCGCGTCGCCAACGAGGTGCGCCGGACCATCGAGCATTATCGCGAGCTTCAGGATGTGATCTCGCTGCTGGGCATGGAGGAATTGGGCACCGAGGATCGCCGCATCGTGGA GCGGGCGCGCCGGCTCCAGCGCTTCCTCACCCAGCCCTTCACGGTCACCGAGGCCTTCACCGGCGTGCCCGGCCGCTCGGTGGCCATCGCCGACACCATCGCCGGCTGCAGGATGATCCTGTCCGGCGCCTGCGACGACTGGCAGGAAAGCGCCCTCTACATGGTGGGCACCATCGACGAGGCCCGCCAGAAGGAGGAGGCCGCTCGCGCCAAGGCGGGGCAGGGCGCCCCGGCCGGGACGGCAGCCG AGACGGCGGAGGCCGCCCCGTGA
SEQ ID NO: 31:SEQ ID NO: 31:
Аминокислотная последовательность бета-субъединицы АТФ-синтазы atpD_1Amino acid sequence of the beta subunit of ATP synthase atpD_1
MAAADEEAQSAAGPASGRVVAVRGAVIDIAFAQPPLPPLDDALLITDGRGGTVLVEVQSHMDRHTVRAIALQATTGLSRGLEAARVGGPVKVPVGDHVLGRLLDVTGAIGDKGGPLPADVPTRPIHHAPPSFAAQGGTSDLFRTGIKVIDLLAPLAQGGKAAMFGGAGVGKTVLVMELIHAMVASYKGISVFAGVGERSREGHEMLLDMTDSGVLDRTVLVYGQMNEPPGARWRVPMTALTIAEYFRDEKHQNVLLLMDNIFRFVQAGAEVSGLLGRPPSRVGYQPTLASEVAALQERITSVGEASVTAIEAVYVPADDFTDPAVTTIAAHVDSMVVLSRAMAAEGMYPAVDPISSSSVLLDPLIVGDEHARVANEVRRTIEHYRELQDVISLLGMEELGTEDRRIVERARRLQRFLTQPFTVTEAFTGVPGRSVAIADTIAGCRMILSGACDDWQESALYMVGTIDEARQKEEAARAKAGQGAPAGTAAETAEAAPMAAADEEAQSAAGPASGRVVAVRGAVIDIAFAQPPLPPLDDALLITDGRGGTVLVEVQSHMDRHTVRAIALQATTGLSRGLEAARVGGPVKVPVGDHVLGRLLDVTGAIGDKGGPLPADVPTRPIHHAPPSFAAQGGTSDLFRTGIKVIDLLAPLAQGGKAAMFGGAGVGKTVLVMELIHAMVASYKGISVFAGVGERSREGHEMLLDMTD SGVLDRTVLVYGQMNEPPGARWRVPMTALTIAEYFRDEKHQNVLLLMDNIFRFVQAGAEVSGLLGRPPSRVGYQPTLASEVAALQERITSVGEASVTAIEAVYVPADDFTDPAVTTIAAHVDSMVVLSRAMAAEGMYPAVDPISSSSVLLDPLIVGDEHARVANEVRRTIEHYRELQDVISLLGMEELGTEDRRIVERARRLQRFLTQPFTVTE AFTGVPGRSVAIADTIAGCRMILSGACDDWQESALYMVGTIDEARQKEEAARAKAGQGAPAGTAAETAEAAP
SEQ ID NO: 32:SEQ ID NO: 32:
Нуклеотидная последовательность бета-субъединицы АТФ-синтазы atpD_2Nucleotide sequence of the beta subunit of ATP synthase atpD_2
ATGGCGAACAAGGTCGGACGCATCACCCAGATCATCGGCGCCGTCGTCGACGTGCAGTTCGACGGGCATCTGCCGGCGATTCTCAACGCGATCGAGACCACCAACCAGGGCAACCGGCTGGTGCTCGAAGTGGCTCAGCATCTCGGCGAGAACACCGTGCGCTGCATCGCCATGGATGCCACTGAAGGCCTGGTGCGTGGCCAGGAGGTGGCCGACACCGATGCGCCCATCCAGGTGCCCGTGGGCGCCGCCACCCTCGGCCGCATCATGAACGTGATCGGCGAGCCGGTGGACGAGCTGGGCCCCATCGAGGGCGAAGCGCTGCGCGGCATCCATCAGCCGGCCCCCTCCTATGCGGAGCAGGCCACGGAAGCTGAGATCCTCGTCACCGGCATCAAGGTGGTGGATCTGCTGGCGCCCTATTCCAAGGGCGGCAAGGTGGGCCTGTTCGGCGGCGCCGGCGTGGGCAAGACCGTGCTCATCATGGAGCTGATCAACAACGTGGCCAAGGCGCACGGCGGCTATTCCGTGTTCGCCGGCGTGGGTGAGCGCACCCGCGAGGGCAACGACCTCTACCACGAGATGATCGAGTCCAACGTGAACAAGGACCCGCACGAGAACAATGGCTCGGCGGCCGGTTCCAAGTGCGCCCTGGTCTATGGCCAGATGAACGAGCCGCCCGGCGCCCGCGCCCGCGTGGCCCTCACCGGCCTCACCGTCGCCGAGCATTTCCGCGACCAGGGCCAGGACGTGCTGTTCTTCGTGGACAACATCTTCCGCTTCACCCAGGCGGGCTCCGAGGTGTCGGCGCTTCTCGGCCGCATCCCCTCGGCGGTGGGCTACCAGCCGACGCTGGCCACCGACATGGGCCAGCTGCAGGAGCGCATCACCACCACCACCAAGGGCTCCATCACCTCGGTGCAGGCCATCTACGTGCCGGCGGACGATCTGACCGATCCGGCGCCGGCCGCCTCCTTCGCCCATCTGGACGCCACCACGGTGCTGTCGCGCTCCATCGCGGAGAAGGGCATCTACCCGGCGGTGGATCCGCTGGACTCCACCTCGCGCATGCTGTCTCCCGCCATCCTCGGCGACGAGCACTACAACACCGCGCGCCAGGTGCAGCAGACCCTGCAGCGCTACAAGGCGCTCCAGGACATCATCGCCATCCTGGGCATGGACGAACTCTCCGAAGAGGACAAGCTCACCGTGGCCCGCGCCCGCAAGATCGAGCGCTTCCTCTCCCAGCCCTTCCACGTGGCCGAGGTGTTCACCGGTTCGCCCGGCAAGCTGGTCGACCTCGCCGACACCATCAAGGGCTTCAAGGGCCTGGTGGACGGCAAGTACGACTACCTGCCCGAGCAGGCCTTCTACATGGTGGGCACCATCGAAGAAGCCATCGAGAAGGGCAAGAAGCTGGCGGCCGAGGCGGCCTGAATGGCGAACAAGGTCGGACGCATCACCCAGATCATCGGCGCCGTCGTCGACGTGCAGTTCGACGGGCATCTGCCGGCGATTCTCAACGCGATCGAGACCACCAACCAGGGCAACCGGCTGGTGCTCGAAGTGGCTCAGCATCTCGGCGAGAACACCGTGCGCTGCATCGCCATGGATGCCACTGAAGGCCTGGTGCGTGGCCAGGAGGTGGCCGACACCGATGCGCCCATCCAGGTGCC CGTGGGCGCCGCCACCCTCGGCCGCATCATGAACGTGATCGGCGAGCCGGTGGACGAGCTGGGCCCCATCGAGGGCGAAGCGCTGCGCGGCATCCATCAGCCGGCCCCCTCCTATGCGGAGCAGGCCACGGAAGCTGAGATCCTCGTCACCGGCATCAAGGTGGTGGATCTGCTGGCGCCTATCCAAGGGCGGCAAGGTGGGCCTGTTCGGCGGCGCCGGCGTGGGCAAGACCGTGCTCATCATCAT GGAGCTGATCAACAACGTGGCCAAGGCGCCACGGCGGCTATTCCGTGTTCGCCGGCGTGGGTGAGCGCACCCGCGAGGGCAACGACCTCTACCACGAGATGATCGAGTCCAACGTGAACAAGGACCCGCACGAGAACAATGGCTCGGCGGCCGGTTCCAAGTGCGCCCTGGTCTATGGCCAGATGAACGAGCCGCCCGGCGCCCGCGCCCGCGTGGCCCTCACCGGCCTCACCGTCGCCGAG CATTTCCGCGACCAGGGCCAGGACGTGCTGTTCTTCGTGGACAACATCTTCCGCTTCACCCAGGCGGGCTCCGAGGTGTCGGCGCTTCTCGGCCGCATCCCCTCGGCGGTGGGCTACCAGCCGACGCTGGCCACCGACATGGGCCAGCTGCAGGAGCGCATCACCACCACCACCAAGGGCTCCATCACCTCGGTGCAGGCCATCTACGTGCCGGCGGACGATCTGACCGATCCGGCGCCGGCC GCCTCCTTCGCCCATCTGGACGCCACCACGGTGCTGTCGCGCTCCATCGCGGAGAAGGGCATCTACCCGGCGGTGGATCCGCTGGACTCCACCTCGCGCATGCTGTCTCCCGCCATCCTCGGCGACGAGCACTACAACACCGCGCGCCAGGTGCAGCAGACCCTGCAGCGCTACAAGGCGCTCCAGGACATCATCGCCATCCTGGGCATGGACGAACTCTCCGAAGAGGACAAGCTCACCGTGGCCCGCGCCC GCAAGATCGAGCGCTTCCTCTCCCAGCCCTTCCACGTGGCCGAGGTGTTCACCGGTTCGCCCGGCAAGCTGGTCGACCTCGCCGACACCATCAAGGGCTTCAAGGGCCTGGTGGACGGCAAGTACGACTACCTGCCCGAGCAGGCCTTCTACATGGTGGGCACCATCGAAGAAGCCATCGAGAAGGGCAAGAAGCTGGCGGCCGAGGCGGCCTGA
SEQ ID NO: 33:SEQ ID NO: 33:
Аминокислотная последовательность бета-субъединицы АТФ-синтазы atpD_2Amino acid sequence of the beta subunit of ATP synthase atpD_2
MANKVGRITQIIGAVVDVQFDGHLPAILNAIETTNQGNRLVLEVAQHLGENTVRCIAMDATEGLVRGQEVADTDAPIQVPVGAATLGRIMNVIGEPVDELGPIEGEALRGIHQPAPSYAEQATEAEILVTGIKVVDLLAPYSKGGKVGLFGGAGVGKTVLIMELINNVAKAHGGYSVFAGVGERTREGNDLYHEMIESNVNKDPHENNGSAAGSKCALVYGQMNEPPGARARVALTGLTVAEHFRDQGQDVLFFVDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGQLQERITTTTKGSITSVQAIYVPADDLTDPAPAASFAHLDATTVLSRSIAEKGIYPAVDPLDSTSRMLSPAILGDEHYNTARQVQQTLQRYKALQDIIAILGMDELSEEDKLTVARARKIERFLSQPFHVAEVFTGSPGKLVDLADTIKGFKGLVDGKYDYLPEQAFYMVGTIEEAIEKGKKLAAEAAMANKVGRITQIIGAVVDVQFDGHLPAILNAIETTNQGNRLVLEVAQHLGENTVRCIAMDATEGLVRGQEVADTDAPIQVPVGAATLGRIMNVIGEPVDELGPIEGEALRGIHQPAPSYAEQATEAEILVTGIKVVDLLAPYSKGGKVGLFGGAGVGKTVLIMELINNVAKAHGGYSVFAGVGERTREGNDLYHEMIESNVNK DPHENNGSAAGSKCALVYGQMNEPPGARARVALTGLTVAEHFRDQGQDVLFFVDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGQLQERITTTTKGSITSVQAIYVPADDLTDPAPAASFAHLDATTVLSRSIAEKGIYPAVDPLDSTSRMLSPAILGDEHYNTARQVQQTLQRYKALQDIIAILGMDELSEEDKLTVAR ARKIERFLSQPFHVAEVFTGSPGKLVDLADTIKGFKGLVDGKYDYLPEQAFYMVGTIEEAIEKGKKLAAEAA
SEQ ID NO: 34:SEQ ID NO: 34:
Нуклеотидная последовательность гамма-цепи АТФ-синтазы atpG_2Nucleotide sequence of the gamma chain of ATP synthase atpG_2
ATGGCGAGTCTGAAGGACCTGAGAAACCGCATTGCCTCGGTGAAGGCGACGCAGAAGATCACCAAGGCGATGCAGATGGTCGCCGCGGCGAAGCTGCGTCGCGCCCAGGCGGCGGCTGAAGCGGCCCGTCCCTATGCGGAACGCATGGAGACGGTGCTCGGAAATCTTGCCTCCGGCATGGTGGTGGGCGCGCAGGCGCCTGTTCTCATGACCGGGACGGGCAAGAGCGACACCCACCTGCTGCTGGTGTGCACCGGCGAGCGCGGCCTGTGCGGCGCCTTCAACTCGTCCATCGTGCGCTTCGCCCGCGAGCGGGCGCAGCTGCTGCTGGCCGAGGGCAAGAAGGTGAAAATCCTGTGCGTGGGCCGCAAGGGCCACGAGCAGCTGCGCCGCATCTACCCGGACAACATCATCGACGTGGTGGACCTGCGCGCGGTGCGCAACATCGGCTTCAAGGAGGCCGACGCCATCGCCCGCAAGGTGCTGGCCCTGCTCGATGAAGGCGCATTCGACGTCTGCACGCTCTTCTACTCCCACTTCAGGAGCGTGATCGCCCAGGTGCCGACGGCCCAGCAGCTCATTCCGGCCACCTTCGACGAGCGGCCGGCCGTCGCCGATGCGCCGGTCTATGAATATGAGCCGGAGGAGGAGGAGATCCTCGCCGAGCTGCTGCCGCGCAACGTGGCGGTGCAGATCTTCAAGGCCCTCCTCGAGAACCAGGCTTCTTTCTATGGCTCCCAGATGAGCGCCATGGACAACGCCACGCGCAATGCGGGCGAGATGATCAAGAAGCAGACGCTCACCTACAACCGTACCCGCCAGGCCATGATCACGAAGGAACTCATCGAGATCATCTCCGGCGCCGAGGCCGTCTGAATGGCGAGTCTGAAGGACCTGAGAAACCGCATTGCCTCGGTGAAGGCGACGCAGAAGATCACCAAGGCGATGCAGATGGTCGCCGCGGCGAAGCTGCGTCGCGCCCAGGCGGCGGCTGAAGCGGCCCGTCCCTATGCGGAACGCATGGAGACGGTGCTCGGAAATCTTGCCTCCGGCATGGTGGTGGGGCCGCAGGCGCCTGTTCTCATGACCGGGACGGGCAAGAGCGACACCCACCTGCTG CTGGTGTGCACCGGCGAGCGCGGCCTGTGCGGCGCCTTCAACTCGTCCATCGTGCGCTTCGCCCGCGAGCGGGCGCAGCTGCTGCTGGCCGAGGGCAAGAAGGTGAAAATCCTGTGCGTGGGCCGCAAGGGCCACGAGCAGCTGCGCCGCATCTACCCGGACAACATCATCGACGTGGTGGACCTGCGCGCGGTGCGCAACATCGGCTTCAAGGAGGCCGACGCCATCGCCCGCAAGGTGCTGG CCCTGCTCGATGAAGGCGCATTCGACGTCTGCACGCTCTTCTACTCCCACTTCAGGAGCGTGATCGCCCAGGTGCCGACGGCCCAGCAGCTCATTCCGGCCACCTTCGACGAGCGGCCGGCCGTCGCCGATGCGCCGGTCTATGAATATGAGCCGGAGGAGGAGGAGATCCTCGCCGAGCTGCTGCCGCGCAACGTGGCGGTGCAGATCTTCAAGGCCCTCCTCGAGAACCAGGCTTCTTTC TATGGCTCCCAGATGAGCGCCATGGACAACGCCACGCGCAATGCGGGCGAGATGATCAAGAAGCAGACGCTCACCTACAACCGTACCCGCCAGGCCATGATCACGAAGGAACTCATCGAGATCATCTCCGGCGCCGAGGCCGTCTGA
SEQ ID NO: 35:SEQ ID NO: 35:
Аминокислотная последовательность гамма-цепи АТФ-синтазы atpG_2Amino acid sequence of the gamma chain of ATP synthase atpG_2
MASLKDLRNRIASVKATQKITKAMQMVAAAKLRRAQAAAEAARPYAERMETVLGNLASGMVVGAQAPVLMTGTGKSDTHLLLVCTGERGLCGAFNSSIVRFARERAQLLLAEGKKVKILCVGRKGHEQLRRIYPDNIIDVVDLRAVRNIGFKEADAIARKVLALLDEGAFDVCTLFYSHFRSVIAQVPTAQQLIPATFDERPAVADAPVYEYEPEEEEILAELLPRNVAVQIFKALLENQASFYGSQMSAMDNATRNAGEMIKKQTLTYNRTRQAMITKELIEIISGAEAVMASLKDLRNRIASVKATQKITKAMQMVAAAKLRRAQAAAEAARPYAERMETVLGNLASGMVVGAQAPVLMTGTGKSDTHLLLVCTGERGLCGAFNSSIVRFARERAQLLLAEGKKVKILCVGRKGHEQLRRIYPDNIIDVVDLRAVRNIGFKEADAIARKVLALLDEGAFDVCTLFYSHFRSVIAQVPTAQQLIPATFDERPAVADAPV YEYEPEEEEILAELLPRNVAVQIFKALLENQASFYGSQMSAMDNATRNAGEMIKKQTLTYNRTRQAMITKELIEIISGAEAV
SEQ ID NO: 36:SEQ ID NO: 36:
Нуклеотидная последовательность альфа-субъединицы АТФ-синтазы atpA_2Nucleotide sequence of the alpha subunit of ATP synthase atpA_2
ATGGACATTCGAGCCGCTGAAATCTCTGCCATCCTGAAAGAGCAGATCCAGAATTTCGGCCAGGAGGCGGAAGTCTCCGAGGTGGGTCAGGTTCTGTCCGTGGGTGACGGCATCGCGCGCGTCTACGGCCTCGACAACGTCCAGGCGGGCGAGATGGTCGAGTTCGAGAACGGCACGCGCGGCATGGCGCTGAACCTCGAGCTCGACAATGTCGGCATCGTGATCTTCGGTTCCGACCGCGAGATCAAGGAAGGCCAGACCGTCAAGCGGACCGGCGCCATCGTGGACGCCCCCGTCGGCAAGGGCCTGCTCGGCCGCGTCGTGGACGCTCTCGGCAACCCGATCGACGGCAAGGGCCCGATCATGTTCACCGAGCGTCGCCGGGTCGACGTGAAGGCGCCGGGCATCATCCCGCGCAAGTCGGTGCACGAGCCCATGCAGACCGGCCTGAAGGCCATCGATGCGCTCATCCCCATCGGCCGCGGCCAGCGCGAGCTCATCATCGGCGACCGCCAGACCGGCAAGACCGCCGTGGCGCTCGACTCGATCCTGAACCAGAAGCCCATCAACCAGGGCGACGACGAGAAGGCCAAGCTCTACTGCGTCTATGTCGCGGTGGGCCAGAAGCGTTCCACTGTCGCGCAGTTCGTGAAGGTGCTCGAGGAGCACGGCGCGCTGGAATATTCCATCGTCGTCGCCGCCACCGCCTCGGACGCGGCCCCCATGCAGTTCCTGGCGCCGTTCACCGGCACCGCCATGGGCGAGTATTTCCGCGACAACGGCATGCACGCCCTCATCATCCATGATGACCTGTCCAAGCAGGCCGTGGCCTACCGCCAGATGTCGCTGCTGCTGCGCCGCCCGCCGGGCCGCGAGGCCTATCCCGGCGATGTGTTCTACCTGCACTCCCGCCTCTTGGAGCGCGCCGCCAAGCTCAATGACGAGCACGGCGCCGGCTCGCTGACCGCCCTGCCGGTGATCGAGACCCAGGCCAACGACGTGTCGGCCTACATCCCGACCAACGTGATCTCCATCACCGACGGTCAGATCTTCCTTGAATCCGATCTGTTCTACCAGGGCATCCGCCCGGCGGTGAACGTGGGCCTGTCGGTGTCGCGCGTGGGCTCTTCGGCCCAGATCAAGGCGATGAAGCAGGTGGCCGGCAAGATCAAGGGCGAGCTCGCCCAGTATCGCGAGCTGGCGGCCTTCGCCCAGTTCGGTTCGGACCTGGACGCGGCCACCCAGAAGCTGCTGAACCGCGGCGCCCGCCTCACCGAGCTGCTGAAGCAGAGCCAGTTCTCGCCCCTCAAGGTGGAGGAGCAGGTGGCGGTGATCTATGCCGGCACCAATGGCTATCTCGATCCGCTGCCGGTCTCCAAGGTGCGCGAGTTCGAGCAGGGTCTGCTCCTGTCGCTGCGCTCGCAGCATCCGGAGATCCTGGACGCCATCCGCACGTCCAAGGAGCTTTCCAAGGACACCGCCGAGAAGCTGACGAAGGCCATCGACGCCTTCGCCAAGAGCTTCTCCTGAATGGACATTCGAGCCGCTGAAAATCTCTGCCATCCTGAAAGAGCAGATCCAGAATTTCGGCCAGGAGGCGGAAGTCTCCGAGGTGGGTCAGGTTCTGTCCGTGGGTGACGGCATCGCGCGTCTACGGCCTCGACAACGTCCAGGCGGGCGAGATGGTCGAGTTCGAGAACGGCACGCGCGGCATGGCGCTGAACCTCGAGCTCGACAATGTCGGCATCGTGATCTTCGGTTCCGACCGCGAGA TCAAGGAAGGCCAGACCGTCAAGCGGACCGGCGCCATCGTGGACGCCCCCGTCGGCAAGGGCCTGCTCGGCCGTCGTGGACGCTCTCGGCAACCCGATCGACGGCAAGGGCCCGATCATGTTCACCGAGCGTCGCCGGGTCGACGTGAAGGCGCCGGGCATCATCCCGCGCAAGTCGGTGCACGAGCCCATGCAGACCGGCCTGAAGGCCATCGATGCGCTCATCCCCCATCGCCGCGGCCAGCG CGAGCTCATCATCGGCGACCGCCAGACCGGCAAGACCGCCGTGGCGCTCGACTCGATCCTGAACCAGAAGCCCATCAACCAGGGCGACGACGAGAAGGCCAAGCTCTACTGCGTCTATGTCGCGGTGGGCCAGAAGCGTTCCACTGTCGCGCAGTTCGTGAAGGTGCTCGAGGAGCACGGCGCGCTGGAATATTCCATCGTCGTCGCCGCCACCGCCTCGGACGCGGCCCCCATGCAGTTCCTGGC GCCGTTCACCGGCACCGCCATGGGCGAGTATTTCGCGCGACAACGGCATGCACGCCCTCATCATCCATGATGACCTGTCCAAGCAGGCCGTGGCCTACCGCCAGATGTCGCTGCTGCTGCGCCGCCCGCCGGGCCGCGAGGCCTATCCCGGCGATGTGTTCTACCTGCACTCCCGCCTCTTGGAGCGCGCCGCCAAGCTCAATGACGAGCACGGCGCGGCTCGCTGACCGCCCTGCCGGTGAT CGAGACCCAGGCCAACGACGTGTCGGCCTACATCCCGACCAACGTGATCTCCATCACCGACGGTCAGATCTTCCTTGAATCCGATCTGTTCTACCAGGGCATCCGCCCGGCGGTGAACGTGGGCCTGTCGGTGTCGCGCGTGGGCTCTTCGGCCCAGATCAAGGCGATGAAGCAGGTGGCCGGCAAGATCAAGGGCGAGCTCGCCCAGTATCGCGAGCTGGCGGCCTTCGCCCAGTTCGG TTCGGACCTGGACGCGGCCACCCAGAAGCTGCTGAACCGCGGCGCCCGCCTCACCGAGCTGCTGAAGCAGAGCCAGTTCTCGCCCCTCAAGGTGGAGGAGCAGGTGGCGGTGATCTATGCCGGCACCAATGGCTATCTCGATCCGCTGCCGGTCTCCAAGGTGCGCGAGTTCGAGCAGGGTCTGCTCCTGTCGCTGCGCTCGCAGCATCCGGAGATCCTGGACGCCATCCGCACGTCCAAGGA GCTTTCCAAGGACACCGCCGAGAAGCTGACGAAGGCCATCGACGCCTTCGCCAAGAGCTTCTCCTGA
SEQ ID NO: 37:SEQ ID NO: 37:
Аминокислотная последовательность альфа-субъединицы АТФ-синтазы atpA_2Amino acid sequence of the alpha subunit of ATP synthase atpA_2
MDIRAAEISAILKEQIQNFGQEAEVSEVGQVLSVGDGIARVYGLDNVQAGEMVEFENGTRGMALNLELDNVGIVIFGSDREIKEGQTVKRTGAIVDAPVGKGLLGRVVDALGNPIDGKGPIMFTERRRVDVKAPGIIPRKSVHEPMQTGLKAIDALIPIGRGQRELIIGDRQTGKTAVALDSILNQKPINQGDDEKAKLYCVYVAVGQKRSTVAQFVKVLEEHGALEYSIVVAATASDAAPMQFLAPFTGTAMGEYFRDNGMHALIIHDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLNDEHGAGSLTALPVIETQANDVSAYIPTNVISITDGQIFLESDLFYQGIRPAVNVGLSVSRVGSSAQIKAMKQVAGKIKGELAQYRELAAFAQFGSDLDAATQKLLNRGARLTELLKQSQFSPLKVEEQVAVIYAGTNGYLDPLPVSKVREFEQGLLLSLRSQHPEILDAIRTSKELSKDTAEKLTKAIDAFAKSFSMDIRAAEISAILKEQIQNFGQEAEVSEVGQVLSVGDGIARVYGLDNVQAGEMVEFENGTRGMALNLELDNVGIVIFGSDREIKEGQTVKRTGAIVDAPVGKGLLGRVVDALGNPIDGKGPIMFTERRRVDVKAPGIIPRKSVHEPMQTGLKAIDALIPIGRGQRELIIGDRQTGKTAVALDSILNQKPINQGDDEKA KLYCVYVAVGQKRSTVAQFVKVLEEHGALEYSIVVAATASDAAPMQFLAPFTGTAMGEYFRDNGMHALIIHDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLNDEHGAGSLTALPVIETQANDVSAYIPTNVISITDGQIFLESDLFYQGIRPAVNVGLSVSRVGSSAQIKAMKQVAGKIKGELA QYRELAAFAQFGSDLDAATQKLLNRGARLTELLKQSQFSPLKVEEQVAVIYAGTNGYLDPLPVSKVREFEQGLLLSLRSQHPEILDAIRTSKELSKDTAEKLTKAIDAFAKSFS
SEQ ID NO: 38:SEQ ID NO: 38:
Нуклеотидная последовательность дельта-субъединицы АТФ-синтазы atpHNucleotide sequence of the delta subunit of ATP synthase atpH
GTGGCGGAAACGATCGTGTCAGGCATGGCGGGACGCTATGCGACCGCGCTGTTCGAGCTGGCGGACGAAGCCGGTGCCATCGATTCCGTCCAGGCGGATCTTGATCGCCTGTCCGGCCTTCTGGCCGAGAGCGCGGATCTGGCGCGGCTGGTCAAGAGCCCGGTCTTCACCGCCGAGCAGCAGCTCGGCGCGATGGCGGCCATTCTCGATCAAGCAGGCATTTCCGGCCTTGCGGGCAAATTCGTGAAGCTGGTGGCGCAGAACCGCCGCCTGTTCGCACTGCCGCGCATGATTGCCGAATACGCCGTCCTGGTGGCCCGGAAGAAGGGCGAGACCTCGGCGAGCGTGACCGTTGCCACCCCCCTGAGCGATGAGCATCTGGCCACGCTCAAGGCGGCCCTGGCTGAAAAGACCGGCAAGGACGTGAAGCTCGACGTCACCGTCGATCCGTCCATCCTCGGTGGTCTCATCGTGAAGCTCGGCTCGCGCATGGTCGATGCTTCCCTGAAGACCAAACTCAATTCTATCCGGCATGCGATGAAAGAGGTCCGCTGAGTGGCGGAAACGATCGTGTCAGGCATGGCGGGACGCTATGCGACCGCGCTGTTCGAGCTGGCGGACGAAGCCGGTGCCATCGATTCCGTCCAGGCGGATCTTGATCGCCTGTCCGGCCTTCTGGCCGAGAGCGCGGATCTGGCGCGGCTGGTCAAGAGCCCGGTCTTCACCGCCGAGCAGCAGCTCGGCGCGATGGCGGCCATTCTCGATCAAGCAGGCATTTCCGGCCTTGCGGGCAAATT CGTGAAGCTGGTGGCGCAGAACCGCCGCCTGTTCGCACTGCCGCGCATGATTGCCGAATACGCCGTCCTGGTGGCCCGGAAGAAGGGCGAGACCTCGGCGAGCGTGACCGTTGCCACCCCCCTGAGCGATGAGCATCTGGCCACGCTCAAGGCGGCCCTGGCTGAAAAGACCGGCAAGGACGTGAAGCTCGACGTCACCGTCGATCCGTCCATCCTCGGTGGTCTCATCGTGAAGCTCGGCTCG CGCATGGTCGATGCTTCCCTGAAGACCAAACTCAATTCTATCCGCATGCGATGAAAGAGGTCCGCTGA
SEQ ID NO: 39:SEQ ID NO: 39:
Аминокислотная последовательность дельта-субъединицы АТФ-синтазы atpHAmino acid sequence of the delta subunit of ATP synthase atpH
MAETIVSGMAGRYATALFELADEAGAIDSVQADLDRLSGLLAESADLARLVKSPVFTAEQQLGAMAAILDQAGISGLAGKFVKLVAQNRRLFALPRMIAEYAVLVARKKGETSASVTVATPLSDEHLATLKAALAEKTGKDVKLDVTVDPSILGGLIVKLGSRMVDASLKTKLNSIRHAMKEVRMAETIVSGMAGRYATALFELADEAGAIDSVQADLDRLSGLLAESADLARLVKSPVFTAEQQLGAMAAILDQAGISGLAGKFVKLVAQNRRLFALPRMIAEYAVLVARKKGETSASVTVATPLSDEHLATLKAALAEKTGKDVKLDVTVDPSILGGLIVKLGSRMVDASLKTKLNSIRHAMKEVR
SEQ ID NO: 40:SEQ ID NO: 40:
Нуклеотидная последовательность субъединицы b АТФ-синтазы atpF_2Nucleotide sequence of subunit b of ATP synthase atpF_2
ATGACCGAAATGGAACTGGCTGAGCTCTGGGTCGCCATCGCCTTCCTGGTTTTCGTAGGCCTCCTGATCTATGCGGGCGCCCACCGCGCCATCGTCTCCGCCCTGGATTCCCGCGGCTCGCGCATCGCCTCGGAACTGGAGGAGGCCCGTCGGCTCAAGGAAGAGGCCCAGAAGCTGGTGGCCGAATTCAAGCGCAAGCAGCGCGAGGCCGAGGCCGAGGCCGAATCCATCGTCACCGGCGCCAAGGCCGAGGCCGAGCGCCTCGCCGCCGAGGCCAAGGCGAAGATCGAGGATTTCGTCACCCGCCGCACCAAGATGGCCGAGGACAAGATCGCCCAGGCCGAGCATCAGGCTCTGGCGGACGTGAAGTCCATCGCCGCCGAGGCGGCGGCCAAGGCGGCCGAGGTGATCCTCGGCGCCCAGGCCACCGGCGCGGTGGCGGAGCGTCTGCTGTCGGGCGCCATCTCCGAGGTCAAGACCAAGCTCAACTGAATGACCGAAATGGAACTGGCTGAGCTCTGGGTCGCCATCGCCTTCCTGGTTTTCGTAGGCCTCCTGATCTATGCGGGCGCCCACCGCGCCATCGTCTCCGCCCTGGATTCCCGCGGCTCGCGCATCGCCTCGGAACTGGAGGAGGCCCGTCGGCTCAAGGAAGAGGCCCAGAAGCTGGTGGCCGAATTCAAGCGCAAGCAGCGCGAGGCCGAGGCCGAGGCCGAATCCATCGTCACCGGCGCCAAGGCCGAGGCCGAGCGCCTCGCCGCCGAGGCCAAGGCGAAGATCGAGGATTTCGTCACCCGCCGCACCAAGATGGCCGAGGACAAGATCGCCCAGGCCGAGCATCAGGCTCTGGCGGACGTGAAGTCCATCGCCGCCGAGGCGGCGGCCAAGGCGGCCGAGGTGATCCTCGGCGCCCAGGCCACCGGCGCGGTGGCGGAGCGTCTGCTGTCGGGCGCCATCTCCGAGGTCAAGACCAAGCTCAACTGA
SEQ ID NO: 41:SEQ ID NO: 41:
Аминокислотная последовательность субъединицы b АТФ-синтазы atpF_2Amino acid sequence of subunit b of ATP synthase atpF_2
MTEMELAELWVAIAFLVFVGLLIYAGAHRAIVSALDSRGSRIASELEEARRLKEEAQKLVAEFKRKQREAEAEAESIVTGAKAEAERLAAEAKAKIEDFVTRRTKMAEDKIAQAEHQALADVKSIAAEAAAKAAEVILGAQATGAVAERLLSGAISEVKTKLNMTEMELAELWVAIAFLVFVGLLIYAGAHRAIVSALDSRGSRIASELEEARRLKEEAQKLVAEFKRKQREAEAEAESIVTGAKAEAERLAAEAKAKIEDFVTRRTKMAEDKIAQAEHQALADVKSIAAEAAAKAAEVILGAQATGAVAERLLSGAISEVKTKLN
SEQ ID NO: 42:SEQ ID NO:42:
Нуклеотидная последовательность субъединицы b' АТФ-синтазы atpG_3Nucleotide sequence of subunit b' of ATP synthase atpG_3
ATGATGATTGCATGGAAGCGGACCTTCGCAGTCGTGACCTTCGGGGCCGCCCTGATGGCCATGCCCGTCGCGGGCGTGGTCGCAGCTGAGACTTCTCCCGCTCCGGCGGCAGTGGCGCAGGCCGATCATGCGGTGCCCACCGAGGCGGCCGGCCAGGGCACCGCCGATGCGGCCCATGCCGCCGCGCCGGGCGAGGCCGCCCATGGTGGCGCGGCCAAGCACGAAACCCATTTCCCGCCCTTCGACGGCACCACCTTCGCCTCCCAGTTGCTGTGGCTCGCCGTCACCTTCGGCCTGCTTTACTACCTCATGAGCAAGGTCACGCTGCCGCGCATCGGCCGCATCCTGGAAGAGCGCCACGACCGCATCGCCGATGATCTGGAGGAAGCCTCCAAGCATCGCGCCGAGAGCGAGGCCGCCCAGCGGGCCTATGAGAAGGCGCTGAGCGAGGCCCGCGCGAAGGCCCATTCCATCGCCGCGGAAACCCGCGACCGCCTTGCCGCCCACGCCGACACCAACCGCAAGGCGCTGGAGAGCGAGCTCACCGCCAAGCTGCAGGCGGCCGAGGAGCGCATCGCCACCACCAAGAGCGAAGCCCTCACCCATGTGCGCGGCATCGCGGTGGACGCCACCCAATCCATCGTCTCCACCCTCATCGGTGTCGCGCCCGCGGCGGCCGACGTGGAAAAAGCGGTGGACGGCGCCCTGTCCCAGCACGGCCAGGCCTGAATGATGATTGCATGGAAGCGGACCTTCGCAGTCGTGACCTTCGGGGCCGCCCTGATGGCCATGCCCGTCGCGGGCGTGGTCGCAGCTGAGACTTCTCCCGCTCCGGCGGCAGTGGCGCAGGCCGATCATGCGGTGCCCACCGAGGCGGCCGGCCAGGGCACCGCCGATGCGGCCCATGCCGCCGCGCCGGGCGAGGCCGCCCATGGTGGCGCGCCAAGCACGAAACCCATTTCCCGCCCTTC GACGGCACCACCTTCGCCTCCCAGTTGCTGTGGCTCGCCGTCACCTTCGGCCTGCTTTACTACCTCATGAGCAAGGTCACGCTGCCGCGCATCGGCCGCATCCTGGAAGAGCGCCACGACCGCATCGCCGATGATCTGGAGGAAGCCTCCAAGCATCGCGCCGAGAGCGAGGCCGCCCAGCGGGCCTATGAGAAGGCGCTGAGCGAGGCCCGCGCGAAGGCCCATTCCATCGCCGCGGAAACCCGCGACCGCC TTGCCGCCCACGCCGACACCAACCGCAAGGCGCTGGAGAGCGAGCTCACCGCCAAGCTGCAGGCGGCCGAGGAGCGCATCGCCACCACCAAGAGCGAAGCCCTCACCCATGTGCGCGGCATCGCGGTGGACGCCACCCAATCCATCGTCTCCACCCTCATCGGTGTCGCGCCCGCGGCGGCCGACGTGGAAAAAGCGGTGGACGGCGCCCTGTCCCAGCACGGCCCAGGCCTGA
SEQ ID NO: 43:SEQ ID NO:43:
Аминокислотная последовательность субъединицы b' АТФ-синтазы atpG_3Amino acid sequence of subunit b' of ATP synthase atpG_3
MMIAWKRTFAVVTFGAALMAMPVAGVVAAETSPAPAAVAQADHAVPTEAAGQGTADAAHAAAPGEAAHGGAAKHETHFPPFDGTTFASQLLWLAVTFGLLYYLMSKVTLPRIGRILEERHDRIADDLEEASKHRAESEAAQRAYEKALSEARAKAHSIAAETRDRLAAHADTNRKALESELTAKLQAAEERIATTKSEALTHVRGIAVDATQSIVSTLIGVAPAAADVEKAVDGALSQHGQAMMIAWKRTFAVVTFGAALMAMPVAGVVAAETSPAPAAVAQADHAVPTEAAGQGTADAAHAAAPGEAAHGGAAKHETHFPPFDGTTFASQLLWLAVTFGLLYYLMSKVTLPRIGRILEERHDRIADDLEEASKHRAESEAAQRAYEKALSEARAKAHSIAAETRDRLAAHADTNRKALESELTAKLQAAEERIATTKSEALTHVRGIAVDATQ SIVSTLIGVAPAAADVEKAVDGALSQHGQA
SEQ ID NO: 44:SEQ ID NO:44:
Нуклеотидная последовательность субъединицы c АТФ-синтазы atpE_2Nucleotide sequence of subunit c of ATP synthase atpE_2
ATGGAAGCGGAAGCTGGAAAGTTCATCGGTGCCGGCCTCGCCTGCCTCGGCATGGGTCTCGCTGGCGTCGGCGTCGGTAACATCTTCGGTAACTTCCTCTCCGGCGCCCTGCGCAACCCGTCCGCTGCCGACGGCCAGTTCGCCCGCGCCTTCATCGGCGCCGCCCTCGCGGAAGGTCTCGGCATCTTCTCGCTGGTCGTTGCGCTCGTCCTGCTGTTCGTGGCCTGAATGGAAGCGGAAGCTGGAAAGTTCATCGGTGCCGGCCTCGCCTGCCTCGGCATGGGTCTCGCTGGCGTCGGCGTCGGTAACATCTTCGGTAACTTCCTCTCCGGCGCCCTGCGCAACCCGTCCGCTGCCGACGGCCAGTTCGCCCGCGCCTTCATCGGCGCCGCCCTCGCGGAAGGTCTCGGCATCTTCTCGCTGGTCGTTGCGCTCGTCCTGCTGTTCGTGGCCTGA
SEQ ID NO: 45:SEQ ID NO: 45:
Аминокислотная последовательность субъединицы c АТФ-синтазы c atpE_2Amino acid sequence of subunit c of ATP synthase c atpE_2
MEAEAGKFIGAGLACLGMGLAGVGVGNIFGNFLSGALRNPSAADGQFARAFIGAALAEGLGIFSLVVALVLLFVAMEAEAGKFIGAGLACLGMGLAGVGVGNIFGNFLSGALRNPSAADGQFARAFIGAALAEGLGIFSLVVALVLLFVA
SEQ ID NO: 46:SEQ ID NO:46:
Нуклеотидная последовательность субъединицы a АТФ-синтазы atpB_2Nucleotide sequence of subunit a of ATP synthase atpB_2
ATGACCGTCGATCCGATCCACCAGTTCGAGATCAAGCGCTACGTGGATCTGCTGAACGTCGGCGGTGTCCAGTTCTCCTTCACCAACGCAACGGTGTTCATGATTGGCATCGTCCTGGTGATTTTCTTCTTCCTGACTTTCGCGACACGCGGTCGCACCCTTGTGCCGGGCCGGATGCAGTCGGCGGCGGAGCTGAGCTACGAGTTCATCGCCAAGATGGTGCGCGACGCGGCCGGCAGCGAGGGAATGGTGTTCTTTCCCTTCGTCTTCTCGCTCTTCATGTTCGTGCTGGTGGCGAACGTATTGGGGCTCATCCCCTACACCTTCACGGTGACCGCCCACCTCATCGTCACCGCCGCCCTGGCGGCGACGGTGATCCTCACCGTCATCATCTACGGCTTCGTGCGGCACGGCACCCACTTCCTGCACCTGTTCGTGCCGTCGGGCGTGCCGGGCTTCCTCCTGCCCTTCCTCGTGGTGATCGAGGTGGTGTCGTTCCTGTCGCGGCCCATCAGCCTCTCGCTGCGTCTGTTCGCCAACATGCTGGCGGGCCACATCGCCCTCAAGGTGTTCGCCTTCTTCGTCGTGGGACTGGCCTCGGCCGGCGCGATCGGCTGGTTCGGCGCCACCCTGCCCTTCTTCATGATCGTGGCGCTCACCGCGCTGGAGCTGCTGGTGGCGGTGCTGCAGGCCTACGTGTTCGCGGTGCTGACCTCGATCTACCTCAACGACGCCATCCATCCCGGCCACTGAATGACCGTCGATCCGATCCACCAGTTCGAGATCAAGCGCTACGTGGATCTGCTGAACGTCGGCGGTGTCCAGTTCTCCTTCACCAACGCAACGGTGTTCATGATTGGCATCGTCCTGGTGATTTTCTTCTTCCTGACTTTCGCGACACGCGGTCGCACCCTTGTGCCGGGCCGGATGCAGTCGGCGGCGGAGCTGAGCTACGAGTTCATCGCCAAGATGGTGCGCGACGCGGCCGGCA GCGAGGGAATGGTGTTCTTTCCCTTCGTCTTTCTCGCTCTTCATGTTCGTGCTGGTGGCGAACGTATTGGGGCTCATCCCCTACACCTTCACGGTGACCGCCCACCTCATCGTCACCGCCGCCCTGGCGGCGACGGTGATCCTCACCGTCATCATCTACGGCTTCGTGCGGCACGGCACCCACTTCCTGCACCTGTTCGTGCCGTCGGGCGTGCCGGGCTTCCTCCTGCCCTTCCTCGTG GTGATCGAGGTGGTGTCGTTCCTGTCGCGGCCCATCAGCCTCTCGCTGCGTCTGTTCGCCAACATGCTGGCGGGCCACATCGCCCTCAAGGTGTTCGCCTTCTTCGTCGTGGGACTGGCCTCGGCCGGCGCGATCGGCTGGTTCGGCGCCACCCTGCCCTTCTTCATGATCGTGGCGCTCACCGCGCTGGAGCTGCTGGTGGCGGTGCTGCAGGCCTACGTGTTCGCGGTGCTGACC TCGATCTACCTCAACGACGCCATCCATCCCGGCCACTGA
SEQ ID NO: 47:SEQ ID NO:47:
Аминокислотная последовательность субъединицы a АТФ-синтазы atpB_2Amino acid sequence of subunit a of ATP synthase atpB_2
MTVDPIHQFEIKRYVDLLNVGGVQFSFTNATVFMIGIVLVIFFFLTFATRGRTLVPGRMQSAAELSYEFIAKMVRDAAGSEGMVFFPFVFSLFMFVLVANVLGLIPYTFTVTAHLIVTAALAATVILTVIIYGFVRHGTHFLHLFVPSGVPGFLLPFLVVIEVVSFLSRPISLSLRLFANMLAGHIALKVFAFFVVGLASAGAIGWFGATLPFFMIVALTALELLVAVLQAYVFAVLTSIYLNDAIHPGHMTVDPIHQFEIKRYVDLLNVGGVQFSFTNATVFMIGIVLVIFFFLTFATRGRTLVPGRMQSAAELSYEFIAKMVRDAAGSEGMVFFPFVFSLFMFVLVANVLGLIPYTFTVTAHLIVTAALAATVILTVIIYGFVRHGTHFLHLFVPSGVPGFLLPFLVVIEVVSFLSRPISLSLRLFANMLAGHIALKVFAFFVVGLASAGAIGWFGATLP FFMIVALTALELLVAVLQAYVFAVLTSIYLNDAIHPGH
SEQ ID NO: 48:SEQ ID NO:48:
Нуклеотидная последовательность белка I АТФ-синтазы atpINucleotide sequence of ATP synthase protein I atpI
ATGTCCGAGCCGAATGATCCATCCCGCAGGGACGGTGCGAAGGCGAAAGACGAGACGCAGGACTCCCGGCCCGGTGAGGCGGATCTTGCTCGGCGCCTCGATGCGCTCGGCACCTCCATCGGTCAGGTCAAGTCCAGAAGCGGGGAGCCCGCGGCGACGCCGCGCAAGGACACCTCCTCGGCCTCCGGCGCGGCCCTGGCGTTTCGGCTGGGCGCCGAGTTTGTTTCAGGCGTGCTGGTGGGCTCGCTCATCGGCTACGGGTTGGATTATGCGTTTGCGATTTCGCCCTGGGGGCTGATCGCCTTCACGCTGATCGGCTTTGCCGCCGGCGTCCTGAACATGCTGCGCGTGGCGAACAGCGATGCCAAGCGCCACAGCGCGGACAGGTGAATGTCCGAGCCGAATGATCCATCCCGCAGGGACGGTGCGAAGGCGAAAGACGAGACGCAGGACTCCCGGCCCGGTGAGGCGGATCTTGCTCGGCGCCTCGATGCGCTCGGCACCTCCATCGGTCAGGTCAAGTCCAGAAGCGGGGAGCCCGCGGCGACGCCGCGCAAGGACACCTCCTCGGCCTCCGGCGCGCGGCCCTGGCGTTTCGGCTGGGCGCCGAGTTTGTTTCAGGCGTGCTGGTG GGCTCGCTCATCGGCTACGGGTTGGATTATGCGTTTGCGATTTCGCCCTGGGGGCTGATCGCCTTCACGCTGATCGGCTTTGCCGCCGGCGTCCTGAACATGCTGCGCGTGGCGAACAGCGATGCCAAGCGCCACAGCGCGGACAGGTGA
SEQ ID NO: 49:SEQ ID NO:49:
Аминокислотная последовательность белка I АТФ-синтазы atpIAmino acid sequence of ATP synthase protein I atpI
MSEPNDPSRRDGAKAKDETQDSRPGEADLARRLDALGTSIGQVKSRSGEPAATPRKDTSSASGAALAFRLGAEFVSGVLVGSLIGYGLDYAFAISPWGLIAFTLIGFAAGVLNMLRVANSDAKRHSADRMSEPNDPSRRDGAKAKDETQDSRPGEADLARRLDALGTSIGQVKSRSGEPAATPRKDTSSASGAALAFRLGAEFVSGVLVGSLIGYGLDYAFAISPWGLIAFTLIGFAAGVLNMLRVANSDAKRHSADR
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий альфа-цепь молибден-железо белка нитрогеназы nifD_1, имеющую последовательность, указанную в SEQ ID NO:51, или последовательность, имеющую более 60% идентичности, например более 70% идентичности, например более 92% идентичности, например более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:51.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the molybdenum-iron alpha chain of the nitrogenase protein nifD_1 having the sequence set forth in SEQ ID NO:51, or a sequence having greater than 60% identity, e.g. 70% identity, for example more than 92% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:51 .
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий альфа-цепь молибден-железо белка нитрогеназы nifD_2, имеющую последовательность, указанную в SEQ ID NO:53, или последовательность, имеющую более 60% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:53.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the molybdenum-iron alpha chain of the nitrogenase protein nifD_2 having the sequence specified in SEQ ID NO:53, or a sequence having more than 60% identity, for example more 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:53.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий бета-цепь молибден-железо белка нитрогеназы nifK_1, имеющую последовательность, указанную в SEQ ID NO:55, или последовательность, имеющую более 87% идентичности, например более 90% идентичности, например, более 95% идентичности, например более 96% идентичности, например, более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:55.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the beta chain of the molybdenum-iron nitrogenase protein nifK_1 having the sequence specified in SEQ ID NO:55, or a sequence having more than 87% identity, for example more 90% identity, for example more than 95% identity, for example more than 96% identity, for example more than 97% identity, for example more than 98%, for example more than 99% sequence identity with the sequence presented in SEQ ID NO:55.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий бета-цепь молибден-железо белка нитрогеназы nifK_2, имеющую последовательность, указанную в SEQ ID NO: 57, или последовательность, имеющую более 95% идентичности, например более 96% идентичности, например более 97% идентичности, например более 98%, например более 99% идентичности по последовательности с последовательностью, представленной в SEQ ID NO:57.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the molybdenum-iron beta chain of the nifK_2 nitrogenase protein having the sequence set forth in SEQ ID NO: 57, or a sequence having greater than 95% identity, e.g. 96% identity, eg greater than 97% identity, eg greater than 98%, eg greater than 99% sequence identity to the sequence shown in SEQ ID NO:57.
В другом варианте осуществления изобретения бактериальный штамм, используемый в способе настоящего изобретения, содержит ген, кодирующий железо белок нитрогеназы nifH, имеющий последовательность, указанную в SEQ ID NO: 59, или последовательность, имеющую более 98,5% идентичности по последовательности с последовательностью, указанной в SEQ ID NO: 59.In another embodiment, the bacterial strain used in the method of the present invention contains a gene encoding the iron nitrogenase protein nifH having the sequence set forth in SEQ ID NO: 59, or a sequence having greater than 98.5% sequence identity to the sequence set forth in SEQ ID NO: 59.
SEQ ID NO: 50:SEQ ID NO: 50:
Нуклеотидная последовательность альфа-цепи молибден-железо белка нитрогеназы nifD_1Nucleotide sequence of the molybdenum-iron alpha chain of the nitrogenase protein nifD_1
ATGAGTTCGCTCTCCGCCACTATTCAACAGGTCTTCAACGAGCCGGGCTGCGCGAAGAACCAGAATAAGTCCGAGGCGGAGAAGAAGAAGGGCTGCACCAAGCAGCTGCAACCCGGCGGAGCGGCCGGCGGCTGCGCGTTCGACGGCGCGAAGATCGCGCTCCAGCCCTTGACCGACGTCGCCCACCTGGTGCACGGCCCCATCGCCTGCGAAGGCAATTCCTGGGACAATCGTGGCGCCAAGTCCTCCGGCTCGAACATCTGGCGCACCGGCTTCACCACGGACATCAACGAAACCGACGTGGTGTTCGGCGGCGAGAAGCGTCTGTTCAAGTCCATCAAGGAAATCATCGAGAAGTACGACCCGCCGGCCGTCTTCGTCTATCAGACCTGCGTCCCCGCCATGATCGGCGACGACATCGACGCGGTGTGCAAGGCGGCCAGGGAGAAGTTCGGAAAGCCGGTGATCCCGATCAATTCCCCCGGCTTCGTGGGGCCGAAGAATCTCGGCAACAAGCTCGCCGGCGAGGCGCTCCTCGACCATGTGATCGGCACCGAGGAGCCCGATTACACGACGGCCTACGACATCAACATCATCGGCGAATACAATCTCTCCGGCGAGTTGTGGCAGGTGAAGCCGCTGCTGGACGAGCTGGGCATCCGCATCCTCGCCTGCATCTCCGGCGACGGGAAGTACAAGGATGTGGCGTCCTCCCACCGCGCCAAGGCGGCGATGATGGTGTGCTCCAAGGCCATGATCAACGTGGCCCGCAAGATGGAGGAGCGCTACGACATCCCCTTCTTCGAAGGCTCCTTCTACGGCATCGAGGATAGCTCCGATTCCCTGCGCGAGATTGCGCGCATGCTCATCGAGAAGGGCGCCGATCCGGAGCTGATGGACCGCACCGAGGCGCTGATTGAGCGGGAAGAGAAGAAGGCGTGGGACGCCATCGCCGCCTACAAGCCCCGCTTCAAGGACAAGAAGGTGCTGCTCATCACCGGCGGCGTGAAATCCTGGTCGGTGGTGGCAGCGCTCCAGGAAGCCGGCCTCGAACTGGTGGGCACCTCGGTGAAGAAGTCCACCAAGGAGGACAAGGAGCGCATCAAGGAACTGATGGGCCAGGACGCCCACATGATCGACGACATGACGCCCCGCGAAATGTACAAGATGCTGAAGGACGCCAAGGCGGACATCATGCTCTCGGGCGGGCGCTCGCAATTCATCGCGCTCAAGGCCGCCATGCCCTGGCTCGACATCAACCAGGAGCGCCACCACGCCTATATGGGCTATGTGGGCATGGTGAAGCTGGTCGAGGAGATCGACAAGGCGCTCTACAATCCCGTGTGGGAACAGGTGCGCAAGCCCGCCCCGTGGGAAAATCCGGAAGACACCTGGCAGGCCCGTGCGCTCGCCGAAATGGAGGCGGAGGCCGCCGCGCTCGCCGCCGATCCGGTGCGCGCGGAAGAGGTGCGCCGGTCCAAGAAGATCTGCAATTGCAAGAGCGTCGACCTCGGAACCATTGAGGACGCCATCAAGGCTCACGCGCTGACCACCGTGGAGGGTGTGCGAGAGCACACCAATGCCTCGGGAGGCTGCGGAGCCTGCAGCGGGCGGATCGAGGAGATCTTCGAGGCCGTGGGCGTTGTCGCCGCCCCGCCTCCCGCGGAGGCCGCCCCGTCTCCGCAGGAGATCGCGCCCGATCCGCTCGCTGCGGAGGAAAAGCGCCGCGCCAAGAAGGCCTGCGGCTGCAAGGAGGTAGCGGTCGGCACCATTGAGGATGCCATCCGCGCCAAGGGTCTGCGAAACATCGCGGAGGTGCGTGCGGCCACCGATGCCAACACCGGCTGCGGCAATTGCCAGGAGCGGGTGGAGGGCATCCTCGACCGGGTTCTCGCCGAGGCGGCCTCAGAACTCCAGGCGGCGGAATAGATGAGTTCGCTCTCCGCCACTATTCAACAGGTCTTCAACGAGCCGGGCTGCGCGAAGAACCAGAATAAGTCCGAGGCGGAGAAGAAGAAGGGCTGCACCAAGCAGCTGCAACCCGGCGGAGCGGCCGGCGGCTGCGCGTTCGACGGCGCGAAGATCGCGCTCCAGCCCTTGACCGACGTCGCCCACCTGGTGCACGGCCCCATCGCCTGCGAAGGCAATTCCTGGGACAATCGTGGCGCCAAG TCCTCCGGCTCGAACATCTGGCGCACCGGCTTCACCACGGACATCAACGAAACCGACGTGGTGTTCGGCGGCGAGAAGCGTCTGTTCAAGTCCATCAAGGAAATCATCGAGAAGTACGACCCGCCGGCCGTCTTCGTCTATCAGACCTGCGTCCCCCGCCATGATCGGCGACGACATCGACGCGGTGTGCAAGGCGGCCAGGAGAAGTTCGGAAAGCCGGTGATCCCGATCAATTCCCCCGGCT TCGTGGGGCCGAAGAATCTCGGCAACAAGCTCGCCGGCGAGGCGCTCCTCGACCATGTGATCGGCACCGAGGAGCCCGATTACACGACGGCCTACGACATCAACATCATCGGCGAATACAATCTCTCCGGCGAGTTGTGGCAGGTGAAGCCGCTGCTGGACGAGCTGGGCATCCGCATCCTCGCCTGCATCTCCGGCGACGGGAAGTACAAGGATGTGGCGTCCTCCCACCGCGCCAAGGCGGC GATGATGGTGTGCTCCAAGGCCATGATCAACGTGGCCCGCAAGATGGAGGAGCGCTACGACATCCCCTTCTTCGAAGGCTCCTTCTACGGCATCGAGGATAGCTCCGATTCCCTGCGCGAGATTGCGCGCATGCTCATCGAGAAGGGCGCCGATCCGGAGCTGATGGACCGCACCGAGGCGCTGATTGAGCGGGAAGAGAAGAAGGCGTGGGACGCCATCGCCGCCTACAAGCCCCGCTTCAAGGACA AGAAGGTGCTGCTCATCACCGGCGGCGTGAAATCCTGGTCGGTGGTGGCAGCGCTCCAGGAAGCCGGCCTCGAACTGGTGGGCACCTCGGTGAAGAAGTCCACCAAGGAGGACAAGGAGCGCATCAAGGAACTGATGGGCCAGGACGCCCACATGATCGACGACATGACGCCCCGCGAAATGTACAAGATGCTGAAGGACGCCAAGGCGGACATCATGCTCGGGCGGGCGCTCGCAATTCATCGCGCT CAAGGCCGCCATGCCCTGGCTCGACATCAACCAGGAGCGCCACCACGCCTATATGGGCTATGTGGGCATGGTGAAGCTGGTCGAGGAGATCGACAAGGCGCTCTACAATCCCGTGTGGGAACAGGTGCGCAAGCCCGCCCCGTGGGAAAATCCGGAAGACACCTGGCAGGCCCGTGCGCTCGCCGAAATGGAGGCGGAGGCCGCCGCGCTCGCCGCCGATCCGGTGCGCGCGGAAGAGGTGCG CCGGTCCAAGAAGATCTGCAATTGCAAGAGCGTCGACCTCGGAACCATTGAGGACGCCATCAAGGCTCACGCGCTGACCACCGTGGAGGGTGTGCGAGAGCACACCAATGCCTCGGGAGGCTGCGGAGCCTGCAGCGGGCGGATCGAGGAGATCTTCGAGGCCGTGGGCGTTGTCGCCGCCCCGCCTCCCGCGGAGGCCGCCCCGTCTCCGCAGGAGATCGCGCCCGATCCGCTCGCTGCGGA GGAAAAGCGCCGCCAAGAAGGCCTGCGGCTGCAAGGAGGTAGCGGTCGGCACCATTGAGGATGCCATCCGCGCCAAGGGTCTGCGAAACATCGCGGAGGTGCGTGCGGCCACCGATGCCAACACCGGCTGCGGCAATTGCCAGGAGCGGGTGGAGGGCATCCTCGACCGGGTTCTCGCCGAGGCGGCCTCAGAACTCCAGGCGGCGGAATAG
SEQ ID NO: 51:SEQ ID NO:51:
Аминокислотная последовательность альфа-цепи молибден-железо белка нитрогеназы nifD_1Amino acid sequence of the molybdenum-iron alpha chain of the nitrogenase protein nifD_1
MSSLSATIQQVFNEPGCAKNQNKSEAEKKKGCTKQLQPGGAAGGCAFDGAKIALQPLTDVAHLVHGPIACEGNSWDNRGAKSSGSNIWRTGFTTDINETDVVFGGEKRLFKSIKEIIEKYDPPAVFVYQTCVPAMIGDDIDAVCKAAREKFGKPVIPINSPGFVGPKNLGNKLAGEALLDHVIGTEEPDYTTAYDINIIGEYNLSGELWQVKPLLDELGIRILACISGDGKYKDVASSHRAKAAMMVCSKAMINVARKMEERYDIPFFEGSFYGIEDSSDSLREIARMLIEKGADPELMDRTEALIEREEKKAWDAIAAYKPRFKDKKVLLITGGVKSWSVVAALQEAGLELVGTSVKKSTKEDKERIKELMGQDAHMIDDMTPREMYKMLKDAKADIMLSGGRSQFIALKAAMPWLDINQERHHAYMGYVGMVKLVEEIDKALYNPVWEQVRKPAPWENPEDTWQARALAEMEAEAAALAADPVRAEEVRRSKKICNCKSVDLGTIEDAIKAHALTTVEGVREHTNASGGCGACSGRIEEIFEAVGVVAAPPPAEAAPSPQEIAPDPLAAEEKRRAKKACGCKEVAVGTIEDAIRAKGLRNIAEVRAATDANTGCGNCQERVEGILDRVLAEAASELQAAEMSSLSATIQQVFNEPGCAKNQNKSEAEKKKGCTKQLQPGGAAGGCAFDGAKIALQPLTDVAHLVHGPIACEGNSWDNRGAKSSGSNIWRTGFTTDINETDVVFGGEKRLFKSIKEIIEKYDPPAVFVYQTCVPAMIGDDIDAVCKAAREKFGKPVIPINSPGFVGPKNLGNKLAGEALLDHVIGTEEPDYTTAYDINIIG EYNLSGELWQVKPLLDELGIRILACISGDGKYKDVASSHRAKAAMMVCSKAMINVARKMEERYDIPFFEGSFYGIEDSSDSLREIARMLIEKGADPELMDRTEALIEREEKKAWDAIAAYKPRFKDKKVLLITGGVKSWSVVAALQEAGLELVGTSVKKSTKEDKERIKELMGQDAHMIDDMTPREMYKMLKDAKADIMLSGGRSQ FIALKAAMPWLDINQERHHAYMGYVGMVKLVEEIDKALYNPVWEQVRKPAPWENPEDTWQARALAEMEAEAAALAADPVRAEEVRRSKKICNCKSVDLGTIEDAIKAHALTTVEGVREHTNASGGCGACSGRIEEIFEAVGVVAAPPPAEAAPSPQEIAPDPLAAEEKRRAKKACGCKEVAVGTIEDAIRAKGLRNIAEVRAATDANTGCG NCQERVEGILDRVLAEAASELQAAE
SEQ ID NO: 52:SEQ ID NO:52:
Нуклеотидная последовательность альфа-цепи молибден-железо белка нитрогеназы nifD_2Nucleotide sequence of the molybdenum-iron alpha chain of the nitrogenase protein nifD_2
ATGAGTGTCGCACAGTCCCAGAGCGTCGCCGAGATCAAGGCGCGCAACAAGGAACTCATCGAAGAGGTCCTCAAGGTCTATCCCGAGAAGACCGCCAAGCGCCGCGCCAAGCACCTGAACGTCCACGAAGCCGGCAAGTCCGACTGCGGCGTGAAGTCCAACATCAAGTCCATCCCGGGCGTGATGACCATCCGCGGTTGCGCTTATGCCGGCTCCAAGGGTGTGGTGTGGGGTCCCATCAAGGACATGATCCACATCTCCCACGGCCCGGTGGGCTGCGGCCAGTATAGCTGGGCCGCCCGCCGCAACTACTATATCGGCACGACCGGCATCGACACCTTCGTGACGATGCAGTTCACCTCCGACTTCCAGGAGAAGGACATCGTCTTCGGCGGCGACAAGAAGCTCGCCAAGATCATGGACGAGATCCAGGAGCTGTTCCCGCTGAACAACGGCATCACCGTTCAGTCCGAGTGCCCCATCGGCCTCATCGGCGACGACATCGAGGCCGTCTCCAAGCAGAAGTCCAAGGAGTATGAGGGCAAGACCATCGTGCCGGTGCGCTGCGAGGGCTTCCGCGGCGTGTCCCAGTCCCTGGGCCACCACATCGCCAACGACGCCATCCGCGATTGGGTGTTCGACAAGATCGCGCCCGACGCCGAGCCGCGCTTTGAGCCGACCCCGTACGACGTCGCCATCATCGGCGACTACAATATCGGTGGTGACGCCTGGTCGTCCCGTATCCTCCTGGAGGAGATGGGCCTGCGCGTGATCGCCCAGTGGTCCGGCGACGGTTCGCTCGCTGAGCTGGAGGCCACCCCGAAGGCCAAGCTCAACGTGCTGCACTGCTACCGCTCCATGAACTACATCTCGCGCCACATGGAAGAGAAGTACGGTATCCCGTGGTGCGAGTACAACTTCTTCGGTCCTTCCAAGATCGCCGAGTCCCTGCGCAAGATCGCCAGCTACTTCGACGACAAGATCAAGGAAGGCGCGGAGCGCGTCATCGCCAAGTATCAGCCGCTCATGGATGCGGTGATCGCGAAGTATCGTCCCCGCCTCGAGGGCAAGACCGTGATGCTGTACGTGGGCGGCCTGCGTCCCCGTCACGTCATCGGCGCCTACGAGGACCTGGGCATGGAAGTGGTCGGCACGGGCTACGAGTTCGCCCATAACGACGACTACCAGCGCACCGCCCAGCACTACGTCAAGGATGGCACCATCATCTATGACGACGTGACCGGCTACGAGTTCGAGAAGTTCGTCGAGAAGATCCAGCCGGACCTGGTCGGTTCGGGCATCAAGGAAAAGTACGTCTTCCAGAAGATGGGCGTGCCGTTCCGCCAGATGCACTCCTGGGACTACTCGGGCCCGTACCACGGCTATGACGGCTTCGCGATCTTCGCGCGCGACATGGACATGGCCATCAACAGCCCCGTGTGGAAGATGACCCAGGCTCCGTGGAAGAGCGTCCCCAAGCCGACGATGCTCGCGGCTGAATGAATGAGTGTCGCACAGTCCCAGAGCGTCGCCGAGATCAAGGCGCGCAACAAGGAACTCATCGAAGAGGTCCTCAAGGTCTATCCCGAGAAGACCGCCAAGCGCCGCGCCAAGCACCTGAACGTCCACGAAGCCGGCAAGTCCGACTGCGGCGTGAAGTCCAACATCAAGTCCATCCCGGGCGTGATGACCATCCGCGGTTGCGCTTATGCCGGCTCCAAGGGTGTGGTGTGGGGTCCCATCA AGGACATGATCCACATCTCCCACGGCCCGGTGGGCTGCGGCCAGTATAGCTGGGCCGCCCGCCGCAACTATATCGGCACGACCGGCATCGACACCTTCGTGACGATGCAGTTCACCTCCGACTTCCAGGAGAAGGACATCGTCTTCGGCGGCGACAAGAAGCTCGCCAAGATCATGGACGAGATCCAGGAGCTGTTCCCGCTGAACAACGGCATCACCGTTCAGTCCGAGTGCCCCATCGGCCT CATCGGCGACGACATCGAGGCCGTCTCCAAGCAGAAGTCCAAGGAGTATGAGGGCAAGACCATCGTGCCGGTGCGCTGCGAGGGCTTCCGCGGCGTGTCCCAGTCCCTGGGCCACCACATCGCCAACGACGCCATCCGCGATTGGGTGTTCGACAAGATCGCGCCCGACGCCGAGCCGCGCTTTGAGCCGACCCCGTACGACGTCGCCATCATCGGCGACTACAATATCGGTGGTGACGCCTGGTCG TCCCGTATCCTCCTGGAGGAGATGGGCCTGCGCGTGATCGCCCAGTGGTCCGGCGACGGTTCGCTCGCTGAGCTGGAGGCCACCCCGAAGGCCAAGCTCAACGTGCTGCACTGCTACCGCTCCATGAACTACATCTCGCGCCACATGGAAGAGAAGTACGGTATCCCGTGGTGCGAGTACAACTTCTTCGGTCCTTCCAAGATCGCCGAGTCCCTGCGCAAGATCGCCAGCTACTTCGAC GACAAGATCAAGGAAGGCGCGGAGCGCGTCATCGCCAAGTATCAGCCGCTCATGGATGCGGTGATCGCGAAGTATCGTCCCCGCCTCGAGGGCAAGACCGTGATGCTGTACGTGGGCGGCCTGCGTCCCCGTCACGTCATCGGCGCCTACGAGGACCTGGGCATGGAAGTGGTCGGCACGGGCTACGAGTTCGCCCATAACGACGACTACCAGCGCACCGCCCAGCACTACGTCAAGG ATGGCACCATCATCTATGAACGACGTGACCGGCTACGAGTTCGAGAAGTTCGTCGAGAAGATCCAGCCGGACCTGGTCGGTTCGGGCATCAAGGAAAAGTACGTCTTCCAGAAGATGGGCGTGCCGTTCCGCCAGATGCACTCCTGGGACTACTCGGGCCCGTACCACGGCTATGACGGCTTCGCGATCTTCGCGCGCGACATGGACATGGCCATCAACAGCCCCGTGTGGAAGATGACCCAGGCTCC GTGGAAGAGCGTCCCCAAGCCGACGATGCTCGCGGCTGAATGA
SEQ ID NO: 53:SEQ ID NO:53:
Аминокислотная последовательность альфа-цепи молибден-железо белка нитрогеназы nifD_2Amino acid sequence of the molybdenum-iron alpha chain of the nitrogenase protein nifD_2
MSVAQSQSVAEIKARNKELIEEVLKVYPEKTAKRRAKHLNVHEAGKSDCGVKSNIKSIPGVMTIRGCAYAGSKGVVWGPIKDMIHISHGPVGCGQYSWAARRNYYIGTTGIDTFVTMQFTSDFQEKDIVFGGDKKLAKIMDEIQELFPLNNGITVQSECPIGLIGDDIEAVSKQKSKEYEGKTIVPVRCEGFRGVSQSLGHHIANDAIRDWVFDKIAPDAEPRFEPTPYDVAIIGDYNIGGDAWSSRILLEEMGLRVIAQWSGDGSLAELEATPKAKLNVLHCYRSMNYISRHMEEKYGIPWCEYNFFGPSKIAESLRKIASYFDDKIKEGAERVIAKYQPLMDAVIAKYRPRLEGKTVMLYVGGLRPRHVIGAYEDLGMEVVGTGYEFAHNDDYQRTAQHYVKDGTIIYDDVTGYEFEKFVEKIQPDLVGSGIKEKYVFQKMGVPFRQMHSWDYSGPYHGYDGFAIFARDMDMAINSPVWKMTQAPWKSVPKPTMLAAEMSVAQSQSVAEIKARNKELIEEVLKVYPEKTAKRRAKHLNVHEAGKSDCGVKSNIKSIPGVMTIRGCAYAGSKGVVWGPIKDMIHISHGPVGCGQYSWAARRNYYIGTTGIDTFVTMQFTSDFQEKDIVFGGDKKLAKIMDEIQELFPLNNGITVQSECPIGLIGDDIEAVSKQKSKEYEGKTIVPVR CEGFRGVSQSLGHHIANDAIRDWVFDKIAPDAEPRFEPTPYDVAIIGDYNIGGDAWSSRILLEEMGLRVIAQWSGDGSLAELEATPKAKLNVLHCYRSMNYISRHMEEKYGIPWCEYNFFGPSKIAESLRKIASYFDDKIKEGAERVIAKYQPLMDAVIAKYRPRLEGKTVMLYVGGLRPRHVIGAYEDLGMEVVGTGYEFAHND DYQRTAQHYVKDGTIIYDDVTGYEFEKFVEKIQPDLVGSGIKEKYVFQKMGVPFRQMHSWDYSGPYHGYDGFAIFARDMDMAINSPVWKMTQAPWKSVPKPTMLAAE
SEQ ID NO: 54:SEQ ID NO:54:
Нуклеотидная последовательность бета-цепи молибден-железо белка нитрогеназы nifK_1Nucleotide sequence of the molybdenum-iron beta chain of the nitrogenase protein nifK_1
ATGGCCACCGTTTCCGTCTCCAAGAAGGCCTGCGCGGTCAACCCCCTCAAGATGAGCCAGCCGGTGGGCGGCGCGCTCGCCTTCATGGGCGTGCGCAAGGCCATGCCGCTGCTGCACGGCTCGCAGGGCTGCACCTCCTTCGGCCTGGTGCTGTTCGTGCGCCACTTCAAGGAAGCCATCCCCATGCAGACCACCGCCATGAGCGAGGTGGCGACGGTTCTGGGCGGCCTTGAGAATGTGGAGCAGGCCATTCTCAACATCTACAATCGCACCAAGCCGGAGATCATCGGCATCTGCTCCACCGGCGTCACCGAGACCAAGGGCGATGATGTCGACGGCTACATCAAGCTGATCCGGGACAAGTATCCCCAGCTGGCCGACTTCCCGCTGGTCTATGTCTCCACCCCCGATTTCAAGGACGCCTTCCAGGACGGTTGGGAGAAGACCGTGGCGAAGATGGTGGAGGCGCTGGTGAAGCCCGCCGCCGACAAGCAGAAGGACAAGACCCGCGTCAACGTCCTGCCCGGCTGCCACCTCACGCCCGGCGATCTGGATGAGATGCGGACCATCTTCGAGGATTTCGGGCTCACACCCTATTTCCTGCCGGATCTGGCCGGCTCGCTGGATGGGCATATCCCCGAGGACTTCTCGCCCACCACCATCGGCGGCATCGGCATCGATGAGATCGCCACCATGGGCGAGGCGGCCCACACCATCTGCATCGGCGCGCAGATGCGCCGGGCGGGCGAGGCCATGGAGAAGAAGACCGGCATTCCCTTCAAGCTGTTCGAGCGCCTGTGCGGCCTGGAGGCGAACGACGCCTTCATCATGCACCTGTCGCAGATCTCCGGCCGGCCGGTGCCGGTGAAGTATCGCCGGCAGCGGGGCCAGCTGGTGGATGCCATGCTGGACGGCCACTTCCATCTGGGCGGTCGCAAGGTGGCCATGGGGGCGGAGCCGGACCTGCTCTACGACGTGGGCTCCTTCCTGCACGAGATGGGCGCCCACATCCTTTCCGCGGTCACCACCACCCAGTCGCCGGTGCTGGCGCGCCTGCCTGCCGAGGAGGTGCTTATCGGCGACCTGGAGGATCTGGAGACCCAGGCGAAGGCGCGCGGATGCGATCTCCTGCTCACCCATTCCCATGGGCGCCAGGCGGCGGAGCGCCTCCACATCCCCTTCTACCGGATCGGCATTCCCATGTTTGACCGGCTGGGGGCGGGGCATCTGTTGTCGGTGGGCTATCGCGGCACCCGCGACCTCATCTTCCATCTCGCCAACCTTGTGATCGCCGACCACGAGGAAAATCACGAGCCGACGCCCGACACCTGGGCCACCGGCCATGGCGAGCATGCCGCCGCCCCCACTTCCCATTGAATGGCCACCGTTTCCGTCTCCAAGAAGGCCTGCGCGGTCAACCCCCTCAAGATGAGCCAGCCGGTGGGCGGCGCGCTCGCCTTCATGGGCGTGCGCAAGGCCATGCCGCTGCTGCACGGCTCGCAGGGCTGCACCTCCTTCGGCCTGGTGCTGTTCGTGCGCCACTTCAAGGAAGCCATCCCCCATGCAGACCACCGCCATGAGCGAGGTGGCGACGGTTCTGGGCGGCCTTGAGAATGTGGA GCAGGCCATTCTCAACATCTACAATCGCACCAAGCCGGAGATCATCGGCATCTGCTCCACCGGCGTCACCGAGACCAAGGGCGATGATGTCGACGGCTACATCAAGCTGATCCGGGACAAGTATCCCCAGCTGGCCGACTTCCCGCTGGTCTATGTCTCCACCCCCGATTTCAAGGACGCCTTCCAGGACGGTTGGGAGAAGACCGTGGCGAAGATGGTGGAGGCGCTGGTGAAGCCCGCCGCCGACA AGCAGAAGGACAAGACCCGCGTCAACGTCCTGCCCGGCTGCCACCTCACGCCCGGCGATCTGGATGAGATGCGGACCATCTTCGAGGATTTCGGGCTCACACCCTATTTCCTGCCGGATCTGGCCGGCTCGCTGGATGGGCATATCCCCGAGGACTTCTCGCCCACCACCATCGGCGGCATCGGCATCGATGAGATCGCCACCATGGGCGAGGCGGCCCACCATCTGCATCGGCGCGCAGATGCGC CGGGCGGGCGAGGCCATGGAGAAGAAGACCGGCATTCCCTTCAAGCTGTTCGAGCGCCTGTGCGGCCTGGAGGCGAACGACGCCTTCATCATGCACCTGTCGCAGATCTCCGGCCGGCCGGTGCCGGTGAAGTATCGCCGGCAGCGGGGCCAGCTGGTGGATGCCATGCTGGACGGCCACTTCCATCTGGGCGGTCGCAAGGTGGCCATGGGGGCGGAGCCGGACCTGCTCTACGACGTGGGCTC CTTCCTGCACGAGATGGGCGCCCACATCCTTTCCGCGGTCACCACCACCCAGTCGCCGGTGCTGGCGCGCCTGCCTGCCGAGGAGGTGCTTATCGGCGACCTGGAGGATCTGGAGACCCAGGCGAAGGCGCGCGGATGCGATCTCCTGCTCACCCATTCCCATGGGCGCCAGGCGGCGGAGCGCCTCCACATCCCCTTCTACCGGATCGGCATTCCCATGTTTGACCGGCTGGGGGCGGGGCATCT GTTGTCGGTGGGCTATCGCGGCACCCGCGACCTCATCTTCCATCTCGCCAACCTTGTGATCGCCGACCACGAGGAAAATCACGAGCCGACGCCCGACACCTGGGCCACCGGCCATGGCGAGCATGCCGCCGCCCCCACTTCCCATTGA
SEQ ID NO: 55:SEQ ID NO: 55:
Аминокислотная последовательность бета-цепи молибден-железо белка нитрогеназы nifK_1Amino acid sequence of the molybdenum-iron beta chain of the nitrogenase protein nifK_1
MATVSVSKKACAVNPLKMSQPVGGALAFMGVRKAMPLLHGSQGCTSFGLVLFVRHFKEAIPMQTTAMSEVATVLGGLENVEQAILNIYNRTKPEIIGICSTGVTETKGDDVDGYIKLIRDKYPQLADFPLVYVSTPDFKDAFQDGWEKTVAKMVEALVKPAADKQKDKTRVNVLPGCHLTPGDLDEMRTIFEDFGLTPYFLPDLAGSLDGHIPEDFSPTTIGGIGIDEIATMGEAAHTICIGAQMRRAGEAMEKKTGIPFKLFERLCGLEANDAFIMHLSQISGRPVPVKYRRQRGQLVDAMLDGHFHLGGRKVAMGAEPDLLYDVGSFLHEMGAHILSAVTTTQSPVLARLPAEEVLIGDLEDLETQAKARGCDLLLTHSHGRQAAERLHIPFYRIGIPMFDRLGAGHLLSVGYRGTRDLIFHLANLVIADHEENHEPTPDTWATGHGEHAAAPTSHMATVSVSKKACAVNPLKMSQPVGGALAFMGVRKAMPLLHGSQGCTSFGLVLFVRHFKEAIPMQTTAMSEVATVLGGLENVEQAILNIYNRTKPEIIGICSTGVTETKGDDVDGYIKLIRDKYPQLADFPLVYVSTPDFKDAFQDGWEKTVAKMVEALVKPAADKQKDKTRVNVLPGCHLTPGDLDEMRTIFE DFGLTPYFLPDLAGSLDGHIPEDFSPTTIGGIGIDEIATMGEAAHTICIGAQMRRAGEAMEKKTGIPFKLFERLCGLEANDAFIMHLSQISGRPVPVKYRRQRGQLVDAMLDGHFHLGGRKVAMGAEPDLLYDVGSFLHEMGAHILSAVTTTQSPVLARLPAEEVLIGDLEDLETQAKARGCDLLLTHSHGRQAAERLHIPFYRIGIPMFDRLGAGHLL SVGYRGTRDLIFHLANLVIADHEENHEPTPDTWATGHGEHAAAPTSH
SEQ ID NO: 56:SEQ ID NO:56:
Нуклеотидная последовательность бета-цепи молибден-железо белка нитрогеназы nifK_2Nucleotide sequence of the molybdenum-iron beta chain of the nitrogenase protein nifK_2
ATGCCACAAAATGCTGACAATGTGCTCGATCACTTCGAGCTCTTCCGTGGTCCCGAATACCAGCAGATGCTGGCCAATAAGAAAAAGATGTTCGAGAACCCCCGCGATCCGGCCGAAGTCGAGCGCGTGCGGGAATGGGCGAAGACTCCTGAATACAAGGAGCTGAACTTCGCCCGCGAGGCGCTCACCGTGAATCCGGCCAAGGCTTGTCAGCCGCTGGGCGCGGTGTTCGTCGCCGTCGGCTTCGAGAGCACGATCCCCTTCGTGCACGGCTCGCAGGGTTGCGTCGCGTATTACCGCTCGCACCTCTCCCGCCACTTCAAGGAGCCGTCCTCCTGCGTCTCCTCGTCCATGACCGAGGATGCGGCGGTGTTCGGCGGCCTCAACAACATGATTGACGGCCTCGCCAACACCTACAACATGTACAAGCCGAAGATGATCGCCGTCTCCACCACCTGCATGGCGGAAGTCATCGGCGACGATCTGAACGCCTTCATCAAGACCGCGAAGGAAAAGGGCTCGGTTCCGGCCGAATACGACGTGCCCTTCGCCCACACCCCGGCGTTCGTCGGCAGCCATGTCACCGGCTACGACAATGCGCTCAAGGGCATCCTCGAGCACTTCTGGGACGGCAAGGCCGGCACCGCGCCGAAGCTGGAGCGCGTTCCCAACGAGAAGATCAACTTCATCGGCGGCTTCGACGGCTACACCGTCGGCAACACTCGCGAAGTGAAGCGCATCTTCGAGGCGTTCGGCGCCGATTACACCATCCTCGCCGACAATTCCGAAGTGTTCGACACCCCGACCGACGGCGAGTTCCGCATGTATGACGGCGGCACGACCCTGGAGGACGCGGCGAACGCGGTGCACGCCAAGGCCACCATCTCCATGCAGGAATACTGCACGGAGAAGACCCTGCCCATGATCGCCGGTCATGGCCAGGACGTGGTCGCCCTCAACCACCCCGTGGGCGTGGGCGGCACCGACAAGTTCCTCATGGAGATCGCCCGCCTCACCGGCAAGGAGATCCCCGAGGAGCTGACCCGCGAGCGCGGCCGTCTCGTGGACGCTATCGCGGACTCTTCCGCGCACATCCACGGCAAGAAGTTCGCCATCTACGGCGATCCGGATCTGTGCCTGGGCCTCGCCGCGTTCCTGCTGGAGCTGGGCGCCGAGCCGACCCATGTGCTGGCCACCAACGGCACCAAGAAGTGGGCCGAGAAGGTTCAGGAACTGTTCGACTCTTCGCCGTTCGGCGCCAACTGCAAGGTCTATCCCGGCAAGGACCTGTGGCACATGCGCTCGCTCCTGTTCGTGGAGCCGGTGGATTTCATCATCGGCAACACCTACGGCAAGTATCTCGAGCGCGACACGGGCACCCCGCTGATCCGTATCGGCTTCCCGGTGTTCGACCGTCACCACCACCACCGCCGTCCGGTGTGGGGCTATCAGGGCGGCATGAACGTCCTGATCACGATCCTCGACAAGATCTTTGACGAGATCGACCGCAACACCAACGTGCCGGCCAAGACCGACTACTCGTTCGACATCATTCGTTGAATGCCACAAAATGCTGACAATGTGCTCGATCACTTCGAGCTCTTCCGTGGTCCCGAATACCAGCAGATGCTGGCCAATAAGAAAAAGATGTTCGAGAACCCCCGCGATCCGGCCGAAGTCGAGCGCGTGCGGGAATGGGCGAAGACTCCTGAATACAAGGAGCTGAACTTCGCCCGCGAGGCGCTCACCGTGAATCCGGCCAAGGCTTGTCAGCCGCTGGGCGCGGTGTTCGTCGCCGTC GGCTTCGAGAGCACGATCCCCTTCGTGCACGGCTCGCAGGGTTGCGTCGCGTATTACCGCTCGCACCTCTCCCGCCACTTCAAGGAGCCGTCCTCCTGCGTCTCCTCGTCCATGACCGAGGATGCGGCGGTGTTCGGCGCCTCAACAACATGATTGACGGCCTCGCCAACACCTACAACATGTACAAGCCGAAGATGATCGCCGTCTCCACCACCTGCATGGCGGAAGTCATCGGCGACG ATCTGAACGCCTTCATCAAGACCGCGAAGGAAAAGGGCTCGGTTCCGGCCGAATACGACGTGCCCTTCGCCCACACCCCGGCGTTCGTCGGCAGCCATGTCACCGGCTACGACAATGCGCTCAAGGGCATCCTCGAGCACTTCTGGGACGGCAAGGCCGGCACCGCGCCGAAGCTGGAGCGCGTTCCCAACGAGAAGATCAACTTCATCGGCGGCTTCGACGGCTACACCGTCGGCAACACTCGCGA AGTGAAGCGCATCTTCGAGGCGTTCGGCGCCGATTACACCATCCTCGCCGACAATTCCGAAGTGTTCGACACCCCGACCGACGGCGAGTTCCGCATGTATGACGGCGGCACGACCCTGGAGGACGCGGCGAACGCGGTGCACGCCAAGGCCACCATCTCCATGCAGGAATACTGCACGGAGAAGACCCTGCCCATGATCGCCGGTCATGGCCAGGACGTGGTCGCCCTCAACCACCCCGTGGGCG TGGGCGGCACCGACAAGTTCCTCATGGAGATCGCCCGCCTCACCGGCAAGGAGATCCCCGAGGAGCTGACCCGCGAGCGCGGCCGTCTCGTGGACGCTATCGCGGACTCTTCCGCGCACATCCACGGCAAGAAGTTCGCCATCTACGGCGATCCGGATCTGTGCCTGGGCCTCGCCGCGTTCCTGCTGGAGCTGGGCGCCGAGCCGACCCATGTGCTGGCCACCAACGGCACCAAGAAGTGGG CCGAGAAGGTTCAGGAACTGTTCGACTCTTCGCCGTTCGGCGCCAACTGCAAGGTCTATCCCGGCAAGGACCTGTGGCACATGCGCTCGCTCCTGTTCGTGGAGCCGGTGGATTTCATCATCGGCAACACCTACGGCAAGTATCTCGAGCGCGACACGGGCACCCCGCTGATCCGTATCGGCTTCCCGGTGTTCGACCGTCACCACCACCACCGCCGTCCGGTGTGGGGCTATCAGGGCGG CATGAACGTCCTGATCACGATCCTCGACAAGATCTTTGACGAGATCGACCGCAACACCAACGTGCCGGCCAAGACCGACTACTCGTTCGACATCATTCGTTGA
SEQ ID NO: 57:SEQ ID NO:57:
Аминокислотная последовательность бета-цепи молибден-железо белка нитрогеназы nifK_2Amino acid sequence of the molybdenum-iron beta chain of the nitrogenase protein nifK_2
MPQNADNVLDHFELFRGPEYQQMLANKKKMFENPRDPAEVERVREWAKTPEYKELNFAREALTVNPAKACQPLGAVFVAVGFESTIPFVHGSQGCVAYYRSHLSRHFKEPSSCVSSSMTEDAAVFGGLNNMIDGLANTYNMYKPKMIAVSTTCMAEVIGDDLNAFIKTAKEKGSVPAEYDVPFAHTPAFVGSHVTGYDNALKGILEHFWDGKAGTAPKLERVPNEKINFIGGFDGYTVGNTREVKRIFEAFGADYTILADNSEVFDTPTDGEFRMYDGGTTLEDAANAVHAKATISMQEYCTEKTLPMIAGHGQDVVALNHPVGVGGTDKFLMEIARLTGKEIPEELTRERGRLVDAIADSSAHIHGKKFAIYGDPDLCLGLAAFLLELGAEPTHVLATNGTKKWAEKVQELFDSSPFGANCKVYPGKDLWHMRSLLFVEPVDFIIGNTYGKYLERDTGTPLIRIGFPVFDRHHHHRRPVWGYQGGMNVLITILDKIFDEIDRNTNVPAKTDYSFDIIRMPQNADNVLDHFELFRGPEYQQMLANKKKMFENPRDPAEVERVREWAKTPEYKELNFAREALTVNPAKACQPLGAVFVAVGFESTIPFVHGSQGCVAYYRSHLSRHFKEPSSCVSSSMTEDAAVFGGLNNMIDGLANTYNMYKPKMIAVSTTCMAEVIGDDLNAFIKTAKEKGSVPAEYDVPFAHTPAFVGSHVTGYDNALKGILEHF WDGKAGTAPKLERVPNEKINFIGGFDGYTVGNTREVKRIFEAFGADYTILADNSEVFDTPTDGEFRMYDGGTTLEDAANAVHAKATISMQEYCTEKTLPMIAGHGQDVVALNHPVGVGGTDKFLMEIARLTGKEIPEELTRERGRLVDAIADSSAHIHGKKFAIYGDPDLCLGLAAFLLELGAEPTHVLATNGTKKWAEKV QELFDSSPFGANCKVYPGKDLWHMRSLLFVEPVDFIIGNTYGKYLERDTGTPLIRIGFPVFDRHHHHRRPVWGYQGGMNVLITILDKIFDEIDRNTNVPAKTDYSFDIIR
SEQ ID NO: 58:SEQ ID NO:58:
Нуклеотидная последовательность железо белка нитрогеназы nifHNucleotide sequence of the iron nitrogenase protein nifH
GTGGAGTCCGGTGGTCCTGAGCCGGGCGTGGGCTGCGCCGGCCGCGGCGTGATCACCTCCATCAACTTCCTGGAGGAGAACGGCGCCTACGAGGACATCGACTATGTGTCCTACGACGTGCTGGGCGACGTGGTGTGCGGCGGCTTCGCCATGCCCATCCGCGAGAACAAGGCGCAGGAAATCTACATCGTGATGTCCGGCGAGATGATGGCCATGTATGCGGCCAACAACATCTCCAAGGGCATCCTGAAGTATGCCAATTCCGGCGGCGTGCGCCTGGGCGGGCTGGTCTGCAACGAGCGCCAGACCGACAAGGAGCTGGAGCTGGCGGAGGCTCTGGCGAAGAAGCTCGGCACCGAGCTGATCTACTTCGTGCCGCGCGACAACATCGTGCAGCATGCCGAGCTGCGCCGCATGACAGTGATCGAGTATGCGCCCGATTCCGCCCAGGCCCAGCACTACCGGAACCTGGCCGAGAAGGTGCACGCCAACAAGGGCAACGGCATCATCCCGACCCCGATCACCATGGACGAGCTGGAAGACATGCTCATGGAGCACGGCATCATGAAGGCCGTGGACGAGAGCCAGATCGGCAAGACCGCCGCCGAGCTCGCCGTCTGAGTGGAGTCCGGTGGTCCTGAGCCGGGCGTGGGCTGCGCCGGCCGCGGCGTGATCACCTCCATCAACTTCCTGGAGGAGAACGGCGCCTACGAGGACATCGACTATGTGTCCTACGACGTGCTGGGCGACGTGGTGTGCGGCGGCTTCGCCATGCCCATCCGCGAGAACAAGGCGCAGGAAATCTACATCGTGATGTCCGGCGAGATGATGGCCATGTATGCGGCCAACAACATCTCAAGGGCAT CCTGAAGTATGCCAATTCCGGCGGCGTGCGCCTGGGCGGGCTGGTCTGCAACGAGCGCCAGACCGACAAGGAGCTGGAGCTGGCGGAGGCTCTGGCGAAGAAGCTCGGCACCGAGCTGATCTACTTCGTGCCGCGCGACAACATCGTGCAGCATGCCGAGCTGCGCCGCATGACAGTGATCGAGTATGCGCCCGATTCCGCCCAGGCCCAGCACTACCGGAACCTGGCCGAGAAGGTGCACGCC AACAAGGGCAACGGCATCATCCCGACCCCGATCACCATGGACGAGCTGGAAGACATGCTCATGGAGCACGGCATCATGAAGGCCGTGGACGAGAGCCAGATCGGCAAGACCGCCGCCGAGCTCGCCGTCTGA
SEQ ID NO: 59:SEQ ID NO:59:
Аминокислотная последовательность железо белка нитрогеназы nifHAmino acid sequence of iron nitrogenase protein nifH
MESGGPEPGVGCAGRGVITSINFLEENGAYEDIDYVSYDVLGDVVCGGFAMPIRENKAQEIYIVMSGEMMAMYAANNISKGILKYANSGGVRLGGLVCNERQTDKELELAEALAKKLGTELIYFVPRDNIVQHAELRRMTVIEYAPDSAQAQHYRNLAEKVHANKGNGIIPTPITMDELEDMLMEHGIMKAVDESQIGKTAAELAVMESGGPEPGVGCAGRGVITSINFLEENGAYEDIDYVSYDVLGDVVCGGFAMPIRENKAQEIYIVMSGEMMAMYAANNISKGILKYANSGGVRLGGLVCNERQTDKELELAEALAKKLGTELIYFVPRDNIVQHAELRRMTVIEYAPDSAQAQHYRNLAEKVHANKGNGIIPTPITMDELEDMLMEHGIMKAVDESQIGKTAAEL AV
Дальнейшая обработкаFurther processing
В одном варианте осуществления изобретения способ настоящего изобретения включает в себя дополнительный этап сбора биомассы, полученной во время культивирования. Биомасса может, например, быть собрана путем осаждения (осаждения под действием силы тяжести), фильтрации, центрифугирования или флокуляции. Для флокуляции может потребоваться добавление флокулянта. Центрифугирование можно проводить, например, с помощью центрифуги непрерывного действия.In one embodiment, the method of the present invention includes the additional step of collecting biomass obtained during cultivation. Biomass can, for example, be collected by sedimentation (settling by gravity), filtration, centrifugation or flocculation. Flocculation may require the addition of a flocculant. Centrifugation can be carried out, for example, using a continuous centrifuge.
В одном варианте осуществления изобретения собранную биомассу затем сушат. Сушка может быть выполнена, например, с использованием хорошо известных методов, включая центрифугирование, барабанную сушку, выпаривание, сушку вымораживанием, нагревание, распылительную сушку, вакуумную сушку и/или вакуумную фильтрацию. Высушенная биомасса впоследствии может быть использована в продукте, например, в виде пищевого или кормового продукта или кормового или пищевого ингредиента.In one embodiment of the invention, the collected biomass is then dried. Drying can be accomplished, for example, using well known methods including centrifugation, drum drying, evaporation, freeze drying, heat, spray drying, vacuum drying and/or vacuum filtration. The dried biomass can subsequently be used in a product, for example as a food or feed product or a feed or food ingredient.
В другом варианте осуществления изобретения клетки собранной биомассы лизируют. В некоторых вариантах осуществления изобретения лизат может быть разделен на нерастворимую и растворимую фракции, каждая из которых или обе могут быть впоследствии сконцентрированы или высушены, а затем использованы в продукте, например, в пищевом или кормовом продукте.In another embodiment of the invention, the cells of the collected biomass are lysed. In some embodiments, the lysate may be separated into insoluble and soluble fractions, each or both of which may subsequently be concentrated or dried and then used in a product, such as a food or feed product.
В одном варианте осуществления изобретения биомассу собирают и из указанной биомассы выделяют белки, в результате чего получают белковую фракцию и фракцию, содержащую небелковые компоненты. Таким образом, в одном варианте осуществления изобретения способ предназначен для получения белка и включает в себя этап культивирования штамма VTT-E-193585 или его производного с последующим этапом сбора биомассы и дополнительным этапом выделения белков из указанной биомассы. В другом варианте осуществления изобретения способ предназначен для получения белка и включает в себя культивирование бактериального штамма рода Xanthobacter в непрерывной культуре с водородом в качестве источника энергии и источником неорганического углерода, где источник неорганического углерода включает в себя диоксид углерода, с последующим этапом сбора биомассы, и дальнейший этап выделения белков из указанной биомассы. В зависимости от способа выделения белков, полученные фракции могут быть более или менее чистыми. Таким образом, термин «белковая фракция» означает фракцию, обогащенную белками. Белковая фракция может по-прежнему содержать значительные количества других компонентов, а также значительные количества белка могут оказаться во «фракции, содержащей небелковые компоненты».In one embodiment of the invention, biomass is collected and proteins are separated from said biomass, resulting in a protein fraction and a fraction containing non-protein components. Thus, in one embodiment of the invention, the method is for the production of protein and includes the step of cultivating strain VTT-E-193585 or a derivative thereof, followed by the step of collecting biomass and the additional step of isolating proteins from said biomass. In another embodiment of the invention, the method is for producing protein and includes cultivating a bacterial strain of the genus Xanthobacter in a continuous culture with hydrogen as an energy source and an inorganic carbon source, where the inorganic carbon source includes carbon dioxide, followed by the step of collecting biomass, and further stage of protein isolation from the specified biomass. Depending on the method of protein isolation, the resulting fractions may be more or less pure. Thus, the term "protein fraction" means a fraction enriched in proteins. The protein fraction may still contain significant amounts of other components, and significant amounts of protein may also end up in the “non-protein component containing fraction.”
Выделение белков можно проводить с использованием любого подходящего способа. Например, в одном варианте осуществления изобретения белки выделяют путем механического разрушения клеток и отделения белка от клеточного дебриса с помощью одного или нескольких этапов фильтрации, например путем последовательной фильтрации через несколько фильтров с уменьшающимся размером пор. Механическое разрушение может быть осуществлено с использованием любого подходящего способа, например при помощи шаровой мельницы, обработки ультразвуком, гомогенизации, гомогенизации под высоким давлением, механической резки и т. д. Полученная отфильтрованная белковая фракция будет обогащена белками, но также будет содержать другие более мелкие компоненты. Белок может быть необязательно дополнительно очищен от этой фракции с использованием любого подходящего способа.Protein isolation can be carried out using any suitable method. For example, in one embodiment of the invention, proteins are isolated by mechanically disrupting cells and separating the protein from cellular debris using one or more filtration steps, such as sequential filtration through multiple filters of decreasing pore size. Mechanical disruption can be carried out using any suitable method, such as ball milling, sonication, homogenization, high pressure homogenization, mechanical cutting, etc. The resulting filtered protein fraction will be enriched in proteins, but will also contain other smaller components . The protein may optionally be further purified from this fraction using any suitable method.
В другом варианте осуществления изобретения белковую фракцию выделяют, проводя экстракцию этанолом с последующей одной или несколькими этапами фильтрации. Такие способы известно, например, для получения соевых белков (см., например, главу 5 «Soybean Protein Concentrates» в «Technology of production of edible flours and protein products from soybeans» by Berk FAO Agricultural Services Bulletin No. 97 (1992)). Полученная белковая фракция будет обогащена белками, но также содержать другие компоненты. Белок может быть необязательно дополнительно очищен от этой фракции с использованием любого подходящего способа.In another embodiment of the invention, the protein fraction is isolated by ethanol extraction followed by one or more filtration steps. Such methods are known, for example, for the production of soybean proteins (see, for example, Chapter 5 “Soybean Protein Concentrates” in “Technology of production of edible flours and protein products from soybeans” by Berk FAO Agricultural Services Bulletin No. 97 (1992)) . The resulting protein fraction will be enriched with proteins, but also contain other components. The protein may optionally be further purified from this fraction using any suitable method.
В одном варианте осуществления изобретения способ настоящего изобретения включает в себя дополнительный этап гидролиза белковой фракции, полученной способом настоящего изобретения, с получением аминокислот и малых пептидов.In one embodiment, the method of the present invention includes the additional step of hydrolyzing the protein fraction obtained by the method of the present invention to produce amino acids and small peptides.
В одном варианте осуществления способа настоящего изобретения способ включает в себя дополнительный этап получения пищевого продукта или кормового продукта из указанной биомассы, из указанной белковой фракции или из указанной фракции, содержащей небелковые компоненты. Указанный дополнительный этап может просто включать в себя введение указанной биомассы, белковой фракции или фракции, содержащей небелковые компоненты, в пищевой продукт или кормовой продукт путем их добавления во время производства пищевого продукта или кормового продукта. В других вариантах осуществления изобретения дальнейшую очистку или модификацию биомассы или ее фракции осуществляют в процессе ее включения в пищевой или кормовой продукт.In one embodiment of the method of the present invention, the method includes the additional step of producing a food product or feed product from said biomass, from said protein fraction, or from said fraction containing non-protein components. Said additional step may simply include introducing said biomass, protein fraction or fraction containing non-protein components into the food or feed product by adding them during production of the food or feed product. In other embodiments of the invention, further purification or modification of the biomass or its fraction is carried out during its incorporation into a food or feed product.
В ещё одном аспекте настоящее изобретение относится к продукту, такому как биомасса, белку или небелковым компонентам, полученным или получаемым способом настоящего изобретения.In yet another aspect, the present invention relates to a product, such as biomass, protein or non-protein components, obtained or obtained by the method of the present invention.
В одном варианте осуществления изобретения продукт, полученный способом настоящего изобретения, содержит более 40% белка, например, от 40% до 99% белка, например от 40% до 90% белка, например, от 40% до 60% белка. В конкретном варианте осуществления изобретения продукт содержит от 25% до 75% белка, от 0% до 20% липидов и от 5% до 40% углеводов. В другом варианте осуществления изобретения продукт содержит от 40 до 60% белка, от 0 до 15% липидов и от 10 до 25% углеводов. В ещё одном варианте осуществления изобретения продукт, полученный способом настоящего изобретения, содержит от 45% до 55% белка, от 5% до 10% липидов и от 10% до 20% углеводов.In one embodiment of the invention, the product obtained by the method of the present invention contains more than 40% protein, for example, from 40% to 99% protein, for example, from 40% to 90% protein, for example, from 40% to 60% protein. In a specific embodiment of the invention, the product contains from 25% to 75% protein, from 0% to 20% lipids and from 5% to 40% carbohydrates. In another embodiment of the invention, the product contains from 40 to 60% protein, from 0 to 15% lipids and from 10 to 25% carbohydrates. In yet another embodiment of the invention, the product obtained by the method of the present invention contains from 45% to 55% protein, from 5% to 10% lipids and from 10% to 20% carbohydrates.
Как описано выше, в дополнительном аспекте настоящее изобретение относится к пищевому продукту или кормовому продукту, полученному или получаемому способом настоящего изобретения. Подразумевается, что используемые здесь термины «пищевой продукт» и «корм» включают в себя не только обычные пищевые продукты и кормовые продукты, такие как переработанные пищевые продукты, но и сопутствующие продукты, такие как пищевые и кормовые добавки, например, протеиновые батончики, порошки или коктейли, заменители мяса, пищевые ингредиенты, пробиотики, пребиотики, нутрицевтики и тому подобное. В некоторых вариантах осуществления изобретения указанную биомассу, указанную белковую фракцию или указанную фракцию, содержащую небелковые компоненты, используют при производстве вегетарианского или веганского пищевого продукта.As described above, in a further aspect, the present invention relates to a food product or feed product produced or produced by the method of the present invention. The terms "food" and "feed" as used herein are intended to include not only conventional food and feed products such as processed foods, but also related products such as food and feed additives, e.g. protein bars, powders or shakes, meat substitutes, food ingredients, probiotics, prebiotics, nutraceuticals and the like. In some embodiments, said biomass, said protein fraction, or said fraction containing non-protein components is used in the production of a vegetarian or vegan food product.
Изобретение дополнительно проиллюстрировано следующими неограничивающими примерами:The invention is further illustrated by the following non-limiting examples:
ПРИМЕРЫEXAMPLES
Пример 1. Выделение бактериального штамма, способного к хемоавтотрофному росту Example 1. Isolation of a bacterial strain capable of chemoautotrophic growth
Образец объёмом 50 мл, содержащий почву и морскую воду, был собран в стерильную пробирку Falcon с берега Балтийского моря в Наантали в Финляндии. Часть образца почвы смешивали с 10 мл минеральной среды в стерильной колбе Эрленмейера. Среда содержала 1 г/л NH4OH, 0,23 г/л KH2PO4, 0,29 г/л Na2HPO4 • 2H2O, 0,005 г/л NaVO3 • H2O, 0,2 г/л FeSO4 • 7H2O, 0,5 г/л MgSO4 • 7 H2O, 0,01 г/л CaSO4, 0,00015 г/л Na2MoO4 • 2H2O, 0,005 г/л MnSO4, 0,0005 г/л ZnSO4 • 7H2O, 0,0015 г/л H3BO3, 0,001 г/л CoSO4, 0,00005 г/л CuSO4 и 0,0001 г/л NiSO4, и была приготовлена на водопроводной воде. Суспензию почвы и среды инкубировали в инкубаторе-качалке при температуре +30°С в герметичном стальном боксе, непрерывно продуваемом газовой смесью: 150 мл/мин N2, 18 мл/мин H2, 3 мл/мин O2 и 6 мл/мин CO2. Культивирование возобновляли с семидневными интервалами путем взятия 1 мл суспензии, которую в стерильных условиях добавляли к 9 мл среды в колбе Эрленмейера, а затем помещали обратно в инкубационный бокс. После четвертого разведения в суспензии не осталось заметного загрязнения почвой. Объём клеточной суспензии увеличивали до 100 мл с целью наращивания биомассы для культивирования в биореакторе. Оптическая плотность (ОП600) суспензии составила 1,53 при её инокуляции в 190 мл минеральной среды в системе параллельного биореактора объёмом 200 мл с 15 сосудами (Medicel Explorer, Medicel Oy, Финляндия). Условиями культивирования были перемешивание со скоростью 800 об/мин, температура +30°C, и значение рН доводили до 6,8, регулируя его с помощью 1 М NaOH. Газ подавали через барботер в виде газовой смеси, состоящей из 14 мл/мин Н2, 3 мл/мин О2 и 6 мл/мин СО2. Головное пространство реактора продували воздухом со скоростью 300 мл/мин. Непрерывное культивирование подпитывали минеральной средой со скоростью 6 мл/ч, клеточную суспензию отбирали из реактора через капилляр, поддерживая постоянный объём на уровне 200 мл. Суспензию клеток, извлеченную из реактора, хранили при температуре +4°С. Образец отбирали из биореактора автоматически каждый день и измеряли оптическую плотность при длине волны 600 нм для отслеживания роста. После 498 часов культивирования в биореакторе в асептических условиях отбирали образцы, суспензию разбавляли и высевали на чашки с агаризованной минеральной средой, содержащей вышеуказанные минералы и 2% бактериологического агара. Чашки инкубировали в тех же условиях, что были описаны выше для колб Эрленмейера. Затем колонии собирали с чашек с агаром и высевали штрихами на новые чашки с агаром, чтобы выделить один микроорганизм в одной колонии. Эту операцию повторяли дважды. Отдельные колонии собирали и суспендировали в 200 мкл среды в 96-луночном титрационном микропланшете. Суспензию инкубировали при температуре +30°C и встряхивали со скоростью 625 об/мин в газонепроницаемом боксе EnzyScreen, который непрерывно продували 150 мл/мин N2, 18 мл/мин H2, 3 мл/мин O2 и 6 мл/мин СО2. Суспензию из одной лунки переносили в колбу Эрленмейера и дополняли свежей средой. Объём увеличивали до тех пор, пока не было достаточно биомассы для проведения культивирования в биореакторе. Организм депонировали в коллекции культур VTT как VTT-E-193585.A 50 ml sample containing soil and seawater was collected in a sterile Falcon tube from the Baltic Sea shore at Naantali in Finland. A portion of the soil sample was mixed with 10 ml of mineral medium in a sterile Erlenmeyer flask. The medium contained 1 g/l NH 4 OH, 0.23 g/l KH 2 PO 4 , 0.29 g/l Na 2 HPO 4 • 2H 2 O, 0.005 g/l NaVO 3 • H2O, 0.2 g/ l FeSO 4 • 7H 2 O, 0.5 g/l MgSO 4 • 7 H 2 O, 0.01 g/l CaSO 4 , 0.00015 g/l Na 2 MoO 4 • 2H 2 O, 0.005 g/l MnSO 4 , 0.0005 g/l ZnSO4 • 7H 2 O, 0.0015 g/l H 3 BO 3 , 0.001 g/l CoSO 4 , 0.00005 g/l CuSO 4 and 0.0001 g/l NiSO 4 , and was prepared with tap water. The soil and medium suspension was incubated in a rocking incubator at a temperature of +30°C in a sealed steel box, continuously purged with a gas mixture: 150 ml/min N2 , 18 ml/min H2 , 3 ml/min O2 and 6 ml/min CO2 . Culture was resumed at seven-day intervals by removing 1 ml of the suspension, which was added under sterile conditions to 9 ml of medium in the Erlenmeyer flask and then placed back into the incubation box. After the fourth dilution, no noticeable soil contamination remained in the suspension. The volume of the cell suspension was increased to 100 ml in order to increase the biomass for cultivation in the bioreactor. The optical density (OD600) of the suspension was 1.53 when inoculated into 190 ml of mineral medium in a 200 ml parallel bioreactor system with 15 vessels (Medicel Explorer, Medicel Oy, Finland). The culture conditions were stirring at 800 rpm, temperature +30°C, and the pH was adjusted to 6.8 with 1 M NaOH. Gas was supplied through a bubbler in the form of a gas mixture consisting of 14 ml/min H 2 , 3 ml/min O 2 and 6 ml/min CO 2 . The head space of the reactor was purged with air at a speed of 300 ml/min. Continuous cultivation was fed with a mineral medium at a rate of 6 ml/h, the cell suspension was taken from the reactor through a capillary, maintaining a constant volume at 200 ml. The cell suspension removed from the reactor was stored at +4°C. A sample was collected from the bioreactor automatically every day and absorbance was measured at 600 nm to monitor growth. After 498 hours of cultivation in a bioreactor under aseptic conditions, samples were taken, the suspension was diluted and sown on plates with an agar mineral medium containing the above minerals and 2% bacteriological agar. The dishes were incubated under the same conditions as described above for Erlenmeyer flasks. Colonies were then picked from the agar plates and streaked onto new agar plates to isolate one microorganism per colony. This operation was repeated twice. Individual colonies were collected and suspended in 200 μl of medium in a 96-well microtiter plate. The suspension was incubated at +30°C and shaken at 625 rpm in a gas-tight EnzyScreen box, which was continuously purged with 150 ml/min N2, 18 ml/min H 2 , 3 ml/min O 2 and 6 ml/min CO 2 . The suspension from one well was transferred to an Erlenmeyer flask and supplemented with fresh medium. The volume was increased until there was enough biomass to carry out cultivation in the bioreactor. The organism was deposited in the VTT culture collection as VTT-E-193585.
Секвенирование 16S рРНК образца показало, что образец содержал только один организм. Тот же образец был использован для секвенирования Illumina NextSeq, что позволило получить метагеномные последовательности методом дробовика размером 1x150 п.н. С помощью программы Unicycler (Wick et al., 2017 PLoS computer biology 13:e1005595) была создана сборка de novo для метагеномных последовательностей, состоящих из 101 контига. Общая длина генома составила 4846739 п.н., а содержание GC — 67,9 %. Предсказания генов и функциональные аннотации были выполнены с использованием программы Prokka (Seemann, 2014 Bioinformatics 30:2068). Аннотация генома выявила 4429 генов. Пангеномное выравнивание Roary (Page et al., 2015 Bioinformatics 31:3691) сгруппировало VTT-E-193585 среди видов Xanthobacter. Таким образом, штамм был идентифицирован как Xanthobacter sp., ближайшим геномом для которого является Xanthobacter tagetidis. Основанный на выравнивании расчёт средней идентичности нуклеотидов, учитывающий только ортологичные фрагменты (OrthoANI) (Lee et al., 2016 Int J Syst Evol Microbiol 66:1100), дал наилучшее совпадение 80,4% с Xanthobacter tagetidis (ATCC 700314; GCF_003667445.1), тогда как предлагаемая граница видов составляет 95-96% (см., например, Chun et al., 2018 Int J Syst Evol Microbiol, 68: 461-466). Xanthobacter autotrophicus Py2 дал совпадение на 79,6%, в то время как совпадение для Xanthobacter sp. 91 было 79,0%. Таким образом, можно сделать вывод, что выделенный бактериальный штамм, депонированный как VTT-E-193585, принадлежит к филуму: Proteobacteria; к классу: Альфа-протеобактерии; и к порядку: Rhizobiales. Наиболее вероятным семейством является Xanthobacteraceae и род Xanthobacter. Бактериальный штамм VTT-E-193585 не может быть однозначно отнесен к какому-либо известному виду.Sequencing of the sample's 16S rRNA revealed that the sample contained only one organism. The same sample was used for Illumina NextSeq sequencing, producing 1x150-bp shotgun metagenomic sequences. Using the Unicycler program (Wick et al., 2017 PLoS computer biology 13:e1005595), a de novo assembly was generated for metagenomic sequences consisting of 101 contigs. The total genome length was 4846739 bp, and the GC content was 67.9%. Gene predictions and functional annotations were performed using Prokka (Seemann, 2014 Bioinformatics 30:2068). Genome annotation identified 4429 genes. The Roary pangenomic alignment (Page et al., 2015 Bioinformatics 31:3691) grouped VTT-E-193585 among Xanthobacter species. Thus, the strain was identified as a Xanthobacter sp., the closest genome for which is Xanthobacter tagetidis . An alignment-based calculation of average nucleotide identity considering only orthologous fragments (OrthoANI) (Lee et al., 2016 Int J Syst Evol Microbiol 66:1100) gave the best match of 80.4% to Xanthobacter tagetidis (ATCC 700314; GCF_003667445.1) , whereas the proposed species boundary is 95-96% (see, e.g., Chun et al., 2018 Int J Syst Evol Microbiol, 68: 461-466). Xanthobacter autotrophicus Py2 gave a 79.6% match, while the match for Xanthobacter sp. 91 was 79.0%. Thus, we can conclude that the isolated bacterial strain, deposited as VTT-E-193585, belongs to the phylum: Proteobacteria; to class: Alpha-proteobacteria; and to order: Rhizobiales. The most likely family is Xanthobacteraceae and the genus Xanthobacter . The bacterial strain VTT-E-193585 cannot be unambiguously assigned to any known species.
Проводили поиск предполагаемых генов устойчивости к противомикробным препаратам. Инструмент ABRicate (https://github.com/tseemann/abricate) использовали для геномного поиска в базах данных Arg-Annot, NCBI, ResFinder, ecOH, Megares и VFDB с использованием blastn или blastp. Порог 50% был установлен как для идентичности, так и для охвата как на уровне нуклеотидов, так и на уровне белка. Были идентифицированы только два предполагаемых гена устойчивости к противомикробным препаратам. Эти два гена не содержат аминокислотных изменений, связанных с устойчивостью к антибиотикам, и, таким образом, резистентный фенотип не ожидается.A search for putative antimicrobial resistance genes was performed. The ABRicate tool ( https://github.com/tseemann/abricate ) was used for genomic searches of Arg-Annot, NCBI, ResFinder, ecOH, Megares, and VFDB databases using blastn or blastp. A threshold of 50% was set for both identity and coverage at both the nucleotide and protein levels. Only two putative antimicrobial resistance genes have been identified. These two genes do not contain amino acid changes associated with antibiotic resistance, and thus a resistant phenotype is not expected.
Пример 2. Пилотное культивирование и анализ выделенного бактериального штаммаExample 2: Pilot cultivation and analysis of an isolated bacterial strain
Выделенный бактериальный штамм, депонированный как VTT-E-193585, культивировали в обычном 200-литровом биореакторе с мешалкой (MPF-U, Marubishi Ltd, Япония). Перемешивание осуществляли крыльчатками Раштона, вращающимися со скоростью 400 об/мин. Температуру при культивировании поддерживали на уровне +30°С. Значение рН поддерживали на уровне 6,8±0,2 путем добавления 8 М NaOH или 3,6 М H3PO4 с программным управлением. Среда культивирования содержала 1 г/л NH4OH, 0,23 г/л KH2PO4, 0,29 г/л Na2HPO4 • 2H2O, 0,005 г/л NaVO3 • H2O, 0,2 г/л FeSO4 • 7H2O, 0,5 г/л MgSO4 • 7H2O, 0,01 г/л CaSO4, 0,00015 г/л Na2MoO4 • 2H2O, 0,005 г/л MnSO4, 0,0005 г/л ZnSO4 • 7H2O, 0,0015 г/л H3BO3, 0,001 г/л CoSO4, 0,00005 г/л CuSO4 и 0,0001 г/л NiSO4, и была приготовлена на водопроводной воде. В качестве основного источника энергии и углерода постоянно подавали смесь, содержащую 1,8-10,5 л/мин газообразного водорода, 0,6-2,5 л/мин газообразного кислорода и 1,8-5 л/мин газообразного диоксида углерода. Уровень растворенного кислорода поддерживали на уровне 7,2±0,5% путем регулирования состава газовой смеси. Инокулят для культивирования готовили, как описано в Примере 1. Рост контролировали, отбирая образцы вручную и анализируя плотность клеток как оптическую плотность путем измерения поглощения при длине волны 600 нм (УФ/видимый свет спектрофотометр Ultrospec 2100 pro, Biochrom Ltd., Англия) и путем измерения сухой массы клеток (CDW) путем сушки в печи в течение ночи при температуре 105°C. Оптическую плотность также контролировали с помощью датчика поглощения in situ (Trucell 2, Finesse Ltd, США). Кривая роста культивирования представлена на Фигуре 1. Максимальная скорость роста в периодической фазе составила 0,06 ч-1. Максимальная плотность клеток составила 4,5 г_CDW/л через 92 часа. После 92 ч культивирования начинали подачу свежей культуральной среды, как описано выше, со скоростью разбавления 0,01 ч-1. Во время непрерывной подпитки плотность клеток составляла в среднем 2,9 г_CDW/л. Культуральную жидкость постоянно собирали в охлаждаемый (+10°С) резервуар, откуда порциями по 300 л подавали в центробежный сепаратор непрерывного действия (BTPX-205, Alfa-Laval AB, Швеция). Собранную из сепаратора суспензию, содержащую клетки, подавали в атмосферную двухбарабанную сушилку (Buflovak 6x8 ADDD, Hebeler Process Solutions Llc., США), нагреваемую паром под давлением 4 бар и барабанами, вращающимися со скоростью 3,5 об/мин. Это привело к получению высушенного клеточного порошка с содержанием сухого вещества приблизительно 96%. Результаты анализа высушенного клеточного порошка представлены в Таблице 1 для приблизительного состава, в Таблице 2 для аминокислотного состава, в Таблице 3 для жирнокислотного состава и в Таблице 4 для содержания витаминов. Анализы показали, что высушенный клеточный порошок имеет высокое содержание белка со всеми незаменимыми аминокислотами. Он также содержит больше ненасыщенных жирных кислот, чем насыщенных, и много витаминов группы В. Содержание пептидогликана составляет всего 0,002 мг/г_CDW, а содержание липополисахарида составляет 0,01 мг/г_CDW. Желательно, чтобы эти концентрации были как можно меньше. Для сравнения, в коммерческом препарате молочнокислых бактерий, проанализированном в то же время, содержание пептидогликана составило 0,244 мг/г_DW, а содержание липополисахарида составило 0,015 мг/г_DW. Анализы цитотоксичности и генотоксичности проводили с использованием надосадочных образцов культивирования. Никакой цитотоксичности в отношении клеточных линий человека HepG2 или HeLa229 не наблюдалось. Генотоксичности в отношении штаммов Escherichia coli WP2 trp- или CM871 uvrA recA lexA не наблюдалось.The isolated bacterial strain, deposited as VTT-E-193585, was cultured in a conventional 200-L stirred tank bioreactor (MPF-U, Marubishi Ltd, Japan). Mixing was carried out with Rushton impellers rotating at a speed of 400 rpm. The temperature during cultivation was maintained at +30°C. The pH value was maintained at 6.8 ± 0.2 by adding 8 M NaOH or 3.6 M H 3 PO 4 under program control. The cultivation medium contained 1 g/l NH 4 OH, 0.23 g/l KH 2 PO 4 , 0.29 g/l Na 2 HPO 4 • 2H 2 O, 0.005 g/l NaVO 3 • H 2 O, 0. 2 g/l FeSO 4 • 7H 2 O, 0.5 g/l MgSO 4 • 7H 2 O, 0.01 g/l CaSO 4 , 0.00015 g/l Na 2 MoO 4 • 2H 2 O, 0.005 g /l MnSO 4 , 0.0005 g/l ZnSO 4 • 7H 2 O, 0.0015 g/l H 3 BO 3 , 0.001 g/l CoSO 4 , 0.00005 g/l CuSO 4 and 0.0001 g/ l NiSO 4 , and was prepared with tap water. As the main source of energy and carbon, a mixture containing 1.8-10.5 l/min of hydrogen gas, 0.6-2.5 l/min of oxygen gas and 1.8-5 l/min of carbon dioxide gas was constantly supplied. The level of dissolved oxygen was maintained at 7.2 ± 0.5% by adjusting the composition of the gas mixture. The culture inoculum was prepared as described in Example 1. Growth was monitored by manually sampling and analyzing cell density as optical density by measuring absorbance at 600 nm (UV/visible light spectrophotometer Ultrospec 2100 pro, Biochrom Ltd., England) and by Measuring cell dry weight (CDW) by oven drying overnight at 105°C. Optical density was also monitored using an in situ absorbance sensor (Trucell 2, Finesse Ltd, USA). The growth curve of the cultivation is presented in Figure 1. The maximum growth rate in the periodic phase was 0.06 h -1 . The maximum cell density was 4.5 g_CDW/L at 92 hours. After 92 hours of cultivation, the supply of fresh culture medium was started, as described above, at a dilution rate of 0.01 h -1 . During continuous feeding, cell density averaged 2.9 g_CDW/L. The culture liquid was constantly collected in a cooled (+10°C) tank, from where it was fed in 300 L portions into a continuous centrifugal separator (BTPX-205, Alfa-Laval AB, Sweden). The suspension containing cells collected from the separator was fed into an atmospheric double-drum dryer (Buflovak 6x8 ADDD, Hebeler Process Solutions Llc., USA), heated by steam at a pressure of 4 bar and drums rotating at a speed of 3.5 rpm. This resulted in a dried cell powder with a dry matter content of approximately 96%. The results of the analysis of the dried cell powder are presented in Table 1 for approximate composition, Table 2 for amino acid composition, Table 3 for fatty acid composition and Table 4 for vitamin content. Tests have shown that the dried cell powder has a high protein content with all essential amino acids. It also contains more unsaturated fatty acids than saturated fatty acids and many B vitamins. The peptidoglycan content is only 0.002 mg/g_CDW and the lipopolysaccharide content is 0.01 mg/g_CDW. It is desirable that these concentrations be as low as possible. In comparison, a commercial lactic acid bacteria preparation analyzed at the same time had a peptidoglycan content of 0.244 mg/g_DW and a lipopolysaccharide content of 0.015 mg/g_DW. Cytotoxicity and genotoxicity assays were performed using culture supernatants. No cytotoxicity was observed against human HepG2 or HeLa229 cell lines. No genotoxicity was observed against Escherichia coli strains WP2 trp- or CM871 uvrA recA lexA .
Таблица 1. Результаты анализа порошка высушенных клеток выделенного бактериального штамма, депонированного как VTT-E-193585Table 1. Results of analysis of dried cell powder of the isolated bacterial strain deposited as VTT-E-193585
Таблица 2. Аминокислотный состав высушенного клеточного порошка выделенного бактериального штамма, депонированного как VTT-E-193585Table 2. Amino acid composition of the dried cell powder of the isolated bacterial strain deposited as VTT-E-193585
Таблица 3. Жирнокислотный состав высушенного клеточного порошка выделенного бактериального штамма, депонированного как VTT-E-193585Table 3. Fatty acid composition of the dried cell powder of the isolated bacterial strain deposited as VTT-E-193585
Таблица 4. Содержание витаминов в высушенном клеточном порошке выделенного бактериального штамма, депонированного как VTT-E-193585Table 4. Vitamin content of dried cell powder of the isolated bacterial strain deposited as VTT-E-193585
Пример 3. Культивирование выделенного бактериального штамма на различных источниках азотаExample 3: Cultivation of an isolated bacterial strain on various nitrogen sources
Выделенный бактериальный штамм, депонированный как VTT-E-193585, культивировали в системе параллельных биореакторов с 15 сосудами объёмом 200 мл (Medicel Explorer, Medicel Oy, Финляндия). Перемешивание осуществляли крыльчатками Раштона, вращающимися со скоростью 800 об/мин. Температуру при культивировании поддерживали на уровне +30°С. Значение pH поддерживали на уровне 6,8 добавлением 1 М NaOH. Среда культивирования содержала 0,23 г/л KH2PO4, 0,29 г/л Na2HPO4 • 2 H2O, 0,005 г/л NaVO3 • H2O, 0,2 г/л FeSO4 • 7H2O, 0,5 г/л MgSO4 • 7H2O, 0,01 г/л CaSO4, 0,00015 г/л Na2MoO4 • 2H2O, 0,005 г/л MnSO4, 0,0005 г/л ZnSO4 • 7H2O, 0,0015 г/л H3BO3, 0,001 г/л CoSO4, 0,00005 г/л CuSO4 и 0,0001 г/л NiSO4, и была приготовлена на водопроводной воде. Кроме того, источник азота при культивировании варьировался так, что четыре культивирования содержали 18,7 мМ NH4OH, четыре культивирования содержали 9,34 мМ мочевины (OC(NH2)2), четыре культивирования содержали 18,7 мМ нитрата калия (KNO3) и три культивирования оставались без источника азота в среде. В качестве основного источника энергии и углерода постоянно подавали смесь, содержащую 22 мл/мин газообразного водорода, 3,2 мл/мин воздуха и 6,4 мл/мин газообразного диоксида углерода. Таким образом, вместе с воздухом все варианты культивирования получали и газообразный азот. Рост контролировали, автоматически отбирая образцы и анализируя плотность клеток как оптическую плотность путем измерения поглощения при длине волны 600 нм (УФ/видимый свет спектрофотометр Ultrospec 2100 pro, Biochrom Ltd., Англия). Кривые роста культивирования представлены на фиг. 2. Рост на аммиаке и мочевине был сопоставим. Рост на нитратах или газообразном азоте был явно медленнее, чем на аммиаке или мочевине. К концу культивирования рост на нитратах был лучше, чем на газообразном азоте как единственном источнике азота. Тем не менее, также наблюдался рост в культурах, в которых газообразный азот был единственным источником азота, демонстрируя, что выделенный бактериальный штамм, депонированный как VTT-E-193585, способен фиксировать азот.The isolated bacterial strain, deposited as VTT-E-193585, was cultivated in a parallel bioreactor system with 15 200 ml vessels (Medicel Explorer, Medicel Oy, Finland). Mixing was carried out with Rushton impellers rotating at a speed of 800 rpm. The temperature during cultivation was maintained at +30°C. The pH was maintained at 6.8 by adding 1 M NaOH. The cultivation medium contained 0.23 g/l KH 2 PO 4 , 0.29 g/l Na 2 HPO 4 • 2 H 2 O, 0.005 g/l NaVO 3 • H 2 O, 0.2 g/l FeSO 4 • 7H2O, 0.5 g/l MgSO 4 • 7H 2 O, 0.01 g/l CaSO 4 , 0.00015 g/l Na 2 MoO 4 • 2H 2 O, 0.005 g/l MnSO 4 , 0.0005 g /l ZnSO 4 • 7H 2 O, 0.0015 g/l H 3 BO 3 , 0.001 g/l CoSO 4 , 0.00005 g/l CuSO 4 and 0.0001 g/l NiSO 4 , and was prepared on tap water. In addition, the nitrogen source of the culture was varied such that four cultures contained 18.7 mM NH 4 OH, four cultures contained 9.34 mM urea (OC(NH 2 ) 2 ), four cultures contained 18.7 mM potassium nitrate (KNO3 ) and three cultivations were left without a source of nitrogen in the medium. As the main source of energy and carbon, a mixture containing 22 ml/min of hydrogen gas, 3.2 ml/min of air and 6.4 ml/min of carbon dioxide gas was continuously supplied. Thus, along with air, all cultivation options also received nitrogen gas. Growth was monitored by automatically sampling and analyzing cell density as optical density by measuring absorbance at 600 nm (UV/visible light spectrophotometer Ultrospec 2100 pro, Biochrom Ltd., England). Cultivation growth curves are shown in Fig. 2. Growth on ammonia and urea was comparable. Growth on nitrate or nitrogen gas was clearly slower than on ammonia or urea. By the end of cultivation, growth on nitrates was better than on nitrogen gas as the only nitrogen source. However, growth was also observed in cultures in which nitrogen gas was the only source of nitrogen, demonstrating that the isolated bacterial strain deposited as VTT-E-193585 is capable of nitrogen fixation.
Пример 4. Характеристика чувствительности к антибиотикамExample 4: Antibiotic Susceptibility Characterization
Чувствительность к антибиотикам гентамицин, канамицин, стрептомицин, тетрациклин, ампициллин, ципрофлоксацин, колистин и фосфомицин для выделенного бактериального штамма, депонированного как VTT-E-193585, анализировали в соответствии со стандартом CLSI M07-A111 (Институт клинических и лабораторных стандартов. Методы разбавления в тестах на чувствительность к противомикробным препаратам для бактерий, которые растут в аэробных условиях, 11-е изд., стандарт CLSI M07, 2018 г.) с приготовленной авторами настоящего изобретения чашкой для микроразведений для ампициллина, ципрофлоксацина и колистина, с чашкой VetMIC Lact-1 (Национальный ветеринарный институт SVA, Уппсала, Швеция) для гентамицина, канамицина, стрептомицина и тетрациклина с использованием метода микроразведения в бульоне и фосфомицина с использованием метода разведения в агаре в аэробных условиях при температуре +35 ± 2°C в течение 48 ± 1 часов с использованием бульона Мюллера-Хинтона с отрегулированным содержанием катионов (LabM, LAB114, катионы Mg2+ и Ca2+ добавляются отдельно). В качестве штамма контроля качества использовали штамм Escherichia coli ATCC 25922, который инкубировали в аэробных условиях при температуре +35 ± 2°C в течение 18 ± 2 часов. Результаты определения чувствительности штаммов к антибиотикам представлены в Таблице 5. Установлено, что выделенный бактериальный штамм в целом чувствителен к антибиотикам. Для гентамицина, канамицина, стрептомицина и тетрациклина значения минимальной ингибирующей концентрации (МИК) для VTT-E-193585 были ниже или сопоставимы с E. coli ATCC 25922, в то время как для ампициллина, ципрофлоксацина, колистина и фосфомицина значения МИК были выше для VTT-E-193585.Susceptibility to the antibiotics gentamicin, kanamycin, streptomycin, tetracycline, ampicillin, ciprofloxacin, colistin and fosfomycin for the isolated bacterial strain deposited as VTT-E-193585 was analyzed in accordance with CLSI standard M07-A111 (Clinical and Laboratory Standards Institute - Dilution methods in Antimicrobial Susceptibility Tests for Bacteria That Grow Under Aerobic Conditions, 11th Ed., CLSI Standard M07, 2018) with the microdilution plate prepared by the present inventors for ampicillin, ciprofloxacin and colistin, with the VetMIC Lact-1 plate (National Veterinary Institute SVA, Uppsala, Sweden) for gentamicin, kanamycin, streptomycin and tetracycline using the broth microdilution method and fosfomycin using the agar dilution method under aerobic conditions at +35 ± 2°C for 48 ± 1 hours s using Mueller-Hinton broth with adjusted cation content (LabM, LAB114, Mg2+ and Ca2+ cations added separately). Escherichia coli strain ATCC 25922 was used as a quality control strain, which was incubated under aerobic conditions at a temperature of +35 ± 2°C for 18 ± 2 hours. The results of determining the sensitivity of strains to antibiotics are presented in Table 5. It was found that the isolated bacterial strain is generally sensitive to antibiotics. For gentamicin, kanamycin, streptomycin, and tetracycline, the minimum inhibitory concentration (MIC) values for VTT-E-193585 were lower than or comparable to E. coli ATCC 25922, while for ampicillin, ciprofloxacin, colistin, and fosfomycin, the MIC values were higher for VTT -E-193585.
Таблица 5. Минимальная ингибирующая концентрация (МИК, мкг/мл) – значения антибиотиков для штамма VTT-E-193585 и Table 5. Minimum inhibitory concentration (MIC, µg/ml) - antibiotic values for strain VTT-E-193585 and Escherichia coliEscherichia coli ATCC 25922 ATCC 25922
48 ч ± 1 ч48 h ± 1 h
18 ч ± 2 ч18 h ± 2 h
--->--->
ПЕРЕЧЕНЬ ПОСЛЕДОВАТЕЛЬНОСТЕЙ LIST OF SEQUENCES
<110> СОЛАР ФУДС ОЙ <110> SOLAR FOODS OH
<120> ШТАММЫ И СПОСОБЫ ДЛЯ ПОЛУЧЕНИЯ БЕЛКА ОДНОКЛЕТОЧНЫХ ОРГАНИЗМОВ ИЛИ БИОМАССЫ <120> STRAINS AND METHODS FOR OBTAINING PROTEIN OF SINGLE-CELLULAR ORGANISMS OR BIOMASS
<130> Solar002<130> Solar002
<160> 59 <160> 59
<170> Патентная версия 3.5<170> Patent version 3.5
<210> 1<210> 1
<211> 1482<211> 1482
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 1<400> 1
cttgagagtt tgatcctggc tcagagcgaa cgctggcggc aggcctaaca catgcaagtc 60cttgagagtt tgatcctggc tcagagcgaa cgctggcggc aggcctaaca catgcaagtc 60
gagcgcccag caatgggagc ggcagacggg tgagtaacgc gtggggatgt gcccaatggt 120gagcgcccag caatgggagc ggcagacggg tgagtaacgc gtggggatgt gcccaatggt 120
acggaataac ccagggaaac ttggactaat accgtatgag cccttcgggg gaaagattta 180acggaataac ccagggaaac ttggactaat accgtatgag cccttcgggg gaaagatta 180
tcgccattgg atcaacccgc gtctgattag ctagttggtg gggtaacggc ccaccaaggc 240tcgccattgg atcaacccgc gtctgattag ctagttggtg gggtaacggc ccaccaaggc 240
gacgatcagt agctggtctg agaggatgat cagccacact gggactgaga cacggcccag 300gacgatcagt agctggtctg agaggatgat cagccacact gggactgaga cacggcccag 300
actcctacgg gaggcagcag tggggaatat tggacaatgg gcgcaagcct gatccagcca 360actcctacgg gaggcagcag tggggaatat tggacaatgg gcgcaagcct gatccagcca 360
tgccgcgtgt gtgatgaagg ccttagggtt gtaaagcact ttcgccggtg aagataatga 420tgccgcgtgt gtgatgaagg ccttagggtt gtaaagcact ttcgccggtg aagataatga 420
cggtaaccgg agaagaagcc ccggctaact tcgtgccagc agccgcggta atacgaaggg 480cggtaaccgg agaagaagcc ccggctaact tcgtgccagc agccgcggta atacgaaggg 480
ggctagcgtt gctcggaatc actgggcgta aagcgcacgt aggcggatcg ttaagtcagg 540ggctagcgtt gctcggaatc actgggcgta aagcgcacgt aggcggatcg ttaagtcagg 540
ggtgaaatcc tggagctcaa ctccagaact gcccttgata ctggcgacct tgagttcgag 600ggtgaaatcc tggagctcaa ctccagaact gcccttgata ctggcgacct tgagttcgag 600
agaggttggt ggaactgcga gtgtagaggt gaaattcgta gatattcgca agaacaccag 660agaggttggt ggaactgcga gtgtagaggt gaaattcgta gatattcgca agaacaccag 660
tggcgaaggc ggccaactgg ctcgatactg acgctgaggt gcgaaagcgt ggggagcaaa 720tggcgaaggc ggccaactgg ctcgatactg acgctgaggt gcgaaagcgt ggggagcaaa 720
caggattaga taccctggta gtccacgccg taaacgatgg atgctagccg ttgggcagct 780caggattaga taccctggta gtccacgccg taaacgatgg atgctagccg ttgggcagct 780
tgctgttcag tggcgcagct aacgcattaa gcatcccgcc tggggagtac ggtcgcaaga 840tgctgttcag tggcgcagct aacgcattaa gcatcccgcc tggggagtac ggtcgcaaga 840
ttaaaactca aaggaattga cgggggcccg cacaagcggt ggagcatgtg gtttaattcg 900ttaaaactca aaggaattga cgggggcccg cacaagcggt ggagcatgtg gtttaattcg 900
aagcaacgcg cagaacctta ccagcctttg acatggcagg acgatttcca gagatggatc 960aagcaacgcg cagaacctta ccagcctttg acatggcagg acgatttcca gagatggatc 960
tcttccagca atggacctgc acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag 1020tcttccagca atggacctgc acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag 1020
atgttgggtt aagtcccgca acgagcgcaa ccctcgcctc tagttgccag cattcagttg 1080atgttgggtt aagtcccgca acgagcgcaa ccctcgcctc tagttgccag cattcagttg 1080
ggcactctag agggactgcc ggtgataagc cgagaggaag gtggggatga cgtcaagtcc 1140ggcactctag agggactgcc ggtgataagc cgagaggaag gtggggatga cgtcaagtcc 1140
tcatggccct tacgggctgg gctacacacg tgctacaatg gtggtgacag tgggatgcga 1200tcatggccct tacgggctgg gctacacacg tgctacaatg gtggtgacag tgggatgcga 1200
aagggcgacc tctagcaaat ctccaaaagc catctcagtt cggattgtac tctgcaactc 1260aagggcgacc tctagcaaat ctccaaaagc catctcagtt cggattgtac tctgcaactc 1260
gagtgcatga agttggaatc gctagtaatc gtggatcagc atgccacggt gaatacgttc 1320gagtgcatga agttggaatc gctagtaatc gtggatcagc atgccacggt gaatacgttc 1320
ccgggccttg tacacaccgc ccgtcacacc atgggagttg gctttacccg aaggcgctgc 1380ccgggccttg tacacaccgc ccgtcacacc atgggagttg gctttacccg aaggcgctgc 1380
gctaacccgc aagggaggca ggcgaccacg gtagggtcag cgactggggt gaagtcgtaa 1440gctaacccgc aagggaggca ggcgaccacg gtagggtcag cgactggggt gaagtcgtaa 1440
caaggtagcc gtaggggaac ctgcggctgg atcacctcct tt 1482caaggtagcc gtaggggaac ctgcggctgg atcacctcct tt 1482
<210> 2<210> 2
<211> 1467<211> 1467
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 2<400> 2
atgggtgccg aagcaaccgt cgggcagatc acggacgcca agaagagata cgccgccggc 60atgggtgccg aagcaaccgt cgggcagatc acggacgcca agaagagata cgccgccggc 60
gtgctgaagt acgcccagat gggctactgg aacggcgact acgttcccaa ggacaccgac 120gtgctgaagt acgcccagat gggctactgg aacggcgact acgttcccaa ggacaccgac 120
ctcctggcgg tgttccgcat caccccccag gcgggcgtgg acccggtgga agccgccgcg 180ctcctggcgg tgttccgcat caccccccag gcgggcgtgg acccggtgga agccgccgcg 180
gcggtcgccg gcgaaagctc caccgctacc tggaccgtgg tgtggaccga ccggctcacc 240gcggtcgccg gcgaaagctc caccgctacc tggaccgtgg tgtggaccga ccggctcacc 240
gccgccgacg tctaccgcgc caaggcctac aaggtggagc cggtgccggg ccaggaaggc 300gccgccgacg tctaccgcgc caaggcctac aaggtggagc cggtgccggg ccaggaaggc 300
cagtatttct gctacatcgc ctatgatctc gatttgttcg aggaaggctc catcgccaac 360cagtatttct gctacatcgc ctatgatctc gatttgttcg aggaaggctc catcgccaac 360
ctcacggcgt cgatcatcgg caacgtcttc tccttcaagc cgctgaaggc ggcgcggctg 420ctcacggcgt cgatcatcgg caacgtcttc tccttcaagc cgctgaaggc ggcgcggctg 420
gaggacatgc ggcttcccgt cgcctatgtg aagaccttcc gcggcccgcc caccggcatc 480gaggacatgc ggcttcccgt cgcctatgtg aagaccttcc gcggcccgcc caccggcatc 480
gtggtcgagc gcgagcgcct ggacaagttc ggccgccccc ttctgggcgc caccaccaag 540gtggtcgagc gcgagcgcct ggacaagttc ggccgccccc ttctgggcgc caccaccaag 540
ccgaagcttg gcctctcggg caagaattac ggccgcgtgg tctatgaggc cctcaagggc 600ccgaagcttg gcctctcggg caagaattac ggccgcgtgg tctatgaggc cctcaagggc 600
ggcctcgact tcgtgaagga cgacgagaac atcaactcgc agcccttcat gcactggcgc 660ggcctcgact tcgtgaagga cgacgagaac atcaactcgc agcccttcat gcactggcgc 660
gatcgcttcc tctattgcat ggaggccgtc aacaaggccc aggccgagac cggcgaggtg 720gatcgcttcc tctattgcat ggaggccgtc aacaaggccc aggccgagac cggcgaggtg 720
aaggggcact atctcaacat caccgccggg accatggagg agatgtaccg ccgcgccgag 780aaggggcact atctcaacat caccgccggg accatggagg agatgtaccg ccgcgccgag 780
ttcgccaagg aactgggctc cgtggtggtg atggtggatc tcatcatcgg ctggaccgcc 840ttcgccaagg aactgggctc cgtggtggtg atggtggatc tcatcatcgg ctggaccgcc 840
atccagtcca tgtccaactg gtgccgcgag aacgacatga tcctgcacat gcaccgtgcg 900atccagtcca tgtccaactg gtgccgcgag aacgacatga tcctgcacat gcaccgtgcg 900
ggccatggca cctacacgcg ccagaagagc cacggcgtct ccttccgcgt catcgccaag 960ggccatggca cctacacgcg ccagaagagc cacggcgtct ccttccgcgt catcgccaag 960
tggctgcggc tcgccggcgt cgaccacctg cacaccggca ccgccgtggg caagctggaa 1020tggctgcggc tcgccggcgt cgaccacctg cacaccggca ccgccgtggg caagctggaa 1020
ggcgacccca tgaccgtgca gggcttctac aatgtctgcc gcgagacgac gacgcagcag 1080ggcgacccca tgaccgtgca gggcttctac aatgtctgcc gcgagacgac gacgcagcag 1080
gacctcaccc gcggcctgtt cttcgagcag gactggggcg gcatccgcaa ggtgatgccg 1140gacctcaccc gcggcctgtt cttcgagcag gactggggcg gcatccgcaa ggtgatgccg 1140
gtggcctccg gcggcatcca tgcgggccag atgcaccagc tcatcgacct gttcggcgag 1200gtggcctccg gcggcatcca tgcgggccag atgcaccagc tcatcgacct gttcggcgag 1200
gacgtggtgc tccagttcgg cggcggcacc atcggccacc cggacggcat ccaggccggc 1260gacgtggtgc tccagttcgg cggcggcacc atcggccacc cggacggcat ccaggccggc 1260
gccaccgcca accgcgtggc gctggaaacc atgatcctcg cccgcaacga gggccgcgac 1320gccaccgcca accgcgtggc gctggaaacc atgatcctcg cccgcaacga gggccgcgac 1320
atcaggaacg agggcccgga aatcctggtg gaagccgcca aatggtgccg tccgctgcgc 1380atcaggaacg agggcccgga aatcctggtg gaagccgcca aatggtgccg tccgctgcgc 1380
gcggcgctcg atacctgggg cgaggtgacc ttcaactacg cctccaccga cacgtccgat 1440gcggcgctcg atacctgggg cgaggtgacc ttcaactacg cctccaccga cacgtccgat 1440
tacgtgccca ccgcgtccgt cgcctga 1467tacgtgccca ccgcgtccgt cgcctga 1467
<210> 3<210> 3
<211> 488<211> 488
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 3<400> 3
Met Gly Ala Glu Ala Thr Val Gly Gln Ile Thr Asp Ala Lys Lys Arg Met Gly Ala Glu Ala Thr Val Gly Gln Ile Thr Asp Ala Lys Lys Arg
1 5 10 15 1 5 10 15
Tyr Ala Ala Gly Val Leu Lys Tyr Ala Gln Met Gly Tyr Trp Asn Gly Tyr Ala Ala Gly Val Leu Lys Tyr Ala Gln Met Gly Tyr Trp Asn Gly
20 25 30 20 25 30
Asp Tyr Val Pro Lys Asp Thr Asp Leu Leu Ala Val Phe Arg Ile Thr Asp Tyr Val Pro Lys Asp Thr Asp Leu Leu Ala Val Phe Arg Ile Thr
35 40 45 35 40 45
Pro Gln Ala Gly Val Asp Pro Val Glu Ala Ala Ala Ala Val Ala Gly Pro Gln Ala Gly Val Asp Pro Val Glu Ala Ala Ala Ala Val Ala Gly
50 55 60 50 55 60
Glu Ser Ser Thr Ala Thr Trp Thr Val Val Trp Thr Asp Arg Leu Thr Glu Ser Ser Thr Ala Thr Trp Thr Val Val Trp Thr Asp Arg Leu Thr
65 70 75 80 65 70 75 80
Ala Ala Asp Val Tyr Arg Ala Lys Ala Tyr Lys Val Glu Pro Val Pro Ala Ala Asp Val Tyr Arg Ala Lys Ala Tyr Lys Val Glu Pro Val Pro
85 90 95 85 90 95
Gly Gln Glu Gly Gln Tyr Phe Cys Tyr Ile Ala Tyr Asp Leu Asp Leu Gly Gln Glu Gly Gln Tyr Phe Cys Tyr Ile Ala Tyr Asp Leu Asp Leu
100 105 110 100 105 110
Phe Glu Glu Gly Ser Ile Ala Asn Leu Thr Ala Ser Ile Ile Gly Asn Phe Glu Glu Gly Ser Ile Ala Asn Leu Thr Ala Ser Ile Ile Gly Asn
115 120 125 115 120 125
Val Phe Ser Phe Lys Pro Leu Lys Ala Ala Arg Leu Glu Asp Met Arg Val Phe Ser Phe Lys Pro Leu Lys Ala Ala Arg Leu Glu Asp Met Arg
130 135 140 130 135 140
Leu Pro Val Ala Tyr Val Lys Thr Phe Arg Gly Pro Pro Thr Gly Ile Leu Pro Val Ala Tyr Val Lys Thr Phe Arg Gly Pro Pro Thr Gly Ile
145 150 155 160 145 150 155 160
Val Val Glu Arg Glu Arg Leu Asp Lys Phe Gly Arg Pro Leu Leu Gly Val Val Glu Arg Glu Arg Leu Asp Lys Phe Gly Arg Pro Leu Leu Gly
165 170 175 165 170 175
Ala Thr Thr Lys Pro Lys Leu Gly Leu Ser Gly Lys Asn Tyr Gly Arg Ala Thr Thr Lys Pro Lys Leu Gly Leu Ser Gly Lys Asn Tyr Gly Arg
180 185 190 180 185 190
Val Val Tyr Glu Ala Leu Lys Gly Gly Leu Asp Phe Val Lys Asp Asp Val Val Tyr Glu Ala Leu Lys Gly Gly Leu Asp Phe Val Lys Asp Asp
195 200 205 195 200 205
Glu Asn Ile Asn Ser Gln Pro Phe Met His Trp Arg Asp Arg Phe Leu Glu Asn Ile Asn Ser Gln Pro Phe Met His Trp Arg Asp Arg Phe Leu
210 215 220 210 215 220
Tyr Cys Met Glu Ala Val Asn Lys Ala Gln Ala Glu Thr Gly Glu Val Tyr Cys Met Glu Ala Val Asn Lys Ala Gln Ala Glu Thr Gly Glu Val
225 230 235 240 225 230 235 240
Lys Gly His Tyr Leu Asn Ile Thr Ala Gly Thr Met Glu Glu Met Tyr Lys Gly His Tyr Leu Asn Ile Thr Ala Gly Thr Met Glu Glu Met Tyr
245 250 255 245 250 255
Arg Arg Ala Glu Phe Ala Lys Glu Leu Gly Ser Val Val Val Met Val Arg Arg Ala Glu Phe Ala Lys Glu Leu Gly Ser Val Val Val Met Val
260 265 270 260 265 270
Asp Leu Ile Ile Gly Trp Thr Ala Ile Gln Ser Met Ser Asn Trp Cys Asp Leu Ile Ile Gly Trp Thr Ala Ile Gln Ser Met Ser Asn Trp Cys
275 280 285 275 280 285
Arg Glu Asn Asp Met Ile Leu His Met His Arg Ala Gly His Gly Thr Arg Glu Asn Asp Met Ile Leu His Met His Arg Ala Gly His Gly Thr
290 295 300 290 295 300
Tyr Thr Arg Gln Lys Ser His Gly Val Ser Phe Arg Val Ile Ala Lys Tyr Thr Arg Gln Lys Ser His Gly Val Ser Phe Arg Val Ile Ala Lys
305 310 315 320 305 310 315 320
Trp Leu Arg Leu Ala Gly Val Asp His Leu His Thr Gly Thr Ala Val Trp Leu Arg Leu Ala Gly Val Asp His Leu His Thr Gly Thr Ala Val
325 330 335 325 330 335
Gly Lys Leu Glu Gly Asp Pro Met Thr Val Gln Gly Phe Tyr Asn Val Gly Lys Leu Glu Gly Asp Pro Met Thr Val Gln Gly Phe Tyr Asn Val
340 345 350 340 345 350
Cys Arg Glu Thr Thr Thr Gln Gln Asp Leu Thr Arg Gly Leu Phe Phe Cys Arg Glu Thr Thr Thr Gln Gln Asp Leu Thr Arg Gly Leu Phe Phe
355 360 365 355 360 365
Glu Gln Asp Trp Gly Gly Ile Arg Lys Val Met Pro Val Ala Ser Gly Glu Gln Asp Trp Gly Gly Ile Arg Lys Val Met Pro Val Ala Ser Gly
370 375 380 370 375 380
Gly Ile His Ala Gly Gln Met His Gln Leu Ile Asp Leu Phe Gly Glu Gly Ile His Ala Gly Gln Met His Gln Leu Ile Asp Leu Phe Gly Glu
385 390 395 400 385 390 395 400
Asp Val Val Leu Gln Phe Gly Gly Gly Thr Ile Gly His Pro Asp Gly Asp Val Val Leu Gln Phe Gly Gly Gly Thr Ile Gly His Pro Asp Gly
405 410 415 405 410 415
Ile Gln Ala Gly Ala Thr Ala Asn Arg Val Ala Leu Glu Thr Met Ile Ile Gln Ala Gly Ala Thr Ala Asn Arg Val Ala Leu Glu Thr Met Ile
420 425 430 420 425 430
Leu Ala Arg Asn Glu Gly Arg Asp Ile Arg Asn Glu Gly Pro Glu Ile Leu Ala Arg Asn Glu Gly Arg Asp Ile Arg Asn Glu Gly Pro Glu Ile
435 440 445 435 440 445
Leu Val Glu Ala Ala Lys Trp Cys Arg Pro Leu Arg Ala Ala Leu Asp Leu Val Glu Ala Ala Lys Trp Cys Arg Pro Leu Arg Ala Ala Leu Asp
450 455 460 450 455 460
Thr Trp Gly Glu Val Thr Phe Asn Tyr Ala Ser Thr Asp Thr Ser Asp Thr Trp Gly Glu Val Thr Phe Asn Tyr Ala Ser Thr Asp Thr Ser Asp
465 470 475 480 465 470 475 480
Tyr Val Pro Thr Ala Ser Val Ala Tyr Val Pro Thr Ala Ser Val Ala
485 485
<210> 4<210> 4
<211> 405<211> 405
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 4<400> 4
atgcgcatca cccaaggctc cttctccttc ctgccggacc tcaccgacac gcagatcaag 60atgcgcatca cccaaggctc cttctccttc ctgccggacc tcaccgacac gcagatcaag 60
gcccaggtgc aatattgcct ggaccagggc tgggcggtct cggtggagca caccgacgat 120gcccaggtgc aatattgcct ggaccagggc tgggcggtct cggtggagca caccgacgat 120
ccccacccgc gcaacaccta ttgggagatg tggggcccgc ccatgttcga tctgcgcgac 180ccccacccgc gcaacaccta ttgggagatg tggggcccgc ccatgttcga tctgcgcgac 180
gcggccggcg tcttcggcga gatcgaagcc tgccgggccg ccaatcccga gcattatgtg 240gcggccggcg tcttcggcga gatcgaagcc tgccgggccg ccaatcccga gcattatgtg 240
cgggtgaacg ccttcgattc cagccgcgga tgggagacga tccgcctgtc cttcatcgtt 300cgggtgaacg ccttcgattc cagccgcgga tgggagacga tccgcctgtc cttcatcgtt 300
cagcggccca ccgtggaaga gggcttccgc ctcgaccgca ccgaaggcaa gggccgcaac 360cagcggccca ccgtggaaga gggcttccgc ctcgaccgca ccgaaggcaa gggccgcaac 360
cagagctacg ccatgcgcta ccgggcgcag ttcgcgccgc gctga 405cagagctacg ccatgcgcta ccgggcgcag ttcgcgccgc gctga 405
<210> 5<210> 5
<211> 134<211> 134
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 5<400> 5
Met Arg Ile Thr Gln Gly Ser Phe Ser Phe Leu Pro Asp Leu Thr Asp Met Arg Ile Thr Gln Gly Ser Phe Ser Phe Leu Pro Asp Leu Thr Asp
1 5 10 15 1 5 10 15
Thr Gln Ile Lys Ala Gln Val Gln Tyr Cys Leu Asp Gln Gly Trp Ala Thr Gln Ile Lys Ala Gln Val Gln Tyr Cys Leu Asp Gln Gly Trp Ala
20 25 30 20 25 30
Val Ser Val Glu His Thr Asp Asp Pro His Pro Arg Asn Thr Tyr Trp Val Ser Val Glu His Thr Asp Asp Pro His Pro Arg Asn Thr Tyr Trp
35 40 45 35 40 45
Glu Met Trp Gly Pro Pro Met Phe Asp Leu Arg Asp Ala Ala Gly Val Glu Met Trp Gly Pro Pro Met Phe Asp Leu Arg Asp Ala Ala Gly Val
50 55 60 50 55 60
Phe Gly Glu Ile Glu Ala Cys Arg Ala Ala Asn Pro Glu His Tyr Val Phe Gly Glu Ile Glu Ala Cys Arg Ala Ala Asn Pro Glu His Tyr Val
65 70 75 80 65 70 75 80
Arg Val Asn Ala Phe Asp Ser Ser Arg Gly Trp Glu Thr Ile Arg Leu Arg Val Asn Ala Phe Asp Ser Ser Arg Gly Trp Glu Thr Ile Arg Leu
85 90 95 85 90 95
Ser Phe Ile Val Gln Arg Pro Thr Val Glu Glu Gly Phe Arg Leu Asp Ser Phe Ile Val Gln Arg Pro Thr Val Glu Glu Gly Phe Arg Leu Asp
100 105 110 100 105 110
Arg Thr Glu Gly Lys Gly Arg Asn Gln Ser Tyr Ala Met Arg Tyr Arg Arg Thr Glu Gly Lys Gly Arg Asn Gln Ser Tyr Ala Met Arg Tyr Arg
115 120 125 115 120 125
Ala Gln Phe Ala Pro Arg Ala Gln Phe Ala Pro Arg
130 130
<210> 6<210> 6
<211> 1896<211> 1896
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 6<400> 6
atgatgccat ctgagccgca cggcgcgggc atgccgcccc cacgggaagc ggccgcggtt 60atgatgccat ctgagccgca cggcgcgggc atgccgcccc cacgggaagc ggccgcggtt 60
cccacccccc aggaggtgag cgcggtggtg gccgaggtgg tcgcggatgc cgtggcatcg 120cccacccccc aggaggtgag cgcggtggtg gccgaggtgg tcgcggatgc cgtggcatcg 120
gtgggcggcg cacgcacccg gctcatggac atcgtccagc tggcccagca gcgtctcggc 180gtgggcggcg cacgcacccg gctcatggac atcgtccagc tggcccagca gcgtctcggc 180
catctctccg aagagaccat ggcggccatt gccgcgcggc tcgccattcc gccggtggaa 240catctctccg aagagaccat ggcggccatt gccgcgcggc tcgccattcc gccggtggaa 240
gtggcggaca tggtgtcctt ctacgccttc ctgaaccgcg cgcccaaggg ccgctaccac 300gtggcggaca tggtgtcctt ctacgccttc ctgaaccgcg cgcccaaggg ccgctaccac 300
atccgcctgt cgcgcagccc catctcgctg atgaagggcg ccgaggcggt ggctgccgcc 360atccgcctgt cgcgcagccc catctcgctg atgaagggcg ccgaggcggt ggctgccgcc 360
ttctgccaga tcctcggcat cgccatgggc gagacctcgc aggatggcga cttcaccctg 420ttctgccaga tcctcggcat cgccatgggc gagacctcgc aggatggcga cttcaccctg 420
gaatggacca acgacatcgg catggccgac caggagccgg ccgccctcgt caacggcacg 480gaatggacca acgacatcgg catggccgac caggagccgg ccgccctcgt caacggcacg 480
gtgatgacgc agctcgcgcc cggcgatgcg gccatcatcg tcggccggct gcgggcccat 540gtgatgacgc agctcgcgcc cggcgatgcg gccatcatcg tcggccggct gcgggcccat 540
cacgcgccca atgccctgcc gctgttccct ggagccggcg tggccggctc cggcctgccc 600cacgcgccca atgccctgcc gctgttccct ggagccggcg tggccggctc cggcctgccc 600
catgcccgga tccgccccag cctggtgatg ccgggacagc ttctgttccg cgaggaccac 660catgcccgga tccgccccag cctggtgatg ccgggacagc ttctgttccg cgaggaccac 660
acgacgccgg gcgccggcat caaggcggca ctcgccctca ccccggacga agtggtgcag 720acgacgccgg gcgccggcat caaggcggca ctcgccctca ccccggacga agtggtgcag 720
aaggtctccg ccgcgcgcct gcgcgggcgg ggtggcgccg gctttcccac cggtctcaaa 780aaggtctccg ccgcgcgcct gcgcgggcgg ggtggcgccg gctttcccac cggtctcaaa 780
tggaagctct gccgccagtc gcccgccacc acccgccatg tgatctgcaa tgcggacgag 840tggaagctct gccgccagtc gcccgccacc acccgccatg tgatctgcaa tgcggacgag 840
ggcgagcccg gcaccttcaa ggatcgcgtg ctgctcacgc aggcgccgca cctcatgttc 900ggcgagcccg gcaccttcaa ggatcgcgtg ctgctcacgc aggcgccgca cctcatgttc 900
gacggcatga ccatcgccgg ctacgccttg ggggcgcggg agggcgtggt ctatctgcgc 960gacggcatga ccatcgccgg ctacgccttg ggggcgcggg agggcgtggt ctatctgcgc 960
ggcgagtacg cctatctgtg ggagcctctg catgcggtcc tgcgcgagcg ctatgggctc 1020ggcgagtacg cctatctgtg ggagcctctg catgcggtcc tgcgcgagcg ctatgggctc 1020
gggctcgccg gcgcgaacat cctgggacac gcgggcttcg acttcgacat ccgcatccag 1080gggctcgccg gcgcgaacat cctgggacac gcgggcttcg acttcgacat ccgcatccag 1080
ctgggcgccg gcgcctatat ctgcggcgag gaatccgcgc tggtggaatc gctggaaggc 1140ctgggcgccg gcgcctatat ctgcggcgag gaatccgcgc tggtggaatc gctggaaggc 1140
aagcgcggct cgccccgcga ccgccccccc ttccccaccg tgcgcggcca tctccagcag 1200aagcgcggct cgccccgcga ccgccccccc ttccccaccg tgcgcggcca tctccagcag 1200
cccaccgccg tggacaatgt ggagaccttc gcctgcgccg cccgcatcct ggaggatggc 1260cccaccgccg tggacaatgt ggagaccttc gcctgcgccg cccgcatcct ggagatggc 1260
gtggaggcgt tcgcgggcat cggcacgccc gaatccgccg gcacgaagct cctctcggtg 1320gtggaggcgt tcgcgggcat cggcacgccc gaatccgccg gcacgaagct cctctcggtg 1320
tcgggcgatt gcccgcgccc cggcgtgtat gaggtgccct tcggcctcac ggtgaacgcg 1380tcgggcgatt gcccgcgccc cggcgtgtat gaggtgccct tcggcctcac ggtgaacgcg 1380
ctgctcgacc ttgtcggcgc gccggacgcc gccttcgtgc agatgggtgg gccgtccggc 1440ctgctcgacc ttgtcggcgc gccggacgcc gccttcgtgc agatgggtgg gccgtccggc 1440
caatgcgtgg cgccgaagga ttacggccgc cgcatcgcct tcgaggacct gcccaccggc 1500caatgcgtgg cgccgaagga ttacggccgc cgcatcgcct tcgaggacct gcccaccggc 1500
ggctcggtga tggtgttcgg cccggggcgc gacgtgctcg ccatggtgcg cgagttcgcg 1560ggctcggtga tggtgttcgg cccggggcgc gacgtgctcg ccatggtgcg cgagttcgcg 1560
gatttcttcg ccggcgaatc ctgcggctgg tgcacgccct gccgggtggg caccaccttg 1620gatttcttcg ccggcgaatc ctgcggctgg tgcacgccct gccgggtggg caccaccttg 1620
ctcaaggaag agctggacaa gctcctcgcc aaccgcgcca ccctcgccga catccgcgcg 1680ctcaaggaag agctggacaa gctcctcgcc aaccgcgcca ccctcgccga catccgcgcg 1680
ctggagaccc tggccacgac cgtctcccgc accagccgct gcggcctcgg ccagacggcg 1740ctggagaccc tggccacgac cgtctcccgc accagccgct gcggcctcgg cgacggcg 1740
cccaacccca tcctttccac catgcgcaac ctgccggaag cctatgaggc gaggctgagg 1800cccaacccca tcctttccac catgcgcaac ctgccggaag cctatgaggc gaggctgagg 1800
cccgaagact tcctgccctg ggcctcgctc gacgaggcgc tgaagcccgc catcgtcatc 1860cccgaagact tcctgccctg ggcctcgctc gacgaggcgc tgaagcccgc catcgtcatc 1860
cagggccgcg cgcccgtgcc ggaggaagag gcatga 1896cagggccgcg cgcccgtgcc gggaggaagag gcatga 1896
<210> 7<210> 7
<211> 631<211> 631
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 7<400> 7
Met Met Pro Ser Glu Pro His Gly Ala Gly Met Pro Pro Pro Arg Glu Met Met Pro Ser Glu Pro His Gly Ala Gly Met Pro Pro Pro Arg Glu
1 5 10 15 1 5 10 15
Ala Ala Ala Val Pro Thr Pro Gln Glu Val Ser Ala Val Val Ala Glu Ala Ala Ala Val Pro Thr Pro Gln Glu Val Ser Ala Val Val Ala Glu
20 25 30 20 25 30
Val Val Ala Asp Ala Val Ala Ser Val Gly Gly Ala Arg Thr Arg Leu Val Val Ala Asp Ala Val Ala Ser Val Gly Gly Ala Arg Thr Arg Leu
35 40 45 35 40 45
Met Asp Ile Val Gln Leu Ala Gln Gln Arg Leu Gly His Leu Ser Glu Met Asp Ile Val Gln Leu Ala Gln Gln Arg Leu Gly His Leu Ser Glu
50 55 60 50 55 60
Glu Thr Met Ala Ala Ile Ala Ala Arg Leu Ala Ile Pro Pro Val Glu Glu Thr Met Ala Ala Ile Ala Ala Arg Leu Ala Ile Pro Pro Val Glu
65 70 75 80 65 70 75 80
Val Ala Asp Met Val Ser Phe Tyr Ala Phe Leu Asn Arg Ala Pro Lys Val Ala Asp Met Val Ser Phe Tyr Ala Phe Leu Asn Arg Ala Pro Lys
85 90 95 85 90 95
Gly Arg Tyr His Ile Arg Leu Ser Arg Ser Pro Ile Ser Leu Met Lys Gly Arg Tyr His Ile Arg Leu Ser Arg Ser Pro Ile Ser Leu Met Lys
100 105 110 100 105 110
Gly Ala Glu Ala Val Ala Ala Ala Phe Cys Gln Ile Leu Gly Ile Ala Gly Ala Glu Ala Val Ala Ala Ala Phe Cys Gln Ile Leu Gly Ile Ala
115 120 125 115 120 125
Met Gly Glu Thr Ser Gln Asp Gly Asp Phe Thr Leu Glu Trp Thr Asn Met Gly Glu Thr Ser Gln Asp Gly Asp Phe Thr Leu Glu Trp Thr Asn
130 135 140 130 135 140
Asp Ile Gly Met Ala Asp Gln Glu Pro Ala Ala Leu Val Asn Gly Thr Asp Ile Gly Met Ala Asp Gln Glu Pro Ala Ala Leu Val Asn Gly Thr
145 150 155 160 145 150 155 160
Val Met Thr Gln Leu Ala Pro Gly Asp Ala Ala Ile Ile Val Gly Arg Val Met Thr Gln Leu Ala Pro Gly Asp Ala Ala Ile Ile Val Gly Arg
165 170 175 165 170 175
Leu Arg Ala His His Ala Pro Asn Ala Leu Pro Leu Phe Pro Gly Ala Leu Arg Ala His His Ala Pro Asn Ala Leu Pro Leu Phe Pro Gly Ala
180 185 190 180 185 190
Gly Val Ala Gly Ser Gly Leu Pro His Ala Arg Ile Arg Pro Ser Leu Gly Val Ala Gly Ser Gly Leu Pro His Ala Arg Ile Arg Pro Ser Leu
195 200 205 195 200 205
Val Met Pro Gly Gln Leu Leu Phe Arg Glu Asp His Thr Thr Pro Gly Val Met Pro Gly Gln Leu Leu Phe Arg Glu Asp His Thr Thr Pro Gly
210 215 220 210 215 220
Ala Gly Ile Lys Ala Ala Leu Ala Leu Thr Pro Asp Glu Val Val Gln Ala Gly Ile Lys Ala Ala Leu Ala Leu Thr Pro Asp Glu Val Val Gln
225 230 235 240 225 230 235 240
Lys Val Ser Ala Ala Arg Leu Arg Gly Arg Gly Gly Ala Gly Phe Pro Lys Val Ser Ala Ala Arg Leu Arg Gly Arg Gly Gly Ala Gly Phe Pro
245 250 255 245 250 255
Thr Gly Leu Lys Trp Lys Leu Cys Arg Gln Ser Pro Ala Thr Thr Arg Thr Gly Leu Lys Trp Lys Leu Cys Arg Gln Ser Pro Ala Thr Thr Arg
260 265 270 260 265 270
His Val Ile Cys Asn Ala Asp Glu Gly Glu Pro Gly Thr Phe Lys Asp His Val Ile Cys Asn Ala Asp Glu Gly Glu Pro Gly Thr Phe Lys Asp
275 280 285 275 280 285
Arg Val Leu Leu Thr Gln Ala Pro His Leu Met Phe Asp Gly Met Thr Arg Val Leu Leu Thr Gln Ala Pro His Leu Met Phe Asp Gly Met Thr
290 295 300 290 295 300
Ile Ala Gly Tyr Ala Leu Gly Ala Arg Glu Gly Val Val Tyr Leu Arg Ile Ala Gly Tyr Ala Leu Gly Ala Arg Glu Gly Val Val Tyr Leu Arg
305 310 315 320 305 310 315 320
Gly Glu Tyr Ala Tyr Leu Trp Glu Pro Leu His Ala Val Leu Arg Glu Gly Glu Tyr Ala Tyr Leu Trp Glu Pro Leu His Ala Val Leu Arg Glu
325 330 335 325 330 335
Arg Tyr Gly Leu Gly Leu Ala Gly Ala Asn Ile Leu Gly His Ala Gly Arg Tyr Gly Leu Gly Leu Ala Gly Ala Asn Ile Leu Gly His Ala Gly
340 345 350 340 345 350
Phe Asp Phe Asp Ile Arg Ile Gln Leu Gly Ala Gly Ala Tyr Ile Cys Phe Asp Phe Asp Ile Arg Ile Gln Leu Gly Ala Gly Ala Tyr Ile Cys
355 360 365 355 360 365
Gly Glu Glu Ser Ala Leu Val Glu Ser Leu Glu Gly Lys Arg Gly Ser Gly Glu Glu Ser Ala Leu Val Glu Ser Leu Glu Gly Lys Arg Gly Ser
370 375 380 370 375 380
Pro Arg Asp Arg Pro Pro Phe Pro Thr Val Arg Gly His Leu Gln Gln Pro Arg Asp Arg Pro Pro Phe Pro Thr Val Arg Gly His Leu Gln Gln
385 390 395 400 385 390 395 400
Pro Thr Ala Val Asp Asn Val Glu Thr Phe Ala Cys Ala Ala Arg Ile Pro Thr Ala Val Asp Asn Val Glu Thr Phe Ala Cys Ala Ala Arg Ile
405 410 415 405 410 415
Leu Glu Asp Gly Val Glu Ala Phe Ala Gly Ile Gly Thr Pro Glu Ser Leu Glu Asp Gly Val Glu Ala Phe Ala Gly Ile Gly Thr Pro Glu Ser
420 425 430 420 425 430
Ala Gly Thr Lys Leu Leu Ser Val Ser Gly Asp Cys Pro Arg Pro Gly Ala Gly Thr Lys Leu Leu Ser Val Ser Gly Asp Cys Pro Arg Pro Gly
435 440 445 435 440 445
Val Tyr Glu Val Pro Phe Gly Leu Thr Val Asn Ala Leu Leu Asp Leu Val Tyr Glu Val Pro Phe Gly Leu Thr Val Asn Ala Leu Leu Asp Leu
450 455 460 450 455 460
Val Gly Ala Pro Asp Ala Ala Phe Val Gln Met Gly Gly Pro Ser Gly Val Gly Ala Pro Asp Ala Ala Phe Val Gln Met Gly Gly Pro Ser Gly
465 470 475 480 465 470 475 480
Gln Cys Val Ala Pro Lys Asp Tyr Gly Arg Arg Ile Ala Phe Glu Asp Gln Cys Val Ala Pro Lys Asp Tyr Gly Arg Arg Ile Ala Phe Glu Asp
485 490 495 485 490 495
Leu Pro Thr Gly Gly Ser Val Met Val Phe Gly Pro Gly Arg Asp Val Leu Pro Thr Gly Gly Ser Val Met Val Phe Gly Pro Gly Arg Asp Val
500 505 510 500 505 510
Leu Ala Met Val Arg Glu Phe Ala Asp Phe Phe Ala Gly Glu Ser Cys Leu Ala Met Val Arg Glu Phe Ala Asp Phe Phe Ala Gly Glu Ser Cys
515 520 525 515 520 525
Gly Trp Cys Thr Pro Cys Arg Val Gly Thr Thr Leu Leu Lys Glu Glu Gly Trp Cys Thr Pro Cys Arg Val Gly Thr Thr Leu Leu Lys Glu Glu
530 535 540 530 535 540
Leu Asp Lys Leu Leu Ala Asn Arg Ala Thr Leu Ala Asp Ile Arg Ala Leu Asp Lys Leu Leu Ala Asn Arg Ala Thr Leu Ala Asp Ile Arg Ala
545 550 555 560 545 550 555 560
Leu Glu Thr Leu Ala Thr Thr Val Ser Arg Thr Ser Arg Cys Gly Leu Leu Glu Thr Leu Ala Thr Thr Val Ser Arg Thr Ser Arg Cys Gly Leu
565 570 575 565 570 575
Gly Gln Thr Ala Pro Asn Pro Ile Leu Ser Thr Met Arg Asn Leu Pro Gly Gln Thr Ala Pro Asn Pro Ile Leu Ser Thr Met Arg Asn Leu Pro
580 585 590 580 585 590
Glu Ala Tyr Glu Ala Arg Leu Arg Pro Glu Asp Phe Leu Pro Trp Ala Glu Ala Tyr Glu Ala Arg Leu Arg Pro Glu Asp Phe Leu Pro Trp Ala
595 600 605 595 600 605
Ser Leu Asp Glu Ala Leu Lys Pro Ala Ile Val Ile Gln Gly Arg Ala Ser Leu Asp Glu Ala Leu Lys Pro Ala Ile Val Ile Gln Gly Arg Ala
610 615 620 610 615 620
Pro Val Pro Glu Glu Glu Ala Pro Val Pro Glu Glu Glu Ala
625 630 625 630
<210> 8<210> 8
<211> 1491<211> 1491
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 8<400> 8
atgagccggg gatcccccga tgccgggaaa gaccgcacca tgagcgccac cgacggcacc 60atgagccggg gatcccccga tgccgggaaa gaccgcacca tgagcgccac cgacggcacc 60
accgcccccc gcaagatcgt catcgatccg gtgacccgcg tggagggcca cggcaaggtc 120accgcccccc gcaagatcgt catcgatccg gtgacccgcg tggagggcca cggcaaggtc 120
accatccgcc tggatgaagc cggcgcggtg gaggatgcgc gtttccacat cgtggagttc 180accatccgcc tggatgaagc cggcgcggtg gaggatgcgc gtttccacat cgtggagttc 180
cgcggcttcg agcggttcat ccagggccgg atgtactggg aagtgcccct tatcatccag 240cgcggcttcg agcggttcat ccagggccgg atgtactggg aagtgcccct tatcatccag 240
cggctgtgcg gcatctgccc ggtgagccac catctggcgg cggcgaaagc catggaccag 300cggctgtgcg gcatctgccc ggtgagccac catctggcgg cggcgaaagc catggaccag 300
gtggcgggcg tggaccgcgt accgcccacc gccgagaaac tgcgccggct gatgcattat 360gtggcgggcg tggaccgcgt accgcccacc gccgagaaac tgcgccggct gatgcattat 360
gggcaggtgc tgcaatccaa cgctttgcac atcttccacc tcgcctcgcc cgacctcctg 420gggcaggtgc tgcaatccaa cgctttgcac atcttccacc tcgcctcgcc cgacctcctg 420
ttcggcttcg acgcgccggc cgagcagcgc aacatcatcg ccgtgctcca gcgttatccg 480ttcggcttcg acgcgccggc cgagcagcgc aacatcatcg ccgtgctcca gcgttatccg 480
gagatcggca aatgggcgat cttcatcagg aagttcggcc aggaggtcat caaggccacc 540gagatcggca aatgggcgat cttcatcagg aagttcggcc aggaggtcat caaggccacc 540
ggcgggcgca agatccatcc caccagcgcc attcccggcg gggtcaacca gaacctcgcc 600ggcgggcgca agatccatcc caccagcgcc attcccggcg gggtcaacca gaacctcgcc 600
gtggaggacc gcgacgccct gcgcgccaag gtgggcgaga tcatcagctg gtgcatggcg 660gtggaggacc gcgacgccct gcgcgccaag gtgggcgaga tcatcagctg gtgcatggcg 660
gcgctggacc atcacaaggc ctatgtggcg gaaaaccggg cgctgcatga cagcttcgcc 720gcgctggacc atcacaaggc ctatgtggcg gaaaaccggg cgctgcatga cagcttcgcc 720
gccttcccct ccgccttcat gagcctcgtg gggccggatg gcggcatgga cctttatgac 780gccttcccct ccgccttcat gagcctcgtg gggccggatg gcggcatgga cctttatgac 780
ggcaccctgc gggtgatcga tgccgagggc gcccccctca tcgaaggcgc gccgcccgcc 840ggcaccctgc gggtgatcga tgccgagggc gcccccctca tcgaaggcgc gccgcccgcc 840
tcctaccgcg accacctcat cgaggaggtg cggccctgga gctatctgaa attcccccat 900tcctaccgcg accacctcat cgaggaggtg cggccctgga gctatctgaa attcccccat 900
ctgcgcgcct tcggccgcga cgatggctgg tatcgggtcg gccccctcgc ccaggtcaat 960ctgcgcgcct tcggccgcga cgatggctgg tatcgggtcg gccccctcgc ccaggtcaat 960
tgcgccgcgt ccatcgacac gccccgcgcc gaggcggccc ggcgggactt catggccgag 1020tgcgccgcgt ccatcgacac gccccgcgcc gaggcggccc ggcgggactt catggccgag 1020
ggcggcggca agccggtgca tgccaccctc gcttatcact gggcgcggct catcgtgctg 1080ggcggcggca agccggtgca tgccaccctc gcttatcact gggcgcggct catcgtgctg 1080
gtccattgcg cggagaagat cgaacagctg ctgttcgacg acgacctgca aggctgcgat 1140gtccattgcg cggagaagat cgaacagctg ctgttcgacg acgacctgca aggctgcgat 1140
ctgcgtgcgg agggcacccg gcgcggggaa ggcgtcgcct ggatcgaggc gccgcgcggc 1200ctgcgtgcgg agggcacccg gcgcggggaa ggcgtcgcct ggatcgaggc gccgcgcggc 1200
accctcatcc accattacga ggtggacgag aacgaccagg tgcgccgcgc caacctcatc 1260accctcatcc accattacga ggtggacgag aacgaccagg tgcgccgcgc caacctcatc 1260
gtctccacca cccacaataa cgaggccatg aaccgcgccg tgcggcaggt ggcgaagacg 1320gtctccacca cccacaataa cgaggccatg aaccgcgccg tgcggcaggt ggcgaagacg 1320
gacctttccg gtcgcgagat caccgaaggg ctgctgaacc atatcgaggt ggccatccgc 1380gacctttccg gtcgcgagat caccgaaggg ctgctgaacc atatcgaggt ggccatccgc 1380
gccttcgacc cctgcctgtc ctgcgccacc catgcgctgg gccagatgcc gctgatcgtg 1440gccttcgacc cctgcctgtc ctgcgccacc catgcgctgg gccagatgcc gctgatcgtg 1440
acgcttgaag atgcctccgg cgcagagatc gcccgcggag tgaaggaatg a 1491acgcttgaag atgcctccgg cgcagagatc gcccgcggag tgaaggaatg a 1491
<210> 9<210> 9
<211> 496<211> 496
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 9<400> 9
Met Ser Arg Gly Ser Pro Asp Ala Gly Lys Asp Arg Thr Met Ser Ala Met Ser Arg Gly Ser Pro Asp Ala Gly Lys Asp Arg Thr Met Ser Ala
1 5 10 15 1 5 10 15
Thr Asp Gly Thr Thr Ala Pro Arg Lys Ile Val Ile Asp Pro Val Thr Thr Asp Gly Thr Thr Ala Pro Arg Lys Ile Val Ile Asp Pro Val Thr
20 25 30 20 25 30
Arg Val Glu Gly His Gly Lys Val Thr Ile Arg Leu Asp Glu Ala Gly Arg Val Glu Gly His Gly Lys Val Thr Ile Arg Leu Asp Glu Ala Gly
35 40 45 35 40 45
Ala Val Glu Asp Ala Arg Phe His Ile Val Glu Phe Arg Gly Phe Glu Ala Val Glu Asp Ala Arg Phe His Ile Val Glu Phe Arg Gly Phe Glu
50 55 60 50 55 60
Arg Phe Ile Gln Gly Arg Met Tyr Trp Glu Val Pro Leu Ile Ile Gln Arg Phe Ile Gln Gly Arg Met Tyr Trp Glu Val Pro Leu Ile Ile Gln
65 70 75 80 65 70 75 80
Arg Leu Cys Gly Ile Cys Pro Val Ser His His Leu Ala Ala Ala Lys Arg Leu Cys Gly Ile Cys Pro Val Ser His His Leu Ala Ala Ala Lys
85 90 95 85 90 95
Ala Met Asp Gln Val Ala Gly Val Asp Arg Val Pro Pro Thr Ala Glu Ala Met Asp Gln Val Ala Gly Val Asp Arg Val Pro Pro Thr Ala Glu
100 105 110 100 105 110
Lys Leu Arg Arg Leu Met His Tyr Gly Gln Val Leu Gln Ser Asn Ala Lys Leu Arg Arg Leu Met His Tyr Gly Gln Val Leu Gln Ser Asn Ala
115 120 125 115 120 125
Leu His Ile Phe His Leu Ala Ser Pro Asp Leu Leu Phe Gly Phe Asp Leu His Ile Phe His Leu Ala Ser Pro Asp Leu Leu Phe Gly Phe Asp
130 135 140 130 135 140
Ala Pro Ala Glu Gln Arg Asn Ile Ile Ala Val Leu Gln Arg Tyr Pro Ala Pro Ala Glu Gln Arg Asn Ile Ile Ala Val Leu Gln Arg Tyr Pro
145 150 155 160 145 150 155 160
Glu Ile Gly Lys Trp Ala Ile Phe Ile Arg Lys Phe Gly Gln Glu Val Glu Ile Gly Lys Trp Ala Ile Phe Ile Arg Lys Phe Gly Gln Glu Val
165 170 175 165 170 175
Ile Lys Ala Thr Gly Gly Arg Lys Ile His Pro Thr Ser Ala Ile Pro Ile Lys Ala Thr Gly Gly Arg Lys Ile His Pro Thr Ser Ala Ile Pro
180 185 190 180 185 190
Gly Gly Val Asn Gln Asn Leu Ala Val Glu Asp Arg Asp Ala Leu Arg Gly Gly Val Asn Gln Asn Leu Ala Val Glu Asp Arg Asp Ala Leu Arg
195 200 205 195 200 205
Ala Lys Val Gly Glu Ile Ile Ser Trp Cys Met Ala Ala Leu Asp His Ala Lys Val Gly Glu Ile Ile Ser Trp Cys Met Ala Ala Leu Asp His
210 215 220 210 215 220
His Lys Ala Tyr Val Ala Glu Asn Arg Ala Leu His Asp Ser Phe Ala His Lys Ala Tyr Val Ala Glu Asn Arg Ala Leu His Asp Ser Phe Ala
225 230 235 240 225 230 235 240
Ala Phe Pro Ser Ala Phe Met Ser Leu Val Gly Pro Asp Gly Gly Met Ala Phe Pro Ser Ala Phe Met Ser Leu Val Gly Pro Asp Gly Gly Met
245 250 255 245 250 255
Asp Leu Tyr Asp Gly Thr Leu Arg Val Ile Asp Ala Glu Gly Ala Pro Asp Leu Tyr Asp Gly Thr Leu Arg Val Ile Asp Ala Glu Gly Ala Pro
260 265 270 260 265 270
Leu Ile Glu Gly Ala Pro Pro Ala Ser Tyr Arg Asp His Leu Ile Glu Leu Ile Glu Gly Ala Pro Pro Ala Ser Tyr Arg Asp His Leu Ile Glu
275 280 285 275 280 285
Glu Val Arg Pro Trp Ser Tyr Leu Lys Phe Pro His Leu Arg Ala Phe Glu Val Arg Pro Trp Ser Tyr Leu Lys Phe Pro His Leu Arg Ala Phe
290 295 300 290 295 300
Gly Arg Asp Asp Gly Trp Tyr Arg Val Gly Pro Leu Ala Gln Val Asn Gly Arg Asp Asp Gly Trp Tyr Arg Val Gly Pro Leu Ala Gln Val Asn
305 310 315 320 305 310 315 320
Cys Ala Ala Ser Ile Asp Thr Pro Arg Ala Glu Ala Ala Arg Arg Asp Cys Ala Ala Ser Ile Asp Thr Pro Arg Ala Glu Ala Ala Arg Arg Asp
325 330 335 325 330 335
Phe Met Ala Glu Gly Gly Gly Lys Pro Val His Ala Thr Leu Ala Tyr Phe Met Ala Glu Gly Gly Gly Lys Pro Val His Ala Thr Leu Ala Tyr
340 345 350 340 345 350
His Trp Ala Arg Leu Ile Val Leu Val His Cys Ala Glu Lys Ile Glu His Trp Ala Arg Leu Ile Val Leu Val His Cys Ala Glu Lys Ile Glu
355 360 365 355 360 365
Gln Leu Leu Phe Asp Asp Asp Leu Gln Gly Cys Asp Leu Arg Ala Glu Gln Leu Leu Phe Asp Asp Asp Leu Gln Gly Cys Asp Leu Arg Ala Glu
370 375 380 370 375 380
Gly Thr Arg Arg Gly Glu Gly Val Ala Trp Ile Glu Ala Pro Arg Gly Gly Thr Arg Arg Gly Glu Gly Val Ala Trp Ile Glu Ala Pro Arg Gly
385 390 395 400 385 390 395 400
Thr Leu Ile His His Tyr Glu Val Asp Glu Asn Asp Gln Val Arg Arg Thr Leu Ile His His Tyr Glu Val Asp Glu Asn Asp Gln Val Arg Arg
405 410 415 405 410 415
Ala Asn Leu Ile Val Ser Thr Thr His Asn Asn Glu Ala Met Asn Arg Ala Asn Leu Ile Val Ser Thr Thr His Asn Asn Glu Ala Met Asn Arg
420 425 430 420 425 430
Ala Val Arg Gln Val Ala Lys Thr Asp Leu Ser Gly Arg Glu Ile Thr Ala Val Arg Gln Val Ala Lys Thr Asp Leu Ser Gly Arg Glu Ile Thr
435 440 445 435 440 445
Glu Gly Leu Leu Asn His Ile Glu Val Ala Ile Arg Ala Phe Asp Pro Glu Gly Leu Leu Asn His Ile Glu Val Ala Ile Arg Ala Phe Asp Pro
450 455 460 450 455 460
Cys Leu Ser Cys Ala Thr His Ala Leu Gly Gln Met Pro Leu Ile Val Cys Leu Ser Cys Ala Thr His Ala Leu Gly Gln Met Pro Leu Ile Val
465 470 475 480 465 470 475 480
Thr Leu Glu Asp Ala Ser Gly Ala Glu Ile Ala Arg Gly Val Lys Glu Thr Leu Glu Asp Ala Ser Gly Ala Glu Ile Ala Arg Gly Val Lys Glu
485 490 495 485 490 495
<210> 10<210> 10
<211> 720<211> 720
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 10<400> 10
atgagcgaga cccccttcac ctttaccgtg gacggcatcg cggtcccggc cacccccggc 60atgagcgaga cccccttcac ctttaccgtg gacggcatcg cggtcccggc cacccccggc 60
cagagcgtca tcgaggcgtg cgatgcggcg ggcatctata tcccgcgcct gtgccaccac 120cagagcgtca tcgaggcgtg cgatgcggcg ggcatctata tcccgcgcct gtgccaccac 120
ccggacctgc cgccggcggg ccattgccgg gtgtgcacct gcatcatcga cgggcggccg 180ccggacctgc cgccggcggg ccattgccgg gtgtgcacct gcatcatcga cgggcggccg 180
gccagcgcct gcaccatgcc cgccgccagg ggcatggtgg tggagaacga gacgcccgct 240gccagcgcct gcaccatgcc cgccgccagg ggcatggtgg tggagaacga gacgcccgct 240
ttgctggcgg agcggcgcac gctgatcgag atgctgttcg cggaaggcaa ccatttctgc 300ttgctggcgg agcggcgcac gctgatcgag atgctgttcg cggaaggcaa ccatttctgc 300
cagttctgcg aggcgagcgg cgattgcgaa ttgcaggcgc tgggctacct gttcggcatg 360cagttctgcg aggcgagcgg cgattgcgaa ttgcaggcgc tgggctacct gttcggcatg 360
gtggccccgc ccttccccca tctgtggccg aagcggccgg tggatgccag ccatccggat 420gtggccccgc ccttccccca tctgtggccg aagcggccgg tggatgccag ccatccggat 420
atctatatcg accacaatcg ctgcatcctg tgctcgcgct gcgtgcgcgc ctcgcgcacc 480atctatatcg accacaatcg ctgcatcctg tgctcgcgct gcgtgcgcgc ctcgcgcacc 480
ctggacggca agtccgtgtt cggcttcgag gggcgcggca tcgagatgca tctggcggtg 540ctggacggca agtccgtgtt cggcttcgag gggcgcggca tcgagatgca tctggcggtg 540
accggcgggc acctggacga cagcgccatc gccgccgccg acagggcggt tgagatgtgc 600accggcgggc acctggacga cagcgccatc gccgccgccg acagggcggt tgagatgtgc 600
ccggtgggct gcatcgtcct caagcgcacc ggctaccgca cgccctatgg ccggcggcgc 660ccggtgggct gcatcgtcct caagcgcacc ggctaccgca cgccctatgg ccggcggcgc 660
tacgacgccg cgcccatcgg ctccgacatc accgcccggc gcggcggcgc gaaggactga 720tacgacgccg cgcccatcgg ctccgacatc accgcccggc gcggcggcgc gaaggactga 720
<210> 11<210> 11
<211> 239<211> 239
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 11<400> 11
Met Ser Glu Thr Pro Phe Thr Phe Thr Val Asp Gly Ile Ala Val Pro Met Ser Glu Thr Pro Phe Thr Phe Thr Val Asp Gly Ile Ala Val Pro
1 5 10 15 1 5 10 15
Ala Thr Pro Gly Gln Ser Val Ile Glu Ala Cys Asp Ala Ala Gly Ile Ala Thr Pro Gly Gln Ser Val Ile Glu Ala Cys Asp Ala Ala Gly Ile
20 25 30 20 25 30
Tyr Ile Pro Arg Leu Cys His His Pro Asp Leu Pro Pro Ala Gly His Tyr Ile Pro Arg Leu Cys His His Pro Asp Leu Pro Pro Ala Gly His
35 40 45 35 40 45
Cys Arg Val Cys Thr Cys Ile Ile Asp Gly Arg Pro Ala Ser Ala Cys Cys Arg Val Cys Thr Cys Ile Ile Asp Gly Arg Pro Ala Ser Ala Cys
50 55 60 50 55 60
Thr Met Pro Ala Ala Arg Gly Met Val Val Glu Asn Glu Thr Pro Ala Thr Met Pro Ala Ala Arg Gly Met Val Val Glu Asn Glu Thr Pro Ala
65 70 75 80 65 70 75 80
Leu Leu Ala Glu Arg Arg Thr Leu Ile Glu Met Leu Phe Ala Glu Gly Leu Leu Ala Glu Arg Arg Thr Leu Ile Glu Met Leu Phe Ala Glu Gly
85 90 95 85 90 95
Asn His Phe Cys Gln Phe Cys Glu Ala Ser Gly Asp Cys Glu Leu Gln Asn His Phe Cys Gln Phe Cys Glu Ala Ser Gly Asp Cys Glu Leu Gln
100 105 110 100 105 110
Ala Leu Gly Tyr Leu Phe Gly Met Val Ala Pro Pro Phe Pro His Leu Ala Leu Gly Tyr Leu Phe Gly Met Val Ala Pro Pro Phe Pro His Leu
115 120 125 115 120 125
Trp Pro Lys Arg Pro Val Asp Ala Ser His Pro Asp Ile Tyr Ile Asp Trp Pro Lys Arg Pro Val Asp Ala Ser His Pro Asp Ile Tyr Ile Asp
130 135 140 130 135 140
His Asn Arg Cys Ile Leu Cys Ser Arg Cys Val Arg Ala Ser Arg Thr His Asn Arg Cys Ile Leu Cys Ser Arg Cys Val Arg Ala Ser Arg Thr
145 150 155 160 145 150 155 160
Leu Asp Gly Lys Ser Val Phe Gly Phe Glu Gly Arg Gly Ile Glu Met Leu Asp Gly Lys Ser Val Phe Gly Phe Glu Gly Arg Gly Ile Glu Met
165 170 175 165 170 175
His Leu Ala Val Thr Gly Gly His Leu Asp Asp Ser Ala Ile Ala Ala His Leu Ala Val Thr Gly Gly His Leu Asp Asp Ser Ala Ile Ala Ala
180 185 190 180 185 190
Ala Asp Arg Ala Val Glu Met Cys Pro Val Gly Cys Ile Val Leu Lys Ala Asp Arg Ala Val Glu Met Cys Pro Val Gly Cys Ile Val Leu Lys
195 200 205 195 200 205
Arg Thr Gly Tyr Arg Thr Pro Tyr Gly Arg Arg Arg Tyr Asp Ala Ala Arg Thr Gly Tyr Arg Thr Pro Tyr Gly Arg Arg Arg Tyr Asp Ala Ala
210 215 220 210 215 220
Pro Ile Gly Ser Asp Ile Thr Ala Arg Arg Gly Gly Ala Lys Asp Pro Ile Gly Ser Asp Ile Thr Ala Arg Arg Gly Gly Ala Lys Asp
225 230 235 225 230 235
<210> 12<210> 12
<211> 543<211> 543
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 12<400> 12
atggccaagc ccaaactcgc cacctgcgcg ctggccggct gcttcggctg ccacatgtcc 60atggccaagc ccaaactcgc cacctgcgcg ctggccggct gcttcggctg ccacatgtcc 60
ttcctggaca tggacgagcg catcgtcgag ctcatcgacc tggtggacct cgacgtctcg 120ttcctggaca tggacgagcg catcgtcgag ctcatcgacc tggtggacct cgacgtctcg 120
cccctcgacg acaagaaaaa cttcaccggc atggtggaaa tcggcctggt ggaaggcggc 180cccctcgacg acaagaaaaa cttcaccggc atggtggaaa tcggcctggt ggaaggcggc 180
tgcgccgacg agcgccatgt gaaggtgctg cgcgagttcc gcgagaaatc ccgcatcctg 240tgcgccgacg agcgccatgt gaaggtgctg cgcgagttcc gcgagaaatc ccgcatcctg 240
gtggcggtgg gcgcctgcgc catcaccggc ggcatcccgg cattgcgcaa cctcgccggc 300gtggcggtgg gcgcctgcgc catcaccggc ggcatcccgg cattgcgcaa cctcgccggc 300
ctcgacgaat gcctgaggga agcctacctc accggcccca cggtggaagg cggcgggctc 360ctcgacgaat gcctgaggga agcctacctc accggcccca cggtggaagg cggcgggctc 360
attcccaacg acccggagct gccgctgctg ctggacaagg tctatccggt gcaggacttc 420attcccaacg acccggagct gccgctgctg ctggacaagg tctatccggt gcaggacttc 420
gtgaagatcg accatttcct gcccggctgc ccgccctcgg ccgacgccat ctgggcggct 480gtgaagatcg accatttcct gcccggctgc ccgccctcgg ccgacgccat ctgggcggct 480
ctgaaggcgc tgctgaccgg caccgagccg catctgccct acccgctttt caagtacgaa 540ctgaaggcgc tgctgaccgg caccgagccg catctgccct acccgctttt caagtacgaa 540
tga 543tga 543
<210> 13<210> 13
<211> 180<211> 180
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 13<400> 13
Met Ala Lys Pro Lys Leu Ala Thr Cys Ala Leu Ala Gly Cys Phe Gly Met Ala Lys Pro Lys Leu Ala Thr Cys Ala Leu Ala Gly Cys Phe Gly
1 5 10 15 1 5 10 15
Cys His Met Ser Phe Leu Asp Met Asp Glu Arg Ile Val Glu Leu Ile Cys His Met Ser Phe Leu Asp Met Asp Glu Arg Ile Val Glu Leu Ile
20 25 30 20 25 30
Asp Leu Val Asp Leu Asp Val Ser Pro Leu Asp Asp Lys Lys Asn Phe Asp Leu Val Asp Leu Asp Val Ser Pro Leu Asp Asp Lys Lys Asn Phe
35 40 45 35 40 45
Thr Gly Met Val Glu Ile Gly Leu Val Glu Gly Gly Cys Ala Asp Glu Thr Gly Met Val Glu Ile Gly Leu Val Glu Gly Gly Cys Ala Asp Glu
50 55 60 50 55 60
Arg His Val Lys Val Leu Arg Glu Phe Arg Glu Lys Ser Arg Ile Leu Arg His Val Lys Val Leu Arg Glu Phe Arg Glu Lys Ser Arg Ile Leu
65 70 75 80 65 70 75 80
Val Ala Val Gly Ala Cys Ala Ile Thr Gly Gly Ile Pro Ala Leu Arg Val Ala Val Gly Ala Cys Ala Ile Thr Gly Gly Ile Pro Ala Leu Arg
85 90 95 85 90 95
Asn Leu Ala Gly Leu Asp Glu Cys Leu Arg Glu Ala Tyr Leu Thr Gly Asn Leu Ala Gly Leu Asp Glu Cys Leu Arg Glu Ala Tyr Leu Thr Gly
100 105 110 100 105 110
Pro Thr Val Glu Gly Gly Gly Leu Ile Pro Asn Asp Pro Glu Leu Pro Pro Thr Val Glu Gly Gly Gly Leu Ile Pro Asn Asp Pro Glu Leu Pro
115 120 125 115 120 125
Leu Leu Leu Asp Lys Val Tyr Pro Val Gln Asp Phe Val Lys Ile Asp Leu Leu Leu Asp Lys Val Tyr Pro Val Gln Asp Phe Val Lys Ile Asp
130 135 140 130 135 140
His Phe Leu Pro Gly Cys Pro Pro Ser Ala Asp Ala Ile Trp Ala Ala His Phe Leu Pro Gly Cys Pro Pro Ser Ala Asp Ala Ile Trp Ala Ala
145 150 155 160 145 150 155 160
Leu Lys Ala Leu Leu Thr Gly Thr Glu Pro His Leu Pro Tyr Pro Leu Leu Lys Ala Leu Leu Thr Gly Thr Glu Pro His Leu Pro Tyr Pro Leu
165 170 175 165 170 175
Phe Lys Tyr Glu Phe Lys Tyr Glu
180 180
<210> 14<210> 14
<211> 1013<211> 1013
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 14<400> 14
tccagacccg ggcaacattg ctccatgtgc tgggcaccct ggccggccgc tggccccata 60tccagacccg ggcaacattg ctccatgtgc tgggcaccct ggccggccgc tggccccata 60
ccctcgcgct ccagcccggc ggggtgaccc gaagcgccga ccagcacgac cgcatgcgcc 120ccctcgcgct ccagcccggc ggggtgaccc gaagcgccga ccagcacgac cgcatgcgcc 120
tgctcgcgac gctgaaggcg gtgcgggcgg cgctggaaga gaccttgttc ggcgcgcctt 180tgctcgcgac gctgaaggcg gtgcgggcgg cgctggaaga gaccttgttc ggcgcgcctt 180
tggaagaggt ggcggccctg gacggcgccg ccgccgtgga ggcctggcgc gccaacggcc 240tggaagaggt ggcggccctg gacggcgccg ccgccgtgga ggcctggcgc gccaacggcc 240
cggaagggga tttccgcctg ttcctggaga tcgccgccga cctggagctg gaccggctcg 300cggaagggga tttccgcctg ttcctggaga tcgccgccga cctggagctg gaccggctcg 300
gccgcgcgca cgaccgcttt ctctccttcg gcgcctacgc ccaggacgag gggcgccttt 360gccgcgcgca cgaccgcttt ctctccttcg gcgcctacgc ccaggacgag gggcgccttt 360
atggcgccgg caccttcgag gccgggacgg cgggagggct cgatcccaac gccatcaccg 420atggcgccgg caccttcgag gccgggacgg cgggagggct cgatcccaac gccatcaccg 420
aggaccacgc cttcgcccgc atggaggacc gcgcggcgcc ccatgcgccc tttgacggct 480aggaccacgc cttcgcccgc atggaggacc gcgcggcgcc ccatgcgccc tttgacggct 480
ccaccttccc cgatgccgac gacaccgagg gctacacctg gtgcaaggcg ccgcgccttg 540ccaccttccc cgatgccgac gacaccgagg gctacacctg gtgcaaggcg ccgcgccttg 540
ccggcctgcc cttcgagacc ggcgccttcg cccggcaggt ggtggcgggc catccgctcg 600ccggcctgcc cttcgagacc ggcgccttcg cccggcaggt ggtggcgggc catccgctcg 600
cccgggacct cgtgacgcgg gaaggcggca ctgtgcgcag ccgcgtggtc ggccggctgc 660cccgggacct cgtgacgcgg gaaggcggca ctgtgcgcag ccgcgtggtc ggccggctgc 660
tggaaaccgc gcgcaccctg atcgccatgg agggctgggt gaaggaactg cggcccgaag 720tggaaaccgc gcgcaccctg atcgccatgg agggctgggt gaaggaactg cggcccgaag 720
ggccctggtg cgcccagggc cacctgcccc aggaaggccg cgccttcggc ctcaccgagg 780ggccctggtg cgcccagggc cacctgcccc aggaaggccg cgccttcggc ctcaccgagg 780
cggcgcgcgg ggcgctcggc cactggatgg tggtggagaa gggccgcatt gcccgctacc 840cggcgcgcgg ggcgctcggc cactggatgg tggtggagaa gggccgcatt gcccgctacc 840
agatcatcgc ccccaccacc tggaacttct ccccccgcga cggcgcgggc ctgcccggcc 900agatcatcgc ccccaccacc tggaacttct ccccccgcga cggcgcgggc ctgcccggcc 900
cgctggagac ggccctggtg ggcgcgcccg tgcggcaggg agagacgacg cccgtgagcg 960cgctggagac ggccctggtg ggcgcgcccg tgcggcaggg agagacgacg cccgtgagcg 960
tgcagcacat cgtgcgctcc ttcgacccgt gcatggtctg cactgtgcat tga 1013tgcagcacat cgtgcgctcc ttcgacccgt gcatggtctg cactgtgcat tga 1013
<210> 15<210> 15
<211> 485<211> 485
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 15<400> 15
Met Ser Ala Glu Thr Arg Arg Leu Val Val Gly Pro Phe Asn Arg Val Met Ser Ala Glu Thr Arg Arg Leu Val Val Gly Pro Phe Asn Arg Val
1 5 10 15 1 5 10 15
Glu Gly Asp Leu Glu Val Arg Leu Asp Val Gln Asp Gly Arg Val Gln Glu Gly Asp Leu Glu Val Arg Leu Asp Val Gln Asp Gly Arg Val Gln
20 25 30 20 25 30
Gln Ala Phe Val Ser Ser Pro Leu Phe Arg Gly Phe Glu Arg Ile Leu Gln Ala Phe Val Ser Ser Pro Leu Phe Arg Gly Phe Glu Arg Ile Leu
35 40 45 35 40 45
Glu Gly Arg Asp Pro Arg Asp Ala Leu Val Ile Ala Pro Arg Ile Cys Glu Gly Arg Asp Pro Arg Asp Ala Leu Val Ile Ala Pro Arg Ile Cys
50 55 60 50 55 60
Gly Ile Cys Ser Val Ser Gln Ser His Ala Ala Ala Leu Ala Leu Ala Gly Ile Cys Ser Val Ser Gln Ser His Ala Ala Ala Leu Ala Leu Ala
65 70 75 80 65 70 75 80
Gly Leu Gln Gly Ile Ala Pro Thr His Asp Gly Arg Ile Ala Thr Asn Gly Leu Gln Gly Ile Ala Pro Thr His Asp Gly Arg Ile Ala Thr Asn
85 90 95 85 90 95
Leu Ile Val Ala Ala Glu Asn Val Ala Asp His Leu Thr His Phe His Leu Ile Val Ala Ala Glu Asn Val Ala Asp His Leu Thr His Phe His
100 105 110 100 105 110
Val Phe Phe Met Pro Asp Phe Ala Arg Ala Val Tyr Glu Asp Arg Pro Val Phe Phe Met Pro Asp Phe Ala Arg Ala Val Tyr Glu Asp Arg Pro
115 120 125 115 120 125
Trp Phe Ala Gln Ala Ala Arg Arg Phe Lys Ala Asn Gln Gly Val Ser Trp Phe Ala Gln Ala Ala Arg Arg Phe Lys Ala Asn Gln Gly Val Ser
130 135 140 130 135 140
Val Arg Arg Ala Leu Gln Thr Arg Ala Thr Leu Leu His Val Leu Gly Val Arg Arg Ala Leu Gln Thr Arg Ala Thr Leu Leu His Val Leu Gly
145 150 155 160 145 150 155 160
Thr Leu Ala Gly Arg Trp Pro His Thr Leu Ala Leu Gln Pro Gly Gly Thr Leu Ala Gly Arg Trp Pro His Thr Leu Ala Leu Gln Pro Gly Gly
165 170 175 165 170 175
Val Thr Arg Ser Ala Asp Gln His Asp Arg Met Arg Leu Leu Ala Thr Val Thr Arg Ser Ala Asp Gln His Asp Arg Met Arg Leu Leu Ala Thr
180 185 190 180 185 190
Leu Lys Ala Val Arg Ala Ala Leu Glu Glu Thr Leu Phe Gly Ala Pro Leu Lys Ala Val Arg Ala Ala Leu Glu Glu Thr Leu Phe Gly Ala Pro
195 200 205 195 200 205
Leu Glu Glu Val Ala Ala Leu Asp Gly Ala Ala Ala Val Glu Ala Trp Leu Glu Glu Val Ala Ala Leu Asp Gly Ala Ala Ala Val Glu Ala Trp
210 215 220 210 215 220
Arg Ala Asn Gly Pro Glu Gly Asp Phe Arg Leu Phe Leu Glu Ile Ala Arg Ala Asn Gly Pro Glu Gly Asp Phe Arg Leu Phe Leu Glu Ile Ala
225 230 235 240 225 230 235 240
Ala Asp Leu Glu Leu Asp Arg Leu Gly Arg Ala His Asp Arg Phe Leu Ala Asp Leu Glu Leu Asp Arg Leu Gly Arg Ala His Asp Arg Phe Leu
245 250 255 245 250 255
Ser Phe Gly Ala Tyr Ala Gln Asp Glu Gly Arg Leu Tyr Gly Ala Gly Ser Phe Gly Ala Tyr Ala Gln Asp Glu Gly Arg Leu Tyr Gly Ala Gly
260 265 270 260 265 270
Thr Phe Glu Ala Gly Thr Ala Gly Gly Leu Asp Pro Asn Ala Ile Thr Thr Phe Glu Ala Gly Thr Ala Gly Gly Leu Asp Pro Asn Ala Ile Thr
275 280 285 275 280 285
Glu Asp His Ala Phe Ala Arg Met Glu Asp Arg Ala Ala Pro His Ala Glu Asp His Ala Phe Ala Arg Met Glu Asp Arg Ala Ala Pro His Ala
290 295 300 290 295 300
Pro Phe Asp Gly Ser Thr Phe Pro Asp Ala Asp Asp Thr Glu Gly Tyr Pro Phe Asp Gly Ser Thr Phe Pro Asp Ala Asp Asp Thr Glu Gly Tyr
305 310 315 320 305 310 315 320
Thr Trp Cys Lys Ala Pro Arg Leu Ala Gly Leu Pro Phe Glu Thr Gly Thr Trp Cys Lys Ala Pro Arg Leu Ala Gly Leu Pro Phe Glu Thr Gly
325 330 335 325 330 335
Ala Phe Ala Arg Gln Val Val Ala Gly His Pro Leu Ala Arg Asp Leu Ala Phe Ala Arg Gln Val Val Ala Gly His Pro Leu Ala Arg Asp Leu
340 345 350 340 345 350
Val Thr Arg Glu Gly Gly Thr Val Arg Ser Arg Val Val Gly Arg Leu Val Thr Arg Glu Gly Gly Thr Val Arg Ser Arg Val Val Gly Arg Leu
355 360 365 355 360 365
Leu Glu Thr Ala Arg Thr Leu Ile Ala Met Glu Gly Trp Val Lys Glu Leu Glu Thr Ala Arg Thr Leu Ile Ala Met Glu Gly Trp Val Lys Glu
370 375 380 370 375 380
Leu Arg Pro Glu Gly Pro Trp Cys Ala Gln Gly His Leu Pro Gln Glu Leu Arg Pro Glu Gly Pro Trp Cys Ala Gln Gly His Leu Pro Gln Glu
385 390 395 400 385 390 395 400
Gly Arg Ala Phe Gly Leu Thr Glu Ala Ala Arg Gly Ala Leu Gly His Gly Arg Ala Phe Gly Leu Thr Glu Ala Ala Arg Gly Ala Leu Gly His
405 410 415 405 410 415
Trp Met Val Val Glu Lys Gly Arg Ile Ala Arg Tyr Gln Ile Ile Ala Trp Met Val Val Glu Lys Gly Arg Ile Ala Arg Tyr Gln Ile Ile Ala
420 425 430 420 425 430
Pro Thr Thr Trp Asn Phe Ser Pro Arg Asp Gly Ala Gly Leu Pro Gly Pro Thr Thr Trp Asn Phe Ser Pro Arg Asp Gly Ala Gly Leu Pro Gly
435 440 445 435 440 445
Pro Leu Glu Thr Ala Leu Val Gly Ala Pro Val Arg Gln Gly Glu Thr Pro Leu Glu Thr Ala Leu Val Gly Ala Pro Val Arg Gln Gly Glu Thr
450 455 460 450 455 460
Thr Pro Val Ser Val Gln His Ile Val Arg Ser Phe Asp Pro Cys Met Thr Pro Val Ser Val Gln His Ile Val Arg Ser Phe Asp Pro Cys Met
465 470 475 480 465 470 475 480
Val Cys Thr Val His Val Cys Thr Val His
485 485
<210> 16<210> 16
<211> 858<211> 858
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 16<400> 16
acgggggagg aagcccgcgc catcttcgac gccatccttg ccggcgttat cgtcctcgac 60acgggggagg aagcccgcgc catcttcgac gccatccttg ccggcgttat cgtcctcgac 60
gccctgtgcg tggaaggcgc gctgctgcgc gggccgaacg gcaccgggcg cttccatgtg 120gccctgtgcg tggaaggcgc gctgctgcgc gggccgaacg gcaccgggcg cttccatgtg 120
ctggcgggca cggacacccc caccatcgac tgggcgcggc agctcgccgg catggcgcgc 180ctggcgggca cggacacccc caccatcgac tgggcgcggc agctcgccgg catggcgcgc 180
cacgtggtgg cggtgggcac ctgcgccgcc tatgggggcg tgacggcggc gggcatcaac 240cacgtggtgg cggtgggcac ctgcgccgcc tatgggggcg tgacggcggc gggcatcaac 240
cccaccgatg cctgcggcct ccagttcgac ggacgccgga agggtggggc gctgggggcg 300cccaccgatg cctgcggcct ccagttcgac ggacgccgga agggtggggc gctggggggcg 300
gacttccgct cccgctcggg gcttccggtc atcaatgtgg ccggctgccc cacccatccc 360gacttccgct cccgctcggg gcttccggtc atcaatgtgg ccggctgccc cacccatccc 360
aactgggtga cggaaaccct gatgctgctc gcctgcggcc tgctgggcga ggccgacctc 420aactgggtga cggaaaccct gatgctgctc gcctgcggcc tgctgggcga ggccgacctc 420
gacgtctatg gccgcccgcg cttctatgcg gacctgctgg tgcatcacgg ctgcccgcgc 480gacgtctatg gccgcccgcg cttctatgcg gacctgctgg tgcatcacgg ctgcccgcgc 480
aacgaatact atgaatacaa ggcgagcgcc gagaagatga gcgacctcgg ctgcatgatg 540aacgaatact atgaatacaa ggcgagcgcc gagaagatga gcgacctcgg ctgcatgatg 540
gagcatctgg gctgcctcgg cacccaggcc cacgccgact gcaacacgcg cctttggaat 600gagcatctgg gctgcctcgg cacccaggcc cacgccgact gcaacacgcg cctttggaat 600
ggcgagggct cgtgcacccg cggcggctat gcctgcatca actgcacggc gccggaattc 660ggcgagggct cgtgcacccg cggcggctat gcctgcatca actgcacggc gccggaattc 660
gaggagccgg gccacgcctt cctggagacg cccaagatcg gcggcatccc catcggcctg 720gaggagccgg gccacgcctt cctggagacg cccaagatcg gcggcatccc catcggcctg 720
cccaccgaca tgcccaaggc ctggttcatc gccttgtcct ccctcgccaa ggcggcgacg 780cccaccgaca tgcccaaggc ctggttcatc gccttgtcct ccctcgccaa ggcggcgacg 780
ccggagcggc tgcgcaagaa cgcggtgtcc gaccatgtgg tcacgccgcc cgccgtcaag 840ccggagcggc tgcgcaagaa cgcggtgtcc gaccatgtgg tcacgccgcc cgccgtcaag 840
gacatcaagc ggcgatga 858gacatcaagc ggcgatga 858
<210> 17<210> 17
<211> 335<211> 335
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 17<400> 17
Met Ser Thr Pro Phe Ser Val Leu Trp Leu Gln Ser Gly Gly Cys Gly Met Ser Thr Pro Phe Ser Val Leu Trp Leu Gln Ser Gly Gly Cys Gly
1 5 10 15 1 5 10 15
Gly Cys Thr Met Ser Leu Leu Cys Ala Glu Ala Pro Asp Leu Ala Thr Gly Cys Thr Met Ser Leu Leu Cys Ala Glu Ala Pro Asp Leu Ala Thr
20 25 30 20 25 30
Thr Leu Asp Ala Ala Gly Ile Gly Phe Leu Trp His Pro Ala Leu Ser Thr Leu Asp Ala Ala Gly Ile Gly Phe Leu Trp His Pro Ala Leu Ser
35 40 45 35 40 45
Glu Glu Thr Gly Glu Glu Ala Arg Ala Ile Phe Asp Ala Ile Leu Ala Glu Glu Thr Gly Glu Glu Ala Arg Ala Ile Phe Asp Ala Ile Leu Ala
50 55 60 50 55 60
Gly Val Ile Val Leu Asp Ala Leu Cys Val Glu Gly Ala Leu Leu Arg Gly Val Ile Val Leu Asp Ala Leu Cys Val Glu Gly Ala Leu Leu Arg
65 70 75 80 65 70 75 80
Gly Pro Asn Gly Thr Gly Arg Phe His Val Leu Ala Gly Thr Asp Thr Gly Pro Asn Gly Thr Gly Arg Phe His Val Leu Ala Gly Thr Asp Thr
85 90 95 85 90 95
Pro Thr Ile Asp Trp Ala Arg Gln Leu Ala Gly Met Ala Arg His Val Pro Thr Ile Asp Trp Ala Arg Gln Leu Ala Gly Met Ala Arg His Val
100 105 110 100 105 110
Val Ala Val Gly Thr Cys Ala Ala Tyr Gly Gly Val Thr Ala Ala Gly Val Ala Val Gly Thr Cys Ala Ala Tyr Gly Gly Val Thr Ala Ala Gly
115 120 125 115 120 125
Ile Asn Pro Thr Asp Ala Cys Gly Leu Gln Phe Asp Gly Arg Arg Lys Ile Asn Pro Thr Asp Ala Cys Gly Leu Gln Phe Asp Gly Arg Arg Lys
130 135 140 130 135 140
Gly Gly Ala Leu Gly Ala Asp Phe Arg Ser Arg Ser Gly Leu Pro Val Gly Gly Ala Leu Gly Ala Asp Phe Arg Ser Arg Ser Gly Leu Pro Val
145 150 155 160 145 150 155 160
Ile Asn Val Ala Gly Cys Pro Thr His Pro Asn Trp Val Thr Glu Thr Ile Asn Val Ala Gly Cys Pro Thr His Pro Asn Trp Val Thr Glu Thr
165 170 175 165 170 175
Leu Met Leu Leu Ala Cys Gly Leu Leu Gly Glu Ala Asp Leu Asp Val Leu Met Leu Leu Ala Cys Gly Leu Leu Gly Glu Ala Asp Leu Asp Val
180 185 190 180 185 190
Tyr Gly Arg Pro Arg Phe Tyr Ala Asp Leu Leu Val His His Gly Cys Tyr Gly Arg Pro Arg Phe Tyr Ala Asp Leu Leu Val His His Gly Cys
195 200 205 195 200 205
Pro Arg Asn Glu Tyr Tyr Glu Tyr Lys Ala Ser Ala Glu Lys Met Ser Pro Arg Asn Glu Tyr Tyr Glu Tyr Lys Ala Ser Ala Glu Lys Met Ser
210 215 220 210 215 220
Asp Leu Gly Cys Met Met Glu His Leu Gly Cys Leu Gly Thr Gln Ala Asp Leu Gly Cys Met Met Glu His Leu Gly Cys Leu Gly Thr Gln Ala
225 230 235 240 225 230 235 240
His Ala Asp Cys Asn Thr Arg Leu Trp Asn Gly Glu Gly Ser Cys Thr His Ala Asp Cys Asn Thr Arg Leu Trp Asn Gly Glu Gly Ser Cys Thr
245 250 255 245 250 255
Arg Gly Gly Tyr Ala Cys Ile Asn Cys Thr Ala Pro Glu Phe Glu Glu Arg Gly Gly Tyr Ala Cys Ile Asn Cys Thr Ala Pro Glu Phe Glu Glu
260 265 270 260 265 270
Pro Gly His Ala Phe Leu Glu Thr Pro Lys Ile Gly Gly Ile Pro Ile Pro Gly His Ala Phe Leu Glu Thr Pro Lys Ile Gly Gly Ile Pro Ile
275 280 285 275 280 285
Gly Leu Pro Thr Asp Met Pro Lys Ala Trp Phe Ile Ala Leu Ser Ser Gly Leu Pro Thr Asp Met Pro Lys Ala Trp Phe Ile Ala Leu Ser Ser
290 295 300 290 295 300
Leu Ala Lys Ala Ala Thr Pro Glu Arg Leu Arg Lys Asn Ala Val Ser Leu Ala Lys Ala Ala Thr Pro Glu Arg Leu Arg Lys Asn Ala Val Ser
305 310 315 320 305 310 315 320
Asp His Val Val Thr Pro Pro Ala Val Lys Asp Ile Lys Arg Arg Asp His Val Val Thr Pro Pro Ala Val Lys Asp Ile Lys Arg Arg
325 330 335 325 330 335
<210> 18<210> 18
<211> 870<211> 870
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 18<400> 18
gtgaccgagc gcctgtccga cgtcaacgcc cgcatcgcct cggtgcggca gctctcatcg 60gtgaccgagc gcctgtccga cgtcaacgcc cgcatcgcct cggtgcggca gctctcatcg 60
gtcatcacgg ccatgcgggg cattgcggcg gcgcgggcgc gggaggcgcg gggtcggctc 120gtcatcacgg ccatgcgggg cattgcggcg gcgcgggcgc gggaggcgcg gggtcggctc 120
gacggcatcc gcgcctatgc gcagaccatc gccgaggcca tcggccatgt gctcgccgtg 180gacggcatcc gcgcctatgc gcagaccatc gccgaggcca tcggccatgt gctcgccgtg 180
ctgcccgagg aggcccgcgc ccggtcctcc gggcaccggc atcggggcca tgcggtcatc 240ctgcccgagg aggcccgcgc ccggtcctcc gggcaccggc atcggggcca tgcggtcatc 240
gccctgtgcg cggagcaggg ctttgccggc gtcttcaacg agcgggtgct ggacgaggcc 300gccctgtgcg cggagcaggg ctttgccggc gtcttcaacg agcgggtgct ggacgaggcc 300
gcccggctgc tgaccggcgg ggcggggccg gccgagctgc tgctggtggg cgaccggggc 360gcccggctgc tgaccggcgg ggcggggccg gccgagctgc tgctggtggg cgaccggggc 360
ctgatggtgg cccgcgagcg ggggctcgat gtctcctggt cggtgcccat ggtggcccat 420ctgatggtgg cccgcgagcg ggggctcgat gtctcctggt cggtgcccat ggtggcccat 420
gcgggccagg cctcggcgct ggcggaccgc atcagcgagg agctctaccg gcggatcgat 480gcgggccagg cctcggcgct ggcggaccgc atcagcgagg agctctaccg gcggatcgat 480
gcgggacggg tgacgcgggt gtcggtggtg cacgccgagc ccgccgcgtc cgccgccatc 540gcgggacggg tgacgcgggt gtcggtggtg cacgccgagc ccgccgcgtc cgccgccatc 540
gagacggtgg tgaaagtgct ggtgccgttc gacttcgccc gcttccccct ggcgcgggtg 600gagacggtgg tgaaagtgct ggtgccgttc gacttcgccc gcttccccct ggcgcgggtg 600
gcatccgccc cgctcatgac catgccgccg ccgcggctgc tggcccagct gtcggaggaa 660gcatccgccc cgctcatgac catgccgccg ccgcggctgc tggcccagct gtcggaggaa 660
tatgtgttcg ccgagctgtg cgaggcgctc accttgtcct tcgcggcgga gaacgaggcc 720tatgtgttcg ccgagctgtg cgaggcgctc accttgtcct tcgcggcgga gaacgaggcc 720
cgcatgcggg ccatgatcgc cgcccgcgcc aatgtggccg ataccctgga gggcctcgtc 780cgcatgcggg ccatgatcgc cgcccgcgcc aatgtggccg ataccctgga gggcctcgtc 780
ggccgcgccc ggcagatgcg ccaggaggag atcaccaacg agatcatcga gctggaaggc 840ggccgcgccc ggcagatgcg cccaggagg atcaccaacg agatcatcga gctggaaggc 840
ggcgccggca gcgcccggca tgcggattga 870ggcgccggca gcgcccggca tgcggattga 870
<210> 19<210> 19
<211> 289<211> 289
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 19<400> 19
Met Thr Glu Arg Leu Ser Asp Val Asn Ala Arg Ile Ala Ser Val Arg Met Thr Glu Arg Leu Ser Asp Val Asn Ala Arg Ile Ala Ser Val Arg
1 5 10 15 1 5 10 15
Gln Leu Ser Ser Val Ile Thr Ala Met Arg Gly Ile Ala Ala Ala Arg Gln Leu Ser Ser Val Ile Thr Ala Met Arg Gly Ile Ala Ala Ala Arg
20 25 30 20 25 30
Ala Arg Glu Ala Arg Gly Arg Leu Asp Gly Ile Arg Ala Tyr Ala Gln Ala Arg Glu Ala Arg Gly Arg Leu Asp Gly Ile Arg Ala Tyr Ala Gln
35 40 45 35 40 45
Thr Ile Ala Glu Ala Ile Gly His Val Leu Ala Val Leu Pro Glu Glu Thr Ile Ala Glu Ala Ile Gly His Val Leu Ala Val Leu Pro Glu Glu
50 55 60 50 55 60
Ala Arg Ala Arg Ser Ser Gly His Arg His Arg Gly His Ala Val Ile Ala Arg Ala Arg Ser Ser Gly His Arg His Arg Gly His Ala Val Ile
65 70 75 80 65 70 75 80
Ala Leu Cys Ala Glu Gln Gly Phe Ala Gly Val Phe Asn Glu Arg Val Ala Leu Cys Ala Glu Gln Gly Phe Ala Gly Val Phe Asn Glu Arg Val
85 90 95 85 90 95
Leu Asp Glu Ala Ala Arg Leu Leu Thr Gly Gly Ala Gly Pro Ala Glu Leu Asp Glu Ala Ala Arg Leu Leu Thr Gly Gly Ala Gly Pro Ala Glu
100 105 110 100 105 110
Leu Leu Leu Val Gly Asp Arg Gly Leu Met Val Ala Arg Glu Arg Gly Leu Leu Leu Val Gly Asp Arg Gly Leu Met Val Ala Arg Glu Arg Gly
115 120 125 115 120 125
Leu Asp Val Ser Trp Ser Val Pro Met Val Ala His Ala Gly Gln Ala Leu Asp Val Ser Trp Ser Val Pro Met Val Ala His Ala Gly Gln Ala
130 135 140 130 135 140
Ser Ala Leu Ala Asp Arg Ile Ser Glu Glu Leu Tyr Arg Arg Ile Asp Ser Ala Leu Ala Asp Arg Ile Ser Glu Glu Leu Tyr Arg Arg Ile Asp
145 150 155 160 145 150 155 160
Ala Gly Arg Val Thr Arg Val Ser Val Val His Ala Glu Pro Ala Ala Ala Gly Arg Val Thr Arg Val Ser Val Val His Ala Glu Pro Ala Ala
165 170 175 165 170 175
Ser Ala Ala Ile Glu Thr Val Val Lys Val Leu Val Pro Phe Asp Phe Ser Ala Ala Ile Glu Thr Val Val Lys Val Leu Val Pro Phe Asp Phe
180 185 190 180 185 190
Ala Arg Phe Pro Leu Ala Arg Val Ala Ser Ala Pro Leu Met Thr Met Ala Arg Phe Pro Leu Ala Arg Val Ala Ser Ala Pro Leu Met Thr Met
195 200 205 195 200 205
Pro Pro Pro Arg Leu Leu Ala Gln Leu Ser Glu Glu Tyr Val Phe Ala Pro Pro Pro Arg Leu Leu Ala Gln Leu Ser Glu Glu Tyr Val Phe Ala
210 215 220 210 215 220
Glu Leu Cys Glu Ala Leu Thr Leu Ser Phe Ala Ala Glu Asn Glu Ala Glu Leu Cys Glu Ala Leu Thr Leu Ser Phe Ala Ala Glu Asn Glu Ala
225 230 235 240 225 230 235 240
Arg Met Arg Ala Met Ile Ala Ala Arg Ala Asn Val Ala Asp Thr Leu Arg Met Arg Ala Met Ile Ala Ala Arg Ala Asn Val Ala Asp Thr Leu
245 250 255 245 250 255
Glu Gly Leu Val Gly Arg Ala Arg Gln Met Arg Gln Glu Glu Ile Thr Glu Gly Leu Val Gly Arg Ala Arg Gln Met Arg Gln Glu Glu Ile Thr
260 265 270 260 265 270
Asn Glu Ile Ile Glu Leu Glu Gly Gly Ala Gly Ser Ala Arg His Ala Asn Glu Ile Ile Glu Leu Glu Gly Gly Ala Gly Ser Ala Arg His Ala
275 280 285 275 280 285
Asp Asp
<210> 20<210> 20
<211> 1536<211> 1536
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 20<400> 20
atgagcacgg gcgcgcaagc gagcgaggat tggctcaccc ggagccgggc ggccctggcc 60atgagcacgg gcgcgcaagc gagcgaggat tggctcaccc ggagccgggc ggccctggcc 60
gggacgcgcc tttcccagca atcccaatcg gtgggccggg tggaggagat ggccgacggc 120gggacgcgcc tttcccagca atcccaatcg gtgggccggg tggaggagat ggccgacggc 120
atcgcccgcg tctccggcct gccggatgtg cggctcgacg agcttctcac cttcgagggc 180atcgcccgcg tctccggcct gccggatgtg cggctcgacg agcttctcac cttcgagggc 180
ggccagaccg gctatgccct caccctcgat cgcaccgaga tcgccgtggt gctgctggat 240ggccagaccg gctatgccct caccctcgat cgcaccgaga tcgccgtggt gctgctggat 240
gacgcctccg gcgtggaggc gggcgcccgg gtgttcggca ccggcgaggt ggtgaaggtg 300gacgcctccg gcgtggaggc gggcgcccgg gtgttcggca ccggcgaggt ggtgaaggtg 300
ccggtggggc cggggctgct gggccgcatc gtcgaccccc tcggccggcc catggaccgc 360ccggtggggc cggggctgct gggccgcatc gtcgaccccc tcggccggcc catggaccgc 360
tccgagccgg tggtggcgca ggcgcaccat cccatcgagc ggccggcgcc ggccatcatc 420tccgagccgg tggtggcgca ggcgcaccat cccatcgagc ggccggcgcc ggccatcatc 420
gcccgcgacc tggtctcgca gccggttcag accggcacgc tggtggtgga tgcgctgttc 480gcccgcgacc tggtctcgca gccggttcag accggcacgc tggtggtgga tgcgctgttc 480
tccctcggcc ggggccagcg cgagctcatc atcggcgacc gggctaccgg caagaccgcc 540tccctcggcc ggggccagcg cgagctcatc atcggcgacc gggctaccgg caagaccgcc 540
atcgcggtgg acaccatcat cagccagaag cattcggaca tcgtgtgcat ctacgtggcg 600atcgcggtgg acaccatcat cagccagaag cattcggaca tcgtgtgcat ctacgtggcg 600
gtgggccagc gcgccgccgc cgtggagcgg gtggtggagg cggtgcgcgc ccacggggcg 660gtgggccagc gcgccgccgc cgtggagcgg gtggtggagg cggtgcgcgc ccacggggcg 660
atcgagcgct gcatcttcgt ggtcgcctcg gccgccgcct cgccagggct gcaatggatc 720atcgagcgct gcatcttcgt ggtcgcctcg gccgccgcct cgccagggct gcaatggatc 720
gcgccgttcg ccggcatgac catggcggaa tatttccgcg acaacggcca gcatgcgctc 780gcgccgttcg ccggcatgac catggcggaa tatttccgcg acaacggcca gcatgcgctc 780
atcatcatcg atgatctcac caagcatgcg gccacccatc gcgagctggc gctgctcacc 840atcatcatcg atgatctcac caagcatgcg gccacccatc gcgagctggc gctgctcacc 840
cacgagccgc cgggccgcga ggcctatccc ggcgacatct tctatgtgca cgcccgcctt 900cacgagccgc cgggccgcga ggcctatccc ggcgacatct tctatgtgca cgcccgcctt 900
ctggagcggg ccgccaagct ctccgccgag ctgggcggtg gctcgctcac ggccctgccc 960ctggagcggg ccgccaagct ctccgccgag ctgggcggtg gctcgctcac ggccctgccc 960
atcgcggaga cggacgcggg aaacctctcc gcctatatcc ccaccaacct catctccatc 1020atcgcggaga cggacgcggg aaacctctcc gcctatatcc ccaccaacct catctccatc 1020
accgatgggc agatcgtgct ggattcgcgg ctgttcgcgg ccaaccagcg cccggcggtg 1080accgatgggc agatcgtgct ggattcgcgg ctgttcgcgg ccaaccagcg cccggcggtg 1080
gatgtgggcc tctccgtgag ccgggtgggc ggcaaggcgc agcatcccgc gcttcgggcc 1140gatgtgggcc tctccgtgag ccgggtgggc ggcaaggcgc agcatcccgc gcttcgggcc 1140
gtgtccgggc gcatccggct cgattattcc cagttcctgg agctggaaat gttcacccgc 1200gtgtccgggc gcatccggct cgattattcc cagttcctgg agctggaaat gttcacccgc 1200
ttcggcggca tcaccgatac ccgcgtgaag gcgcagatca cccggggcga gcgcatccgc 1260ttcggcggca tcaccgatac ccgcgtgaag gcgcagatca cccggggcga gcgcatccgc 1260
gcgctgctca cccagccgcg cttttccacc ctgcgccttc aggacgaggt ggcgctgctg 1320gcgctgctca cccagccgcg cttttccacc ctgcgccttc aggacgaggt ggcgctgctg 1320
gccgcgctgg cggagggggt gttcgacact ttggccccgg ggctgatggg cgccgtgcgt 1380gccgcgctgg cggagggggt gttcgacact ttggccccgg ggctgatggg cgccgtgcgt 1380
gcccgcattc cggcccagct ggatgcgcag gtgaaggacg tggcctcggc cctcgccgag 1440gcccgcattc cggcccagct ggatgcgcag gtgaaggacg tggcctcggc cctcgccgag 1440
ggcaaggtgc tggaggaggg cttgcacgcc cgtctcgtgg cggccgtgcg ggccgtcgcg 1500ggcaaggtgc tggagggaggg cttgcacgcc cgtctcgtgg cggccgtgcg ggccgtcgcg 1500
gcggacgtgg ccgcgaccgc gaaggccggg ccgtga 1536gcggacgtgg ccgcgaccgc gaaggccggg ccgtga 1536
<210> 21<210> 21
<211> 511<211> 511
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 21<400> 21
Met Ser Thr Gly Ala Gln Ala Ser Glu Asp Trp Leu Thr Arg Ser Arg Met Ser Thr Gly Ala Gln Ala Ser Glu Asp Trp Leu Thr Arg Ser Arg
1 5 10 15 1 5 10 15
Ala Ala Leu Ala Gly Thr Arg Leu Ser Gln Gln Ser Gln Ser Val Gly Ala Ala Leu Ala Gly Thr Arg Leu Ser Gln Gln Ser Gln Ser Val Gly
20 25 30 20 25 30
Arg Val Glu Glu Met Ala Asp Gly Ile Ala Arg Val Ser Gly Leu Pro Arg Val Glu Glu Met Ala Asp Gly Ile Ala Arg Val Ser Gly Leu Pro
35 40 45 35 40 45
Asp Val Arg Leu Asp Glu Leu Leu Thr Phe Glu Gly Gly Gln Thr Gly Asp Val Arg Leu Asp Glu Leu Leu Thr Phe Glu Gly Gly Gln Thr Gly
50 55 60 50 55 60
Tyr Ala Leu Thr Leu Asp Arg Thr Glu Ile Ala Val Val Leu Leu Asp Tyr Ala Leu Thr Leu Asp Arg Thr Glu Ile Ala Val Val Leu Leu Asp
65 70 75 80 65 70 75 80
Asp Ala Ser Gly Val Glu Ala Gly Ala Arg Val Phe Gly Thr Gly Glu Asp Ala Ser Gly Val Glu Ala Gly Ala Arg Val Phe Gly Thr Gly Glu
85 90 95 85 90 95
Val Val Lys Val Pro Val Gly Pro Gly Leu Leu Gly Arg Ile Val Asp Val Val Lys Val Pro Val Gly Pro Gly Leu Leu Gly Arg Ile Val Asp
100 105 110 100 105 110
Pro Leu Gly Arg Pro Met Asp Arg Ser Glu Pro Val Val Ala Gln Ala Pro Leu Gly Arg Pro Met Asp Arg Ser Glu Pro Val Val Ala Gln Ala
115 120 125 115 120 125
His His Pro Ile Glu Arg Pro Ala Pro Ala Ile Ile Ala Arg Asp Leu His His Pro Ile Glu Arg Pro Ala Pro Ala Ile Ile Ala Arg Asp Leu
130 135 140 130 135 140
Val Ser Gln Pro Val Gln Thr Gly Thr Leu Val Val Asp Ala Leu Phe Val Ser Gln Pro Val Gln Thr Gly Thr Leu Val Val Asp Ala Leu Phe
145 150 155 160 145 150 155 160
Ser Leu Gly Arg Gly Gln Arg Glu Leu Ile Ile Gly Asp Arg Ala Thr Ser Leu Gly Arg Gly Gln Arg Glu Leu Ile Ile Gly Asp Arg Ala Thr
165 170 175 165 170 175
Gly Lys Thr Ala Ile Ala Val Asp Thr Ile Ile Ser Gln Lys His Ser Gly Lys Thr Ala Ile Ala Val Asp Thr Ile Ile Ser Gln Lys His Ser
180 185 190 180 185 190
Asp Ile Val Cys Ile Tyr Val Ala Val Gly Gln Arg Ala Ala Ala Val Asp Ile Val Cys Ile Tyr Val Ala Val Gly Gln Arg Ala Ala Ala Val
195 200 205 195 200 205
Glu Arg Val Val Glu Ala Val Arg Ala His Gly Ala Ile Glu Arg Cys Glu Arg Val Val Glu Ala Val Arg Ala His Gly Ala Ile Glu Arg Cys
210 215 220 210 215 220
Ile Phe Val Val Ala Ser Ala Ala Ala Ser Pro Gly Leu Gln Trp Ile Ile Phe Val Val Ala Ser Ala Ala Ala Ser Pro Gly Leu Gln Trp Ile
225 230 235 240 225 230 235 240
Ala Pro Phe Ala Gly Met Thr Met Ala Glu Tyr Phe Arg Asp Asn Gly Ala Pro Phe Ala Gly Met Thr Met Ala Glu Tyr Phe Arg Asp Asn Gly
245 250 255 245 250 255
Gln His Ala Leu Ile Ile Ile Asp Asp Leu Thr Lys His Ala Ala Thr Gln His Ala Leu Ile Ile Ile Asp Asp Leu Thr Lys His Ala Ala Thr
260 265 270 260 265 270
His Arg Glu Leu Ala Leu Leu Thr His Glu Pro Pro Gly Arg Glu Ala His Arg Glu Leu Ala Leu Leu Thr His Glu Pro Pro Gly Arg Glu Ala
275 280 285 275 280 285
Tyr Pro Gly Asp Ile Phe Tyr Val His Ala Arg Leu Leu Glu Arg Ala Tyr Pro Gly Asp Ile Phe Tyr Val His Ala Arg Leu Leu Glu Arg Ala
290 295 300 290 295 300
Ala Lys Leu Ser Ala Glu Leu Gly Gly Gly Ser Leu Thr Ala Leu Pro Ala Lys Leu Ser Ala Glu Leu Gly Gly Gly Ser Leu Thr Ala Leu Pro
305 310 315 320 305 310 315 320
Ile Ala Glu Thr Asp Ala Gly Asn Leu Ser Ala Tyr Ile Pro Thr Asn Ile Ala Glu Thr Asp Ala Gly Asn Leu Ser Ala Tyr Ile Pro Thr Asn
325 330 335 325 330 335
Leu Ile Ser Ile Thr Asp Gly Gln Ile Val Leu Asp Ser Arg Leu Phe Leu Ile Ser Ile Thr Asp Gly Gln Ile Val Leu Asp Ser Arg Leu Phe
340 345 350 340 345 350
Ala Ala Asn Gln Arg Pro Ala Val Asp Val Gly Leu Ser Val Ser Arg Ala Ala Asn Gln Arg Pro Ala Val Asp Val Gly Leu Ser Val Ser Arg
355 360 365 355 360 365
Val Gly Gly Lys Ala Gln His Pro Ala Leu Arg Ala Val Ser Gly Arg Val Gly Gly Lys Ala Gln His Pro Ala Leu Arg Ala Val Ser Gly Arg
370 375 380 370 375 380
Ile Arg Leu Asp Tyr Ser Gln Phe Leu Glu Leu Glu Met Phe Thr Arg Ile Arg Leu Asp Tyr Ser Gln Phe Leu Glu Leu Glu Met Phe Thr Arg
385 390 395 400 385 390 395 400
Phe Gly Gly Ile Thr Asp Thr Arg Val Lys Ala Gln Ile Thr Arg Gly Phe Gly Gly Ile Thr Asp Thr Arg Val Lys Ala Gln Ile Thr Arg Gly
405 410 415 405 410 415
Glu Arg Ile Arg Ala Leu Leu Thr Gln Pro Arg Phe Ser Thr Leu Arg Glu Arg Ile Arg Ala Leu Leu Thr Gln Pro Arg Phe Ser Thr Leu Arg
420 425 430 420 425 430
Leu Gln Asp Glu Val Ala Leu Leu Ala Ala Leu Ala Glu Gly Val Phe Leu Gln Asp Glu Val Ala Leu Leu Ala Ala Leu Ala Glu Gly Val Phe
435 440 445 435 440 445
Asp Thr Leu Ala Pro Gly Leu Met Gly Ala Val Arg Ala Arg Ile Pro Asp Thr Leu Ala Pro Gly Leu Met Gly Ala Val Arg Ala Arg Ile Pro
450 455 460 450 455 460
Ala Gln Leu Asp Ala Gln Val Lys Asp Val Ala Ser Ala Leu Ala Glu Ala Gln Leu Asp Ala Gln Val Lys Asp Val Ala Ser Ala Leu Ala Glu
465 470 475 480 465 470 475 480
Gly Lys Val Leu Glu Glu Gly Leu His Ala Arg Leu Val Ala Ala Val Gly Lys Val Leu Glu Glu Gly Leu His Ala Arg Leu Val Ala Ala Val
485 490 495 485 490 495
Arg Ala Val Ala Ala Asp Val Ala Ala Thr Ala Lys Ala Gly Pro Arg Ala Val Ala Ala Asp Val Ala Ala Thr Ala Lys Ala Gly Pro
500 505 510 500 505 510
<210> 22<210> 22
<211> 741<211> 741
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 22<400> 22
atgcagatcg actggtggac gctgggcctg cagacggtca acgtcctcgt tctcatctgg 60atgcagatcg actggtggac gctgggcctg cagacggtca acgtcctcgt tctcatctgg 60
ctcctgagcc gcttcctgtt caagccggtg gcgcaggtca tcgcgcagcg ccgtgccgag 120ctcctgagcc gcttcctgtt caagccggtg gcgcaggtca tcgcgcagcg ccgtgccgag 120
atcgagaagc tggtggagga tgcgcgcgcc gccaaggccg ccgccgaggc cgagcgggac 180atcgagaagc tggtggagga tgcgcgcgcc gccaaggccg ccgccgaggc cgagcgggac 180
acggcgaagg cggaggaggc gcgccttgcc gccgagcgcg gcgcccgcat ggcggcggtc 240acggcgaagg cggaggaggc gcgccttgcc gccgagcgcg gcgcccgcat ggcggcggtc 240
gccaaggagg cggaggcgca gaaggcggca ttgctggccg ccgccaagac cgaggccgag 300gccaaggagg cggaggcgca gaaggcggca ttgctggccg ccgccaagac cgaggccgag 300
gccctgcacg cggccgcgga agcggccatc gtccgggcgc gggcgagcga ggaggaagcc 360gccctgcacg cggccgcgga agcggccatc gtccgggcgc gggcgagcga ggaggaagcc 360
gccgccgacc gcgccagccg ccttgccgtg gacatcgccg ccaagctgct ggaccggctg 420gccgccgacc gcgccagccg ccttgccgtg gacatcgccg ccaagctgct ggaccggctg 420
cccgacgacg cccgggtcgc gggcttcatc gatggcctcg ccgaggggct tgaagccctg 480cccgacgacg cccgggtcgc gggcttcatc gatggcctcg ccgaggggct tgaagccctg 480
cccgaggcga gccgggcggt gatcggcgtc gacggcgcgc cagtgcgcgt gacggccgcg 540cccgaggcga gccgggcggt gatcggcgtc gacggcgcgc cagtgcgcgt gacggccgcg 540
cgcgccctta tgccggcgga ggaggaggcc tgccgcacgc ggctctccca ggcgctgggc 600cgcgccctta tgccggcgga ggaggaggcc tgccgcacgc ggctctccca ggcgctgggc 600
cgtccggtga cgctggccgt gaccatcgac cccgccctca tcgccggcct ggagatggag 660cgtccggtga cgctggccgt gaccatcgac cccgccctca tcgccggcct ggagatggag 660
acgccccacg cggtggtgcg caattccttc aaggccgatc tcgaccgcgt caccgcggcg 720acgccccacg cggtggtgcg caattccttc aaggccgatc tcgaccgcgt caccgcggcg 720
ctcacccatc atgggacctg a 741ctcacccatc atgggacctg a 741
<210> 23<210> 23
<211> 246<211> 246
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 23<400> 23
Met Gln Ile Asp Trp Trp Thr Leu Gly Leu Gln Thr Val Asn Val Leu Met Gln Ile Asp Trp Trp Thr Leu Gly Leu Gln Thr Val Asn Val Leu
1 5 10 15 1 5 10 15
Val Leu Ile Trp Leu Leu Ser Arg Phe Leu Phe Lys Pro Val Ala Gln Val Leu Ile Trp Leu Leu Ser Arg Phe Leu Phe Lys Pro Val Ala Gln
20 25 30 20 25 30
Val Ile Ala Gln Arg Arg Ala Glu Ile Glu Lys Leu Val Glu Asp Ala Val Ile Ala Gln Arg Arg Ala Glu Ile Glu Lys Leu Val Glu Asp Ala
35 40 45 35 40 45
Arg Ala Ala Lys Ala Ala Ala Glu Ala Glu Arg Asp Thr Ala Lys Ala Arg Ala Ala Lys Ala Ala Ala Glu Ala Glu Arg Asp Thr Ala Lys Ala
50 55 60 50 55 60
Glu Glu Ala Arg Leu Ala Ala Glu Arg Gly Ala Arg Met Ala Ala Val Glu Glu Ala Arg Leu Ala Ala Glu Arg Gly Ala Arg Met Ala Ala Val
65 70 75 80 65 70 75 80
Ala Lys Glu Ala Glu Ala Gln Lys Ala Ala Leu Leu Ala Ala Ala Lys Ala Lys Glu Ala Glu Ala Gln Lys Ala Ala Leu Leu Ala Ala Ala Lys
85 90 95 85 90 95
Thr Glu Ala Glu Ala Leu His Ala Ala Ala Glu Ala Ala Ile Val Arg Thr Glu Ala Glu Ala Leu His Ala Ala Ala Glu Ala Ala Ile Val Arg
100 105 110 100 105 110
Ala Arg Ala Ser Glu Glu Glu Ala Ala Ala Asp Arg Ala Ser Arg Leu Ala Arg Ala Ser Glu Glu Glu Ala Ala Ala Asp Arg Ala Ser Arg Leu
115 120 125 115 120 125
Ala Val Asp Ile Ala Ala Lys Leu Leu Asp Arg Leu Pro Asp Asp Ala Ala Val Asp Ile Ala Ala Lys Leu Leu Asp Arg Leu Pro Asp Asp Ala
130 135 140 130 135 140
Arg Val Ala Gly Phe Ile Asp Gly Leu Ala Glu Gly Leu Glu Ala Leu Arg Val Ala Gly Phe Ile Asp Gly Leu Ala Glu Gly Leu Glu Ala Leu
145 150 155 160 145 150 155 160
Pro Glu Ala Ser Arg Ala Val Ile Gly Val Asp Gly Ala Pro Val Arg Pro Glu Ala Ser Arg Ala Val Ile Gly Val Asp Gly Ala Pro Val Arg
165 170 175 165 170 175
Val Thr Ala Ala Arg Ala Leu Met Pro Ala Glu Glu Glu Ala Cys Arg Val Thr Ala Ala Arg Ala Leu Met Pro Ala Glu Glu Glu Ala Cys Arg
180 185 190 180 185 190
Thr Arg Leu Ser Gln Ala Leu Gly Arg Pro Val Thr Leu Ala Val Thr Thr Arg Leu Ser Gln Ala Leu Gly Arg Pro Val Thr Leu Ala Val Thr
195 200 205 195 200 205
Ile Asp Pro Ala Leu Ile Ala Gly Leu Glu Met Glu Thr Pro His Ala Ile Asp Pro Ala Leu Ile Ala Gly Leu Glu Met Glu Thr Pro His Ala
210 215 220 210 215 220
Val Val Arg Asn Ser Phe Lys Ala Asp Leu Asp Arg Val Thr Ala Ala Val Val Arg Asn Ser Phe Lys Ala Asp Leu Asp Arg Val Thr Ala Ala
225 230 235 240 225 230 235 240
Leu Thr His His Gly Thr Leu Thr His His Gly Thr
245 245
<210> 24<210> 24
<211> 243<211> 243
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 24<400> 24
atgactgtcg agatggtcag catcttcgcg gcggcgctcg ccgtctcctt cggcgccatc 60atgactgtcg agatggtcag catcttcgcg gcggcgctcg ccgtctcctt cggcgccatc 60
gggccggccc tgggcgaggg ccgggcggtg gccgcggcca tggacgccat cgcccgccag 120gggccggccc tgggcgaggg ccgggcggtg gccgcggcca tggacgccat cgcccgccag 120
ccggaggcgg ccggaacctt gtcgcgcacg ctcttcgtcg gcctcgccat gatcgagacc 180ccggaggcgg ccggaacctt gtcgcgcacg ctcttcgtcg gcctcgccat gatcgagacc 180
atggcgatct actgcctggt gatcgcgctc ctggtgctct tcgccaatcc gttcgtgaag 240atggcgatct actgcctggt gatcgcgctc ctggtgctct tcgccaatcc gttcgtgaag 240
tga 243tga 243
<210> 25<210> 25
<211> 80<211> 80
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 25<400> 25
Met Thr Val Glu Met Val Ser Ile Phe Ala Ala Ala Leu Ala Val Ser Met Thr Val Glu Met Val Ser Ile Phe Ala Ala Ala Leu Ala Val Ser
1 5 10 15 1 5 10 15
Phe Gly Ala Ile Gly Pro Ala Leu Gly Glu Gly Arg Ala Val Ala Ala Phe Gly Ala Ile Gly Pro Ala Leu Gly Glu Gly Arg Ala Val Ala Ala
20 25 30 20 25 30
Ala Met Asp Ala Ile Ala Arg Gln Pro Glu Ala Ala Gly Thr Leu Ser Ala Met Asp Ala Ile Ala Arg Gln Pro Glu Ala Ala Gly Thr Leu Ser
35 40 45 35 40 45
Arg Thr Leu Phe Val Gly Leu Ala Met Ile Glu Thr Met Ala Ile Tyr Arg Thr Leu Phe Val Gly Leu Ala Met Ile Glu Thr Met Ala Ile Tyr
50 55 60 50 55 60
Cys Leu Val Ile Ala Leu Leu Val Leu Phe Ala Asn Pro Phe Val Lys Cys Leu Val Ile Ala Leu Leu Val Leu Phe Ala Asn Pro Phe Val Lys
65 70 75 80 65 70 75 80
<210> 26<210> 26
<211> 699<211> 699
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 26<400> 26
atgggctcgc cgctgatcct cgaacccctg ttccatatcg ggcccgtgcc catcaccgcg 60atgggctcgc cgctgatcct cgaacccctg ttccatatcg ggcccgtgcc catcaccgcg 60
ccggtggtgg tcacctggct catcatggcc gccttcattg ggctggcgcg gctcatcacc 120ccggtggtgg tcacctggct catcatggcc gccttcattg ggctggcgcg gctcatcacc 120
cggaagcttt ccaccgatcc cacccggacc caggcggcgg tggaaacggt gctgaccgcc 180cggaagcttt ccaccgatcc cacccggacc caggcggcgg tggaaacggt gctgaccgcc 180
atcgattccc agatcgccga caccatgcag gccgatcccg cgccttatcg cgcgctcatc 240atcgattccc agatcgccga caccatgcag gccgatcccg cgccttatcg cgcgctcatc 240
ggcaccatct tcctttatgt gctggtggcc aactggtcct cgctcatccc gggcatcgag 300ggcaccatct tcctttatgt gctggtggcc aactggtcct cgctcatccc gggcatcgag 300
ccgcccacgg cgcatatcga gaccgatgcg gcgctcgctt tcatcgtgtt cgccgccacc 360ccgcccacgg cgcatatcga gaccgatgcg gcgctcgctt tcatcgtgtt cgccgccacc 360
atcgggttcg ggttgaagac aaggggtgtg aagggctatc tcgccacctt cgccgaaccc 420atcgggttcg ggttgaagac aaggggtgtg aagggctatc tcgccacctt cgccgaaccc 420
tcctgggtga tgatcccgct caatgtggtg gagcagatca cccggacctt ctcgctcatc 480tcctgggtga tgatcccgct caatgtggtg gagcagatca cccggacctt ctcgctcatc 480
gtgcgcctgt tcggcaacat catgagcggg gtgttcgtgg tcggcatcat cctgtccctc 540gtgcgcctgt tcggcaacat catgagcggg gtgttcgtgg tcggcatcat cctgtccctc 540
gccgggctgc tggtgcccat ccccctcatg gcgctcgatc tcctgaccgg cgccgtgcag 600gccgggctgc tggtgcccat ccccctcatg gcgctcgatc tcctgaccgg cgccgtgcag 600
gcctacatct tcgcggtgct ggcctgcgtg ttcatcggcg cggccattgg cgaggcgccg 660gcctacatct tcgcggtgct ggcctgcgtg ttcatcggcg cggccattgg cgaggcgccg 660
gcaaagcccc aatcgaagga gccagggaaa acatcatga 699gcaaagcccc aatcgaagga gccagggaaa acatcatga 699
<210> 27<210> 27
<211> 232<211> 232
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 27<400> 27
Met Gly Ser Pro Leu Ile Leu Glu Pro Leu Phe His Ile Gly Pro Val Met Gly Ser Pro Leu Ile Leu Glu Pro Leu Phe His Ile Gly Pro Val
1 5 10 15 1 5 10 15
Pro Ile Thr Ala Pro Val Val Val Thr Trp Leu Ile Met Ala Ala Phe Pro Ile Thr Ala Pro Val Val Val Thr Trp Leu Ile Met Ala Ala Phe
20 25 30 20 25 30
Ile Gly Leu Ala Arg Leu Ile Thr Arg Lys Leu Ser Thr Asp Pro Thr Ile Gly Leu Ala Arg Leu Ile Thr Arg Lys Leu Ser Thr Asp Pro Thr
35 40 45 35 40 45
Arg Thr Gln Ala Ala Val Glu Thr Val Leu Thr Ala Ile Asp Ser Gln Arg Thr Gln Ala Ala Val Glu Thr Val Leu Thr Ala Ile Asp Ser Gln
50 55 60 50 55 60
Ile Ala Asp Thr Met Gln Ala Asp Pro Ala Pro Tyr Arg Ala Leu Ile Ile Ala Asp Thr Met Gln Ala Asp Pro Ala Pro Tyr Arg Ala Leu Ile
65 70 75 80 65 70 75 80
Gly Thr Ile Phe Leu Tyr Val Leu Val Ala Asn Trp Ser Ser Leu Ile Gly Thr Ile Phe Leu Tyr Val Leu Val Ala Asn Trp Ser Ser Leu Ile
85 90 95 85 90 95
Pro Gly Ile Glu Pro Pro Thr Ala His Ile Glu Thr Asp Ala Ala Leu Pro Gly Ile Glu Pro Pro Thr Ala His Ile Glu Thr Asp Ala Ala Leu
100 105 110 100 105 110
Ala Phe Ile Val Phe Ala Ala Thr Ile Gly Phe Gly Leu Lys Thr Arg Ala Phe Ile Val Phe Ala Ala Thr Ile Gly Phe Gly Leu Lys Thr Arg
115 120 125 115 120 125
Gly Val Lys Gly Tyr Leu Ala Thr Phe Ala Glu Pro Ser Trp Val Met Gly Val Lys Gly Tyr Leu Ala Thr Phe Ala Glu Pro Ser Trp Val Met
130 135 140 130 135 140
Ile Pro Leu Asn Val Val Glu Gln Ile Thr Arg Thr Phe Ser Leu Ile Ile Pro Leu Asn Val Val Glu Gln Ile Thr Arg Thr Phe Ser Leu Ile
145 150 155 160 145 150 155 160
Val Arg Leu Phe Gly Asn Ile Met Ser Gly Val Phe Val Val Gly Ile Val Arg Leu Phe Gly Asn Ile Met Ser Gly Val Phe Val Val Gly Ile
165 170 175 165 170 175
Ile Leu Ser Leu Ala Gly Leu Leu Val Pro Ile Pro Leu Met Ala Leu Ile Leu Ser Leu Ala Gly Leu Leu Val Pro Ile Pro Leu Met Ala Leu
180 185 190 180 185 190
Asp Leu Leu Thr Gly Ala Val Gln Ala Tyr Ile Phe Ala Val Leu Ala Asp Leu Leu Thr Gly Ala Val Gln Ala Tyr Ile Phe Ala Val Leu Ala
195 200 205 195 200 205
Cys Val Phe Ile Gly Ala Ala Ile Gly Glu Ala Pro Ala Lys Pro Gln Cys Val Phe Ile Gly Ala Ala Ile Gly Glu Ala Pro Ala Lys Pro Gln
210 215 220 210 215 220
Ser Lys Glu Pro Gly Lys Thr Ser Ser Lys Glu Pro Gly Lys Thr Ser
225 230 225 230
<210> 28<210> 28
<211> 456<211> 456
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 28<400> 28
gtgagcgcgc cgctgcacct caccatcacc acgccggccg ccgttctggt ggaccgtgcc 60gtgagcgcgc cgctgcacct caccatcacc acgccggccg ccgttctggt ggaccgtgcc 60
gacatcgtgg ccctgcgtgc cgaggacgag agcggcagct tcggcatcct gcccggccat 120gacatcgtgg ccctgcgtgc cgaggacgag agcggcagct tcggcatcct gcccggccat 120
gcggatttcc tgaccgttct ggaggcctgc gtggtgcgct tcaaggatgg ggccgacggc 180gcggatttcc tgaccgttct ggaggcctgc gtggtgcgct tcaaggatgg ggccgacggc 180
gtgcattatt gtgctctcag tggtggcgtg ctgtcggtcg aggagggccg gcgcatcgcc 240gtgcattatt gtgctctcag tggtggcgtg ctgtcggtcg aggagggccg gcgcatcgcc 240
atcgcctgcc gtcagggcac ggtgagcgac gacctggtcg ccctggaagg ggcggtggac 300atcgcctgcc gtcagggcac ggtgagcgac gacctggtcg ccctggaagg ggcggtggac 300
gccatgcgtt cggcggagag cgatgccgac aagcgggccc gggtggagca gatgcgcctt 360gccatgcgtt cggcggagag cgatgccgac aagcgggccc gggtggagca gatgcgcctt 360
catgcccacg ccgtgcgcca gctcctgcac tatctgcggc ccggccgggc cggcggcgtg 420catgcccacg ccgtgcgcca gctcctgcac tatctgcggc ccggccgggc cggcggcgtg 420
gcgccggccg ccgcgccgga ggaggggccg tcatga 456gcgccggccg ccgcgccgga ggaggggccg tcatga 456
<210> 29<210> 29
<211> 151<211> 151
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 29<400> 29
Met Ser Ala Pro Leu His Leu Thr Ile Thr Thr Pro Ala Ala Val Leu Met Ser Ala Pro Leu His Leu Thr Ile Thr Thr Pro Ala Ala Val Leu
1 5 10 15 1 5 10 15
Val Asp Arg Ala Asp Ile Val Ala Leu Arg Ala Glu Asp Glu Ser Gly Val Asp Arg Ala Asp Ile Val Ala Leu Arg Ala Glu Asp Glu Ser Gly
20 25 30 20 25 30
Ser Phe Gly Ile Leu Pro Gly His Ala Asp Phe Leu Thr Val Leu Glu Ser Phe Gly Ile Leu Pro Gly His Ala Asp Phe Leu Thr Val Leu Glu
35 40 45 35 40 45
Ala Cys Val Val Arg Phe Lys Asp Gly Ala Asp Gly Val His Tyr Cys Ala Cys Val Val Arg Phe Lys Asp Gly Ala Asp Gly Val His Tyr Cys
50 55 60 50 55 60
Ala Leu Ser Gly Gly Val Leu Ser Val Glu Glu Gly Arg Arg Ile Ala Ala Leu Ser Gly Gly Val Leu Ser Val Glu Glu Gly Arg Arg Ile Ala
65 70 75 80 65 70 75 80
Ile Ala Cys Arg Gln Gly Thr Val Ser Asp Asp Leu Val Ala Leu Glu Ile Ala Cys Arg Gln Gly Thr Val Ser Asp Asp Leu Val Ala Leu Glu
85 90 95 85 90 95
Gly Ala Val Asp Ala Met Arg Ser Ala Glu Ser Asp Ala Asp Lys Arg Gly Ala Val Asp Ala Met Arg Ser Ala Glu Ser Asp Ala Asp Lys Arg
100 105 110 100 105 110
Ala Arg Val Glu Gln Met Arg Leu His Ala His Ala Val Arg Gln Leu Ala Arg Val Glu Gln Met Arg Leu His Ala His Ala Val Arg Gln Leu
115 120 125 115 120 125
Leu His Tyr Leu Arg Pro Gly Arg Ala Gly Gly Val Ala Pro Ala Ala Leu His Tyr Leu Arg Pro Gly Arg Ala Gly Gly Val Ala Pro Ala Ala
130 135 140 130 135 140
Ala Pro Glu Glu Gly Pro Ser Ala Pro Glu Glu Gly Pro Ser
145 150 145 150
<210> 30<210> 30
<211> 1494<211> 1494
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 30<400> 30
atggcagcgg cagatgagga ggcgcaatcg gccgccggcc ccgcctcggg ccgggtggtg 60atggcagcgg cagatgagga ggcgcaatcg gccgccggcc ccgcctcggg ccgggtggtg 60
gccgtgcgcg gcgcggtgat cgacatcgcc tttgcccagc ctccgctgcc gccgctggac 120gccgtgcgcg gcgcggtgat cgacatcgcc tttgcccagc ctccgctgcc gccgctggac 120
gacgcccttc tcatcaccga cggccggggc ggcacggtgc tggtggaggt gcagagccat 180gacgcccttc tcatcaccga cggccggggc ggcacggtgc tggtggaggt gcagagccat 180
atggatcggc acacggtgcg cgccatcgcc cttcaggcca ccaccggcct cagccggggg 240atggatcggc acacggtgcg cgccatcgcc cttcaggcca ccaccggcct cagccggggg 240
ctggaggcgg cgcgggtggg cgggccggtg aaggtgccgg tgggagacca tgtgctcggc 300ctggaggcgg cgcgggtggg cgggccggtg aaggtgccgg tggggagacca tgtgctcggc 300
cgcctcctgg atgtcaccgg cgccatcggc gacaagggcg ggccgctgcc ggccgacgtg 360cgcctcctgg atgtcaccgg cgccatcggc gacaagggcg ggccgctgcc ggccgacgtg 360
cccacgcggc cgatccacca cgcgccgcca tccttcgccg cgcagggcgg cacgtccgat 420cccacgcggc cgatccacca cgcgccgcca tccttcgccg cgcagggcgg cacgtccgat 420
ctgtttcgca ccggcatcaa ggtcatcgac ctcctggcgc ccctcgccca gggcggcaag 480ctgtttcgca ccggcatcaa ggtcatcgac ctcctggcgc ccctcgccca gggcggcaag 480
gcggccatgt tcggcggggc cggcgtgggc aagaccgtgc tggtgatgga gctgatccac 540gcggccatgt tcggcggggc cggcgtgggc aagaccgtgc tggtgatgga gctgatccac 540
gccatggtgg cgagctacaa gggcatctcg gtgtttgccg gcgtggggga gcgctcccgc 600gccatggtgg cgagctacaa gggcatctcg gtgtttgccg gcgtggggga gcgctcccgc 600
gagggccacg agatgctgct ggacatgacc gattccggcg tgctcgaccg caccgttctg 660gagggccacg agatgctgct ggacatgacc gattccggcg tgctcgaccg caccgttctg 660
gtctatggcc agatgaacga gccccccggg gcccgctggc gggtgcccat gacggcgctg 720gtctatggcc agatgaacga gccccccggg gcccgctggc gggtgcccat gacggcgctg 720
accatcgccg aatatttccg cgacgagaag caccagaacg tcctgctgct gatggacaac 780accatcgccg aatatttccg cgacgagaag caccagaacg tcctgctgct gatggacaac 780
atcttccgct tcgtccaggc gggggcggag gtctccggcc ttttgggccg tccgccctcc 840atcttccgct tcgtccaggc gggggcggag gtctccggcc ttttgggccg tccgccctcc 840
cgggtgggat accagccgac gctggcgagc gaggtggcgg cgctccagga acgcatcacc 900cgggtgggat accagccgac gctggcgagc gaggtggcgg cgctccagga acgcatcacc 900
tccgtgggcg aggcctcggt gaccgccatc gaggcggtct acgtgccggc ggatgacttc 960tccgtgggcg aggcctcggt gaccgccatc gaggcggtct acgtgccggc ggatgacttc 960
accgatcccg ccgtgaccac catcgccgcc cacgtggatt ccatggtggt gctctcccgc 1020accgatcccg ccgtgaccac catcgccgcc cacgtggatt ccatggtggt gctctcccgc 1020
gccatggcgg cggagggcat gtatccggcg gtggacccca tctcctcctc gtcggtgctg 1080gccatggcgg cggagggcat gtatccggcg gtggacccca tctcctcctc gtcggtgctg 1080
ctcgacccgc tcatcgtggg ggacgagcat gcgcgcgtcg ccaacgaggt gcgccggacc 1140ctcgacccgc tcatcgtggg ggacgagcat gcgcgcgtcg ccaacgaggt gcgccggacc 1140
atcgagcatt atcgcgagct tcaggatgtg atctcgctgc tgggcatgga ggaattgggc 1200atcgagcatt atcgcgagct tcaggatgtg atctcgctgc tgggcatgga ggaattgggc 1200
accgaggatc gccgcatcgt ggagcgggcg cgccggctcc agcgcttcct cacccagccc 1260accgaggatc gccgcatcgt ggagcgggcg cgccggctcc agcgcttcct cacccagccc 1260
ttcacggtca ccgaggcctt caccggcgtg cccggccgct cggtggccat cgccgacacc 1320ttcacggtca ccgaggcctt caccggcgtg cccggccgct cggtggccat cgccgacacc 1320
atcgccggct gcaggatgat cctgtccggc gcctgcgacg actggcagga aagcgccctc 1380atcgccggct gcaggatgat cctgtccggc gcctgcgacg actggcagga aagcgccctc 1380
tacatggtgg gcaccatcga cgaggcccgc cagaaggagg aggccgctcg cgccaaggcg 1440tacatggtgg gcaccatcga cgaggcccgc cagaaggagg aggccgctcg cgccaaggcg 1440
gggcagggcg ccccggccgg gacggcagcc gagacggcgg aggccgcccc gtga 1494gggcagggcg ccccggccgg gacggcagcc gagacggcgg aggccgcccc gtga 1494
<210> 31<210> 31
<211> 497<211> 497
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 31<400> 31
Met Ala Ala Ala Asp Glu Glu Ala Gln Ser Ala Ala Gly Pro Ala Ser Met Ala Ala Ala Asp Glu Glu Ala Gln Ser Ala Ala Gly Pro Ala Ser
1 5 10 15 1 5 10 15
Gly Arg Val Val Ala Val Arg Gly Ala Val Ile Asp Ile Ala Phe Ala Gly Arg Val Val Ala Val Arg Gly Ala Val Ile Asp Ile Ala Phe Ala
20 25 30 20 25 30
Gln Pro Pro Leu Pro Pro Leu Asp Asp Ala Leu Leu Ile Thr Asp Gly Gln Pro Pro Leu Pro Pro Leu Asp Asp Ala Leu Leu Ile Thr Asp Gly
35 40 45 35 40 45
Arg Gly Gly Thr Val Leu Val Glu Val Gln Ser His Met Asp Arg His Arg Gly Gly Thr Val Leu Val Glu Val Gln Ser His Met Asp Arg His
50 55 60 50 55 60
Thr Val Arg Ala Ile Ala Leu Gln Ala Thr Thr Gly Leu Ser Arg Gly Thr Val Arg Ala Ile Ala Leu Gln Ala Thr Thr Gly Leu Ser Arg Gly
65 70 75 80 65 70 75 80
Leu Glu Ala Ala Arg Val Gly Gly Pro Val Lys Val Pro Val Gly Asp Leu Glu Ala Ala Arg Val Gly Gly Pro Val Lys Val Pro Val Gly Asp
85 90 95 85 90 95
His Val Leu Gly Arg Leu Leu Asp Val Thr Gly Ala Ile Gly Asp Lys His Val Leu Gly Arg Leu Leu Asp Val Thr Gly Ala Ile Gly Asp Lys
100 105 110 100 105 110
Gly Gly Pro Leu Pro Ala Asp Val Pro Thr Arg Pro Ile His His Ala Gly Gly Pro Leu Pro Ala Asp Val Pro Thr Arg Pro Ile His His Ala
115 120 125 115 120 125
Pro Pro Ser Phe Ala Ala Gln Gly Gly Thr Ser Asp Leu Phe Arg Thr Pro Pro Ser Phe Ala Ala Gln Gly Gly Thr Ser Asp Leu Phe Arg Thr
130 135 140 130 135 140
Gly Ile Lys Val Ile Asp Leu Leu Ala Pro Leu Ala Gln Gly Gly Lys Gly Ile Lys Val Ile Asp Leu Leu Ala Pro Leu Ala Gln Gly Gly Lys
145 150 155 160 145 150 155 160
Ala Ala Met Phe Gly Gly Ala Gly Val Gly Lys Thr Val Leu Val Met Ala Ala Met Phe Gly Gly Ala Gly Val Gly Lys Thr Val Leu Val Met
165 170 175 165 170 175
Glu Leu Ile His Ala Met Val Ala Ser Tyr Lys Gly Ile Ser Val Phe Glu Leu Ile His Ala Met Val Ala Ser Tyr Lys Gly Ile Ser Val Phe
180 185 190 180 185 190
Ala Gly Val Gly Glu Arg Ser Arg Glu Gly His Glu Met Leu Leu Asp Ala Gly Val Gly Glu Arg Ser Arg Glu Gly His Glu Met Leu Leu Asp
195 200 205 195 200 205
Met Thr Asp Ser Gly Val Leu Asp Arg Thr Val Leu Val Tyr Gly Gln Met Thr Asp Ser Gly Val Leu Asp Arg Thr Val Leu Val Tyr Gly Gln
210 215 220 210 215 220
Met Asn Glu Pro Pro Gly Ala Arg Trp Arg Val Pro Met Thr Ala Leu Met Asn Glu Pro Pro Gly Ala Arg Trp Arg Val Pro Met Thr Ala Leu
225 230 235 240 225 230 235 240
Thr Ile Ala Glu Tyr Phe Arg Asp Glu Lys His Gln Asn Val Leu Leu Thr Ile Ala Glu Tyr Phe Arg Asp Glu Lys His Gln Asn Val Leu Leu
245 250 255 245 250 255
Leu Met Asp Asn Ile Phe Arg Phe Val Gln Ala Gly Ala Glu Val Ser Leu Met Asp Asn Ile Phe Arg Phe Val Gln Ala Gly Ala Glu Val Ser
260 265 270 260 265 270
Gly Leu Leu Gly Arg Pro Pro Ser Arg Val Gly Tyr Gln Pro Thr Leu Gly Leu Leu Gly Arg Pro Pro Ser Arg Val Gly Tyr Gln Pro Thr Leu
275 280 285 275 280 285
Ala Ser Glu Val Ala Ala Leu Gln Glu Arg Ile Thr Ser Val Gly Glu Ala Ser Glu Val Ala Ala Leu Gln Glu Arg Ile Thr Ser Val Gly Glu
290 295 300 290 295 300
Ala Ser Val Thr Ala Ile Glu Ala Val Tyr Val Pro Ala Asp Asp Phe Ala Ser Val Thr Ala Ile Glu Ala Val Tyr Val Pro Ala Asp Asp Phe
305 310 315 320 305 310 315 320
Thr Asp Pro Ala Val Thr Thr Ile Ala Ala His Val Asp Ser Met Val Thr Asp Pro Ala Val Thr Thr Ile Ala Ala His Val Asp Ser Met Val
325 330 335 325 330 335
Val Leu Ser Arg Ala Met Ala Ala Glu Gly Met Tyr Pro Ala Val Asp Val Leu Ser Arg Ala Met Ala Ala Glu Gly Met Tyr Pro Ala Val Asp
340 345 350 340 345 350
Pro Ile Ser Ser Ser Ser Val Leu Leu Asp Pro Leu Ile Val Gly Asp Pro Ile Ser Ser Ser Ser Val Leu Leu Asp Pro Leu Ile Val Gly Asp
355 360 365 355 360 365
Glu His Ala Arg Val Ala Asn Glu Val Arg Arg Thr Ile Glu His Tyr Glu His Ala Arg Val Ala Asn Glu Val Arg Arg Thr Ile Glu His Tyr
370 375 380 370 375 380
Arg Glu Leu Gln Asp Val Ile Ser Leu Leu Gly Met Glu Glu Leu Gly Arg Glu Leu Gln Asp Val Ile Ser Leu Leu Gly Met Glu Glu Leu Gly
385 390 395 400 385 390 395 400
Thr Glu Asp Arg Arg Ile Val Glu Arg Ala Arg Arg Leu Gln Arg Phe Thr Glu Asp Arg Arg Ile Val Glu Arg Ala Arg Arg Leu Gln Arg Phe
405 410 415 405 410 415
Leu Thr Gln Pro Phe Thr Val Thr Glu Ala Phe Thr Gly Val Pro Gly Leu Thr Gln Pro Phe Thr Val Thr Glu Ala Phe Thr Gly Val Pro Gly
420 425 430 420 425 430
Arg Ser Val Ala Ile Ala Asp Thr Ile Ala Gly Cys Arg Met Ile Leu Arg Ser Val Ala Ile Ala Asp Thr Ile Ala Gly Cys Arg Met Ile Leu
435 440 445 435 440 445
Ser Gly Ala Cys Asp Asp Trp Gln Glu Ser Ala Leu Tyr Met Val Gly Ser Gly Ala Cys Asp Asp Trp Gln Glu Ser Ala Leu Tyr Met Val Gly
450 455 460 450 455 460
Thr Ile Asp Glu Ala Arg Gln Lys Glu Glu Ala Ala Arg Ala Lys Ala Thr Ile Asp Glu Ala Arg Gln Lys Glu Glu Ala Ala Arg Ala Lys Ala
465 470 475 480 465 470 475 480
Gly Gln Gly Ala Pro Ala Gly Thr Ala Ala Glu Thr Ala Glu Ala Ala Gly Gln Gly Ala Pro Ala Gly Thr Ala Ala Glu Thr Ala Glu Ala Ala
485 490 495 485 490 495
Pro Pro
<210> 32<210> 32
<211> 1437<211> 1437
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 32<400> 32
atggcgaaca aggtcggacg catcacccag atcatcggcg ccgtcgtcga cgtgcagttc 60atggcgaaca aggtcggacg catcaccag atcatcggcg ccgtcgtcga cgtgcagttc 60
gacgggcatc tgccggcgat tctcaacgcg atcgagacca ccaaccaggg caaccggctg 120gacgggcatc tgccggcgat tctcaacgcg atcgagacca ccaaccaggg caaccggctg 120
gtgctcgaag tggctcagca tctcggcgag aacaccgtgc gctgcatcgc catggatgcc 180gtgctcgaag tggctcagca tctcggcgag aacaccgtgc gctgcatcgc catggatgcc 180
actgaaggcc tggtgcgtgg ccaggaggtg gccgacaccg atgcgcccat ccaggtgccc 240actgaaggcc tggtgcgtgg ccaggaggtg gccgacaccg atgcgcccat ccaggtgccc 240
gtgggcgccg ccaccctcgg ccgcatcatg aacgtgatcg gcgagccggt ggacgagctg 300gtgggcgccg ccaccctcgg ccgcatcatg aacgtgatcg gcgagccggt ggacgagctg 300
ggccccatcg agggcgaagc gctgcgcggc atccatcagc cggccccctc ctatgcggag 360ggccccatcg agggcgaagc gctgcgcggc atccatcagc cggccccctc ctatgcggag 360
caggccacgg aagctgagat cctcgtcacc ggcatcaagg tggtggatct gctggcgccc 420caggccacgg aagctgagat cctcgtcacc ggcatcaagg tggtggatct gctggcgccc 420
tattccaagg gcggcaaggt gggcctgttc ggcggcgccg gcgtgggcaa gaccgtgctc 480tattccaagg gcggcaaggt gggcctgttc ggcggcgccg gcgtgggcaa gaccgtgctc 480
atcatggagc tgatcaacaa cgtggccaag gcgcacggcg gctattccgt gttcgccggc 540atcatggagc tgatcaacaa cgtggccaag gcgcacggcg gctattccgt gttcgccggc 540
gtgggtgagc gcacccgcga gggcaacgac ctctaccacg agatgatcga gtccaacgtg 600gtgggtgagc gcacccgcga gggcaacgac ctctaccacg agatgatcga gtccaacgtg 600
aacaaggacc cgcacgagaa caatggctcg gcggccggtt ccaagtgcgc cctggtctat 660aacaaggacc cgcacgagaa caatggctcg gcggccggtt ccaagtgcgc cctggtctat 660
ggccagatga acgagccgcc cggcgcccgc gcccgcgtgg ccctcaccgg cctcaccgtc 720ggccagatga acgagccgcc cggcgcccgc gcccgcgtgg ccctcaccgg cctcaccgtc 720
gccgagcatt tccgcgacca gggccaggac gtgctgttct tcgtggacaa catcttccgc 780gccgagcatt tccgcgacca gggccaggac gtgctgttct tcgtggacaa catcttccgc 780
ttcacccagg cgggctccga ggtgtcggcg cttctcggcc gcatcccctc ggcggtgggc 840ttcacccagg cgggctccga ggtgtcggcg cttctcggcc gcatcccctc ggcggtgggc 840
taccagccga cgctggccac cgacatgggc cagctgcagg agcgcatcac caccaccacc 900taccagccga cgctggccac cgacatgggc cagctgcagg agcgcatcac caccaccacc 900
aagggctcca tcacctcggt gcaggccatc tacgtgccgg cggacgatct gaccgatccg 960aagggctcca tcacctcggt gcaggccatc tacgtgccgg cggacgatct gaccgatccg 960
gcgccggccg cctccttcgc ccatctggac gccaccacgg tgctgtcgcg ctccatcgcg 1020gcgccggccg cctccttcgc ccatctggac gccaccacgg tgctgtcgcg ctccatcgcg 1020
gagaagggca tctacccggc ggtggatccg ctggactcca cctcgcgcat gctgtctccc 1080gagaagggca tctacccggc ggtggatccg ctggactcca cctcgcgcat gctgtctccc 1080
gccatcctcg gcgacgagca ctacaacacc gcgcgccagg tgcagcagac cctgcagcgc 1140gccatcctcg gcgacgagca ctacaacacc gcgcgccagg tgcagcagac cctgcagcgc 1140
tacaaggcgc tccaggacat catcgccatc ctgggcatgg acgaactctc cgaagaggac 1200tacaaggcgc tccaggacat catcgccatc ctgggcatgg acgaactctc cgaagaggac 1200
aagctcaccg tggcccgcgc ccgcaagatc gagcgcttcc tctcccagcc cttccacgtg 1260aagctcaccg tggcccgcgc ccgcaagatc gagcgcttcc tctcccagcc cttccacgtg 1260
gccgaggtgt tcaccggttc gcccggcaag ctggtcgacc tcgccgacac catcaagggc 1320gccgaggtgt tcaccggttc gcccggcaag ctggtcgacc tcgccgacac catcaagggc 1320
ttcaagggcc tggtggacgg caagtacgac tacctgcccg agcaggcctt ctacatggtg 1380ttcaagggcc tggtggacgg caagtacgac tacctgcccg agcaggcctt ctacatggtg 1380
ggcaccatcg aagaagccat cgagaagggc aagaagctgg cggccgaggc ggcctga 1437ggcaccatcg aagaagccat cgagaagggc aagaagctgg cggccgaggc ggcctga 1437
<210> 33<210> 33
<211> 478<211> 478
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 33<400> 33
Met Ala Asn Lys Val Gly Arg Ile Thr Gln Ile Ile Gly Ala Val Val Met Ala Asn Lys Val Gly Arg Ile Thr Gln Ile Ile Gly Ala Val Val
1 5 10 15 1 5 10 15
Asp Val Gln Phe Asp Gly His Leu Pro Ala Ile Leu Asn Ala Ile Glu Asp Val Gln Phe Asp Gly His Leu Pro Ala Ile Leu Asn Ala Ile Glu
20 25 30 20 25 30
Thr Thr Asn Gln Gly Asn Arg Leu Val Leu Glu Val Ala Gln His Leu Thr Thr Asn Gln Gly Asn Arg Leu Val Leu Glu Val Ala Gln His Leu
35 40 45 35 40 45
Gly Glu Asn Thr Val Arg Cys Ile Ala Met Asp Ala Thr Glu Gly Leu Gly Glu Asn Thr Val Arg Cys Ile Ala Met Asp Ala Thr Glu Gly Leu
50 55 60 50 55 60
Val Arg Gly Gln Glu Val Ala Asp Thr Asp Ala Pro Ile Gln Val Pro Val Arg Gly Gln Glu Val Ala Asp Thr Asp Ala Pro Ile Gln Val Pro
65 70 75 80 65 70 75 80
Val Gly Ala Ala Thr Leu Gly Arg Ile Met Asn Val Ile Gly Glu Pro Val Gly Ala Ala Thr Leu Gly Arg Ile Met Asn Val Ile Gly Glu Pro
85 90 95 85 90 95
Val Asp Glu Leu Gly Pro Ile Glu Gly Glu Ala Leu Arg Gly Ile His Val Asp Glu Leu Gly Pro Ile Glu Gly Glu Ala Leu Arg Gly Ile His
100 105 110 100 105 110
Gln Pro Ala Pro Ser Tyr Ala Glu Gln Ala Thr Glu Ala Glu Ile Leu Gln Pro Ala Pro Ser Tyr Ala Glu Gln Ala Thr Glu Ala Glu Ile Leu
115 120 125 115 120 125
Val Thr Gly Ile Lys Val Val Asp Leu Leu Ala Pro Tyr Ser Lys Gly Val Thr Gly Ile Lys Val Val Asp Leu Leu Ala Pro Tyr Ser Lys Gly
130 135 140 130 135 140
Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val Gly Lys Thr Val Leu Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val Gly Lys Thr Val Leu
145 150 155 160 145 150 155 160
Ile Met Glu Leu Ile Asn Asn Val Ala Lys Ala His Gly Gly Tyr Ser Ile Met Glu Leu Ile Asn Asn Val Ala Lys Ala His Gly Gly Tyr Ser
165 170 175 165 170 175
Val Phe Ala Gly Val Gly Glu Arg Thr Arg Glu Gly Asn Asp Leu Tyr Val Phe Ala Gly Val Gly Glu Arg Thr Arg Glu Gly Asn Asp Leu Tyr
180 185 190 180 185 190
His Glu Met Ile Glu Ser Asn Val Asn Lys Asp Pro His Glu Asn Asn His Glu Met Ile Glu Ser Asn Val Asn Lys Asp Pro His Glu Asn Asn
195 200 205 195 200 205
Gly Ser Ala Ala Gly Ser Lys Cys Ala Leu Val Tyr Gly Gln Met Asn Gly Ser Ala Ala Gly Ser Lys Cys Ala Leu Val Tyr Gly Gln Met Asn
210 215 220 210 215 220
Glu Pro Pro Gly Ala Arg Ala Arg Val Ala Leu Thr Gly Leu Thr Val Glu Pro Pro Gly Ala Arg Ala Arg Val Ala Leu Thr Gly Leu Thr Val
225 230 235 240 225 230 235 240
Ala Glu His Phe Arg Asp Gln Gly Gln Asp Val Leu Phe Phe Val Asp Ala Glu His Phe Arg Asp Gln Gly Gln Asp Val Leu Phe Phe Val Asp
245 250 255 245 250 255
Asn Ile Phe Arg Phe Thr Gln Ala Gly Ser Glu Val Ser Ala Leu Leu Asn Ile Phe Arg Phe Thr Gln Ala Gly Ser Glu Val Ser Ala Leu Leu
260 265 270 260 265 270
Gly Arg Ile Pro Ser Ala Val Gly Tyr Gln Pro Thr Leu Ala Thr Asp Gly Arg Ile Pro Ser Ala Val Gly Tyr Gln Pro Thr Leu Ala Thr Asp
275 280 285 275 280 285
Met Gly Gln Leu Gln Glu Arg Ile Thr Thr Thr Thr Lys Gly Ser Ile Met Gly Gln Leu Gln Glu Arg Ile Thr Thr Thr Thr Lys Gly Ser Ile
290 295 300 290 295 300
Thr Ser Val Gln Ala Ile Tyr Val Pro Ala Asp Asp Leu Thr Asp Pro Thr Ser Val Gln Ala Ile Tyr Val Pro Ala Asp Asp Leu Thr Asp Pro
305 310 315 320 305 310 315 320
Ala Pro Ala Ala Ser Phe Ala His Leu Asp Ala Thr Thr Val Leu Ser Ala Pro Ala Ala Ser Phe Ala His Leu Asp Ala Thr Thr Val Leu Ser
325 330 335 325 330 335
Arg Ser Ile Ala Glu Lys Gly Ile Tyr Pro Ala Val Asp Pro Leu Asp Arg Ser Ile Ala Glu Lys Gly Ile Tyr Pro Ala Val Asp Pro Leu Asp
340 345 350 340 345 350
Ser Thr Ser Arg Met Leu Ser Pro Ala Ile Leu Gly Asp Glu His Tyr Ser Thr Ser Arg Met Leu Ser Pro Ala Ile Leu Gly Asp Glu His Tyr
355 360 365 355 360 365
Asn Thr Ala Arg Gln Val Gln Gln Thr Leu Gln Arg Tyr Lys Ala Leu Asn Thr Ala Arg Gln Val Gln Gln Thr Leu Gln Arg Tyr Lys Ala Leu
370 375 380 370 375 380
Gln Asp Ile Ile Ala Ile Leu Gly Met Asp Glu Leu Ser Glu Glu Asp Gln Asp Ile Ile Ala Ile Leu Gly Met Asp Glu Leu Ser Glu Glu Asp
385 390 395 400 385 390 395 400
Lys Leu Thr Val Ala Arg Ala Arg Lys Ile Glu Arg Phe Leu Ser Gln Lys Leu Thr Val Ala Arg Ala Arg Lys Ile Glu Arg Phe Leu Ser Gln
405 410 415 405 410 415
Pro Phe His Val Ala Glu Val Phe Thr Gly Ser Pro Gly Lys Leu Val Pro Phe His Val Ala Glu Val Phe Thr Gly Ser Pro Gly Lys Leu Val
420 425 430 420 425 430
Asp Leu Ala Asp Thr Ile Lys Gly Phe Lys Gly Leu Val Asp Gly Lys Asp Leu Ala Asp Thr Ile Lys Gly Phe Lys Gly Leu Val Asp Gly Lys
435 440 445 435 440 445
Tyr Asp Tyr Leu Pro Glu Gln Ala Phe Tyr Met Val Gly Thr Ile Glu Tyr Asp Tyr Leu Pro Glu Gln Ala Phe Tyr Met Val Gly Thr Ile Glu
450 455 460 450 455 460
Glu Ala Ile Glu Lys Gly Lys Lys Leu Ala Ala Glu Ala Ala Glu Ala Ile Glu Lys Gly Lys Lys Leu Ala Ala Glu Ala Ala
465 470 475 465 470 475
<210> 34<210> 34
<211> 876<211> 876
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 34<400> 34
atggcgagtc tgaaggacct gagaaaccgc attgcctcgg tgaaggcgac gcagaagatc 60atggcgagtc tgaaggacct gagaaaccgc attgcctcgg tgaaggcgac gcagaagatc 60
accaaggcga tgcagatggt cgccgcggcg aagctgcgtc gcgcccaggc ggcggctgaa 120accaaggcga tgcagatggt cgccgcggcg aagctgcgtc gcgcccaggc ggcggctgaa 120
gcggcccgtc cctatgcgga acgcatggag acggtgctcg gaaatcttgc ctccggcatg 180gcggcccgtc cctatgcgga acgcatggag acggtgctcg gaaatcttgc ctccggcatg 180
gtggtgggcg cgcaggcgcc tgttctcatg accgggacgg gcaagagcga cacccacctg 240gtggtgggcg cgcaggcgcc tgttctcatg accgggacgg gcaagagcga cacccacctg 240
ctgctggtgt gcaccggcga gcgcggcctg tgcggcgcct tcaactcgtc catcgtgcgc 300ctgctggtgt gcaccggcga gcgcggcctg tgcggcgcct tcaactcgtc catcgtgcgc 300
ttcgcccgcg agcgggcgca gctgctgctg gccgagggca agaaggtgaa aatcctgtgc 360ttcgcccgcg agcgggcgca gctgctgctg gccgagggca agaaggtgaa aatcctgtgc 360
gtgggccgca agggccacga gcagctgcgc cgcatctacc cggacaacat catcgacgtg 420gtgggccgca agggccacga gcagctgcgc cgcatctacc cggacaacat catcgacgtg 420
gtggacctgc gcgcggtgcg caacatcggc ttcaaggagg ccgacgccat cgcccgcaag 480gtggacctgc gcgcggtgcg caacatcggc ttcaaggagg ccgacgccat cgcccgcaag 480
gtgctggccc tgctcgatga aggcgcattc gacgtctgca cgctcttcta ctcccacttc 540gtgctggccc tgctcgatga aggcgcattc gacgtctgca cgctcttcta ctcccacttc 540
aggagcgtga tcgcccaggt gccgacggcc cagcagctca ttccggccac cttcgacgag 600aggagcgtga tcgcccaggt gccgacggcc cagcagctca ttccggccac cttcgacgag 600
cggccggccg tcgccgatgc gccggtctat gaatatgagc cggaggagga ggagatcctc 660cggccggccg tcgccgatgc gccggtctat gaatatgagc cggagggagga ggagatcctc 660
gccgagctgc tgccgcgcaa cgtggcggtg cagatcttca aggccctcct cgagaaccag 720gccgagctgc tgccgcgcaa cgtggcggtg cagatcttca aggccctcct cgagaaccag 720
gcttctttct atggctccca gatgagcgcc atggacaacg ccacgcgcaa tgcgggcgag 780gcttctttct atggctccca gatgagcgcc atggacaacg ccacgcgcaa tgcgggcgag 780
atgatcaaga agcagacgct cacctacaac cgtacccgcc aggccatgat cacgaaggaa 840atgatcaaga agcagacgct cacctacaac cgtacccgcc aggccatgat cacgaaggaa 840
ctcatcgaga tcatctccgg cgccgaggcc gtctga 876ctcatcgaga tcatctccgg cgccgaggcc gtctga 876
<210> 35<210> 35
<211> 291<211> 291
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 35<400> 35
Met Ala Ser Leu Lys Asp Leu Arg Asn Arg Ile Ala Ser Val Lys Ala Met Ala Ser Leu Lys Asp Leu Arg Asn Arg Ile Ala Ser Val Lys Ala
1 5 10 15 1 5 10 15
Thr Gln Lys Ile Thr Lys Ala Met Gln Met Val Ala Ala Ala Lys Leu Thr Gln Lys Ile Thr Lys Ala Met Gln Met Val Ala Ala Ala Lys Leu
20 25 30 20 25 30
Arg Arg Ala Gln Ala Ala Ala Glu Ala Ala Arg Pro Tyr Ala Glu Arg Arg Arg Ala Gln Ala Ala Ala Glu Ala Ala Arg Pro Tyr Ala Glu Arg
35 40 45 35 40 45
Met Glu Thr Val Leu Gly Asn Leu Ala Ser Gly Met Val Val Gly Ala Met Glu Thr Val Leu Gly Asn Leu Ala Ser Gly Met Val Val Gly Ala
50 55 60 50 55 60
Gln Ala Pro Val Leu Met Thr Gly Thr Gly Lys Ser Asp Thr His Leu Gln Ala Pro Val Leu Met Thr Gly Thr Gly Lys Ser Asp Thr His Leu
65 70 75 80 65 70 75 80
Leu Leu Val Cys Thr Gly Glu Arg Gly Leu Cys Gly Ala Phe Asn Ser Leu Leu Val Cys Thr Gly Glu Arg Gly Leu Cys Gly Ala Phe Asn Ser
85 90 95 85 90 95
Ser Ile Val Arg Phe Ala Arg Glu Arg Ala Gln Leu Leu Leu Ala Glu Ser Ile Val Arg Phe Ala Arg Glu Arg Ala Gln Leu Leu Leu Ala Glu
100 105 110 100 105 110
Gly Lys Lys Val Lys Ile Leu Cys Val Gly Arg Lys Gly His Glu Gln Gly Lys Lys Val Lys Ile Leu Cys Val Gly Arg Lys Gly His Glu Gln
115 120 125 115 120 125
Leu Arg Arg Ile Tyr Pro Asp Asn Ile Ile Asp Val Val Asp Leu Arg Leu Arg Arg Ile Tyr Pro Asp Asn Ile Ile Asp Val Val Asp Leu Arg
130 135 140 130 135 140
Ala Val Arg Asn Ile Gly Phe Lys Glu Ala Asp Ala Ile Ala Arg Lys Ala Val Arg Asn Ile Gly Phe Lys Glu Ala Asp Ala Ile Ala Arg Lys
145 150 155 160 145 150 155 160
Val Leu Ala Leu Leu Asp Glu Gly Ala Phe Asp Val Cys Thr Leu Phe Val Leu Ala Leu Leu Asp Glu Gly Ala Phe Asp Val Cys Thr Leu Phe
165 170 175 165 170 175
Tyr Ser His Phe Arg Ser Val Ile Ala Gln Val Pro Thr Ala Gln Gln Tyr Ser His Phe Arg Ser Val Ile Ala Gln Val Pro Thr Ala Gln Gln
180 185 190 180 185 190
Leu Ile Pro Ala Thr Phe Asp Glu Arg Pro Ala Val Ala Asp Ala Pro Leu Ile Pro Ala Thr Phe Asp Glu Arg Pro Ala Val Ala Asp Ala Pro
195 200 205 195 200 205
Val Tyr Glu Tyr Glu Pro Glu Glu Glu Glu Ile Leu Ala Glu Leu Leu Val Tyr Glu Tyr Glu Pro Glu Glu Glu Glu Ile Leu Ala Glu Leu Leu
210 215 220 210 215 220
Pro Arg Asn Val Ala Val Gln Ile Phe Lys Ala Leu Leu Glu Asn Gln Pro Arg Asn Val Ala Val Gln Ile Phe Lys Ala Leu Leu Glu Asn Gln
225 230 235 240 225 230 235 240
Ala Ser Phe Tyr Gly Ser Gln Met Ser Ala Met Asp Asn Ala Thr Arg Ala Ser Phe Tyr Gly Ser Gln Met Ser Ala Met Asp Asn Ala Thr Arg
245 250 255 245 250 255
Asn Ala Gly Glu Met Ile Lys Lys Gln Thr Leu Thr Tyr Asn Arg Thr Asn Ala Gly Glu Met Ile Lys Lys Gln Thr Leu Thr Tyr Asn Arg Thr
260 265 270 260 265 270
Arg Gln Ala Met Ile Thr Lys Glu Leu Ile Glu Ile Ile Ser Gly Ala Arg Gln Ala Met Ile Thr Lys Glu Leu Ile Glu Ile Ile Ser Gly Ala
275 280 285 275 280 285
Glu Ala Val Glu Ala Val
290 290
<210> 36<210> 36
<211> 1530<211> 1530
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 36<400> 36
atggacattc gagccgctga aatctctgcc atcctgaaag agcagatcca gaatttcggc 60atggacattc gagccgctga aatctctgcc atcctgaaag agcagatcca gaatttcggc 60
caggaggcgg aagtctccga ggtgggtcag gttctgtccg tgggtgacgg catcgcgcgc 120caggaggcgg aagtctccga ggtgggtcag gttctgtccg tgggtgacgg catcgcgcgc 120
gtctacggcc tcgacaacgt ccaggcgggc gagatggtcg agttcgagaa cggcacgcgc 180gtctacggcc tcgacaacgt ccaggcgggc gagatggtcg agttcgagaa cggcacgcgc 180
ggcatggcgc tgaacctcga gctcgacaat gtcggcatcg tgatcttcgg ttccgaccgc 240ggcatggcgc tgaacctcga gctcgacaat gtcggcatcg tgatcttcgg ttccgaccgc 240
gagatcaagg aaggccagac cgtcaagcgg accggcgcca tcgtggacgc ccccgtcggc 300gagatcaagg aaggccagac cgtcaagcgg accggcgcca tcgtggacgc ccccgtcggc 300
aagggcctgc tcggccgcgt cgtggacgct ctcggcaacc cgatcgacgg caagggcccg 360aagggcctgc tcggccgcgt cgtggacgct ctcggcaacc cgatcgacgg caagggcccg 360
atcatgttca ccgagcgtcg ccgggtcgac gtgaaggcgc cgggcatcat cccgcgcaag 420atcatgttca ccgagcgtcg ccgggtcgac gtgaaggcgc cgggcatcat cccgcgcaag 420
tcggtgcacg agcccatgca gaccggcctg aaggccatcg atgcgctcat ccccatcggc 480tcggtgcacg agcccatgca gaccggcctg aaggccatcg atgcgctcat ccccatcggc 480
cgcggccagc gcgagctcat catcggcgac cgccagaccg gcaagaccgc cgtggcgctc 540cgcggccagc gcgagctcat catcggcgac cgccagaccg gcaagaccgc cgtggcgctc 540
gactcgatcc tgaaccagaa gcccatcaac cagggcgacg acgagaaggc caagctctac 600gactcgatcc tgaaccagaa gcccatcaac cagggcgacg acgagaaggc caagctctac 600
tgcgtctatg tcgcggtggg ccagaagcgt tccactgtcg cgcagttcgt gaaggtgctc 660tgcgtctatg tcgcggtggg ccagaagcgt tccactgtcg cgcagttcgt gaaggtgctc 660
gaggagcacg gcgcgctgga atattccatc gtcgtcgccg ccaccgcctc ggacgcggcc 720gaggagcacg gcgcgctgga atattccatc gtcgtcgccg ccaccgcctc ggacgcggcc 720
cccatgcagt tcctggcgcc gttcaccggc accgccatgg gcgagtattt ccgcgacaac 780cccatgcagt tcctggcgcc gttcaccggc accgccatgg gcgagtattt ccgcgacaac 780
ggcatgcacg ccctcatcat ccatgatgac ctgtccaagc aggccgtggc ctaccgccag 840ggcatgcacg ccctcatcat ccatgatgac ctgtccaagc aggccgtggc ctaccgccag 840
atgtcgctgc tgctgcgccg cccgccgggc cgcgaggcct atcccggcga tgtgttctac 900atgtcgctgc tgctgcgccg cccgccgggc cgcgaggcct atcccggcga tgtgttctac 900
ctgcactccc gcctcttgga gcgcgccgcc aagctcaatg acgagcacgg cgccggctcg 960ctgcactccc gcctcttgga gcgcgccgcc aagctcaatg acgagcacgg cgccggctcg 960
ctgaccgccc tgccggtgat cgagacccag gccaacgacg tgtcggccta catcccgacc 1020ctgaccgccc tgccggtgat cgagacccag gccaacgacg tgtcggccta catcccgacc 1020
aacgtgatct ccatcaccga cggtcagatc ttccttgaat ccgatctgtt ctaccagggc 1080aacgtgatct ccatcaccga cggtcagatc ttccttgaat ccgatctgtt ctaccagggc 1080
atccgcccgg cggtgaacgt gggcctgtcg gtgtcgcgcg tgggctcttc ggcccagatc 1140atccgcccgg cggtgaacgt gggcctgtcg gtgtcgcgcg tgggctcttc ggcccagatc 1140
aaggcgatga agcaggtggc cggcaagatc aagggcgagc tcgcccagta tcgcgagctg 1200aaggcgatga agcaggtggc cggcaagatc aagggcgagc tcgcccagta tcgcgagctg 1200
gcggccttcg cccagttcgg ttcggacctg gacgcggcca cccagaagct gctgaaccgc 1260gcggccttcg cccagttcgg ttcggacctg gacgcggcca cccagaagct gctgaaccgc 1260
ggcgcccgcc tcaccgagct gctgaagcag agccagttct cgcccctcaa ggtggaggag 1320ggcgcccgcc tcaccgagct gctgaagcag agccagttct cgcccctcaa ggtggaggag 1320
caggtggcgg tgatctatgc cggcaccaat ggctatctcg atccgctgcc ggtctccaag 1380caggtggcgg tgatctatgc cggcaccaat ggctatctcg atccgctgcc ggtctccaag 1380
gtgcgcgagt tcgagcaggg tctgctcctg tcgctgcgct cgcagcatcc ggagatcctg 1440gtgcgcgagt tcgagcaggg tctgctcctg tcgctgcgct cgcagcatcc ggagatcctg 1440
gacgccatcc gcacgtccaa ggagctttcc aaggacaccg ccgagaagct gacgaaggcc 1500gacgccatcc gcacgtccaa ggagctttcc aaggacaccg ccgagaagct gacgaaggcc 1500
atcgacgcct tcgccaagag cttctcctga 1530atcgacgcct tcgccaagag cttctcctga 1530
<210> 37<210> 37
<211> 509<211> 509
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 37<400> 37
Met Asp Ile Arg Ala Ala Glu Ile Ser Ala Ile Leu Lys Glu Gln Ile Met Asp Ile Arg Ala Ala Glu Ile Ser Ala Ile Leu Lys Glu Gln Ile
1 5 10 15 1 5 10 15
Gln Asn Phe Gly Gln Glu Ala Glu Val Ser Glu Val Gly Gln Val Leu Gln Asn Phe Gly Gln Glu Ala Glu Val Ser Glu Val Gly Gln Val Leu
20 25 30 20 25 30
Ser Val Gly Asp Gly Ile Ala Arg Val Tyr Gly Leu Asp Asn Val Gln Ser Val Gly Asp Gly Ile Ala Arg Val Tyr Gly Leu Asp Asn Val Gln
35 40 45 35 40 45
Ala Gly Glu Met Val Glu Phe Glu Asn Gly Thr Arg Gly Met Ala Leu Ala Gly Glu Met Val Glu Phe Glu Asn Gly Thr Arg Gly Met Ala Leu
50 55 60 50 55 60
Asn Leu Glu Leu Asp Asn Val Gly Ile Val Ile Phe Gly Ser Asp Arg Asn Leu Glu Leu Asp Asn Val Gly Ile Val Ile Phe Gly Ser Asp Arg
65 70 75 80 65 70 75 80
Glu Ile Lys Glu Gly Gln Thr Val Lys Arg Thr Gly Ala Ile Val Asp Glu Ile Lys Glu Gly Gln Thr Val Lys Arg Thr Gly Ala Ile Val Asp
85 90 95 85 90 95
Ala Pro Val Gly Lys Gly Leu Leu Gly Arg Val Val Asp Ala Leu Gly Ala Pro Val Gly Lys Gly Leu Leu Gly Arg Val Val Asp Ala Leu Gly
100 105 110 100 105 110
Asn Pro Ile Asp Gly Lys Gly Pro Ile Met Phe Thr Glu Arg Arg Arg Asn Pro Ile Asp Gly Lys Gly Pro Ile Met Phe Thr Glu Arg Arg Arg
115 120 125 115 120 125
Val Asp Val Lys Ala Pro Gly Ile Ile Pro Arg Lys Ser Val His Glu Val Asp Val Lys Ala Pro Gly Ile Ile Pro Arg Lys Ser Val His Glu
130 135 140 130 135 140
Pro Met Gln Thr Gly Leu Lys Ala Ile Asp Ala Leu Ile Pro Ile Gly Pro Met Gln Thr Gly Leu Lys Ala Ile Asp Ala Leu Ile Pro Ile Gly
145 150 155 160 145 150 155 160
Arg Gly Gln Arg Glu Leu Ile Ile Gly Asp Arg Gln Thr Gly Lys Thr Arg Gly Gln Arg Glu Leu Ile Ile Gly Asp Arg Gln Thr Gly Lys Thr
165 170 175 165 170 175
Ala Val Ala Leu Asp Ser Ile Leu Asn Gln Lys Pro Ile Asn Gln Gly Ala Val Ala Leu Asp Ser Ile Leu Asn Gln Lys Pro Ile Asn Gln Gly
180 185 190 180 185 190
Asp Asp Glu Lys Ala Lys Leu Tyr Cys Val Tyr Val Ala Val Gly Gln Asp Asp Glu Lys Ala Lys Leu Tyr Cys Val Tyr Val Ala Val Gly Gln
195 200 205 195 200 205
Lys Arg Ser Thr Val Ala Gln Phe Val Lys Val Leu Glu Glu His Gly Lys Arg Ser Thr Val Ala Gln Phe Val Lys Val Leu Glu Glu His Gly
210 215 220 210 215 220
Ala Leu Glu Tyr Ser Ile Val Val Ala Ala Thr Ala Ser Asp Ala Ala Ala Leu Glu Tyr Ser Ile Val Val Ala Ala Thr Ala Ser Asp Ala Ala
225 230 235 240 225 230 235 240
Pro Met Gln Phe Leu Ala Pro Phe Thr Gly Thr Ala Met Gly Glu Tyr Pro Met Gln Phe Leu Ala Pro Phe Thr Gly Thr Ala Met Gly Glu Tyr
245 250 255 245 250 255
Phe Arg Asp Asn Gly Met His Ala Leu Ile Ile His Asp Asp Leu Ser Phe Arg Asp Asn Gly Met His Ala Leu Ile Ile His Asp Asp Leu Ser
260 265 270 260 265 270
Lys Gln Ala Val Ala Tyr Arg Gln Met Ser Leu Leu Leu Arg Arg Pro Lys Gln Ala Val Ala Tyr Arg Gln Met Ser Leu Leu Leu Arg Arg Pro
275 280 285 275 280 285
Pro Gly Arg Glu Ala Tyr Pro Gly Asp Val Phe Tyr Leu His Ser Arg Pro Gly Arg Glu Ala Tyr Pro Gly Asp Val Phe Tyr Leu His Ser Arg
290 295 300 290 295 300
Leu Leu Glu Arg Ala Ala Lys Leu Asn Asp Glu His Gly Ala Gly Ser Leu Leu Glu Arg Ala Ala Lys Leu Asn Asp Glu His Gly Ala Gly Ser
305 310 315 320 305 310 315 320
Leu Thr Ala Leu Pro Val Ile Glu Thr Gln Ala Asn Asp Val Ser Ala Leu Thr Ala Leu Pro Val Ile Glu Thr Gln Ala Asn Asp Val Ser Ala
325 330 335 325 330 335
Tyr Ile Pro Thr Asn Val Ile Ser Ile Thr Asp Gly Gln Ile Phe Leu Tyr Ile Pro Thr Asn Val Ile Ser Ile Thr Asp Gly Gln Ile Phe Leu
340 345 350 340 345 350
Glu Ser Asp Leu Phe Tyr Gln Gly Ile Arg Pro Ala Val Asn Val Gly Glu Ser Asp Leu Phe Tyr Gln Gly Ile Arg Pro Ala Val Asn Val Gly
355 360 365 355 360 365
Leu Ser Val Ser Arg Val Gly Ser Ser Ala Gln Ile Lys Ala Met Lys Leu Ser Val Ser Arg Val Gly Ser Ser Ala Gln Ile Lys Ala Met Lys
370 375 380 370 375 380
Gln Val Ala Gly Lys Ile Lys Gly Glu Leu Ala Gln Tyr Arg Glu Leu Gln Val Ala Gly Lys Ile Lys Gly Glu Leu Ala Gln Tyr Arg Glu Leu
385 390 395 400 385 390 395 400
Ala Ala Phe Ala Gln Phe Gly Ser Asp Leu Asp Ala Ala Thr Gln Lys Ala Ala Phe Ala Gln Phe Gly Ser Asp Leu Asp Ala Ala Thr Gln Lys
405 410 415 405 410 415
Leu Leu Asn Arg Gly Ala Arg Leu Thr Glu Leu Leu Lys Gln Ser Gln Leu Leu Asn Arg Gly Ala Arg Leu Thr Glu Leu Leu Lys Gln Ser Gln
420 425 430 420 425 430
Phe Ser Pro Leu Lys Val Glu Glu Gln Val Ala Val Ile Tyr Ala Gly Phe Ser Pro Leu Lys Val Glu Glu Gln Val Ala Val Ile Tyr Ala Gly
435 440 445 435 440 445
Thr Asn Gly Tyr Leu Asp Pro Leu Pro Val Ser Lys Val Arg Glu Phe Thr Asn Gly Tyr Leu Asp Pro Leu Pro Val Ser Lys Val Arg Glu Phe
450 455 460 450 455 460
Glu Gln Gly Leu Leu Leu Ser Leu Arg Ser Gln His Pro Glu Ile Leu Glu Gln Gly Leu Leu Leu Ser Leu Arg Ser Gln His Pro Glu Ile Leu
465 470 475 480 465 470 475 480
Asp Ala Ile Arg Thr Ser Lys Glu Leu Ser Lys Asp Thr Ala Glu Lys Asp Ala Ile Arg Thr Ser Lys Glu Leu Ser Lys Asp Thr Ala Glu Lys
485 490 495 485 490 495
Leu Thr Lys Ala Ile Asp Ala Phe Ala Lys Ser Phe Ser Leu Thr Lys Ala Ile Asp Ala Phe Ala Lys Ser Phe Ser
500 505 500 505
<210> 38<210> 38
<211> 555<211> 555
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 38<400> 38
gtggcggaaa cgatcgtgtc aggcatggcg ggacgctatg cgaccgcgct gttcgagctg 60gtggcggaaa cgatcgtgtc aggcatggcg ggacgctatg cgaccgcgct gttcgagctg 60
gcggacgaag ccggtgccat cgattccgtc caggcggatc ttgatcgcct gtccggcctt 120gcggacgaag ccggtgccat cgattccgtc caggcggatc ttgatcgcct gtccggcctt 120
ctggccgaga gcgcggatct ggcgcggctg gtcaagagcc cggtcttcac cgccgagcag 180ctggccgaga gcgcggatct ggcgcggctg gtcaagagcc cggtcttcac cgccgagcag 180
cagctcggcg cgatggcggc cattctcgat caagcaggca tttccggcct tgcgggcaaa 240cagctcggcg cgatggcggc cattctcgat caagcaggca tttccggcct tgcgggcaaa 240
ttcgtgaagc tggtggcgca gaaccgccgc ctgttcgcac tgccgcgcat gattgccgaa 300ttcgtgaagc tggtggcgca gaaccgccgc ctgttcgcac tgccgcgcat gattgccgaa 300
tacgccgtcc tggtggcccg gaagaagggc gagacctcgg cgagcgtgac cgttgccacc 360tacgccgtcc tggtggcccg gaagaagggc gagacctcgg cgagcgtgac cgttgccacc 360
cccctgagcg atgagcatct ggccacgctc aaggcggccc tggctgaaaa gaccggcaag 420cccctgagcg atgagcatct ggccacgctc aaggcggccc tggctgaaaa gaccggcaag 420
gacgtgaagc tcgacgtcac cgtcgatccg tccatcctcg gtggtctcat cgtgaagctc 480gacgtgaagc tcgacgtcac cgtcgatccg tccatcctcg gtggtctcat cgtgaagctc 480
ggctcgcgca tggtcgatgc ttccctgaag accaaactca attctatccg gcatgcgatg 540ggctcgcgca tggtcgatgc ttccctgaag accaaactca attctatccg gcatgcgatg 540
aaagaggtcc gctga 555aaagaggtcc gctga 555
<210> 39<210> 39
<211> 184<211> 184
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 39<400> 39
Met Ala Glu Thr Ile Val Ser Gly Met Ala Gly Arg Tyr Ala Thr Ala Met Ala Glu Thr Ile Val Ser Gly Met Ala Gly Arg Tyr Ala Thr Ala
1 5 10 15 1 5 10 15
Leu Phe Glu Leu Ala Asp Glu Ala Gly Ala Ile Asp Ser Val Gln Ala Leu Phe Glu Leu Ala Asp Glu Ala Gly Ala Ile Asp Ser Val Gln Ala
20 25 30 20 25 30
Asp Leu Asp Arg Leu Ser Gly Leu Leu Ala Glu Ser Ala Asp Leu Ala Asp Leu Asp Arg Leu Ser Gly Leu Leu Ala Glu Ser Ala Asp Leu Ala
35 40 45 35 40 45
Arg Leu Val Lys Ser Pro Val Phe Thr Ala Glu Gln Gln Leu Gly Ala Arg Leu Val Lys Ser Pro Val Phe Thr Ala Glu Gln Gln Leu Gly Ala
50 55 60 50 55 60
Met Ala Ala Ile Leu Asp Gln Ala Gly Ile Ser Gly Leu Ala Gly Lys Met Ala Ala Ile Leu Asp Gln Ala Gly Ile Ser Gly Leu Ala Gly Lys
65 70 75 80 65 70 75 80
Phe Val Lys Leu Val Ala Gln Asn Arg Arg Leu Phe Ala Leu Pro Arg Phe Val Lys Leu Val Ala Gln Asn Arg Arg Leu Phe Ala Leu Pro Arg
85 90 95 85 90 95
Met Ile Ala Glu Tyr Ala Val Leu Val Ala Arg Lys Lys Gly Glu Thr Met Ile Ala Glu Tyr Ala Val Leu Val Ala Arg Lys Lys Gly Glu Thr
100 105 110 100 105 110
Ser Ala Ser Val Thr Val Ala Thr Pro Leu Ser Asp Glu His Leu Ala Ser Ala Ser Val Thr Val Ala Thr Pro Leu Ser Asp Glu His Leu Ala
115 120 125 115 120 125
Thr Leu Lys Ala Ala Leu Ala Glu Lys Thr Gly Lys Asp Val Lys Leu Thr Leu Lys Ala Ala Leu Ala Glu Lys Thr Gly Lys Asp Val Lys Leu
130 135 140 130 135 140
Asp Val Thr Val Asp Pro Ser Ile Leu Gly Gly Leu Ile Val Lys Leu Asp Val Thr Val Asp Pro Ser Ile Leu Gly Gly Leu Ile Val Lys Leu
145 150 155 160 145 150 155 160
Gly Ser Arg Met Val Asp Ala Ser Leu Lys Thr Lys Leu Asn Ser Ile Gly Ser Arg Met Val Asp Ala Ser Leu Lys Thr Lys Leu Asn Ser Ile
165 170 175 165 170 175
Arg His Ala Met Lys Glu Val Arg Arg His Ala Met Lys Glu Val Arg
180 180
<210> 40<210> 40
<211> 492<211> 492
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 40<400> 40
atgaccgaaa tggaactggc tgagctctgg gtcgccatcg ccttcctggt tttcgtaggc 60atgaccgaaa tggaactggc tgagctctgg gtcgccatcg ccttcctggt tttcgtaggc 60
ctcctgatct atgcgggcgc ccaccgcgcc atcgtctccg ccctggattc ccgcggctcg 120ctcctgatct atgcgggcgc ccaccgcgcc atcgtctccg ccctggattc ccgcggctcg 120
cgcatcgcct cggaactgga ggaggcccgt cggctcaagg aagaggccca gaagctggtg 180cgcatcgcct cggaactgga ggaggcccgt cggctcaagg aagaggccca gaagctggtg 180
gccgaattca agcgcaagca gcgcgaggcc gaggccgagg ccgaatccat cgtcaccggc 240gccgaattca agcgcaagca gcgcgaggcc gaggccgagg ccgaatccat cgtcaccggc 240
gccaaggccg aggccgagcg cctcgccgcc gaggccaagg cgaagatcga ggatttcgtc 300gccaaggccg aggccgagcg cctcgccgcc gaggccaagg cgaagatcga ggatttcgtc 300
acccgccgca ccaagatggc cgaggacaag atcgcccagg ccgagcatca ggctctggcg 360acccgccgca ccaagatggc cgaggacaag atcgcccagg ccgagcatca ggctctggcg 360
gacgtgaagt ccatcgccgc cgaggcggcg gccaaggcgg ccgaggtgat cctcggcgcc 420gacgtgaagt ccatcgccgc cgaggcggcg gccaaggcgg ccgaggtgat cctcggcgcc 420
caggccaccg gcgcggtggc ggagcgtctg ctgtcgggcg ccatctccga ggtcaagacc 480caggccaccg gcgcggtggc ggagcgtctg ctgtcgggcg ccatctccga ggtcaagacc 480
aagctcaact ga 492aagctcaact ga 492
<210> 41<210> 41
<211> 163<211> 163
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 41<400> 41
Met Thr Glu Met Glu Leu Ala Glu Leu Trp Val Ala Ile Ala Phe Leu Met Thr Glu Met Glu Leu Ala Glu Leu Trp Val Ala Ile Ala Phe Leu
1 5 10 15 1 5 10 15
Val Phe Val Gly Leu Leu Ile Tyr Ala Gly Ala His Arg Ala Ile Val Val Phe Val Gly Leu Leu Ile Tyr Ala Gly Ala His Arg Ala Ile Val
20 25 30 20 25 30
Ser Ala Leu Asp Ser Arg Gly Ser Arg Ile Ala Ser Glu Leu Glu Glu Ser Ala Leu Asp Ser Arg Gly Ser Arg Ile Ala Ser Glu Leu Glu Glu
35 40 45 35 40 45
Ala Arg Arg Leu Lys Glu Glu Ala Gln Lys Leu Val Ala Glu Phe Lys Ala Arg Arg Leu Lys Glu Glu Ala Gln Lys Leu Val Ala Glu Phe Lys
50 55 60 50 55 60
Arg Lys Gln Arg Glu Ala Glu Ala Glu Ala Glu Ser Ile Val Thr Gly Arg Lys Gln Arg Glu Ala Glu Ala Glu Ala Glu Ser Ile Val Thr Gly
65 70 75 80 65 70 75 80
Ala Lys Ala Glu Ala Glu Arg Leu Ala Ala Glu Ala Lys Ala Lys Ile Ala Lys Ala Glu Ala Glu Arg Leu Ala Ala Glu Ala Lys Ala Lys Ile
85 90 95 85 90 95
Glu Asp Phe Val Thr Arg Arg Thr Lys Met Ala Glu Asp Lys Ile Ala Glu Asp Phe Val Thr Arg Arg Thr Lys Met Ala Glu Asp Lys Ile Ala
100 105 110 100 105 110
Gln Ala Glu His Gln Ala Leu Ala Asp Val Lys Ser Ile Ala Ala Glu Gln Ala Glu His Gln Ala Leu Ala Asp Val Lys Ser Ile Ala Ala Glu
115 120 125 115 120 125
Ala Ala Ala Lys Ala Ala Glu Val Ile Leu Gly Ala Gln Ala Thr Gly Ala Ala Ala Lys Ala Ala Glu Val Ile Leu Gly Ala Gln Ala Thr Gly
130 135 140 130 135 140
Ala Val Ala Glu Arg Leu Leu Ser Gly Ala Ile Ser Glu Val Lys Thr Ala Val Ala Glu Arg Leu Leu Ser Gly Ala Ile Ser Glu Val Lys Thr
145 150 155 160 145 150 155 160
Lys Leu Asn Lys Leu Asn
<210> 42<210> 42
<211> 729<211> 729
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 42<400> 42
atgatgattg catggaagcg gaccttcgca gtcgtgacct tcggggccgc cctgatggcc 60atgatgattg catggaagcg gaccttcgca gtcgtgacct tcggggccgc cctgatggcc 60
atgcccgtcg cgggcgtggt cgcagctgag acttctcccg ctccggcggc agtggcgcag 120atgcccgtcg cgggcgtggt cgcagctgag acttctcccg ctccggcggc agtggcgcag 120
gccgatcatg cggtgcccac cgaggcggcc ggccagggca ccgccgatgc ggcccatgcc 180gccgatcatg cggtgcccac cgaggcggcc ggccagggca ccgccgatgc ggcccatgcc 180
gccgcgccgg gcgaggccgc ccatggtggc gcggccaagc acgaaaccca tttcccgccc 240gccgcgccgg gcgaggccgc ccatggtggc gcggccaagc acgaaaccca tttcccgccc 240
ttcgacggca ccaccttcgc ctcccagttg ctgtggctcg ccgtcacctt cggcctgctt 300ttcgacggca ccaccttcgc ctcccagttg ctgtggctcg ccgtcacctt cggcctgctt 300
tactacctca tgagcaaggt cacgctgccg cgcatcggcc gcatcctgga agagcgccac 360tactacctca tgagcaaggt cacgctgccg cgcatcggcc gcatcctgga agagcgccac 360
gaccgcatcg ccgatgatct ggaggaagcc tccaagcatc gcgccgagag cgaggccgcc 420gaccgcatcg ccgatgatct ggaggaagcc tccaagcatc gcgccgagag cgaggccgcc 420
cagcgggcct atgagaaggc gctgagcgag gcccgcgcga aggcccattc catcgccgcg 480cagcgggcct atgagaaggc gctgagcgag gcccgcgcga aggcccattc catcgccgcg 480
gaaacccgcg accgccttgc cgcccacgcc gacaccaacc gcaaggcgct ggagagcgag 540gaaacccgcg accgccttgc cgcccacgcc gacaccaacc gcaaggcgct ggagagcgag 540
ctcaccgcca agctgcaggc ggccgaggag cgcatcgcca ccaccaagag cgaagccctc 600ctcaccgcca agctgcaggc ggccgaggag cgcatcgcca ccaccaagag cgaagccctc 600
acccatgtgc gcggcatcgc ggtggacgcc acccaatcca tcgtctccac cctcatcggt 660acccatgtgc gcggcatcgc ggtggacgcc acccaatcca tcgtctccac cctcatcggt 660
gtcgcgcccg cggcggccga cgtggaaaaa gcggtggacg gcgccctgtc ccagcacggc 720gtcgcgcccg cggcggccga cgtggaaaaa gcggtggacg gcgccctgtc ccagcacggc 720
caggcctga 729caggcctga 729
<210> 43<210> 43
<211> 242<211> 242
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 43<400> 43
Met Met Ile Ala Trp Lys Arg Thr Phe Ala Val Val Thr Phe Gly Ala Met Met Ile Ala Trp Lys Arg Thr Phe Ala Val Val Thr Phe Gly Ala
1 5 10 15 1 5 10 15
Ala Leu Met Ala Met Pro Val Ala Gly Val Val Ala Ala Glu Thr Ser Ala Leu Met Ala Met Pro Val Ala Gly Val Val Ala Ala Glu Thr Ser
20 25 30 20 25 30
Pro Ala Pro Ala Ala Val Ala Gln Ala Asp His Ala Val Pro Thr Glu Pro Ala Pro Ala Ala Val Ala Gln Ala Asp His Ala Val Pro Thr Glu
35 40 45 35 40 45
Ala Ala Gly Gln Gly Thr Ala Asp Ala Ala His Ala Ala Ala Pro Gly Ala Ala Gly Gln Gly Thr Ala Asp Ala Ala His Ala Ala Ala Pro Gly
50 55 60 50 55 60
Glu Ala Ala His Gly Gly Ala Ala Lys His Glu Thr His Phe Pro Pro Glu Ala Ala His Gly Gly Ala Ala Lys His Glu Thr His Phe Pro Pro
65 70 75 80 65 70 75 80
Phe Asp Gly Thr Thr Phe Ala Ser Gln Leu Leu Trp Leu Ala Val Thr Phe Asp Gly Thr Thr Phe Ala Ser Gln Leu Leu Trp Leu Ala Val Thr
85 90 95 85 90 95
Phe Gly Leu Leu Tyr Tyr Leu Met Ser Lys Val Thr Leu Pro Arg Ile Phe Gly Leu Leu Tyr Tyr Leu Met Ser Lys Val Thr Leu Pro Arg Ile
100 105 110 100 105 110
Gly Arg Ile Leu Glu Glu Arg His Asp Arg Ile Ala Asp Asp Leu Glu Gly Arg Ile Leu Glu Glu Arg His Asp Arg Ile Ala Asp Asp Leu Glu
115 120 125 115 120 125
Glu Ala Ser Lys His Arg Ala Glu Ser Glu Ala Ala Gln Arg Ala Tyr Glu Ala Ser Lys His Arg Ala Glu Ser Glu Ala Ala Gln Arg Ala Tyr
130 135 140 130 135 140
Glu Lys Ala Leu Ser Glu Ala Arg Ala Lys Ala His Ser Ile Ala Ala Glu Lys Ala Leu Ser Glu Ala Arg Ala Lys Ala His Ser Ile Ala Ala
145 150 155 160 145 150 155 160
Glu Thr Arg Asp Arg Leu Ala Ala His Ala Asp Thr Asn Arg Lys Ala Glu Thr Arg Asp Arg Leu Ala Ala His Ala Asp Thr Asn Arg Lys Ala
165 170 175 165 170 175
Leu Glu Ser Glu Leu Thr Ala Lys Leu Gln Ala Ala Glu Glu Arg Ile Leu Glu Ser Glu Leu Thr Ala Lys Leu Gln Ala Ala Glu Glu Arg Ile
180 185 190 180 185 190
Ala Thr Thr Lys Ser Glu Ala Leu Thr His Val Arg Gly Ile Ala Val Ala Thr Thr Lys Ser Glu Ala Leu Thr His Val Arg Gly Ile Ala Val
195 200 205 195 200 205
Asp Ala Thr Gln Ser Ile Val Ser Thr Leu Ile Gly Val Ala Pro Ala Asp Ala Thr Gln Ser Ile Val Ser Thr Leu Ile Gly Val Ala Pro Ala
210 215 220 210 215 220
Ala Ala Asp Val Glu Lys Ala Val Asp Gly Ala Leu Ser Gln His Gly Ala Ala Asp Val Glu Lys Ala Val Asp Gly Ala Leu Ser Gln His Gly
225 230 235 240 225 230 235 240
Gln Ala Gln Ala
<210> 44<210> 44
<211> 228<211> 228
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 44<400> 44
atggaagcgg aagctggaaa gttcatcggt gccggcctcg cctgcctcgg catgggtctc 60atggaagcgg aagctggaaa gttcatcggt gccggcctcg cctgcctcgg catgggtctc 60
gctggcgtcg gcgtcggtaa catcttcggt aacttcctct ccggcgccct gcgcaacccg 120gctggcgtcg gcgtcggtaa catcttcggt aacttcctct ccggcgccct gcgcaacccg 120
tccgctgccg acggccagtt cgcccgcgcc ttcatcggcg ccgccctcgc ggaaggtctc 180tccgctgccg acggccagtt cgcccgcgcc ttcatcggcg ccgccctcgc ggaaggtctc 180
ggcatcttct cgctggtcgt tgcgctcgtc ctgctgttcg tggcctga 228ggcatcttct cgctggtcgt tgcgctcgtc ctgctgttcg tggcctga 228
<210> 45<210> 45
<211> 75<211> 75
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 45<400> 45
Met Glu Ala Glu Ala Gly Lys Phe Ile Gly Ala Gly Leu Ala Cys Leu Met Glu Ala Glu Ala Gly Lys Phe Ile Gly Ala Gly Leu Ala Cys Leu
1 5 10 15 1 5 10 15
Gly Met Gly Leu Ala Gly Val Gly Val Gly Asn Ile Phe Gly Asn Phe Gly Met Gly Leu Ala Gly Val Gly Val Gly Asn Ile Phe Gly Asn Phe
20 25 30 20 25 30
Leu Ser Gly Ala Leu Arg Asn Pro Ser Ala Ala Asp Gly Gln Phe Ala Leu Ser Gly Ala Leu Arg Asn Pro Ser Ala Ala Asp Gly Gln Phe Ala
35 40 45 35 40 45
Arg Ala Phe Ile Gly Ala Ala Leu Ala Glu Gly Leu Gly Ile Phe Ser Arg Ala Phe Ile Gly Ala Ala Leu Ala Glu Gly Leu Gly Ile Phe Ser
50 55 60 50 55 60
Leu Val Val Ala Leu Val Leu Leu Phe Val Ala Leu Val Val Ala Leu Val Leu Leu Phe Val Ala
65 70 75 65 70 75
<210> 46<210> 46
<211> 753<211> 753
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 46<400> 46
atgaccgtcg atccgatcca ccagttcgag atcaagcgct acgtggatct gctgaacgtc 60atgaccgtcg atccgatcca ccagttcgag atcaagcgct acgtggatct gctgaacgtc 60
ggcggtgtcc agttctcctt caccaacgca acggtgttca tgattggcat cgtcctggtg 120ggcggtgtcc agttctcctt caccaacgca acggtgttca tgattggcat cgtcctggtg 120
attttcttct tcctgacttt cgcgacacgc ggtcgcaccc ttgtgccggg ccggatgcag 180attttcttct tcctgacttt cgcgacacgc ggtcgcaccc ttgtgccggg ccggatgcag 180
tcggcggcgg agctgagcta cgagttcatc gccaagatgg tgcgcgacgc ggccggcagc 240tcggcggcgg agctgagcta cgagttcatc gccaagatgg tgcgcgacgc ggccggcagc 240
gagggaatgg tgttctttcc cttcgtcttc tcgctcttca tgttcgtgct ggtggcgaac 300gagggaatgg tgttctttcc cttcgtcttc tcgctcttca tgttcgtgct ggtggcgaac 300
gtattggggc tcatccccta caccttcacg gtgaccgccc acctcatcgt caccgccgcc 360gtattggggc tcatccccta caccttcacg gtgaccgccc acctcatcgt caccgccgcc 360
ctggcggcga cggtgatcct caccgtcatc atctacggct tcgtgcggca cggcacccac 420ctggcggcga cggtgatcct caccgtcatc atctacggct tcgtgcggca cggcacccac 420
ttcctgcacc tgttcgtgcc gtcgggcgtg ccgggcttcc tcctgccctt cctcgtggtg 480ttcctgcacc tgttcgtgcc gtcgggcgtg ccgggcttcc tcctgccctt cctcgtggtg 480
atcgaggtgg tgtcgttcct gtcgcggccc atcagcctct cgctgcgtct gttcgccaac 540atcgaggtgg tgtcgttcct gtcgcggccc atcagcctct cgctgcgtct gttcgccaac 540
atgctggcgg gccacatcgc cctcaaggtg ttcgccttct tcgtcgtggg actggcctcg 600atgctggcgg gccacatcgc cctcaaggtg ttcgccttct tcgtcgtggg actggcctcg 600
gccggcgcga tcggctggtt cggcgccacc ctgcccttct tcatgatcgt ggcgctcacc 660gccggcgcga tcggctggtt cggcgccacc ctgcccttct tcatgatcgt ggcgctcacc 660
gcgctggagc tgctggtggc ggtgctgcag gcctacgtgt tcgcggtgct gacctcgatc 720gcgctggagc tgctggtggc ggtgctgcag gcctacgtgt tcgcggtgct gacctcgatc 720
tacctcaacg acgccatcca tcccggccac tga 753tacctcaacg acgccatcca tcccggccac tga 753
<210> 47<210> 47
<211> 250<211> 250
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 47<400> 47
Met Thr Val Asp Pro Ile His Gln Phe Glu Ile Lys Arg Tyr Val Asp Met Thr Val Asp Pro Ile His Gln Phe Glu Ile Lys Arg Tyr Val Asp
1 5 10 15 1 5 10 15
Leu Leu Asn Val Gly Gly Val Gln Phe Ser Phe Thr Asn Ala Thr Val Leu Leu Asn Val Gly Gly Val Gln Phe Ser Phe Thr Asn Ala Thr Val
20 25 30 20 25 30
Phe Met Ile Gly Ile Val Leu Val Ile Phe Phe Phe Leu Thr Phe Ala Phe Met Ile Gly Ile Val Leu Val Ile Phe Phe Phe Leu Thr Phe Ala
35 40 45 35 40 45
Thr Arg Gly Arg Thr Leu Val Pro Gly Arg Met Gln Ser Ala Ala Glu Thr Arg Gly Arg Thr Leu Val Pro Gly Arg Met Gln Ser Ala Ala Glu
50 55 60 50 55 60
Leu Ser Tyr Glu Phe Ile Ala Lys Met Val Arg Asp Ala Ala Gly Ser Leu Ser Tyr Glu Phe Ile Ala Lys Met Val Arg Asp Ala Ala Gly Ser
65 70 75 80 65 70 75 80
Glu Gly Met Val Phe Phe Pro Phe Val Phe Ser Leu Phe Met Phe Val Glu Gly Met Val Phe Phe Pro Phe Val Phe Ser Leu Phe Met Phe Val
85 90 95 85 90 95
Leu Val Ala Asn Val Leu Gly Leu Ile Pro Tyr Thr Phe Thr Val Thr Leu Val Ala Asn Val Leu Gly Leu Ile Pro Tyr Thr Phe Thr Val Thr
100 105 110 100 105 110
Ala His Leu Ile Val Thr Ala Ala Leu Ala Ala Thr Val Ile Leu Thr Ala His Leu Ile Val Thr Ala Ala Leu Ala Ala Thr Val Ile Leu Thr
115 120 125 115 120 125
Val Ile Ile Tyr Gly Phe Val Arg His Gly Thr His Phe Leu His Leu Val Ile Ile Tyr Gly Phe Val Arg His Gly Thr His Phe Leu His Leu
130 135 140 130 135 140
Phe Val Pro Ser Gly Val Pro Gly Phe Leu Leu Pro Phe Leu Val Val Phe Val Pro Ser Gly Val Pro Gly Phe Leu Leu Pro Phe Leu Val Val
145 150 155 160 145 150 155 160
Ile Glu Val Val Ser Phe Leu Ser Arg Pro Ile Ser Leu Ser Leu Arg Ile Glu Val Val Ser Phe Leu Ser Arg Pro Ile Ser Leu Ser Leu Arg
165 170 175 165 170 175
Leu Phe Ala Asn Met Leu Ala Gly His Ile Ala Leu Lys Val Phe Ala Leu Phe Ala Asn Met Leu Ala Gly His Ile Ala Leu Lys Val Phe Ala
180 185 190 180 185 190
Phe Phe Val Val Gly Leu Ala Ser Ala Gly Ala Ile Gly Trp Phe Gly Phe Phe Val Val Gly Leu Ala Ser Ala Gly Ala Ile Gly Trp Phe Gly
195 200 205 195 200 205
Ala Thr Leu Pro Phe Phe Met Ile Val Ala Leu Thr Ala Leu Glu Leu Ala Thr Leu Pro Phe Phe Met Ile Val Ala Leu Thr Ala Leu Glu Leu
210 215 220 210 215 220
Leu Val Ala Val Leu Gln Ala Tyr Val Phe Ala Val Leu Thr Ser Ile Leu Val Ala Val Leu Gln Ala Tyr Val Phe Ala Val Leu Thr Ser Ile
225 230 235 240 225 230 235 240
Tyr Leu Asn Asp Ala Ile His Pro Gly His Tyr Leu Asn Asp Ala Ile His Pro Gly His
245 250 245 250
<210> 48<210> 48
<211> 390<211> 390
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 48<400> 48
atgtccgagc cgaatgatcc atcccgcagg gacggtgcga aggcgaaaga cgagacgcag 60atgtccgagc cgaatgatcc atcccgcagg gacggtgcga aggcgaaaga cgagacgcag 60
gactcccggc ccggtgaggc ggatcttgct cggcgcctcg atgcgctcgg cacctccatc 120gactcccggc ccggtgaggc ggatcttgct cggcgcctcg atgcgctcgg cacctccatc 120
ggtcaggtca agtccagaag cggggagccc gcggcgacgc cgcgcaagga cacctcctcg 180ggtcaggtca agtccagaag cggggagccc gcggcgacgc cgcgcaagga cacctcctcg 180
gcctccggcg cggccctggc gtttcggctg ggcgccgagt ttgtttcagg cgtgctggtg 240gcctccggcg cggccctggc gtttcggctg ggcgccgagt ttgtttcagg cgtgctggtg 240
ggctcgctca tcggctacgg gttggattat gcgtttgcga tttcgccctg ggggctgatc 300ggctcgctca tcggctacgg gttggattat gcgtttgcga tttcgccctg ggggctgatc 300
gccttcacgc tgatcggctt tgccgccggc gtcctgaaca tgctgcgcgt ggcgaacagc 360gccttcacgc tgatcggctt tgccgccggc gtcctgaaca tgctgcgcgt ggcgaacagc 360
gatgccaagc gccacagcgc ggacaggtga 390gatgccaagc gccacagcgc ggacaggtga 390
<210> 49<210> 49
<211> 129<211> 129
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 49<400> 49
Met Ser Glu Pro Asn Asp Pro Ser Arg Arg Asp Gly Ala Lys Ala Lys Met Ser Glu Pro Asn Asp Pro Ser Arg Arg Asp Gly Ala Lys Ala Lys
1 5 10 15 1 5 10 15
Asp Glu Thr Gln Asp Ser Arg Pro Gly Glu Ala Asp Leu Ala Arg Arg Asp Glu Thr Gln Asp Ser Arg Pro Gly Glu Ala Asp Leu Ala Arg Arg
20 25 30 20 25 30
Leu Asp Ala Leu Gly Thr Ser Ile Gly Gln Val Lys Ser Arg Ser Gly Leu Asp Ala Leu Gly Thr Ser Ile Gly Gln Val Lys Ser Arg Ser Gly
35 40 45 35 40 45
Glu Pro Ala Ala Thr Pro Arg Lys Asp Thr Ser Ser Ala Ser Gly Ala Glu Pro Ala Ala Thr Pro Arg Lys Asp Thr Ser Ser Ala Ser Gly Ala
50 55 60 50 55 60
Ala Leu Ala Phe Arg Leu Gly Ala Glu Phe Val Ser Gly Val Leu Val Ala Leu Ala Phe Arg Leu Gly Ala Glu Phe Val Ser Gly Val Leu Val
65 70 75 80 65 70 75 80
Gly Ser Leu Ile Gly Tyr Gly Leu Asp Tyr Ala Phe Ala Ile Ser Pro Gly Ser Leu Ile Gly Tyr Gly Leu Asp Tyr Ala Phe Ala Ile Ser Pro
85 90 95 85 90 95
Trp Gly Leu Ile Ala Phe Thr Leu Ile Gly Phe Ala Ala Gly Val Leu Trp Gly Leu Ile Ala Phe Thr Leu Ile Gly Phe Ala Ala Gly Val Leu
100 105 110 100 105 110
Asn Met Leu Arg Val Ala Asn Ser Asp Ala Lys Arg His Ser Ala Asp Asn Met Leu Arg Val Ala Asn Ser Asp Ala Lys Arg His Ser Ala Asp
115 120 125 115 120 125
Arg Arg
<210> 50<210> 50
<211> 1929<211> 1929
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 50<400> 50
atgagttcgc tctccgccac tattcaacag gtcttcaacg agccgggctg cgcgaagaac 60atgagttcgc tctccgccac tattcaacag gtcttcaacg agccgggctg cgcgaagaac 60
cagaataagt ccgaggcgga gaagaagaag ggctgcacca agcagctgca acccggcgga 120cagaataagt ccgaggcgga gaagaagaag ggctgcacca agcagctgca acccggcgga 120
gcggccggcg gctgcgcgtt cgacggcgcg aagatcgcgc tccagccctt gaccgacgtc 180gcggccggcg gctgcgcgtt cgacggcgcg aagatcgcgc tccagccctt gaccgacgtc 180
gcccacctgg tgcacggccc catcgcctgc gaaggcaatt cctgggacaa tcgtggcgcc 240gcccacctgg tgcacggccc catcgcctgc gaaggcaatt cctgggacaa tcgtggcgcc 240
aagtcctccg gctcgaacat ctggcgcacc ggcttcacca cggacatcaa cgaaaccgac 300aagtcctccg gctcgaacat ctggcgcacc ggcttcacca cggacatcaa cgaaaccgac 300
gtggtgttcg gcggcgagaa gcgtctgttc aagtccatca aggaaatcat cgagaagtac 360gtggtgttcg gcggcgagaa gcgtctgttc aagtccatca aggaaatcat cgagaagtac 360
gacccgccgg ccgtcttcgt ctatcagacc tgcgtccccg ccatgatcgg cgacgacatc 420gacccgccgg ccgtcttcgt ctatcagacc tgcgtccccg ccatgatcgg cgacgacatc 420
gacgcggtgt gcaaggcggc cagggagaag ttcggaaagc cggtgatccc gatcaattcc 480gacgcggtgt gcaaggcggc cagggagaag ttcggaaagc cggtgatccc gatcaattcc 480
cccggcttcg tggggccgaa gaatctcggc aacaagctcg ccggcgaggc gctcctcgac 540cccggcttcg tggggccgaa gaatctcggc aacaagctcg ccggcgaggc gctcctcgac 540
catgtgatcg gcaccgagga gcccgattac acgacggcct acgacatcaa catcatcggc 600catgtgatcg gcaccgagga gcccgattac acgacggcct acgacatcaa catcatcggc 600
gaatacaatc tctccggcga gttgtggcag gtgaagccgc tgctggacga gctgggcatc 660gaatacaatc tctccggcga gttgtggcag gtgaagccgc tgctggacga gctgggcatc 660
cgcatcctcg cctgcatctc cggcgacggg aagtacaagg atgtggcgtc ctcccaccgc 720cgcatcctcg cctgcatctc cggcgacggg aagtacaagg atgtggcgtc ctcccaccgc 720
gccaaggcgg cgatgatggt gtgctccaag gccatgatca acgtggcccg caagatggag 780gccaaggcgg cgatgatggt gtgctccaag gccatgatca acgtggcccg caagatggag 780
gagcgctacg acatcccctt cttcgaaggc tccttctacg gcatcgagga tagctccgat 840gagcgctacg acatcccctt cttcgaaggc tccttctacg gcatcgagga tagctccgat 840
tccctgcgcg agattgcgcg catgctcatc gagaagggcg ccgatccgga gctgatggac 900tccctgcgcg agattgcgcg catgctcatc gagaagggcg ccgatccgga gctgatggac 900
cgcaccgagg cgctgattga gcgggaagag aagaaggcgt gggacgccat cgccgcctac 960cgcaccgagg cgctgattga gcgggaagag aagaaggcgt gggacgccat cgccgcctac 960
aagccccgct tcaaggacaa gaaggtgctg ctcatcaccg gcggcgtgaa atcctggtcg 1020aagccccgct tcaaggacaa gaaggtgctg ctcatcaccg gcggcgtgaa atcctggtcg 1020
gtggtggcag cgctccagga agccggcctc gaactggtgg gcacctcggt gaagaagtcc 1080gtggtggcag cgctccagga agccggcctc gaactggtgg gcacctcggt gaagaagtcc 1080
accaaggagg acaaggagcg catcaaggaa ctgatgggcc aggacgccca catgatcgac 1140accaaggagg acaaggagcg catcaaggaa ctgatgggcc aggacgccca catgatcgac 1140
gacatgacgc cccgcgaaat gtacaagatg ctgaaggacg ccaaggcgga catcatgctc 1200gacatgacgc cccgcgaaat gtacaagatg ctgaaggacg ccaaggcgga catcatgctc 1200
tcgggcgggc gctcgcaatt catcgcgctc aaggccgcca tgccctggct cgacatcaac 1260tcgggcgggc gctcgcaatt catcgcgctc aaggccgcca tgccctggct cgacatcaac 1260
caggagcgcc accacgccta tatgggctat gtgggcatgg tgaagctggt cgaggagatc 1320caggagcgcc accacgccta tatgggctat gtgggcatgg tgaagctggt cgaggagatc 1320
gacaaggcgc tctacaatcc cgtgtgggaa caggtgcgca agcccgcccc gtgggaaaat 1380gacaaggcgc tctacaatcc cgtgtgggaa caggtgcgca agcccgcccc gtgggaaaat 1380
ccggaagaca cctggcaggc ccgtgcgctc gccgaaatgg aggcggaggc cgccgcgctc 1440ccggaagaca cctggcaggc ccgtgcgctc gccgaaatgg aggcggaggc cgccgcgctc 1440
gccgccgatc cggtgcgcgc ggaagaggtg cgccggtcca agaagatctg caattgcaag 1500gccgccgatc cggtgcgcgc ggaagaggtg cgccggtcca agaagatctg caattgcaag 1500
agcgtcgacc tcggaaccat tgaggacgcc atcaaggctc acgcgctgac caccgtggag 1560agcgtcgacc tcggaaccat tgaggacgcc atcaaggctc acgcgctgac caccgtggag 1560
ggtgtgcgag agcacaccaa tgcctcggga ggctgcggag cctgcagcgg gcggatcgag 1620ggtgtgcgag agcacaccaa tgcctcggga ggctgcggag cctgcagcgg gcggatcgag 1620
gagatcttcg aggccgtggg cgttgtcgcc gccccgcctc ccgcggaggc cgccccgtct 1680gagatcttcg aggccgtggg cgttgtcgcc gccccgcctc ccgcggaggc cgccccgtct 1680
ccgcaggaga tcgcgcccga tccgctcgct gcggaggaaa agcgccgcgc caagaaggcc 1740ccgcaggaga tcgcgcccga tccgctcgct gcggaggaaa agcgccgcgc caagaaggcc 1740
tgcggctgca aggaggtagc ggtcggcacc attgaggatg ccatccgcgc caagggtctg 1800tgcggctgca aggaggtagc ggtcggcacc attgaggatg ccatccgcgc caagggtctg 1800
cgaaacatcg cggaggtgcg tgcggccacc gatgccaaca ccggctgcgg caattgccag 1860cgaaacatcg cggaggtgcg tgcggccacc gatgccaaca ccggctgcgg caattgccag 1860
gagcgggtgg agggcatcct cgaccgggtt ctcgccgagg cggcctcaga actccaggcg 1920gagcgggtgg agggcatcct cgaccgggtt ctcgccgagg cggcctcaga actccaggcg 1920
gcggaatag 1929gcggaatag 1929
<210> 51<210> 51
<211> 642<211> 642
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 51<400> 51
Met Ser Ser Leu Ser Ala Thr Ile Gln Gln Val Phe Asn Glu Pro Gly Met Ser Ser Leu Ser Ala Thr Ile Gln Gln Val Phe Asn Glu Pro Gly
1 5 10 15 1 5 10 15
Cys Ala Lys Asn Gln Asn Lys Ser Glu Ala Glu Lys Lys Lys Gly Cys Cys Ala Lys Asn Gln Asn Lys Ser Glu Ala Glu Lys Lys Lys Gly Cys
20 25 30 20 25 30
Thr Lys Gln Leu Gln Pro Gly Gly Ala Ala Gly Gly Cys Ala Phe Asp Thr Lys Gln Leu Gln Pro Gly Gly Ala Ala Gly Gly Cys Ala Phe Asp
35 40 45 35 40 45
Gly Ala Lys Ile Ala Leu Gln Pro Leu Thr Asp Val Ala His Leu Val Gly Ala Lys Ile Ala Leu Gln Pro Leu Thr Asp Val Ala His Leu Val
50 55 60 50 55 60
His Gly Pro Ile Ala Cys Glu Gly Asn Ser Trp Asp Asn Arg Gly Ala His Gly Pro Ile Ala Cys Glu Gly Asn Ser Trp Asp Asn Arg Gly Ala
65 70 75 80 65 70 75 80
Lys Ser Ser Gly Ser Asn Ile Trp Arg Thr Gly Phe Thr Thr Asp Ile Lys Ser Ser Gly Ser Asn Ile Trp Arg Thr Gly Phe Thr Thr Asp Ile
85 90 95 85 90 95
Asn Glu Thr Asp Val Val Phe Gly Gly Glu Lys Arg Leu Phe Lys Ser Asn Glu Thr Asp Val Val Phe Gly Gly Glu Lys Arg Leu Phe Lys Ser
100 105 110 100 105 110
Ile Lys Glu Ile Ile Glu Lys Tyr Asp Pro Pro Ala Val Phe Val Tyr Ile Lys Glu Ile Ile Glu Lys Tyr Asp Pro Pro Ala Val Phe Val Tyr
115 120 125 115 120 125
Gln Thr Cys Val Pro Ala Met Ile Gly Asp Asp Ile Asp Ala Val Cys Gln Thr Cys Val Pro Ala Met Ile Gly Asp Asp Ile Asp Ala Val Cys
130 135 140 130 135 140
Lys Ala Ala Arg Glu Lys Phe Gly Lys Pro Val Ile Pro Ile Asn Ser Lys Ala Ala Arg Glu Lys Phe Gly Lys Pro Val Ile Pro Ile Asn Ser
145 150 155 160 145 150 155 160
Pro Gly Phe Val Gly Pro Lys Asn Leu Gly Asn Lys Leu Ala Gly Glu Pro Gly Phe Val Gly Pro Lys Asn Leu Gly Asn Lys Leu Ala Gly Glu
165 170 175 165 170 175
Ala Leu Leu Asp His Val Ile Gly Thr Glu Glu Pro Asp Tyr Thr Thr Ala Leu Leu Asp His Val Ile Gly Thr Glu Glu Pro Asp Tyr Thr Thr
180 185 190 180 185 190
Ala Tyr Asp Ile Asn Ile Ile Gly Glu Tyr Asn Leu Ser Gly Glu Leu Ala Tyr Asp Ile Asn Ile Ile Gly Glu Tyr Asn Leu Ser Gly Glu Leu
195 200 205 195 200 205
Trp Gln Val Lys Pro Leu Leu Asp Glu Leu Gly Ile Arg Ile Leu Ala Trp Gln Val Lys Pro Leu Leu Asp Glu Leu Gly Ile Arg Ile Leu Ala
210 215 220 210 215 220
Cys Ile Ser Gly Asp Gly Lys Tyr Lys Asp Val Ala Ser Ser His Arg Cys Ile Ser Gly Asp Gly Lys Tyr Lys Asp Val Ala Ser Ser His Arg
225 230 235 240 225 230 235 240
Ala Lys Ala Ala Met Met Val Cys Ser Lys Ala Met Ile Asn Val Ala Ala Lys Ala Ala Met Met Val Cys Ser Lys Ala Met Ile Asn Val Ala
245 250 255 245 250 255
Arg Lys Met Glu Glu Arg Tyr Asp Ile Pro Phe Phe Glu Gly Ser Phe Arg Lys Met Glu Glu Arg Tyr Asp Ile Pro Phe Phe Glu Gly Ser Phe
260 265 270 260 265 270
Tyr Gly Ile Glu Asp Ser Ser Asp Ser Leu Arg Glu Ile Ala Arg Met Tyr Gly Ile Glu Asp Ser Ser Asp Ser Leu Arg Glu Ile Ala Arg Met
275 280 285 275 280 285
Leu Ile Glu Lys Gly Ala Asp Pro Glu Leu Met Asp Arg Thr Glu Ala Leu Ile Glu Lys Gly Ala Asp Pro Glu Leu Met Asp Arg Thr Glu Ala
290 295 300 290 295 300
Leu Ile Glu Arg Glu Glu Lys Lys Ala Trp Asp Ala Ile Ala Ala Tyr Leu Ile Glu Arg Glu Glu Lys Lys Ala Trp Asp Ala Ile Ala Ala Tyr
305 310 315 320 305 310 315 320
Lys Pro Arg Phe Lys Asp Lys Lys Val Leu Leu Ile Thr Gly Gly Val Lys Pro Arg Phe Lys Asp Lys Lys Val Leu Leu Ile Thr Gly Gly Val
325 330 335 325 330 335
Lys Ser Trp Ser Val Val Ala Ala Leu Gln Glu Ala Gly Leu Glu Leu Lys Ser Trp Ser Val Val Ala Ala Leu Gln Glu Ala Gly Leu Glu Leu
340 345 350 340 345 350
Val Gly Thr Ser Val Lys Lys Ser Thr Lys Glu Asp Lys Glu Arg Ile Val Gly Thr Ser Val Lys Lys Ser Thr Lys Glu Asp Lys Glu Arg Ile
355 360 365 355 360 365
Lys Glu Leu Met Gly Gln Asp Ala His Met Ile Asp Asp Met Thr Pro Lys Glu Leu Met Gly Gln Asp Ala His Met Ile Asp Asp Met Thr Pro
370 375 380 370 375 380
Arg Glu Met Tyr Lys Met Leu Lys Asp Ala Lys Ala Asp Ile Met Leu Arg Glu Met Tyr Lys Met Leu Lys Asp Ala Lys Ala Asp Ile Met Leu
385 390 395 400 385 390 395 400
Ser Gly Gly Arg Ser Gln Phe Ile Ala Leu Lys Ala Ala Met Pro Trp Ser Gly Gly Arg Ser Gln Phe Ile Ala Leu Lys Ala Ala Met Pro Trp
405 410 415 405 410 415
Leu Asp Ile Asn Gln Glu Arg His His Ala Tyr Met Gly Tyr Val Gly Leu Asp Ile Asn Gln Glu Arg His His Ala Tyr Met Gly Tyr Val Gly
420 425 430 420 425 430
Met Val Lys Leu Val Glu Glu Ile Asp Lys Ala Leu Tyr Asn Pro Val Met Val Lys Leu Val Glu Glu Ile Asp Lys Ala Leu Tyr Asn Pro Val
435 440 445 435 440 445
Trp Glu Gln Val Arg Lys Pro Ala Pro Trp Glu Asn Pro Glu Asp Thr Trp Glu Gln Val Arg Lys Pro Ala Pro Trp Glu Asn Pro Glu Asp Thr
450 455 460 450 455 460
Trp Gln Ala Arg Ala Leu Ala Glu Met Glu Ala Glu Ala Ala Ala Leu Trp Gln Ala Arg Ala Leu Ala Glu Met Glu Ala Glu Ala Ala Ala Leu
465 470 475 480 465 470 475 480
Ala Ala Asp Pro Val Arg Ala Glu Glu Val Arg Arg Ser Lys Lys Ile Ala Ala Asp Pro Val Arg Ala Glu Glu Val Arg Arg Ser Lys Lys Ile
485 490 495 485 490 495
Cys Asn Cys Lys Ser Val Asp Leu Gly Thr Ile Glu Asp Ala Ile Lys Cys Asn Cys Lys Ser Val Asp Leu Gly Thr Ile Glu Asp Ala Ile Lys
500 505 510 500 505 510
Ala His Ala Leu Thr Thr Val Glu Gly Val Arg Glu His Thr Asn Ala Ala His Ala Leu Thr Thr Val Glu Gly Val Arg Glu His Thr Asn Ala
515 520 525 515 520 525
Ser Gly Gly Cys Gly Ala Cys Ser Gly Arg Ile Glu Glu Ile Phe Glu Ser Gly Gly Cys Gly Ala Cys Ser Gly Arg Ile Glu Glu Ile Phe Glu
530 535 540 530 535 540
Ala Val Gly Val Val Ala Ala Pro Pro Pro Ala Glu Ala Ala Pro Ser Ala Val Gly Val Val Ala Ala Pro Pro Pro Ala Glu Ala Ala Pro Ser
545 550 555 560 545 550 555 560
Pro Gln Glu Ile Ala Pro Asp Pro Leu Ala Ala Glu Glu Lys Arg Arg Pro Gln Glu Ile Ala Pro Asp Pro Leu Ala Ala Glu Glu Lys Arg Arg
565 570 575 565 570 575
Ala Lys Lys Ala Cys Gly Cys Lys Glu Val Ala Val Gly Thr Ile Glu Ala Lys Lys Ala Cys Gly Cys Lys Glu Val Ala Val Gly Thr Ile Glu
580 585 590 580 585 590
Asp Ala Ile Arg Ala Lys Gly Leu Arg Asn Ile Ala Glu Val Arg Ala Asp Ala Ile Arg Ala Lys Gly Leu Arg Asn Ile Ala Glu Val Arg Ala
595 600 605 595 600 605
Ala Thr Asp Ala Asn Thr Gly Cys Gly Asn Cys Gln Glu Arg Val Glu Ala Thr Asp Ala Asn Thr Gly Cys Gly Asn Cys Gln Glu Arg Val Glu
610 615 620 610 615 620
Gly Ile Leu Asp Arg Val Leu Ala Glu Ala Ala Ser Glu Leu Gln Ala Gly Ile Leu Asp Arg Val Leu Ala Glu Ala Ala Ser Glu Leu Gln Ala
625 630 635 640 625 630 635 640
Ala Glu Ala Glu
<210> 52<210> 52
<211> 1503<211> 1503
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 52<400> 52
atgagtgtcg cacagtccca gagcgtcgcc gagatcaagg cgcgcaacaa ggaactcatc 60atgagtgtcg cacagtccca gagcgtcgcc gagatcaagg cgcgcaacaa ggaactcatc 60
gaagaggtcc tcaaggtcta tcccgagaag accgccaagc gccgcgccaa gcacctgaac 120gaagaggtcc tcaaggtcta tcccgagaag accgccaagc gccgcgccaa gcacctgaac 120
gtccacgaag ccggcaagtc cgactgcggc gtgaagtcca acatcaagtc catcccgggc 180gtccacgaag ccggcaagtc cgactgcggc gtgaagtcca acatcaagtc catcccgggc 180
gtgatgacca tccgcggttg cgcttatgcc ggctccaagg gtgtggtgtg gggtcccatc 240gtgatgacca tccgcggttg cgcttatgcc ggctccaagg gtgtggtgtg gggtcccatc 240
aaggacatga tccacatctc ccacggcccg gtgggctgcg gccagtatag ctgggccgcc 300aaggacatga tccacatctc ccacggcccg gtgggctgcg gccagtatag ctgggccgcc 300
cgccgcaact actatatcgg cacgaccggc atcgacacct tcgtgacgat gcagttcacc 360cgccgcaact actatatcgg cacgaccggc atcgacacct tcgtgacgat gcagttcacc 360
tccgacttcc aggagaagga catcgtcttc ggcggcgaca agaagctcgc caagatcatg 420tccgacttcc aggagaagga catcgtcttc ggcggcgaca agaagctcgc caagatcatg 420
gacgagatcc aggagctgtt cccgctgaac aacggcatca ccgttcagtc cgagtgcccc 480gacgagatcc aggagctgtt cccgctgaac aacggcatca ccgttcagtc cgagtgcccc 480
atcggcctca tcggcgacga catcgaggcc gtctccaagc agaagtccaa ggagtatgag 540atcggcctca tcggcgacga catcgaggcc gtctccaagc agaagtccaa ggagtatgag 540
ggcaagacca tcgtgccggt gcgctgcgag ggcttccgcg gcgtgtccca gtccctgggc 600ggcaagacca tcgtgccggt gcgctgcgag ggcttccgcg gcgtgtccca gtccctgggc 600
caccacatcg ccaacgacgc catccgcgat tgggtgttcg acaagatcgc gcccgacgcc 660caccacatcg ccaacgacgc catccgcgat tgggtgttcg acaagatcgc gcccgacgcc 660
gagccgcgct ttgagccgac cccgtacgac gtcgccatca tcggcgacta caatatcggt 720gagccgcgct ttgagccgac cccgtacgac gtcgccatca tcggcgacta caatatcggt 720
ggtgacgcct ggtcgtcccg tatcctcctg gaggagatgg gcctgcgcgt gatcgcccag 780ggtgacgcct ggtcgtcccg tatcctcctg gaggagatgg gcctgcgcgt gatcgcccag 780
tggtccggcg acggttcgct cgctgagctg gaggccaccc cgaaggccaa gctcaacgtg 840tggtccggcg acggttcgct cgctgagctg gaggccaccc cgaaggccaa gctcaacgtg 840
ctgcactgct accgctccat gaactacatc tcgcgccaca tggaagagaa gtacggtatc 900ctgcactgct accgctccat gaactacatc tcgcgccaca tggaagagaa gtacggtatc 900
ccgtggtgcg agtacaactt cttcggtcct tccaagatcg ccgagtccct gcgcaagatc 960ccgtggtgcg agtacaactt cttcggtcct tccaagatcg ccgagtccct gcgcaagatc 960
gccagctact tcgacgacaa gatcaaggaa ggcgcggagc gcgtcatcgc caagtatcag 1020gccagctact tcgacgacaa gatcaaggaa ggcgcggagc gcgtcatcgc caagtatcag 1020
ccgctcatgg atgcggtgat cgcgaagtat cgtccccgcc tcgagggcaa gaccgtgatg 1080ccgctcatgg atgcggtgat cgcgaagtat cgtccccgcc tcgagggcaa gaccgtgatg 1080
ctgtacgtgg gcggcctgcg tccccgtcac gtcatcggcg cctacgagga cctgggcatg 1140ctgtacgtgg gcggcctgcg tccccgtcac gtcatcggcg cctacgagga cctgggcatg 1140
gaagtggtcg gcacgggcta cgagttcgcc cataacgacg actaccagcg caccgcccag 1200gaagtggtcg gcacgggcta cgagttcgcc cataacgacg actaccagcg caccgcccag 1200
cactacgtca aggatggcac catcatctat gacgacgtga ccggctacga gttcgagaag 1260cactacgtca aggatggcac catcatctat gacgacgtga ccggctacga gttcgagaag 1260
ttcgtcgaga agatccagcc ggacctggtc ggttcgggca tcaaggaaaa gtacgtcttc 1320ttcgtcgaga agatccagcc ggacctggtc ggttcgggca tcaaggaaaa gtacgtcttc 1320
cagaagatgg gcgtgccgtt ccgccagatg cactcctggg actactcggg cccgtaccac 1380cagaagatgg gcgtgccgtt ccgccagatg cactcctggg actactcggg cccgtaccac 1380
ggctatgacg gcttcgcgat cttcgcgcgc gacatggaca tggccatcaa cagccccgtg 1440ggctatgacg gcttcgcgat cttcgcgcgc gacatggaca tggccatcaa cagccccgtg 1440
tggaagatga cccaggctcc gtggaagagc gtccccaagc cgacgatgct cgcggctgaa 1500tggaagatga cccaggctcc gtggaagagc gtccccaagc cgacgatgct cgcggctgaa 1500
tga 1503tga 1503
<210> 53<210> 53
<211> 500<211> 500
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 53<400> 53
Met Ser Val Ala Gln Ser Gln Ser Val Ala Glu Ile Lys Ala Arg Asn Met Ser Val Ala Gln Ser Gln Ser Val Ala Glu Ile Lys Ala Arg Asn
1 5 10 15 1 5 10 15
Lys Glu Leu Ile Glu Glu Val Leu Lys Val Tyr Pro Glu Lys Thr Ala Lys Glu Leu Ile Glu Glu Val Leu Lys Val Tyr Pro Glu Lys Thr Ala
20 25 30 20 25 30
Lys Arg Arg Ala Lys His Leu Asn Val His Glu Ala Gly Lys Ser Asp Lys Arg Arg Ala Lys His Leu Asn Val His Glu Ala Gly Lys Ser Asp
35 40 45 35 40 45
Cys Gly Val Lys Ser Asn Ile Lys Ser Ile Pro Gly Val Met Thr Ile Cys Gly Val Lys Ser Asn Ile Lys Ser Ile Pro Gly Val Met Thr Ile
50 55 60 50 55 60
Arg Gly Cys Ala Tyr Ala Gly Ser Lys Gly Val Val Trp Gly Pro Ile Arg Gly Cys Ala Tyr Ala Gly Ser Lys Gly Val Val Trp Gly Pro Ile
65 70 75 80 65 70 75 80
Lys Asp Met Ile His Ile Ser His Gly Pro Val Gly Cys Gly Gln Tyr Lys Asp Met Ile His Ile Ser His Gly Pro Val Gly Cys Gly Gln Tyr
85 90 95 85 90 95
Ser Trp Ala Ala Arg Arg Asn Tyr Tyr Ile Gly Thr Thr Gly Ile Asp Ser Trp Ala Ala Arg Arg Asn Tyr Tyr Ile Gly Thr Thr Gly Ile Asp
100 105 110 100 105 110
Thr Phe Val Thr Met Gln Phe Thr Ser Asp Phe Gln Glu Lys Asp Ile Thr Phe Val Thr Met Gln Phe Thr Ser Asp Phe Gln Glu Lys Asp Ile
115 120 125 115 120 125
Val Phe Gly Gly Asp Lys Lys Leu Ala Lys Ile Met Asp Glu Ile Gln Val Phe Gly Gly Asp Lys Lys Leu Ala Lys Ile Met Asp Glu Ile Gln
130 135 140 130 135 140
Glu Leu Phe Pro Leu Asn Asn Gly Ile Thr Val Gln Ser Glu Cys Pro Glu Leu Phe Pro Leu Asn Asn Gly Ile Thr Val Gln Ser Glu Cys Pro
145 150 155 160 145 150 155 160
Ile Gly Leu Ile Gly Asp Asp Ile Glu Ala Val Ser Lys Gln Lys Ser Ile Gly Leu Ile Gly Asp Asp Ile Glu Ala Val Ser Lys Gln Lys Ser
165 170 175 165 170 175
Lys Glu Tyr Glu Gly Lys Thr Ile Val Pro Val Arg Cys Glu Gly Phe Lys Glu Tyr Glu Gly Lys Thr Ile Val Pro Val Arg Cys Glu Gly Phe
180 185 190 180 185 190
Arg Gly Val Ser Gln Ser Leu Gly His His Ile Ala Asn Asp Ala Ile Arg Gly Val Ser Gln Ser Leu Gly His His Ile Ala Asn Asp Ala Ile
195 200 205 195 200 205
Arg Asp Trp Val Phe Asp Lys Ile Ala Pro Asp Ala Glu Pro Arg Phe Arg Asp Trp Val Phe Asp Lys Ile Ala Pro Asp Ala Glu Pro Arg Phe
210 215 220 210 215 220
Glu Pro Thr Pro Tyr Asp Val Ala Ile Ile Gly Asp Tyr Asn Ile Gly Glu Pro Thr Pro Tyr Asp Val Ala Ile Ile Gly Asp Tyr Asn Ile Gly
225 230 235 240 225 230 235 240
Gly Asp Ala Trp Ser Ser Arg Ile Leu Leu Glu Glu Met Gly Leu Arg Gly Asp Ala Trp Ser Ser Arg Ile Leu Leu Glu Glu Met Gly Leu Arg
245 250 255 245 250 255
Val Ile Ala Gln Trp Ser Gly Asp Gly Ser Leu Ala Glu Leu Glu Ala Val Ile Ala Gln Trp Ser Gly Asp Gly Ser Leu Ala Glu Leu Glu Ala
260 265 270 260 265 270
Thr Pro Lys Ala Lys Leu Asn Val Leu His Cys Tyr Arg Ser Met Asn Thr Pro Lys Ala Lys Leu Asn Val Leu His Cys Tyr Arg Ser Met Asn
275 280 285 275 280 285
Tyr Ile Ser Arg His Met Glu Glu Lys Tyr Gly Ile Pro Trp Cys Glu Tyr Ile Ser Arg His Met Glu Glu Lys Tyr Gly Ile Pro Trp Cys Glu
290 295 300 290 295 300
Tyr Asn Phe Phe Gly Pro Ser Lys Ile Ala Glu Ser Leu Arg Lys Ile Tyr Asn Phe Phe Gly Pro Ser Lys Ile Ala Glu Ser Leu Arg Lys Ile
305 310 315 320 305 310 315 320
Ala Ser Tyr Phe Asp Asp Lys Ile Lys Glu Gly Ala Glu Arg Val Ile Ala Ser Tyr Phe Asp Asp Lys Ile Lys Glu Gly Ala Glu Arg Val Ile
325 330 335 325 330 335
Ala Lys Tyr Gln Pro Leu Met Asp Ala Val Ile Ala Lys Tyr Arg Pro Ala Lys Tyr Gln Pro Leu Met Asp Ala Val Ile Ala Lys Tyr Arg Pro
340 345 350 340 345 350
Arg Leu Glu Gly Lys Thr Val Met Leu Tyr Val Gly Gly Leu Arg Pro Arg Leu Glu Gly Lys Thr Val Met Leu Tyr Val Gly Gly Leu Arg Pro
355 360 365 355 360 365
Arg His Val Ile Gly Ala Tyr Glu Asp Leu Gly Met Glu Val Val Gly Arg His Val Ile Gly Ala Tyr Glu Asp Leu Gly Met Glu Val Val Gly
370 375 380 370 375 380
Thr Gly Tyr Glu Phe Ala His Asn Asp Asp Tyr Gln Arg Thr Ala Gln Thr Gly Tyr Glu Phe Ala His Asn Asp Asp Tyr Gln Arg Thr Ala Gln
385 390 395 400 385 390 395 400
His Tyr Val Lys Asp Gly Thr Ile Ile Tyr Asp Asp Val Thr Gly Tyr His Tyr Val Lys Asp Gly Thr Ile Ile Tyr Asp Asp Val Thr Gly Tyr
405 410 415 405 410 415
Glu Phe Glu Lys Phe Val Glu Lys Ile Gln Pro Asp Leu Val Gly Ser Glu Phe Glu Lys Phe Val Glu Lys Ile Gln Pro Asp Leu Val Gly Ser
420 425 430 420 425 430
Gly Ile Lys Glu Lys Tyr Val Phe Gln Lys Met Gly Val Pro Phe Arg Gly Ile Lys Glu Lys Tyr Val Phe Gln Lys Met Gly Val Pro Phe Arg
435 440 445 435 440 445
Gln Met His Ser Trp Asp Tyr Ser Gly Pro Tyr His Gly Tyr Asp Gly Gln Met His Ser Trp Asp Tyr Ser Gly Pro Tyr His Gly Tyr Asp Gly
450 455 460 450 455 460
Phe Ala Ile Phe Ala Arg Asp Met Asp Met Ala Ile Asn Ser Pro Val Phe Ala Ile Phe Ala Arg Asp Met Asp Met Ala Ile Asn Ser Pro Val
465 470 475 480 465 470 475 480
Trp Lys Met Thr Gln Ala Pro Trp Lys Ser Val Pro Lys Pro Thr Met Trp Lys Met Thr Gln Ala Pro Trp Lys Ser Val Pro Lys Pro Thr Met
485 490 495 485 490 495
Leu Ala Ala Glu Leu Ala Ala Glu
500 500
<210> 54<210> 54
<211> 1377<211> 1377
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 54<400> 54
atggccaccg tttccgtctc caagaaggcc tgcgcggtca accccctcaa gatgagccag 60atggccaccg tttccgtctc caagaaggcc tgcgcggtca accccctcaa gatgagccag 60
ccggtgggcg gcgcgctcgc cttcatgggc gtgcgcaagg ccatgccgct gctgcacggc 120ccggtgggcg gcgcgctcgc cttcatgggc gtgcgcaagg ccatgccgct gctgcacggc 120
tcgcagggct gcacctcctt cggcctggtg ctgttcgtgc gccacttcaa ggaagccatc 180tcgcagggct gcacctcctt cggcctggtg ctgttcgtgc gccacttcaa ggaagccatc 180
cccatgcaga ccaccgccat gagcgaggtg gcgacggttc tgggcggcct tgagaatgtg 240cccatgcaga ccaccgccat gagcgaggtg gcgacggttc tgggcggcct tgagaatgtg 240
gagcaggcca ttctcaacat ctacaatcgc accaagccgg agatcatcgg catctgctcc 300gagcaggcca ttctcaacat ctacaatcgc accaagccgg agatcatcgg catctgctcc 300
accggcgtca ccgagaccaa gggcgatgat gtcgacggct acatcaagct gatccgggac 360accggcgtca ccgagaccaa gggcgatgat gtcgacggct acatcaagct gatccgggac 360
aagtatcccc agctggccga cttcccgctg gtctatgtct ccacccccga tttcaaggac 420aagtatcccc agctggccga cttcccgctg gtctatgtct ccacccccga tttcaaggac 420
gccttccagg acggttggga gaagaccgtg gcgaagatgg tggaggcgct ggtgaagccc 480gccttccagg acggttggga gaagaccgtg gcgaagatgg tggaggcgct ggtgaagccc 480
gccgccgaca agcagaagga caagacccgc gtcaacgtcc tgcccggctg ccacctcacg 540gccgccgaca agcagaagga caagacccgc gtcaacgtcc tgcccggctg ccacctcacg 540
cccggcgatc tggatgagat gcggaccatc ttcgaggatt tcgggctcac accctatttc 600cccggcgatc tggatgagat gcggaccatc ttcgaggatt tcgggctcac accctatttc 600
ctgccggatc tggccggctc gctggatggg catatccccg aggacttctc gcccaccacc 660ctgccggatc tggccggctc gctggatggg catatccccg aggacttctc gcccaccacc 660
atcggcggca tcggcatcga tgagatcgcc accatgggcg aggcggccca caccatctgc 720atcggcggca tcggcatcga tgagatcgcc accatgggcg aggcggccca caccatctgc 720
atcggcgcgc agatgcgccg ggcgggcgag gccatggaga agaagaccgg cattcccttc 780atcggcgcgc agatgcgccg ggcgggcgag gccatggaga agaagaccgg cattcccttc 780
aagctgttcg agcgcctgtg cggcctggag gcgaacgacg ccttcatcat gcacctgtcg 840aagctgttcg agcgcctgtg cggcctggag gcgaacgacg ccttcatcat gcacctgtcg 840
cagatctccg gccggccggt gccggtgaag tatcgccggc agcggggcca gctggtggat 900cagatctccg gccggccggt gccggtgaag tatcgccggc agcggggcca gctggtggat 900
gccatgctgg acggccactt ccatctgggc ggtcgcaagg tggccatggg ggcggagccg 960gccatgctgg acggccactt ccatctgggc ggtcgcaagg tggccatggg ggcggagccg 960
gacctgctct acgacgtggg ctccttcctg cacgagatgg gcgcccacat cctttccgcg 1020gacctgctct acgacgtggg ctccttcctg cacgagatgg gcgcccacat cctttccgcg 1020
gtcaccacca cccagtcgcc ggtgctggcg cgcctgcctg ccgaggaggt gcttatcggc 1080gtcaccacca cccagtcgcc ggtgctggcg cgcctgcctg ccgaggaggt gcttatcggc 1080
gacctggagg atctggagac ccaggcgaag gcgcgcggat gcgatctcct gctcacccat 1140gacctggagg atctggagac ccaggcgaag gcgcgcggat gcgatctcct gctcacccat 1140
tcccatgggc gccaggcggc ggagcgcctc cacatcccct tctaccggat cggcattccc 1200tcccatgggc gccaggcggc ggagcgcctc cacatcccct tctaccggat cggcattccc 1200
atgtttgacc ggctgggggc ggggcatctg ttgtcggtgg gctatcgcgg cacccgcgac 1260atgtttgacc ggctgggggc ggggcatctg ttgtcggtgg gctatcgcgg cacccgcgac 1260
ctcatcttcc atctcgccaa ccttgtgatc gccgaccacg aggaaaatca cgagccgacg 1320ctcatcttcc atctcgccaa ccttgtgatc gccgaccacg aggaaaatca cgagccgacg 1320
cccgacacct gggccaccgg ccatggcgag catgccgccg cccccacttc ccattga 1377cccgacacct gggccaccgg ccatggcgag catgccgccg cccccacttc ccattga 1377
<210> 55<210> 55
<211> 458<211> 458
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 55<400> 55
Met Ala Thr Val Ser Val Ser Lys Lys Ala Cys Ala Val Asn Pro Leu Met Ala Thr Val Ser Val Ser Lys Lys Ala Cys Ala Val Asn Pro Leu
1 5 10 15 1 5 10 15
Lys Met Ser Gln Pro Val Gly Gly Ala Leu Ala Phe Met Gly Val Arg Lys Met Ser Gln Pro Val Gly Gly Ala Leu Ala Phe Met Gly Val Arg
20 25 30 20 25 30
Lys Ala Met Pro Leu Leu His Gly Ser Gln Gly Cys Thr Ser Phe Gly Lys Ala Met Pro Leu Leu His Gly Ser Gln Gly Cys Thr Ser Phe Gly
35 40 45 35 40 45
Leu Val Leu Phe Val Arg His Phe Lys Glu Ala Ile Pro Met Gln Thr Leu Val Leu Phe Val Arg His Phe Lys Glu Ala Ile Pro Met Gln Thr
50 55 60 50 55 60
Thr Ala Met Ser Glu Val Ala Thr Val Leu Gly Gly Leu Glu Asn Val Thr Ala Met Ser Glu Val Ala Thr Val Leu Gly Gly Leu Glu Asn Val
65 70 75 80 65 70 75 80
Glu Gln Ala Ile Leu Asn Ile Tyr Asn Arg Thr Lys Pro Glu Ile Ile Glu Gln Ala Ile Leu Asn Ile Tyr Asn Arg Thr Lys Pro Glu Ile Ile
85 90 95 85 90 95
Gly Ile Cys Ser Thr Gly Val Thr Glu Thr Lys Gly Asp Asp Val Asp Gly Ile Cys Ser Thr Gly Val Thr Glu Thr Lys Gly Asp Asp Val Asp
100 105 110 100 105 110
Gly Tyr Ile Lys Leu Ile Arg Asp Lys Tyr Pro Gln Leu Ala Asp Phe Gly Tyr Ile Lys Leu Ile Arg Asp Lys Tyr Pro Gln Leu Ala Asp Phe
115 120 125 115 120 125
Pro Leu Val Tyr Val Ser Thr Pro Asp Phe Lys Asp Ala Phe Gln Asp Pro Leu Val Tyr Val Ser Thr Pro Asp Phe Lys Asp Ala Phe Gln Asp
130 135 140 130 135 140
Gly Trp Glu Lys Thr Val Ala Lys Met Val Glu Ala Leu Val Lys Pro Gly Trp Glu Lys Thr Val Ala Lys Met Val Glu Ala Leu Val Lys Pro
145 150 155 160 145 150 155 160
Ala Ala Asp Lys Gln Lys Asp Lys Thr Arg Val Asn Val Leu Pro Gly Ala Ala Asp Lys Gln Lys Asp Lys Thr Arg Val Asn Val Leu Pro Gly
165 170 175 165 170 175
Cys His Leu Thr Pro Gly Asp Leu Asp Glu Met Arg Thr Ile Phe Glu Cys His Leu Thr Pro Gly Asp Leu Asp Glu Met Arg Thr Ile Phe Glu
180 185 190 180 185 190
Asp Phe Gly Leu Thr Pro Tyr Phe Leu Pro Asp Leu Ala Gly Ser Leu Asp Phe Gly Leu Thr Pro Tyr Phe Leu Pro Asp Leu Ala Gly Ser Leu
195 200 205 195 200 205
Asp Gly His Ile Pro Glu Asp Phe Ser Pro Thr Thr Ile Gly Gly Ile Asp Gly His Ile Pro Glu Asp Phe Ser Pro Thr Thr Ile Gly Gly Ile
210 215 220 210 215 220
Gly Ile Asp Glu Ile Ala Thr Met Gly Glu Ala Ala His Thr Ile Cys Gly Ile Asp Glu Ile Ala Thr Met Gly Glu Ala Ala His Thr Ile Cys
225 230 235 240 225 230 235 240
Ile Gly Ala Gln Met Arg Arg Ala Gly Glu Ala Met Glu Lys Lys Thr Ile Gly Ala Gln Met Arg Arg Ala Gly Glu Ala Met Glu Lys Lys Thr
245 250 255 245 250 255
Gly Ile Pro Phe Lys Leu Phe Glu Arg Leu Cys Gly Leu Glu Ala Asn Gly Ile Pro Phe Lys Leu Phe Glu Arg Leu Cys Gly Leu Glu Ala Asn
260 265 270 260 265 270
Asp Ala Phe Ile Met His Leu Ser Gln Ile Ser Gly Arg Pro Val Pro Asp Ala Phe Ile Met His Leu Ser Gln Ile Ser Gly Arg Pro Val Pro
275 280 285 275 280 285
Val Lys Tyr Arg Arg Gln Arg Gly Gln Leu Val Asp Ala Met Leu Asp Val Lys Tyr Arg Arg Gln Arg Gly Gln Leu Val Asp Ala Met Leu Asp
290 295 300 290 295 300
Gly His Phe His Leu Gly Gly Arg Lys Val Ala Met Gly Ala Glu Pro Gly His Phe His Leu Gly Gly Arg Lys Val Ala Met Gly Ala Glu Pro
305 310 315 320 305 310 315 320
Asp Leu Leu Tyr Asp Val Gly Ser Phe Leu His Glu Met Gly Ala His Asp Leu Leu Tyr Asp Val Gly Ser Phe Leu His Glu Met Gly Ala His
325 330 335 325 330 335
Ile Leu Ser Ala Val Thr Thr Thr Gln Ser Pro Val Leu Ala Arg Leu Ile Leu Ser Ala Val Thr Thr Thr Gln Ser Pro Val Leu Ala Arg Leu
340 345 350 340 345 350
Pro Ala Glu Glu Val Leu Ile Gly Asp Leu Glu Asp Leu Glu Thr Gln Pro Ala Glu Glu Val Leu Ile Gly Asp Leu Glu Asp Leu Glu Thr Gln
355 360 365 355 360 365
Ala Lys Ala Arg Gly Cys Asp Leu Leu Leu Thr His Ser His Gly Arg Ala Lys Ala Arg Gly Cys Asp Leu Leu Leu Thr His Ser His Gly Arg
370 375 380 370 375 380
Gln Ala Ala Glu Arg Leu His Ile Pro Phe Tyr Arg Ile Gly Ile Pro Gln Ala Ala Glu Arg Leu His Ile Pro Phe Tyr Arg Ile Gly Ile Pro
385 390 395 400 385 390 395 400
Met Phe Asp Arg Leu Gly Ala Gly His Leu Leu Ser Val Gly Tyr Arg Met Phe Asp Arg Leu Gly Ala Gly His Leu Leu Ser Val Gly Tyr Arg
405 410 415 405 410 415
Gly Thr Arg Asp Leu Ile Phe His Leu Ala Asn Leu Val Ile Ala Asp Gly Thr Arg Asp Leu Ile Phe His Leu Ala Asn Leu Val Ile Ala Asp
420 425 430 420 425 430
His Glu Glu Asn His Glu Pro Thr Pro Asp Thr Trp Ala Thr Gly His His Glu Glu Asn His Glu Pro Thr Pro Asp Thr Trp Ala Thr Gly His
435 440 445 435 440 445
Gly Glu His Ala Ala Ala Pro Thr Ser His Gly Glu His Ala Ala Ala Pro Thr Ser His
450 455 450 455
<210> 56<210> 56
<211> 1560<211> 1560
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 56<400> 56
atgccacaaa atgctgacaa tgtgctcgat cacttcgagc tcttccgtgg tcccgaatac 60atgccacaaa atgctgacaa tgtgctcgat cacttcgagc tcttccgtgg tcccgaatac 60
cagcagatgc tggccaataa gaaaaagatg ttcgagaacc cccgcgatcc ggccgaagtc 120cagcagatgc tggccaataa gaaaaagatg ttcgagaacc cccgcgatcc ggccgaagtc 120
gagcgcgtgc gggaatgggc gaagactcct gaatacaagg agctgaactt cgcccgcgag 180gagcgcgtgc gggaatgggc gaagactcct gaatacaagg agctgaactt cgcccgcgag 180
gcgctcaccg tgaatccggc caaggcttgt cagccgctgg gcgcggtgtt cgtcgccgtc 240gcgctcaccg tgaatccggc caaggcttgt cagccgctgg gcgcggtgtt cgtcgccgtc 240
ggcttcgaga gcacgatccc cttcgtgcac ggctcgcagg gttgcgtcgc gtattaccgc 300ggcttcgaga gcacgatccc cttcgtgcac ggctcgcagg gttgcgtcgc gtattaccgc 300
tcgcacctct cccgccactt caaggagccg tcctcctgcg tctcctcgtc catgaccgag 360tcgcacctct cccgccactt caaggagccg tcctcctgcg tctcctcgtc catgaccgag 360
gatgcggcgg tgttcggcgg cctcaacaac atgattgacg gcctcgccaa cacctacaac 420gatgcggcgg tgttcggcgg cctcaacaac atgattgacg gcctcgccaa cacctacaac 420
atgtacaagc cgaagatgat cgccgtctcc accacctgca tggcggaagt catcggcgac 480atgtacaagc cgaagatgat cgccgtctcc accacctgca tggcggaagt catcggcgac 480
gatctgaacg ccttcatcaa gaccgcgaag gaaaagggct cggttccggc cgaatacgac 540gatctgaacg ccttcatcaa gaccgcgaag gaaaagggct cggttccggc cgaatacgac 540
gtgcccttcg cccacacccc ggcgttcgtc ggcagccatg tcaccggcta cgacaatgcg 600gtgcccttcg cccacacccc ggcgttcgtc ggcagccatg tcaccggcta cgacaatgcg 600
ctcaagggca tcctcgagca cttctgggac ggcaaggccg gcaccgcgcc gaagctggag 660ctcaagggca tcctcgagca cttctgggac ggcaaggccg gcaccgcgcc gaagctggag 660
cgcgttccca acgagaagat caacttcatc ggcggcttcg acggctacac cgtcggcaac 720cgcgttccca acgagaagat caacttcatc ggcggcttcg acggctacac cgtcggcaac 720
actcgcgaag tgaagcgcat cttcgaggcg ttcggcgccg attacaccat cctcgccgac 780actcgcgaag tgaagcgcat cttcgaggcg ttcggcgccg attacaccat cctcgccgac 780
aattccgaag tgttcgacac cccgaccgac ggcgagttcc gcatgtatga cggcggcacg 840aattccgaag tgttcgacac cccgaccgac ggcgagttcc gcatgtatga cggcggcacg 840
accctggagg acgcggcgaa cgcggtgcac gccaaggcca ccatctccat gcaggaatac 900accctggagg acgcggcgaa cgcggtgcac gccaaggcca ccatctccat gcaggaatac 900
tgcacggaga agaccctgcc catgatcgcc ggtcatggcc aggacgtggt cgccctcaac 960tgcacggaga agaccctgcc catgatcgcc ggtcatggcc aggacgtggt cgccctcaac 960
caccccgtgg gcgtgggcgg caccgacaag ttcctcatgg agatcgcccg cctcaccggc 1020caccccgtgg gcgtgggcgg caccgacaag ttcctcatgg agatcgcccg cctcaccggc 1020
aaggagatcc ccgaggagct gacccgcgag cgcggccgtc tcgtggacgc tatcgcggac 1080aaggagatcc ccgaggagct gacccgcgag cgcggccgtc tcgtggacgc tatcgcggac 1080
tcttccgcgc acatccacgg caagaagttc gccatctacg gcgatccgga tctgtgcctg 1140tcttccgcgc acatccacgg caagaagttc gccatctacg gcgatccgga tctgtgcctg 1140
ggcctcgccg cgttcctgct ggagctgggc gccgagccga cccatgtgct ggccaccaac 1200ggcctcgccg cgttcctgct ggagctgggc gccgagccga cccatgtgct ggccaccaac 1200
ggcaccaaga agtgggccga gaaggttcag gaactgttcg actcttcgcc gttcggcgcc 1260ggcaccaaga agtgggccga gaaggttcag gaactgttcg actcttcgcc gttcggcgcc 1260
aactgcaagg tctatcccgg caaggacctg tggcacatgc gctcgctcct gttcgtggag 1320aactgcaagg tctatcccgg caaggacctg tggcacatgc gctcgctcct gttcgtggag 1320
ccggtggatt tcatcatcgg caacacctac ggcaagtatc tcgagcgcga cacgggcacc 1380ccggtggatt tcatcatcgg caacacctac ggcaagtatc tcgagcgcga cacgggcacc 1380
ccgctgatcc gtatcggctt cccggtgttc gaccgtcacc accaccaccg ccgtccggtg 1440ccgctgatcc gtatcggctt cccggtgttc gaccgtcacc accaccaccg ccgtccggtg 1440
tggggctatc agggcggcat gaacgtcctg atcacgatcc tcgacaagat ctttgacgag 1500tggggctatc agggcggcat gaacgtcctg atcacgatcc tcgacaagat ctttgacgag 1500
atcgaccgca acaccaacgt gccggccaag accgactact cgttcgacat cattcgttga 1560atcgaccgca acaccaacgt gccggccaag accgactact cgttcgacat cattcgttga 1560
<210> 57<210> 57
<211> 519<211> 519
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 57<400> 57
Met Pro Gln Asn Ala Asp Asn Val Leu Asp His Phe Glu Leu Phe Arg Met Pro Gln Asn Ala Asp Asn Val Leu Asp His Phe Glu Leu Phe Arg
1 5 10 15 1 5 10 15
Gly Pro Glu Tyr Gln Gln Met Leu Ala Asn Lys Lys Lys Met Phe Glu Gly Pro Glu Tyr Gln Gln Met Leu Ala Asn Lys Lys Lys Met Phe Glu
20 25 30 20 25 30
Asn Pro Arg Asp Pro Ala Glu Val Glu Arg Val Arg Glu Trp Ala Lys Asn Pro Arg Asp Pro Ala Glu Val Glu Arg Val Arg Glu Trp Ala Lys
35 40 45 35 40 45
Thr Pro Glu Tyr Lys Glu Leu Asn Phe Ala Arg Glu Ala Leu Thr Val Thr Pro Glu Tyr Lys Glu Leu Asn Phe Ala Arg Glu Ala Leu Thr Val
50 55 60 50 55 60
Asn Pro Ala Lys Ala Cys Gln Pro Leu Gly Ala Val Phe Val Ala Val Asn Pro Ala Lys Ala Cys Gln Pro Leu Gly Ala Val Phe Val Ala Val
65 70 75 80 65 70 75 80
Gly Phe Glu Ser Thr Ile Pro Phe Val His Gly Ser Gln Gly Cys Val Gly Phe Glu Ser Thr Ile Pro Phe Val His Gly Ser Gln Gly Cys Val
85 90 95 85 90 95
Ala Tyr Tyr Arg Ser His Leu Ser Arg His Phe Lys Glu Pro Ser Ser Ala Tyr Tyr Arg Ser His Leu Ser Arg His Phe Lys Glu Pro Ser Ser
100 105 110 100 105 110
Cys Val Ser Ser Ser Met Thr Glu Asp Ala Ala Val Phe Gly Gly Leu Cys Val Ser Ser Ser Met Thr Glu Asp Ala Ala Val Phe Gly Gly Leu
115 120 125 115 120 125
Asn Asn Met Ile Asp Gly Leu Ala Asn Thr Tyr Asn Met Tyr Lys Pro Asn Asn Met Ile Asp Gly Leu Ala Asn Thr Tyr Asn Met Tyr Lys Pro
130 135 140 130 135 140
Lys Met Ile Ala Val Ser Thr Thr Cys Met Ala Glu Val Ile Gly Asp Lys Met Ile Ala Val Ser Thr Thr Cys Met Ala Glu Val Ile Gly Asp
145 150 155 160 145 150 155 160
Asp Leu Asn Ala Phe Ile Lys Thr Ala Lys Glu Lys Gly Ser Val Pro Asp Leu Asn Ala Phe Ile Lys Thr Ala Lys Glu Lys Gly Ser Val Pro
165 170 175 165 170 175
Ala Glu Tyr Asp Val Pro Phe Ala His Thr Pro Ala Phe Val Gly Ser Ala Glu Tyr Asp Val Pro Phe Ala His Thr Pro Ala Phe Val Gly Ser
180 185 190 180 185 190
His Val Thr Gly Tyr Asp Asn Ala Leu Lys Gly Ile Leu Glu His Phe His Val Thr Gly Tyr Asp Asn Ala Leu Lys Gly Ile Leu Glu His Phe
195 200 205 195 200 205
Trp Asp Gly Lys Ala Gly Thr Ala Pro Lys Leu Glu Arg Val Pro Asn Trp Asp Gly Lys Ala Gly Thr Ala Pro Lys Leu Glu Arg Val Pro Asn
210 215 220 210 215 220
Glu Lys Ile Asn Phe Ile Gly Gly Phe Asp Gly Tyr Thr Val Gly Asn Glu Lys Ile Asn Phe Ile Gly Gly Phe Asp Gly Tyr Thr Val Gly Asn
225 230 235 240 225 230 235 240
Thr Arg Glu Val Lys Arg Ile Phe Glu Ala Phe Gly Ala Asp Tyr Thr Thr Arg Glu Val Lys Arg Ile Phe Glu Ala Phe Gly Ala Asp Tyr Thr
245 250 255 245 250 255
Ile Leu Ala Asp Asn Ser Glu Val Phe Asp Thr Pro Thr Asp Gly Glu Ile Leu Ala Asp Asn Ser Glu Val Phe Asp Thr Pro Thr Asp Gly Glu
260 265 270 260 265 270
Phe Arg Met Tyr Asp Gly Gly Thr Thr Leu Glu Asp Ala Ala Asn Ala Phe Arg Met Tyr Asp Gly Gly Thr Thr Leu Glu Asp Ala Ala Asn Ala
275 280 285 275 280 285
Val His Ala Lys Ala Thr Ile Ser Met Gln Glu Tyr Cys Thr Glu Lys Val His Ala Lys Ala Thr Ile Ser Met Gln Glu Tyr Cys Thr Glu Lys
290 295 300 290 295 300
Thr Leu Pro Met Ile Ala Gly His Gly Gln Asp Val Val Ala Leu Asn Thr Leu Pro Met Ile Ala Gly His Gly Gln Asp Val Val Ala Leu Asn
305 310 315 320 305 310 315 320
His Pro Val Gly Val Gly Gly Thr Asp Lys Phe Leu Met Glu Ile Ala His Pro Val Gly Val Gly Gly Thr Asp Lys Phe Leu Met Glu Ile Ala
325 330 335 325 330 335
Arg Leu Thr Gly Lys Glu Ile Pro Glu Glu Leu Thr Arg Glu Arg Gly Arg Leu Thr Gly Lys Glu Ile Pro Glu Glu Leu Thr Arg Glu Arg Gly
340 345 350 340 345 350
Arg Leu Val Asp Ala Ile Ala Asp Ser Ser Ala His Ile His Gly Lys Arg Leu Val Asp Ala Ile Ala Asp Ser Ser Ala His Ile His Gly Lys
355 360 365 355 360 365
Lys Phe Ala Ile Tyr Gly Asp Pro Asp Leu Cys Leu Gly Leu Ala Ala Lys Phe Ala Ile Tyr Gly Asp Pro Asp Leu Cys Leu Gly Leu Ala Ala
370 375 380 370 375 380
Phe Leu Leu Glu Leu Gly Ala Glu Pro Thr His Val Leu Ala Thr Asn Phe Leu Leu Glu Leu Gly Ala Glu Pro Thr His Val Leu Ala Thr Asn
385 390 395 400 385 390 395 400
Gly Thr Lys Lys Trp Ala Glu Lys Val Gln Glu Leu Phe Asp Ser Ser Gly Thr Lys Lys Trp Ala Glu Lys Val Gln Glu Leu Phe Asp Ser Ser
405 410 415 405 410 415
Pro Phe Gly Ala Asn Cys Lys Val Tyr Pro Gly Lys Asp Leu Trp His Pro Phe Gly Ala Asn Cys Lys Val Tyr Pro Gly Lys Asp Leu Trp His
420 425 430 420 425 430
Met Arg Ser Leu Leu Phe Val Glu Pro Val Asp Phe Ile Ile Gly Asn Met Arg Ser Leu Leu Phe Val Glu Pro Val Asp Phe Ile Ile Gly Asn
435 440 445 435 440 445
Thr Tyr Gly Lys Tyr Leu Glu Arg Asp Thr Gly Thr Pro Leu Ile Arg Thr Tyr Gly Lys Tyr Leu Glu Arg Asp Thr Gly Thr Pro Leu Ile Arg
450 455 460 450 455 460
Ile Gly Phe Pro Val Phe Asp Arg His His His His Arg Arg Pro Val Ile Gly Phe Pro Val Phe Asp Arg His His His His Arg Arg Pro Val
465 470 475 480 465 470 475 480
Trp Gly Tyr Gln Gly Gly Met Asn Val Leu Ile Thr Ile Leu Asp Lys Trp Gly Tyr Gln Gly Gly Met Asn Val Leu Ile Thr Ile Leu Asp Lys
485 490 495 485 490 495
Ile Phe Asp Glu Ile Asp Arg Asn Thr Asn Val Pro Ala Lys Thr Asp Ile Phe Asp Glu Ile Asp Arg Asn Thr Asn Val Pro Ala Lys Thr Asp
500 505 510 500 505 510
Tyr Ser Phe Asp Ile Ile Arg Tyr Ser Phe Asp Ile Ile Arg
515 515
<210> 58<210> 58
<211> 621<211> 621
<212> ДНК<212> DNA
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 58<400> 58
gtggagtccg gtggtcctga gccgggcgtg ggctgcgccg gccgcggcgt gatcacctcc 60gtggagtccg gtggtcctga gccgggcgtg ggctgcgccg gccgcggcgt gatcacctcc 60
atcaacttcc tggaggagaa cggcgcctac gaggacatcg actatgtgtc ctacgacgtg 120atcaacttcc tggaggagaa cggcgcctac gaggacatcg actatgtgtc ctacgacgtg 120
ctgggcgacg tggtgtgcgg cggcttcgcc atgcccatcc gcgagaacaa ggcgcaggaa 180ctgggcgacg tggtgtgcgg cggcttcgcc atgcccatcc gcgagaacaa ggcgcaggaa 180
atctacatcg tgatgtccgg cgagatgatg gccatgtatg cggccaacaa catctccaag 240atctacatcg tgatgtccgg cgagatgatg gccatgtatg cggccaacaa catctccaag 240
ggcatcctga agtatgccaa ttccggcggc gtgcgcctgg gcgggctggt ctgcaacgag 300ggcatcctga agtatgccaa ttccggcggc gtgcgcctgg gcgggctggt ctgcaacgag 300
cgccagaccg acaaggagct ggagctggcg gaggctctgg cgaagaagct cggcaccgag 360cgccagaccg acaaggagct ggagctggcg gaggctctgg cgaagaagct cggcaccgag 360
ctgatctact tcgtgccgcg cgacaacatc gtgcagcatg ccgagctgcg ccgcatgaca 420ctgatctact tcgtgccgcg cgacaacatc gtgcagcatg ccgagctgcg ccgcatgaca 420
gtgatcgagt atgcgcccga ttccgcccag gcccagcact accggaacct ggccgagaag 480gtgatcgagt atgcgcccga ttccgcccag gcccagcact accggaacct ggccgagaag 480
gtgcacgcca acaagggcaa cggcatcatc ccgaccccga tcaccatgga cgagctggaa 540gtgcacgcca acaagggcaa cggcatcatc ccgaccccga tcaccatgga cgagctggaa 540
gacatgctca tggagcacgg catcatgaag gccgtggacg agagccagat cggcaagacc 600gacatgctca tggagcacgg catcatgaag gccgtggacg agagccagat cggcaagacc 600
gccgccgagc tcgccgtctg a 621gccgccgagc tcgccgtctg a 621
<210> 59<210> 59
<211> 206<211> 206
<212> Белок<212> Protein
<213> Неизвестный<213> Unknown
<220><220>
<223> Новый штамм<223> New strain
<400> 59<400> 59
Met Glu Ser Gly Gly Pro Glu Pro Gly Val Gly Cys Ala Gly Arg Gly Met Glu Ser Gly Gly Pro Glu Pro Gly Val Gly Cys Ala Gly Arg Gly
1 5 10 15 1 5 10 15
Val Ile Thr Ser Ile Asn Phe Leu Glu Glu Asn Gly Ala Tyr Glu Asp Val Ile Thr Ser Ile Asn Phe Leu Glu Glu Asn Gly Ala Tyr Glu Asp
20 25 30 20 25 30
Ile Asp Tyr Val Ser Tyr Asp Val Leu Gly Asp Val Val Cys Gly Gly Ile Asp Tyr Val Ser Tyr Asp Val Leu Gly Asp Val Val Cys Gly Gly
35 40 45 35 40 45
Phe Ala Met Pro Ile Arg Glu Asn Lys Ala Gln Glu Ile Tyr Ile Val Phe Ala Met Pro Ile Arg Glu Asn Lys Ala Gln Glu Ile Tyr Ile Val
50 55 60 50 55 60
Met Ser Gly Glu Met Met Ala Met Tyr Ala Ala Asn Asn Ile Ser Lys Met Ser Gly Glu Met Met Ala Met Tyr Ala Ala Asn Asn Ile Ser Lys
65 70 75 80 65 70 75 80
Gly Ile Leu Lys Tyr Ala Asn Ser Gly Gly Val Arg Leu Gly Gly Leu Gly Ile Leu Lys Tyr Ala Asn Ser Gly Gly Val Arg Leu Gly Gly Leu
85 90 95 85 90 95
Val Cys Asn Glu Arg Gln Thr Asp Lys Glu Leu Glu Leu Ala Glu Ala Val Cys Asn Glu Arg Gln Thr Asp Lys Glu Leu Glu Leu Ala Glu Ala
100 105 110 100 105 110
Leu Ala Lys Lys Leu Gly Thr Glu Leu Ile Tyr Phe Val Pro Arg Asp Leu Ala Lys Lys Leu Gly Thr Glu Leu Ile Tyr Phe Val Pro Arg Asp
115 120 125 115 120 125
Asn Ile Val Gln His Ala Glu Leu Arg Arg Met Thr Val Ile Glu Tyr Asn Ile Val Gln His Ala Glu Leu Arg Arg Met Thr Val Ile Glu Tyr
130 135 140 130 135 140
Ala Pro Asp Ser Ala Gln Ala Gln His Tyr Arg Asn Leu Ala Glu Lys Ala Pro Asp Ser Ala Gln Ala Gln His Tyr Arg Asn Leu Ala Glu Lys
145 150 155 160 145 150 155 160
Val His Ala Asn Lys Gly Asn Gly Ile Ile Pro Thr Pro Ile Thr Met Val His Ala Asn Lys Gly Asn Gly Ile Ile Pro Thr Pro Ile Thr Met
165 170 175 165 170 175
Asp Glu Leu Glu Asp Met Leu Met Glu His Gly Ile Met Lys Ala Val Asp Glu Leu Glu Asp Met Leu Met Glu His Gly Ile Met Lys Ala Val
180 185 190 180 185 190
Asp Glu Ser Gln Ile Gly Lys Thr Ala Ala Glu Leu Ala Val Asp Glu Ser Gln Ile Gly Lys Thr Ala Ala Glu Leu Ala Val
195 200 205 195 200 205
<---<---
Claims (69)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP19205786.7 | 2019-10-29 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| RU2806550C1 true RU2806550C1 (en) | 2023-11-01 |
Family
ID=
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2018144965A1 (en) * | 2017-02-03 | 2018-08-09 | Kiverdi, Inc. | Microbial conversion of co2 and other c1 substrates to vegan nutrients, fertilizers, biostimulants, and systems for accelerated soil carbon sequestration |
| RU2688486C2 (en) * | 2014-01-16 | 2019-05-21 | Калиста, Инк. | Microorganisms for high production of amino acids and related methods |
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2688486C2 (en) * | 2014-01-16 | 2019-05-21 | Калиста, Инк. | Microorganisms for high production of amino acids and related methods |
| WO2018144965A1 (en) * | 2017-02-03 | 2018-08-09 | Kiverdi, Inc. | Microbial conversion of co2 and other c1 substrates to vegan nutrients, fertilizers, biostimulants, and systems for accelerated soil carbon sequestration |
Non-Patent Citations (2)
| Title |
|---|
| OHMIYA K. ET AL. Application of microbial genes to recalcitrant biomass utilization and environmental conservation. J Biosci Bioeng. 2003;95(6):549-61 Найдено онлайн: https://www.sciencedirect.com/science/article/pii/S1389172303801615?via%3Dihub Дата обращения 28.12.2022. JIANYU ET AL. The energy efficiency of carbon dioxide fixation by a hydrogen-oxidizing bacterium. International Journal of Hydrogen Energy, 2013, 38(21):8683-8690 Найдено онлайн: https://www.sciencedirect.com/science/article/pii/S0360319913011312 Дата обращения 28.12.2022. * |
| ВОЛОВА Т.Г., БАРАШКОВ В.А. Характеристика белков, синтезируемых водородокисляющими микроорганизмами. Прикладная биохимия и микробиология. 2010. Т. 46, N 6, С. 624-629. Найдено онлайн: http://www.biotech.sfu-kras.ru/files/200_Volova,_Barashkov_2010.pdf Дата обращения 28.12.2022. * |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102795704B1 (en) | Strains and methods for single cell protein or biomass production | |
| Schleper et al. | Picrophilus gen. nov., fam. nov.: a novel aerobic, heterotrophic, thermoacidophilic genus and family comprising archaea capable of growth around pH 0 | |
| Makhdoumi-Kakhki et al. | Halovenus aranensis gen. nov., sp. nov., an extremely halophilic archaeon from Aran-Bidgol salt lake | |
| KR20060018868A (en) | Amino acid production method and composition for amino acid production | |
| CN114107159A (en) | High-yield beta-alanine producing strain, construction method and application | |
| JP4262206B2 (en) | Fermentation method for production of coenzyme Q10 with recombinant Agrobacterium tumefaciens | |
| CN110129244B (en) | Streptomyces chassis strains, their construction methods, and their application in heterologous expression research | |
| RU2806550C1 (en) | Strains and methods of obtaining protein of single-cellular organisms or biomass | |
| FI129574B (en) | Variant bacterial strains and processes for protein or biomass production | |
| JP4221476B2 (en) | Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid | |
| CN110607311A (en) | DSF quorum sensing signal degradation gene and its application | |
| CN117210429A (en) | Histidine trimethylase EgtD mutant and application thereof | |
| KR102754213B1 (en) | Novel Enterobacter cloacae sp. producing lactobionic acid and method for production of lactobionic acid using the Same | |
| CN110551739A (en) | Pyrazolomycin biosynthesis gene cluster, recombinant bacterium and application thereof | |
| US20080261282A1 (en) | Fermentation Process for Preparing Coenzyme Q10 by the Recombinant Agrobacterium tumefaciens | |
| CN114891656A (en) | A kind of Bacillus subtilis and its application in producing adenosine by fermentation | |
| JP2014187886A (en) | METHOD FOR PRODUCING 1,3-βGALACTOSYL-N-ACETYLHEXOSAMINE PHOSPHORYLASE | |
| KR20110049740A (en) | Tacrolimus producing strains and tacrolimus biosynthetic genes obtained therefrom | |
| EA045694B1 (en) | METHODS AND COMPOSITIONS FOR TREATING AND CONTROL OF PLANT DISEASES |