RU2822039C2 - Sialyltransferases and use thereof in producing sialylated oligosaccharides - Google Patents
Sialyltransferases and use thereof in producing sialylated oligosaccharides Download PDFInfo
- Publication number
- RU2822039C2 RU2822039C2 RU2020105821A RU2020105821A RU2822039C2 RU 2822039 C2 RU2822039 C2 RU 2822039C2 RU 2020105821 A RU2020105821 A RU 2020105821A RU 2020105821 A RU2020105821 A RU 2020105821A RU 2822039 C2 RU2822039 C2 RU 2822039C2
- Authority
- RU
- Russia
- Prior art keywords
- leu
- ile
- lys
- glu
- ser
- Prior art date
Links
- 108090000141 Sialyltransferases Proteins 0.000 title claims abstract description 131
- 102000003838 Sialyltransferases Human genes 0.000 title claims abstract description 129
- 229920001542 oligosaccharide Polymers 0.000 title claims abstract description 89
- 150000002482 oligosaccharides Chemical class 0.000 title claims abstract description 89
- 238000000034 method Methods 0.000 claims abstract description 51
- 230000000694 effects Effects 0.000 claims abstract description 47
- 238000004519 manufacturing process Methods 0.000 claims abstract description 25
- DVGKRPYUFRZAQW-UHFFFAOYSA-N 3 prime Natural products CC(=O)NC1OC(CC(O)C1C(O)C(O)CO)(OC2C(O)C(CO)OC(OC3C(O)C(O)C(O)OC3CO)C2O)C(=O)O DVGKRPYUFRZAQW-UHFFFAOYSA-N 0.000 claims abstract description 14
- TYALNJQZQRNQNQ-JLYOMPFMSA-N alpha-Neup5Ac-(2->6)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O1 TYALNJQZQRNQNQ-JLYOMPFMSA-N 0.000 claims abstract description 11
- CILYIEBUXJIHCO-UHFFFAOYSA-N 102778-91-6 Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC2C(C(O)C(O)OC2CO)O)OC(CO)C1O CILYIEBUXJIHCO-UHFFFAOYSA-N 0.000 claims abstract description 8
- CILYIEBUXJIHCO-UITFWXMXSA-N N-acetyl-alpha-neuraminyl-(2->3)-beta-D-galactosyl-(1->4)-beta-D-glucose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O[C@H](CO)[C@@H]1O CILYIEBUXJIHCO-UITFWXMXSA-N 0.000 claims abstract description 8
- OIZGSVFYNBZVIK-UHFFFAOYSA-N N-acetylneuraminosyl-D-lactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C1O OIZGSVFYNBZVIK-UHFFFAOYSA-N 0.000 claims abstract description 8
- TYALNJQZQRNQNQ-UHFFFAOYSA-N #alpha;2,6-sialyllactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OCC1C(O)C(O)C(O)C(OC2C(C(O)C(O)OC2CO)O)O1 TYALNJQZQRNQNQ-UHFFFAOYSA-N 0.000 claims abstract description 7
- FCIROHDMPFOSFG-LAVSNGQLSA-N disialyllacto-N-tetraose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@]3(O[C@H]([C@H](NC(C)=O)[C@@H](O)C3)[C@H](O)[C@H](O)CO)C(O)=O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@H]3[C@@H]([C@@H](O)C(O)O[C@@H]3CO)O)O[C@H](CO)[C@@H]2O)O)O1 FCIROHDMPFOSFG-LAVSNGQLSA-N 0.000 claims abstract description 7
- QUOQJNYANJQSDA-MHQSSNGYSA-N Sialyllacto-N-tetraose a Chemical compound O1C([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](OC2[C@H]([C@H](OC3[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]3O)O)O[C@H](CO)[C@H]2O)NC(C)=O)O[C@H](CO)[C@@H]1O QUOQJNYANJQSDA-MHQSSNGYSA-N 0.000 claims abstract description 5
- SFMRPVLZMVJKGZ-JRZQLMJNSA-N Sialyllacto-N-tetraose b Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]2O)O)O1 SFMRPVLZMVJKGZ-JRZQLMJNSA-N 0.000 claims abstract description 5
- SXMGGNXBTZBGLU-UHFFFAOYSA-N sialyllacto-n-tetraose c Chemical compound OCC1OC(OC2C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC(C(C(O)C1O)O)OC1COC1(C(O)=O)CC(O)C(NC(C)=O)C(C(O)C(O)CO)O1 SXMGGNXBTZBGLU-UHFFFAOYSA-N 0.000 claims abstract description 3
- 108090000623 proteins and genes Proteins 0.000 claims description 67
- 230000014509 gene expression Effects 0.000 claims description 40
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 39
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 claims description 38
- 239000002773 nucleotide Substances 0.000 claims description 38
- 125000003729 nucleotide group Chemical group 0.000 claims description 38
- 239000008101 lactose Substances 0.000 claims description 35
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 35
- 229920001184 polypeptide Polymers 0.000 claims description 31
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 31
- 125000005629 sialic acid group Chemical group 0.000 claims description 31
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 claims description 30
- 239000000758 substrate Substances 0.000 claims description 30
- AXQLFFDZXPOFPO-UHFFFAOYSA-N UNPD216 Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC(C1O)C(O)C(CO)OC1OC1C(O)C(O)C(O)OC1CO AXQLFFDZXPOFPO-UHFFFAOYSA-N 0.000 claims description 27
- RJTOFDPWCJDYFZ-SPVZFZGWSA-N Lacto-N-triaose Chemical compound CC(=O)N[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O RJTOFDPWCJDYFZ-SPVZFZGWSA-N 0.000 claims description 26
- AXQLFFDZXPOFPO-UNTPKZLMSA-N beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O([C@@H]1O[C@H](CO)[C@H](O)[C@@H]([C@H]1O)O[C@H]1[C@@H]([C@H]([C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)NC(=O)C)[C@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@@H]1CO AXQLFFDZXPOFPO-UNTPKZLMSA-N 0.000 claims description 26
- USIPEGYTBGEPJN-UHFFFAOYSA-N lacto-N-tetraose Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC1C(O)C(CO)OC(OC(C(O)CO)C(O)C(O)C=O)C1O USIPEGYTBGEPJN-UHFFFAOYSA-N 0.000 claims description 26
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 claims description 21
- 108020004707 nucleic acids Proteins 0.000 claims description 21
- 102000039446 nucleic acids Human genes 0.000 claims description 21
- 150000007523 nucleic acids Chemical class 0.000 claims description 21
- 150000001413 amino acids Chemical group 0.000 claims description 17
- 230000002255 enzymatic effect Effects 0.000 claims description 15
- 238000000855 fermentation Methods 0.000 claims description 14
- 230000004151 fermentation Effects 0.000 claims description 14
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 claims description 14
- TXCIAUNLDRJGJZ-UHFFFAOYSA-N CMP-N-acetyl neuraminic acid Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-UHFFFAOYSA-N 0.000 claims description 13
- TXCIAUNLDRJGJZ-BILDWYJOSA-N CMP-N-acetyl-beta-neuraminic acid Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@]1(C(O)=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-BILDWYJOSA-N 0.000 claims description 13
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 claims description 10
- 238000013518 transcription Methods 0.000 claims description 10
- 230000035897 transcription Effects 0.000 claims description 10
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 claims description 8
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 claims description 8
- 229930182830 galactose Natural products 0.000 claims description 8
- 238000012239 gene modification Methods 0.000 claims description 8
- 230000005017 genetic modification Effects 0.000 claims description 8
- 235000013617 genetically modified food Nutrition 0.000 claims description 8
- 229950006780 n-acetylglucosamine Drugs 0.000 claims description 8
- 235000000346 sugar Nutrition 0.000 claims description 8
- 238000012546 transfer Methods 0.000 claims description 8
- 102100041034 Glucosamine-6-phosphate isomerase 1 Human genes 0.000 claims description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 claims description 6
- 108010069483 N-acetylglucosamine-6-phosphate deacetylase Proteins 0.000 claims description 6
- 102100033341 N-acetylmannosamine kinase Human genes 0.000 claims description 6
- 108010010750 N-acetylmannosamine-6-phosphate epimerase Proteins 0.000 claims description 6
- 108010029147 N-acylmannosamine kinase Proteins 0.000 claims description 6
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 claims description 6
- 108010022717 glucosamine-6-phosphate isomerase Proteins 0.000 claims description 6
- 102100026189 Beta-galactosidase Human genes 0.000 claims description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 5
- 108700023372 Glycosyltransferases Proteins 0.000 claims description 5
- 108090000301 Membrane transport proteins Proteins 0.000 claims description 5
- 108700023220 N-acetylneuraminate lyases Proteins 0.000 claims description 5
- 102000048245 N-acetylneuraminate lyases Human genes 0.000 claims description 5
- 108010005774 beta-Galactosidase Proteins 0.000 claims description 5
- 229910052799 carbon Inorganic materials 0.000 claims description 5
- IEQCXFNWPAHHQR-UHFFFAOYSA-N lacto-N-neotetraose Natural products OCC1OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC1OC(CO)C(O)C(O)C1O IEQCXFNWPAHHQR-UHFFFAOYSA-N 0.000 claims description 5
- 108010060845 lactose permease Proteins 0.000 claims description 5
- 150000008163 sugars Chemical class 0.000 claims description 5
- GSXOAOHZAIYLCY-UHFFFAOYSA-N D-F6P Natural products OCC(=O)C(O)C(O)C(O)COP(O)(O)=O GSXOAOHZAIYLCY-UHFFFAOYSA-N 0.000 claims description 4
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 claims description 4
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 claims description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 4
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 claims description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 claims description 4
- 229930182816 L-glutamine Natural products 0.000 claims description 4
- 108091006161 SLC17A5 Proteins 0.000 claims description 4
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 claims description 4
- 108010061048 UDPacetylglucosamine pyrophosphorylase Proteins 0.000 claims description 4
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 claims description 4
- 230000000295 complement effect Effects 0.000 claims description 4
- 108010084034 glucosamine-1-phosphate acetyltransferase Proteins 0.000 claims description 4
- 239000008103 glucose Substances 0.000 claims description 4
- 229940062780 lacto-n-neotetraose Drugs 0.000 claims description 4
- 108010032867 phosphoglucosamine mutase Proteins 0.000 claims description 4
- 108091000115 phosphomannomutase Proteins 0.000 claims description 4
- 239000011541 reaction mixture Substances 0.000 claims description 4
- 238000013519 translation Methods 0.000 claims description 4
- 108010043841 Glucosamine 6-Phosphate N-Acetyltransferase Proteins 0.000 claims description 3
- 102000002740 Glucosamine 6-Phosphate N-Acetyltransferase Human genes 0.000 claims description 3
- 229930006000 Sucrose Natural products 0.000 claims description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 3
- 102000003929 Transaminases Human genes 0.000 claims description 3
- 108090000340 Transaminases Proteins 0.000 claims description 3
- 101710091363 UDP-N-acetylglucosamine 2-epimerase Proteins 0.000 claims description 3
- 239000005720 sucrose Substances 0.000 claims description 3
- 108010062110 water dikinase pyruvate Proteins 0.000 claims description 3
- 229930091371 Fructose Natural products 0.000 claims description 2
- 239000005715 Fructose Substances 0.000 claims description 2
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 claims description 2
- 102100024515 GDP-L-fucose synthase Human genes 0.000 claims description 2
- 108030006298 GDP-L-fucose synthases Proteins 0.000 claims description 2
- 108010062427 GDP-mannose 4,6-dehydratase Proteins 0.000 claims description 2
- 102000002312 GDPmannose 4,6-dehydratase Human genes 0.000 claims description 2
- 102100036291 Galactose-1-phosphate uridylyltransferase Human genes 0.000 claims description 2
- 101710090046 Galactose-1-phosphate uridylyltransferase Proteins 0.000 claims description 2
- 102100021700 Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 Human genes 0.000 claims description 2
- 102000051366 Glycosyltransferases Human genes 0.000 claims description 2
- 101000896564 Homo sapiens Glycoprotein-N-acetylgalactosamine 3-beta-galactosyltransferase 1 Proteins 0.000 claims description 2
- 108010046068 N-Acetyllactosamine Synthase Proteins 0.000 claims description 2
- 108010081778 N-acylneuraminate cytidylyltransferase Proteins 0.000 claims description 2
- 102000009569 Phosphoglucomutase Human genes 0.000 claims description 2
- 102000030605 Phosphomannomutase Human genes 0.000 claims description 2
- 101000693115 Sulfurisphaera tokodaii (strain DSM 16993 / JCM 10545 / NBRC 100140 / 7) Sugar-1-phosphate acetyltransferase Proteins 0.000 claims description 2
- 108010075202 UDP-glucose 4-epimerase Proteins 0.000 claims description 2
- 102100021436 UDP-glucose 4-epimerase Human genes 0.000 claims description 2
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 claims description 2
- HXXFSFRBOHSIMQ-RWOPYEJCSA-L alpha-D-mannose 1-phosphate(2-) Chemical compound OC[C@H]1O[C@H](OP([O-])([O-])=O)[C@@H](O)[C@@H](O)[C@@H]1O HXXFSFRBOHSIMQ-RWOPYEJCSA-L 0.000 claims description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 claims description 2
- PTVXQARCLQPGIR-SXUWKVJYSA-N beta-L-fucose 1-phosphate Chemical compound C[C@@H]1O[C@H](OP(O)(O)=O)[C@@H](O)[C@H](O)[C@@H]1O PTVXQARCLQPGIR-SXUWKVJYSA-N 0.000 claims description 2
- RBMYDHMFFAVMMM-PLQWBNBWSA-N neolactotetraose Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O RBMYDHMFFAVMMM-PLQWBNBWSA-N 0.000 claims 3
- 102100031317 Alpha-N-acetylgalactosaminidase Human genes 0.000 claims 2
- BGWGXPAPYGQALX-VRPWFDPXSA-N D-fructofuranose 6-phosphate Chemical compound OCC1(O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BGWGXPAPYGQALX-VRPWFDPXSA-N 0.000 claims 1
- LQEBEXMHBLQMDB-UHFFFAOYSA-N GDP-L-fucose Natural products OC1C(O)C(O)C(C)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C3=C(C(N=C(N)N3)=O)N=C2)O1 LQEBEXMHBLQMDB-UHFFFAOYSA-N 0.000 claims 1
- LQEBEXMHBLQMDB-JGQUBWHWSA-N GDP-beta-L-fucose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C3=C(C(NC(N)=N3)=O)N=C2)O1 LQEBEXMHBLQMDB-JGQUBWHWSA-N 0.000 claims 1
- HSCJRCZFDFQWRP-ABVWGUQPSA-N UDP-alpha-D-galactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-ABVWGUQPSA-N 0.000 claims 1
- AXQLFFDZXPOFPO-FSGZUBPKSA-N beta-D-Gal-(1->3)-beta-D-GlcNAc-(1->3)-beta-D-Gal-(1->4)-D-Glc Chemical compound O([C@@H]1O[C@H](CO)[C@H](O)[C@@H]([C@H]1O)O[C@H]1[C@@H]([C@H]([C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)NC(=O)C)[C@H]1[C@H](O)[C@@H](O)C(O)O[C@@H]1CO AXQLFFDZXPOFPO-FSGZUBPKSA-N 0.000 claims 1
- 238000012258 culturing Methods 0.000 claims 1
- 239000000126 substance Substances 0.000 abstract description 4
- 108020004414 DNA Proteins 0.000 description 161
- 210000004027 cell Anatomy 0.000 description 94
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 34
- 235000020256 human milk Nutrition 0.000 description 31
- 210000004251 human milk Anatomy 0.000 description 31
- SQVRNKJHWKZAKO-PFQGKNLYSA-N N-acetyl-beta-neuraminic acid Chemical group CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-PFQGKNLYSA-N 0.000 description 26
- 108010034529 leucyl-lysine Proteins 0.000 description 24
- 239000012634 fragment Substances 0.000 description 23
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 22
- 108010009298 lysylglutamic acid Proteins 0.000 description 22
- 108010050848 glycylleucine Proteins 0.000 description 21
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 20
- 108010012581 phenylalanylglutamate Proteins 0.000 description 19
- 230000015572 biosynthetic process Effects 0.000 description 18
- 150000002500 ions Chemical class 0.000 description 18
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 17
- 108010057821 leucylproline Proteins 0.000 description 17
- 108010051110 tyrosyl-lysine Proteins 0.000 description 17
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 15
- 108010089804 glycyl-threonine Proteins 0.000 description 15
- 108010054155 lysyllysine Proteins 0.000 description 15
- 241000588724 Escherichia coli Species 0.000 description 14
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 14
- 108010068265 aspartyltyrosine Proteins 0.000 description 14
- 108010026333 seryl-proline Proteins 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- 108010005233 alanylglutamic acid Proteins 0.000 description 13
- 108010092854 aspartyllysine Proteins 0.000 description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 13
- 108010073969 valyllysine Proteins 0.000 description 13
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 12
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 108010064235 lysylglycine Proteins 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 102000004190 Enzymes Human genes 0.000 description 11
- 108090000790 Enzymes Proteins 0.000 description 11
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 11
- 108010085325 histidylproline Proteins 0.000 description 11
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 11
- 108010051242 phenylalanylserine Proteins 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 10
- 108010038633 aspartylglutamate Proteins 0.000 description 10
- 108010092114 histidylphenylalanine Proteins 0.000 description 10
- 235000016709 nutrition Nutrition 0.000 description 10
- 102000004169 proteins and genes Human genes 0.000 description 10
- 238000003786 synthesis reaction Methods 0.000 description 10
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 9
- 241000589875 Campylobacter jejuni Species 0.000 description 9
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 9
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 9
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 9
- 108010077245 asparaginyl-proline Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 230000001580 bacterial effect Effects 0.000 description 9
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 9
- 108010025306 histidylleucine Proteins 0.000 description 9
- 108010015796 prolylisoleucine Proteins 0.000 description 9
- 108010020532 tyrosyl-proline Proteins 0.000 description 9
- 108010003137 tyrosyltyrosine Proteins 0.000 description 9
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 8
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 8
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 8
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 8
- 108010020764 Transposases Proteins 0.000 description 8
- 102000008579 Transposases Human genes 0.000 description 8
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- 230000010354 integration Effects 0.000 description 8
- 108010038320 lysylphenylalanine Proteins 0.000 description 8
- 108010017391 lysylvaline Proteins 0.000 description 8
- OIZGSVFYNBZVIK-FHHHURIISA-N 3'-sialyllactose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O OIZGSVFYNBZVIK-FHHHURIISA-N 0.000 description 7
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 7
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 7
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 7
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 7
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 7
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 7
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 7
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 7
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 7
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 7
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 7
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 108010008355 arginyl-glutamine Proteins 0.000 description 7
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 238000011534 incubation Methods 0.000 description 7
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 238000004809 thin layer chromatography Methods 0.000 description 7
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 6
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 6
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 6
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 6
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 6
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 6
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 6
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 6
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 6
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 6
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 6
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 6
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 6
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 6
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 6
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 6
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 6
- 238000005119 centrifugation Methods 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 235000013350 formula milk Nutrition 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 150000002772 monosaccharides Chemical group 0.000 description 6
- 101150019075 neuA gene Proteins 0.000 description 6
- 108010084572 phenylalanyl-valine Proteins 0.000 description 6
- 239000002243 precursor Substances 0.000 description 6
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 6
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 6
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 5
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 5
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 5
- 101100245749 Campylobacter jejuni subsp. jejuni serotype O:23/36 (strain 81-176) pseF gene Proteins 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 5
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 5
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 5
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 5
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 5
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 5
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 5
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 5
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 5
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 5
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 5
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 5
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 5
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 5
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 5
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 5
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 5
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 5
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 5
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 5
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 5
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 5
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 5
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 5
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 5
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 5
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 5
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 5
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 5
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 5
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 5
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 5
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 5
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 5
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 5
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 5
- 108010070944 alanylhistidine Proteins 0.000 description 5
- 108010064886 beta-D-galactoside alpha 2-6-sialyltransferase Proteins 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000018109 developmental process Effects 0.000 description 5
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 5
- 108010081551 glycylphenylalanine Proteins 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 5
- 238000004949 mass spectrometry Methods 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 230000002018 overexpression Effects 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 5
- 150000004044 tetrasaccharides Chemical class 0.000 description 5
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 4
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 4
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 4
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 4
- 241000099223 Alistipes sp. Species 0.000 description 4
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 4
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 4
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 4
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 4
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 4
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 4
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 4
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 4
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 4
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 4
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 4
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 4
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 4
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 4
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 4
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 4
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 4
- CMBXOSFZCFGDLE-IHRRRGAJSA-N Gln-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O CMBXOSFZCFGDLE-IHRRRGAJSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 4
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 4
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 4
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 4
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 4
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 4
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 4
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 4
- 241000606831 Histophilus somni Species 0.000 description 4
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 4
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 4
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 4
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 4
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 4
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 4
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 4
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 4
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 4
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 4
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 4
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 4
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 4
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 4
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 4
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 4
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 4
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 4
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 4
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 4
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 4
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 4
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 4
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 4
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 4
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 4
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 4
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 4
- MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 description 4
- 102100031324 N-acetylglucosamine-6-phosphate deacetylase Human genes 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 241000606856 Pasteurella multocida Species 0.000 description 4
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 4
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 4
- 241001517016 Photobacterium damselae Species 0.000 description 4
- 241000493790 Photobacterium leiognathi Species 0.000 description 4
- 241000607606 Photobacterium sp. Species 0.000 description 4
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 4
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 4
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 4
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 4
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 4
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 4
- 239000004098 Tetracycline Substances 0.000 description 4
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 4
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 4
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 4
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 4
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 4
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 4
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 4
- SNFSYLYCDAVZGP-UHFFFAOYSA-N UNPD26986 Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(OC(O)C(O)C2O)CO)OC(CO)C(O)C1O SNFSYLYCDAVZGP-UHFFFAOYSA-N 0.000 description 4
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 4
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 4
- 241000606834 [Haemophilus] ducreyi Species 0.000 description 4
- 150000001408 amides Chemical class 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 101150117187 glmS gene Proteins 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 238000000099 in vitro assay Methods 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010091871 leucylmethionine Proteins 0.000 description 4
- 230000000813 microbial effect Effects 0.000 description 4
- 238000002552 multiple reaction monitoring Methods 0.000 description 4
- 101150048598 nanT gene Proteins 0.000 description 4
- 229940051027 pasteurella multocida Drugs 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 230000009450 sialylation Effects 0.000 description 4
- 241000894007 species Species 0.000 description 4
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 4
- 229960002180 tetracycline Drugs 0.000 description 4
- 229930101283 tetracycline Natural products 0.000 description 4
- 235000019364 tetracycline Nutrition 0.000 description 4
- 150000003522 tetracyclines Chemical class 0.000 description 4
- MGSRCZKZVOBKFT-UHFFFAOYSA-N thymol Chemical compound CC(C)C1=CC=C(C)C=C1O MGSRCZKZVOBKFT-UHFFFAOYSA-N 0.000 description 4
- 230000017105 transposition Effects 0.000 description 4
- 239000003643 water by type Substances 0.000 description 4
- SNFSYLYCDAVZGP-OLAZETNGSA-N 2'-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@H](O)[C@@H]1O SNFSYLYCDAVZGP-OLAZETNGSA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 3
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 3
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 3
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 3
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 3
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 3
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 3
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 3
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 3
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 3
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 3
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 3
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 3
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 3
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 3
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 3
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 3
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 3
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 3
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 3
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 3
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 3
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 3
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 3
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 3
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 3
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 3
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 3
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 3
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 3
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 3
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 3
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 3
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 3
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 3
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 3
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 3
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 3
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 3
- 101100025757 Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UPEC) nanT1 gene Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 3
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 3
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 3
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 3
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 3
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 3
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 3
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 3
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 3
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 3
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 3
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 3
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 3
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 3
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 3
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 3
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 3
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 3
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 3
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 3
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 3
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 3
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 3
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 3
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 3
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 3
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 3
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 3
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 3
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 3
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 3
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 3
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 3
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 3
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 3
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 3
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 3
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 3
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 3
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 3
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 3
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 3
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 3
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 3
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 3
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 3
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 3
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 3
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 3
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 3
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 3
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 3
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 3
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 3
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 3
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 3
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 3
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 3
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 3
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 3
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 3
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 3
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 3
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 3
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 3
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 3
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 3
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 3
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 3
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 3
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 3
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 3
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 3
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 3
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 3
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 3
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 3
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 3
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 3
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 3
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 3
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 3
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 3
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 3
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 3
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 3
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- 102000003939 Membrane transport proteins Human genes 0.000 description 3
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 3
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 3
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 3
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 3
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 3
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 3
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 3
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 3
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 3
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 3
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 3
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 3
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 3
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 3
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 3
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 3
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 3
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 3
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 3
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 3
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 3
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 3
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 3
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 3
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 3
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 3
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 3
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 3
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 3
- OCCYDHCUKXRPSJ-SXNHZJKMSA-N Trp-Ile-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OCCYDHCUKXRPSJ-SXNHZJKMSA-N 0.000 description 3
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 3
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 3
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 3
- IUQDEKCCHWRHRW-IHPCNDPISA-N Tyr-Asn-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IUQDEKCCHWRHRW-IHPCNDPISA-N 0.000 description 3
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 3
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 3
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 3
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 3
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 3
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 3
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 3
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 3
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 3
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 3
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 3
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 3
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 3
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 3
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 3
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 description 3
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 3
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- GSXOAOHZAIYLCY-HSUXUTPPSA-N keto-D-fructose 6-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)[C@H](O)COP(O)(O)=O GSXOAOHZAIYLCY-HSUXUTPPSA-N 0.000 description 3
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 3
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 3
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 238000012269 metabolic engineering Methods 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 108010004914 prolylarginine Proteins 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 230000001681 protective effect Effects 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 3
- 210000000130 stem cell Anatomy 0.000 description 3
- 150000004043 trisaccharides Chemical class 0.000 description 3
- 108010029384 tryptophyl-histidine Proteins 0.000 description 3
- 210000005253 yeast cell Anatomy 0.000 description 3
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- PAHHYDSPOXDASW-VGWMRTNUSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-3-hydroxypropanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO PAHHYDSPOXDASW-VGWMRTNUSA-N 0.000 description 2
- 241000606730 Actinobacillus capsulatus Species 0.000 description 2
- 241000606731 Actinobacillus suis Species 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 2
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 2
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- 241000030716 Alistipes shahii Species 0.000 description 2
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 2
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 2
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 2
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 2
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 2
- ATHZHGQSAIJHQU-XIRDDKMYSA-N Asn-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ATHZHGQSAIJHQU-XIRDDKMYSA-N 0.000 description 2
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 2
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 2
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 2
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 2
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 2
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- 241000606767 Avibacterium paragallinarum Species 0.000 description 2
- 241000218561 Bibersteinia trehalosi Species 0.000 description 2
- 241000589877 Campylobacter coli Species 0.000 description 2
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 2
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 2
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 2
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 2
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 2
- 241000194033 Enterococcus Species 0.000 description 2
- QGWNDRXFNXRZMB-UUOKFMHZSA-N GDP Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O QGWNDRXFNXRZMB-UUOKFMHZSA-N 0.000 description 2
- 229930182566 Gentamicin Natural products 0.000 description 2
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 2
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 2
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 2
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 2
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 2
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 2
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 2
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 2
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 2
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 2
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 2
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 2
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 2
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 2
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 2
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 2
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 2
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 2
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 2
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 241000606822 Haemophilus parahaemolyticus Species 0.000 description 2
- VIVSWEBJUHXCDS-DCAQKATOSA-N His-Asn-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O VIVSWEBJUHXCDS-DCAQKATOSA-N 0.000 description 2
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 2
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 2
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 2
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- GUXQAPACZVVOKX-AVGNSLFASA-N His-Lys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GUXQAPACZVVOKX-AVGNSLFASA-N 0.000 description 2
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 2
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 2
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 2
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 2
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 2
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 2
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- 241000186660 Lactobacillus Species 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- 108010071324 Livagen Proteins 0.000 description 2
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 2
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- 108010003266 Lys-Leu-Tyr-Asp Proteins 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 2
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 2
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 2
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 2
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 2
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 2
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 2
- DJJBHQHOZLUBCN-WDSOQIARSA-N Met-Lys-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DJJBHQHOZLUBCN-WDSOQIARSA-N 0.000 description 2
- KVNOBVKRBOYSIV-SZMVWBNQSA-N Met-Pro-Trp Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KVNOBVKRBOYSIV-SZMVWBNQSA-N 0.000 description 2
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 2
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- 108010035265 N-acetylneuraminate synthase Proteins 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 241000588650 Neisseria meningitidis Species 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 102100035593 POU domain, class 2, transcription factor 1 Human genes 0.000 description 2
- 101710084414 POU domain, class 2, transcription factor 1 Proteins 0.000 description 2
- 241000588912 Pantoea agglomerans Species 0.000 description 2
- 241000606594 Pasteurella dagmatis Species 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 2
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 2
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 2
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 2
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 2
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 2
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 2
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 2
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 2
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 2
- 241000607565 Photobacterium phosphoreum Species 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 2
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 2
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 2
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 2
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- 102100029954 Sialic acid synthase Human genes 0.000 description 2
- 241000193985 Streptococcus agalactiae Species 0.000 description 2
- 241000009877 Streptococcus entericus Species 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 2
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 2
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- 239000005844 Thymol Substances 0.000 description 2
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 2
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 2
- LORJKYIPJIRIRT-BVSLBCMMSA-N Trp-Pro-Tyr Chemical compound C([C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 LORJKYIPJIRIRT-BVSLBCMMSA-N 0.000 description 2
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 2
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 2
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 2
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 2
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 2
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 2
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 2
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 2
- QMNWABHLJOHGDS-IHRRRGAJSA-N Tyr-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QMNWABHLJOHGDS-IHRRRGAJSA-N 0.000 description 2
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 2
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 2
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 2
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 2
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 2
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 2
- 241000607618 Vibrio harveyi Species 0.000 description 2
- 241000607284 Vibrio sp. Species 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 239000012491 analyte Substances 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- IEQCXFNWPAHHQR-YKLSGRGUSA-N beta-D-Gal-(1->4)-beta-D-GlcNAc-(1->3)-beta-D-Gal-(1->4)-D-Glc Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O IEQCXFNWPAHHQR-YKLSGRGUSA-N 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 238000013375 chromatographic separation Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 108091008053 gene clusters Proteins 0.000 description 2
- 229960002518 gentamicin Drugs 0.000 description 2
- 101150073660 glmM gene Proteins 0.000 description 2
- 101150111330 glmU gene Proteins 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- QGWNDRXFNXRZMB-UHFFFAOYSA-N guanidine diphosphate Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O QGWNDRXFNXRZMB-UHFFFAOYSA-N 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 238000012750 in vivo screening Methods 0.000 description 2
- 229910052500 inorganic mineral Inorganic materials 0.000 description 2
- 230000010039 intracellular degradation Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 101150066555 lacZ gene Proteins 0.000 description 2
- 229940039696 lactobacillus Drugs 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 102000016470 mariner transposase Human genes 0.000 description 2
- 108060004631 mariner transposase Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000011707 mineral Substances 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 238000004445 quantitative analysis Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 229960000790 thymol Drugs 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 1
- 229940062827 2'-fucosyllactose Drugs 0.000 description 1
- HWHQUWQCBPAQQH-UHFFFAOYSA-N 2-O-alpha-L-Fucosyl-lactose Natural products OC1C(O)C(O)C(C)OC1OC1C(O)C(O)C(CO)OC1OC(C(O)CO)C(O)C(O)C=O HWHQUWQCBPAQQH-UHFFFAOYSA-N 0.000 description 1
- AUNPEJDACLEKSC-ZAYDSPBTSA-N 3-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O AUNPEJDACLEKSC-ZAYDSPBTSA-N 0.000 description 1
- WJPIUUDKRHCAEL-UHFFFAOYSA-N 3FL Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(O)C1O WJPIUUDKRHCAEL-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 108010036211 5-HT-moduline Proteins 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- ZRNWJUAQKFUUKV-SRVKXCTJSA-N Arg-Met-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZRNWJUAQKFUUKV-SRVKXCTJSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- BFDDUDQCPJWQRQ-IHRRRGAJSA-N Arg-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O BFDDUDQCPJWQRQ-IHRRRGAJSA-N 0.000 description 1
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- FANGHKQYFPYDNB-UBHSHLNASA-N Asn-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N FANGHKQYFPYDNB-UBHSHLNASA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- JPSODRNUDXONAS-XIRDDKMYSA-N Asn-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC(=O)N)N JPSODRNUDXONAS-XIRDDKMYSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000193752 Bacillus circulans Species 0.000 description 1
- 241000193749 Bacillus coagulans Species 0.000 description 1
- 241000193422 Bacillus lentus Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 241000194107 Bacillus megaterium Species 0.000 description 1
- 241000194106 Bacillus mycoides Species 0.000 description 1
- 241000194103 Bacillus pumilus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000770536 Bacillus thermophilus Species 0.000 description 1
- 101710173142 Beta-fructofuranosidase, cell wall isozyme Proteins 0.000 description 1
- 241000186000 Bifidobacterium Species 0.000 description 1
- 241000186016 Bifidobacterium bifidum Species 0.000 description 1
- 241001608472 Bifidobacterium longum Species 0.000 description 1
- 241000186015 Bifidobacterium longum subsp. infantis Species 0.000 description 1
- 241000193417 Brevibacillus laterosporus Species 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000588919 Citrobacter freundii Species 0.000 description 1
- 241000193401 Clostridium acetobutylicum Species 0.000 description 1
- 241001656809 Clostridium autoethanogenum Species 0.000 description 1
- 241000186566 Clostridium ljungdahlii Species 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- PFAQXUDMZVMADG-AVGNSLFASA-N Cys-Gln-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PFAQXUDMZVMADG-AVGNSLFASA-N 0.000 description 1
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 1
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 1
- VNXXMHTZQGGDSG-CIUDSAMLSA-N Cys-His-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O VNXXMHTZQGGDSG-CIUDSAMLSA-N 0.000 description 1
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 1
- UQHYQYXOLIYNSR-CUJWVEQBSA-N Cys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N)O UQHYQYXOLIYNSR-CUJWVEQBSA-N 0.000 description 1
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- LBSKYJOZIIOZIO-DCAQKATOSA-N Cys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N LBSKYJOZIIOZIO-DCAQKATOSA-N 0.000 description 1
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 1
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 1
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 1
- MFMDKTLJCUBQIC-MXAVVETBSA-N Cys-Phe-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MFMDKTLJCUBQIC-MXAVVETBSA-N 0.000 description 1
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 1
- JEKIARHEWURQRJ-BZSNNMDCSA-N Cys-Phe-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CS)N JEKIARHEWURQRJ-BZSNNMDCSA-N 0.000 description 1
- CAXGCBSRJLADPD-FXQIFTODSA-N Cys-Pro-Asn Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CAXGCBSRJLADPD-FXQIFTODSA-N 0.000 description 1
- CNAMJJOZGXPDHW-IHRRRGAJSA-N Cys-Pro-Phe Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O CNAMJJOZGXPDHW-IHRRRGAJSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 1
- 108010084372 D-arabinose isomerase Proteins 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000194031 Enterococcus faecium Species 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000901842 Escherichia coli W Species 0.000 description 1
- 101100061504 Escherichia coli cscB gene Proteins 0.000 description 1
- 101100309698 Escherichia coli cscK gene Proteins 0.000 description 1
- 101100186924 Escherichia coli neuC gene Proteins 0.000 description 1
- 108010046276 FLP recombinase Proteins 0.000 description 1
- 108090000156 Fructokinases Proteins 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 108010019236 Fucosyltransferases Proteins 0.000 description 1
- 108060003306 Galactosyltransferase Proteins 0.000 description 1
- 101150102398 Galt gene Proteins 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- WVUZERSNWGUKJY-BPUTZDHNSA-N Gln-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N WVUZERSNWGUKJY-BPUTZDHNSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 1
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 1
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- NMYFPKCIGUJMIK-GUBZILKMSA-N Gln-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NMYFPKCIGUJMIK-GUBZILKMSA-N 0.000 description 1
- XUZQMPGBGFQJMY-SRVKXCTJSA-N Gln-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XUZQMPGBGFQJMY-SRVKXCTJSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 1
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- 108010055629 Glucosyltransferases Proteins 0.000 description 1
- 102000000340 Glucosyltransferases Human genes 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 241000589989 Helicobacter Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- YJBMLTVVVRJNOK-SRVKXCTJSA-N His-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N YJBMLTVVVRJNOK-SRVKXCTJSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- VGYOLSOFODKLSP-IHPCNDPISA-N His-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 VGYOLSOFODKLSP-IHPCNDPISA-N 0.000 description 1
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 1
- KQJBFMJFUXAYPK-AVGNSLFASA-N His-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KQJBFMJFUXAYPK-AVGNSLFASA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- RXKFKJVJVHLRIE-XIRDDKMYSA-N His-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CN=CN3)N RXKFKJVJVHLRIE-XIRDDKMYSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- VUPHVQCDULLACF-NAKRPEOUSA-N Ile-Met-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N VUPHVQCDULLACF-NAKRPEOUSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 238000012404 In vitro experiment Methods 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 241000170280 Kluyveromyces sp. Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- 101710186049 L-fuculokinase Proteins 0.000 description 1
- 240000001046 Lactobacillus acidophilus Species 0.000 description 1
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 1
- 244000199885 Lactobacillus bulgaricus Species 0.000 description 1
- 235000013960 Lactobacillus bulgaricus Nutrition 0.000 description 1
- 244000199866 Lactobacillus casei Species 0.000 description 1
- 235000013958 Lactobacillus casei Nutrition 0.000 description 1
- 241000218492 Lactobacillus crispatus Species 0.000 description 1
- 241000186673 Lactobacillus delbrueckii Species 0.000 description 1
- 241000186606 Lactobacillus gasseri Species 0.000 description 1
- 240000002605 Lactobacillus helveticus Species 0.000 description 1
- 235000013967 Lactobacillus helveticus Nutrition 0.000 description 1
- 241001561398 Lactobacillus jensenii Species 0.000 description 1
- 241000186604 Lactobacillus reuteri Species 0.000 description 1
- 241000218588 Lactobacillus rhamnosus Species 0.000 description 1
- 241000186869 Lactobacillus salivarius Species 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- 101100186921 Legionella pneumophila subsp. pneumophila (strain Philadelphia 1 / ATCC 33152 / DSM 7513) neuB gene Proteins 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 1
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 1
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- SDTSLIMYROCDNS-FXQIFTODSA-N Met-Cys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O SDTSLIMYROCDNS-FXQIFTODSA-N 0.000 description 1
- RMHHNLKYPOOKQN-FXQIFTODSA-N Met-Cys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O RMHHNLKYPOOKQN-FXQIFTODSA-N 0.000 description 1
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- HMEVNCOJHJTLNB-BVSLBCMMSA-N Met-Trp-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N HMEVNCOJHJTLNB-BVSLBCMMSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- 241000192041 Micrococcus Species 0.000 description 1
- 108010093077 N-Acetylglucosaminyltransferases Proteins 0.000 description 1
- 102000002493 N-Acetylglucosaminyltransferases Human genes 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical group CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 206010051606 Necrotising colitis Diseases 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 241000529648 Neisseria meningitidis MC58 Species 0.000 description 1
- 101001121800 Neisseria meningitidis Polysialic acid O-acetyltransferase Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000606860 Pasteurella Species 0.000 description 1
- 241000588701 Pectobacterium carotovorum Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- ISYSEOWLRQKQEQ-JYJNAYRXSA-N Phe-His-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISYSEOWLRQKQEQ-JYJNAYRXSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- DZVXMMSUWWUIQE-ACRUOGEOSA-N Phe-His-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N DZVXMMSUWWUIQE-ACRUOGEOSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- WURZLPSMYZLEGH-UNQGMJICSA-N Phe-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N)O WURZLPSMYZLEGH-UNQGMJICSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- JLDZQPPLTJTJLE-IHPCNDPISA-N Phe-Trp-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JLDZQPPLTJTJLE-IHPCNDPISA-N 0.000 description 1
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 1
- WDOCBGZHAQQIBL-IHPCNDPISA-N Phe-Trp-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 WDOCBGZHAQQIBL-IHPCNDPISA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- 241000607568 Photobacterium Species 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000235061 Pichia sp. Species 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 241000169446 Promethis Species 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 241000589540 Pseudomonas fluorescens Species 0.000 description 1
- 108010054530 RGDN peptide Proteins 0.000 description 1
- 108010025216 RVF peptide Proteins 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 241001030146 Rhodotorula sp. Species 0.000 description 1
- 241000235088 Saccharomyces sp. Species 0.000 description 1
- 241001360381 Saccharomycopsis sp. Species 0.000 description 1
- 241000720795 Schizosaccharomyces sp. Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 102100023230 Serine/threonine-protein kinase MAK Human genes 0.000 description 1
- 101710161071 Sialic acid transporter NanT Proteins 0.000 description 1
- 241000204117 Sporolactobacillus Species 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 235000014897 Streptococcus lactis Nutrition 0.000 description 1
- 101710117283 Sucrose permease Proteins 0.000 description 1
- 241000192581 Synechocystis sp. Species 0.000 description 1
- 241000520244 Tatumella citrea Species 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- YDWLCDQXLCILCZ-BWAGICSOSA-N Thr-His-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YDWLCDQXLCILCZ-BWAGICSOSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 1
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 1
- SMDQRGAERNMJJF-JQWIXIFHSA-N Trp-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 SMDQRGAERNMJJF-JQWIXIFHSA-N 0.000 description 1
- OFSLQLHHDQOWDB-QEJZJMRPSA-N Trp-Cys-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 OFSLQLHHDQOWDB-QEJZJMRPSA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 1
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 1
- PVRRBEROBJQPJX-SZMVWBNQSA-N Trp-His-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PVRRBEROBJQPJX-SZMVWBNQSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- SLOYNOMYOAOUCX-BVSLBCMMSA-N Trp-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SLOYNOMYOAOUCX-BVSLBCMMSA-N 0.000 description 1
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- BVOCLAPFOBSJHR-KKUMJFAQSA-N Tyr-Cys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BVOCLAPFOBSJHR-KKUMJFAQSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- PDKILSUYSUGCAO-JBACZVJFSA-N Tyr-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PDKILSUYSUGCAO-JBACZVJFSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 1
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- DAOREBHZAKCOEN-ULQDDVLXSA-N Tyr-Leu-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O DAOREBHZAKCOEN-ULQDDVLXSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- XDGPTBVOSHKDFT-KKUMJFAQSA-N Tyr-Met-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XDGPTBVOSHKDFT-KKUMJFAQSA-N 0.000 description 1
- GYBVHTWOQJMYAM-HRCADAONSA-N Tyr-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N GYBVHTWOQJMYAM-HRCADAONSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- NAHUCETZGZZSEX-IHPCNDPISA-N Tyr-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NAHUCETZGZZSEX-IHPCNDPISA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 101710196080 UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase Proteins 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000589636 Xanthomonas campestris Species 0.000 description 1
- 241000490645 Yarrowia sp. Species 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- 241000193453 [Clostridium] cellulolyticum Species 0.000 description 1
- USAZACJQJDHAJH-KDEXOMDGSA-N [[(2r,3s,4r,5s)-5-(2,4-dioxo-1h-pyrimidin-6-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2r,3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] hydrogen phosphate Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](C=2NC(=O)NC(=O)C=2)O1 USAZACJQJDHAJH-KDEXOMDGSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- HXXFSFRBOHSIMQ-FPRJBGLDSA-N alpha-D-galactose 1-phosphate Chemical compound OC[C@H]1O[C@H](OP(O)(O)=O)[C@H](O)[C@@H](O)[C@H]1O HXXFSFRBOHSIMQ-FPRJBGLDSA-N 0.000 description 1
- PHTAQVMXYWFMHF-GJGMMKECSA-N alpha-L-Fucp-(1->2)-beta-D-Galp-(1->4)-D-GlcpNAc Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H](OC(O)[C@H](NC(C)=O)[C@H]2O)CO)O[C@H](CO)[C@H](O)[C@@H]1O PHTAQVMXYWFMHF-GJGMMKECSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 239000000908 ammonium hydroxide Substances 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 101150035354 araA gene Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 229910052786 argon Inorganic materials 0.000 description 1
- 229940054340 bacillus coagulans Drugs 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229940002008 bifidobacterium bifidum Drugs 0.000 description 1
- 229940004120 bifidobacterium infantis Drugs 0.000 description 1
- 229940009291 bifidobacterium longum Drugs 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 238000010352 biotechnological method Methods 0.000 description 1
- 230000004641 brain development Effects 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- RPKLZQLYODPWTM-KBMWBBLPSA-N cholanoic acid Chemical compound C1CC2CCCC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@@H](CCC(O)=O)C)[C@@]1(C)CC2 RPKLZQLYODPWTM-KBMWBBLPSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000003930 cognitive ability Effects 0.000 description 1
- 238000001360 collision-induced dissociation Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 101150018392 cscA gene Proteins 0.000 description 1
- 101150091121 cscR gene Proteins 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- IERHLVCPSMICTF-UHFFFAOYSA-N cytidine monophosphate Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(COP(O)(O)=O)O1 IERHLVCPSMICTF-UHFFFAOYSA-N 0.000 description 1
- IERHLVCPSMICTF-ZAKLUEHWSA-N cytidine-5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-ZAKLUEHWSA-N 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011026 diafiltration Methods 0.000 description 1
- 235000015872 dietary supplement Nutrition 0.000 description 1
- IJKVHSBPTUYDLN-UHFFFAOYSA-N dihydroxy(oxo)silane Chemical compound O[Si](O)=O IJKVHSBPTUYDLN-UHFFFAOYSA-N 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000000909 electrodialysis Methods 0.000 description 1
- 238000000132 electrospray ionisation Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 101150025078 fucK gene Proteins 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 235000003869 genetically modified organism Nutrition 0.000 description 1
- XHMJOUIAFHJHBW-VFUOTHLCSA-N glucosamine 6-phosphate Chemical compound N[C@H]1[C@H](O)O[C@H](COP(O)(O)=O)[C@H](O)[C@@H]1O XHMJOUIAFHJHBW-VFUOTHLCSA-N 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 101150100121 gna1 gene Proteins 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 244000005709 gut microbiome Species 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000010189 intracellular transport Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 1
- 229940004208 lactobacillus bulgaricus Drugs 0.000 description 1
- 229940017800 lactobacillus casei Drugs 0.000 description 1
- 229940054346 lactobacillus helveticus Drugs 0.000 description 1
- 229940001882 lactobacillus reuteri Drugs 0.000 description 1
- 108010044538 lactostatin Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000007721 medicinal effect Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 238000001471 micro-filtration Methods 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 101150070589 nagB gene Proteins 0.000 description 1
- 101150076570 nanK gene Proteins 0.000 description 1
- 208000004995 necrotizing enterocolitis Diseases 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012261 overproduction Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 201000006195 perinatal necrotizing enterocolitis Diseases 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 101150067185 ppsA gene Proteins 0.000 description 1
- 108010065320 prolyl-lysyl-glutamyl-lysine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000001223 reverse osmosis Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
Abstract
Description
Настоящее изобретение относится к сиалилтрансферазам, их применению в получении сиалированных олигосахаридов и к применению указанных сиалированных олигосахаридов в предоставлении питательных композиций.The present invention relates to sialyltransferases, their use in the preparation of sialylated oligosaccharides and the use of said sialylated oligosaccharides in the provision of nutritional compositions.
Предшествующий уровень техникиPrior Art
На сегодняшний день идентифицировано более 150 структурно отличающихся олигосахаридов грудного молока (ОГМ). Несмотря на то, что ОГМ представляют только незначительное количество суммарных питательных веществ грудного молока, их высоко полезное действие на развитие детей, вскармливаемых грудью, стало очевидным за последние десятилетия.To date, more than 150 structurally distinct human milk oligosaccharides (HMOs) have been identified. Although HMOs represent only a small amount of the total nutrients of breast milk, their highly beneficial effects on the development of breastfed infants have become evident over recent decades.
Вплоть до 20% общего содержания ОГМ в грудном молоке является кислотным. Таким образом, данные молекулы ОГМ обладают по меньшей мере одной группировкой сиаловой кислоты. В то время как только 3% сиаловой кислоты, содержащейся в грудном молоке, доступно в свободной форме, 23% и 74% связаны с (глико-)протеинами и олигосахаридами, соответственно. Наиболее распространенным членом семейства сиаловых кислот является N-ацетилнейраминовая кислота (Neu5Ac). В качестве составной части олигомерного сахарида N-ацетилнейраминовая кислота часто обуславливает биологическую активность сахарида.Up to 20% of the total HGM content in breast milk is acidic. Thus, these OGM molecules possess at least one sialic acid moiety. While only 3% of sialic acid found in breast milk is available in free form, 23% and 74% are bound to (glyco-)proteins and oligosaccharides, respectively. The most common member of the sialic acid family is N-acetylneuraminic acid (Neu5Ac). As a constituent of an oligomeric saccharide, N-acetylneuraminic acid often mediates the biological activity of the saccharide.
Наблюдалось, что сиалированные ОГМ (СОГМ) поддерживают устойчивость к патогенам, а также кишечной микрофлоре. Интересно, что недавние исследования дополнительно продемонстрировали защитное действие длинноцепочечных СОГМ от некротизирующего энтероколита, который является одним из наиболее распространенных и смертельных заболеваний у недоношенных младенцев. Кроме того, полагают, что СОГМ поддерживает развитие мозга младенца и его когнитивные способности.Sialylated OGMs (SOGMs) have been observed to support resistance to pathogens as well as intestinal microflora. Interestingly, recent studies have further demonstrated the protective effect of long-chain SOGM against necrotizing enterocolitis, which is one of the most common and fatal diseases in preterm infants. In addition, SOGM is believed to support infant brain development and cognitive abilities.
Несмотря на то, что значительные варьирования в профиле ОГМ среди разных доноров препятствуют абсолютной количественной оценке кислотных олигосахаридов, особенно влияя на структурные изомеры сиалиллакто-N-тетраозы, наиболее распространенными кислотными ОГМ являются 3'-сиалиллактоза (3'-SL- от англ. sialyllactose), 6'-сиалиллактоза (6'-SL), сиалиллакто-N-тетраоза a (LST-a - от англ. sialyllacto-N-tetraose), сиалиллакто-N-тетраоза b (LST-b), сиалиллакто-N-тетраоза с (LST-c) и дисиалиллакто-N-тетраоза (DSLNT - от англ. disialyllacto-N-tetraose).Although significant variations in the OGM profile among different donors preclude absolute quantification of acid oligosaccharides, particularly affecting the structural isomers of sialyllacto-N-tetraose, the most abundant acidic OHMs are 3'-sialyllactose (3'-SL- from sialyllactose ), 6'-sialyllactose (6'-SL), sialyllacto-N-tetraose a (LST-a - from English sialyllacto-N-tetraose), sialyllacto-N-tetraose b (LST-b), sialyllacto-N- tetraose c (LST-c) and disialyllacto-N-tetraose (DSLNT - from the English disialyllacto-N-tetraose).
В отношении сложности структуры сиалированных ОГМ (Фиг. 1), их химические или химико-ферментативные синтезы являются проблематичными и ассоциированы с огромными сложностями, например, контролем стереохимии, образованием специфичных связей, доступностью сырья. Наконец, несмотря на то, что совокупность таких способов синтеза была успешной для некоторых СОГМ, их дороговизна и неудовлетворительные выходы ограничивают рентабельное получение сиалированных ОГМ в коммерческих целях.Regarding the complexity of the structure of sialylated HMOs (Figure 1), their chemical or chemoenzymatic syntheses are problematic and are associated with enormous difficulties, for example, control of stereochemistry, formation of specific bonds, and availability of raw materials. Finally, although a combination of such synthetic routes have been successful for some SOHMs, their high cost and poor yields limit the cost-effective preparation of sialylated OHMs for commercial purposes.
В общем, метаболическое конструирование микроорганизмов с получением ОГМ представляет самый многообещающий подход к получению ОГМ в промышленном масштабе и оно уже было разработано для 2'-фукозиллактозы, 3-фукозиллактозы и 3'-сиалил-лактозы.In general, metabolic engineering of microorganisms to produce HGM represents the most promising approach to obtain HGM on an industrial scale and has already been developed for 2'-fucosyllactose, 3-fucosyllactose and 3'-sialyl-lactose.
Тем не менее, конструирование путей биосинтеза для получения ОГМ часто ограничено специфичностью и активностью гликозилтрансфераз, которые вовлечены в биосинтез желательного ОГМ, например, фукозил-, галактозил-, N-ацетил-глюкозаминил- или сиалилтрансфераз, особенно в пределах гетерологичной экспрессионной системы, такой как рекомбинантная бактериальная клетка.However, the design of biosynthetic pathways to produce OGM is often limited by the specificity and activity of the glycosyltransferases that are involved in the biosynthesis of the desired OGM, such as fucosyl-, galactosyl-, N-acetyl-glucosaminyl- or sialyltransferases, especially within a heterologous expression system such as recombinant bacterial cell.
К сожалению, гены, кодирующие человеческие сиалилтрансферазы, едва экспрессируются в прокариотических микроорганизмах. Таким образом, данные гены и ферменты неприменимы в биотехнологических способах, использующих генетически модифицированные бактериальные штаммы, как например, Escherichia coli, для получения сиалированных ОГМ.Unfortunately, the genes encoding human sialyltransferases are barely expressed in prokaryotic microorganisms. Thus, these genes and enzymes are not applicable in biotechnological methods that use genetically modified bacterial strains, such as Escherichia coli, to produce sialylated HMOs.
На настоящий момент идентифицировано и охарактеризовано несколько сиалилтрансфераз (SiaT) из видов бактерий, например, из Neisseria, Campylobacter, Pasteurella, Helicobacter и Photobacterium, а также из млекопитающих и вирусов. Сиалилтрансферазы обычно подразделяют на шесть семейств гликозилтрансфераз (GT - от англ. glycosyltransferase), в зависимости от сходства белковых последовательностей. При этом, все эукариотические и вирусные сиалилтрансферазы сгруппированы в семейство GT 29, тогда как бактериальные SiaT содержатся в группах GT4, GT38, GT42, GT52 или GT80. Кроме того, сиалилтрансферазы и полисиалилтрансферазы могут быть подразделены, благодаря связям, которые они образуют, например, на α-2,3-, α-2,6- и α-2,8-сиалилтрансферазы. Все данные сиалилтрансферазы переносят остаток сиаловой кислоты от цитидин 5'-монофосфат сиаловой кислоты (например, CMP-Neu5Ac) к множеству акцепторных молекул, обычно группировкам галактозы (Gal), N-ацетилгалактозамина (GalNAc) или N-ацетилглюкозамина (GlcNAc) или группировкам других сиаловых кислот (Sia).To date, several sialyltransferases (SiaTs) have been identified and characterized from bacterial species, such as Neisseria, Campylobacter, Pasteurella, Helicobacter and Photobacterium, as well as from mammals and viruses. Sialyltransferases are usually divided into six families of glycosyltransferases (GT - from English glycosyltransferase), depending on the similarity of protein sequences. Moreover, all eukaryotic and viral sialyltransferases are grouped into the GT 29 family, while bacterial SiaTs are contained in the groups GT4, GT38, GT42, GT52 or GT80. In addition, sialyltransferases and polysialyltransferases can be subdivided, due to the bonds they form, into, for example, α-2,3-, α-2,6- and α-2,8-sialyltransferases. All of these sialyltransferases transfer a sialic acid residue from a sialic acid cytidine 5'-monophosphate (e.g., CMP-Neu5Ac) to a variety of acceptor molecules, typically galactose (Gal), N-acetylgalactosamine (GalNAc) or N-acetylglucosamine (GlcNAc) moieties or other moieties sialic acids (Sia).
Несколько бактериальных сиалилтрансфераз хорошо охарактеризованы ранее и, как уже доказано, подходят для получения 3'-SL или 6'-SL. Смогла быть получена ничтожно малая сумма знаний о сиалилтрансферазах, делающих возможным синтез сиалированных пента- или гексасахаридов, таких как LST-a, LST-b или DSLNT, ограничивая, таким образом, разработку способа получения для какого-либо из данных СОГМ. Вследствие этого, недоступность количеств данных желательных олигосахаридов высокой чистоты препятствует широкой научной оценке их лечебных свойств.Several bacterial sialyltransferases have been well characterized previously and have been shown to be suitable for producing 3'-SL or 6'-SL. A negligible amount of knowledge has been able to be gained about the sialyltransferases that enable the synthesis of sialylated penta- or hexasaccharides such as LST-a, LST-b or DSLNT, thus limiting the development of a production route for any of these SOGMs. Consequently, the unavailability of high purity quantities of these desired oligosaccharides has prevented widespread scientific evaluation of their medicinal properties.
Таким образом, существует необходимость в экономически эффективных способах получения одного или боле СОГМ, особенно тетрасахаридов, пентасахаридов и гексасахаридов, обладающих одним или двумя остатками сиаловой кислоты, в больших количествах и высокой чистоты.Thus, there is a need for cost-effective methods for producing one or more SOGMs, especially tetrasaccharides, pentasaccharides and hexasaccharides having one or two sialic acid units, in large quantities and in high purity.
Краткое изложение сущности изобретенияSummary of the invention
Данная задача решается, помимо прочего, посредством идентификации и характеристики новых сиалилтрансфераз и их применения в получении сиалированных олигосахаридов грудного молока посредством цельноклеточной ферментации или биокатализа.This problem is addressed, among other things, through the identification and characterization of new sialyltransferases and their application in the production of sialylated human milk oligosaccharides through whole-cell fermentation or biocatalysis.
Согласно первому аспекту предложен способ получения сиалированных олигосахаридов, в котором для получения указанного сиалированного олигосахарида используется генетически модифицированная клетка. Указанная генетически модифицированная клетка содержит по меньшей мере одну гетерологичную сиалилтрансферазу, которая способна переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле, где указанная акцепторная молекула выбрана из группы, состоящей из лактозы, LNT-II и олигосахаридов грудного молока.According to a first aspect, a method for producing sialylated oligosaccharides is provided, in which a genetically modified cell is used to produce said sialylated oligosaccharide. Said genetically modified cell contains at least one heterologous sialyltransferase that is capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule, where said acceptor molecule is selected from the group consisting of lactose, LNT-II and human milk oligosaccharides.
Согласно второму аспекту предложена генетически модифицированная клетка для применения в способе получения сиалированных олигосахаридов, в котором указанная генетически модифицированная клетка была генетически модифицирована для экспрессии гетерологичной сиалилтрансферазы, которая способна переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле, где указанная акцепторная молекула выбрана из группы, состоящей из лактозы, LNT-II и олигосахаридов грудного молока.According to a second aspect, a genetically modified cell is provided for use in a method for producing sialylated oligosaccharides, wherein said genetically modified cell has been genetically modified to express a heterologous sialyltransferase that is capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule, wherein said acceptor molecule is selected from the group, consisting of lactose, LNT-II and breast milk oligosaccharides.
Согласно третьему аспекту предложена молекула рекомбинантной нуклеиновой кислоты для экспрессии гетерологичной сиалилтрансферазы при накоплении в клетке, где указанная сиалилтрансфераза способна переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле, где указанная акцепторная молекула выбрана из группы, состоящей из лактозы, LNT-II и олигосахаридов грудного молока.According to a third aspect, a recombinant nucleic acid molecule is provided for expressing a heterologous sialyltransferase when accumulated in a cell, wherein said sialyltransferase is capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule, wherein said acceptor molecule is selected from the group consisting of lactose, LNT-II and breast oligosaccharides milk.
Согласно четвертому аспекту предложены сиалилтрансферазы, способные переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле, где указанная акцепторная молекула выбрана из группы, состоящей из лактозы, LNT-II и олигосахаридов грудного молока.According to a fourth aspect, sialyltransferases are provided that are capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule, wherein said acceptor molecule is selected from the group consisting of lactose, LNT-II and human milk oligosaccharides.
Согласно пятому аспекту предложено применение сиалилтрансферазы, способной переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле, где указанная акцепторная молекула выбрана из группы, состоящей из лактозы, LNT-II и олигосахаридов грудного молока, для получения сиалированных олигосахаридов.A fifth aspect provides the use of a sialyltransferase capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule, wherein said acceptor molecule is selected from the group consisting of lactose, LNT-II and human milk oligosaccharides, to produce sialylated oligosaccharides.
Согласно шестому аспекту предложен способ получения сиалированных олигосахаридов посредством биокатализа in vitro, в котором используют сиалилтрансферазу, причем указанная сиалилтрансфераза способна переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле, где указанная акцепторная молекула выбрана из группы, состоящей из лактозы, LNT-II и олигосахаридов грудного молока.According to a sixth aspect, a method is provided for the production of sialylated oligosaccharides by in vitro biocatalysis, which uses a sialyltransferase, wherein said sialyltransferase is capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule, wherein said acceptor molecule is selected from the group consisting of lactose, LNT-II and oligosaccharides breast milk.
Согласно седьмому аспекту предложены сиалированные олигосахариды, получаемые способом согласно первому аспекту или способом согласно шестому аспекту.According to a seventh aspect, there are provided sialylated oligosaccharides produced by a method according to the first aspect or a method according to the sixth aspect.
Согласно восьмому аспекту предложено применение сиалированных олигосахаридов согласно седьмому аспекту для изготовления питательной композиции.According to the eighth aspect, there is provided the use of sialylated oligosaccharides according to the seventh aspect for the manufacture of a nutritional composition.
Согласно девятому аспекту предложена питательная композиция, содержащая по меньшей мере один сиалированный олигосахарид согласно седьмому аспекту.According to a ninth aspect, there is provided a nutritional composition comprising at least one sialylated oligosaccharide according to the seventh aspect.
Согласно десятому аспекту предложена детская смесь, содержащая по меньшей мере один сиалированный олигосахарид грудного молока.According to a tenth aspect, an infant formula is provided comprising at least one sialylated human milk oligosaccharide.
Краткое описание графических материаловBrief description of graphic materials
На Фиг. 1 изображены химические структуры наиболее распространенных кислотных олигосахаридов в грудном молоке: 3'-SL (А), 6'-SL (В), LST-a (С), LST-b (D), LST-c (Е) и DSLNT (F).In FIG. Figure 1 shows the chemical structures of the most common acid oligosaccharides in breast milk: 3'-SL (A), 6'-SL (B), LST-a (C), LST-b (D), LST-c (E) and DSLNT (F).
На Фиг. 2 показаны карты плазмид pDEST14 и рЕТ11, сверхэкспрессирующих гены сиалилтрансфераз.In FIG. Figure 2 shows maps of plasmids pDEST14 and pET11, overexpressing sialyltransferase genes.
На Фиг. 3 показана продукция in vivo сиалиллакто-N-тетраозы-а и сиалиллакто-N-тетраозы-b (выделена стрелками) благодаря сверхэкспрессии подходящих сиалилтрансфераз, где изображено разделение внутриклеточной фракции siaT9- или siaT19-сверхэкспрессирующих клеток Е. coli BL21(DE3) #2130 посредством тонкослойной хроматографии.In FIG. Figure 3 shows the in vivo production of sialyllacto-N-tetraose-a and sialyllacto-N-tetraose-b (highlighted by arrows) due to overexpression of the appropriate sialyltransferases, which depicts the separation of the intracellular fraction of siaT9- or siaT19-overexpressing E. coli BL21(DE3) #2130 cells through thin layer chromatography.
Подробное описаниеDetailed description
В попытке идентифицировать сиалилтрансферазы, которые подходят для применения в способе изготовления сиалированного ОГМ, исследовали базы данных нуклеиновых кислот и базы данных белков. Сто предположительных сиалилтрансфераз идентифицировали посредством сходства последовательностей с известными гликозилтрансферазами. Указанные предположительные сиалилтрансферазы оценивали в отношении активности сиалилтрансфераз.In an attempt to identify sialyltransferases that are suitable for use in the process for making sialylated OGM, nucleic acid databases and protein databases were searched. One hundred putative sialyltransferases were identified by sequence similarity to known glycosyltransferases. These putative sialyltransferases were evaluated for sialyltransferase activity.
Согласно первому аспекту предложен способ получения сиалированного олигосахарида, включающий следующие стадии:According to a first aspect, there is provided a method for producing a sialylated oligosaccharide, comprising the following steps:
a) предоставление по меньшей мере одной генетически модифицированной клетки, которая содержит гетерологичную сиалилтрансферазу, причем указанная гетерологичная сиалилтрансфераза способна обладать α-2,3-сиалилтрансферазной активностью и/или α-2,6-сиалилтрансферазной активностью для переноса остатка сиаловой кислоты от СМР-активированной формы в качестве донорного субстрата к акцепторной молекуле, выбранной из группы, состоящей из лактозы, LNT-II и олигосахаридов грудного молока;a) providing at least one genetically modified cell that contains a heterologous sialyltransferase, wherein said heterologous sialyltransferase is capable of having α-2,3-sialyltransferase activity and/or α-2,6-sialyltransferase activity to transfer a sialic acid residue from the CMP-activated forms as a donor substrate to an acceptor molecule selected from the group consisting of lactose, LNT-II and human milk oligosaccharides;
b) культивирование по меньшей мере одной генетически модифицированной клетки в ферментационном бульоне и в условиях, пермиссивных для продукции указанного сиалированного олигосахарида; иb) cultivating at least one genetically modified cell in a fermentation broth and under conditions permissive for the production of said sialylated oligosaccharide; And
c) выделение указанного сиалированного олигосахарида.c) isolating said sialylated oligosaccharide.
Способ представляет собой способ получения сиалированного олигосахарида.The method is a method for producing sialylated oligosaccharide.
Термин «олигосахарид», в том виде, в котором он используется в данном документе, относится к полимерам из моносахаридных остатков, где указанные полимеры содержат по меньшей мере три моносахаридных остатка, но не больше чем 10 моносахаридных остатков, предпочтительно не больше чем 7 моносахаридных остатков. Олигосахариды представляют собой или линейную цепь моносахаридов или являются разветвленными. Кроме того, моносахаридные остатки олигосахаридов могут характеризоваться целым рядом химических модификаций. Соответственно, олигосахариды могу содержать одну или более несахаридных группировок.The term "oligosaccharide", as used herein, refers to polymers of monosaccharide residues, wherein said polymers contain at least three monosaccharide residues, but no more than 10 monosaccharide residues, preferably no more than 7 monosaccharide residues . Oligosaccharides are either a linear chain of monosaccharides or are branched. In addition, the monosaccharide residues of oligosaccharides can be characterized by a number of chemical modifications. Accordingly, oligosaccharides may contain one or more non-saccharide moieties.
Термин «сиалированный олигосахарид», в том виде, в котором он используется в данном документе, относится к олигосахаридам, содержащим один или более остатков сиаловой кислоты. В предпочтительном воплощении остаток сиаловой кислоты представляет собой остаток N-ацетилнейраминовой кислоты (Neu5Ac). Остаток N-ацетилнейраминовой кислоты обычно переносится с СМР-Neu5Ac в качестве донорного субстрата на акцепторную молекулу.The term “sialylated oligosaccharide”, as used herein, refers to oligosaccharides containing one or more sialic acid residues. In a preferred embodiment, the sialic acid residue is an N-acetylneuraminic acid residue (Neu5Ac). The N-acetylneuraminic acid residue is usually transferred from CMP-Neu5Ac as a donor substrate to an acceptor molecule.
Способ получения сиалированного олигосахарида включает стадию предоставления генетически модифицированной клетки, содержащей гетерологичную сиалилтрансферазу, которая способна обладать α-2,3-сиалилтрансферазной активностью и/или α-2,6-сиалилтрансферазной активностью.A method for producing a sialylated oligosaccharide includes the step of providing a genetically modified cell containing a heterologous sialyltransferase that is capable of having α-2,3-sialyltransferase activity and/or α-2,6-sialyltransferase activity.
Генетически модифицированная клетка представляет собой прокариотическую клетку или эукариотическую клетку. Предпочтительно, генетически модифицированная клетка представляет собой микробную клетку. Соответствующие микробные клетки включают дрожжевые клетки, бактериальные клетки, клетки архебактерий, клетки водорослей и клетки грибов.A genetically modified cell is a prokaryotic cell or a eukaryotic cell. Preferably, the genetically modified cell is a microbial cell. Suitable microbial cells include yeast cells, bacterial cells, archaebacterial cells, algal cells and fungal cells.
В дополнительном и/или альтернативном воплощении микробная клетка представляет собой прокариотическую клетку, предпочтительно бактериальную клетку, более предпочтительно бактериальную клетку, выбранную из группы, состоящей из Bacillus, Lactobacillus, Lactococcus, Enterococcus, Bifidobacterium, Sporolactobacillus spp., Micromomospora spp., Micrococcus spp., Rhodococcus spp., and Pseudomonas. Подходящие виды бактерий представляют собой Bacillus subtilis, Bacillus licheniformis, Bacillus coagulans, Bacillus thermophilus, Bacillus laterosporus, Bacillus megaterium, Bacillus mycoides, Bacillus pumilus, Bacillus lentus, Bacillus cereus, Bacillus circulans, Bifidobacterium longum, Bifidobacterium infantis, Bifidobacterium bifidum, Citrobacter freundii, Clostridium cellulolyticum, Clostridium ljungdahlii, Clostridium autoethanogenum, Clostridium acetobutylicum, Corynebacterium glutamicum, Enterococcus faecium, Enterococcus thermophiles, Escherichia coli, Erwinia herbicola (Pantoea agglomerans), Lactobacillus acidophilus, Lactobacillus salivarius, Lactobacillus plantarum, Lactobacillus helveticus, Lactobacillus delbrueckii, Lactobacillus rhamnosus, Lactobacillus bulgaricus, Lactobacillus crispatus, Lactobacillus gasseri, Lactobacillus casei, Lactobacillus reuteri, Lactobacillus jensenii, Lactococcus lactis, Pantoea citrea, Pectobacterium carotovorum, Proprionibacterium freudenreichii, Pseudomonas fluorescens, Pseudomonas aeruginosa, Streptococcus thermophiles и Xanthomonas campestris.In a further and/or alternative embodiment, the microbial cell is a prokaryotic cell, preferably a bacterial cell, more preferably a bacterial cell selected from the group consisting of Bacillus, Lactobacillus, Lactococcus, Enterococcus, Bifidobacterium, Sporolactobacillus spp., Micromomospora spp., Micrococcus spp. , Rhodococcus spp., and Pseudomonas. Suitable bacterial species include Bacillus subtilis, Bacillus licheniformis, Bacillus coagulans, Bacillus thermophilus, Bacillus laterosporus, Bacillus megaterium, Bacillus mycoides, Bacillus pumilus, Bacillus lentus, Bacillus cereus, Bacillus circulans, Bifidobacterium longum, Bifidobacterium infantis, Bifidobacterium bifidum, Citrobacter fre undii Clostridium cellulolyticum, Clostridium ljungdahlii, Clostridium autoethanogenum, Clostridium acetobutylicum, Corynebacterium glutamicum, Enterococcus faecium, Enterococcus thermophiles, Escherichia coli, Erwinia herbicola (Pantoea agglomerans), Lactobacillus acidophilus, Lactobacillus salivarius, Lactobacillus plantar um, Lactobacillus helveticus, Lactobacillus delbrueckii, Lactobacillus rhamnosus, Lactobacillus bulgaricus, Lactobacillus crispatus, Lactobacillus gasseri, Lactobacillus casei, Lactobacillus reuteri, Lactobacillus jensenii, Lactococcus lactis, Pantoea citrea, Pectobacterium carotovorum, Proprionibacterium freudenreichii, Pseudomonas fluorescens, Pseudomonas aeruginosa, ococcus thermophiles and Xanthomonas campestris.
В альтернативном воплощении эукариотическая клетка представляет собой дрожжевую клетку, клетку насекомого, растительную клетку или клетку млекопитающего. Дрожжевая клетка предпочтительно выбрана из группы, состоящей из Saccharomyces sp., в частности, Saccharomyces cerevisiae, Saccharomycopsis sp., Pichia sp., в частности Pichia pastoris, Hansenula sp., Kluyveromyces sp., Yarrowia sp., Rhodotorula sp.и Schizosaccharomyces sp.In an alternative embodiment, the eukaryotic cell is a yeast cell, an insect cell, a plant cell, or a mammalian cell. The yeast cell is preferably selected from the group consisting of Saccharomyces sp., in particular Saccharomyces cerevisiae, Saccharomycopsis sp., Pichia sp., in particular Pichia pastoris, Hansenula sp., Kluyveromyces sp., Yarrowia sp., Rhodotorula sp. and Schizosaccharomyces sp. .
Генетически модифицированная клетка была генетически модифицирована с возможностью содержать гетерологичную сиалилтрансферазу.The genetically modified cell has been genetically modified to contain a heterologous sialyltransferase.
Термин «генетически модифицированный», в том виде, в котором он используется в данном документе, относится к модификации организации генома клетки с использованием биологических способов. Модификация организации генома клетки может включать перенос генов в пределах и/или через видовые границы, осуществление вставки, осуществление делеции, осуществление замены и/или осуществление модификации нуклеотидов, триплетов, генов, открытых рамок считывания, промоторов, энхансеров, терминаторов и других нуклеотидных последовательностей, опосредующих и/или контролирующих экспрессию генов. Модификацая организации генома клетки нацелена на создание генетически модифицированного организма, обладающего конкретными, желательными свойствами. Генетически модифицированные клетки могут содержать один или более генов, которые отсутствуют в нативной (генетически немодифицированной) форме клетки. Методики введения экзогенных молекул нуклеиновых кислот и/или осуществления вставки экзогенных молекул нуклеиновых кислот (рекомбинантных. гетерологичных) в наследственную информацию клетки для осуществления вставки, осуществления делеции или изменения нуклеотидной последовательности генетической информации клетки хорошо известны специалисту в данной области. Генетически модифицированные клетки могут содержать один или более генов, которые находятся в нативной форме клетки, где указанные гены модифицируют и повторно вводят в клетку посредством искусственных средств. Термин «генетически модифицированный» также охватывает клетки, которые содержат молекулу нуклеиновой кислоты, являющуюся эндогенной в отношении данной клетки и которая была модифицирована без удаления молекулы нуклеиновой кислоты из данной клетки. Такие модификации включают модификации, полученные посредством замены гена, сайт-специфичных мутаций и родственных методик.The term “genetically modified”, as used herein, refers to modification of the genome organization of a cell using biological means. Modification of the genome organization of a cell may include the transfer of genes within and/or across species boundaries, insertion, deletion, substitution and/or modification of nucleotides, triplets, genes, open reading frames, promoters, enhancers, terminators and other nucleotide sequences, mediating and/or controlling gene expression. Modifying the organization of a cell's genome is aimed at creating a genetically modified organism that has specific, desirable properties. Genetically modified cells may contain one or more genes that are not present in the native (non-genetically modified) form of the cell. Methods for introducing exogenous nucleic acid molecules and/or inserting exogenous nucleic acid molecules (recombinant, heterologous) into the hereditary information of a cell to insert, perform a deletion or change the nucleotide sequence of the genetic information of a cell are well known to a person skilled in the art. Genetically modified cells may contain one or more genes that are found in the native form of the cell, wherein said genes are modified and reintroduced into the cell through artificial means. The term “genetically modified” also includes cells that contain a nucleic acid molecule that is endogenous to the cell and that has been modified without removing the nucleic acid molecule from the cell. Such modifications include modifications obtained through gene replacement, site-specific mutations and related techniques.
Генетически модифицированная клетка содержит гетерологичную сиалилтрансферазу.A genetically modified cell contains a heterologous sialyltransferase.
Термин «сиалилтрансфераза», в том виде, в котором он используется в данном документе, относится к полипептидам, способным обладать сиалилтрансферазной активностью. «Сиалилтрансферазная активность» означает перенос остатка сиаловой кислоты, предпочтительно остатка N-ацетилнейраминовой кислоты (Neu5Ac), от донорного субстрата к акцепторной молекуле. Термин «сиалилтрансфераза» включает функциональные фрагменты сиалилтрансфераз, описанных в данном документе, функциональные варианты сиалилтрансфераз, описанных в данном документе, и функциональные фрагменты функциональных вариантов. «Функциональный» в данном отношении означает, что фрагменты и/или варианты обладают сиалилтрансферазной активностью. Функциональные фрагменты сиалилтрансферазы охватывают усеченные версии сиалилтрансферазы, как кодируется ее встречающимся в природе геном, усеченная версия которой способна обладать сиалилтрансферазной активностью. Примеры усеченных версий представляют собой сиалилтрансферазы, которые не содержат так называемой лидерной последовательности, которая обычно направляет полипептид в конкретную внутриклеточную локализацию. Обычно, такие лидерные последовательности удаляются из полипептида во время его внутриклеточной транспортировки и также отсутствуют в встречающейся в природе зрелой сиалилтрансферазе.The term “sialyltransferase”, as used herein, refers to polypeptides capable of having sialyltransferase activity. "Sialyltransferase activity" means the transfer of a sialic acid residue, preferably an N-acetylneuraminic acid residue (Neu5Ac), from a donor substrate to an acceptor molecule. The term “sialyltransferase” includes functional fragments of the sialyltransferases described herein, functional variants of the sialyltransferases described herein, and functional fragments of functional variants. "Functional" in this regard means that the fragments and/or variants have sialyltransferase activity. Functional sialyltransferase fragments comprise truncated versions of the sialyltransferase as encoded by its naturally occurring gene, the truncated version of which is capable of possessing sialyltransferase activity. Examples of truncated versions are sialyltransferases that do not contain the so-called leader sequence that typically directs the polypeptide to a specific subcellular location. Typically, such leader sequences are removed from the polypeptide during intracellular transport and are also absent from the naturally occurring mature sialyltransferase.
Гетерологичная сиалилтрансфераза способна переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле. Термин «способный к» в отношении гетерологичной сиалилтрансферазы относится к сиалилтрансферазной активности гетерологичной сиалилтрансферазы и условию, что подходящие условия реакции необходимы для того, чтобы гетерологичная сиалилтрансфераза обладала своей ферментативной активностью. При отсутствии подходящих условий реакции гетерологичная сиалилтрансфераза не обладает своей ферментативной активностью, а сохраняет свою ферментативную активность и обладает своей ферментативной активностью, когда подходящие условия реакции восстанавливаются. Подходящие условия реакции включают наличие подходящего донорного субстрата, наличие подходящей акцепторной молекулы, наличие важнейших кофакторов, таких как, например, одновалентные или двухвалентные ионы, значение рН в соответствующем диапазоне, подходящую температуру и тому подобное. Необязательно, чтобы были удовлетворены оптимальные значения для абсолютно всех факторов, воздействующих на ферментативную реакцию гетерологичной сиалилтрансферазы, но условия реакции должны быть такими, чтобы гетерологичная сиалилтрансфераза осуществляла свою ферментативную активность. Соответственно, термин «способный к» исключает какие-либо условия, при которых ферментативная активность данной гетерологичной сиалилтрансферазы была бы необратимо нарушена, и также исключал воздействие гетерологичной сиалилтрансферазы на любое такое условие. Напротив, термин «способный к» означает, что сиалилтрансфераза является ферментативно активной, то есть обладает своей сиалилтрансферазной активностью, если для сиалилтрансферазы обеспечиваются пермиссивные условия реакции (где все требования являются необходимыми для осуществления сиалилтрансферазой своей ферментативной активности).A heterologous sialyltransferase is capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule. The term “capable of” in relation to a heterologous sialyltransferase refers to the sialyltransferase activity of the heterologous sialyltransferase and the condition that suitable reaction conditions are necessary for the heterologous sialyltransferase to possess its enzymatic activity. In the absence of suitable reaction conditions, the heterologous sialyltransferase does not have its enzymatic activity, but retains its enzymatic activity and has its enzymatic activity when suitable reaction conditions are restored. Suitable reaction conditions include the presence of a suitable donor substrate, the presence of a suitable acceptor molecule, the presence of essential cofactors such as, for example, monovalent or divalent ions, a pH value in the appropriate range, a suitable temperature, and the like. It is not necessary that the optimum values for absolutely all factors affecting the enzymatic reaction of the heterologous sialyltransferase be satisfied, but the reaction conditions must be such that the heterologous sialyltransferase carries out its enzymatic activity. Accordingly, the term “capable of” excludes any conditions under which the enzymatic activity of a given heterologous sialyltransferase would be irreversibly impaired, and also excludes the effect of the heterologous sialyltransferase on any such condition. In contrast, the term “capable of” means that the sialyltransferase is enzymatically active, that is, it possesses its sialyltransferase activity if permissive reaction conditions are provided for the sialyltransferase (where all requirements are necessary for the sialyltransferase to perform its enzymatic activity).
Сиалилтрансферазы могут различаться по типу связи с сахаром, которую они образуют.В том виде, в котором они используются в данном документе, термины «α-2,3-сиалилтрансфераза» и «α-2,3-сиалилтрансферазная активность» относятся к полипептидам и их ферментативной активности, которые добавляют остаток сиаловой кислоты с альфα-2,3 связью к галактозе или остатку галактозы акцепторной молекулы. Аналогичным образом, термины «α-2,6-сиалилтрансфераза» и «α-2,6-сиалилтрансферазная активность» относятся к полипептидам и их ферментативной активности, которые добавляют остаток сиаловой кислоты с альфα-2,6 связью к галактозе, N-ацетилгалактозамину и/или N-ацетилглюкозамину, остатку галактозы или остатку N-ацетилгалактозамина и/или остатку N-ацетилгалактозамина и/или остатку N-ацетилглюкозамина акцепторной молекулы. Аналогичным образом, термины «α-2,8-сиалилтрансфераза» и «α-2,8-сиалилтрансферазная активность» относятся к полипептидам и их ферментативной активности, которые добавляют остаток сиаловой кислоты с альфа-2,8 связью к галактозе, N-ацетилгалактозамину и/или N-ацетилглюкозамину, остатку галактозы или остатку N-ацетилгалактозамина и/или остатку N-ацетилглюкозамина акцепторной молекулы.Sialyltransferases can differ in the type of sugar bond they form. As used herein, the terms "α-2,3-sialyltransferase" and "α-2,3-sialyltransferase activity" refer to polypeptides and their enzymatic activities that add a sialic acid residue with an alpha-2,3 linkage to galactose or a galactose residue of the acceptor molecule. Likewise, the terms "α-2,6-sialyltransferase" and "α-2,6-sialyltransferase activity" refer to polypeptides and their enzymatic activities that add a sialic acid residue with an α-2,6 linkage to galactose, N-acetylgalactosamine and/or N-acetylglucosamine, a galactose residue or an N-acetylgalactosamine residue and/or an N-acetylgalactosamine residue and/or an N-acetylglucosamine residue of the acceptor molecule. Likewise, the terms "α-2,8-sialyltransferase" and "α-2,8-sialyltransferase activity" refer to polypeptides and their enzymatic activities that add an alpha-2,8-linked sialic acid residue to galactose, N-acetylgalactosamine and/or N-acetylglucosamine, a galactose residue or an N-acetylgalactosamine residue and/or an N-acetylglucosamine residue of the acceptor molecule.
Термин «гетерологичный», в том виде, в котором он используется в данном документе, относится к полипептиду, аминокислотной последовательности, молекуле нуклеиновой кислоты или нуклеотидной последовательности, которая является чужеродной для клетки или организма, то есть к полипептиду, аминокислотной последовательности, молекуле нуклеиновой кислоты или нуклеотидной последовательности, которая в природе не встречается в указанной клетке или организме. Термин «гетерологичная последовательность» или «гетерологичная нуклеиновая кислота» или «гетерологичный пептид», в том виде, в котором он используется в данном документе, представляет собой последовательность или нуклеиновую кислоту или пептид, который происходит из источника, чужеродного для конкретной клетки - хозяина (например, из другого вида), или, если из того же источника, является модифицированным, по сравнению с его исходной формой. Таким образом, гетерологичная нуклеиновая кислота, функционально связанная с промотором, происходит из источника, отличного от источника, из которого происходил промотор, или, если из того же источника, является модифицированной, по сравнению со своей исходной формой. Гетерологичная последовательность может стабильно вводиться, например, посредством трансфекции, трансформации, конъюгации или трансдукции, в геном микробной клетки-хозяина, таким образом, представляя генетически модифицированную клетку-хозяина. Можно применять методики, которые будут зависеть от клетки-хозяина, последовательности, которая подлежит вставке. Специалисту в данной области известны разные методики, и они раскрыты, например, в Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Соответственно, «гетерологичный пептид» представляет собой пептид, который в природе не встречается в клетке, и «гетерологичная сиалилтрансфераза» представляет собой сиалилтрансферазу, которая в природе не встречается в данной клетке.The term "heterologous", as used herein, refers to a polypeptide, amino acid sequence, nucleic acid molecule or nucleotide sequence that is foreign to a cell or organism, that is, a polypeptide, amino acid sequence, nucleic acid molecule or a nucleotide sequence that does not naturally occur in the specified cell or organism. The term "heterologous sequence" or "heterologous nucleic acid" or "heterologous peptide", as used herein, is a sequence or nucleic acid or peptide that is derived from a source foreign to a particular host cell ( for example, from another species), or, if from the same source, is modified from its original form. Thus, the heterologous nucleic acid operably linked to a promoter comes from a source different from that of the promoter or, if from the same source, is modified from its original form. The heterologous sequence can be stably introduced, for example, by transfection, transformation, conjugation or transduction, into the genome of a microbial host cell, thereby representing a genetically modified host cell. Techniques can be used that will depend on the host cell, the sequence to be inserted. Various techniques are known to one skilled in the art and are disclosed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Accordingly, a “heterologous peptide” is a peptide that does not naturally occur in a cell, and a “heterologous sialyltransferase” is a sialyltransferase that does not naturally occur in a given cell.
Гетерологичная сиалилтрансфераза способна переносить остаток сиаловой кислоты, например, остаток N-ацетилнейраминовой кислоты (Neu5Ac), от донорного субстрата, например, CMP-Neu5Ac, к акцепторной молекуле. Акцепторная молекула представляет собой лактозу, лакто-N-триозу II (LNT-II) или олигосахарид, выбранный из группы, состоящей из олигосахаридов грудного молока.A heterologous sialyltransferase is capable of transferring a sialic acid residue, such as an N-acetylneuraminic acid residue (Neu5Ac), from a donor substrate, such as CMP-Neu5Ac, to an acceptor molecule. The acceptor molecule is lactose, lacto-N-triose II (LNT-II) or an oligosaccharide selected from the group consisting of human milk oligosaccharides.
В дополнительном и/или альтернативном воплощении акцепторная молекула представляет собой олигосахарид грудного молока, выбранный из группы, состоящей из трисахаридов, тетрасахаридов и пентасахаридов.In a further and/or alternative embodiment, the acceptor molecule is a human milk oligosaccharide selected from the group consisting of trisaccharides, tetrasaccharides and pentasaccharides.
В дополнительном и/или альтернативном воплощении акцепторная молекула представляет собой олигосахарид грудного молока, выбранный из группы, состоящей из лакто-N-тетраозы, лакто-N-неотетраозы, LST-a и LST-b.In an additional and/or alternative embodiment, the acceptor molecule is a human milk oligosaccharide selected from the group consisting of lacto-N-tetraose, lacto-N-neotetraose, LST-a and LST-b.
В одном воплощении гетерологичная сиалилтрансфераза выбрана из группы, состоящей из:In one embodiment, the heterologous sialyltransferase is selected from the group consisting of:
I. полипептидов, содержащих или состоящих из аминокислотной последовательности, как представлено любой из SEQ ID NO: 1-33;I. polypeptides containing or consisting of an amino acid sequence as represented by any of SEQ ID NO: 1-33;
II. полипептидов, содержащих или состоящих из аминокислотной последовательности, имеющей сходство последовательности, составляющее по меньшей мере 80%, с любой из аминокислотных последовательностей, как представлено любой из SEQ ID NO: 1-33; иII. polypeptides containing or consisting of an amino acid sequence having at least 80% sequence similarity to any of the amino acid sequences as represented by any of SEQ ID NO: 1-33; And
III. фрагментов любого из полипептидов I. и II.III. fragments of any of polypeptides I. and II.
В дополнительном и/или альтернативном воплощении генетически модифицированная клетка трансформирована с возможностью содержать молекулу нуклеиновой кислоты, которая содержит нуклеотидную последовательность, кодирующую гетерологичную сиалилтрансферазу. Предпочтительно, нуклеотидная последовательность выбрана из группы, состоящей из:In a further and/or alternative embodiment, the genetically modified cell is transformed to contain a nucleic acid molecule that contains a nucleotide sequence encoding a heterologous sialyltransferase. Preferably, the nucleotide sequence is selected from the group consisting of:
i. нуклеотидных последовательностей, кодирующих полипептид, как представлено любой из SEQ ID NO: 1-33;i. nucleotide sequences encoding a polypeptide as represented by any of SEQ ID NO: 1-33;
ii. нуклеотидных последовательностей, как представлено любой из SEQ ID NO: 34-66;ii. nucleotide sequences as represented by any of SEQ ID NO: 34-66;
iii. нуклеотидных последовательностей, имеющих по меньшей мере 80%-ое сходство последовательностей с одной из нуклеотидных последовательностей, кодирующих полипептид, как представлено любой из SEQ ID NO: 1-33;iii. nucleotide sequences having at least 80% sequence similarity to one of the nucleotide sequences encoding a polypeptide as represented by any of SEQ ID NO: 1-33;
iv. нуклеотидных последовательностей, имеющих сходство последовательностей по меньшей мере 80% с любой из нуклеотидных последовательностей, представленных SEQ ID NO: 34-66;iv. nucleotide sequences having sequence similarity of at least 80% with any of the nucleotide sequences represented by SEQ ID NO: 34-66;
v. нуклеотидных последовательностей, которые комплементарны любой из нуклеотидных последовательностей i., ii., iii. и iv; иv. nucleotide sequences that are complementary to any of the nucleotide sequences i., ii., iii. and iv; And
vi. фрагментов любой из нуклеотидных последовательностей i., ii., iii., iv. и v. Выражение «любая из SEQ ID NO: 1-33» относится к любой из группы,vi. fragments of any of the nucleotide sequences i., ii., iii., iv. and v. The expression "any of SEQ ID NO: 1-33" refers to any of the group
состоящей из SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13. SEQ ID NO: 14. SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 и SEQ ID NO: 33. Такой же принцип относится к выражению «любая из SEQ ID NO: 34-66». В общем, выражение «любая из SEQ ID NO: X - Z», где «X» и «Z» представляют натуральное число, относится ко всем последовательностям (нуклеотидным последовательностям или аминокислотным последовательностям), представленным любой из «SEQ ID NO», содержащим идентификационный номер от X до Z.consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13. SEQ ID NO: 14. SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32 and SEQ ID NO: 33. The same principle applies to the expression “any of SEQ ID NO: 34-66”. In general, the expression "any of SEQ ID NO: X - Z", where "X" and "Z" represent a natural number, refers to all sequences (nucleotide sequences or amino acid sequences) represented by any of the "SEQ ID NO" containing identification number from X to Z.
Кроме того, генетически модифицированную клетку генетически модифицировали для экспрессии нуклеотидной последовательности, кодирующей гетерологичную сиалилтрансферазу. В связи с этим, нуклеотидная последовательность, кодирующая гетерологичную сиалилтрансферазу, функционально связана с по меньшей мере одним контролем экспрессии, влияющим на транскрипцию и/или трансляцию указанной нуклеотидной последовательности, кодирующей гетерологичную сиалилтрансферазу, в генетически модифицированной клетке.In addition, the genetically modified cell was genetically modified to express a nucleotide sequence encoding a heterologous sialyltransferase. In this regard, the nucleotide sequence encoding the heterologous sialyltransferase is operably linked to at least one expression control affecting the transcription and/or translation of the nucleotide sequence encoding the heterologous sialyltransferase in the genetically modified cell.
Термин «функционально связанный», в том виде, в котором он используется в данном документе, относится к функциональной связи между нуклеотидной последовательностью, кодирующей гетерологичную сиалилтрансферазу, и второй нуклеотидной последовательностью, нуклеотидной последовательностью контроля экспрессии (такой как промотор, оператор, энхансер, регулятор, целый ряд сайтов связывания факторов транскрипции, терминатор транскрипции, сайт связывания рибосомы), где последовательность контроля экспрессии влияет на транскрипцию и/или трансляцию нуклеиновой кислоты, соответствующей нуклеотидной последовательности, кодирующей гетерологичную сиалилтрансферазу. Соответственно, термин «промотор» обозначает последовательности ДНК, которые обычно «предшествуют» гену в полимере ДНК и предоставляют сайт для инициации транскрипции в мРНК. «Регуляторные» последовательности ДНК, также обычно «расположенные до» (то есть, предшествующие) гена в данном полимере ДНК, связывают белки, которые определяют частоту (или скорость) инициации транскрипции. Совместно называемые «промоторными/регуляторными» или «контрольными» последовательностями ДНК, данные последовательности, которые предшествуют выбранному гену (или серии генов) в функциональном полимере ДНК, содействуют определению того, будет ли происходить транскрипция (или возможная транскрипция) гена. Последовательности ДНК, которые «следуют за» геном в ДНК-полимере и обеспечивают сигнал для терминации транскрипции в мРНК, называются последовательностями, «терминирующими»транскрипцию.The term "operably linked", as used herein, refers to a functional relationship between a nucleotide sequence encoding a heterologous sialyltransferase and a second nucleotide sequence, an expression control nucleotide sequence (such as a promoter, operator, enhancer, regulator, a number of transcription factor binding sites, a transcription terminator, a ribosome binding site), where the expression control sequence influences the transcription and/or translation of a nucleic acid corresponding to the nucleotide sequence encoding a heterologous sialyltransferase. Accordingly, the term “promoter” refers to DNA sequences that typically “precede” a gene in a DNA polymer and provide a site for initiation of transcription into mRNA. "Regulatory" DNA sequences, also usually "upstream" (that is, upstream) of a gene in a given DNA polymer, bind proteins that determine the frequency (or rate) of transcription initiation. Collectively referred to as “promoter/regulatory” or “control” DNA sequences, these sequences that precede a selected gene (or series of genes) in a functional DNA polymer help determine whether transcription (or possible transcription) of the gene will occur. DNA sequences that “follow” a gene in a DNA polymer and provide a signal to terminate transcription in mRNA are called transcription termination sequences.
В одном воплощении генетически модифицированная клетка содержит гетерологичную сиалилтрансферазу, способную обладать α-2,3-сиалилтрансферазной активностью, и олигосахарид грудного молока представляет собой LNT. Полученный таким образом сиалированный олигосахарид представляет собой LST-a.In one embodiment, the genetically modified cell contains a heterologous sialyltransferase capable of having α-2,3-sialyltransferase activity, and the human milk oligosaccharide is LNT. The sialylated oligosaccharide thus obtained is LST-a.
В дополнительном и/или альтернативном воплощении гетерологичная сиалилтрансфераза, способная обладать α-2,3-сиалилтрансферазной активностью, выбрана из группы, состоящей из:In a further and/or alternative embodiment, the heterologous sialyltransferase capable of having α-2,3-sialyltransferase activity is selected from the group consisting of:
I. полипептидов, содержащих или состоящих из аминокислотной последовательности, как представлено любой из SEQ ID NO: 1-27;I. polypeptides containing or consisting of an amino acid sequence as represented by any of SEQ ID NO: 1-27;
II. полипептидов, содержащих или состоящих из аминокислотной последовательности, имеющей сходство по меньшей мере 80%, с любой из аминокислотных последовательностей, как представлено любой из SEQ ID NO: 1-27; иII. polypeptides containing or consisting of an amino acid sequence having at least 80% similarity to any of the amino acid sequences as represented by any of SEQ ID NO: 1-27; And
III. фрагментов любого из полипептидов I. и II.III. fragments of any of polypeptides I. and II.
В дополнительном и/или альтернативном воплощении генетически модифицированная клетка содержит молекулу рекомбинантной или синтетической нуклеиновой кислоты, которая содержит по меньшей мере одну нуклеотидную последовательность, кодирующую указанную гетерологичную сиалилтрансферазу, способную обладать α-2,3-сиалилтрансферазной активностью, где указанная по меньшей мере одна нуклеотидная последовательность выбрана из группы, состоящей из:In an additional and/or alternative embodiment, the genetically modified cell contains a recombinant or synthetic nucleic acid molecule that contains at least one nucleotide sequence encoding said heterologous sialyltransferase capable of having α-2,3-sialyltransferase activity, wherein said at least one nucleotide sequence the sequence is selected from the group consisting of:
i. нуклеотидных последовательностей, кодирующих полипептид, как представлено любой из SEQ ID NO: 1-27;i. nucleotide sequences encoding a polypeptide as represented by any of SEQ ID NO: 1-27;
ii. нуклеотидных последовательностей, как представлено любой из SEQ ID NO: 34-60;ii. nucleotide sequences as represented by any of SEQ ID NO: 34-60;
iii. нуклеотидных последовательностей, имеющих по меньшей мере 80%-ое сходство последовательностей с одной из нуклеотидных последовательностей, кодирующих полипептид, как представлено любой из SEQ ID NO: 1-27;iii. nucleotide sequences having at least 80% sequence similarity to one of the nucleotide sequences encoding a polypeptide as represented by any of SEQ ID NO: 1-27;
iv. нуклеотидных последовательностей, имеющих сходство последовательностей, составляющее по меньшей мере 80%, с любой из нуклеотидных последовательностей, представленных SEQ ID NO: 34-60;iv. nucleotide sequences having sequence similarity of at least 80% to any of the nucleotide sequences represented by SEQ ID NOs: 34-60;
v. нуклеотидных последовательностей, которые комплементарны любой из нуклеотидных последовательностей i., ii., iii. и iv; иv. nucleotide sequences that are complementary to any of the nucleotide sequences i., ii., iii. and iv; And
vi. фрагментов любой из нуклеотидных последовательностей i, ii, iii, iv и v.vi. fragments of any of the nucleotide sequences i, ii, iii, iv and v.
В дополнительном и/или альтернативном воплощении гетерологичная сиалилтрансфераза, способная обладать α-2,3-сиалилтрансферазной активностью, обладает относительной эффективностью по меньшей мере 100-кратной, по меньшей мере 200-кратной, по меньшей мере 300-кратной, по меньшей мере 1000-кратной, по меньшей мере 10000-кратной, по сравнению с относительной эффективностью SiaT16, как представлено SEQ ID NO: 27, посредством количественного анализа сиалирования LNT с использованием ЖХ-МС/МС (жидкостная хроматография/масс-спектрометрия) в соответствии со способом, как описано в примере 5.In a further and/or alternative embodiment, a heterologous sialyltransferase capable of having α-2,3-sialyltransferase activity has a relative potency of at least 100-fold, at least 200-fold, at least 300-fold, at least 1000-fold. fold, at least 10,000-fold, compared to the relative efficiency of SiaT16, as represented by SEQ ID NO: 27, by quantitative analysis of LNT sialylation using LC-MS/MS (liquid chromatography/mass spectrometry) in accordance with the method as described in example 5.
В другом воплощении гетерологичная сиалилтрансфераза может обладать α-2,6-сиалилтрансферазной активностью, и олигосахарид грудного молока представляет собой LNT. Полученный таким образом сиалированный олигосахарид представляет собой LST-b.In another embodiment, the heterologous sialyltransferase may have α-2,6-sialyltransferase activity and the human milk oligosaccharide is LNT. The sialylated oligosaccharide thus obtained is LST-b.
В дополнительном воплощении гетерологичная сиалилтрансфераза, способная обладать α-2,6-сиалилтрансферазной активностью, выбрана из группы, состоящей из:In a further embodiment, the heterologous sialyltransferase capable of having α-2,6-sialyltransferase activity is selected from the group consisting of:
I. полипептидов, содержащих или состоящих из аминокислотной последовательности, как представлено любой из SEQ ID NO: 28-33;I. polypeptides containing or consisting of an amino acid sequence as represented by any of SEQ ID NO: 28-33;
II. полипептидов, содержащих или состоящих из аминокислотной последовательности, имеющей сходство по меньшей 80%, с любой из аминокислотных последовательностей, как представлено любой из SEQ ID NO: 28-33; иII. polypeptides containing or consisting of an amino acid sequence having at least 80% similarity to any of the amino acid sequences set forth in any of SEQ ID NOs: 28-33; And
III. фрагментов любой из полипептидов I. и II.III. fragments of any of polypeptides I. and II.
В дополнительном и/или альтернативном воплощении генетически модифицированная клетка содержит молекулу рекомбинантной или синтетической нуклеиновой кислоты, которая содержит по меньшей мере одну нуклеотидную последовательность, кодирующую указанную гетерологичную сиалилтрансферазу, способную обладать α-2,6-сиалилтрансферазной активностью, когда указанная по меньшей мере одна нуклеотидная последовательность выбрана из группы, состоящей из:In an additional and/or alternative embodiment, the genetically modified cell contains a recombinant or synthetic nucleic acid molecule that contains at least one nucleotide sequence encoding said heterologous sialyltransferase capable of having α-2,6-sialyltransferase activity, when said at least one nucleotide sequence the sequence is selected from the group consisting of:
i. нуклеотидных последовательностей, кодирующих полипептид, как представлено любой из SEQ ID NO: 28-33;i. nucleotide sequences encoding a polypeptide as represented by any of SEQ ID NOs: 28-33;
ii. нуклеотидных последовательностей, как представлено любой из SEQ ID NOs: 61-66;ii. nucleotide sequences as represented by any of SEQ ID NOs: 61-66;
iii. нуклеотидных последовательностей, имеющих по меньшей мере 80%-ое сходство последовательностей с одной из нуклеотидных последовательностей, кодирующих полипептид, как представлено любой из SEQ ID NO: 28-33;iii. nucleotide sequences having at least 80% sequence similarity to one of the nucleotide sequences encoding a polypeptide as represented by any of SEQ ID NOs: 28-33;
iv. нуклеотидных последовательностей, имеющих сходство последовательностей, составляющее по меньшей мере 80%, с любой из нуклеотидных последовательностей, представленных SEQ ID NO: 61-66;iv. nucleotide sequences having sequence similarity of at least 80% to any of the nucleotide sequences represented by SEQ ID NOs: 61-66;
v. нуклеотидных последовательностей, которые комплементарны любой из нуклеотидных последовательностей i., ii., iii. и iv; иv. nucleotide sequences that are complementary to any of the nucleotide sequences i., ii., iii. and iv; And
vi. фрагментов любой из нуклеотидных последовательностей i., ii., iii., iv. и v.vi. fragments of any of the nucleotide sequences i., ii., iii., iv. and v.
В дополнительном и/или альтернативном воплощении гетерологичная сиалилтрансфераза, способная обладать α-2,3-сиалилтрансферазной активностью, обладает относительной эффективностью по меньшей мере 100-кратной, более предпочтительно по меньшей мере 200-кратной, наиболее предпочтительно по меньшей мере 300-кратной, по сравнению с относительной эффективностью SiaT5, как представлено SEQ ID NO: 33, посредством количественного анализа сиалирования LNT с использованием ЖХ-МС/МС в соответствии со способом, как описано в примере 5.In a further and/or alternative embodiment, the heterologous sialyltransferase capable of having α-2,3-sialyltransferase activity has a relative potency of at least 100-fold, more preferably at least 200-fold, most preferably at least 300-fold, according to compared to the relative potency of SiaT5 as represented by SEQ ID NO: 33 by quantitative analysis of LNT sialylation using LC-MS/MS according to the method described in Example 5.
В дополнительном и/или альтернативном воплощении по меньшей мере одна генетически модифицированная клетка имеет повышенную продукцию одного или более нуклеотид-активированных сахаров, выбранных из группы, состоящей из CMP-N-ацетилнейраминовой кислоты (Neu5Ac), УДФ (уридиндифосфат)-N-ацетилглюкозамина, УДФ-галактозы и ГДФ (гуанозиндифосфат)-фукозы. Предпочтительно, по меньшей мере одна генетически модифицированная клетка дополнительно генетически модифицирована для возможности обладать увеличенной продукцией одного или более указанных нуклеотид-активированных сахаров. Продукция по меньшей мере одного из указанных нуклеотид-активированных сахаров увеличена в дополнительно генетически модифицированной клетке, по сравнению с продукцией того (тех) же нуклеотид-активированного(ых) сахара(ов) в клетке-предшественнике дополнительно генетически модифицированной клетки перед дополнительной генетической модификацией для того, чтобы обладать увеличенной продукцией по меньшей мере одного из указанных нуклеотид-активированных сахаров.In a further and/or alternative embodiment, the at least one genetically modified cell has increased production of one or more nucleotide-activated sugars selected from the group consisting of CMP-N-acetylneuraminic acid (Neu5Ac), UDP-N-acetylglucosamine, UDP-galactose and GDP (guanosine diphosphate)-fucose. Preferably, the at least one genetically modified cell is further genetically modified to be capable of having increased production of one or more of said nucleotide-activated sugars. The production of at least one of said nucleotide-activated sugar(s) is increased in the further genetically modified cell, compared to the production of the same nucleotide-activated sugar(s) in the progenitor cell of the further genetically modified cell before further genetic modification for in order to have increased production of at least one of these nucleotide-activated sugars.
В дополнительном и/или альтернативном воплощении по меньшей мере одна клетка дополнительно генетически модифицирована для сверхэкспрессии одного или более генов, кодирующих полипептиды, способные обладать ферментативной активностью, выбранной из группы, состоящей из следующего: L-глутамин:D-фруктозо-6-фосфат аминотрансфераза, N-ацетилглюкозамин-1-фосфат уридилтрансфераза, глюкозамин-1-фосфат ацетилтрансфераза, фосфоглюкозаминмутаза, глюкозамин-6-фосфат-N-ацетилтрансфераза, N-ацетилглюкозамин-2-эпимераза, УДФ-N-ацетилглюкозамин-2-эпимераза, синтаза сиаловой кислоты, фосфоенолпируватсинтаза, синтетаза СМР-сиаловой кислоты, УДФ-галактозо-4-эпимераза, галактозо-1-фосфат уридилилтрансфераза, фосфоглюкомутаза, глюкозо-1-фосфат уридилилтрансфераза, фосфоманномутаза, маннозо-1-фосфат гуанозилтрансфераза, ГДФ-маннозо-4,6-дегидратаза, ГДФ-L-фукозосинтаза и фукозокиназа/L-фукозо-1-фосфат-гуанилтрансфераза. Указанная сверхэкспрессия одного или более генов представляет собой сверхэкспрессию, по сравнению с клеткой-предшественником дополнительно генетически модифицированной клетки перед дополнительной генетической модификацией для возможности обладания сверхэкспрессией указанного одного или более генов.In a further and/or alternative embodiment, the at least one cell is further genetically modified to overexpress one or more genes encoding polypeptides capable of having enzymatic activity selected from the group consisting of the following: L-glutamine: D-fructose-6-phosphate aminotransferase , N-acetylglucosamine-1-phosphate uridyltransferase, glucosamine-1-phosphate acetyltransferase, phosphoglucosamine mutase, glucosamine-6-phosphate-N-acetyltransferase, N-acetylglucosamine-2-epimerase, UDP-N-acetylglucosamine-2-epimerase, sialic acid synthase , phosphoenolpyruvate synthase, CMP-sialic acid synthetase, UDP-galactose-4-epimerase, galactose-1-phosphate uridylyltransferase, phosphoglucomutase, glucose-1-phosphate uridylyltransferase, phosphomannomutase, mannose-1-phosphate guanosyltransferase, GDP-mannose-4,6- dehydratase, GDP-L-fucose synthase and fucosokinase/L-fucose-1-phosphate guanyltransferase. Said overexpression of one or more genes represents overexpression, relative to a progenitor cell, of an additional genetically modified cell before further genetic modification to be able to overexpress said one or more genes.
Сверхэкспрессия одного или более указанных генов увеличивает количество соответствующего(их) фермента(ов) в генетически модифицированной клетке и, следовательно, повышает соответствующую ферментативную активность в клетке с усилением внутриклеточной продукции по меньшей мере одного из указанных нуклеотид-активированных сахаров.Overexpression of one or more of these genes increases the amount of the corresponding enzyme(s) in the genetically modified cell and, therefore, increases the corresponding enzymatic activity in the cell with increased intracellular production of at least one of these nucleotide-activated sugars.
В дополнительном и/или альтернативном воплощении по меньшей мере одна генетически модифицированная клетка не обладает или обладает сниженной активностью одной или более ферментативных активностей, выбранных из группы, состоящей из β-галактозидазной активности, глюкозамин-6-фосфатдезаминазы, N-ацетилглюкозамин-6-фосфатдеацетилазы, N-ацетилманнозаминкиназы, N-ацетилманнозамин-6-фосфатэпимеразы и альдолазы N-ацетилнейраминовой кислоты, по сравнению с клеткой перед генетической модификацией.In a further and/or alternative embodiment, at least one genetically modified cell lacks or has reduced activity of one or more enzymatic activities selected from the group consisting of β-galactosidase activity, glucosamine-6-phosphate deaminase, N-acetylglucosamine-6-phosphate deacetylase , N-acetylmannosamine kinase, N-acetylmannosamine-6-phosphate epimerase, and N-acetylneuraminic acid aldolase, compared to a cell before genetic modification.
В дополнительном и/или альтернативном воплощении один или более генов, кодирующих β-галактозидазу, глюкозамин-6-фосфатдезаминазу, N-ацетилглюкозамин-6-фосфатдезацетилазу, N-ацетилманнозаминкиназу, N-ацетилманнозамин-6-фосфатэпимеразу и альдолазу N-ацетилнейраминовой кислоты, был/были удален(удалены) из генома генетически модифицированной клетки или экспрессия одного или более генов, кодирующих β-галактозидазу, глюкозамин-6-фосфатдезаминазу, N-ацетилглюкозамин-6-фосфатдеацетилазу, N-ацетилманнозаминкиназу, N-ацетилманнозамин-6-фосфатэпимеразу и альдолазу N-ацетилнейраминовой кислоты, была инактивирована или по меньшей мере снижена в генетически модифицированной клетке посредством дополнительной генетической модификации клетки. Уровень экспрессии указанных генов снижается в дополнительно генетически модифицированной клетке, по сравнению с клеткой-предшественником дополнительно генетически модифицированной клетки перед дополнительной генетической модификацией для обладания сниженным уровнем экспрессии указанных генов.In an additional and/or alternative embodiment, one or more genes encoding β-galactosidase, glucosamine-6-phosphate deaminase, N-acetylglucosamine-6-phosphate deacetylase, N-acetylmannosamine kinase, N-acetylmannosamine-6-phosphate epimerase, and N-acetylneuraminic acid aldolase was /have been removed from the genome of a genetically modified cell or the expression of one or more genes encoding β-galactosidase, glucosamine-6-phosphate deaminase, N-acetylglucosamine-6-phosphate deacetylase, N-acetylmannosamine kinase, N-acetylmannosamine-6-phosphate epimerase and aldolase N-acetylneuraminic acid has been inactivated or at least reduced in the genetically modified cell by further genetic modification of the cell. The level of expression of said genes is reduced in the further genetically modified cell compared to a progenitor cell of the further genetically modified cell before further genetic modification to have a reduced level of expression of said genes.
В дополнительном и/или альтернативном воплощении по меньшей мере одна генетически модифицированная клетка содержит по меньшей мере одно, выбранное из группы, состоящей из функциональной лактозопермеазы, функциональной фукозопермеазы и функционального транспортера сиаловой кислоты (импортера), предпочтительно содержит и экспрессирует по меньшей мере одну нуклеотидную последовательность, кодирующую одно, выбранное из группы, состоящей из функциональной лактозопермеазы, функциональной фукозопермеазы и функционального транспортера сиаловой кислоты (импортера).In a further and/or alternative embodiment, the at least one genetically modified cell comprises at least one selected from the group consisting of a functional lactose permease, a functional fucose permease and a functional sialic acid transporter (importer), preferably containing and expressing at least one nucleotide sequence , encoding one selected from the group consisting of a functional lactose permease, a functional fucose permease, and a functional sialic acid transporter (importer).
В дополнительном и/или альтернативном воплощении генетически модифицированная клетка обладает активностью по меньшей мере одной глюкозилтрансферазы, выбранной из группы, состоящей из β-1,3-N-ацетилглюкозаминилтрансферазы, β-1,3-галактозилтрансферазы, β-1,4-галактозилтрансферазы, α-2,3-сиалилтрансферазы и α-2,6-сиалилтрансферазы.In a further and/or alternative embodiment, the genetically modified cell has the activity of at least one glucosyltransferase selected from the group consisting of β-1,3-N-acetylglucosaminyltransferase, β-1,3-galactosyltransferase, β-1,4-galactosyltransferase, α-2,3-sialyltransferases and α-2,6-sialyltransferases.
В дополнительном и/или альтернативном воплощении по меньшей мере одну генетически модифицированную клетку культивируют в ферментационном бульоне и в условиях, являющихся пермиссивными для продукции сиалированного олигосахарида.In a further and/or alternative embodiment, the at least one genetically modified cell is cultured in a fermentation broth and under conditions that are permissive for the production of sialylated oligosaccharide.
Ферментационный бульон содержит по меньшей мере один источник углерода для генетически модифицированных клеток. По меньшей мере один источник углерода предпочтительно выбран из группы, состоящей из глюкозы, фруктозы, сахарозы, глицерина и их комбинаций.The fermentation broth contains at least one carbon source for the genetically modified cells. The at least one carbon source is preferably selected from the group consisting of glucose, fructose, sucrose, glycerol, and combinations thereof.
В дополнительном и/или альтернативном воплощении ферментационный бульон содержит по меньшей мере одно, выбранное из группы, состоящей из N-ацетилглюкозамина, галактозы и сиаловой кислоты.In a further and/or alternative embodiment, the fermentation broth contains at least one selected from the group consisting of N-acetylglucosamine, galactose and sialic acid.
В дополнительном и/или альтернативном воплощении, где по меньшей мере одну генетически модифицированную клетку культивируют при отсутствии и/или без добавления одного или более, выбранных из группы, состоящей из N-ацетилглюкозамина, галактозы и сиаловой кислоты, по меньшей мере одну генетически модифицированную клетку культивируют в присутствии лактозы, лакто-N-триозы II (LNT-II) или по меньшей мере одного ОГМ, предпочтительно ОГМ, выбранного из группы, состоящей из трисахаридов, тетрасахаридов и пентасахаридов, более предпочтительно ОГМ, выбранного из группы, состоящей из LNT и LNnT.In a further and/or alternative embodiment, wherein the at least one genetically modified cell is cultured in the absence and/or without the addition of one or more selected from the group consisting of N-acetylglucosamine, galactose and sialic acid, at least one genetically modified cell cultured in the presence of lactose, lacto-N-triose II (LNT-II) or at least one OGM, preferably an OGM selected from the group consisting of trisaccharides, tetrasaccharides and pentasaccharides, more preferably an OGM selected from the group consisting of LNT and LNnT.
Способ включает возможную стадию выделения сиалированного олигосахарида, который был продуцирован по меньшей мере одной генетически модифицированной клеткой во время ее культивирования в ферментационном бульоне. Сиалированный олигосахарид может быть выделен из ферментационного бульона после удаления генетически модифицированной клетки, например, посредством центрифугирования, и/или может быть выделен из клеток, например, в том отношении, что клетки собирают из ферментационного бульона посредством центрифугирования и подвергают стадии лизиса клеток. Затем, сиалированные олигосахариды могут быть дополнительно очищены из ферментационного бульона и/или клеточных лизатов подходящими методиками, известными специалисту в данной области. Подходящие методики включают микрофильтрацию, ультрафильтрацию, диафильтрацию, хроматографию с псевдодвижущимся слоем, электродиализ, обратный осмос, гель-фильтрацию, анионообменную хроматографию, катионообменную хроматографию и т.п.The method includes the optional step of isolating a sialylated oligosaccharide that has been produced by at least one genetically modified cell during its cultivation in a fermentation broth. The sialylated oligosaccharide can be isolated from the fermentation broth after removal of the genetically modified cell, for example by centrifugation, and/or can be isolated from the cells, for example in that the cells are collected from the fermentation broth by centrifugation and subjected to a cell lysis step. The sialylated oligosaccharides can then be further purified from the fermentation broth and/or cell lysates by suitable techniques known to one of ordinary skill in the art. Suitable techniques include microfiltration, ultrafiltration, diafiltration, pseudo-moving bed chromatography, electrodialysis, reverse osmosis, gel filtration, anion exchange chromatography, cation exchange chromatography, and the like.
Согласно второму аспекту предложена генетически модифицированная клетка для применения в способе получения сиалированных олигосахаридов. Указанная генетически модифицированная клетка и предпочтительные воплощения указанной генетически модифицированной клетки ранее описаны в данном документе в связи со способом. Следовательно, генетически модифицированная клетка содержит гетерологичную сиалилтрансферазу, причем указанная гетерологичная сиалилтрансфераза способна обладать α-2,3-сиалилтрансферазной активностью и/или α-2,6-сиалилтрансферазной активностью для переноса остатка сиаловой кислоты, например, остатка N-ацетилнейраминовой кислоты (Neu5Ac) от нуклеотид-активированной формы в качестве донорного субстрата, например, СМР-Neu5Ac, к акцепторной молекуле, где акцепторная молекула выбрана из группы, состоящей из лактозы, лакто-N-триозы II и олигосахаридов грудного молока.According to a second aspect, a genetically modified cell is provided for use in a process for producing sialylated oligosaccharides. Said genetically modified cell and preferred embodiments of said genetically modified cell are previously described herein in connection with the method. Therefore, the genetically modified cell contains a heterologous sialyltransferase, wherein said heterologous sialyltransferase is capable of having α-2,3-sialyltransferase activity and/or α-2,6-sialyltransferase activity to transfer a sialic acid residue, for example, an N-acetylneuraminic acid residue (Neu5Ac) from a nucleotide-activated form as a donor substrate, for example, CMP-Neu5Ac, to an acceptor molecule, where the acceptor molecule is selected from the group consisting of lactose, lacto-N-triose II and human milk oligosaccharides.
Согласно третьему аспекту предложены молекулы рекомбинантной нуклеиновой кислоты для экспрессии сиалилтрансферазы при накоплении в клетке, причем указанная сиалилтрансфераза представляет собой гетерологичную сиалилтрансферазу при экспрессии в клетке. Молекула(ы) рекомбинантной нуклеиновой кислоты содержит(ат) нуклеотидную последовательность, кодирующую сиалилтрансферазу, которая способна переносить остаток сиаловой кислоты, например, остаток N-ацетилнейраминовой кислоты, от донорного субстрата к акцепторной молекуле, где указанная акцепторная молекула выбрана из группы, состоящей из лактозы, лакто-N-триозы II и олигосахаридов грудного молока.According to a third aspect, recombinant nucleic acid molecules are provided for expressing a sialyltransferase when expressed in a cell, wherein said sialyltransferase is a heterologous sialyltransferase when expressed in a cell. The recombinant nucleic acid molecule(s) contains a nucleotide sequence encoding a sialyltransferase that is capable of transferring a sialic acid residue, for example an N-acetylneuraminic acid residue, from a donor substrate to an acceptor molecule, wherein said acceptor molecule is selected from the group consisting of lactose , lacto-N-triose II and breast milk oligosaccharides.
Предпочтительные воплощения нуклеотидных последовательностей, кодирующих сиалилтрансферазу, которая способна переносить остаток сиаловой кислоты от донорного субстрата к акцепторной молекуле, где акцепторная молекула выбрана из группы, состоящей из лактозы, лакто-N-триозы II и олигосахаридов грудного молока, таких как предпочтительные нуклеотидные последовательности, раскрыты ранее в данном документе, в связи со способом получения сиалированных олигосахаридов. Например, сиалилтрансфераза способна переносить остаток N-ацетилнейраминовой кислоты от CMP-Neu5Ac к лактозе, лакто-N-триозе II или олигосахариду грудного молока.Preferred embodiments of nucleotide sequences encoding a sialyltransferase that is capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule, wherein the acceptor molecule is selected from the group consisting of lactose, lacto-N-triose II and human milk oligosaccharides, such as preferred nucleotide sequences are disclosed earlier in this document, in connection with the method of preparing sialylated oligosaccharides. For example, sialyltransferase is capable of transferring an N-acetylneuraminic acid residue from CMP-Neu5Ac to lactose, lacto-N-triose II, or a human milk oligosaccharide.
Нуклеотидная последовательность, кодирующая сиалилтрансферазу, функционально связана с по меньшей мере одной последовательностью контроля экспрессии. Таким образом, в дополнительном и/или альтернативном воплощении молекула рекомбинантной нуклеиновой кислоты содержит по меньшей мере одну последовательность контроля экспрессии, опосредующую транскрипцию и/или трансляцию нуклеотидной последовательности, кодирующей сиалилтрансферазу, когда указанная молекула рекомбинантной нуклеиновой кислоты нарабатывается в клетке.The nucleotide sequence encoding the sialyltransferase is operably linked to at least one expression control sequence. Thus, in a further and/or alternative embodiment, the recombinant nucleic acid molecule comprises at least one expression control sequence mediating transcription and/or translation of a nucleotide sequence encoding a sialyltransferase when said recombinant nucleic acid molecule is produced in a cell.
Согласно четвертому аспекту предложены сиалилтрансферазы, способные обладать α-2,3-сиалилтрансферазной активностью и/или α-2,6-сиалилтрансферазной активностью, переносящие остаток сиаловой кислоты, например, остаток N-ацетилнейраминовой кислоты, от донорного субстрата, например, CMP-Neu5Ac, к акцепторной молекуле, где указанная акцепторная молекула представляет собой лактозу, лакто-N-триозу II или олигосахарид грудного молока.According to a fourth aspect, sialyltransferases are provided capable of having α-2,3-sialyltransferase activity and/or α-2,6-sialyltransferase activity, transferring a sialic acid residue, for example, an N-acetylneuraminic acid residue, from a donor substrate, for example, CMP-Neu5Ac , to an acceptor molecule, wherein said acceptor molecule is lactose, lacto-N-triose II or a human milk oligosaccharide.
В одном воплощении акцепторная молекула выбрана из группы, состоящей из трисахаридов, тетрасахаридов и пентасахаридов. В дополнительном и/или альтернативном воплощении акцепторная молекула выбрана из группы, состоящей из LST-a и LST-b.In one embodiment, the acceptor molecule is selected from the group consisting of trisaccharides, tetrasaccharides and pentasaccharides. In an additional and/or alternative embodiment, the acceptor molecule is selected from the group consisting of LST-a and LST-b.
В дополнительном и/или альтернативном воплощении сиалилтрансфераза выбрана из группы, состоящей из:In an additional and/or alternative embodiment, the sialyltransferase is selected from the group consisting of:
I. полипептидов, содержащих или состоящих из аминокислотной последовательности, как представлено любой из SEQ ID NO: 1-33;I. polypeptides containing or consisting of an amino acid sequence as represented by any of SEQ ID NO: 1-33;
II. полипептидов, содержащих или состоящих из аминокислотной последовательности, обладающей сходством последовательности, составляющим по меньшей мере 80%, с любой из аминокислотных последовательностей, как представлено любой из SEQ ID NO: 1-33; иII. polypeptides containing or consisting of an amino acid sequence having at least 80% sequence similarity to any of the amino acid sequences as represented by any of SEQ ID NO: 1-33; And
III. фрагментов любого из полипептидов I. и II.III. fragments of any of polypeptides I. and II.
Согласно пятому аспекту предложено применение сиалилтрансфераз, ранее описанных в данном документе и способных переносить остаток сиаловой кислоты от донорного субстрата, например, остаток N-ацетилнейраминовой кислоты, от СМР Neu5AC к акцепторной молекуле, где указанная акцепторная молекула представляет собой лактозу, лакто-N-триозу II или олигосахарид грудного молока, для получения сиалированных олигосахаридов.A fifth aspect provides the use of sialyltransferases previously described herein that are capable of transferring a sialic acid residue from a donor substrate, for example an N-acetylneuraminic acid residue, from the Neu5AC CMP to an acceptor molecule, wherein said acceptor molecule is lactose, lacto-N-triose II or breast milk oligosaccharide, to obtain sialylated oligosaccharides.
Указанные сиалилтрансферазы способны переносить остаток сиаловой кислоты к акцепторной молекуле, представляющей собой олигосахарид грудного молока, с получением, таким образом, сиалированного олигосахарида.These sialyltransferases are capable of transferring a sialic acid residue to an acceptor molecule, which is a breast milk oligosaccharide, thereby obtaining a sialylated oligosaccharide.
Олигосахарид грудного молока может представлять собой нейтральный олигосахарид или кислотный олигосахарид, то есть олигосахарид грудного молока, содержащий по меньшей мере один остаток сиаловой кислоты.The human milk oligosaccharide may be a neutral oligosaccharide or an acidic oligosaccharide, that is, a human milk oligosaccharide containing at least one sialic acid residue.
Сиалированный олигосахарид, полученный посредством применения сиалилтрансфераз, как описано ранее в данном документе, может представлять собой олигосахарид грудного молока или может представлять собой сиалированный олигосахарид, не обнаруженный во встречающемся в природе грудном молоке.The sialylated oligosaccharide produced by the use of sialyltransferases as described earlier herein may be a human milk oligosaccharide or may be a sialylated oligosaccharide not found in naturally occurring human milk.
Согласно шестому аспекту предложен способ получения сиалированных олигосахаридов посредством биокатализа in vitro, где используется сиалилтрансфераза, причем указанная сиалилтрансфераза способна переносить остаток сиаловой кислоты от донорного субстрата, например, остаток N-ацетилнейраминовой кислоты от CMP-Neu5Ac к акцепторной молекуле, где указанная акцепторная молекула представляет собой олигосахарид грудного молока.According to a sixth aspect, a method is provided for the production of sialylated oligosaccharides by in vitro biocatalysis using a sialyltransferase, wherein said sialyltransferase is capable of transferring a sialic acid residue from a donor substrate, for example, an N-acetylneuraminic acid residue from CMP-Neu5Ac to an acceptor molecule, wherein said acceptor molecule is Breast milk oligosaccharide.
Способ включает следующие стадии:The method includes the following stages:
- предоставление - в реакционной смеси - сиалилтрансферазы, способной переносить остаток сиаловой кислоты, предпочтительно N-ацетилнейраминовой кислоты, от донорного субстрата к акцепторной молекуле, донорного субстрата и акцепторной молекулы;- providing - in the reaction mixture - a sialyltransferase capable of transferring a sialic acid residue, preferably N-acetylneuraminic acid, from a donor substrate to an acceptor molecule, a donor substrate and an acceptor molecule;
- обеспечение переноса сиалилтрансферазой остатка сиаловой кислоты от донорного субстрата к акцепторной молекуле с получением сиалированного олигосахарида; и- ensuring the transfer of sialic acid residue by sialyltransferase from the donor substrate to the acceptor molecule to produce a sialylated oligosaccharide; And
- выделение сиалированного олигосахарида из реакционной смеси. Согласно седьмому аспекту предложены сиалированные олигосахариды,- isolation of the sialylated oligosaccharide from the reaction mixture. According to a seventh aspect, sialylated oligosaccharides are provided,
получаемые способом согласно первому аспекту или способом согласно шестому аспекту.obtained by a method according to the first aspect or a method according to the sixth aspect.
В одном воплощении сиалированный олигосахарид представляет собой олигосахарид грудного молока, предпочтительно, тетрасахарид, пентасахарид или гексасахарид, более предпочтительно, сиалированный олигосахарид, выбранный из группы, состоящей из LST-a, LST-b и DSLNT.In one embodiment, the sialylated oligosaccharide is a human milk oligosaccharide, preferably a tetrasaccharide, pentasaccharide or hexasaccharide, more preferably a sialylated oligosaccharide selected from the group consisting of LST-a, LST-b and DSLNT.
Согласно восьмому аспекту предложено применение сиалированного олигосахарида, получаемого посредством подхода на основе цельноклеточной ферментации или биокатализа in vitro, как описано ранее в данном документе, для изготовления питательной композиции. Указанная питательная композиция содержит по меньшей мере один сиалированный олигосахарид, который был получен способом, как ранее раскрыто в данном документе.The eighth aspect provides the use of a sialylated oligosaccharide produced through a whole cell fermentation or in vitro biocatalysis approach as described previously herein for the manufacture of a nutritional composition. Said nutritional composition contains at least one sialylated oligosaccharide which has been prepared by a method as previously disclosed herein.
Таким образом, согласно девятому аспекту предложена питательная композиция, содержащая по меньшей мере один сиалированный олигосахарид, который был получен способом, как ранее раскрыто в данном документе. Предпочтительно, по меньшей мере один сиалированный олигосахарид представляет собой 3'-сиалиллактозу, 6'-сиалиллактозу, LST-a, LST-b, LST-c или DSLNT.Thus, according to the ninth aspect, there is provided a nutritional composition comprising at least one sialylated oligosaccharide that has been prepared by a method as previously disclosed herein. Preferably, the at least one sialylated oligosaccharide is 3'-sialyllactose, 6'-sialyllactose, LST-a, LST-b, LST-c or DSLNT.
В дополнительном и/или альтернативном воплощении питательная композиция дополнительно содержит по меньшей мере один нейтральный ОГМ, предпочтительно 2'-FL.In a further and/or alternative embodiment, the nutritional composition further comprises at least one neutral HMO, preferably 2'-FL.
В дополнительном и/или альтернативном воплощении питательная композиция содержит 3-SL, 6-SL и 2'-FL.In an additional and/or alternative embodiment, the nutritional composition contains 3-SL, 6-SL and 2'-FL.
В дополнительном воплощении питательная композиция выбрана из группы, состоящей из лекарственных составов, детской смеси и биологически активных добавок.In a further embodiment, the nutritional composition is selected from the group consisting of medicinal formulations, infant formula and dietary supplements.
Питательная композиция может быть представлена в жидкой форме или в твердой форме, включая порошки, гранулы, хлопья и пеллеты, но, не ограничиваясь ими.The nutritional composition may be presented in liquid form or solid form, including, but not limited to, powders, granules, flakes and pellets.
Согласно десятому аспекту предложена детская смесь, содержащая по меньшей мере один сиалированный ОГМ. Указанный сиалированный ОГМ представляет собой ОГМ, выбранный из группы сиалированных олигосахаридов, которые были получены способом, как описано ранее в данном документе.According to a tenth aspect, an infant formula is provided comprising at least one sialylated HMO. Said sialylated OGM is an OGM selected from the group of sialylated oligosaccharides that have been prepared by a method as described previously herein.
В одном воплощении по меньшей мере один сиалированный ОГМ, который содержится в детской питательной смеси, выбран из группы, состоящей из 3-SL, 6-SL, LST-a, LST-b, LST-c и DSLNT.In one embodiment, the at least one sialylated HGM that is contained in the infant formula is selected from the group consisting of 3-SL, 6-SL, LST-a, LST-b, LST-c and DSLNT.
В дополнительном и/или альтернативном воплощении детская питательная смесь содержит по меньшей мере один сиалированный ОГМ и один или более нейтральных ОГМ.In a further and/or alternative embodiment, the infant formula contains at least one sialylated HMO and one or more neutral HMOs.
В дополнительном и/или альтернативном воплощении детская питательная смесь содержит 3-SL, 6-SL и 2'-FL.In an additional and/or alternative embodiment, the infant formula contains 3-SL, 6-SL and 2'-FL.
Настоящее изобретение будет описано в отношении конкретных воплощений и со ссылкой на графические материалы, но изобретение не ограничивается ими, а только формулой изобретения. Кроме того, термины первый, второй и т.д. в описании и в формуле изобретения используются для проведения различия между похожими элементами и не обязательно для описания последовательности, или во времени, или в пространстве, или в расположении или любым другим образом. Следует понимать, что термины, используемые таким образом, являются взаимозаменяемыми в соответствующих обстоятельствах, и что воплощения изобретения, описанные в данном документе, способны действовать в последовательностях, отличных от последовательностей, описанных или проиллюстрированных в данном документе.The present invention will be described with respect to specific embodiments and with reference to drawings, but the invention is not limited thereto, but only by the claims. In addition, the terms first, second, etc. in the description and claims are used to distinguish between like elements and not necessarily to describe sequence, or in time, or in space, or in arrangement or in any other way. It should be understood that the terms used in this manner are interchangeable in appropriate circumstances, and that the embodiments of the invention described herein are capable of operating in sequences other than those described or illustrated herein.
Следует отметить, что термин «содержащий», используемый в формуле изобретения, не следует считать ограниченным средствами, соответственно перечисленными; он не исключает других элементов или стадий. Таким образом, его следует понимать как точно определяющий наличие установленных признаков, целых чисел, стадий или компонентов, на которые ссылаются, но он не исключает наличия или добавления одного или более признаков, целых чисел, стадий или компонентов или их групп. Таким образом, объем выражения «устройство, содержащее средства А и В», не должен ограничиваться устройствами, состоящими только из компонентов А и В. Это означает, что в отношении настоящего изобретения А и В являются лишь релевантными компонентами устройства.It should be noted that the term "comprising" used in the claims should not be considered limited to the means respectively listed; it does not exclude other elements or stages. Thus, it is to be understood as specifying precisely the presence of the stated features, integers, steps or components referred to, but does not preclude the presence or addition of one or more features, integers, steps or components or groups thereof. Thus, the scope of the expression “device comprising means A and B” should not be limited to devices consisting only of components A and B. This means that for the purposes of the present invention, A and B are only relevant components of the device.
Ссылка на всем протяжении данного описания изобретения на «одно воплощение» или «воплощение» означает, что конкретный признак, структура или характеристика, описанная в связи с воплощением, включена в по меньшей мере одно воплощение настоящего изобретения. Таким образом, появления фраз «в одном воплощении» или «в воплощении» в разных местах на всем протяжении данного описания изобретения не обязательно все относятся к одному и тому же воплощению, но могут относиться к одному и тому же воплощению. Кроме того, конкретные признаки, структуры или характеристики могут объединяться любым подходящим образом, как будет очевидно обычному специалисту в данной области на основе данного раскрытия, в одном или более воплощениях.Reference throughout this specification to “one embodiment” or “embodiment” means that the particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in different places throughout this specification do not necessarily all refer to the same embodiment, but may refer to the same embodiment. Moreover, specific features, structures, or characteristics may be combined in any suitable manner, as will be apparent to one of ordinary skill in the art based on this disclosure, in one or more embodiments.
Аналогично, следует понимать, что в описании иллюстративных воплощений изобретения разные признаки изобретения иногда группируют вместе в одном единственном воплощении, фигуре или их его описании в целях упрощения раскрытия и оказания помощи в понимании одного или более разных аспектов изобретения. Данный способ раскрытия, однако, не должен восприниматься как свидетельствование о намерении, что заявленное изобретение требует больше признаков, чем явным образом перечислены в каждом пункте. Напротив, о чем свидетельствует нижеприведенная формула изобретения, аспекты изобретения заключаются меньше чем во всех признаках одного единственного вышеприведенного раскрытого воплощения. Таким образом, формула изобретения, следующая после подробного описания, тем самым явным образом включена в данное подробное описание, причем каждый пункт формулы изобретения выступает сам по себе в качестве отдельного воплощения данного изобретения.Likewise, it should be understood that in the description of illustrative embodiments of the invention, various features of the invention are sometimes grouped together in one single embodiment, figure, or description thereof for the purpose of simplifying the disclosure and assisting in understanding one or more different aspects of the invention. This manner of disclosure, however, should not be taken as indicating an intention that the claimed invention requires more features than are expressly listed in each claim. On the contrary, as evidenced by the claims below, aspects of the invention are contained in less than all the features of the single embodiment disclosed above. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing by itself as a separate embodiment of the invention.
Кроме того, в то время как некоторые воплощения, описанные в данном документе, включают некоторые, но не все признаки, включенные в другие воплощения, подразумевается, что комбинации признаков разных воплощений находятся в пределах объема изобретения и образуют разные воплощения, как будет понятно специалистам в данной области. Например, в нижеследующей формуле изобретения любые из заявленных воплощений могут быть использованы в любой комбинации.In addition, while some embodiments described herein include some, but not all, of the features included in other embodiments, it is understood that combinations of features of different embodiments are within the scope of the invention and form different embodiments, as will be appreciated by those skilled in the art. this area. For example, in the following claims, any of the claimed embodiments may be used in any combination.
Кроме того, некоторые воплощения описаны в данном документе как способ или комбинация элементов способа, который может осуществляться посредством процессора компьютерной системы или посредством других средств осуществления функции. Таким образом, процессор с необходимыми инструкциями для осуществления такого способа или элемента способа образует средство осуществления способа или элемента способа. Кроме того, описанный в данном документе элемент воплощения устройства представляет собой пример средства осуществления функции, выполняемой элементом, с целью осуществления изобретения.In addition, certain embodiments are described herein as a method or combination of elements of a method that may be performed by a computer system processor or other means of performing a function. Thus, a processor with the necessary instructions for implementing such method or method element constitutes means for implementing the method or method element. In addition, the device embodiment described herein is an example of a means of implementing the function performed by the element for the purpose of carrying out the invention.
В описании и графических материалах, предложенных в данном документе, изложено множество конкретных подробностей. Однако, понятно, что воплощения изобретения могут быть осуществлены без данных конкретных подробностей. В других примерах хорошо известные способы, структуры и методики не показаны подробно для того, чтобы не мешать пониманию данного описания.Many specific details are set forth in the descriptions and graphics provided herein. However, it will be understood that embodiments of the invention may be practiced without these specific details. In other examples, well-known methods, structures and techniques are not shown in detail so as not to interfere with the understanding of this description.
Теперь изобретение будет описано посредством подробного описания нескольких воплощений изобретения. Ясно, что другие воплощения изобретения могут быть сконфигурированы в соответствии со знанием специалистов в данной области без отступления от истинной сущности или технической идеи изобретения, причем изобретение ограничивается только терминами прилагаемой формулы изобретения.The invention will now be described by way of detailed description of several embodiments of the invention. It is clear that other embodiments of the invention may be configured in accordance with the knowledge of those skilled in the art without departing from the true spirit or technical concept of the invention, the invention being limited only by the terms of the appended claims.
ПримерыExamples
Пример 1: Разработка Neu5Ac-продуцирующего штамма Е. coli, делающего возможным скрининг in vivo сиалилтрансфераз, использующих лактозу в качестве акцептораExample 1: Development of a Neu5Ac-producing E. coli strain allowing in vivo screening of sialyltransferases using lactose as an acceptor
Метаболическое конструирование включало мутагенез и осуществление делеций конкретных генов, соответственно, и геномные интеграции гетерологичных генов. Гены lacZ и araA инактивировали посредством мутагенеза, используя ошибочно спаренные олигонуклеотиды, как описано Ellis et al., (Proc. Natl. Acad. Sci. USA 98: 6742-6746 (2001)).Metabolic engineering included mutagenesis and deletion of specific genes, respectively, and genomic integration of heterologous genes. The lacZ and araA genes were inactivated by mutagenesis using mismatched oligonucleotides as described by Ellis et al. (Proc. Natl. Acad. Sci. USA 98: 6742-6746 (2001)).
Геномные делеции осуществляли в соответствии со способом Datsenko и Wanner (Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000)). Для предотвращения внутриклеточной деградации N-ацетилнейраминовой кислоты осуществляли делецию генов, кодирующих N-ацетилглюкозамин-6-фосфатдеацетилазу (nagA) и глюкозамин-6-фосфатдезаминазу (nagB), а также целый кластер генов катаболизма N-ацетилнейраминовой кислоты, кодирующих N-ацетилманнозаминкиназу (nanK), N-ацетилманнозамин-6-фосфатэпимеразу (nanE), альдолазу N-ацетилнейраминовой кислоты (nanA) и пермеазу сиаловой кислоты (nanT), из генома штамма Е. coli BL21 (DE3). Также осуществляли делецию генов wzxC-wcaJ. WcaJ кодирует УДФ-глюкозо:ундекапренил-фосфат-глюкозо-1-фосфат трансферазу, катализирующую первую стадию в синтезе колановой кислоты (Stevenson et al., J. Bacteriol. 1996, 178:4885-4893). Кроме того, удаляли гены fuel и fucK, кодирующие L-фукозоизомеразу и L-фукулозокиназу, соответственно.Genomic deletions were performed according to the method of Datsenko and Wanner (Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000)). To prevent intracellular degradation of N-acetylneuraminic acid, the genes encoding N-acetylglucosamine-6-phosphate deacetylase (nagA) and glucosamine-6-phosphate deaminase (nagB) were deleted, as well as a whole cluster of N-acetylneuraminic acid catabolism genes encoding N-acetylmannosamine kinase (nanK ), N-acetylmannosamine-6-phosphate epimerase (nanE), N-acetylneuraminic acid aldolase (nanA), and sialic acid permease (nanT), from the genome of E. coli strain BL21 (DE3). The wzxC-wcaJ genes were also deleted. WcaJ encodes UDP-glucose:undecaprenyl-phosphate-glucose-1-phosphate transferase, which catalyzes the first step in the synthesis of colanic acid (Stevenson et al., J. Bacteriol. 1996, 178:4885-4893). In addition, the fuel and fucK genes encoding L-fucose isomerase and L-fuculose kinase, respectively, were deleted.
Геномную интеграцию гетерологичных генов проводили посредством транспозиции. Одну из транспозаз EZ-Tn5™ (Epicentre, США) использовали для интеграции линейных ДНК-фрагментов, или для транспозиции использовали геперактивный С9-мутант mariner транспозазы Himar1 (Lampe et al., Proc. Natl. Acad. Sci. 1999, USA 96:11428-11433). Для получения транспосом EZ-Tn5 исследуемый ген вместе с маркерным геном устойчивости к антибиотику, фланкированным FRT-сайтами, амплифицировали с помощью праймера 1119 и 1120 (все используемые праймеры перечислены ниже в таблице 3); полученный ПЦР-продукт нес на обоих сайтах 19-п.н. сайты распознавания Mosaic End для транспозазы EZ-Tn5. Для интеграции с использованием тарспозазы Himar1 исследуемые экспрессионные конструкции (опероны) аналогично клонировали вместе с маркерным геном устойчивости к антибиотику, фланкированным FRT-сайтами, в вектор pEcomar. Вектор pEcomar кодирует гиперактивный С9-мутант mariner транспозазы Himar1 под контролем промотора, индуцируемого арабинозой ParaB. Экспрессионный фрагмент <Ptet-lacY-FRT-aadA-FRT> (SEQ ID NO: 67) интегрировали посредством использования транспозазы EZ-Tn5. После удачной интеграции гена для лактозного импортера LacY из Е. coli К12 TG1 (номер доступа ABN72583) ген устойчивости исключали из клонов, устойчивых к стрептомицину, посредством FLP рекомбиназы, кодируемой на плазмиде рСР20 (Datsenko and Wanner, Proc. Natl. Acad. Sci. 2000, USA 97:6640-6645), с образованием штамма #534. Кроме того, кластер csc-генов Е. coli W (номер доступа СР002185.1), содержащий гены сахарозопермеазы, фруктокиназы, сахарозогидролазы и репрессора транскрипции (гены cscB, cscK, cscA и cscR, соответственно), которые позволяют штамму расти на сахарозе в качестве единственного источника углерода, вставляли в геном. Данный csc-кластер интегрировали в геном штамма Е. coli BL21(DE3) посредством транспозиции с использованием плазмиды pEcomar-cscABKR Для усиления синтеза de novo УДФ-N-ацетилглюкозамина осуществляли оптимизацию кодонов генов, кодирующих L-глутамин:D-фруктозо-6-фосфат-аминотрансферазу (glmS), фосфоглюкозаминмутазу из подштамма Е. coli К-12 MG1655 (glmM) и N-ацетилглюкозамин-1-фосфат уридилтрансферазу/глюкозамин-1-фосфат ацетилтрансферазу (glmU) из подштамма Е. coli К-12 MG1655 (номер доступа NP_418185, NP_417643, NP_418186, соответственно), и их получали посредством синтеза генов. Оперон glmUM клонировали под контролем конститутивного тетрациклинового промотора Ptet, в то время как glmS клонировали под контролем конститутивного промотора РТ5. Кассету транспозонов <Ptet-glmUM-PT5-glmS-FRT-dhfr-FRT> (SEQ ID NO: 68), фланкированную инвертированными концевыми повторами, специфично распознаваемыми транспозазой mariner-подобного элемента Himar1, вставляли из pEcomar-glmUM-glmS. В итоге, описанные модификации генома приводили к получению штамма Е. coli BL21(DE3) #942, который представляет каркас для разработки штамма. В Таблицах 1, 2 и 3 содержатся все штаммы, олигонуклеотиды, используемые для клонирования, а также общие плазмиды, используемые в данном исследовании, соответственно.Genomic integration of heterologous genes was carried out through transposition. One of the transposases EZ-Tn5™ (Epicentre, USA) was used for the integration of linear DNA fragments, or the hyperactive C9 mutant of the mariner transposase Himar1 was used for transposition (Lampe et al., Proc. Natl. Acad. Sci. 1999, USA 96: 11428-11433). To obtain EZ-Tn5 transposomes, the gene of interest, along with an antibiotic resistance marker gene flanked by FRT sites, was amplified using primers 1119 and 1120 (all primers used are listed below in Table 3); the resulting PCR product carried 19 bp at both sites. Mosaic End recognition sites for EZ-Tn5 transposase. For integration using the Himar1 tarsposase, the expression constructs (operons) under study were similarly cloned together with an antibiotic resistance marker gene flanked by FRT sites into the pEcomar vector. The pEcomar vector encodes a hyperactive C9 mutant of the mariner transposase Himar1 under the control of the arabinose-inducible promoter P araB . The expression fragment <P tet -lacY-FRT-aadA-FRT> (SEQ ID NO: 67) was integrated using the EZ-Tn5 transposase. After successful integration of the gene for the lactose importer LacY from E. coli K12 TG1 (accession number ABN72583), the resistance gene was eliminated from streptomycin-resistant clones by FLP recombinase encoded on plasmid pCP20 (Datsenko and Wanner, Proc. Natl. Acad. Sci. 2000, USA 97:6640-6645), producing strain #534. In addition, the csc gene cluster of E. coli W (accession number CP002185.1), containing the genes for sucrose permease, fructokinase, sucrose hydrolase and transcription repressor (genes cscB, cscK, cscA and cscR, respectively), which allow the strain to grow on sucrose as the only carbon source was inserted into the genome. This csc cluster was integrated into the genome of E. coli strain BL21(DE3) via transposition using the pEcomar-cscABKR plasmid. To enhance the de novo synthesis of UDP-N-acetylglucosamine, codon optimization was carried out for genes encoding L-glutamine: D-fructose-6-phosphate -aminotransferase (glmS), phosphoglucosamine mutase from E. coli substrain K-12 MG1655 (glmM) and N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase (glmU) from E. coli substrain K-12 MG1655 (accession no. NP_418185, NP_417643, NP_418186, respectively), and they were obtained through gene synthesis. The glmUM operon was cloned under the control of the constitutive tetracycline promoter P tet , while glmS was cloned under the control of the constitutive P T5 promoter. The transposon cassette <P tet -glmUM-P T5 -glmS-FRT-dhfr-FRT> (SEQ ID NO: 68), flanked by inverted terminal repeats specifically recognized by the mariner-like element transposase Himar1, was inserted from pEcomar-glmUM-glmS. Ultimately, the described genome modifications resulted in the E. coli strain BL21(DE3) #942, which provides the framework for strain development. Tables 1, 2, and 3 contain all strains, oligonucleotides used for cloning, and common plasmids used in this study, respectively.
Штамм #942 модифицировали для продукции сиаловой кислоты посредством геномной интеграции экспрессионных кассет <Ptet-glmSm-gna1-FRT-aacC1-FRT> (SEQ ID NO: 69), <Ptet-slr1975-FRT-caf-FRT> (SEQ ID NO: 70), <Ptet-neuBC-FRT-kan-FRT> (SEQ ID NO: 71) и <Ptet-ppsA-FRT-aad1-FRT> (SEQ ID NO: 72). Для всех генов осуществляли оптимизацию кодонов для экспрессии в Е. coli и их получали синтетическим способом посредством кооперации GenScript. GlmSm представляет подвергнутую мутагенезу версию GlmS, что, таким образом, исключает ингибирование глюкозамин-6-фосфатом по типу обратной связи. Ген gna1 кодирует глюкозамин-6-фосфат-ацетилтрансферазу, происходящую из Saccharomyces cerevisiae. Гены субклонировали в виде оперона позади конститутивного промотора Ptet и сливали с геном устойчивости к гентамицину, фланкированным FRT-сайтами, используя праймеры glmSm/gna1_1-8. Аналогично, гены neuB (номер доступа AF305571), кодирующий синтазу сиаловой кислоты, и neuC (номер доступа AF305571), кодирующий УДФ-N-ацетилглюкозамин-2-эпимеразу, оба из которых происходят из Campylobacter jejuni, субклонировали в виде оперона позади конститутивного промотора Ptet и сливали с геном устойчивости к канамицину, фланкированным FRT-сайтами, с использованием праймеров neuBC_1-6. Ген slr1975 (номер доступа BAL35720), также клонируемый позади конститутивного промотора Ptet и слитый с геном устойчивости к хлорамфениколу, фланкированным FRT-сайтами, используя праймеры slr_1-4, кодирует N-ацетилглюкозамин 2-эпимеразу из Synechocystis sp. РСС6803. Ген ppsA (номер доступа АСТ43527), кодирующий фосфоенолпируватсинтазу Е. coli BL21(DE3), аналогично клонировали для конститутивной экспрессии и сливали с геном устойчивости к стрептомицину, фланкированным FRT-сайтами, используя праймеры ppsA_1-4. Геномные интеграции, в конечном итоге, приводили к Neu5Ac-продуцирующему штамму #1363, который использовали для скрининга сиалилтрансфераз 1-26.Strain #942 was modified to produce sialic acid through genomic integration of the expression cassettes <P tet -glmSm-gna1-FRT-aacC1-FRT> (SEQ ID NO: 69), <P tet -slr1975-FRT-caf-FRT> (SEQ ID NO: 70), <P tet -neuBC-FRT-kan-FRT> (SEQ ID NO: 71) and <P tet -ppsA-FRT-aad1-FRT> (SEQ ID NO: 72). All genes were codon optimized for expression in E. coli and produced synthetically through the GenScript collaboration. GlmSm is a mutagenized version of GlmS, which therefore eliminates feedback inhibition by glucosamine 6-phosphate. The gna1 gene encodes a glucosamine 6-phosphate acetyltransferase derived from Saccharomyces cerevisiae. The genes were subcloned as an operon behind the constitutive Ptet promoter and fused to the gentamicin resistance gene flanked by FRT sites using primers glmSm/gna1_1-8. Similarly, the genes neuB (accession no. AF305571), encoding sialic acid synthase, and neuC (accession no. AF305571), encoding UDP-N-acetylglucosamine-2-epimerase, both derived from Campylobacter jejuni, were subcloned as an operon behind the constitutive Ptet promoter and fused to the kanamycin resistance gene flanked by FRT sites using primers neuBC_1-6. The slr1975 gene (accession number BAL35720), also cloned behind the constitutive Ptet promoter and fused to the chloramphenicol resistance gene flanked by FRT sites using primers slr_1-4, encodes N-acetylglucosamine 2-epimerase from Synechocystis sp. RSS6803. The ppsA gene (accession number ACT43527), encoding E. coli BL21(DE3) phosphoenolpyruvate synthase, was similarly cloned for constitutive expression and fused to the streptomycin resistance gene flanked by FRT sites using primers ppsA_1-4. Genomic integrations ultimately resulted in Neu5Ac-producing strain #1363, which was used to screen for sialyltransferases 1–26.
Пример 2: Разработка штамма Е. coli, делающего возможным скрининг in vivo сиалилтрансфераз, использующих лактозу в качестве акцептора, но требующих внешнего добавления сиаловой кислотыExample 2: Development of an E. coli strain allowing in vivo screening of sialyltransferases using lactose as an acceptor but requiring external addition of sialic acid
Штамм Escherichia coli BL21(DE3) #942 использовали для создания штамма для скрининга кодируемых плазмидой сиалилтрансфераз 27-100. Таким образом, для обеспечения поглощения и активации сиаловой кислоты под действием нуклеотида интегрировали гены nanT и neuA, соответственно. Ген nanT (номер доступа В21_03035), кодирующий главный транспортер суперсемейства мембранных транспортеров Neu5Ac Е. coli, амплифицировали из геномной ДНК Е. coli BL21(DE3) и осуществляли оптимизацию кодонов гена neuA, происходящего из Campylobacter jejuni (номер доступа AF305571), и его получали посредством синтеза. Гены клонировали в виде оперона под контролем конститутивного тетрациклинового промотора Ptet, и полученный экспрессионный фрагмент <Pter-neuA-nanT-lox66-kan-lox72> (SEQ ID NO: 73) интегрировали посредством использования транспозазы EZ-Tn5 с получением штамма скрининга #1730.Escherichia coli strain BL21(DE3) #942 was used to create a strain to screen for plasmid-encoded sialyltransferases 27-100. Thus, to ensure the uptake and activation of sialic acid by the nucleotide, the nanT and neuA genes were integrated, respectively. The nanT gene (accession number B21_03035), encoding the main transporter of the E. coli Neu5Ac superfamily of membrane transporters, was amplified from the genomic DNA of E. coli BL21(DE3) and codon optimization of the neuA gene derived from Campylobacter jejuni (accession number AF305571) was carried out and obtained through synthesis. The genes were cloned as an operon under the control of the constitutive tetracycline promoter P tet and the resulting expression fragment <P ter -neuA-nanT-lox66-kan-lox72> (SEQ ID NO: 73) was integrated using the EZ-Tn5 transposase to obtain screening strain # 1730.
Пример 3: Получение коллекции плазмид, кодирующих сиалилтрансферазыExample 3: Preparation of a collection of plasmids encoding sialyltransferases
Последовательности генов охарактеризованных или предполагаемых сиалилтрансфераз получали из литературы и общедоступных баз данных. Поскольку часто описывают, что сиалилтрансферазы демонстрируют высокую активность, когда удален их сигнальный пептид, авторы изобретения проанализировали соответствующие белковые последовательности посредством инструмента прогнозирования on-line SignalP (Petersen et al., Nature Methods, 2011 Sep 29; 8(10):785-6). Гены синтетическим способом синтезировали посредством кооперации GenScript, или, как аннотировано, в полноразмерной форме или когда предсказан сигнальный пептид, в виде усеченного варианта, не обладающего N-концевым сигнальным пептидом (Таблица 4).Gene sequences of characterized or putative sialyltransferases were obtained from the literature and public databases. Since sialyltransferases are often described to exhibit high activity when their signal peptide is removed, we analyzed the corresponding protein sequences using the on-line prediction tool SignalP (Petersen et al., Nature Methods, 2011 Sep 29; 8(10):785-6 ). Genes were synthetically synthesized through GenScript collaboration, either as annotated in full-length form or, when a signal peptide is predicted, as a truncated variant lacking the N-terminal signal peptide (Table 4).
Каждую из сиалилтрансфераз 1-26 субклонировали в виде оперона с neuA в pDEST14 посредством SLIC (от англ. Sequence- and ligation-independent cloning - сиквенс-независимое безлигазное клонирование) с использованием ген-специфичных праймеров (Таблица 2) с получением плазмид общего вида: pDEST14-siaT-neuA. Оставшиеся сиалилтрансферазы 27-100 прямо субклонировали посредством кооперации GenScript в плазмиду рЕТ11а с использованием сайтов рестрикции Ndel и SamHI. Обе экспрессионные системы обеспечивают IPTG (IsoPropyl-β-D-ThioGalactoside - изопропил-β-D-тиогалактозид)-индуцибельную экспрессию генов (Фиг. 2). Для осуществления скрининга активности in vivo плазмидами трансформировали или штамм #1363 или #1730, тогда как Е. coli BL21(DE3) дикого типа или его вариант с отсутствием lacZ (штамм #287) использовали для анализов in vitro.Each of the sialyltransferases 1-26 was subcloned as an operon with neuA in pDEST14 via SLIC (sequence- and ligation-independent cloning) using gene-specific primers (Table 2) to obtain plasmids of the general form: pDEST14-siaT-neuA. The remaining sialyltransferases 27-100 were directly subcloned via GenScript cooperation into plasmid pET11a using Ndel and SamHI restriction sites. Both expression systems provide IPTG (IsoPropyl-β-D-ThioGalactoside)-inducible gene expression (Figure 2). To perform in vivo activity screening, either strain #1363 or #1730 was transformed with plasmids, while wild-type E. coli BL21(DE3) or its lacZ-deficient variant (strain #287) was used for in vitro assays.
Сиалилтрансферазы 1-11, 13-21 и 24 клонировали в виде оперона с neuA (номер доступа AY102622) Campylobacter jejuni. Клонирование сиалилтрансфераз 30-32, 34, 37, 39, 41, 42, 51 и 73 происходило через сайты Ndel и BamHI. Гены сиалилтрансфераз либо клонировали в виде полноразмерных конструкций (FL), либо без предсказанного сигнального пептида (Δ). Число, стоящее после Δ, показывает N-концевые аминокислоты, удаленные из соответствующей последовательности.Sialyltransferases 1–11, 13–21, and 24 were cloned as an operon with neuA (accession number AY102622) of Campylobacter jejuni. Cloning of sialyltransferases 30-32, 34, 37, 39, 41, 42, 51 and 73 occurred through the Ndel and BamHI sites. Sialyltransferase genes were either cloned as full-length constructs (FL) or without the predicted signal peptide (Δ). The number following Δ indicates the N-terminal amino acids removed from the corresponding sequence.
Пример 4: Идентификация и характеристика α-2,3- и α-2,6-сиалилтрансфераз, использующих лактозу в качестве акцепторного субстратаExample 4: Identification and Characterization of α-2,3- and α-2,6-sialyltransferases Using Lactose as an Acceptor Substrate
Штаммы Escherichia coli BL21(DE3) #1363 и #1730, несущие плазмиды, кодирующие 100 сиалилтрансфераз, выращивали при 30°С в 100 мл встряхиваемых колбах, наполненных 20 мл среды на основе минеральных солей (Samain et al., J. Biotech. 1999, 72:33-47) с добавлением 2% (масс./об.) глюкозы, 100 мкг/мл ампициллина, 15 мкг/мл канамицина и 40 мкг/мл зеоцина. Когда культуры достигали OD600 (от англ. optical density - оптическая плотность) 0,1-0,3, экспрессию генов индуцировали добавлением 0,3 мМ IPTG. После инкубации в течение 1 часа 1,5 мМ лактозу добавляли к культурам #1363, тогда как к культурам #1730 добавляли 1,5 мМ лактозу, а также 1,5 мМ сиаловую кислоту. Инкубация продолжалась на протяжении 72-96 часов. Затем, клетки собирали посредством центрифугирования и механически разрушали в определенном объеме с использованием стеклянных шариков. Далее, к образцам применяли тонкослойную хроматографию (ТСХ) на силикагеле 60 F254 (Merck KGaA, Дармштадт, Германия). Смесь бутанол:ацетон:уксусная кислота:H2O (35/35/7/23 (об./об./об./об.)) использовали в качестве подвижной фазы. Для выявления разделенных веществ ТСХ-пластину вымачивали в тимоловом реагенте (0,5 г тимол, растворенный в 95 мл этаноле, с добавлением 5 мл серной кислоты) и нагревали.Escherichia coli strains BL21(DE3) #1363 and #1730, carrying plasmids encoding 100 sialyltransferases, were grown at 30°C in 100 ml shake flasks filled with 20 ml of mineral salts medium (Samain et al., J. Biotech. 1999 , 72:33-47) with the addition of 2% (w/v) glucose, 100 µg/ml ampicillin, 15 µg/ml kanamycin and 40 µg/ml zeocin. When the cultures reached an OD 600 (optical density) of 0.1-0.3, gene expression was induced by adding 0.3 mM IPTG. After incubation for 1 hour, 1.5 mM lactose was added to cultures #1363, while 1.5 mM lactose was added to cultures #1730, as well as 1.5 mM sialic acid. Incubation lasted for 72-96 hours. The cells were then collected by centrifugation and mechanically disrupted to a certain extent using glass beads. Next, thin layer chromatography (TLC) on silica gel 60 F 254 (Merck KGaA, Darmstadt, Germany) was applied to the samples. A mixture of butanol:acetone:acetic acid:H 2 O (35/35/7/23 (v/v/v/v)) was used as the mobile phase. To identify separated substances, the TLC plate was soaked in thymol reagent (0.5 g thymol dissolved in 95 ml ethanol with the addition of 5 ml sulfuric acid) and heated.
Результаты обобщены в Таблице 5. При полном скрининге тридцать два гена, как было идентифицировано, кодируют α-2,3-сиалилтрансферазы, таким образом, продуцируя 3'-SL. 19 Ферментов синтезировали 6'-SL и были изображены как α-2,6-сиалилтрансферазы. α-2,3-, а также α-2,6-сиалилтрансферазную активность можно было наблюдать только в случае 3 ферментов. Соответственно, экспрессия 46 ферментов не приводила к образованию сиалиллактозы. Скрининг оказался высокоточным, поскольку смогли быть подтверждены описанные активности широко охарактеризованных сиалилтрансфераз, например, SiaT1 (Gilbert et al., J Biol Chem. 1996 Nov 8; 271(45):28271-6; Gilbert et al., Eur J Biochem. 1997 Oct 1;249(1): 187-94), SiaT6 (Tsukamoto et al., J Biochem. 2008 Feb; 143(2):187-97) и SiaT11 (Yamamoto et al., J Biochem. 1996 Jul; 120(1):104-10). Относительно образования продукта, условия проведения эксперимента сделали возможным полуколичественное сравнение ферментов, подвергающихся скринингу. Но для достижения более глубокого понимания их кинетических свойств, 3 α-2,3- и α-2,6-сиалилтранферазы с возможно наилучшей эффективностью применяли для анализов in vitro.The results are summarized in Table 5. In the full screen, thirty-two genes were identified to encode α-2,3-sialyltransferases, thereby producing 3'-SL. 19 Enzymes synthesized 6′-SL and were depicted as α-2,6-sialyltransferases. α-2,3- as well as α-2,6-sialyltransferase activity could only be observed in the case of 3 enzymes. Accordingly, the expression of 46 enzymes did not lead to the formation of sialyllactose. The screen was highly accurate because the described activities of widely characterized sialyltransferases, such as SiaT1, could be confirmed (Gilbert et al., J Biol Chem. 1996 Nov 8; 271(45):28271-6; Gilbert et al., Eur J Biochem. 1997 Oct 1;249(1): 187-94), SiaT6 (Tsukamoto et al., J Biochem. 2008 Feb; 143(2):187-97) and SiaT11 (Yamamoto et al., J Biochem. 1996 Jul; 120 (1):104-10). Regarding product formation, the experimental conditions allowed semi-quantitative comparison of the screened enzymes. But to achieve a deeper understanding of their kinetic properties, 3 α-2,3- and α-2,6-sialyltransferases have been used in in vitro assays with the best possible efficiency.
Активности определяли посредством in vivo или in vitro экспериментов с использованием лактозы или LNT в качестве субстратов на основе гликанов, соответственно. Идентификация сиалированной лактозы осуществлялась количественно посредством тонкослойной хроматографии. Продукты сиалирования LNT количественно оценивали посредством ЖХ-МС/МС. Изображено относительное количество LST-a или -b относительно фермента с наилучшей эффективностью, (н.о: неопределяемый).Activities were determined through in vivo or in vitro experiments using lactose or LNT as glycan-based substrates, respectively. Identification of sialylated lactose was carried out quantitatively by thin layer chromatography. LNT sialylation products were quantified by LC-MS/MS. The relative amount of LST-a or -b relative to the best performing enzyme is shown (no: undetectable).
Пример 5: Идентификация и характеристика α-2,3- и α-2,6-сиалилтрансфераз с использованием лакто-N-тетраозы в качестве акцепторного субстратаExample 5: Identification and Characterization of α-2,3- and α-2,6-sialyltransferases using lacto-N-tetraose as acceptor substrate
Escherichia coli BL21(DE3), несущие плазмиды, кодирующие 100 сиалилтрансфераз, выращивали при 30°С в 100 мл встряхиваемых колбах, наполненных 20 мл среды 2YT с добавлением 100 мкг/мл ампициллина. Когда OD600 культур достигала 0,1-0,3, экспрессию генов индуцировали добавлением 0,3 мМ IPTG, и инкубацию продолжали на протяжении 12-16 часов. Клетки собирали посредством центрифугирования и механически разрушали в определенном объеме 50 мМ Tris-HCl, рН 7,5, используя стеклянные шарики. Белковый экстракт выдерживали на льду до начала анализа. Анализ in vitro проводили в общем объеме 25 мкл, включающем 50 мМ Tris-HCl рН 7,5, 5 мМ MgCl2, 10 мМ CMP-Neu5Ac и 5 мМ LNT. Анализ начинался с добавления 3 мкл белкового экстракта и продолжался в течение 16 часов. Образование продукта определяли посредством масс-спектрометрии.Escherichia coli BL21(DE3), carrying plasmids encoding 100 sialyltransferases, were grown at 30°C in 100 ml shake flasks filled with 20 ml of 2YT medium supplemented with 100 μg/ml ampicillin. When the OD 600 of the cultures reached 0.1-0.3, gene expression was induced by adding 0.3 mM IPTG, and incubation was continued for 12-16 hours. Cells were collected by centrifugation and mechanically disrupted in a defined volume of 50 mM Tris-HCl, pH 7.5, using glass beads. The protein extract was kept on ice until analysis. The in vitro assay was performed in a total volume of 25 μl containing 50 mM Tris-HCl pH 7.5, 5 mM MgCl 2 , 10 mM CMP-Neu5Ac and 5 mM LNT. The assay began with the addition of 3 μl of protein extract and continued for 16 hours. Product formation was determined by mass spectrometry.
Масс-спектрометрический анализ проводили посредством MRM (от англ. multiple reaction monitoring - мониторинг множественных реакций) с использованием трехквадрупольной системы обнаружения ЖХ-МС. Ионы-предшественники отбирали и анализировали в квадруполе 1, фрагментация происходит в столкновительной ячейке с использованием аргона в качестве газа CID (от англ. collision-induced dissociation - индуцированная столкновениями диссоциация), отбор фрагментарных ионов проводят в квадруполе 3. Хроматографическое разделение лактозы, 3'-сиалиллактозы и 6'-сиалиллактозы после разведения супернатанта культуры 1:100 H2O (степень чистоты для ЖХ/МС), проводили на колонке ВЭЖХ (высоко-эффективная жидкостная хроматография) XBridge Amide (3,5 мкм, 2,1 × 50 мм (Waters, США) с защитным картриджем XBridge Amide (3,5 мкм, 2,1 × 10 мм) (Waters, США). Температура термостата колонки системы ВЭЖХ составляла 50°С. Подвижная фаза состояла из ацетонитрила:H2O с 10 мМ ацетатом аммония. 1 мкл образец инъецировали в прибор, прогон проводили в течение 3,60 мин со скоростью потока 400 мкл/мин. 3'-Сиалиллактозу и 6'-Сиалиллактозу анализировали посредством MRM в режиме положительной ионизации ИЭР (ионизация электрораспылением). Масс-спектрометр работал при единичном разрешении. Сиалиллактоза образует ион m/z 656,2 [М+Na]. Ион-предшественник сиалиллактозы дополнительно фрагментировали в столкновительной ячейке до фрагментарных ионов m/z 612,15, m/z 365,15 и m/z 314,15. Энергию столкновения, предварительную систематическую погрешность измерения Q1 и Q3 оптимизировали для каждого аналита отдельно. Хроматографическое разделение лактозы, LNT-II, LNT и LST-a или -b после разведения не содержащей частиц реакционной смеси биокатализа или неочищенного экстракта, соответственно, 1:50 H2O (степень чистоты для ЖХ/МС) проводили на колонке ВЭЖХ XBridge Amide (3,5 мкм, 2,1 × 50 мм (Waters, США) с защитным картриджем XBridge Amide (3,5 мкм, 2,1 × 10 мм) (Waters, США). Термостат колонки запускали при 35°С. Подвижная фаза состояла из ацетонитрила:H2O с 0,1% гидроксидом аммония. 1 Мкл образец инъецировали в прибор, прогон проводили в течение 3,50 мин со скоростью потока 300 мкл/мин. Лактозу, LNT-II, LNT, а также LST-a и -b анализировали посредством MRM в режиме отрицательной ионизации ИЭР. Масс-спектрометр функционировал при единичном разрешении. Лактоза образует ион m/z 341,00 [М-Н]. Ион-предшественник лактозы дополнительно фрагментировали в столкновительной ячейке до фрагментарных ионов m/z 179,15, m/z 161,15 и m/z 101,05. LNT-II образует ион m/z 544,20 [М-Н]. Ион-предшественник LNT-II дополнительно фрагментировали до фрагментарных ионов m/z 382,10, m/z 161,00 и m/z 112,90. LNT образует ион m/z 706,20 [М-Н]. Ион-предшественник LNT дополнительно фрагментировали до фрагментарных ионов m/z 382,10, m/z 202,10 и m/z 142,00. LST-a и -b образует ион m/z 997,20 [М-Н]. Ион-предшественник LST-a и -b дополнительно фрагментировали до фрагментарных ионов m/z 290,15, m/z 202,15 и m/z 142,15. Энергию столкновения, предварительную систематическую погрешность измерения Q1 и Q3 оптимизировали отдельно для каждого аналита. Способы количественной оценки устанавливали, используя имеющиеся в продаже стандарты (Carbosynth, Compton, UK).Mass spectrometric analysis was performed by MRM (multiple reaction monitoring) using a triple quadrupole LC-MS detection system. Precursor ions were selected and analyzed in quadrupole 1, fragmentation occurs in a collision cell using argon as a CID gas (collision-induced dissociation), fragment ions are selected in quadrupole 3. Chromatographic separation of lactose, 3' -sialyllactose and 6'-sialyllactose after dilution of the culture supernatant 1:100 H 2 O (LC/MS grade), carried out on an XBridge Amide HPLC column (3.5 µm, 2.1 × 50 mm (Waters, USA) with a protective cartridge XBridge Amide (3.5 μm, 2.1 × 10 mm) (Waters, USA). The temperature of the HPLC system column was 50°C. The mobile phase consisted of acetonitrile: H 2 O with. 10 mM ammonium acetate. 1 μl sample was injected into the instrument and run for 3.60 min at a flow rate of 400 μl/min. 3'-Sialyl lactose and 6'-sialyllactose were analyzed by MRM in positive ionization ESI mode (electrospray ionization). The mass spectrometer was operated at unity resolution. Sialyl lactose forms an ion m/z 656.2 [M+Na]. The sialyllactose precursor ion was further fragmented in a collision cell to fragment ions m/z 612.15, m/z 365.15, and m/z 314.15. Collision energy, preliminary systematic measurement error Q1 and Q3 were optimized for each analyte separately. Chromatographic separation of lactose, LNT-II, LNT and LST-a or -b after dilution of the particle-free biocatalysis reaction mixture or crude extract, respectively, 1:50 H 2 O (LC/MS grade) was carried out on an XBridge Amide HPLC column (3.5 µm, 2.1 × 50 mm (Waters, USA) with a protective cartridge XBridge Amide (3.5 µm, 2.1 × 10 mm) (Waters, USA). The column thermostat was started at 35°C. Movable phase consisted of acetonitrile:H 2 O with 0.1% ammonium hydroxide. 1 μl of the sample was injected into the device, run for 3.50 min with a flow rate of 300 μl/min Lactose, LNT-II, LNT, and LST. -a and -b were analyzed by MRM in ESI negative ionization mode. The mass spectrometer was operated at unity resolution. Lactose produces an ion m/z 341.00 [M-H]. The lactose precursor ion was further fragmented in the collision cell to fragment ions m. /z 179.15, m/z 161.15 and m/z 101.05. LNT-II produces an ion m/z 544.20 [M-H]. The LNT-II precursor ion was further fragmented to m/ fragment ions. z 382.10, m/z 161.00 and m/z 112.90. LNT produces an ion m/z 706.20 [M-H]. The LNT precursor ion was further fragmented to fragment ions m/z 382.10, m/z 202.10, and m/z 142.00. LST-a and -b form an ion m/z 997.20 [M-H]. The precursor ion LST-a and -b were further fragmented to fragment ions m/z 290.15, m/z 202.15, and m/z 142.15. Collision energy, preliminary systematic measurement error Q1 and Q3 were optimized separately for each analyte. Quantification methods were established using commercially available standards (Carbosynth, Compton, UK).
Результаты скрининга in vitro обобщенно приведены в Таблице 5. Идентифицировали, что двадцать восемь генов продуцируют LST-a, тогда как только 6 ферментов синтезировали LST-b. Соответственно, экспрессия 66 ферментов не приводила к образованию как LST-a, так и LST-b. Анализ считается точным, поскольку активность SiaT1, которая, как уже было описано, сиалирует LNT (Gilbert et al., J Biol Chem. 1996 Nov 8; 271(45):28271-6; Gilbert et al., Eur J Biochem. 1997 Oct 1; 249(1): 187-94), могла быть подтверждена. Безотносительно уровня сверхэкспрессии белка, сиалилтрансферазы, которые наилучшим образом продуцировались, выбирали для определения Km и Vmax.The results of the in vitro screen are summarized in Table 5. Twenty-eight genes were identified to produce LST-a, whereas only 6 enzymes synthesized LST-b. Accordingly, expression of 66 enzymes did not result in the formation of either LST-a or LST-b. The assay is considered accurate because of the activity of SiaT1, which has already been described to sialylate LNT (Gilbert et al., J Biol Chem. 1996 Nov 8; 271(45):28271-6; Gilbert et al., Eur J Biochem. 1997 Oct 1; 249(1): 187-94) could be confirmed. Regardless of the level of protein overexpression, the sialyltransferases that were best produced were selected for determination of K m and V max .
Пример 6: Характеристика кинетических свойств выбранных сиалилтрансферазExample 6: Characterization of the kinetic properties of selected sialyltransferases
Для ранжирования сиалилтрансфераз с наилучшей эффективностью их значения Km для донорных и акцепторных субстратов определяли in vitro. Штамм Escherichia coli BL21(DE3) #287 использовали для чрезмерной продукции ферментов. Клетки инкубировали в 100 мл среды 2YT во встряхиваемых колбах с добавлением 100 мкг/мл ампициллина при 30°С до тех пор, пока OD600 не достигнет 0,3. Затем добавляли 0,3 мМ IPTG, и инкубацию продолжали в течение 12-16 часов. Клетки собирали посредством центрифугирования и механически разрушали в определенном объеме 50 мМ Tris-HCl рН 7,5, используя стеклянные шарики. Белковый экстракт держали на льду до начала анализа. Анализ in vitro проводили в общем объеме 50 мл, включающем 50 мМ Tris-HCl, рН7,5, 5 мМ MgCl2 и варьирующие концентрации CMP-Neu5Ac (0,05-30 мМ), а также лактозы или LNT (0,1-50 мМ). Анализ начинался с добавления 35-750 мкг белкового экстракта. После 1-10 минут инкубации при 30°С анализ инактивировали при 95°С в течение 5 минут. Образование продукта определяли посредством масс-спектрометрии. Данные оценивали, используя модуль ферментативной кинетики SigmaPlot v12.5 для расчета Km и Vmax.To rank sialyltransferases with the best efficiency, their K m values for donor and acceptor substrates were determined in vitro. Escherichia coli strain BL21(DE3) #287 was used for enzyme overproduction. Cells were incubated in 100 ml of 2YT medium in shake flasks supplemented with 100 μg/ml ampicillin at 30°C until the OD 600 reached 0.3. 0.3 mM IPTG was then added and incubation was continued for 12-16 hours. Cells were collected by centrifugation and mechanically disrupted in a defined volume of 50 mM Tris-HCl pH 7.5 using glass beads. The protein extract was kept on ice until analysis. The in vitro assay was performed in a total volume of 50 ml containing 50 mM Tris-HCl, pH 7.5, 5 mM MgCl 2 and varying concentrations of CMP-Neu5Ac (0.05-30 mM), as well as lactose or LNT (0.1- 50 mM). The assay began with the addition of 35-750 μg of protein extract. After 1-10 minutes of incubation at 30°C, the assay was inactivated at 95°C for 5 minutes. Product formation was determined by mass spectrometry. Data were assessed using the SigmaPlot v12.5 enzyme kinetics module to calculate K m and V max .
Во время скрининга наиболее эффективные α-2,3-сиалилтрансферазы для получения LST-a, по-видимому, представляют собой SiaT8, SiaT9 и SiaT20. Для сравнения, SiaT6, SiaT18 и SiaT19, как наблюдали, сиалируют LNT наиболее эффективно среди анализируемых α-2,6-сиалилтрансфераз. Их кинетические параметры для CMP-Neu5Ac и LNT, а также лактозы, изображены в Таблице 6. Только SiaT20 не соответствует кинетике Михаэлиса-Ментен.During the screen, the most efficient α-2,3-sialyltransferases for producing LST-a appeared to be SiaT8, SiaT9 and SiaT20. In comparison, SiaT6, SiaT18, and SiaT19 were observed to sialylate LNT most efficiently among the α-2,6-sialyltransferases analyzed. Their kinetic parameters for CMP-Neu5Ac and LNT, as well as lactose, are depicted in Table 6. Only SiaT20 does not correspond to Michaelis-Menten kinetics.
Пример 7: Получение штамма, продуцирующего лакто-N-тетраозу, для скрининга активности in vivo сиалилтрансфераз, использующих LNT в качестве акцепторного субстратаExample 7: Preparation of a lacto-N-tetraose-producing strain for screening the in vivo activity of sialyltransferases using LNT as an acceptor substrate
Штамм Escherichia coli BL21(DE3) #534 использовали для конструирования штамма, продуцирующего лакто-N-тетраозу (LNT). Осуществляли оптимизацию кодонов гена β-1,3-N-ацетилглюкозаминилтрансферазы IgtA из Neisseria meningitidis МС58 (номер доступа NP_274923) для экспрессии в Е. coli и его получали синтетическим способом посредством синтеза генов. Вместе с геном galT, кодирующим галактозо-1-фосфат уридинилтрансферазу, из подштамма Е. coli К-12 MG1655 (номер доступа NP_415279), который аналогично получали посредством синтеза генов, посредством транспозиции вставляли IgtA (SEQ ID NO: 188) с использованием плазмиды pEcomar-IgtA-gaI7. Для усиления синтеза de novo УДФ-N-ацетилглюкозамина осуществляли оптимизацию кодонов генов, кодирующих L-глутамин:D-фруктозо-6-фосфат аминотрансферазу (glmS), фосфоглюкозаминмутазу из подштамма Е. coli К-12 MG1655 (glmM) и N-ацетилглюкозамин-1-фосфат уридилтрансферазу/глюкозамин-1-фосфат-ацетилтрансферазу (glmU), из подштамма Е. coli К-12 MG1655 (номер доступа NP_418185, NP_417643, NP_418186, соответственно), и их получали посредством синтеза генов. Оперон glmUM клонировали под контролем конститутивного тетрациклинового промотора Ptet, в то время как glmS клонировали под контролем конститутивного промотора РТ5- Кассету транспозонов <Ptet-glmUM-PT5-glmS-FRT-dhfr-FRT>, фланкированную инвертированными концевыми повторами, специфично распознаваемыми транспозазой mariner-подобного элемента Himar1, вставляли из pEcomar-glmUM-glmS, обнаруживая штамм, продуцирующий лакто-N-триозу II. Метаболическое конструирование дополнительно включало геномную интеграцию кассет транспозонов <Ptet-wbdO-PT5-galE-FRT-caf-FRT> (SEQ ID NO: 187), фланкированных инвертированными концевыми повторами, специфично распознаваемыми транспозазой mariner-подобного элемента Himar1, которую вставляли из pEcomar-wbdO-galE. Для предупреждения внутриклеточной деградации N-ацетилнейраминовой кислоты кластер генов nanKETA удаляли из генома штамма Е. coli BL21(DE3) в соответствии со способом Datsenko и Wanner (Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000)). Для предоставления достаточного донорного субстрата (CMP-Neu5Ac) для сиалирования LNT механизм поглощения сиаловой кислоты, а так же способность ее активации под действием нуклеотида внедряли в штамм Е. coli. Как описано выше, гены nanT и neuA клонировали в виде оперона (с использованием праймеров neuA/nanT_1-6) под контролем конститутивного тетрациклинового промотора Ptet, и полученный экспрессионный фрагмент <Pter-neuA-nanT-lox66-kan-lox72> интегрировали посредством применения транспозазы EZ-Tn5, получая, в конечном итоге, штамм #2130.Escherichia coli strain BL21(DE3) #534 was used to construct a lacto-N-tetraose (LNT) producing strain. The β-1,3-N-acetylglucosaminyltransferase IgtA gene from Neisseria meningitidis MC58 (accession number NP_274923) was codon optimized for expression in E. coli and obtained synthetically through gene synthesis. Together with the galT gene encoding galactose-1-phosphate uridinyltransferase from the E. coli substrain K-12 MG1655 (accession number NP_415279), which was similarly obtained through gene synthesis, IgtA (SEQ ID NO: 188) was inserted by transposition using the pEcomar plasmid -IgtA-gaI7. To enhance the de novo synthesis of UDP-N-acetylglucosamine, we optimized the codons of the genes encoding L-glutamine:D-fructose-6-phosphate aminotransferase (glmS), phosphoglucosamine mutase from the E. coli substrain K-12 MG1655 (glmM) and N-acetylglucosamine- 1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase (glmU), from E. coli substrain K-12 MG1655 (accession numbers NP_418185, NP_417643, NP_418186, respectively), and were obtained through gene synthesis. The glmUM operon was cloned under the control of the constitutive tetracycline promoter P tet , while glmS was cloned under the control of the constitutive P T5 promoter - <P tet -glmUM-P T5 -glmS-FRT-dhfr-FRT> transposon cassette flanked by inverted terminal repeats, specifically recognized by the mariner-like element transposase Himar1, were inserted from pEcomar-glmUM-glmS, revealing a lacto-N-triose II-producing strain. Metabolic engineering additionally involved genomic integration of transposon cassettes <P tet -wbdO-P T5 -galE-FRT-caf-FRT> (SEQ ID NO: 187) flanked by inverted terminal repeats specifically recognized by the mariner-like element transposase Himar1, which was inserted from pEcomar-wbdO-galE. To prevent intracellular degradation of N-acetylneuraminic acid, the nanKETA gene cluster was deleted from the genome of E. coli strain BL21(DE3) according to the method of Datsenko and Wanner (Proc. Natl. Acad. Sci. USA 97:6640-6645 (2000)). To provide a sufficient donor substrate (CMP-Neu5Ac) for sialylation of LNT, the sialic acid uptake mechanism, as well as the ability to activate it under the influence of the nucleotide, were introduced into the E. coli strain. As described above, the nanT and neuA genes were cloned as an operon (using primers neuA/nanT_1-6) under the control of the constitutive tetracycline promoter P tet , and the resulting expression fragment <P ter -neuA-nanT-lox66-kan-lox72> was integrated via application of the EZ-Tn5 transposase, ultimately obtaining strain #2130.
Пример 8: Периодическая ферментация штамма Е. coli BL21(DE3) #2130, экспрессирующего разные сиалилтрансферазыExample 8: Batch fermentation of E. coli strain BL21(DE3) #2130 expressing different sialyltransferases
Клетки Escherichia coli BL21(DE3) #2130, несущие экспрессионные плазмиды, кодирующие сиалилтрансферазы SiaT9 или SiaT19, выращивали при 30°С в 100 мл встряхиваемых колбах, наполненных 25 мл среды на основе минеральных солей (Samain et al., J. Biotech. 1999, 72:33-47) с добавлением 2% (масс./об.) глюкозы, 5 г/л NH4Cl, 100 мкг/мл ампициллина, 15 мкн/мл канамицина и 5 мкг/мл гентамицина. Когда культуры достигали OD600 0,5-1, добавляли 3 мМ лактозу. Спустя 24 часа инкубации, экспрессию гена сиалилтрансферазы индуцировали добавлением 0,3 мМ IPTG. Одновременно, к культурам добавляли 3 мМ сиаловую кислоту. Инкубация продолжалась в течение 48 часов. Затем клетки собирали посредством центрифугирования и механически разрушали в определенном объеме, используя стеклянные шарики. Далее, тонкослойную хроматографию (ТСХ) осуществляли для подтверждения образования внутри клетки сиалиллакто-N-тетраоз-а и -b. Как показано на Фиг. 3, экспрессия siaT9 или siaT19 в штамме #2130 приводила к образованию LST-a и LST-b, соответственно. Результаты подтверждали посредством масс-спектрометрии.Escherichia coli BL21(DE3) #2130 cells carrying expression plasmids encoding SiaT9 or SiaT19 sialyltransferases were grown at 30°C in 100 ml shake flasks filled with 25 ml of mineral salts medium (Samain et al., J. Biotech. 1999 , 72:33-47) with the addition of 2% (w/v) glucose, 5 g/l NH 4 Cl, 100 µg/ml ampicillin, 15 µg/ml kanamycin and 5 µg/ml gentamicin. When cultures reached an OD 600 of 0.5-1, 3 mM lactose was added. After 24 hours of incubation, sialyltransferase gene expression was induced by the addition of 0.3 mM IPTG. At the same time, 3 mM sialic acid was added to the cultures. Incubation continued for 48 hours. The cells were then collected by centrifugation and mechanically disrupted to a certain extent using glass beads. Next, thin layer chromatography (TLC) was performed to confirm the intracellular formation of sialyllacto-N-tetraose-a and -b. As shown in FIG. 3, expression of siaT9 or siaT19 in strain #2130 resulted in the formation of LST-a and LST-b, respectively. The results were confirmed by mass spectrometry.
--->--->
ПЕРЕЧЕНЬ ПОСЛЕДОВАТЕЛЬНОСТЕЙLIST OF SEQUENCES
<110> Jennewein Biotechnologie GmbH<110> Jennewein Biotechnologie GmbH
<120> СИАЛИЛТРАНСФЕРАЗЫ И ИХ ПРИМЕНЕНИЕ В ПОЛУЧЕНИИ СИАЛИРОВАННЫХ ОЛИГОСАХАРИДОВ<120> SIALYL TRANSFERASES AND THEIR APPLICATION IN THE PREPARATION OF SIALYLATED OLIGOSACCHARIDES
<130> P 1703 WO<130>P 1703 WO
<160> 188<160> 188
<170> PatentIn version 3.5<170> Patent In version 3.5
<210> 1<210> 1
<211> 1410<211> 1410
<212> ДНК<212> DNA
<213> Campylobacter coli<213> Campylobacter coli
<400> 1<400> 1
atgcaaaacg tcattatcgc tggtaacggt ccgagcctgc aatcaatcaa ctatcaacgc 60atgcaaaacg tcattatcgc tggtaacggt ccgagcctgc aatcaatcaa ctatcaacgc 60
ctgccgaaag aatacgacat cttccgctgc aaccagttct acttcgaaga taaatactac 120ctgccgaaag aatacgacat cttccgctgc aaccagttct acttcgaaga taaatactac 120
ctgggcaaaa acatcaaagc ggcctttttc aatccgtatc cgttcctgca gcaataccat 180ctgggcaaaa acatcaaagc ggcctttttc aatccgtatc cgttcctgca gcaataccat 180
accgcgaaac agctggtgtt caacaacgaa tacaaaatcg aaaacatctt ttgtagcacg 240accgcgaaac agctggtgtt caacaacgaa tacaaaatcg aaaacatctt ttgtagcacg 240
ttcaatctgc cgttcatcga aaaagataac ttcatcaaca aattttacga tttctttccg 300ttcaatctgc cgttcatcga aaaagataac ttcatcaaca aattttacga tttctttccg 300
gacgctaaac tgggtcacaa aatcatcgaa aacctgaaag aattttacgc gtacatcaaa 360gacgctaaac tgggtcacaa aatcatcgaa aacctgaaag aattttacgc gtacatcaaa 360
tacaacgaaa tctacctgaa caaacgtatt accagcggca tctatatgtg cgcaattgct 420tacaacgaaa tctacctgaa caaacgtatt accagcggca tctatatgtg cgcaattgct 420
atcgcgctgg gttataaaaa catttacctg tgtggcatcg atttctatga aggtgaaacg 480atcgcgctgg gttataaaaa catttacctg tgtggcatcg atttctatga aggtgaaacg 480
atctacccgt tcaaagccat gtctaaaaac attaagaaaa tttttccgtg gatcaaagat 540atctacccgt tcaaagccat gtctaaaaac attaagaaaa tttttccgtg gatcaaagat 540
ttcaacccga gtaacttcca ttccaaagaa tacgacatcg aaatcctgaa actgctggaa 600ttcaacccga gtaacttcca ttccaaagaa tacgacatcg aaatcctgaa actgctggaa 600
tcaatctaca aagttaacat ctacgcactg tgcgataact cggccctggc aaattacttc 660tcaatctaca aagttaacat ctacgcactg tgcgataact cggccctggc aaattacttc 660
ccgctgctgg tgaacaccga caattcattt gttctggaaa acaaatcgga tgactgtatc 720ccgctgctgg tgaacaccga caattcattt gttctggaaa acaaatcgga tgactgtatc 720
aacgatatcc tgctgaccaa caatacgccg ggcattaact tctataaaag ccagatccaa 780aacgatatcc tgctgaccaa caatacgccg ggcattaact tctataaaag ccagatccaa 780
gtcaacaata ccgaaattct gctgctgaac tttcagaata tgatcagcgc caaagaaaac 840gtcaacaata ccgaaattct gctgctgaac tttcagaata tgatcagcgc caaagaaaac 840
gaaatttcta acctgaacaa aatcctgcaa gactcataca aaaccatcaa cacgaaagaa 900gaaatttcta acctgaacaa aatcctgcaa gactcataca aaaccatcaa cacgaaagaa 900
aacgaaatta gtaatctgaa taaaatcctg caggattcct ataaaacgat taataccaaa 960aacgaaatta gtaatctgaa taaaatcctg caggattcct ataaaacgat taataccaaa 960
gaaaatgaaa tttcgaatct gaacaaaatc ctgcaggata aagacaaact gctgatcgtt 1020gaaaatgaaa tttcgaatct gaacaaaatc ctgcaggata aagacaaact gctgatcgtt 1020
aaagaaaacc tgctgaattt caaaagccgt catggtaaag ccaaatttcg cattcagaac 1080aaagaaaacc tgctgaattt caaaagccgt catggtaaag ccaaatttcg cattcagaac 1080
caactgtctt ataaactggg ccaggcaatg atggtcaata gcaaatctct gctgggttat 1140caactgtctt ataaactggg ccaggcaatg atggtcaata gcaaatctct gctgggttat 1140
atccgtatgc cgtttgtgct gagttacatc aaagacaaac acaaacagga acaaaaaatc 1200atccgtatgc cgtttgtgct gagttacatc aaagacaaac acaaacagga acaaaaaatc 1200
tatcaggaaa aaattaagaa agatccgagc ctgaccctgc cgccgctgga agattatccg 1260tatcaggaaa aaattaagaa agatccgagc ctgaccctgc cgccgctgga agattatccg 1260
gactacaaag aagctctgaa agaaaaagaa tgcctgacct atcgcctggg ccagacgctg 1320gactacaaag aagctctgaa agaaaaagaa tgcctgacct atcgcctggg cgacgctg 1320
attaaagcgg atcaagaatg gtacaaaggt ggctatgtga aaatgtggtt cgaaatcaaa 1380attaaagcgg atcaagaatg gtacaaaggt ggctatgtga aaatgtggtt cgaaatcaaa 1380
aaactgaaga aagaatacaa aaagaaataa 1410aaactgaaga aagaatacaa aaagaaataa 1410
<210> 2<210> 2
<211> 1146<211> 1146
<212> ДНК<212> DNA
<213> Vibrio sp.<213> Vibrio sp.
<400> 2<400> 2
atgaacaacg acaactccac gaccaccaac aataacgcta ttgaaatcta tgtggatcgt 60atgaacaacg acaactccac gaccaccaac aataacgcta ttgaaatcta tgtggatcgt 60
gcgaccctgc cgacgatcca gcaaatgacc aaaattgtta gccagaaaac gtctaacaaa 120gcgaccctgc cgacgatcca gcaaatgacc aaaattgtta gccagaaaac gtctaacaaa 120
aaactgatct catggtcgcg ctacccgatt accgataaaa gcctgctgaa gaaaattaac 180aaactgatct catggtcgcg ctacccgatt accgataaaa gcctgctgaa gaaaattaac 180
gcggaatttt tcaaagaaca atttgaactg acggaaagcc tgaaaaacat catcctgtct 240gcggaatttt tcaaagaaca atttgaactg acggaaagcc tgaaaaacat catcctgtct 240
gaaaacatcg ataacctgat cattcatggc aataccctgt ggagtattga tgtggttgac 300gaaaacatcg ataacctgat cattcatggc aataccctgt ggagtattga tgtggttgac 300
attatcaaag aagtcaacct gctgggcaaa aatattccga tcgaactgca cttttatgat 360attatcaaag aagtcaacct gctgggcaaa aatattccga tcgaactgca cttttatgat 360
gacggttccg ccgaatacgt tcgtatctac gaatttagta aactgccgga atccgaacag 420gacggttccg ccgaatacgt tcgtatctac gaatttagta aactgccgga atccgaacag 420
aaatacaaaa ccagcctgtc taaaaacaac atcaaattct caatcgatgg caccgactcg 480aaatacaaaa ccagcctgtc taaaaacaac atcaaattct caatcgatgg caccgactcg 480
ttcaaaaaca cgatcgaaaa catctacggt ttcagccaac tgtatccgac cacgtaccac 540ttcaaaaaca cgatcgaaaa catctacggt ttcagccaac tgtatccgac cacgtaccac 540
atgctgcgtg cagatatctt cgacaccacg ctgaaaatta acccgctgcg cgaactgctg 600atgctgcgtg cagatatctt cgacaccacg ctgaaaatta acccgctgcg cgaactgctg 600
tcaaacaaca tcaaacagat gaaatgggat tacttcaaag acttcaacta caaacaaaaa 660tcaaacaaca tcaaacagat gaaatgggat tacttcaaag acttcaacta caaacaaaaa 660
gatatctttt actcactgac caacttcaac ccgaaagaaa tccaggaaga cttcaacaaa 720gatatctttt actcactgac caacttcaac ccgaaagaaa tccaggaaga cttcaacaaa 720
aactcgaaca aaaacttcat cttcatcggc agtaactccg cgaccgccac ggcagaagaa 780aactcgaaca aaaacttcat cttcatcggc agtaactccg cgaccgccac ggcagaagaa 780
caaatcaata ttatcagcga agcgaagaaa gaaaacagca gcattatcac caattcaatt 840caaatcaata ttatcagcga agcgaagaaa gaaaacagca gcattatcac caattcaatt 840
tcggattatg acctgttttt caaaggtcat ccgtctgcca cgtttaacga acagattatc 900tcggattatg acctgttttt caaaggtcat ccgtctgcca cgtttaacga acagattatc 900
aatgcacacg atatgatcga aatcaacaac aaaatcccgt tcgaagctct gatcatgacc 960aatgcacacg atatgatcga aatcaacaac aaaatcccgt tcgaagctct gatcatgacc 960
ggcattctgc cggatgccgt tggcggtatg ggtagttccg tctttttcag tatcccgaaa 1020ggcattctgc cggatgccgt tggcggtatg ggtagttccg tctttttcag tatcccgaaa 1020
gaagtcaaaa acaaattcgt gttctataaa agtggtacgg atatcgaaaa taactccctg 1080gaagtcaaaa acaaattcgt gttctataaa agtggtacgg atatcgaaaa taactccctg 1080
attcaggtga tgctgaaact gaatctgatt aaccgcgata atattaaact gatctctgac 1140attcaggtga tgctgaaact gaatctgatt aaccgcgata atattaaact gatctctgac 1140
atttaa 1146atttaa 1146
<210> 3<210> 3
<211> 1173<211> 1173
<212> ДНК<212> DNA
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 3<400> 3
atgggctgta atagcgactc caaccacaac aactccgacg gcaacatcac caaaaacaaa 60atgggctgta atagcgactc caaccacaac aactccgacg gcaacatcac caaaaacaaa 60
acgatcgaag tttatgtcga tcgtgcaacc ctgccgacga ttcagcaaat gacccagatc 120acgatcgaag tttatgtcga tcgtgcaacc ctgccgacga ttcagcaaat gacccagatc 120
atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgctaccc gatcaatgat 180atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgctaccc gatcaatgat 180
gaagaactgc tggaatcaat taacggctcg tttttcaaaa acaactctga actgatcaaa 240gaagaactgc tggaatcaat taacggctcg tttttcaaaa acaactctga actgatcaaa 240
agtctggatt ccatgattct gaccaatgac attaagaaag tgatcatcaa cggtaacacg 300agtctggatt ccatgattct gaccaatgac attaagaaag tgatcatcaa cggtaacacg 300
ctgtgggcgg ccgatgtggt taacatcatc aaatcaatcg aagcgttcgg caagaaaacc 360ctgtgggcgg ccgatgtggt taacatcatc aaatcaatcg aagcgttcgg caagaaaacc 360
gaaatcgaac tgaactttta tgatgacggt tcggccgaat atgtgcgtct gtacgacttt 420gaaatcgaac tgaactttta tgatgacggt tcggccgaat atgtgcgtct gtacgacttt 420
agcaaactgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattctg 480agcaaactgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattctg 480
agcagcatca acggcaccca gccgttcgaa aacgtcgtgg aaaacatcta cggtttcagt 540agcagcatca acggcaccca gccgttcgaa aacgtcgtgg aaaacatcta cggtttcagt 540
caactgtacc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600caactgtacc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600
ctgcgcagtc tgaaaggcgt tctgtccaac aacatcaaac agatgaaatg ggattacttc 660ctgcgcagtc tgaaaggcgt tctgtccaac aacatcaaac agatgaaatg ggattacttc 660
aaaaccttca acagccagca aaaagacaaa ttctacaact tcacgggttt taacccggat 720aaaaccttca acagccagca aaaagacaaa ttctacaact tcacgggttt taacccggat 720
gaaattatgg aacaatacaa agcaagcccg aacaaaaatt ttatcttcgt cggcaccaat 780gaaattatgg aacaatacaa agcaagcccg aacaaaaatt ttatcttcgt cggcaccaat 780
tctggcaccg caacggctga acagcaaatt gatatcctga ccgaagctaa aaacccgaac 840tctggcaccg caacggctga acagcaaatt gatatcctga ccgaagctaa aaacccgaac 840
agcccgatta tcacgaaatc gatccagggc ttcgacctgt ttttcaaagg tcatccgtct 900agcccgatta tcacgaaatc gatccagggc ttcgacctgt ttttcaaagg tcatccgtct 900
gcaacctaca acaaacaaat catcgatgct cacaacatga tcgaaatcta caacaaaatc 960gcaacctaca acaaacaaat catcgatgct cacaacatga tcgaaatcta caacaaaatc 960
ccgttcgaag cgctgatcat gaccgatgcc ctgccggatg cggtgggcgg tatgggcagc 1020ccgttcgaag cgctgatcat gaccgatgcc ctgccggatg cggtgggcgg tatgggcagc 1020
agcgtgtttt tcagcctgcc gaataccgtg gaaaacaaat tcattttcta taaatccgat 1080agcgtgtttt tcagcctgcc gaataccgtg gaaaacaaat tcattttcta taaatccgat 1080
acggacattg aaaacaatgc cctgatccag gttatgattg aactgaatat cgtgaaccgt 1140acggacattg aaaacaatgc cctgatccag gttatgattg aactgaatat cgtgaaccgt 1140
aatgatgtga aactgatctc ggacctgcaa taa 1173aatgatgtga aactgatctc ggacctgcaa taa 1173
<210> 4<210> 4
<211> 1167<211> 1167
<212> ДНК<212> DNA
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 4<400> 4
atgaaaacga ttaccctgta tctggacccg gcgtccctgc cggcactgaa ccaactgatg 60atgaaaacga ttaccctgta tctggacccg gcgtccctgc cggcactgaa ccaactgatg 60
gattttacgc agaacaatga agacaaaacc catccgcgta tctttggcct gtctcgcttc 120gattttacgc agaacaatga agacaaaacc catccgcgta tctttggcct gtctcgcttc 120
aaaattccgg ataacattat cacccaatat cagaatatcc actttgttga actgaaagac 180aaaattccgg ataacattat cacccaatat cagaatatcc actttgttga actgaaagac 180
aatcgtccga cggaagccct gttcaccatt ctggatcagt acccgggtaa cattgaactg 240aatcgtccga cggaagccct gttcaccatt ctggatcagt acccgggtaa cattgaactg 240
gacatccatc tgaatattgc tcacagcgtc cagctgattc gtccgatcct ggcgtatcgc 300gacatccatc tgaatattgc tcacagcgtc cagctgattc gtccgatcct ggcgtatcgc 300
tttaaacatc tggatcgtgt gtccatccag cgcctgaacc tgtatgatga cggctcaatg 360tttaaacatc tggatcgtgt gtccatccag cgcctgaacc tgtatgatga cggctcaatg 360
gaatacgttg atctggaaaa agaagaaaac aaagacatct cggcagaaat taaacaagct 420gaatacgttg atctggaaaa agaagaaaac aaagacatct cggcagaaat taaacaagct 420
gaaaaacagc tgagccatta tctgctgacg ggtaaaatca aattcgataa cccgaccatt 480gaaaaacagc tgagccatta tctgctgacg ggtaaaatca aattcgataa cccgaccatt 480
gcgcgctacg tttggcagtc tgcctttccg gtcaaatatc acttcctgag tacggactac 540gcgcgctacg tttggcagtc tgcctttccg gtcaaatatc acttcctgag tacggactac 540
tttgaaaaag cagaatttct gcaaccgctg aaagaatatc tggcggaaaa ttaccagaaa 600tttgaaaaag cagaatttct gcaaccgctg aaagaatatc tggcggaaaa ttaccagaaa 600
atggattgga cggcctatca gcaactgacc ccggaacagc aagcatttta cctgaccctg 660atggattgga cggcctatca gcaactgacc ccggaacagc aagcatttta cctgaccctg 660
gttggcttca acgacgaagt caaacagagt ctggaagtgc agcaagcgaa atttattttc 720gttggcttca acgacgaagt caaacagagt ctggaagtgc agcaagcgaa atttattttc 720
acgggcacca cgacctggga aggtaatacc gatgttcgtg aatattacgc ccagcaacag 780acgggcacca cgacctggga aggtaatacc gatgttcgtg aatattacgc ccagcaacag 780
ctgaacctgc tgaatcattt tacccaggcg ggcggcgacc tgtttattgg tgaccattac 840ctgaacctgc tgaatcattt tacccaggcg ggcggcgacc tgtttattgg tgaccattac 840
aaaatttact tcaaaggtca cccgcgcggc ggtgaaatca acgattacat cctgaacaac 900aaaatttact tcaaaggtca cccgcgcggc ggtgaaatca acgattacat cctgaacaac 900
gcaaaaaaca tcacgaatat cccggctaat atctctttcg aagtgctgat gatgaccggc 960gcaaaaaaca tcacgaatat cccggctaat atctctttcg aagtgctgat gatgaccggc 960
ctgctgccgg ataaagtcgg cggtgtggct agctctctgt acttcagtct gccgaaagaa 1020ctgctgccgg ataaagtcgg cggtgtggct agctctctgt acttcagtct gccgaaagaa 1020
aaaattagtc acatcatctt caccagcaac aaacaggtca aatcaaaaga agatgccctg 1080aaaattagtc acatcatctt caccagcaac aaacaggtca aatcaaaaga agatgccctg 1080
aacaatccgt acgtgaaagt tatgcgtcgc ctgggtatta tcgatgaatc gcaagtgatc 1140aacaatccgt acgtgaaagt tatgcgtcgc ctgggtatta tcgatgaatc gcaagtgatc 1140
ttttgggaca gcctgaaaca gctgtaa 1167ttttgggaca gcctgaaaca gctgtaa 1167
<210> 5<210> 5
<211> 1116<211> 1116
<212> ДНК<212> DNA
<213> Neisseria meningitidis<213> Neisseria meningitidis
<400> 5<400> 5
atgggcctga aaaaagcctg cctgaccgtg ctgtgtctga tcgtgttttg cttcggcatc 60atgggcctga aaaaagcctg cctgaccgtg ctgtgtctga tcgtgttttg cttcggcatc 60
ttttatacgt tcgatcgtgt gaaccagggt gaacgcaatg cagttagtct gctgaaagaa 120ttttatacgt tcgatcgtgt gaaccagggt gaacgcaatg cagttagtct gctgaaagaa 120
aaactgttta acgaagaagg cgaaccggtg aatctgatct tctgttacac cattctgcaa 180aaactgttta acgaagaagg cgaaccggtg aatctgatct tctgttacac cattctgcaa 180
atgaaagttg ccgaacgtat tatggcacag catccgggtg aacgctttta tgtggttctg 240atgaaagttg ccgaacgtat tatggcacag catccgggtg aacgctttta tgtggttctg 240
atgagcgaaa accgtaacga aaaatacgat tactacttca accagatcaa agataaagcg 300atgagcgaaa accgtaacga aaaatacgat tactacttca accagatcaa agataaagcg 300
gaacgcgcct atttctttca cctgccgtac ggcctgaaca aaagttttaa tttcattccg 360gaacgcgcct atttctttca cctgccgtac ggcctgaaca aaagttttaa tttcattccg 360
acgatggcgg aactgaaagt gaaaagcatg ctgctgccga aagttaaacg tatctatctg 420acgatggcgg aactgaaagt gaaaagcatg ctgctgccga aagttaaacg tatctatctg 420
gcaagcctgg aaaaagtgtc tattgcggcc tttctgagca cctacccgga tgcggaaatc 480gcaagcctgg aaaaagtgtc tattgcggcc tttctgagca cctacccgga tgcggaaatc 480
aaaaccttcg atgatggcac gggtaatctg attcagagct ctagttatct gggcgatgaa 540aaaaccttcg atgatggcac gggtaatctg attcagagct ctagttatct gggcgatgaa 540
ttttctgtta acggtacgat caaacgtaat ttcgcccgca tgatgatcgg tgattggtct 600ttttctgtta acggtacgat caaacgtaat ttcgcccgca tgatgatcgg tgattggtct 600
attgcgaaaa cccgcaacgc cagtgatgaa cattacacga tcttcaaagg cctgaaaaac 660attgcgaaaa cccgcaacgc cagtgatgaa cattacacga tcttcaaagg cctgaaaaac 660
atcatggatg atggtcgtcg caaaatgacc tacctgccgc tgttcgatgc gtctgaactg 720atcatggatg atggtcgtcg caaaatgacc tacctgccgc tgttcgatgc gtctgaactg 720
aaaacgggcg atgaaaccgg cggtacggtg cgtattctgc tgggtagccc ggataaagaa 780aaaacgggcg atgaaaccgg cggtacggtg cgtattctgc tgggtagccc ggataaagaa 780
atgaaagaaa tctctgaaaa agcagcgaaa aacttcaaaa tccagtatgt tgccccgcac 840atgaaagaaa tctctgaaaa agcagcgaaa aacttcaaaa tccagtatgt tgccccgcac 840
ccgcgtcaga cctacggcct gagtggtgtg accacgctga acagcccgta tgttattgaa 900ccgcgtcaga cctacggcct gagtggtgtg accacgctga acagcccgta tgttattgaa 900
gattacatcc tgcgtgaaat taagaaaaac ccgcataccc gctatgaaat ctacacgttt 960gattacatcc tgcgtgaaat taagaaaaac ccgcataccc gctatgaaat ctacacgttt 960
ttcagcggcg ccgcactgac catgaaagat tttccgaacg tgcacgttta tgcactgaaa 1020ttcagcggcg ccgcactgac catgaaagat tttccgaacg tgcacgttta tgcactgaaa 1020
ccggcgtctc tgccggaaga ttattggctg aaaccggtgt acgcgctgtt tacccagagt 1080ccggcgtctc tgccggaaga ttattggctg aaaccggtgt acgcgctgtt tacccagagt 1080
ggtattccga tcctgacgtt cgatgataaa aattaa 1116ggtattccga tcctgacgtt cgatgataaa aattaa 1116
<210> 6<210> 6
<211> 852<211> 852
<212> ДНК<212> DNA
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 6<400> 6
atggataaat ttgcagaaca tgaaattccg aaagcagtga tcgttgctgg caacggtgaa 60atggataaat ttgcagaaca tgaaattccg aaagcagtga tcgttgctgg caacggtgaa 60
agtctgtccc agattgatta tcgtctgctg ccgaaaaact acgacgtctt ccgttgcaac 120agtctgtccc agattgatta tcgtctgctg ccgaaaaact acgacgtctt ccgttgcaac 120
caattctact tcgaagaacg ctacttcctg ggcaataaaa tcaaagccgt gtttttcacc 180caattctact tcgaagaacg ctacttcctg ggcaataaaa tcaaagccgt gtttttcacc 180
ccgggtgttt ttctggaaca gtattacacg ctgtatcatc tgaaacgcaa caatgaatac 240ccgggtgttt ttctggaaca gtattacacg ctgtatcatc tgaaacgcaa caatgaatac 240
tttgtcgata acgtgattct gagctctttc aatcacccga ccgtggacct ggaaaaatca 300tttgtcgata acgtgattct gagctctttc aatcacccga ccgtggacct ggaaaaatca 300
cagaaaatcc aagcactgtt catcgatgtt atcaacggct acgaaaaata cctgtcgaaa 360cagaaaatcc aagcactgtt catcgatgtt atcaacggct acgaaaaata cctgtcgaaa 360
ctgaccgctt tcgatgttta tctgcgttac aaagaactgt atgaaaatca gcgcattacg 420ctgaccgctt tcgatgttta tctgcgttac aaagaactgt atgaaaatca gcgcattacg 420
agcggtgttt acatgtgcgc tgtcgcgatc gccatgggct ataccgatat ttacctgacg 480agcggtgttt acatgtgcgc tgtcgcgatc gccatgggct ataccgatat ttacctgacg 480
ggtatcgact tttatcaagc gtctgaagaa aactacgcct tcgataacaa aaaaccgaat 540ggtatcgact tttatcaagc gtctgaagaa aactacgcct tcgataacaa aaaaccgaat 540
attatccgtc tgctgccgga ctttcgcaaa gaaaaaaccc tgttcagcta tcattctaaa 600attatccgtc tgctgccgga ctttcgcaaa gaaaaaaccc tgttcagcta tcattctaaa 600
gatattgacc tggaagcgct gtcatttctg cagcaacatt accacgtgaa cttctactca 660gatattgacc tggaagcgct gtcatttctg cagcaacatt accacgtgaa cttctactca 660
atctcgccga tgagtccgct gtccaaacat tttccgatcc cgacggttga agatgactgt 720atctcgccga tgagtccgct gtccaaacat tttccgatcc cgacggttga agatgactgt 720
gaaaccacgt tcgtcgcccc gctgaaagaa aactatatta atgacatcct gctgccgccg 780gaaaccacgt tcgtcgcccc gctgaaagaa aactatatta atgacatcct gctgccgccg 780
cactttgtct atgaaaaact gggcgtggat aaactggcgg ccgcactgga acatcaccat 840cactttgtct atgaaaaact gggcgtggat aaactggcgg ccgcactgga acatcaccat 840
caccatcact aa 852caccatcact aa 852
<210> 7<210> 7
<211> 1158<211> 1158
<212> ДНК<212> DNA
<213> Pasteurella dagmatis<213> Pasteurella dagmatis
<400> 7<400> 7
atgaccattt acctggaccc ggcgtctctg ccgaccctga accaactgat gcattttacg 60atgaccattt acctggaccc ggcgtctctg ccgaccctga accaactgat gcattttacg 60
aaagaaagcg aagacaaaga aaccgcacgt atttttggct tctctcgctt taaactgccg 120aaagaaagcg aagacaaaga aaccgcacgt atttttggct tctctcgctt taaactgccg 120
gaaaaaatca cggaacagta caacaacatc catttcgtgg aaatcaaaaa caatcgtccg 180gaaaaaatca cggaacagta caacaacatc catttcgtgg aaatcaaaaa caatcgtccg 180
acggaagata ttttcaccat cctggaccag tacccggaaa aactggaact ggatctgcat 240acggaagata ttttcaccat cctggaccag tacccggaaa aactggaact ggatctgcat 240
ctgaacattg cacacagcat ccagctgttt catccgattc tgcaatatcg tttcaaacac 300ctgaacattg cacacagcat ccagctgttt catccgattc tgcaatatcg tttcaaacac 300
ccggatcgca ttagtatcaa atccctgaac ctgtatgatg acggcaccat ggaatacgtt 360ccggatcgca ttagtatcaa atccctgaac ctgtatgatg acggcaccat ggaatacgtt 360
gatctggaaa aagaagaaaa caaagacatc aaaagtgcga tcaaaaaagc cgaaaaacag 420gatctggaaa aagaagaaaa caaagacatc aaaagtgcga tcaaaaaagc cgaaaaacag 420
ctgtccgatt atctgctgac gggtaaaatt aactttgaca atccgaccct ggcacgctac 480ctgtccgatt atctgctgac gggtaaaatt aactttgaca atccgaccct ggcacgctac 480
gtttggcagt cacaatatcc ggtcaaatac catttcctgt cgacggaata ttttgaaaaa 540gtttggcagt cacaatatcc ggtcaaatac catttcctgt cgacggaata ttttgaaaaa 540
gctgaattcc tgcagccgct gaaaacctat ctggcgggca aataccaaaa aatggattgg 600gctgaattcc tgcagccgct gaaaacctat ctggcgggca aataccaaaa aatggattgg 600
tcagcctatg aaaaactgtc gccggaacag caaacgtttt acctgaaact ggtcggtttc 660tcagcctatg aaaaactgtc gccggaacag caaacgtttt acctgaaact ggtcggtttc 660
agtgatgaaa ccaaacagct gtttcacacg gaacaaacca aatttatttt cacgggcacc 720agtgatgaaa ccaaacagct gtttcacacg gaacaaacca aatttatttt cacgggcacc 720
acgacctggg agggtaacac cgatatccgt gaatattacg cgaaacagca actgaatctg 780acgacctggg agggtaacac cgatatccgt gaatattacg cgaaacagca actgaatctg 780
ctgaaacatt ttacccacag cgaaggcgac ctgtttatcg gtgaccagta caaaatctac 840ctgaaacatt ttacccacag cgaaggcgac ctgtttatcg gtgaccagta caaaatctac 840
ttcaaaggcc atccgcgcgg cggtgatatt aacgactata tcctgaaaca cgcaaaagat 900ttcaaaggcc atccgcgcgg cggtgatatt aacgactata tcctgaaaca cgcaaaagat 900
attacgaaca tcccggctaa tattagcttc gaaatcctga tgatgaccgg tctgctgccg 960attacgaaca tcccggctaa tattagcttc gaaatcctga tgatgaccgg tctgctgccg 960
gacaaagtcg gcggtgtggc gagctctctg tacttctctc tgccgaaaga aaaaatcagc 1020gacaaagtcg gcggtgtggc gagctctctg tacttctctc tgccgaaaga aaaaatcagc 1020
cacattatct tcacctctaa caagaaaatt aaaaacaaag aagatgccct gaatgacccg 1080cacattatct tcacctctaa caagaaaatt aaaaacaaag aagatgccct gaatgacccg 1080
tacgtgcgtg ttatgctgcg tctgggtatg attgacaaaa gccaaattat cttctgggat 1140tacgtgcgtg ttatgctgcg tctgggtatg attgacaaaa gccaaattat cttctgggat 1140
tctctgaaac aactgtaa 1158tctctgaaac aactgtaa 1158
<210> 8<210> 8
<211> 1173<211> 1173
<212> ДНК<212> DNA
<213> Photobacterium phosphoreum<213> Photobacterium phosphoreum
<400> 8<400> 8
atgggctgta actccgatag caaacacaat aacagtgatg gcaatattac caaaaacaaa 60atgggctgta actccgatag caaacacaat aacagtgatg gcaatattac caaaaacaaa 60
acgatcgaag tctatgtgga ccgtgcgacc ctgccgacga ttcagcaaat gacccagatc 120acgatcgaag tctatgtgga ccgtgcgacc ctgccgacga ttcagcaaat gacccagatc 120
atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgttaccc gatcaatgat 180atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgttaccc gatcaatgat 180
gaaacgctgc tggaatcaat taatggctcg tttttcaaaa accgcccgga actgatcaaa 240gaaacgctgc tggaatcaat taatggctcg tttttcaaaa accgcccgga actgatcaaa 240
agtctggatt ccatgattct gaccaacgaa attaagaaag tgatcatcaa cggtaacacg 300agtctggatt ccatgattct gaccaacgaa attaagaaag tgatcatcaa cggtaacacg 300
ctgtgggcag ttgacgtggt taatattatc aaaagcattg aagctctggg caagaaaacc 360ctgtgggcag ttgacgtggt taatattatc aaaagcattg aagctctggg caagaaaacc 360
gaaatcgaac tgaacttcta tgatgacggt tctgcggaat atgtgcgtct gtacgatttt 420gaaatcgaac tgaacttcta tgatgacggt tctgcggaat atgtgcgtct gtacgatttt 420
agccgcctgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattcag 480agccgcctgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattcag 480
agcagcatca acggcaccca accgttcgac aacagcatcg aaaacatcta cggtttctct 540agcagcatca acggcaccca accgttcgac aacagcatcg aaaacatcta cggtttctct 540
cagctgtatc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600cagctgtatc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600
ctgacgagtc tgaaacgcgt tatctccaac aacatcaaac agatgaaatg ggattacttc 660ctgacgagtc tgaaacgcgt tatctccaac aacatcaaac agatgaaatg ggattacttc 660
accacgttca attcccagca gaaaaacaaa ttttacaact tcaccggctt caacccggaa 720accacgttca attcccagca gaaaaacaaa ttttacaact tcaccggctt caacccggaa 720
aaaatcaaag aacaatacaa agcgagtccg cacgaaaatt ttattttcat tggcaccaac 780aaaatcaaag aacaatacaa agcgagtccg cacgaaaatt ttattttcat tggcaccaac 780
tccggcaccg ccaccgcaga acagcaaatt gatatcctga ccgaagccaa aaaaccggac 840tccggcaccg ccaccgcaga acagcaaatt gatatcctga ccgaagccaa aaaaccggac 840
tcaccgatta tcaccaacag cattcagggc ctggacctgt ttttcaaagg tcatccgtct 900tcaccgatta tcaccaacag cattcagggc ctggacctgt ttttcaaagg tcatccgtct 900
gcgacctata accagcaaat tatcgacgcc cacaacatga tcgaaatcta caacaaaatc 960gcgacctata accagcaaat tatcgacgcc cacaacatga tcgaaatcta caacaaaatc 960
ccgttcgaag cactgatcat gaccgatgca ctgccggacg ctgttggcgg tatgggtagt 1020ccgttcgaag cactgatcat gaccgatgca ctgccggacg ctgttggcgg tatgggtagt 1020
tccgtctttt tctcactgcc gaataccgtc gaaaacaaat tcattttcta taaatcggat 1080tccgtctttt tctcactgcc gaataccgtc gaaaacaaat tcattttcta taaatcggat 1080
acggacattg aaaacaatgc tctgatccag gttatgatcg aactgaatat cgtgaaccgc 1140acggacattg aaaacaatgc tctgatccag gttatgatcg aactgaatat cgtgaaccgc 1140
aatgatgtga aactgattag tgacctgcaa taa 1173aatgatgtga aactgattag tgacctgcaa taa 1173
<210> 9<210> 9
<211> 1254<211> 1254
<212> ДНК<212> DNA
<213> Avibacterium paragallinarum<213> Avibacterium paragallinarum
<400> 9<400> 9
atgcgtaaaa tcatcacctt cttcagcctg ttcttctcga tctcagcgtg gtgtcaaaaa 60atgcgtaaaa tcatcacctt cttcagcctg ttcttctcga tctcagcgtg gtgtcaaaaa 60
atggaaatct acctggacta tgcgtcgctg ccgagcctga acatgatcct gaacctggtt 120atggaaatct acctggacta tgcgtcgctg ccgagcctga acatgatcct gaacctggtt 120
gaaaacaaaa acaacgaaaa agtcgaacgt attatcggct tcgaacgctt tgatttcaac 180gaaaacaaaa acaacgaaaa agtcgaacgt attatcggct tcgaacgctt tgatttcaac 180
aaagaaattc tgaatagctt ctctaaagaa cgtatcgaat ttagtaaagt ctccattctg 240aaagaaattc tgaatagctt ctctaaagaa cgtatcgaat ttagtaaagt ctccattctg 240
gatatcaaag aattttcaga caaactgtac ctgaacattg aaaaatcgga tacgccggtg 300gatatcaaag aattttcaga caaactgtac ctgaacattg aaaaatcgga tacgccggtg 300
gacctgatta tccataccaa tctggatcac tcagttcgtt cgctgctgag catctttaaa 360gacctgatta tccataccaa tctggatcac tcagttcgtt cgctgctgag catctttaaa 360
accctgagtc cgctgttcca taaaatcaac atcgaaaaac tgtacctgta cgatgacggc 420accctgagtc cgctgttcca taaaatcaac atcgaaaaac tgtacctgta cgatgacggc 420
agcggtaact atgttgatct gtaccagcac cgccaagaaa atatttctgc gattctgatc 480agcggtaact atgttgatct gtaccagcac cgccaagaaa atatttctgc gattctgatc 480
gaagcccaga aaaaactgaa agacgcgctg gaaaatcgtg aaacggatac cgacaaactg 540gaagcccaga aaaaactgaa agacgcgctg gaaaatcgtg aaacggatac cgacaaactg 540
catagcctga cgcgctatac ctggcacaaa atctttccga cggaatatat cctgctgcgt 600catagcctga cgcgctatac ctggcacaaa atctttccga cggaatatat cctgctgcgt 600
ccggattacc tggatattga cgaaaaaatg caaccgctga aacatttcct gagcgatacc 660ccggattacc tggatattga cgaaaaaatg caaccgctga aacatttcct gagcgatacc 660
atcgtgtcta tggacctgtc tcgctttagt catttctcca aaaaccagaa agaactgttt 720atcgtgtcta tggacctgtc tcgctttagt catttctcca aaaaccagaa agaactgttt 720
ctgaaaatca cgcacttcga tcaaaacatc ttcaacgaac tgaacatcgg caccaaaaac 780ctgaaaatca cgcacttcga tcaaaacatc ttcaacgaac tgaacatcgg caccaaaaac 780
aaagaataca aaacgttcat cttcaccggc accacgacct gggaaaaaga taagaaaaaa 840aaagaataca aaacgttcat cttcaccggc accacgacct gggaaaaaga taagaaaaaa 840
cgtctgaaca acgcgaaact gcagacggaa attctggaat cttttatcaa accgaacggc 900cgtctgaaca acgcgaaact gcagacggaa attctggaat cttttatcaa accgaacggc 900
aaattctacc tgggtaacga tatcaaaatc tttttcaaag gccacccgaa aggtgatgac 960aaattctacc tgggtaacga tatcaaaatc tttttcaaag gccacccgaa aggtgatgac 960
attaacgact acattatccg caaaaccggc gcagaaaaaa ttccggctaa catcccgttt 1020attaacgact acattatccg caaaaccggc gcagaaaaaa ttccggctaa catcccgttt 1020
gaagttctga tgatgacgaa tagtctgccg gattatgtcg gcggtattat gagtaccgtg 1080gaagttctga tgatgacgaa tagtctgccg gattatgtcg gcggtattat gagtaccgtg 1080
tacttttccc tgccgccgaa aaatattgat aaagtggttt tcctgggttc cgaaaaaatc 1140tacttttccc tgccgccgaa aaatattgat aaagtggttt tcctgggttc cgaaaaaatc 1140
aaaaacgaaa acgacgccaa atcacagacc ctgtcgaaac tgatgctgat gctgaacgtc 1200aaaaacgaaa acgacgccaa atcacagacc ctgtcgaaac tgatgctgat gctgaacgtc 1200
atcacgccgg aacagatttt ctttgaagaa atgccgaacc cgattaactt ttaa 1254atcacgccgg aacagatttt ctttgaagaa atgccgaacc cgattaactt ttaa 1254
<210> 10<210> 10
<211> 1293<211> 1293
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 10<400> 10
atgacccgca cccgtatgga aaacgaactg attgtgagca aaaacatgca gaacattatt 60atgacccgca cccgtatgga aaacgaactg attgtgagca aaaacatgca gaacattatt 60
atcgccggta acggtccgag cctgaaaaat attaactata aacgtctgcc gcgcgaatac 120atcgccggta acggtccgag cctgaaaaat attaactata aacgtctgcc gcgcgaatac 120
gatgtgttcc gttgcaacca gttctacttc gaagacaaat actacctggg caagaaaatt 180gatgtgttcc gttgcaacca gttctacttc gaagacaaat actacctggg caagaaaatt 180
aaagccgtgt ttttcaatcc gggcgtgttt ctgcaacaat atcataccgc aaaacagctg 240aaagccgtgt ttttcaatcc gggcgtgttt ctgcaacaat atcataccgc aaaacagctg 240
attctgaaaa acgaatacga aatcaaaaac atcttttgta gcaccttcaa tctgccgttt 300attctgaaaa acgaatacga aatcaaaaac atcttttgta gcaccttcaa tctgccgttt 300
atcgaatcta acgatttcct gcaccaattt tataactttt tcccggacgc taaactgggc 360atcgaatcta acgatttcct gcaccaattt tataactttt tcccggacgc taaactgggc 360
tacgaagtca tcgaaaacct gaaagaattt tacgcgtaca tcaaatacaa cgaaatctac 420tacgaagtca tcgaaaacct gaaagaattt tacgcgtaca tcaaatacaa cgaaatctac 420
ttcaacaaac gcatcacctc tggcgtgtat atgtgcgcga ttgccatcgc actgggttat 480ttcaacaaac gcatcacctc tggcgtgtat atgtgcgcga ttgccatcgc actgggttat 480
aaaacgattt acctgtgtgg catcgatttc tatgaaggtg acgttattta cccgtttgaa 540aaaacgattt acctgtgtgg catcgatttc tatgaaggtg acgttattta cccgtttgaa 540
gcaatgagta ccaacattaa aacgatcttc ccgggtatca aagatttcaa accgagtaac 600gcaatgagta ccaacattaa aacgatcttc ccgggtatca aagatttcaa accgagtaac 600
tgccattcca aagaatatga catcgaagcg ctgaaactgc tgaaaagcat ctacaaagtt 660tgccattcca aagaatatga catcgaagcg ctgaaactgc tgaaaagcat ctacaaagtt 660
aacatctacg ccctgtgtga tgacagtatt ctggcaaatc atttcccgct gtccattaac 720aacatctacg ccctgtgtga tgacagtatt ctggcaaatc atttcccgct gtccattaac 720
atcaacaaca acttcaccct ggaaaacaaa cacaacaact caatcaacga tattctgctg 780atcaacaaca acttcaccct ggaaaacaaa cacaacaact caatcaacga tattctgctg 780
accgacaata cgccgggcgt ctcgttttat aaaaatcagc tgaaagccga taacaaaatc 840accgacaata cgccgggcgt ctcgttttat aaaaatcagc tgaaagccga taacaaaatc 840
atgctgaact tctacaacat cctgcatagc aaagataacc tgatcaaatt cctgaacaaa 900atgctgaact tctacaacat cctgcatagc aaagataacc tgatcaaatt cctgaacaaa 900
gaaatcgctg ttctgaaaaa acagaccacg caacgtgcta aagcgcgcat tcagaaccac 960gaaatcgctg ttctgaaaaa acagaccacg caacgtgcta aagcgcgcat tcagaaccac 960
ctgagctata aactgggcca agccctgatt atcaatagca aatctgtcct gggtttcctg 1020ctgagctata aactgggcca agccctgatt atcaatagca aatctgtcct gggtttcctg 1020
tctctgccgt ttattatcct gtcaattgtg atctcgcaca aacaggaaca aaaagcgtat 1080tctctgccgt ttattatcct gtcaattgtg atctcgcaca aacaggaaca aaaagcgtat 1080
aaattcaaag tgaagaaaaa cccgaacctg gcactgccgc cgctggaaac ctatccggat 1140aaattcaaag tgaagaaaaa cccgaacctg gcactgccgc cgctggaaac ctatccggat 1140
tacaacgaag ccctgaaaga aaaagaatgc ttcacgtaca aactgggcga agaatttatc 1200tacaacgaag ccctgaaaga aaaagaatgc ttcacgtaca aactgggcga agaatttatc 1200
aaagcaggta aaaactggta tggcgaaggt tacatcaaat ttatcttcaa agatgttccg 1260aaagcaggta aaaactggta tggcgaaggt tacatcaaat ttatcttcaa agatgttccg 1260
cgtctgaaac gtgaatttga aaaaggcgaa taa 1293cgtctgaaac gtgaatttga aaaaggcgaa taa 1293
<210> 11<210> 11
<211> 1188<211> 1188
<212> ДНК<212> DNA
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 11<400> 11
atgaataaga aaccgctgat tattgctggc aacgggccaa gcatcaaaga cttagattat 60atgaataaga aaccgctgat tattgctggc aacgggccaa gcatcaaaga cttagattat 60
gcgttgttcc cgaaagactt tgatgtattc cgatgtaatc aattctactt cgaggacaaa 120gcgttgttcc cgaaagactt tgatgtattc cgatgtaatc aattctactt cgaggacaaa 120
tactatttag ggcgggaaat aaaaggggtg ttctttaacg cgcacgtctt cgatctccaa 180tactatttag ggcgggaaat aaaaggggtg ttctttaacg cgcacgtctt cgatctccaa 180
atgaagatca ctaaagccat agtcaaaaac ggggaatatc acccggacca catatattgc 240atgaagatca ctaaagccat agtcaaaaac ggggaatatc acccggacca catatattgc 240
acacatgtcg aaccgtacgg ttacgttaac ggaaaccagc aactcatgca agagtacctg 300acacatgtcg aaccgtacgg ttacgttaac ggaaaccagc aactcatgca agagtacctg 300
gaaaaacatt ttgtgggagt ccgaagcacg tacgcatacc tgaaagatct agagccattc 360gaaaaacatt ttgtgggagt ccgaagcacg tacgcatacc tgaaagatct agagccattc 360
tttattctgc acagtaagta tcgcaacttc tacgaccagc acttcacaac gggcatcatg 420tttattctgc acagtaagta tcgcaacttc tacgaccagc acttcacaac gggcatcatg 420
atgctactgg tggccatcca attgggatac aaagaaatat acctgtgcgg aatagacttc 480atgctactgg tggccatcca attgggatac aaagaaatat acctgtgcgg aatagacttc 480
tacgaaaacg gattcggaca tttctacgag aaccaagggg gattctttga agaggatagc 540tacgaaaacg gattcggaca tttctacgag aaccaagggg gattctttga agaggatagc 540
gatccgatgc acgataagaa catagacatc caagcactgg aactggcaaa gaaatacgcg 600gatccgatgc acgataagaa catagacatc caagcactgg aactggcaaa gaaatacgcg 600
aaaatctacg cactggtacc gaacagcgcc ctagtgaaaa tgattccgtt gagcagccaa 660aaaatctacg cactggtacc gaacagcgcc ctagtgaaaa tgattccgtt gagcagccaa 660
aaaggagttc tggaaaaggt gaaggaccgg atcgggttgg gcgagtttaa gagagagaaa 720aaaggagttc tggaaaaggt gaaggaccgg atcgggttgg gcgagtttaa gagagagaaa 720
ttcgggcaaa aagaattgga aagacagaag gaattagaac gacaaaaaga gctcgaacgc 780ttcgggcaaa aagaattgga aagacagaag gaattagaac gacaaaaaga gctcgaacgc 780
caaaaggagc ttgaacgtca aaaggaactt gaacgacaaa aagagttgga gaggcagaaa 840caaaaggagc ttgaacgtca aaaggaactt gaacgacaaa aagagttgga gaggcagaaa 840
gaactcgaac gccaaaaaga attagagaga cagaaggaat tagagcgcca aaaggagctt 900gaactcgaac gccaaaaaga attagagaga cagaaggaat tagagcgcca aaaggagctt 900
gagcgtcaaa aagaattaga gaggcagaag gagttagaaa ggcagaaaga actggagaga 960gagcgtcaaa aagaattaga gaggcagaag gagttagaaa ggcagaaaga actggagaga 960
cagaaagaac tcgaaaggca gaaggagttg gaacgccaaa aagaactaga attagaacga 1020cagaaagaac tcgaaaggca gaaggagttg gaacgccaaa aagaactaga attagaacga 1020
tccttaaaag cacgattgaa agcggtactc gcgagcaaag gcatccgcgg cgacaacctg 1080tccttaaaag cacgattgaa agcggtactc gcgagcaaag gcatccgcgg cgacaacctg 1080
ataatcgtaa gtttaaaaga cacctaccga ctgtttaaag ggggatttgc gttactcttg 1140ataatcgtaa gtttaaaaga cacctaccga ctgtttaaag ggggatttgc gttactcttg 1140
gacctgaagg cgctaaagtc aatcattaaa gcattcctga agagataa 1188gacctgaagg cgctaaagtc aatcattaaa gcattcctga agagataa 1188
<210> 12<210> 12
<211> 783<211> 783
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 12<400> 12
atgggcaaaa aagtgattat tgcgggcaac ggcccgagcc tgaaagaaat tgattatagc 60atgggcaaaa aagtgattat tgcgggcaac ggcccgagcc tgaaagaaat tgattatagc 60
cgtctgccga acgattttga tgtgtttcgc tgcaaccagt tttatttcga agataaatat 120cgtctgccga acgattttga tgtgtttcgc tgcaaccagt tttatttcga agataaatat 120
tacctgggca aaaaatgcaa agcggtgttc tataatccga tcctgttctt cgaacagtat 180tacctgggca aaaaatgcaa agcggtgttc tataatccga tcctgttctt cgaacagtat 180
tacaccctga aacatctgat tcagaaccag gaatatgaaa ccgaactgat catgtgcagc 240tacaccctga aacatctgat tcagaaccag gaatatgaaa ccgaactgat catgtgcagc 240
aactataacc aggcgcatct ggaaaacgaa aactttgtga aaaccttcta cgattatttt 300aactataacc aggcgcatct ggaaaacgaa aactttgtga aaaccttcta cgattatttt 300
ccggatgcgc atctgggcta tgattttttc aaacagctga aagatttcaa cgcgtacttc 360ccggatgcgc atctgggcta tgattttttc aaacagctga aagatttcaa cgcgtacttc 360
aaattccacg aaatctattt caaccagcgt attaccagcg gcgtgtatat gtgcgcggtg 420aaattccacg aaatctattt caaccagcgt attaccagcg gcgtgtatat gtgcgcggtg 420
gcgattgcgc tgggctataa agaaatttat ctgagcggca tcgattttta tcagaacggc 480gcgattgcgc tgggctataa agaaatttat ctgagcggca tcgattttta tcagaacggc 480
agcagctatg cgtttgatac caaacagaaa aacctgctga aactggcccc gaactttaaa 540agcagctatg cgtttgatac caaacagaaa aacctgctga aactggcccc gaactttaaa 540
aacgataaca gccactatat tggccatagc aaaaacaccg atatcaaagc gctggaattt 600aacgataaca gccactatat tggccatagc aaaaacaccg atatcaaagc gctggaattt 600
ctggaaaaaa cctataaaat caaactgtat tgcctgtgcc cgaacagcct gctggccaac 660ctggaaaaaa cctataaaat caaactgtat tgcctgtgcc cgaacagcct gctggccaac 660
tttattgaac tggcaccgaa tctgaacagc aacttcatca tccaggaaaa aaacaactat 720tttattgaac tggcaccgaa tctgaacagc aacttcatca tccaggaaaa aaacaactat 720
accaaagata ttctgattcc gagcagcgaa gcgtatggca aattcagcaa aaacatcaac 780accaaagata ttctgattcc gagcagcgaa gcgtatggca aattcagcaa aaacatcaac 780
taa 783taa 783
<210> 13<210> 13
<211> 897<211> 897
<212> ДНК<212> DNA
<213> Streptococcus entericus<213> Streptococcus entericus
<400> 13<400> 13
atgaagaaag tctacttctg ccatacggtc taccatctgc tgattaccct gtgcaaaatt 60atgaagaaag tctacttctg ccatacggtc taccatctgc tgattaccct gtgcaaaatt 60
agcgttgaag aacaagttga aattattgtg ttcgataccg ttagtaatca tgaactgatt 120agcgttgaag aacaagttga aattattgtg ttcgataccg ttagtaatca tgaactgatt 120
gtccagaaaa tccgcgacgt gtttgttaac accacggtgc tgttcgcaga acaaaatacc 180gtccagaaaa tccgcgacgt gtttgttaac accacggtgc tgttcgcaga acaaaatacc 180
gatttttcca ttctggaaat cgatcgcgct acggacattt atgtgttcaa cgactggacc 240gatttttcca ttctggaaat cgatcgcgct acggacattt atgtgttcaa cgactggacc 240
ccgatcggcg cgtatctgcg taaaaacaaa ctgttttacc atctgatcga agatggttat 300ccgatcggcg cgtatctgcg taaaaacaaa ctgttttacc atctgatcga agatggttat 300
aactaccacg aatataacgt ttacgcgaat gccctgacca tgaaacgtcg cctgctgaac 360aactaccacg aatataacgt ttacgcgaat gccctgacca tgaaacgtcg cctgctgaac 360
ttcgtgctgc gtcgcgaaga accgtcaggc ttttcgcgtt atgttcgcag cattgaagtt 420ttcgtgctgc gtcgcgaaga accgtcaggc ttttcgcgtt atgttcgcag cattgaagtt 420
aaccgtgtca aatacctgcc gaatgattgc cgcaaaagca aatgggttga aaaaccgcgt 480aaccgtgtca aatacctgcc gaatgattgc cgcaaaagca aatgggttga aaaaccgcgt 480
tctgccctgt tcgaaaatct ggtcccggaa cataaacaga aaatcatcac gatcttcggc 540tctgccctgt tcgaaaatct ggtcccggaa cataaacaga aaatcatcac gatcttcggc 540
ctggaaaact atcaagatag cctgcgcggt gtcctggtgc tgacccagcc gctggtgcaa 600ctggaaaact atcaagatag cctgcgcggt gtcctggtgc tgacccagcc gctggtgcaa 600
gactactggg atcgcgacat taccacggaa gaagaacagc tggaatttta tcgtcaaatc 660gactactggg atcgcgacat taccacggaa gaagaacagc tggaatttta tcgtcaaatc 660
gtggaatctt acggcgaagg tgaacaggtg tttttcaaaa ttcacccgcg tgataaagtt 720gtggaatctt acggcgaagg tgaacaggtg tttttcaaaa ttcacccgcg tgataaagtt 720
gactatagct ctctgaccaa cgtcattttt ctgaagaaaa acgtcccgat ggaagtgtac 780gactatagct ctctgaccaa cgtcattttt ctgaagaaaa acgtcccgat ggaagtgtac 780
gaactgattg ccgattgtca ttttaccaaa ggtatcacgc acagttccac cgcactggac 840gaactgattg ccgattgtca ttttaccaaa ggtatcacgc acagttccac cgcactggac 840
ttcctgtcct gtgtggataa gaaaatcacc ctgaaacaaa tgaaagcaaa tagttaa 897ttcctgtcct gtgtggataa gaaaatcacc ctgaaacaaa tgaaagcaaa tagttaa 897
<210> 14<210> 14
<211> 888<211> 888
<212> ДНК<212> DNA
<213> Haemophilus ducreyi<213>Haemophilus ducreyi
<400> 14<400> 14
atgaaagaaa tcgccatcat ctccaaccaa cgcatgttct tcctgtactg tctgctgacc 60atgaaagaaa tcgccatcat ctccaaccaa cgcatgttct tcctgtactg tctgctgacc 60
aataaaaatg tcgaagacgt gttcttcatt tttgaaaaag gcgcgatgcc gaacaatctg 120aataaaaatg tcgaagacgt gttcttcatt tttgaaaaag gcgcgatgcc gaacaatctg 120
accagcattt ctcatttcat cgtgctggat cacagtaaat ccgaatgcta tgactttttc 180accagcattt ctcatttcat cgtgctggat cacagtaaat ccgaatgcta tgactttttc 180
tacttcaact tcatcagttg taaatatcgt ctgcgcggcc tggatgttta cggtgcagac 240tacttcaact tcatcagttg taaatatcgt ctgcgcggcc tggatgttta cggtgcagac 240
catatcaaag gcgctaaatt tttcctggaa cgtcaccgct ttttcgtggt tgaagatggt 300catatcaaag gcgctaaatt tttcctggaa cgtcaccgct ttttcgtggt tgaagatggt 300
atgatgaact acagcaaaaa catgtacgca ttctctctgt tccgtacccg caatccggtg 360atgatgaact acagcaaaaa catgtacgca ttctctctgt tccgtacccg caatccggtg 360
attctgccgg gcggttttca tccgaacgtt aaaaccatct tcctgacgaa agataatccg 420attctgccgg gcggttttca tccgaacgtt aaaaccatct tcctgacgaa agataatccg 420
attccggacc agatcgctca caaacgtgaa atcatcaaca tcaaaaccct gtggcaagcg 480attccggacc agatcgctca caaacgtgaa atcatcaaca tcaaaaccct gtggcaagcg 480
aaaaccgcca cggaaaaaac gaaaattctg agctttttcg aaatcgatat gcaggaaatt 540aaaaccgcca cggaaaaaac gaaaattctg agctttttcg aaatcgatat gcaggaaatt 540
tcagttatca aaaaccgctc gtttgtcctg tatacccaac cgctgtcaga agataaactg 600tcagttatca aaaaccgctc gtttgtcctg tatacccaac cgctgtcaga agataaactg 600
ctgacggaag cggaaaaaat tgacatctat cgtaccattc tgacgaaata caaccattcg 660ctgacggaag cggaaaaaat tgacatctat cgtaccattc tgacgaaata caaccattcg 660
cagaccgtta tcaaaccgca cccgcgcgat aaaacggact ataaacaact gtttccggat 720cagaccgtta tcaaaccgca cccgcgcgat aaaacggact ataaacaact gtttccggat 720
gcctatgtca tgaaaggcac ctacccgagt gaactgctga cgctgctggg tgtcaacttc 780gcctatgtca tgaaaggcac ctacccgagt gaactgctga cgctgctggg tgtcaacttc 780
aacaaagtga tcaccctgtt ttccacggcg gtcttcgatt atccgaaaga aaaaatcgac 840aacaaagtga tcaccctgtt ttccacggcg gtcttcgatt atccgaaaga aaaaatcgac 840
ttctacggca ccgcggtgca tccgaaactg ctggatttct ttgactaa 888ttctacggca ccgcggtgca tccgaaactg ctggatttct ttgactaa 888
<210> 15<210> 15
<211> 1467<211> 1467
<212> ДНК<212> DNA
<213> Alistipes sp.<213> Alistipes sp.
<400> 15<400> 15
atggccctgc tgagcggtac cgccgcatgc tcagatgacg aagtctcgca gaacctgatc 60atggccctgc tgagcggtac cgccgcatgc tcagatgacg aagtctcgca gaacctgatc 60
gtgattaatg gcggtgaaca ttttctgagc ctggatggtc tggcccgtgc aggtaaaatt 120gtgattaatg gcggtgaaca ttttctgagc ctggatggtc tggcccgtgc aggtaaaatt 120
agcgtgctgg caccggctcc gtggcgtgtt acgaaagcag ctggtgatac ctggtttcgc 180agcgtgctgg caccggctcc gtggcgtgtt acgaaagcag ctggtgatac ctggtttcgc 180
ctgagcgcaa ccgaaggtcc ggctggttac agcgaagtgg aactgtctct ggatgaaaat 240ctgagcgcaa ccgaaggtcc ggctggttac agcgaagtgg aactgtctct ggatgaaaat 240
ccgggtgccg cacgtagcgc acagctggcg tttgcctgtg gtgatgcgat tgtgccgttc 300ccgggtgccg cacgtagcgc acagctggcg tttgcctgtg gtgatgcgat tgtgccgttc 300
cgcctgagtc aaggcgcact gtccgctggt tatgattcac cggactatta cttttacgtt 360cgcctgagtc aaggcgcact gtccgctggt tatgattcac cggactatta cttttacgtt 360
accttcggca cgatgccgac cctgtatgcc ggtatccatc tgctgagcca cgataaaccg 420accttcggca cgatgccgac cctgtatgcc ggtatccatc tgctgagcca cgataaaccg 420
ggctatgtct tttactcacg ttcgaaaacg tttgacccgg ccgaattccc ggcacgtgct 480ggctatgtct tttactcacg ttcgaaaacg tttgacccgg ccgaattccc ggcacgtgct 480
gaagttacca ccgcagctga tcgtaccgcc gatgcaaccc aggccgaaat ggaagcaatg 540gaagttacca ccgcagctga tcgtaccgcc gatgcaaccc aggccgaaat ggaagcaatg 540
gctcgcgaaa tgaaacgtcg catcctggaa attaactctg cggatccgac cgccgtgttt 600gctcgcgaaa tgaaacgtcg catcctggaa attaactctg cggatccgac cgccgtgttt 600
ggcctgtatg ttgatgacct gcgttgccgc attggctacg attggttcgt ggcgcagggt 660ggcctgtatg ttgatgacct gcgttgccgc attggctacg attggttcgt ggcgcagggt 660
atcgacagtg cccgtgtcaa agtgagcatg ctgtctgatg gcaccggcac gtacaacaat 720atcgacagtg cccgtgtcaa agtgagcatg ctgtctgatg gcaccggcac gtacaacaat 720
ttttataact acttcggtga cgcggccacg gcggaacaaa attgggaaag ttatgcgtcc 780ttttataact acttcggtga cgcggccacg gcggaacaaa attgggaaag ttatgcgtcc 780
gaagttgaag ccctggattg gaatcacggc ggtcgttatc cggaaacccg ctcgctgccg 840gaagttgaag ccctggattg gaatcacggc ggtcgttatc cggaaacccg ctcgctgccg 840
gaatttgaaa gctacacgtg gccgtattac ctgtctaccc gtccggatta tcgcctggtg 900gaatttgaaa gctacacgtg gccgtattac ctgtctaccc gtccggatta tcgcctggtg 900
gttcaggacg gcagtctgct ggaaagctct tgtccgttta ttaccgaaaa actgggtgaa 960gttcaggacg gcagtctgct ggaaagctct tgtccgttta ttaccgaaaa actgggtgaa 960
atggaaatcg aatccattca accgtatgaa atgctgtcag ccctgccgga aagttcccgt 1020atggaaatcg aatccattca accgtatgaa atgctgtcag ccctgccgga aagttcccgt 1020
aaacgctttt atgatatggc aggcttcgat tacgacaaat ttgcagctct gttcgatgcg 1080aaacgctttt atgatatggc aggcttcgat tacgacaaat ttgcagctct gttcgatgcg 1080
tccccgaaga aaaacctgat tatcattggt acctctcatg cggatgatgc cagtgcacgt 1140tccccgaaga aaaacctgat tatcattggt acctctcatg cggatgatgc cagtgcacgt 1140
ctgcagcgtg attacgttgc acgcatcatg gaacagtatg gcgctcaata cgatgtcttt 1200ctgcagcgtg attacgttgc acgcatcatg gaacagtatg gcgctcaata cgatgtcttt 1200
ttcaaaccgc acccggcaga caccacgtca gctggttatg aaacggaatt tccgggcctg 1260ttcaaaccgc acccggcaga caccacgtca gctggttatg aaacggaatt tccgggcctg 1260
accctgctgc cgggtcaaat gccgtttgaa atcttcgttt ggtccctgat tgatcgtgtc 1320accctgctgc cgggtcaaat gccgtttgaa atcttcgttt ggtccctgat tgatcgtgtc 1320
gacatgatcg gcggttatcc gtcaacggtc tttctgaccg ttccggtcga taaagtgcgc 1380gacatgatcg gcggttatcc gtcaacggtc tttctgaccg ttccggtcga taaagtgcgc 1380
tttatttttg ccgcggatgc agcttctctg gtgcgtccgc tgaatatcct gttccgcgat 1440tttatttttg ccgcggatgc agcttctctg gtgcgtccgc tgaatatcct gttccgcgat 1440
gcgaccgacg ttgaatggat gcagtaa 1467gcgaccgacg ttgaatggat gcagtaa 1467
<210> 16<210> 16
<211> 876<211> 876
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 16<400> 16
atgaagaaag tgattatcgc cggcaatggt ccgagcctga aagaaattga ttattctcgt 60atgaagaaag tgattatcgc cggcaatggt ccgagcctga aagaaattga ttattctcgt 60
ctgccgaatg atttcgacgt ctttcgctgc aaccagttct actttgaaga caaatattac 120ctgccgaatg atttcgacgt ctttcgctgc aaccagttct actttgaaga caaatattac 120
ctgggcaaaa aatgtaaagc cgtgttttat accccgaact ttttctttga acagtattac 180ctgggcaaaa aatgtaaagc cgtgttttat accccgaact ttttctttga acagtattac 180
acgctgaaac atctgattca gaaccaagaa tatgaaaccg aactgatcat gtgctcaaac 240acgctgaaac atctgattca gaaccaagaa tatgaaaccg aactgatcat gtgctcaaac 240
tacaatcaag cacatctgga aaacgaaaac ttcgtcaaaa cgttctacga ttacttcccg 300tacaatcaag cacatctgga aaacgaaaac ttcgtcaaaa cgttctacga ttacttcccg 300
gacgctcacc tgggttacga tttctttaaa cagctgaaag aattcaacgc gtacttcaaa 360gacgctcacc tgggttacga tttctttaaa cagctgaaag aattcaacgc gtacttcaaa 360
ttccacgaaa tctacttcaa ccaacgtatc acctcaggcg tgtatatgtg tgcggttgcc 420ttccacgaaa tctacttcaa ccaacgtatc acctcaggcg tgtatatgtg tgcggttgcc 420
attgcactgg gttataaaga aatttacctg tcgggcatcg atttttatca gaatggtagc 480attgcactgg gttataaaga aatttacctg tcgggcatcg atttttatca gaatggtagc 480
tcttacgcct tcgacacgaa acaagaaaat ctgctgaaac tggcaccgga ttttaaaaac 540tcttacgcct tcgacacgaa acaagaaaat ctgctgaaac tggcaccgga ttttaaaaac 540
gaccgctcac attatattgg ccactcgaaa aacaccgata tcaaagctct ggaattcctg 600gaccgctcac attatattgg ccactcgaaa aacaccgata tcaaagctct ggaattcctg 600
gaaaaaacgt acaaaatcaa actgtactgc ctgtgtccga atagtctgct ggctaacttt 660gaaaaaacgt acaaaatcaa actgtactgc ctgtgtccga atagtctgct ggctaacttt 660
atcgaactgg cgccgaacct gaattccaac ttcatcatcc aggagaaaaa caactacacc 720atcgaactgg cgccgaacct gaattccaac ttcatcatcc aggagaaaaa caactacacc 720
aaagatatcc tgatcccgag ttccgaagcg tacggcaaat ttagcaaaaa catcaacttc 780aaagatatcc tgatcccgag ttccgaagcg tacggcaaat ttagcaaaaa catcaacttc 780
aagaaaatta aaatcaaaga aaacgtgtat tacaaactga ttaaagatct gctgcgtctg 840aagaaaatta aaatcaaaga aaacgtgtat tacaaactga ttaaagatct gctgcgtctg 840
ccgtctgaca tcaaacatta ttttaaaggt aaataa 876ccgtctgaca tcaaacatta ttttaaaggt aaataa 876
<210> 17<210> 17
<211> 939<211> 939
<212> ДНК<212> DNA
<213> Streptococcus agalactiae<213> Streptococcus agalactiae
<400> 17<400> 17
atgacgaatc gcaaaatcta tgtctgccac accctgtacc atctgctgat ctgcctgtat 60atgacgaatc gcaaaatcta tgtctgccac accctgtacc atctgctgat ctgcctgtat 60
aaagaagaaa tctactcaaa tctggaaatt atcctgagca gcagcattcc ggatgtggac 120aaagaagaaa tctactcaaa tctggaaatt atcctgagca gcagcattcc ggatgtggac 120
aacctggaga aaaaactgaa aagcaaaacc atcaacatcc atattctgga agaatcctca 180aacctggaga aaaaactgaa aagcaaaacc atcaacatcc atattctgga agaatcctca 180
ggcgaatctg aagaactgct gagtgttctg aaagatgcag gtctgtctta cagtaaattc 240ggcgaatctg aagaactgct gagtgttctg aaagatgcag gtctgtctta cagtaaattc 240
gatagcaact gcttcatctt caacgacgct accccgattg gccgtacgct gatcaaacac 300gatagcaact gcttcatctt caacgacgct accccgattg gccgtacgct gatcaaacac 300
ggtatttatt acaatctgat cgaagatggc ctgaactgtt ttacctactc gattttcagc 360ggtatttatt acaatctgat cgaagatggc ctgaactgtt ttacctactc gattttcagc 360
cagaaactgt ggaaatacta cgtgaaaaaa tacatcctgc ataaaattca accgcacggc 420cagaaactgt ggaaatacta cgtgaaaaaa tacatcctgc ataaaattca accgcacggc 420
ttttcccgct actgcctggg tatcgaagtg aacagtctgg ttaatctgcc gaaagatccg 480ttttcccgct actgcctggg tatcgaagtg aacagtctgg ttaatctgcc gaaagatccg 480
cgttacaaaa aattcatcga agtcccgcgc aaagaactgt tcgacaatgt tacggaatac 540cgttacaaaa aattcatcga agtcccgcgc aaagaactgt tcgacaatgt tacggaatac 540
cagaaagaaa tggcgatcaa cctgtttggc gccgtccgtg tgtctattaa atccccgtca 600cagaaagaaa tggcgatcaa cctgtttggc gccgtccgtg tgtctattaa atccccgtca 600
gttctggtcc tgacccagcc gctgtccatc gataaagaat ttatgtcata caacaacaaa 660gttctggtcc tgacccagcc gctgtccatc gataaagaat ttatgtcata caacaacaaa 660
atcgaaacgt cggaagaaca attcaacttc tacaaaagca tcgtgaacga atacatcaac 720atcgaaacgt cggaagaaca attcaacttc tacaaaagca tcgtgaacga atacatcaac 720
aaaggttaca acgtctacct gaaagtgcat ccgcgtgatg tggttgacta ttctaaactg 780aaaggttaca acgtctacct gaaagtgcat ccgcgtgatg tggttgacta ttctaaactg 780
ccggttgaac tgctgccgag taacgtcccg atggaaatta tcgaactgat gctgaccggc 840ccggttgaac tgctgccgag taacgtcccg atggaaatta tcgaactgat gctgaccggc 840
cgctttgaat gcggtattac ccatagcagc accgccctgg atttcctgac ctgtgtggac 900cgctttgaat gcggtattac ccatagcagc accgccctgg atttcctgac ctgtgtggac 900
aagaaaatta cgctggttga tctgaaagac attaaataa 939aagaaaatta cgctggttga tctgaaagac attaaataa 939
<210> 18<210> 18
<211> 1233<211> 1233
<212> ДНК<212> DNA
<213> Bibersteinia trehalosi<213> Bibersteinia trehalosi
<400> 18<400> 18
atggaattct gcaaaatggc aacgacgcaa aaaatctgtg tctacctgga ctatgctacg 60atggaattct gcaaaatggc aacgacgcaa aaaatctgtg tctacctgga ctatgctacg 60
atcccgagcc tgaactacat cctgcacttt gcgcaacatt tcgaagatca ggaaaccatt 120atcccgagcc tgaactacat cctgcacttt gcgcaacatt tcgaagatca ggaaaccatt 120
cgtctgtttg gcctgtcccg cttccacatt ccggaatcag tcatccagcg ctatccgaaa 180cgtctgtttg gcctgtcccg cttccacatt ccggaatcag tcatccagcg ctatccgaaa 180
ggtgtggttc aattttaccc gaaccaggaa aaagacttca gcgcgctgct gctggccctg 240ggtgtggttc aattttaccc gaaccaggaa aaagacttca gcgcgctgct gctggccctg 240
aaaaacatcc tgatcgaagt taaacagcaa cagcgtaaat gcgaaatcga actgcatctg 300aaaaacatcc tgatcgaagt taaacagcaa cagcgtaaat gcgaaatcga actgcatctg 300
aacctgtttc actatcagct gctgctgctg ccgttcctga gtctgtatct ggatacccag 360aacctgtttc actatcagct gctgctgctg ccgttcctga gtctgtatct ggatacccag 360
gactactgtc atctgacgct gaaattttac gatgacggct ctgaagcgat tagtgccctg 420gactactgtc atctgacgct gaaattttac gatgacggct ctgaagcgat tagtgccctg 420
caggaactgg cactggctcc ggatctggcg gcccaaatcc agtttgaaaa acaacagttc 480caggaactgg cactggctcc ggatctggcg gcccaaatcc agtttgaaaa acaacagttc 480
gacgaactgg tcgtgaaaaa atcgtttaaa ctgtcgctgc tgagccgcta tttttggggt 540gacgaactgg tcgtgaaaaa atcgtttaaa ctgtcgctgc tgagccgcta tttttggggt 540
aaactgttcg aaagcgaata catttggttc aatcaagcaa tcctgcagaa agctgaactg 600aaactgttcg aaagcgaata catttggttc aatcaagcaa tcctgcagaa agctgaactg 600
caaattctga aacaggaaat cagctctagt cgtcagatgg attttgcaat ttatcaacag 660caaattctga aacaggaaat cagctctagt cgtcagatgg attttgcaat ttatcaacag 660
atgtccgacg aacaaaaaca gctggtgctg gaaattctga acatcgatct gaataaagtt 720atgtccgacg aacaaaaaca gctggtgctg gaaattctga acatcgatct gaataaagtt 720
gcttacctga aacaactgat ggaaaaccag ccgtcttttc tgttcctggg caccacgctg 780gcttacctga aacaactgat ggaaaaccag ccgtcttttc tgttcctggg caccacgctg 780
tttaatatta cccaggaaac caaaacgtgg ctgatgcaga tgcatgtgga tctgatccaa 840tttaatatta cccaggaaac caaaacgtgg ctgatgcaga tgcatgtgga tctgatccaa 840
cagtattgcc tgccgagcgg ccagtttttc aacaataaag ccggctatct gtgtttttac 900cagtattgcc tgccgagcgg ccagtttttc aacaataaag ccggctatct gtgtttttac 900
aaaggtcacc cgaacgaaaa agaaatgaac caaatgatcc tgtctcagtt caaaaacctg 960aaaggtcacc cgaacgaaaa agaaatgaac caaatgatcc tgtctcagtt caaaaacctg 960
atcgcgctgc cggatgacat tccgctggaa atcctgctgc tgctgggcgt tattccgagt 1020atcgcgctgc cggatgacat tccgctggaa atcctgctgc tgctgggcgt tattccgagt 1020
aaagtcggcg gttttgcatc ctcagctctg tttaacttca ccccggcgca gatcgaaaat 1080aaagtcggcg gttttgcatc ctcagctctg tttaacttca ccccggcgca gatcgaaaat 1080
attatctttt tcacgccgcg ttatttcgaa aaagataatc gcctgcacgc cacgcaatac 1140attatctttt tcacgccgcg ttatttcgaa aaagataatc gcctgcacgc cacgcaatac 1140
cgtctgatgc agggcctgat tgaactgggt tatctggacg ctgaaaaatc tgtgacccac 1200cgtctgatgc agggcctgat tgaactgggt tatctggacg ctgaaaaatc tgtgacccac 1200
tttgaaatca tgcaactgct gacgaaagaa taa 1233tttgaaatca tgcaactgct gacgaaagaa taa 1233
<210> 19<210> 19
<211> 1221<211> 1221
<212> ДНК<212> DNA
<213> Haemophilus parahaemolyticus<213> Haemophilus parahaemolyticus
<400> 19<400> 19
atgaccgaac agtacatcaa aaacgtggaa gtttacctgg attacgcgac catcccgacg 60atgaccgaac agtacatcaa aaacgtggaa gtttacctgg attacgcgac catcccgacg 60
ctgaactact tctaccattt caccgaaaac aaagatgaca tcgccacgat tcgtctgttt 120ctgaactact tctaccattt caccgaaaac aaagatgaca tcgccacgat tcgtctgttt 120
ggcctgggtc gcttcaacat cagtaaatcc atcatcgaaa gctacccgga aggcattatc 180ggcctgggtc gcttcaacat cagtaaatcc atcatcgaaa gctacccgga aggcattatc 180
cgttactgcc cgattatctt tgaagatcaa accgcatttc agcaactgtt cattaccctg 240cgttactgcc cgattatctt tgaagatcaa accgcatttc agcaactgtt cattaccctg 240
ctgacggaag acagtttttg tcagtatcgc tttaacttcc atattaacct gtttcactcc 300ctgacggaag acagtttttg tcagtatcgc tttaacttcc atattaacct gtttcactcc 300
tggaaaatgc tgatcccgct gctgcatatt atctggcagt ttaaacacaa agtcctggat 360tggaaaatgc tgatcccgct gctgcatatt atctggcagt ttaaacacaa agtcctggat 360
attaaactga acttctatga tgacggcagt gaaggtctgg tgacgctgtc caaaatcgaa 420attaaactga acttctatga tgacggcagt gaaggtctgg tgacgctgtc caaaatcgaa 420
cagaactaca gctctgaaat cctgcaaaaa atcatcgata tcgactcaca gtcgttttat 480cagaactaca gctctgaaat cctgcaaaaa atcatcgata tcgactcaca gtcgttttat 480
gcagataaac tgtctttcct ggatgaagac attgctcgtt acctgtggaa cagtctgttt 540gcagataaac tgtctttcct ggatgaagac attgctcgtt acctgtggaa cagtctgttt 540
gaatcccatt attacctgct gaacgacttc ctgctgaaaa acgaaaaact gtcactgctg 600gaatcccatt attacctgct gaacgacttc ctgctgaaaa acgaaaaact gtcactgctg 600
aaaaactcga tcaaatactg ccacatcatg gatctggaac gctacctgca gtttacccaa 660aaaaactcga tcaaatactg ccacatcatg gatctggaac gctacctgca gtttacccaa 660
gaagaaaaag actttttcaa cgaactgctg ggcatcaaca tccagagtct ggaagataaa 720gaagaaaaag actttttcaa cgaactgctg ggcatcaaca tccagagtct ggaagataaa 720
atcaaaatct tccagcagaa gaaaaccttt attttcacgg gtaccacgat cttcagcctg 780atcaaaatct tccagcagaa gaaaaccttt attttcacgg gtaccacgat cttcagcctg 780
ccgaaagaag aagaagaaac cctgtatcgt ctgcatctga acgcaatcct gaattatatt 840ccgaaagaag aagaagaaac cctgtatcgt ctgcatctga acgcaatcct gaattatatt 840
cacccgaacg gcaaatactt tattggcgat ggtttcacgc tggttatcaa aggtcatccg 900cacccgaacg gcaaatactt tattggcgat ggtttcacgc tggttatcaa aggtcatccg 900
caccagaaag aaatgaacag ccgcctggaa aaatcttttg aaaaagctgt catgctgccg 960caccagaaag aaatgaacag ccgcctggaa aaatcttttg aaaaagctgt catgctgccg 960
gataatatcc cgttcgaaat tctgtatctg atcggctgca aaccggacaa aattggcggt 1020gataatatcc cgttcgaaat tctgtatctg atcggctgca aaccggacaa aattggcggt 1020
tttgtgagca cctcttactt cagctgtgat aagaaaaaca ttgcggacct gctgtttatc 1080tttgtgagca cctcttactt cagctgtgat aagaaaaaca ttgcggacct gctgtttatc 1080
tctgcccgtc aagaagaagt tcgcaaaaac gattacctgt ttaacatcca gtaccaactg 1140tctgcccgtc aagaagaagt tcgcaaaaac gattacctgt ttaacatcca gtaccaactg 1140
cgtgacatga tgattaaaac cggttttatc caggaagaaa aaacgcactt ctactcagat 1200cgtgacatga tgattaaaac cggttttatc caggaagaaa aaacgcactt ctactcagat 1200
atcccgatct tcatctcgta a 1221atcccgatct tcatctcgta a 1221
<210> 20<210> 20
<211> 903<211> 903
<212> ДНК<212> DNA
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 20<400> 20
atgaaatata acatcaaaat taaagctatc gtcatcgtgt cgagcctgcg tatgctgctg 60atgaaatata acatcaaaat taaagctatc gtcatcgtgt cgagcctgcg tatgctgctg 60
atcttcctga tgctgaataa ataccacctg gatgaagttc tgtttgtctt caacgaaggc 120atcttcctga tgctgaataa ataccacctg gatgaagttc tgtttgtctt caacgaaggc 120
ttcgaactgc ataaaaaata caaaatcaaa cactatgtgg cgattaaaaa gaaaattacc 180ttcgaactgc ataaaaaata caaaatcaaa cactatgtgg cgattaaaaa gaaaattacc 180
aaattctggc gtctgtacta caaactgtac ttctaccgtt tcaaaattga ccgcatcccg 240aaattctggc gtctgtacta caaactgtac ttctaccgtt tcaaaattga ccgcatcccg 240
gtttatggcg cagatcatct gggttggacc gactattttc tgaaatactt cgatttctac 300gtttatggcg cagatcatct gggttggacc gactattttc tgaaatactt cgatttctac 300
ctgattgaag acggcatcgc taacttctcc ccgaaacgtt acgaaattaa cctgacgcgc 360ctgattgaag acggcatcgc taacttctcc ccgaaacgtt acgaaattaa cctgacgcgc 360
aatatcccgg tctttggttt ccataaaacc gtgaagaaaa tttacctgac gagtctggaa 420aatatcccgg tctttggttt ccataaaacc gtgaagaaaa tttacctgac gagtctggaa 420
aatgttccgt ccgatattcg tcataaagtc gaactgatca gcctggaaca cctgtggaaa 480aatgttccgt ccgatattcg tcataaagtc gaactgatca gcctggaaca cctgtggaaa 480
acccgcacgg cgcaggaaca acacaacatc ctggatttct ttgcctttaa tctggacagc 540acccgcacgg cgcaggaaca acacaacatc ctggatttct ttgcctttaa tctggacagc 540
ctgatctctc tgaaaatgaa aaaatacatc ctgttcaccc agtgcctgtc agaagatcgc 600ctgatctctc tgaaaatgaa aaaatacatc ctgttcaccc agtgcctgtc agaagatcgc 600
gtcatttcgg aacaggaaaa aatcgcgatc taccaacata tcatcaaaaa ctacgatgaa 660gtcatttcgg aacaggaaaa aatcgcgatc taccaacata tcatcaaaaa ctacgatgaa 660
cgtctgctgg ttatcaaacc gcacccgcgc gaaaccacgg actatcagaa atactttgaa 720cgtctgctgg ttatcaaacc gcacccgcgc gaaaccacgg actatcagaa atactttgaa 720
aatgtcttcg tgtaccaaga tgtggttccg agcgaactgt ttgaactgct ggacgtgaac 780aatgtcttcg tgtaccaaga tgtggttccg agcgaactgt ttgaactgct ggacgtgaac 780
ttcgaacgtg ttattaccct gttttctacg gccgtgttca aatatgatcg caatatcgtt 840ttcgaacgtg ttattaccct gttttctacg gccgtgttca aatatgatcg caatatcgtt 840
gacttctacg gtacgcgcat ccacgacaaa atctatcaat ggttcggcga catcaaattc 900gacttctacg gtacgcgcat ccacgacaaa atctatcaat ggttcggcga catcaaattc 900
taa 903taa 903
<210> 21<210> 21
<211> 1146<211> 1146
<212> ДНК<212> DNA
<213> Vibrio harveyi<213> Vibrio harveyi
<400> 21<400> 21
atggattctt cgccggaaaa caccagctct acgctggaaa tttacatcga ttcagcaacc 60atggattctt cgccggaaaa caccagctct acgctggaaa tttacatcga ttcagcaacc 60
ctgccgtcgc tgcagcacat ggtgaaaatt atcgacgaac aaagtggcaa caaaaaactg 120ctgccgtcgc tgcagcacat ggtgaaaatt atcgacgaac aaagtggcaa caaaaaactg 120
atcaactgga aacgttatcc gatcgatgac gaactgctgc tggataaaat caacgctctg 180atcaactgga aacgttatcc gatcgatgac gaactgctgc tggataaaat caacgctctg 180
agcttttctg ataccacgga cctgacccgt tatatggaaa gtattctgct gatcggcgat 240agcttttctg ataccacgga cctgacccgt tatatggaaa gtattctgct gatcggcgat 240
attaaacgcg tggttattaa cggtaatagt ctgtccaact acaatattgt cggcgtgatg 300attaaacgcg tggttattaa cggtaatagt ctgtccaact acaatattgt cggcgtgatg 300
cgctccatca acgccctggg tctggatctg gacgttgaaa tcaattttta tgatgacggt 360cgctccatca acgccctggg tctggatctg gacgttgaaa tcaattttta tgatgacggt 360
tcagcagaat atgtccgtct gtacaacttc tcgcagctgc cggaagctga acgcgaactg 420tcagcagaat atgtccgtct gtacaacttc tcgcagctgc cggaagctga acgcgaactg 420
ctggtgtcaa tgtcgaaaaa caatattctg gcggccgtta acggcatcgg ttcttatgat 480ctggtgtcaa tgtcgaaaaa caatattctg gcggccgtta acggcatcgg ttcttatgat 480
agcggctctc cggaaaatat ttacggtttt gcgcagattt atccggccac ctaccacatg 540agcggctctc cggaaaatat ttacggtttt gcgcagattt atccggccac ctaccacatg 540
ctgcgtgcgg acattttcga tacggacctg gaaatcggcc tgattcgcga tatcctgggt 600ctgcgtgcgg acattttcga tacggacctg gaaatcggcc tgattcgcga tatcctgggt 600
gacaacgtca aacagatgaa atggggccaa tttctgggtt tcaacgaaga acagaaagaa 660gacaacgtca aacagatgaa atggggccaa tttctgggtt tcaacgaaga acagaaagaa 660
ctgttttatc aactgaccag cttcaacccg gataaaatcc aggcgcaata caaagaatct 720ctgttttatc aactgaccag cttcaacccg gataaaatcc aggcgcaata caaagaatct 720
ccgaacaaaa acttcgtttt cgtcggcacc aacagtcgtt ccgcaacggc tgaacagcaa 780ccgaacaaaa acttcgtttt cgtcggcacc aacagtcgtt ccgcaacggc tgaacagcaa 780
atcaacatca tcaaagaagc caaaaaactg gatagcgaaa ttatcccgaa cagcatcgat 840atcaacatca tcaaagaagc caaaaaactg gatagcgaaa ttatcccgaa cagcatcgat 840
ggctatgacc tgtttttcaa aggtcatccg agcgcgacct acaaccagca aattgttgat 900ggctatgacc tgtttttcaa aggtcatccg agcgcgacct acaaccagca aattgttgat 900
gcccacgaca tgaccgaaat ctataatcgc acgccgtttg aagtcctggc aatgacgagt 960gcccacgaca tgaccgaaat ctataatcgc acgccgtttg aagtcctggc aatgacgagt 960
tccctgccgg atgctgtggg cggtatgggc tcatcgctgt ttttctcact gccgaaaacc 1020tccctgccgg atgctgtggg cggtatgggc tcatcgctgt ttttctcact gccgaaaacc 1020
gtggaaacga aattcatttt ctataaaagt ggcaccgata ttgaatccaa tgcgctgatc 1080gtggaaacga aattcatttt ctataaaagt ggcaccgata ttgaatccaa tgcgctgatc 1080
caggttatgc tgaaactggg tatcattacg gacgaaaaag tgcgctttac gacggacatc 1140caggttatgc tgaaactggg tatcattacg gacgaaaaag tgcgctttac gacggacatc 1140
aaataa 1146aaataa 1146
<210> 22<210> 22
<211> 1452<211> 1452
<212> ДНК<212> DNA
<213> Alistipes sp.<213> Alistipes sp.
<400> 22<400> 22
atggccagct gttctgatga cgataaagaa cagacgggtt ttcaaatcga cgatggctct 60atggccagct gttctgatga cgataaagaa cagacggggtt ttcaaatcga cgatggctct 60
ggtttcctga gtctggatgc agctgcgcgt agtggctcca ttgccatcac cgcaaacaat 120ggtttcctga gtctggatgc agctgcgcgt agtggctcca ttgccatcac cgcaaacaat 120
tcatggtcgg tgacgcagga taaagacagc gaatggctga ccctgagcac cacgtctggt 180tcatggtcgg tgacgcagga taaagacagc gaatggctga ccctgagcac cacgtctggt 180
gcagcaggtc gtaccgaaat tggtatcatg ctggaagcga acccgggcga agctcgtaat 240gcagcaggtc gtaccgaaat tggtatcatg ctggaagcga acccgggcga agctcgtaat 240
gcgggtctga cctttaactc tggcggtcgc acgtatccgt tcgtgattac ccagagtgcc 300gcgggtctga cctttaactc tggcggtcgc acgtatccgt tcgtgattac ccagagtgcc 300
catgttacgg cagattttga cgatgctgac cactgctttt atatcacctt tggtaccctg 360catgttacgg cagattttga cgatgctgac cactgctttt atatcacctt tggtaccctg 360
ccgaccctgt atgcaggtct gcatgtgctg tcccacgata aaccgtcata tgtgtttttc 420ccgaccctgt atgcaggtct gcatgtgctg tcccacgata aaccgtcata tgtgtttttc 420
cagcgttccc aaacctttcg cccggaagaa ttcccggccc atgcagaagt tacgattgct 480cagcgttccc aaacctttcg cccggaagaa ttcccggccc atgcagaagt tacgattgct 480
gcggatccgt cagctaatgc gaccgatgaa gacatggaac gtatgcgcac ggccatgaaa 540gcggatccgt cagctaatgc gaccgatgaa gacatggaac gtatgcgcac ggccatgaaa 540
cagcaaattc tgaaaatcaa cgttgaagat ccgaccgcag tttttggcct gtatgtcgac 600cagcaaattc tgaaaatcaa cgttgaagat ccgaccgcag tttttggcct gtatgtcgac 600
gatctgcgtt gtggcattgg ttacgattgg ttcgtcgccc agggtatcga cagtacccgc 660gatctgcgtt gtggcattgg ttacgattgg ttcgtcgccc agggtatcga cagtacccgc 660
gtgaaagtta gtatgctgtc cgatggcacc ggcacgtaca acaacttcta caactacttc 720gtgaaagtta gtatgctgtc cgatggcacc ggcacgtaca acaacttcta caactacttc 720
ggcgatccgg ccaccgcaga acaaaactgg gaaaattacg ccgcacaggt ggaagcgctg 780ggcgatccgg ccaccgcaga acaaaactgg gaaaattacg ccgcacaggt ggaagcgctg 780
gattggcaac acggcggtcg ttttccggaa acccgcatgc cggatggttt tgacttctat 840gattggcaac acggcggtcg ttttccggaa acccgcatgc cggatggttt tgacttctat 840
gaatggccgt attacctggc aacgcgtccg aactaccgcc tggttctgca ggacgatgac 900gaatggccgt attacctggc aacgcgtccg aactaccgcc tggttctgca ggacgatgac 900
ctgctggaag cgacgtctcc gtttatgacc gaacgtctgc agcaaatgcg caccgaatcg 960ctgctggaag cgacgtctcc gtttatgacc gaacgtctgc agcaaatgcg caccgaatcg 960
aaacagccgt atgaactgct ggccagcctg ccggctgaag cccgtcaacg ctttttccgt 1020aaacagccgt atgaactgct ggccagcctg ccggctgaag cccgtcaacg ctttttccgt 1020
atggctggct ttgattacga cgcgtttgct gcgctgttcg atgccagccc gaagaaaaac 1080atggctggct ttgattacga cgcgtttgct gcgctgttcg atgccagccc gaagaaaaac 1080
ctggtcatta tcggcacgtc acatacctcg gaagaaagcg aagcacagca agccgcatat 1140ctggtcatta tcggcacgtc acatacctcg gaagaaagcg aagcacagca agccgcatat 1140
gtggaacgta ttatcggcga ttatggtacc gcctacgaca ttttctttaa accgcacccg 1200gtggaacgta ttatcggcga ttatggtacc gcctacgaca ttttctttaa accgcacccg 1200
gcagatagct ctagttccaa ctacgaagaa cgctttgaag gtctgaccct gctgccgggt 1260gcagatagct ctagttccaa ctacgaagaa cgctttgaag gtctgaccct gctgccgggt 1260
cagatgccgt ttgaaatttt cgtctggtcg ctgctggata aagtggacct gatcggcggt 1320cagatgccgt ttgaaatttt cgtctggtcg ctgctggata aagtggacct gatcggcggt 1320
tattcatcga cggtgtttct gaccgtcccg gtggaaaaaa ccggctttat tttcgctgcg 1380tattcatcga cggtgtttct gaccgtcccg gtggaaaaaa ccggctttat tttcgctgcg 1380
aatgctgaaa gcctgccgcg cccgctgaac gttctgttcc gtaatgcgga acatgtccgc 1440aatgctgaaa gcctgccgcg cccgctgaac gttctgttcc gtaatgcgga acatgtccgc 1440
tggatccagt aa 1452tggatccagt aa 1452
<210> 23<210> 23
<211> 1452<211> 1452
<212> ДНК<212> DNA
<213> Alistipes shahii<213> Alistipes shahii
<400> 23<400> 23
atggacgatg gcaccccgag tgtcagcatc aacggcggca ccgacttcct gagcctggac 60atggacgatg gcaccccgag tgtcagcatc aacggcggca ccgacttcct gagcctggac 60
cacctggcac gcagcggcaa aatcacggtc aacgcaccgg ctccgtggtc tgtgaccctg 120cacctggcac gcagcggcaa aatcacggtc aacgcaccgg ctccgtggtc tgtgaccctg 120
gccccggaaa attacggcca ggatgaaaaa ccggactggc tgaccctgag cgccgaagaa 180gccccggaaa attacggcca ggatgaaaaa ccggactggc tgaccctgag cgccgaagaa 180
ggcccggcag gttatagcga aatcgatgtt acctttgcgg aaaacccggg tccggcccgt 240ggcccggcag gttatagcga aatcgatgtt acctttgcgg aaaacccggg tccggcccgt 240
tccgcatcac tgctgttcag ctgcgatggt aaaaccctgg cctttacggt ttcgcagagc 300tccgcatcac tgctgttcag ctgcgatggt aaaaccctgg cctttacggt ttcgcagagc 300
gcaggcggta cgggtttcga tgctccggac tattactttt atatttcggt cggcaccatg 360gcaggcggta cgggtttcga tgctccggac tattactttt atatttcggt cggcaccatg 360
ccgacgctgt actcgggtct gcatctgctg agccacgata aaccgtctta tgttagttac 420ccgacgctgt actcgggtct gcatctgctg agccacgata aaccgtctta tgttagttac 420
gaacgtgcga gcacctttga tgcggccgaa ttcccggacc gcgcgtttgt ctatccggtg 480gaacgtgcga gcacctttga tgcggccgaa ttcccggacc gcgcgtttgt ctatccggtg 480
gccgatccga ccggtcatgc aaccaacgaa gaactgcgtg cgatgagcga agccatgaaa 540gccgatccga ccggtcatgc aaccaacgaa gaactgcgtg cgatgagcga agccatgaaa 540
cgtcgcatcc tggaaattaa tgcagaagat ccgaccgctg ttttcggtct gtgggtcgat 600cgtcgcatcc tggaaattaa tgcagaagat ccgaccgctg ttttcggtct gtgggtcgat 600
gacctgcgtt gccgcctggg ctacgattgg tttgtggctc aaggtatcga ctctgcgcgc 660gacctgcgtt gccgcctggg ctacgattgg tttgtggctc aaggtatcga ctctgcgcgc 660
gtgaaagtta cgatgctgag tgatggcacc gcgacgtata acaattttca taactacttc 720gtgaaagtta cgatgctgag tgatggcacc gcgacgtata acaattttca taactacttc 720
ggtgacgcag ctaccgccga acagaactgg aatgattatg cggccgaagt tgaagcactg 780ggtgacgcag ctaccgccga acagaactgg aatgattatg cggccgaagt tgaagcactg 780
gactggaatc atggcggtcg ttatccggaa acccgtgccc cggaagaatt cgcctcctac 840gactggaatc atggcggtcg ttatccggaa acccgtgccc cggaagaatt cgcctcctac 840
acctggccgt attacctgtc aacgcgtccg gattatcgcc tgatgctgca aaacagctct 900acctggccgt attacctgtc aacgcgtccg gattatcgcc tgatgctgca aaacagctct 900
ctgatggaaa gttcctgtcc gtttatcgca gatcgcctgg cagctatgaa aatggaatcc 960ctgatggaaa gttcctgtcc gtttatcgca gatcgcctgg cagctatgaa aatggaatcc 960
gtgcagccgt atgaactgct gacggcactg ccggaagctt caaaacagca attctatcgt 1020gtgcagccgt atgaactgct gacggcactg ccggaagctt caaaacagca attctatcgt 1020
atggccaaat ttgattacgc acgctttgct ggcctgttcg acctgtctcc gaagaaaaac 1080atggccaaat ttgattacgc acgctttgct ggcctgttcg acctgtctcc gaagaaaaac 1080
ctgattatca ttggtacctc tcattcatcg gcggccagtg aacagcaaca ggcagcttac 1140ctgattatca ttggtacctc tcattcatcg gcggccagtg aacagcaaca ggcagcttac 1140
gtcgaacgta tcattcaaca gtatggcagt gattacgaca ttttctttaa accgcacccg 1200gtcgaacgta tcattcaaca gtatggcagt gattacgaca ttttctttaa accgcacccg 1200
gcagatagct ctagtgctgg ttatccggac cgctttgaag gtctgaccct gctgccgggt 1260gcagatagct ctagtgctgg ttatccggac cgctttgaag gtctgaccct gctgccgggt 1260
cagatgccgt ttgaaatctt cgtttgggcg ctgctggata aaatcgacat gattggcggt 1320cagatgccgt ttgaaatctt cgtttgggcg ctgctggata aaatcgacat gattggcggt 1320
tatccgtcca ccacgtttat ttcagtgccg ctggataaag ttggctttct gttcgcggcc 1380tatccgtcca ccacgtttat ttcagtgccg ctggataaag ttggctttct gttcgcggcc 1380
gatgccgacg gtctggtccg cccgctgaat atcctgttcc gtgacgctgc aaatgtcgaa 1440gatgccgacg gtctggtccg cccgctgaat atcctgttcc gtgacgctgc aaatgtcgaa 1440
tggattcaat aa 1452tggattcaat aa 1452
<210> 24<210> 24
<211> 1206<211> 1206
<212> ДНК<212> DNA
<213> Actinobacillus suis<213> Actinobacillus suis
<400> 24<400> 24
atggaacgca cgccgcaact gcaagcggtg gacatttaca ttgacttcgc aacgatcccg 60atggaacgca cgccgcaact gcaagcggtg gacatttaca ttgacttcgc aacgatcccg 60
agcctgagct actttctgca ctttctgaaa cataaacacg atgatcagcg tctgcgtctg 120agcctgagct actttctgca ctttctgaaa cataaacacg atgatcagcg tctgcgtctg 120
ttcagcctgg cccgttttga aatgccgcaa accctgattg aacagtatga aggcattatc 180ttcagcctgg cccgttttga aatgccgcaa accctgattg aacagtatga aggcattatc 180
cagttctcgc gcaacgtgga acataatgtt gaaccgctgc tggaacagct gcaaacgatc 240cagttctcgc gcaacgtgga acataatgtt gaaccgctgc tggaacagct gcaaacgatc 240
ctgtcacaag aaggtaaaca gtttgaactg catctgcacc tgaacctgtt tcattcgttc 300ctgtcacaag aaggtaaaca gtttgaactg catctgcacc tgaacctgtt tcattcgttc 300
gaaatgtttc tgaatctgag cccgacctac acgcagtaca aagaaaaaat ctctaaaatc 360gaaatgtttc tgaatctgag cccgacctac acgcagtaca aagaaaaaat ctctaaaatc 360
gttctgcacc tgtatgatga cggcagtgaa ggtgtcatga aacagtacca actgcagaaa 420gttctgcacc tgtatgatga cggcagtgaa ggtgtcatga aacagtacca actgcagaaa 420
agctctagtc tggtgcagga tctggcggcc accaaagcat ctctggttag cctgttcgaa 480agctctagtc tggtgcagga tctggcggcc accaaagcat ctctggttag cctgttcgaa 480
aacggcgaag gttcgtttag ccagattgat ctgatccgtt atgtctggaa tgctgtgctg 540aacggcgaag gttcgtttag ccagattgat ctgatccgtt atgtctggaa tgctgtgctg 540
gaaacccatt attacctgct gtctgatcac tttctgctgg acgaaaaact gcagccgctg 600gaaacccatt attacctgct gtctgatcac tttctgctgg acgaaaaact gcagccgctg 600
aaagcagaac tgggccatta ccaactgctg aacctgagtg cttatcagta cctgtcctca 660aaagcagaac tgggccatta ccaactgctg aacctgagtg cttatcagta cctgtcctca 660
gaagatctgc tgtggctgaa acagattctg aaaatcgaca ccgaactgga aagcctgatg 720gaagatctgc tgtggctgaa acagattctg aaaatcgaca ccgaactgga aagcctgatg 720
caaaaactga cggcgcagcc ggtgtatttc tttagcggta ccacgttttt caacatcagt 780caaaaactga cggcgcagcc ggtgtatttc tttagcggta ccacgttttt caacatcagt 780
ttcgaagata aacaacgtct ggcgaatatc catgccattc tgatccgcga acacctggac 840ttcgaagata aacaacgtct ggcgaatatc catgccattc tgatccgcga acacctggac 840
ccgaactccc agctgtttat tggcgaaccg tacctgtttg tcttcaaagg tcatccgaac 900ccgaactccc agctgtttat tggcgaaccg tacctgtttg tcttcaaagg tcatccgaac 900
tcaccggaaa ttaatcaggc cctgcgtgaa tattacccga acgttatctt cctgccggaa 960tcaccggaaa ttaatcaggc cctgcgtgaa tattacccga acgttatctt cctgccggaa 960
aatattccgt ttgaaatcct gaccctgctg ggcttctccc cgcaaaaaat tggcggtttt 1020aatattccgt ttgaaatcct gaccctgctg ggcttctccc cgcaaaaaat tggcggtttt 1020
gcgtcaacga tccacgttaa ttccgaacag tcaaaactgg ccaaactgtt tttcctgacc 1080gcgtcaacga tccacgttaa ttccgaacag tcaaaactgg ccaaactgtt tttcctgacc 1080
tcgacggatg aacaagaacg ccagctgagc gacggttata ttaaacaata cgcactggct 1140tcgacggatg aacaagaacg ccagctgagc gacggttata ttaaacaata cgcactggct 1140
caggctatgc tggaaatgca actggtctcg caagaacaag tctattactg ctcgctgtcg 1200caggctatgc tggaaatgca actggtctcg caagaacaag tctattactg ctcgctgtcg 1200
tcgtaa 1206tcgtaa 1206
<210> 25<210> 25
<211> 1206<211> 1206
<212> ДНК<212> DNA
<213> Actinobacillus capsulatus<213> Actinobacillus capsulatus
<400> 25<400> 25
atggaacgca tcccgcaact gcaagctgtc gatatttaca ttgacttcgc cacgatcccg 60atggaacgca tcccgcaact gcaagctgtc gatatttaca ttgacttcgc cacgatcccg 60
agcctgtcct actttctgca ctttctgaaa cataaacacg atcatcagcg tctgcgcctg 120agcctgtcct actttctgca ctttctgaaa cataaacacg atcatcagcg tctgcgcctg 120
ttcagcctgg cgcgttttga aatgccgcag accgtcattg aacaatatga aggcattatc 180ttcagcctgg cgcgttttga aatgccgcag accgtcattg aacaatatga aggcattatc 180
cagttctcac gcaacgtgga acacaatgtt gaacaactgc tggaacagct gcaaacgatc 240cagttctcac gcaacgtgga acacaatgtt gaacaactgc tggaacagct gcaaacgatc 240
ctgtcgcagg aaggtaaaca atttgaactg cacctgcatc tgaacctgtt tcacagtttc 300ctgtcgcagg aaggtaaaca atttgaactg cacctgcatc tgaacctgtt tcacagtttc 300
gaaatgtttc tgaatctgtc cccgacctac acgaaataca aagaaaaaat ctcaaaaatc 360gaaatgtttc tgaatctgtc cccgacctac acgaaataca aagaaaaaat ctcaaaaatc 360
gttctgcatc tgtatgatga cggctcggaa ggtgtcatga aacagtacca actgcagcaa 420gttctgcatc tgtatgatga cggctcggaa ggtgtcatga aacagtacca actgcagcaa 420
agtaactccc tggcacagga tctggctagc accaaagcgt cactggtttc gctgttcaaa 480agtaactccc tggcacagga tctggctagc accaaagcgt cactggtttc gctgttcaaa 480
aacggcgaag gtgccttttc tcagattgat ctgatccgtt atgtctggaa tgcagtgctg 540aacggcgaag gtgccttttc tcagattgat ctgatccgtt atgtctggaa tgcagtgctg 540
gaaacccact attacctgct gtcagaccac tttctggccc atgaaaaact gcagccgctg 600gaaacccact attacctgct gtcagaccac tttctggccc atgaaaaact gcagccgctg 600
aaaattgaac tgggccatta ccagctgctg aatctgtctg cctatcaata cctgagctct 660aaaattgaac tgggccatta ccagctgctg aatctgtctg cctatcaata cctgagctct 660
gaagatctgc tgtggctgaa acaaattctg aaaatcgacg cagaactgga aagtctgatg 720gaagatctgc tgtggctgaa acaaattctg aaaatcgacg cagaactgga aagtctgatg 720
cataaactga ccacgcagcc ggtgtatttc tttagcggta ccacgttttt caacatttcg 780cataaactga ccacgcagcc ggtgtatttc tttagcggta ccacgttttt caacatttcg 780
ttcgaagata aacagcgtct ggccaatatc cacgcaattc tgatccgcga acatctggac 840ttcgaagata aacagcgtct ggccaatatc cacgcaattc tgatccgcga acatctggac 840
ccgaacagtc agctgtttat cggcgaaccg tacctgtttg ttttcaaagg tcacccgaac 900ccgaacagtc agctgtttat cggcgaaccg tacctgtttg ttttcaaagg tcacccgaac 900
tccccggaaa ttaatcaggc tctgcgcgaa tattacccga acgcgatctt cctgccggaa 960tccccggaaa ttaatcaggc tctgcgcgaa tattacccga acgcgatctt cctgccggaa 960
aatattccgt ttgaaatcct gaccctgctg ggcttcagcc cgcagaaaat tggcggtttt 1020aatattccgt ttgaaatcct gaccctgctg ggcttcagcc cgcagaaaat tggcggtttt 1020
gcttctacga tccatgtgaa cagcgaacaa tctaaactgg cgaaactgtt tttcctgacc 1080gcttctacga tccatgtgaa cagcgaacaa tctaaactgg cgaaactgtt tttcctgacc 1080
agtacggatg aacaggaacg taatcgctcc gacggttata ttaaacagta cgcgctggcc 1140agtacggatg aacaggaacg taatcgctcc gacggttata ttaaacagta cgcgctggcc 1140
caagcaatgc tggaaatgca actggtctcg caagaacaag tctactactg ctcgctgtcg 1200caagcaatgc tggaaatgca actggtctcg caagaacaag tctactactg ctcgctgtcg 1200
tcgtaa 1206tcgtaa 1206
<210> 26<210> 26
<211> 936<211> 936
<212> ДНК<212> DNA
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 26<400> 26
atgttccgtg aagacaatat gaacctgatt atctgctgta cgccgctgca agtgattatc 60atgttccgtg aagacaatat gaacctgatt atctgctgta cgccgctgca agtgattatc 60
gccgaaaaaa ttatcgaacg ctatccggaa cagaaatttt atggcgttat gctggaatca 120gccgaaaaaa ttatcgaacg ctatccggaa cagaaatttt atggcgttat gctggaatca 120
ttctacaacg ataaattcga cttctacgaa aacaaactga aacatctgtg ccacgaattt 180ttctacaacg ataaattcga cttctacgaa aacaaactga aacatctgtg ccacgaattt 180
ttctgtatca aaatcgcacg tttcaaactg gaacgctata aaaacctgct gtcactgctg 240ttctgtatca aaatcgcacg tttcaaactg gaacgctata aaaacctgct gtcactgctg 240
aaaatcaaaa acaaaacctt cgatcgtgtc ttcctggcta acatcgaaaa acgctacatc 300aaaatcaaaa acaaaacctt cgatcgtgtc ttcctggcta acatcgaaaa acgctacatc 300
catatcatcc tgtcgaacat tttctttaaa gaactgtaca ccttcgatga cggcacggcg 360catatcatcc tgtcgaacat tttctttaaa gaactgtaca ccttcgatga cggcacggcg 360
aacatcgccc cgaatagtca tctgtatcaa gaatacgatc actccctgaa aaaacgtatt 420aacatcgccc cgaatagtca tctgtatcaa gaatacgatc actccctgaa aaaacgtatt 420
accgacatcc tgctgccgaa ccattacaac agcaacaaag tgaaaaacat cagcaaactg 480accgacatcc tgctgccgaa ccattacaac agcaacaaag tgaaaaacat cagcaaactg 480
cactactcta tctaccgctg caaaaacaac atcatcgata acatcgaata catgccgctg 540cactactcta tctaccgctg caaaaacaac atcatcgata acatcgaata catgccgctg 540
tttaacctgg agaaaaaata cacggcacag gataaaagta tttccatcct gctgggtcaa 600tttaacctgg agaaaaaata cacggcacag gataaaagta tttccatcct gctgggtcaa 600
ccgattttct atgacgaaga gaaaaacatt cgtctgatca aagaagtcat cgccaaattc 660ccgattttct atgacgaaga gaaaaacatt cgtctgatca aagaagtcat cgccaaattc 660
aaaatcgatt actacttccc gcacccgcgc gaagattact acatcgacaa cgtgtcttac 720aaaatcgatt actacttccc gcacccgcgc gaagattact acatcgacaa cgtgtcttac 720
atcaaaaccc cgctgatctt tgaagaattt tacgcggaac gttcaatcga aaattcgatc 780atcaaaaccc cgctgatctt tgaagaattt tacgcggaac gttcaatcga aaattcgatc 780
aaaatctata cctttttcag ctctgccgtg ctgaacatcg ttacgaaaga aaatattgat 840aaaatctata cctttttcag ctctgccgtg ctgaacatcg ttacgaaaga aaatattgat 840
cgcatctacg cactgaaacc gaaactgacg gaaaaagcgt atctggattg ttacgacatc 900cgcatctacg cactgaaacc gaaactgacg gaaaaagcgt atctggattg ttacgacatc 900
ctgaaagatt tcggtatcaa agttatcgac atctaa 936ctgaaagatt tcggtatcaa agttatcgac atctaa 936
<210> 27<210> 27
<211> 1200<211> 1200
<212> ДНК<212> DNA
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 27<400> 27
atgctgattc aacagaacct ggaaatctac ctggactacg caaccatccc gagcctggcc 60atgctgattc aacagaacct ggaaatctac ctggactacg caaccatccc gagcctggcc 60
tgctttatgc acttcattca acacaaagat gacgtcgata gtattcgtct gtttggcctg 120tgctttatgc acttcattca acacaaagat gacgtcgata gtattcgtct gtttggcctg 120
gcacgcttcg atatcccgca gtccattatc gaccgttacc cggctaacca cctgttttat 180gcacgcttcg atatcccgca gtccattatc gaccgttacc cggctaacca cctgttttat 180
cacaacatcg ataatcgcga cctgaccgca gtgctgaacc agctggcgga tattctggcc 240cacaacatcg ataatcgcga cctgaccgca gtgctgaacc agctggcgga tattctggcc 240
caggaaaata aacgttttca aatcaacctg catctgaacc tgtttcacag cattgacctg 300caggaaaata aacgttttca aatcaacctg catctgaacc tgtttcacag cattgacctg 300
tttttcgcta tttatccgat ctaccagcaa tatcagcata aaatttctac catccagctg 360tttttcgcta tttatccgat ctaccagcaa tatcagcata aaatttctac catccagctg 360
caactgtacg atgacggcag cgaaggtatt gttacgcagc attctctgtg caaaattgcg 420caactgtacg atgacggcag cgaaggtatt gttacgcagc attctctgtg caaaattgcg 420
gatctggaac agctgatcct gcaacacaaa aacgtgctgc tggaactgct gaccaaaggc 480gatctggaac agctgatcct gcaacacaaa aacgtgctgc tggaactgct gaccaaaggc 480
acggccaacg ttccgaatcc gaccctgctg cgttatctgt ggaacaatat tatcgattca 540acggccaacg ttccgaatcc gaccctgctg cgttatctgt ggaacaatat tatcgattca 540
cagtttcatc tgatctcgga ccattttctg caacacccga aactgcaacc gctgaaacgt 600cagtttcatc tgatctcgga ccattttctg caacacccga aactgcaacc gctgaaacgt 600
ctgctgaaac gctacaccat tctggatttt acgtgttatc cgcgcttcaa tgccgaacag 660ctgctgaaac gctacaccat tctggatttt acgtgttatc cgcgcttcaa tgccgaacag 660
aaacaactgc tgaaagaaat tctgcatatc tcaaacgaac tggaaaatct gctgaaactg 720aaacaactgc tgaaagaaat tctgcatatc tcaaacgaac tggaaaatct gctgaaactg 720
ctgaaacagc acaacacctt tctgttcacg ggcaccacgg cgtttaatct ggatcaggaa 780ctgaaacagc acaacacctt tctgttcacg ggcaccacgg cgtttaatct ggatcaggaa 780
aaactggacc tgctgaccca actgcatatc ctgctgctga acgaacacca gaatccgcat 840aaactggacc tgctgaccca actngcatatc ctgctgctga acgaacacca gaatccgcat 840
tcaacgcact acattggcaa caattatctg ctgctgatca aaggtcatgc aaactcgccg 900tcaacgcact acattggcaa caattatctg ctgctgatca aaggtcatgc aaactcgccg 900
gctctgaatc ataccctggc gctgcacttt ccggatgcga ttttcctgcc ggccaatatt 960gctctgaatc ataccctggc gctgcacttt ccggatgcga ttttcctgcc ggccaatatt 960
ccgtttgaaa tcttcgcgat gctgggcttt acgccgaaca aaatgggcgg tttcgccagc 1020ccgtttgaaa tcttcgcgat gctgggcttt acgccgaaca aaatgggcgg tttcgccagc 1020
acctcttaca ttaattatcc gacggaaaac atcaatcacc tgtttttcct gaccagtgat 1080acctcttaca ttaattatcc gacggaaaac atcaatcacc tgtttttcct gaccagtgat 1080
cagccgtcca ttcgcacgaa atggctggac tacgaaaaac aatttggtct gatgtattcc 1140cagccgtcca ttcgcacgaa atggctggac tacgaaaaac aatttggtct gatgtattcc 1140
ctgctggcaa tgcagaaaat caacgaagat caggcgttta tgtgcaccat tcacaattaa 1200ctgctggcaa tgcagaaaat caacgaagat caggcgttta tgtgcaccat tcacaattaa 1200
<210> 28<210> 28
<211> 1494<211> 1494
<212> ДНК<212> DNA
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 28<400> 28
atgtgtaacg ataatcaaaa tacggtcgat gttgttgtga gcaccgttaa cgataacgtc 60atgtgtaacg ataatcaaaa tacggtcgat gttgttgtga gcaccgttaa cgataacgtc 60
atcgaaaaca acacgtacca agttaaaccg atcgataccc cgaccacgtt tgacagttac 120atcgaaaaca acacgtacca agttaaaccg atcgataccc cgaccacgtt tgacagttac 120
tcctggattc agacgtgcgg caccccgatc ctgaaagatg acgaaaaata ttcactgtcg 180tcctggattc agacgtgcgg caccccgatc ctgaaagatg acgaaaaata ttcactgtcg 180
tttgatttcg tcgccccgga actggatcag gacgaaaaat tctgtttcga atttaccggc 240tttgatttcg tcgccccgga actggatcag gacgaaaaat tctgtttcga atttaccggc 240
gatgttgacg gtaaacgtta tgtcacgcag accaacctga cggtggttgc accgaccctg 300gatgttgacg gtaaacgtta tgtcacgcag accaacctga cggtggttgc accgaccctg 300
gaagtttacg tcgatcatgc tagtctgccg tccctgcagc aactgatgaa aatcatccag 360gaagtttacg tcgatcatgc tagtctgccg tccctgcagc aactgatgaa aatcatccag 360
cagaaaaacg aatactcaca gaatgaacgt ttcatttcgt ggggccgcat cggtctgacg 420cagaaaaacg aatactcaca gaatgaacgt ttcatttcgt ggggccgcat cggtctgacg 420
gaagataacg cggaaaaact gaatgcccat atttatccgc tggcaggcaa caatacctca 480gaagataacg cggaaaaact gaatgcccat atttatccgc tggcaggcaa caatacctca 480
caggaactgg tggatgcagt gatcgattac gctgactcga aaaaccgtct gaatctggaa 540caggaactgg tggatgcagt gatcgattac gctgactcga aaaaccgtct gaatctggaa 540
ctgaacacga ataccgcgca cagctttccg aacctggccc cgattctgcg cattatcagc 600ctgaacacga ataccgcgca cagctttccg aacctggccc cgattctgcg cattatcagc 600
tctaaaagca acatcctgat ctctaacatc aacctgtacg atgacggcag tgctgaatat 660tctaaaagca acatcctgat ctctaacatc aacctgtacg atgacggcag tgctgaatat 660
gtgaacctgt acaattggaa agataccgaa gacaaatccg tgaaactgag cgattctttc 720gtgaacctgt acaattggaa agataccgaa gacaaatccg tgaaactgag cgattctttc 720
ctggttctga aagactactt taacggtatt agttccgaaa aaccgagcgg catctatggt 780ctggttctga aagactactt taacggtatt agttccgaaa aaccgagcgg catctatggt 780
cgctacaact ggcatcaact gtataatacg tcttattact tcctgcgtaa agattacctg 840cgctacaact ggcatcaact gtataatacg tcttattact tcctgcgtaa agattacctg 840
accgttgaac cgcagctgca cgacctgcgc gaatatctgg gcggtagtct gaaacaaatg 900accgttgaac cgcagctgca cgacctgcgc gaatatctgg gcggtagtct gaaacaaatg 900
tcctgggatg gcttttcaca gctgtcgaaa ggtgacaaag aactgttcct gaacattgtc 960tcctgggatg gcttttcaca gctgtcgaaa ggtgacaaag aactgttcct gaacattgtc 960
ggctttgatc aggaaaaact gcagcaagaa taccagcaat cagaactgcc gaatttcgtg 1020ggctttgatc aggaaaaact gcagcaagaa taccagcaat cagaactgcc gaatttcgtg 1020
tttacgggca ccacgacctg ggcaggcggt gaaaccaaag aatattacgc tcagcaacag 1080tttacgggca ccacgacctg ggcaggcggt gaaaccaaag aatattacgc tcagcaacag 1080
gtgaacgtcg tgaacaatgc gattaatgaa accagcccgt attacctggg ccgtgaacat 1140gtgaacgtcg tgaacaatgc gattaatgaa accagcccgt attacctggg ccgtgaacat 1140
gacctgtttt tcaaaggtca cccgcgcggc ggtattatca atgatattat cctgggcagt 1200gacctgtttt tcaaaggtca cccgcgcggc ggtattatca atgatattat cctgggcagt 1200
ttcaacaata tgattgacat cccggccaaa gtgtcctttg aagttctgat gatgacgggt 1260ttcaacaata tgattgacat cccggccaaa gtgtcctttg aagttctgat gatgacgggt 1260
atgctgccgg ataccgtggg cggtattgcg tcatcgctgt attttagcat cccggccgaa 1320atgctgccgg ataccgtggg cggtattgcg tcatcgctgt attttagcat cccggccgaa 1320
aaagtctctt tcattgtgtt taccagctct gatacgatca ccgatcgtga agacgcgctg 1380aaagtctctt tcattgtgtt taccagctct gatacgatca ccgatcgtga agacgcgctg 1380
aaatctccgc tggtgcaggt tatgatgacc ctgggcattg ttaaagaaaa agatgtgctg 1440aaatctccgc tggtgcaggt tatgatgacc ctgggcattg ttaaagaaaa agatgtgctg 1440
ttctggtcgg atctgccgga ttgttcctcg ggtgtttgta ttgctcagta ttaa 1494ttctggtcgg atctgccgga ttgttcctcg ggtgtttgta ttgctcagta ttaa 1494
<210> 29<210> 29
<211> 1497<211> 1497
<212> ДНК<212> DNA
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 29<400> 29
atgagtgaag aaaacaccca gtccattatt aaaaacgaca tcaacaaaac catcatcgat 60atgagtgaag aaaacaccca gtccattatt aaaaacgaca tcaacaaaac catcatcgat 60
gaagaatacg ttaacctgga accgatcaac cagtctaaca tcagttttac caaacatagc 120gaagaatacg ttaacctgga accgatcaac cagtctaaca tcagttttac caaacatagc 120
tgggtccaga cctgcggtac gcagcaactg ctgacggaac aaaacaaaga atcaatttcg 180tgggtccaga cctgcggtac gcagcaactg ctgacggaac aaaacaaaga atcaatttcg 180
ctgagcgtgg ttgcgccgcg tctggatgac gatgaaaaat actgtttcga tttcaacggt 240ctgagcgtgg ttgcgccgcg tctggatgac gatgaaaaat actgtttcga tttcaacggt 240
gttagtaata aaggcgaaaa atacatcacc aaagtcacgc tgaatgtcgt ggcaccgtct 300gttagtaata aaggcgaaaa atacatcacc aaagtcacgc tgaatgtcgt ggcaccgtct 300
ctggaagttt atgtggatca tgctagtctg ccgaccctgc aacaactgat ggatattatc 360ctggaagttt atgtggatca tgctagtctg ccgaccctgc aacaactgat ggatattatc 360
aaatcggaag aagaaaaccc gaccgcacag cgttacattg cttggggccg catcgtgccg 420aaatcggaag aagaaaaccc gaccgcacag cgttacattg cttggggccg catcgtgccg 420
acggacgaac agatgaaaga actgaatatt accagctttg cgctgatcaa caatcacacg 480acggacgaac agatgaaaga actgaatatt accagctttg cgctgatcaa caatcacacg 480
ccggccgatc tggttcagga aattgtcaaa caggcgcaaa ccaaacatcg tctgaacgtg 540ccggccgatc tggttcagga aattgtcaaa caggcgcaaa ccaaacatcg tctgaacgtg 540
aaactgagca gcaatacggc ccactcgttt gacaatctgg ttccgattct gaaagaactg 600aaactgagca gcaatacggc ccactcgttt gacaatctgg ttccgattct gaaagaactg 600
aacagcttca acaatgtgac cgttacgaat atcgatctgt atgacgatgg cagcgcggaa 660aacagcttca acaatgtgac cgttacgaat atcgatctgt atgacgatgg cagcgcggaa 660
tatgttaacc tgtacaattg gcgcgacacc ctgaacaaaa cggataatct gaaaattggc 720tatgttaacc tgtacaattg gcgcgacacc ctgaacaaaa cggataatct gaaaattggc 720
aaagactatc tggaagatgt cattaacggt atcaatgaag ataccagcaa caccggcacg 780aaagactatc tggaagatgt cattaacggt atcaatgaag ataccagcaa caccggcacg 780
agttccgtgt acaattggca gaaactgtat ccggctaact accattttct gcgtaaagat 840agttccgtgt acaattggca gaaactgtat ccggctaact accattttct gcgtaaagat 840
tatctgaccc tggaaccgtc cctgcacgaa ctgcgcgact acattggtga ttcactgaaa 900tatctgaccc tggaaccgtc cctgcacgaa ctgcgcgact acattggtga ttcactgaaa 900
cagatgcaat gggacggctt caaaaaattc aactcgaaac agcaagaact gtttctgagc 960cagatgcaat gggacggctt caaaaaattc aactcgaaac agcaagaact gtttctgagc 960
atcgtgaatt tcgataaaca gaaactgcaa aacgaataca attcatcgaa cctgccgaat 1020atcgtgaatt tcgataaaca gaaactgcaa aacgaataca attcatcgaa cctgccgaat 1020
tttgtgttca ccggtaccac ggtttgggca ggcaaccacg aacgcgaata ctacgctaaa 1080tttgtgttca ccggtaccac ggtttgggca ggcaaccacg aacgcgaata ctacgctaaa 1080
cagcaaatca acgttatcaa caacgccatc aacgaaagct ctccgcatta tctgggtaat 1140cagcaaatca acgttatcaa caacgccatc aacgaaagct ctccgcatta tctgggtaat 1140
tcctacgacc tgtttttcaa aggccacccg ggcggtggca ttatcaacac cctgatcatg 1200tcctacgacc tgtttttcaa aggccacccg ggcggtggca ttatcaacac cctgatcatg 1200
cagaattatc cgtcaatggt cgatattccg tccaaaatct catttgaagt gctgatgatg 1260cagaattatc cgtcaatggt cgatattccg tccaaaatct catttgaagt gctgatgatg 1260
accgacatgc tgccggatgc cgtggcaggt attgcgagtt ccctgtactt cacgatcccg 1320accgacatgc tgccggatgc cgtggcaggt attgcgagtt ccctgtactt cacgatcccg 1320
gccgaaaaaa tcaaattcat cgttttcacc tctacggaaa ccattacgga tcgtgaaacc 1380gccgaaaaaa tcaaattcat cgttttcacc tctacggaaa ccattacgga tcgtgaaacc 1380
gccctgcgta gtccgctggt ccaggtgatg attaaactgg gcatcgtgaa agaagaaaat 1440gccctgcgta gtccgctggt ccaggtgatg attaaactgg gcatcgtgaa agaagaaaat 1440
gtgctgttct gggcggacct gccgaattgc gaaacgggtg tctgtattgc tgtctga 1497gtgctgttct gggcggacct gccgaattgc gaaacgggtg tctgtattgc tgtctga 1497
<210> 30<210> 30
<211> 1449<211> 1449
<212> ДНК<212> DNA
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 30<400> 30
atgaacgata atcaaaatac ggtggacgtg gtggtctcaa ccgtcaacga taacgtgatc 60atgaacgata atcaaaatac ggtggacgtg gtggtctcaa ccgtcaacga taacgtgatc 60
gaaaacaaca cgtaccaagt caaaccgatc gataccccga ccacgttcga ctcatactcg 120gaaaacaaca cgtaccaagt caaaccgatc gataccccga ccacgttcga ctcatactcg 120
tggattcaga cgtgcggcac cccgatcctg aaagatgacg aaaaatatag cctgtctttt 180tggattcaga cgtgcggcac cccgatcctg aaagatgacg aaaaatatag cctgtctttt 180
gatttcgttg ccccggaact ggatcaagac gaaaaattct gtttcgaatt taccggcgat 240gatttcgttg ccccggaact ggatcaagac gaaaaattct gtttcgaatt taccggcgat 240
gtggatggta aacgttatgt gacgcagacc aacctgacgg tggttgcacc gaccctggaa 300gtggatggta aacgttatgt gacgcagacc aacctgacgg tggttgcacc gaccctggaa 300
gtttacgtcg atcatgcttc actgccgtcg ctgcagcaac tgatgaaaat catccagcag 360gtttacgtcg atcatgcttc actgccgtcg ctgcagcaac tgatgaaaat catccagcag 360
aaaaacgaat acagccagaa tgaacgcttt atttcttggg gccgtatccg cctgacggaa 420aaaaacgaat acagccagaa tgaacgcttt atttcttggg gccgtatccg cctgacggaa 420
gataacgcgg aaaaactgaa tgcccatatt tatccgctgg caggcaacaa taccagccag 480gataacgcgg aaaaactgaa tgcccatatt tatccgctgg caggcaacaa taccagccag 480
gaactggtgg acgcagttat cgattacgct gactctaaaa accgtctgaa tctggaactg 540gaactggtgg acgcagttat cgattacgct gactctaaaa accgtctgaa tctggaactg 540
aacacgaata ccggccacag tttccgtaac attgcgccga tcctgcgcgc caccagctct 600aacacgaata ccggccacag tttccgtaac attgcgccga tcctgcgcgc caccagctct 600
aaaaacaaca tcctgatctc caacatcaac ctgtacgatg acggtagtgc tgaatatgtg 660aaaaacaaca tcctgatctc caacatcaac ctgtacgatg acggtagtgc tgaatatgtg 660
tccctgtaca actggaaaga taccgacaat aaatcacaga aactgagtga ttcctttctg 720tccctgtaca actggaaaga taccgacaat aaatcacaga aactgagtga ttcctttctg 720
gttctgaaag actacctgaa tggcatcagt tccgaaaaac cgaacggtat ttatagcatc 780gttctgaaag actacctgaa tggcatcagt tccgaaaaac cgaacggtat ttatagcatc 780
tacaattggc atcagctgta tcactcatcg tattacttcc tgcgtaaaga ttacctgacg 840tacaattggc atcagctgta tcactcatcg tattacttcc tgcgtaaaga ttacctgacg 840
gtggaaacca aactgcacga cctgcgcgaa tatctgggcg gttcactgaa acaaatgtcg 900gtggaaacca aactgcacga cctgcgcgaa tatctgggcg gttcactgaa acaaatgtcg 900
tgggatacct ttagccagct gtctaaaggc gacaaagaac tgttcctgaa cattgttggt 960tgggatacct ttagccagct gtctaaaggc gacaaagaac tgttcctgaa cattgttggt 960
tttgatcagg aaaaactgca gcaagaatac cagcaaagcg aactgccgaa tttcgtcttt 1020tttgatcagg aaaaactgca gcaagaatac cagcaaagcg aactgccgaa tttcgtcttt 1020
acgggcacca cgacctgggc aggcggtgaa accaaagaat attacgctca gcaacaggtg 1080acgggcacca cgacctgggc aggcggtgaa accaaagaat attacgctca gcaacaggtg 1080
aacgtcgtga acaatgcgat taatgaaacc tctccgtatt acctgggccg tgaacatgac 1140aacgtcgtga acaatgcgat taatgaaacc tctccgtatt acctgggccg tgaacatgac 1140
ctgtttttca aaggtcaccc gcgcggcggt attatcaatg atattatcct gggctcattc 1200ctgtttttca aaggtcaccc gcgcggcggt attatcaatg atattatcct gggctcattc 1200
aacaatatga ttgacatccc ggccaaagtt tcgtttgaag tcctgatgat gacgggtatg 1260aacaatatga ttgacatccc ggccaaagtt tcgtttgaag tcctgatgat gacgggtatg 1260
ctgccggata ccgttggcgg tattgcgagc agcctgtatt ttagtatccc ggccgaaaaa 1320ctgccggata ccgttggcgg tattgcgagc agcctgtatt ttagtatccc ggccgaaaaa 1320
gtgtccttca ttgtttttac cagttccgat acgatcaccg atcgcgaaga cgcgctgaaa 1380gtgtccttca ttgtttttac cagttccgat acgatcaccg atcgcgaaga cgcgctgaaa 1380
agtccgctgg tccaagtgat gatgaccctg ggcattgtga aagaaaaaga tgtgctgttc 1440agtccgctgg tccaagtgat gatgaccctg ggcattgtga aagaaaaaga tgtgctgttc 1440
tggtgctaa 1449tggtgctaa 1449
<210> 31<210> 31
<211> 2028<211> 2028
<212> ДНК<212> DNA
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 31<400> 31
atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60
acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120
cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga acagacgtgc 180cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga agacgtgc 180
ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240
gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300
tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360
gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420
tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480
ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540
atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600
catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660
atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720
aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780
ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840
ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900
cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960
aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020
ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080
tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140
gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200
catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260
atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320
gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380
ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440
gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agaccacaaa 1500gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agaccacaaa 1500
gttaatagca tggaagtcgc gattgatgaa gcctgcaccc gcattatcgc aaaacgtcag 1560gttaatagca tggaagtcgc gattgatgaa gcctgcaccc gcattatcgc aaaacgtcag 1560
ccgacggctt ctgatctgcg cctggtgatt gcgattatca aaacgatcac cgatctggaa 1620ccgacggctt ctgatctgcg cctggtgatt gcgattatca aaacgatcac cgatctggaa 1620
cgtattggcg acgttgccga atctattgcg aaagtcgcgc tggaatcttt ttctaacaaa 1680cgtattggcg acgttgccga atctattgcg aaagtcgcgc tggaatcttt ttctaacaaa 1680
cagtacaatc tgctggttag cctggaatct ctgggtcaac ataccgtgcg catgctgcac 1740cagtacaatc tgctggttag cctggaatct ctgggtcaac ataccgtgcg catgctgcac 1740
gaagttctgg atgcattcgc tcgtatggac gtcaaagcag ctatcgaagt gtatcaggaa 1800gaagttctgg atgcattcgc tcgtatggac gtcaaagcag ctatcgaagt gtatcaggaa 1800
gatgaccgca tcgatcaaga atacgaaagt attgtccgtc agctgatggc ccacatgatg 1860gatgaccgca tcgatcaaga atacgaaagt attgtccgtc agctgatggc ccacatgatg 1860
gaagatccgt catcgattcc gaacgttatg aaagtcatgt gggcggcccg ttccatcgaa 1920gaagatccgt catcgattcc gaacgttatg aaagtcatgt gggcggcccg ttccatcgaa 1920
cgcgttggtg atcgttgcca gaatatttgt gaatacatca tctacttcgt gaaaggcaaa 1980cgcgttggtg atcgttgcca gaatatttgt gaatacatca tctacttcgt gaaaggcaaa 1980
gatgttcgcc acaccaaacc ggatgacttc ggtacgatgc tggactaa 2028gatgttcgcc acaccaaacc ggatgacttc ggtacgatgc tggactaa 2028
<210> 32<210> 32
<211> 1533<211> 1533
<212> ДНК<212> DNA
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 32<400> 32
atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60
acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120
cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga acagacgtgc 180cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga agacgtgc 180
ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240
gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300
tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360
gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420
tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480
ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540
atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600
catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660
atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720
aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780
ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840
ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900
cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960
aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020
ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080
tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140
gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200
catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260
atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320
gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380
ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440
gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agacctgccg 1500gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agacctgccg 1500
gactgctcgt ctggtgtgtg tatcgacaaa taa 1533gactgctcgt ctggtgtgtg tatcgacaaa taa 1533
<210> 33<210> 33
<211> 1269<211> 1269
<212> ДНК<212> DNA
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 33<400> 33
atggggacca ttaaaaagcc cttaatcata gcaggaaatg gtccatcaat taaggaccta 60atggggacca ttaaaaagcc cttaatcata gcaggaaatg gtccatcaat taaggaccta 60
gactatgctt tatttccaaa agacttcgat gtctttcgct gcaaccagtt ttacttcgag 120gactatgctt tatttccaaa agacttcgat gtctttcgct gcaaccagtt ttacttcgag 120
gataaatatt acctaggacg cgaaataaaa ggagtgttct ttaacccttg tgtattaagc 180gataaatatt acctaggacg cgaaataaaa ggagtgttct ttaacccttg tgtattaagc 180
agtcaaatgc aaacagtgca ataccttatg gacaatggcg aatatagcat agaacgcttc 240agtcaaatgc aaacagtgca ataccttatg gacaatggcg aatatagcat agaacgcttc 240
ttttgcagtg tttcaacaga tcgccacgat tttgatgggg attaccaaac gattttaccg 300ttttgcagtg tttcaacaga tcgccacgat tttgatgggg attaccaaac gattttaccg 300
gtagacggtt atttaaaagc acactatccg ttcgtctgcg atacattcag cttattcaaa 360gtagacggtt atttaaaagc acactatccg ttcgtctgcg atacattcag cttattcaaa 360
ggtcacgaag aaatcttaaa acacgtgaaa taccacctga aaacgtacag caaagaactt 420ggtcacgaag aaatcttaaa acacgtgaaa taccacctga aaacgtacag caaagaactt 420
agtgcgggtg tcttaatgtt attgagtgca gtggtattag gatacaaaga aatataccta 480agtgcgggtg tcttaatgtt attgagtgca gtggtattag gatacaaaga aatataccta 480
gtaggaatcg acttcggcgc ctcatcttgg gggcacttct atgacgaaag ccaatcccaa 540gtaggaatcg acttcggcgc ctcatcttgg gggcacttct atgacgaaag ccaatcccaa 540
cactttagca atcacatggc agattgtcac aatatctatt acgacatgct gactatttgt 600cactttagca atcacatggc agattgtcac aatatctatt acgacatgct gactatttgt 600
ctctgtcaaa agtatgcaaa attgtacgca ttagcaccca attcaccatt atcacatttg 660ctctgtcaaa agtatgcaaa attgtacgca ttagcaccca attcaccatt atcacatttg 660
cttacactaa atccacaggc caaataccca tttgaactat tagataaacc tatcgggtat 720cttacactaa atccacaggc caaataccca tttgaactat tagataaacc tatcgggtat 720
actagcgacc taattattag tagcccgttg gaagagaagt tgctcgaatt taagaatatc 780actagcgacc taattattag tagcccgttg gaagagaagt tgctcgaatt taagaatatc 780
gaagagaagt tgcttgagtt caaaaacata gaagagaaac tcttagagtt caagaatatt 840gaagagaagt tgcttgagtt caaaaacata gaagagaaac tcttagagtt caagaatatt 840
gaagagaaac tattagaatt taaaaacatc gaggaaaaac ttttggagtt caaaaatata 900gaagagaaac tattagaatt taaaaacatc gaggaaaaac ttttggagtt caaaaatata 900
gaagagaaac tcctagagtt caagaacatt gaggaaaagt tgcttgagtt caaaaatatt 960gaagagaaac tcctagagtt caagaacatt gaggaaaagt tgcttgagtt caaaaatatt 960
gaggaaaagt tgctcgaatt taagaatatc gaggaaaaac ttttggaatt taagaacata 1020gaggaaaagt tgctcgaatt taagaatatc gaggaaaaac ttttggaatt taagaacata 1020
gaagaaaagt tactcgaatt taaaaacatt gaagagaaac tattggaatt taaaaatata 1080gaagaaaagt tactcgaatt taaaaacatt gaagagaaac tattggaatt taaaaatata 1080
gaggaaaagt tacttgagtt caaaaacata gaggaaaagt tacttgaatt taagaacata 1140gaggaaaagt tacttgagtt caaaaacata gaggaaaagt tacttgaatt taagaacata 1140
gaagagaaac ttctcgcaag ccgactgaac aacattctac gtaaaatcaa gcggaaaata 1200gaagagaaac ttctcgcaag ccgactgaac aacattctac gtaaaatcaa gcggaaaata 1200
cttccattct tttggggcgg aggtgtaacc ccaacattaa aagttagttt ccgttgggga 1260cttccattct tttggggcgg aggtgtaacc ccaacattaa aagttagttt ccgttgggga 1260
gctgcataa 1269gctgcataa 1269
<210> 34<210> 34
<211> 469<211> 469
<212> ПРТ<212> PRT
<213> Campylobacter coli<213> Campylobacter coli
<400> 34<400> 34
Met Gln Asn Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Gln Ser IleMet Gln Asn Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Gln Ser Ile
1 5 10 151 5 10 15
Asn Tyr Gln Arg Leu Pro Lys Glu Tyr Asp Ile Phe Arg Cys Asn GlnAsn Tyr Gln Arg Leu Pro Lys Glu Tyr Asp Ile Phe Arg Cys Asn Gln
20 25 3020 25 30
Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Asn Ile Lys Ala AlaPhe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Asn Ile Lys Ala Ala
35 40 4535 40 45
Phe Phe Asn Pro Tyr Pro Phe Leu Gln Gln Tyr His Thr Ala Lys GlnPhe Phe Asn Pro Tyr Pro Phe Leu Gln Gln Tyr His Thr Ala Lys Gln
50 55 6050 55 60
Leu Val Phe Asn Asn Glu Tyr Lys Ile Glu Asn Ile Phe Cys Ser ThrLeu Val Phe Asn Asn Glu Tyr Lys Ile Glu Asn Ile Phe Cys Ser Thr
65 70 75 8065 70 75 80
Phe Asn Leu Pro Phe Ile Glu Lys Asp Asn Phe Ile Asn Lys Phe TyrPhe Asn Leu Pro Phe Ile Glu Lys Asp Asn Phe Ile Asn Lys Phe Tyr
85 90 9585 90 95
Asp Phe Phe Pro Asp Ala Lys Leu Gly His Lys Ile Ile Glu Asn LeuAsp Phe Phe Pro Asp Ala Lys Leu Gly His Lys Ile Ile Glu Asn Leu
100 105 110100 105 110
Lys Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Leu Asn LysLys Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Leu Asn Lys
115 120 125115 120 125
Arg Ile Thr Ser Gly Ile Tyr Met Cys Ala Ile Ala Ile Ala Leu GlyArg Ile Thr Ser Gly Ile Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly
130 135 140130 135 140
Tyr Lys Asn Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Glu ThrTyr Lys Asn Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Glu Thr
145 150 155 160145 150 155 160
Ile Tyr Pro Phe Lys Ala Met Ser Lys Asn Ile Lys Lys Ile Phe ProIle Tyr Pro Phe Lys Ala Met Ser Lys Asn Ile Lys Lys Ile Phe Pro
165 170 175165 170 175
Trp Ile Lys Asp Phe Asn Pro Ser Asn Phe His Ser Lys Glu Tyr AspTrp Ile Lys Asp Phe Asn Pro Ser Asn Phe His Ser Lys Glu Tyr Asp
180 185 190180 185 190
Ile Glu Ile Leu Lys Leu Leu Glu Ser Ile Tyr Lys Val Asn Ile TyrIle Glu Ile Leu Lys Leu Leu Glu Ser Ile Tyr Lys Val Asn Ile Tyr
195 200 205195 200 205
Ala Leu Cys Asp Asn Ser Ala Leu Ala Asn Tyr Phe Pro Leu Leu ValAla Leu Cys Asp Asn Ser Ala Leu Ala Asn Tyr Phe Pro Leu Leu Val
210 215 220210 215 220
Asn Thr Asp Asn Ser Phe Val Leu Glu Asn Lys Ser Asp Asp Cys IleAsn Thr Asp Asn Ser Phe Val Leu Glu Asn Lys Ser Asp Asp Cys Ile
225 230 235 240225 230 235 240
Asn Asp Ile Leu Leu Thr Asn Asn Thr Pro Gly Ile Asn Phe Tyr LysAsn Asp Ile Leu Leu Thr Asn Asn Thr Pro Gly Ile Asn Phe Tyr Lys
245 250 255245 250 255
Ser Gln Ile Gln Val Asn Asn Thr Glu Ile Leu Leu Leu Asn Phe GlnSer Gln Ile Gln Val Asn Asn Thr Glu Ile Leu Leu Leu Asn Phe Gln
260 265 270260 265 270
Asn Met Ile Ser Ala Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys IleAsn Met Ile Ser Ala Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile
275 280 285275 280 285
Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Glu Asn Glu Ile SerLeu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Glu Asn Glu Ile Ser
290 295 300290 295 300
Asn Leu Asn Lys Ile Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr LysAsn Leu Asn Lys Ile Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys
305 310 315 320305 310 315 320
Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile Leu Gln Asp Lys Asp LysGlu Asn Glu Ile Ser Asn Leu Asn Lys Ile Leu Gln Asp Lys Asp Lys
325 330 335325 330 335
Leu Leu Ile Val Lys Glu Asn Leu Leu Asn Phe Lys Ser Arg His GlyLeu Leu Ile Val Lys Glu Asn Leu Leu Asn Phe Lys Ser Arg His Gly
340 345 350340 345 350
Lys Ala Lys Phe Arg Ile Gln Asn Gln Leu Ser Tyr Lys Leu Gly GlnLys Ala Lys Phe Arg Ile Gln Asn Gln Leu Ser Tyr Lys Leu Gly Gln
355 360 365355 360 365
Ala Met Met Val Asn Ser Lys Ser Leu Leu Gly Tyr Ile Arg Met ProAla Met Met Val Asn Ser Lys Ser Leu Leu Gly Tyr Ile Arg Met Pro
370 375 380370 375 380
Phe Val Leu Ser Tyr Ile Lys Asp Lys His Lys Gln Glu Gln Lys IlePhe Val Leu Ser Tyr Ile Lys Asp Lys His Lys Gln Glu Gln Lys Ile
385 390 395 400385 390 395 400
Tyr Gln Glu Lys Ile Lys Lys Asp Pro Ser Leu Thr Leu Pro Pro LeuTyr Gln Glu Lys Ile Lys Lys Asp Pro Ser Leu Thr Leu Pro Pro Leu
405 410 415405 410 415
Glu Asp Tyr Pro Asp Tyr Lys Glu Ala Leu Lys Glu Lys Glu Cys LeuGlu Asp Tyr Pro Asp Tyr Lys Glu Ala Leu Lys Glu Lys Glu Cys Leu
420 425 430420 425 430
Thr Tyr Arg Leu Gly Gln Thr Leu Ile Lys Ala Asp Gln Glu Trp TyrThr Tyr Arg Leu Gly Gln Thr Leu Ile Lys Ala Asp Gln Glu Trp Tyr
435 440 445435 440 445
Lys Gly Gly Tyr Val Lys Met Trp Phe Glu Ile Lys Lys Leu Lys LysLys Gly Gly Tyr Val Lys Met Trp Phe Glu Ile Lys Lys Leu Lys Lys
450 455 460450 455 460
Glu Tyr Lys Lys LysGlu Tyr Lys Lys Lys
465465
<210> 35<210> 35
<211> 381<211> 381
<212> ПРТ<212> PRT
<213> Vibrio sp.<213> Vibrio sp.
<400> 35<400> 35
Met Asn Asn Asp Asn Ser Thr Thr Thr Asn Asn Asn Ala Ile Glu IleMet Asn Asn Asp Asn Ser Thr Thr Thr Asn Asn Asn Ala Ile Glu Ile
1 5 10 151 5 10 15
Tyr Val Asp Arg Ala Thr Leu Pro Thr Ile Gln Gln Met Thr Lys IleTyr Val Asp Arg Ala Thr Leu Pro Thr Ile Gln Gln Met Thr Lys Ile
20 25 3020 25 30
Val Ser Gln Lys Thr Ser Asn Lys Lys Leu Ile Ser Trp Ser Arg TyrVal Ser Gln Lys Thr Ser Asn Lys Lys Leu Ile Ser Trp Ser Arg Tyr
35 40 4535 40 45
Pro Ile Thr Asp Lys Ser Leu Leu Lys Lys Ile Asn Ala Glu Phe PhePro Ile Thr Asp Lys Ser Leu Leu Lys Lys Ile Asn Ala Glu Phe Phe
50 55 6050 55 60
Lys Glu Gln Phe Glu Leu Thr Glu Ser Leu Lys Asn Ile Ile Leu SerLys Glu Gln Phe Glu Leu Thr Glu Ser Leu Lys Asn Ile Ile Leu Ser
65 70 75 8065 70 75 80
Glu Asn Ile Asp Asn Leu Ile Ile His Gly Asn Thr Leu Trp Ser IleGlu Asn Ile Asp Asn Leu Ile Ile His Gly Asn Thr Leu Trp Ser Ile
85 90 9585 90 95
Asp Val Val Asp Ile Ile Lys Glu Val Asn Leu Leu Gly Lys Asn IleAsp Val Val Asp Ile Ile Lys Glu Val Asn Leu Leu Gly Lys Asn Ile
100 105 110100 105 110
Pro Ile Glu Leu His Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val ArgPro Ile Glu Leu His Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg
115 120 125115 120 125
Ile Tyr Glu Phe Ser Lys Leu Pro Glu Ser Glu Gln Lys Tyr Lys ThrIle Tyr Glu Phe Ser Lys Leu Pro Glu Ser Glu Gln Lys Tyr Lys Thr
130 135 140130 135 140
Ser Leu Ser Lys Asn Asn Ile Lys Phe Ser Ile Asp Gly Thr Asp SerSer Leu Ser Lys Asn Asn Ile Lys Phe Ser Ile Asp Gly Thr Asp Ser
145 150 155 160145 150 155 160
Phe Lys Asn Thr Ile Glu Asn Ile Tyr Gly Phe Ser Gln Leu Tyr ProPhe Lys Asn Thr Ile Glu Asn Ile Tyr Gly Phe Ser Gln Leu Tyr Pro
165 170 175165 170 175
Thr Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Thr Leu LysThr Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Thr Leu Lys
180 185 190180 185 190
Ile Asn Pro Leu Arg Glu Leu Leu Ser Asn Asn Ile Lys Gln Met LysIle Asn Pro Leu Arg Glu Leu Leu Ser Asn Asn Ile Lys Gln Met Lys
195 200 205195 200 205
Trp Asp Tyr Phe Lys Asp Phe Asn Tyr Lys Gln Lys Asp Ile Phe TyrTrp Asp Tyr Phe Lys Asp Phe Asn Tyr Lys Gln Lys Asp Ile Phe Tyr
210 215 220210 215 220
Ser Leu Thr Asn Phe Asn Pro Lys Glu Ile Gln Glu Asp Phe Asn LysSer Leu Thr Asn Phe Asn Pro Lys Glu Ile Gln Glu Asp Phe Asn Lys
225 230 235 240225 230 235 240
Asn Ser Asn Lys Asn Phe Ile Phe Ile Gly Ser Asn Ser Ala Thr AlaAsn Ser Asn Lys Asn Phe Ile Phe Ile Gly Ser Asn Ser Ala Thr Ala
245 250 255245 250 255
Thr Ala Glu Glu Gln Ile Asn Ile Ile Ser Glu Ala Lys Lys Glu AsnThr Ala Glu Glu Gln Ile Asn Ile Ile Ser Glu Ala Lys Lys Glu Asn
260 265 270260 265 270
Ser Ser Ile Ile Thr Asn Ser Ile Ser Asp Tyr Asp Leu Phe Phe LysSer Ser Ile Ile Thr Asn Ser Ile Ser Asp Tyr Asp Leu Phe Phe Lys
275 280 285275 280 285
Gly His Pro Ser Ala Thr Phe Asn Glu Gln Ile Ile Asn Ala His AspGly His Pro Ser Ala Thr Phe Asn Glu Gln Ile Ile Asn Ala His Asp
290 295 300290 295 300
Met Ile Glu Ile Asn Asn Lys Ile Pro Phe Glu Ala Leu Ile Met ThrMet Ile Glu Ile Asn Asn Lys Ile Pro Phe Glu Ala Leu Ile Met Thr
305 310 315 320305 310 315 320
Gly Ile Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Val Phe PheGly Ile Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Val Phe Phe
325 330 335325 330 335
Ser Ile Pro Lys Glu Val Lys Asn Lys Phe Val Phe Tyr Lys Ser GlySer Ile Pro Lys Glu Val Lys Asn Lys Phe Val Phe Tyr Lys Ser Gly
340 345 350340 345 350
Thr Asp Ile Glu Asn Asn Ser Leu Ile Gln Val Met Leu Lys Leu AsnThr Asp Ile Glu Asn Asn Ser Leu Ile Gln Val Met Leu Lys Leu Asn
355 360 365355 360 365
Leu Ile Asn Arg Asp Asn Ile Lys Leu Ile Ser Asp IleLeu Ile Asn Arg Asp Asn Ile Lys Leu Ile Ser Asp Ile
370 375 380370 375 380
<210> 36<210> 36
<211> 390<211> 390
<212> ПРТ<212> PRT
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 36<400> 36
Met Gly Cys Asn Ser Asp Ser Asn His Asn Asn Ser Asp Gly Asn IleMet Gly Cys Asn Ser Asp Ser Asn His Asn Asn Ser Asp Gly Asn Ile
1 5 10 151 5 10 15
Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu ProThr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro
20 25 3020 25 30
Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn LysThr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys
35 40 4535 40 45
Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Glu Leu LeuLys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Glu Leu Leu
50 55 6050 55 60
Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Asn Ser Glu Leu Ile LysGlu Ser Ile Asn Gly Ser Phe Phe Lys Asn Asn Ser Glu Leu Ile Lys
65 70 75 8065 70 75 80
Ser Leu Asp Ser Met Ile Leu Thr Asn Asp Ile Lys Lys Val Ile IleSer Leu Asp Ser Met Ile Leu Thr Asn Asp Ile Lys Lys Val Ile Ile
85 90 9585 90 95
Asn Gly Asn Thr Leu Trp Ala Ala Asp Val Val Asn Ile Ile Lys SerAsn Gly Asn Thr Leu Trp Ala Ala Asp Val Val Asn Ile Ile Lys Ser
100 105 110100 105 110
Ile Glu Ala Phe Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr AspIle Glu Ala Phe Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp
115 120 125115 120 125
Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Lys Leu ProAsp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Lys Leu Pro
130 135 140130 135 140
Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile LeuGlu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Leu
145 150 155 160145 150 155 160
Ser Ser Ile Asn Gly Thr Gln Pro Phe Glu Asn Val Val Glu Asn IleSer Ser Ile Asn Gly Thr Gln Pro Phe Glu Asn Val Val Glu Asn Ile
165 170 175165 170 175
Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg AlaTyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala
180 185 190180 185 190
Asp Ile Phe Glu Thr Asn Leu Pro Leu Arg Ser Leu Lys Gly Val LeuAsp Ile Phe Glu Thr Asn Leu Pro Leu Arg Ser Leu Lys Gly Val Leu
195 200 205195 200 205
Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Lys Thr Phe AsnSer Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Lys Thr Phe Asn
210 215 220210 215 220
Ser Gln Gln Lys Asp Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro AspSer Gln Gln Lys Asp Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Asp
225 230 235 240225 230 235 240
Glu Ile Met Glu Gln Tyr Lys Ala Ser Pro Asn Lys Asn Phe Ile PheGlu Ile Met Glu Gln Tyr Lys Ala Ser Pro Asn Lys Asn Phe Ile Phe
245 250 255245 250 255
Val Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp IleVal Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile
260 265 270260 265 270
Leu Thr Glu Ala Lys Asn Pro Asn Ser Pro Ile Ile Thr Lys Ser IleLeu Thr Glu Ala Lys Asn Pro Asn Ser Pro Ile Ile Thr Lys Ser Ile
275 280 285275 280 285
Gln Gly Phe Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr AsnGln Gly Phe Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn
290 295 300290 295 300
Lys Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys IleLys Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile
305 310 315 320305 310 315 320
Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val GlyPro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly
325 330 335325 330 335
Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu AsnGly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn
340 345 350340 345 350
Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala LeuLys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu
355 360 365355 360 365
Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val LysIle Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys
370 375 380370 375 380
Leu Ile Ser Asp Leu GlnLeu Ile Ser Asp Leu Gln
385 390385 390
<210> 37<210> 37
<211> 388<211> 388
<212> ПРТ<212> PRT
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 37<400> 37
Met Lys Thr Ile Thr Leu Tyr Leu Asp Pro Ala Ser Leu Pro Ala LeuMet Lys Thr Ile Thr Leu Tyr Leu Asp Pro Ala Ser Leu Pro Ala Leu
1 5 10 151 5 10 15
Asn Gln Leu Met Asp Phe Thr Gln Asn Asn Glu Asp Lys Thr His ProAsn Gln Leu Met Asp Phe Thr Gln Asn Asn Glu Asp Lys Thr His Pro
20 25 3020 25 30
Arg Ile Phe Gly Leu Ser Arg Phe Lys Ile Pro Asp Asn Ile Ile ThrArg Ile Phe Gly Leu Ser Arg Phe Lys Ile Pro Asp Asn Ile Ile Thr
35 40 4535 40 45
Gln Tyr Gln Asn Ile His Phe Val Glu Leu Lys Asp Asn Arg Pro ThrGln Tyr Gln Asn Ile His Phe Val Glu Leu Lys Asp Asn Arg Pro Thr
50 55 6050 55 60
Glu Ala Leu Phe Thr Ile Leu Asp Gln Tyr Pro Gly Asn Ile Glu LeuGlu Ala Leu Phe Thr Ile Leu Asp Gln Tyr Pro Gly Asn Ile Glu Leu
65 70 75 8065 70 75 80
Asp Ile His Leu Asn Ile Ala His Ser Val Gln Leu Ile Arg Pro IleAsp Ile His Leu Asn Ile Ala His Ser Val Gln Leu Ile Arg Pro Ile
85 90 9585 90 95
Leu Ala Tyr Arg Phe Lys His Leu Asp Arg Val Ser Ile Gln Arg LeuLeu Ala Tyr Arg Phe Lys His Leu Asp Arg Val Ser Ile Gln Arg Leu
100 105 110100 105 110
Asn Leu Tyr Asp Asp Gly Ser Met Glu Tyr Val Asp Leu Glu Lys GluAsn Leu Tyr Asp Asp Gly Ser Met Glu Tyr Val Asp Leu Glu Lys Glu
115 120 125115 120 125
Glu Asn Lys Asp Ile Ser Ala Glu Ile Lys Gln Ala Glu Lys Gln LeuGlu Asn Lys Asp Ile Ser Ala Glu Ile Lys Gln Ala Glu Lys Gln Leu
130 135 140130 135 140
Ser His Tyr Leu Leu Thr Gly Lys Ile Lys Phe Asp Asn Pro Thr IleSer His Tyr Leu Leu Thr Gly Lys Ile Lys Phe Asp Asn Pro Thr Ile
145 150 155 160145 150 155 160
Ala Arg Tyr Val Trp Gln Ser Ala Phe Pro Val Lys Tyr His Phe LeuAla Arg Tyr Val Trp Gln Ser Ala Phe Pro Val Lys Tyr His Phe Leu
165 170 175165 170 175
Ser Thr Asp Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys GluSer Thr Asp Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Glu
180 185 190180 185 190
Tyr Leu Ala Glu Asn Tyr Gln Lys Met Asp Trp Thr Ala Tyr Gln GlnTyr Leu Ala Glu Asn Tyr Gln Lys Met Asp Trp Thr Ala Tyr Gln Gln
195 200 205195 200 205
Leu Thr Pro Glu Gln Gln Ala Phe Tyr Leu Thr Leu Val Gly Phe AsnLeu Thr Pro Glu Gln Gln Ala Phe Tyr Leu Thr Leu Val Gly Phe Asn
210 215 220210 215 220
Asp Glu Val Lys Gln Ser Leu Glu Val Gln Gln Ala Lys Phe Ile PheAsp Glu Val Lys Gln Ser Leu Glu Val Gln Gln Ala Lys Phe Ile Phe
225 230 235 240225 230 235 240
Thr Gly Thr Thr Thr Trp Glu Gly Asn Thr Asp Val Arg Glu Tyr TyrThr Gly Thr Thr Thr Trp Glu Gly Asn Thr Asp Val Arg Glu Tyr Tyr
245 250 255245 250 255
Ala Gln Gln Gln Leu Asn Leu Leu Asn His Phe Thr Gln Ala Gly GlyAla Gln Gln Gln Leu Asn Leu Leu Asn His Phe Thr Gln Ala Gly Gly
260 265 270260 265 270
Asp Leu Phe Ile Gly Asp His Tyr Lys Ile Tyr Phe Lys Gly His ProAsp Leu Phe Ile Gly Asp His Tyr Lys Ile Tyr Phe Lys Gly His Pro
275 280 285275 280 285
Arg Gly Gly Glu Ile Asn Asp Tyr Ile Leu Asn Asn Ala Lys Asn IleArg Gly Gly Glu Ile Asn Asp Tyr Ile Leu Asn Asn Ala Lys Asn Ile
290 295 300290 295 300
Thr Asn Ile Pro Ala Asn Ile Ser Phe Glu Val Leu Met Met Thr GlyThr Asn Ile Pro Ala Asn Ile Ser Phe Glu Val Leu Met Met Thr Gly
305 310 315 320305 310 315 320
Leu Leu Pro Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe SerLeu Leu Pro Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser
325 330 335325 330 335
Leu Pro Lys Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys GlnLeu Pro Lys Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Gln
340 345 350340 345 350
Val Lys Ser Lys Glu Asp Ala Leu Asn Asn Pro Tyr Val Lys Val MetVal Lys Ser Lys Glu Asp Ala Leu Asn Asn Pro Tyr Val Lys Val Met
355 360 365355 360 365
Arg Arg Leu Gly Ile Ile Asp Glu Ser Gln Val Ile Phe Trp Asp SerArg Arg Leu Gly Ile Ile Asp Glu Ser Gln Val Ile Phe Trp Asp Ser
370 375 380370 375 380
Leu Lys Gln LeuLeu Lys Gln Leu
385385
<210> 38<210> 38
<211> 371<211> 371
<212> ПРТ<212> PRT
<213> Neisseria meningitidis<213> Neisseria meningitidis
<400> 38<400> 38
Met Gly Leu Lys Lys Ala Cys Leu Thr Val Leu Cys Leu Ile Val PheMet Gly Leu Lys Lys Ala Cys Leu Thr Val Leu Cys Leu Ile Val Phe
1 5 10 151 5 10 15
Cys Phe Gly Ile Phe Tyr Thr Phe Asp Arg Val Asn Gln Gly Glu ArgCys Phe Gly Ile Phe Tyr Thr Phe Asp Arg Val Asn Gln Gly Glu Arg
20 25 3020 25 30
Asn Ala Val Ser Leu Leu Lys Glu Lys Leu Phe Asn Glu Glu Gly GluAsn Ala Val Ser Leu Leu Lys Glu Lys Leu Phe Asn Glu Glu Gly Glu
35 40 4535 40 45
Pro Val Asn Leu Ile Phe Cys Tyr Thr Ile Leu Gln Met Lys Val AlaPro Val Asn Leu Ile Phe Cys Tyr Thr Ile Leu Gln Met Lys Val Ala
50 55 6050 55 60
Glu Arg Ile Met Ala Gln His Pro Gly Glu Arg Phe Tyr Val Val LeuGlu Arg Ile Met Ala Gln His Pro Gly Glu Arg Phe Tyr Val Val Leu
65 70 75 8065 70 75 80
Met Ser Glu Asn Arg Asn Glu Lys Tyr Asp Tyr Tyr Phe Asn Gln IleMet Ser Glu Asn Arg Asn Glu Lys Tyr Asp Tyr Tyr Phe Asn Gln Ile
85 90 9585 90 95
Lys Asp Lys Ala Glu Arg Ala Tyr Phe Phe His Leu Pro Tyr Gly LeuLys Asp Lys Ala Glu Arg Ala Tyr Phe Phe His Leu Pro Tyr Gly Leu
100 105 110100 105 110
Asn Lys Ser Phe Asn Phe Ile Pro Thr Met Ala Glu Leu Lys Val LysAsn Lys Ser Phe Asn Phe Ile Pro Thr Met Ala Glu Leu Lys Val Lys
115 120 125115 120 125
Ser Met Leu Leu Pro Lys Val Lys Arg Ile Tyr Leu Ala Ser Leu GluSer Met Leu Leu Pro Lys Val Lys Arg Ile Tyr Leu Ala Ser Leu Glu
130 135 140130 135 140
Lys Val Ser Ile Ala Ala Phe Leu Ser Thr Tyr Pro Asp Ala Glu IleLys Val Ser Ile Ala Ala Phe Leu Ser Thr Tyr Pro Asp Ala Glu Ile
145 150 155 160145 150 155 160
Lys Thr Phe Asp Asp Gly Thr Gly Asn Leu Ile Gln Ser Ser Ser TyrLys Thr Phe Asp Asp Gly Thr Gly Asn Leu Ile Gln Ser Ser Ser Tyr
165 170 175165 170 175
Leu Gly Asp Glu Phe Ser Val Asn Gly Thr Ile Lys Arg Asn Phe AlaLeu Gly Asp Glu Phe Ser Val Asn Gly Thr Ile Lys Arg Asn Phe Ala
180 185 190180 185 190
Arg Met Met Ile Gly Asp Trp Ser Ile Ala Lys Thr Arg Asn Ala SerArg Met Met Ile Gly Asp Trp Ser Ile Ala Lys Thr Arg Asn Ala Ser
195 200 205195 200 205
Asp Glu His Tyr Thr Ile Phe Lys Gly Leu Lys Asn Ile Met Asp AspAsp Glu His Tyr Thr Ile Phe Lys Gly Leu Lys Asn Ile Met Asp Asp
210 215 220210 215 220
Gly Arg Arg Lys Met Thr Tyr Leu Pro Leu Phe Asp Ala Ser Glu LeuGly Arg Arg Lys Met Thr Tyr Leu Pro Leu Phe Asp Ala Ser Glu Leu
225 230 235 240225 230 235 240
Lys Thr Gly Asp Glu Thr Gly Gly Thr Val Arg Ile Leu Leu Gly SerLys Thr Gly Asp Glu Thr Gly Gly Thr Val Arg Ile Leu Leu Gly Ser
245 250 255245 250 255
Pro Asp Lys Glu Met Lys Glu Ile Ser Glu Lys Ala Ala Lys Asn PhePro Asp Lys Glu Met Lys Glu Ile Ser Glu Lys Ala Ala Lys Asn Phe
260 265 270260 265 270
Lys Ile Gln Tyr Val Ala Pro His Pro Arg Gln Thr Tyr Gly Leu SerLys Ile Gln Tyr Val Ala Pro His Pro Arg Gln Thr Tyr Gly Leu Ser
275 280 285275 280 285
Gly Val Thr Thr Leu Asn Ser Pro Tyr Val Ile Glu Asp Tyr Ile LeuGly Val Thr Thr Leu Asn Ser Pro Tyr Val Ile Glu Asp Tyr Ile Leu
290 295 300290 295 300
Arg Glu Ile Lys Lys Asn Pro His Thr Arg Tyr Glu Ile Tyr Thr PheArg Glu Ile Lys Lys Asn Pro His Thr Arg Tyr Glu Ile Tyr Thr Phe
305 310 315 320305 310 315 320
Phe Ser Gly Ala Ala Leu Thr Met Lys Asp Phe Pro Asn Val His ValPhe Ser Gly Ala Ala Leu Thr Met Lys Asp Phe Pro Asn Val His Val
325 330 335325 330 335
Tyr Ala Leu Lys Pro Ala Ser Leu Pro Glu Asp Tyr Trp Leu Lys ProTyr Ala Leu Lys Pro Ala Ser Leu Pro Glu Asp Tyr Trp Leu Lys Pro
340 345 350340 345 350
Val Tyr Ala Leu Phe Thr Gln Ser Gly Ile Pro Ile Leu Thr Phe AspVal Tyr Ala Leu Phe Thr Gln Ser Gly Ile Pro Ile Leu Thr Phe Asp
355 360 365355 360 365
Asp Lys AsnAsp Lys Asn
370370
<210> 39<210> 39
<211> 283<211> 283
<212> ПРТ<212> PRT
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 39<400> 39
Met Asp Lys Phe Ala Glu His Glu Ile Pro Lys Ala Val Ile Val AlaMet Asp Lys Phe Ala Glu His Glu Ile Pro Lys Ala Val Ile Val Ala
1 5 10 151 5 10 15
Gly Asn Gly Glu Ser Leu Ser Gln Ile Asp Tyr Arg Leu Leu Pro LysGly Asn Gly Glu Ser Leu Ser Gln Ile Asp Tyr Arg Leu Leu Pro Lys
20 25 3020 25 30
Asn Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Phe Glu Glu Arg TyrAsn Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Phe Glu Glu Arg Tyr
35 40 4535 40 45
Phe Leu Gly Asn Lys Ile Lys Ala Val Phe Phe Thr Pro Gly Val PhePhe Leu Gly Asn Lys Ile Lys Ala Val Phe Phe Thr Pro Gly Val Phe
50 55 6050 55 60
Leu Glu Gln Tyr Tyr Thr Leu Tyr His Leu Lys Arg Asn Asn Glu TyrLeu Glu Gln Tyr Tyr Thr Leu Tyr His Leu Lys Arg Asn Asn Glu Tyr
65 70 75 8065 70 75 80
Phe Val Asp Asn Val Ile Leu Ser Ser Phe Asn His Pro Thr Val AspPhe Val Asp Asn Val Ile Leu Ser Ser Phe Asn His Pro Thr Val Asp
85 90 9585 90 95
Leu Glu Lys Ser Gln Lys Ile Gln Ala Leu Phe Ile Asp Val Ile AsnLeu Glu Lys Ser Gln Lys Ile Gln Ala Leu Phe Ile Asp Val Ile Asn
100 105 110100 105 110
Gly Tyr Glu Lys Tyr Leu Ser Lys Leu Thr Ala Phe Asp Val Tyr LeuGly Tyr Glu Lys Tyr Leu Ser Lys Leu Thr Ala Phe Asp Val Tyr Leu
115 120 125115 120 125
Arg Tyr Lys Glu Leu Tyr Glu Asn Gln Arg Ile Thr Ser Gly Val TyrArg Tyr Lys Glu Leu Tyr Glu Asn Gln Arg Ile Thr Ser Gly Val Tyr
130 135 140130 135 140
Met Cys Ala Val Ala Ile Ala Met Gly Tyr Thr Asp Ile Tyr Leu ThrMet Cys Ala Val Ala Ile Ala Met Gly Tyr Thr Asp Ile Tyr Leu Thr
145 150 155 160145 150 155 160
Gly Ile Asp Phe Tyr Gln Ala Ser Glu Glu Asn Tyr Ala Phe Asp AsnGly Ile Asp Phe Tyr Gln Ala Ser Glu Glu Asn Tyr Ala Phe Asp Asn
165 170 175165 170 175
Lys Lys Pro Asn Ile Ile Arg Leu Leu Pro Asp Phe Arg Lys Glu LysLys Lys Pro Asn Ile Ile Arg Leu Leu Pro Asp Phe Arg Lys Glu Lys
180 185 190180 185 190
Thr Leu Phe Ser Tyr His Ser Lys Asp Ile Asp Leu Glu Ala Leu SerThr Leu Phe Ser Tyr His Ser Lys Asp Ile Asp Leu Glu Ala Leu Ser
195 200 205195 200 205
Phe Leu Gln Gln His Tyr His Val Asn Phe Tyr Ser Ile Ser Pro MetPhe Leu Gln Gln His Tyr His Val Asn Phe Tyr Ser Ile Ser Pro Met
210 215 220210 215 220
Ser Pro Leu Ser Lys His Phe Pro Ile Pro Thr Val Glu Asp Asp CysSer Pro Leu Ser Lys His Phe Pro Ile Pro Thr Val Glu Asp Asp Cys
225 230 235 240225 230 235 240
Glu Thr Thr Phe Val Ala Pro Leu Lys Glu Asn Tyr Ile Asn Asp IleGlu Thr Thr Phe Val Ala Pro Leu Lys Glu Asn Tyr Ile Asn Asp Ile
245 250 255245 250 255
Leu Leu Pro Pro His Phe Val Tyr Glu Lys Leu Gly Val Asp Lys LeuLeu Leu Pro Pro His Phe Val Tyr Glu Lys Leu Gly Val Asp Lys Leu
260 265 270260 265 270
Ala Ala Ala Leu Glu His His His His His HisAla Ala Ala Leu Glu His His His His His
275 280275 280
<210> 40<210> 40
<211> 385<211> 385
<212> ПРТ<212> PRT
<213> Pasteurella dagmatis<213> Pasteurella dagmatis
<400> 40<400> 40
Met Thr Ile Tyr Leu Asp Pro Ala Ser Leu Pro Thr Leu Asn Gln LeuMet Thr Ile Tyr Leu Asp Pro Ala Ser Leu Pro Thr Leu Asn Gln Leu
1 5 10 151 5 10 15
Met His Phe Thr Lys Glu Ser Glu Asp Lys Glu Thr Ala Arg Ile PheMet His Phe Thr Lys Glu Ser Glu Asp Lys Glu Thr Ala Arg Ile Phe
20 25 3020 25 30
Gly Phe Ser Arg Phe Lys Leu Pro Glu Lys Ile Thr Glu Gln Tyr AsnGly Phe Ser Arg Phe Lys Leu Pro Glu Lys Ile Thr Glu Gln Tyr Asn
35 40 4535 40 45
Asn Ile His Phe Val Glu Ile Lys Asn Asn Arg Pro Thr Glu Asp IleAsn Ile His Phe Val Glu Ile Lys Asn Asn Arg Pro Thr Glu Asp Ile
50 55 6050 55 60
Phe Thr Ile Leu Asp Gln Tyr Pro Glu Lys Leu Glu Leu Asp Leu HisPhe Thr Ile Leu Asp Gln Tyr Pro Glu Lys Leu Glu Leu Asp Leu His
65 70 75 8065 70 75 80
Leu Asn Ile Ala His Ser Ile Gln Leu Phe His Pro Ile Leu Gln TyrLeu Asn Ile Ala His Ser Ile Gln Leu Phe His Pro Ile Leu Gln Tyr
85 90 9585 90 95
Arg Phe Lys His Pro Asp Arg Ile Ser Ile Lys Ser Leu Asn Leu TyrArg Phe Lys His Pro Asp Arg Ile Ser Ile Lys Ser Leu Asn Leu Tyr
100 105 110100 105 110
Asp Asp Gly Thr Met Glu Tyr Val Asp Leu Glu Lys Glu Glu Asn LysAsp Asp Gly Thr Met Glu Tyr Val Asp Leu Glu Lys Glu Glu Asn Lys
115 120 125115 120 125
Asp Ile Lys Ser Ala Ile Lys Lys Ala Glu Lys Gln Leu Ser Asp TyrAsp Ile Lys Ser Ala Ile Lys Lys Ala Glu Lys Gln Leu Ser Asp Tyr
130 135 140130 135 140
Leu Leu Thr Gly Lys Ile Asn Phe Asp Asn Pro Thr Leu Ala Arg TyrLeu Leu Thr Gly Lys Ile Asn Phe Asp Asn Pro Thr Leu Ala Arg Tyr
145 150 155 160145 150 155 160
Val Trp Gln Ser Gln Tyr Pro Val Lys Tyr His Phe Leu Ser Thr GluVal Trp Gln Ser Gln Tyr Pro Val Lys Tyr His Phe Leu Ser Thr Glu
165 170 175165 170 175
Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Thr Tyr Leu AlaTyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Thr Tyr Leu Ala
180 185 190180 185 190
Gly Lys Tyr Gln Lys Met Asp Trp Ser Ala Tyr Glu Lys Leu Ser ProGly Lys Tyr Gln Lys Met Asp Trp Ser Ala Tyr Glu Lys Leu Ser Pro
195 200 205195 200 205
Glu Gln Gln Thr Phe Tyr Leu Lys Leu Val Gly Phe Ser Asp Glu ThrGlu Gln Gln Thr Phe Tyr Leu Lys Leu Val Gly Phe Ser Asp Glu Thr
210 215 220210 215 220
Lys Gln Leu Phe His Thr Glu Gln Thr Lys Phe Ile Phe Thr Gly ThrLys Gln Leu Phe His Thr Glu Gln Thr Lys Phe Ile Phe Thr Gly Thr
225 230 235 240225 230 235 240
Thr Thr Trp Glu Gly Asn Thr Asp Ile Arg Glu Tyr Tyr Ala Lys GlnThr Thr Trp Glu Gly Asn Thr Asp Ile Arg Glu Tyr Tyr Ala Lys Gln
245 250 255245 250 255
Gln Leu Asn Leu Leu Lys His Phe Thr His Ser Glu Gly Asp Leu PheGln Leu Asn Leu Leu Lys His Phe Thr His Ser Glu Gly Asp Leu Phe
260 265 270260 265 270
Ile Gly Asp Gln Tyr Lys Ile Tyr Phe Lys Gly His Pro Arg Gly GlyIle Gly Asp Gln Tyr Lys Ile Tyr Phe Lys Gly His Pro Arg Gly Gly
275 280 285275 280 285
Asp Ile Asn Asp Tyr Ile Leu Lys His Ala Lys Asp Ile Thr Asn IleAsp Ile Asn Asp Tyr Ile Leu Lys His Ala Lys Asp Ile Thr Asn Ile
290 295 300290 295 300
Pro Ala Asn Ile Ser Phe Glu Ile Leu Met Met Thr Gly Leu Leu ProPro Ala Asn Ile Ser Phe Glu Ile Leu Met Met Thr Gly Leu Leu Pro
305 310 315 320305 310 315 320
Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Pro LysAsp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Pro Lys
325 330 335325 330 335
Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Lys Ile Lys AsnGlu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Lys Ile Lys Asn
340 345 350340 345 350
Lys Glu Asp Ala Leu Asn Asp Pro Tyr Val Arg Val Met Leu Arg LeuLys Glu Asp Ala Leu Asn Asp Pro Tyr Val Arg Val Met Leu Arg Leu
355 360 365355 360 365
Gly Met Ile Asp Lys Ser Gln Ile Ile Phe Trp Asp Ser Leu Lys GlnGly Met Ile Asp Lys Ser Gln Ile Ile Phe Trp Asp Ser Leu Lys Gln
370 375 380370 375 380
LeuLeu
385385
<210> 41<210> 41
<211> 390<211> 390
<212> ПРТ<212> PRT
<213> Photobacterium phosphoreum<213> Photobacterium phosphoreum
<400> 41<400> 41
Met Gly Cys Asn Ser Asp Ser Lys His Asn Asn Ser Asp Gly Asn IleMet Gly Cys Asn Ser Asp Ser Lys His Asn Asn Ser Asp Gly Asn Ile
1 5 10 151 5 10 15
Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu ProThr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro
20 25 3020 25 30
Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn LysThr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys
35 40 4535 40 45
Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Thr Leu LeuLys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Thr Leu Leu
50 55 6050 55 60
Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Arg Pro Glu Leu Ile LysGlu Ser Ile Asn Gly Ser Phe Phe Lys Asn Arg Pro Glu Leu Ile Lys
65 70 75 8065 70 75 80
Ser Leu Asp Ser Met Ile Leu Thr Asn Glu Ile Lys Lys Val Ile IleSer Leu Asp Ser Met Ile Leu Thr Asn Glu Ile Lys Lys Val Ile Ile
85 90 9585 90 95
Asn Gly Asn Thr Leu Trp Ala Val Asp Val Val Asn Ile Ile Lys SerAsn Gly Asn Thr Leu Trp Ala Val Asp Val Val Asn Ile Ile Lys Ser
100 105 110100 105 110
Ile Glu Ala Leu Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr AspIle Glu Ala Leu Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp
115 120 125115 120 125
Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Arg Leu ProAsp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Arg Leu Pro
130 135 140130 135 140
Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile GlnGlu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Gln
145 150 155 160145 150 155 160
Ser Ser Ile Asn Gly Thr Gln Pro Phe Asp Asn Ser Ile Glu Asn IleSer Ser Ile Asn Gly Thr Gln Pro Phe Asp Asn Ser Ile Glu Asn Ile
165 170 175165 170 175
Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg AlaTyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala
180 185 190180 185 190
Asp Ile Phe Glu Thr Asn Leu Pro Leu Thr Ser Leu Lys Arg Val IleAsp Ile Phe Glu Thr Asn Leu Pro Leu Thr Ser Leu Lys Arg Val Ile
195 200 205195 200 205
Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Thr Thr Phe AsnSer Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Thr Thr Phe Asn
210 215 220210 215 220
Ser Gln Gln Lys Asn Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro GluSer Gln Gln Lys Asn Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Glu
225 230 235 240225 230 235 240
Lys Ile Lys Glu Gln Tyr Lys Ala Ser Pro His Glu Asn Phe Ile PheLys Ile Lys Glu Gln Tyr Lys Ala Ser Pro His Glu Asn Phe Ile Phe
245 250 255245 250 255
Ile Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp IleIle Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile
260 265 270260 265 270
Leu Thr Glu Ala Lys Lys Pro Asp Ser Pro Ile Ile Thr Asn Ser IleLeu Thr Glu Ala Lys Lys Pro Asp Ser Pro Ile Ile Thr Asn Ser Ile
275 280 285275 280 285
Gln Gly Leu Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr AsnGln Gly Leu Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn
290 295 300290 295 300
Gln Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys IleGln Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile
305 310 315 320305 310 315 320
Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val GlyPro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly
325 330 335325 330 335
Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu AsnGly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn
340 345 350340 345 350
Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala LeuLys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu
355 360 365355 360 365
Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val LysIle Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys
370 375 380370 375 380
Leu Ile Ser Asp Leu GlnLeu Ile Ser Asp Leu Gln
385 390385 390
<210> 42<210> 42
<211> 417<211> 417
<212> ПРТ<212> PRT
<213> Avibacterium paragallinarum<213> Avibacterium paragallinarum
<400> 42<400> 42
Met Arg Lys Ile Ile Thr Phe Phe Ser Leu Phe Phe Ser Ile Ser AlaMet Arg Lys Ile Ile Thr Phe Phe Ser Leu Phe Phe Ser Ile Ser Ala
1 5 10 151 5 10 15
Trp Cys Gln Lys Met Glu Ile Tyr Leu Asp Tyr Ala Ser Leu Pro SerTrp Cys Gln Lys Met Glu Ile Tyr Leu Asp Tyr Ala Ser Leu Pro Ser
20 25 3020 25 30
Leu Asn Met Ile Leu Asn Leu Val Glu Asn Lys Asn Asn Glu Lys ValLeu Asn Met Ile Leu Asn Leu Val Glu Asn Lys Asn Asn Glu Lys Val
35 40 4535 40 45
Glu Arg Ile Ile Gly Phe Glu Arg Phe Asp Phe Asn Lys Glu Ile LeuGlu Arg Ile Ile Gly Phe Glu Arg Phe Asp Phe Asn Lys Glu Ile Leu
50 55 6050 55 60
Asn Ser Phe Ser Lys Glu Arg Ile Glu Phe Ser Lys Val Ser Ile LeuAsn Ser Phe Ser Lys Glu Arg Ile Glu Phe Ser Lys Val Ser Ile Leu
65 70 75 8065 70 75 80
Asp Ile Lys Glu Phe Ser Asp Lys Leu Tyr Leu Asn Ile Glu Lys SerAsp Ile Lys Glu Phe Ser Asp Lys Leu Tyr Leu Asn Ile Glu Lys Ser
85 90 9585 90 95
Asp Thr Pro Val Asp Leu Ile Ile His Thr Asn Leu Asp His Ser ValAsp Thr Pro Val Asp Leu Ile Ile His Thr Asn Leu Asp His Ser Val
100 105 110100 105 110
Arg Ser Leu Leu Ser Ile Phe Lys Thr Leu Ser Pro Leu Phe His LysArg Ser Leu Leu Ser Ile Phe Lys Thr Leu Ser Pro Leu Phe His Lys
115 120 125115 120 125
Ile Asn Ile Glu Lys Leu Tyr Leu Tyr Asp Asp Gly Ser Gly Asn TyrIle Asn Ile Glu Lys Leu Tyr Leu Tyr Asp Asp Gly Ser Gly Asn Tyr
130 135 140130 135 140
Val Asp Leu Tyr Gln His Arg Gln Glu Asn Ile Ser Ala Ile Leu IleVal Asp Leu Tyr Gln His Arg Gln Glu Asn Ile Ser Ala Ile Leu Ile
145 150 155 160145 150 155 160
Glu Ala Gln Lys Lys Leu Lys Asp Ala Leu Glu Asn Arg Glu Thr AspGlu Ala Gln Lys Lys Leu Lys Asp Ala Leu Glu Asn Arg Glu Thr Asp
165 170 175165 170 175
Thr Asp Lys Leu His Ser Leu Thr Arg Tyr Thr Trp His Lys Ile PheThr Asp Lys Leu His Ser Leu Thr Arg Tyr Thr Trp His Lys Ile Phe
180 185 190180 185 190
Pro Thr Glu Tyr Ile Leu Leu Arg Pro Asp Tyr Leu Asp Ile Asp GluPro Thr Glu Tyr Ile Leu Leu Arg Pro Asp Tyr Leu Asp Ile Asp Glu
195 200 205195 200 205
Lys Met Gln Pro Leu Lys His Phe Leu Ser Asp Thr Ile Val Ser MetLys Met Gln Pro Leu Lys His Phe Leu Ser Asp Thr Ile Val Ser Met
210 215 220210 215 220
Asp Leu Ser Arg Phe Ser His Phe Ser Lys Asn Gln Lys Glu Leu PheAsp Leu Ser Arg Phe Ser His Phe Ser Lys Asn Gln Lys Glu Leu Phe
225 230 235 240225 230 235 240
Leu Lys Ile Thr His Phe Asp Gln Asn Ile Phe Asn Glu Leu Asn IleLeu Lys Ile Thr His Phe Asp Gln Asn Ile Phe Asn Glu Leu Asn Ile
245 250 255245 250 255
Gly Thr Lys Asn Lys Glu Tyr Lys Thr Phe Ile Phe Thr Gly Thr ThrGly Thr Lys Asn Lys Glu Tyr Lys Thr Phe Ile Phe Thr Gly Thr Thr
260 265 270260 265 270
Thr Trp Glu Lys Asp Lys Lys Lys Arg Leu Asn Asn Ala Lys Leu GlnThr Trp Glu Lys Asp Lys Lys Lys Arg Leu Asn Asn Ala Lys Leu Gln
275 280 285275 280 285
Thr Glu Ile Leu Glu Ser Phe Ile Lys Pro Asn Gly Lys Phe Tyr LeuThr Glu Ile Leu Glu Ser Phe Ile Lys Pro Asn Gly Lys Phe Tyr Leu
290 295 300290 295 300
Gly Asn Asp Ile Lys Ile Phe Phe Lys Gly His Pro Lys Gly Asp AspGly Asn Asp Ile Lys Ile Phe Phe Lys Gly His Pro Lys Gly Asp Asp
305 310 315 320305 310 315 320
Ile Asn Asp Tyr Ile Ile Arg Lys Thr Gly Ala Glu Lys Ile Pro AlaIle Asn Asp Tyr Ile Ile Arg Lys Thr Gly Ala Glu Lys Ile Pro Ala
325 330 335325 330 335
Asn Ile Pro Phe Glu Val Leu Met Met Thr Asn Ser Leu Pro Asp TyrAsn Ile Pro Phe Glu Val Leu Met Met Thr Asn Ser Leu Pro Asp Tyr
340 345 350340 345 350
Val Gly Gly Ile Met Ser Thr Val Tyr Phe Ser Leu Pro Pro Lys AsnVal Gly Gly Ile Met Ser Thr Val Tyr Phe Ser Leu Pro Pro Lys Asn
355 360 365355 360 365
Ile Asp Lys Val Val Phe Leu Gly Ser Glu Lys Ile Lys Asn Glu AsnIle Asp Lys Val Val Phe Leu Gly Ser Glu Lys Ile Lys Asn Glu Asn
370 375 380370 375 380
Asp Ala Lys Ser Gln Thr Leu Ser Lys Leu Met Leu Met Leu Asn ValAsp Ala Lys Ser Gln Thr Leu Ser Lys Leu Met Leu Met Leu Asn Val
385 390 395 400385 390 395 400
Ile Thr Pro Glu Gln Ile Phe Phe Glu Glu Met Pro Asn Pro Ile AsnIle Thr Pro Glu Gln Ile Phe Phe Glu Glu Met Pro Asn Pro Ile Asn
405 410 415405 410 415
PhePhe
<210> 43<210> 43
<211> 430<211> 430
<212> ПРТ<212> PRT
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 43<400> 43
Met Thr Arg Thr Arg Met Glu Asn Glu Leu Ile Val Ser Lys Asn MetMet Thr Arg Thr Arg Met Glu Asn Glu Leu Ile Val Ser Lys Asn Met
1 5 10 151 5 10 15
Gln Asn Ile Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Asn Ile AsnGln Asn Ile Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Asn Ile Asn
20 25 3020 25 30
Tyr Lys Arg Leu Pro Arg Glu Tyr Asp Val Phe Arg Cys Asn Gln PheTyr Lys Arg Leu Pro Arg Glu Tyr Asp Val Phe Arg Cys Asn Gln Phe
35 40 4535 40 45
Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Ile Lys Ala Val PheTyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Ile Lys Ala Val Phe
50 55 6050 55 60
Phe Asn Pro Gly Val Phe Leu Gln Gln Tyr His Thr Ala Lys Gln LeuPhe Asn Pro Gly Val Phe Leu Gln Gln Tyr His Thr Ala Lys Gln Leu
65 70 75 8065 70 75 80
Ile Leu Lys Asn Glu Tyr Glu Ile Lys Asn Ile Phe Cys Ser Thr PheIle Leu Lys Asn Glu Tyr Glu Ile Lys Asn Ile Phe Cys Ser Thr Phe
85 90 9585 90 95
Asn Leu Pro Phe Ile Glu Ser Asn Asp Phe Leu His Gln Phe Tyr AsnAsn Leu Pro Phe Ile Glu Ser Asn Asp Phe Leu His Gln Phe Tyr Asn
100 105 110100 105 110
Phe Phe Pro Asp Ala Lys Leu Gly Tyr Glu Val Ile Glu Asn Leu LysPhe Phe Pro Asp Ala Lys Leu Gly Tyr Glu Val Ile Glu Asn Leu Lys
115 120 125115 120 125
Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Phe Asn Lys ArgGlu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Phe Asn Lys Arg
130 135 140130 135 140
Ile Thr Ser Gly Val Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly TyrIle Thr Ser Gly Val Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly Tyr
145 150 155 160145 150 155 160
Lys Thr Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Asp Val IleLys Thr Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Asp Val Ile
165 170 175165 170 175
Tyr Pro Phe Glu Ala Met Ser Thr Asn Ile Lys Thr Ile Phe Pro GlyTyr Pro Phe Glu Ala Met Ser Thr Asn Ile Lys Thr Ile Phe Pro Gly
180 185 190180 185 190
Ile Lys Asp Phe Lys Pro Ser Asn Cys His Ser Lys Glu Tyr Asp IleIle Lys Asp Phe Lys Pro Ser Asn Cys His Ser Lys Glu Tyr Asp Ile
195 200 205195 200 205
Glu Ala Leu Lys Leu Leu Lys Ser Ile Tyr Lys Val Asn Ile Tyr AlaGlu Ala Leu Lys Leu Leu Lys Ser Ile Tyr Lys Val Asn Ile Tyr Ala
210 215 220210 215 220
Leu Cys Asp Asp Ser Ile Leu Ala Asn His Phe Pro Leu Ser Ile AsnLeu Cys Asp Asp Ser Ile Leu Ala Asn His Phe Pro Leu Ser Ile Asn
225 230 235 240225 230 235 240
Ile Asn Asn Asn Phe Thr Leu Glu Asn Lys His Asn Asn Ser Ile AsnIle Asn Asn Asn Phe Thr Leu Glu Asn Lys His Asn Asn Ser Ile Asn
245 250 255245 250 255
Asp Ile Leu Leu Thr Asp Asn Thr Pro Gly Val Ser Phe Tyr Lys AsnAsp Ile Leu Leu Thr Asp Asn Thr Pro Gly Val Ser Phe Tyr Lys Asn
260 265 270260 265 270
Gln Leu Lys Ala Asp Asn Lys Ile Met Leu Asn Phe Tyr Asn Ile LeuGln Leu Lys Ala Asp Asn Lys Ile Met Leu Asn Phe Tyr Asn Ile Leu
275 280 285275 280 285
His Ser Lys Asp Asn Leu Ile Lys Phe Leu Asn Lys Glu Ile Ala ValHis Ser Lys Asp Asn Leu Ile Lys Phe Leu Asn Lys Glu Ile Ala Val
290 295 300290 295 300
Leu Lys Lys Gln Thr Thr Gln Arg Ala Lys Ala Arg Ile Gln Asn HisLeu Lys Lys Gln Thr Thr Gln Arg Ala Lys Ala Arg Ile Gln Asn His
305 310 315 320305 310 315 320
Leu Ser Tyr Lys Leu Gly Gln Ala Leu Ile Ile Asn Ser Lys Ser ValLeu Ser Tyr Lys Leu Gly Gln Ala Leu Ile Ile Asn Ser Lys Ser Val
325 330 335325 330 335
Leu Gly Phe Leu Ser Leu Pro Phe Ile Ile Leu Ser Ile Val Ile SerLeu Gly Phe Leu Ser Leu Pro Phe Ile Ile Leu Ser Ile Val Ile Ser
340 345 350340 345 350
His Lys Gln Glu Gln Lys Ala Tyr Lys Phe Lys Val Lys Lys Asn ProHis Lys Gln Glu Gln Lys Ala Tyr Lys Phe Lys Val Lys Lys Asn Pro
355 360 365355 360 365
Asn Leu Ala Leu Pro Pro Leu Glu Thr Tyr Pro Asp Tyr Asn Glu AlaAsn Leu Ala Leu Pro Pro Leu Glu Thr Tyr Pro Asp Tyr Asn Glu Ala
370 375 380370 375 380
Leu Lys Glu Lys Glu Cys Phe Thr Tyr Lys Leu Gly Glu Glu Phe IleLeu Lys Glu Lys Glu Cys Phe Thr Tyr Lys Leu Gly Glu Glu Phe Ile
385 390 395 400385 390 395 400
Lys Ala Gly Lys Asn Trp Tyr Gly Glu Gly Tyr Ile Lys Phe Ile PheLys Ala Gly Lys Asn Trp Tyr Gly Glu Gly Tyr Ile Lys Phe Ile Phe
405 410 415405 410 415
Lys Asp Val Pro Arg Leu Lys Arg Glu Phe Glu Lys Gly GluLys Asp Val Pro Arg Leu Lys Arg Glu Phe Glu Lys Gly Glu
420 425 430420 425 430
<210> 44<210> 44
<211> 395<211> 395
<212> ПРТ<212> PRT
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 44<400> 44
Met Asn Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Ile LysMet Asn Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Ile Lys
1 5 10 151 5 10 15
Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Arg CysAsp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Arg Cys
20 25 3020 25 30
Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Ile LysAsn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Ile Lys
35 40 4535 40 45
Gly Val Phe Phe Asn Ala His Val Phe Asp Leu Gln Met Lys Ile ThrGly Val Phe Phe Asn Ala His Val Phe Asp Leu Gln Met Lys Ile Thr
50 55 6050 55 60
Lys Ala Ile Val Lys Asn Gly Glu Tyr His Pro Asp His Ile Tyr CysLys Ala Ile Val Lys Asn Gly Glu Tyr His Pro Asp His Ile Tyr Cys
65 70 75 8065 70 75 80
Thr His Val Glu Pro Tyr Gly Tyr Val Asn Gly Asn Gln Gln Leu MetThr His Val Glu Pro Tyr Gly Tyr Val Asn Gly Asn Gln Gln Leu Met
85 90 9585 90 95
Gln Glu Tyr Leu Glu Lys His Phe Val Gly Val Arg Ser Thr Tyr AlaGln Glu Tyr Leu Glu Lys His Phe Val Gly Val Arg Ser Thr Tyr Ala
100 105 110100 105 110
Tyr Leu Lys Asp Leu Glu Pro Phe Phe Ile Leu His Ser Lys Tyr ArgTyr Leu Lys Asp Leu Glu Pro Phe Phe Ile Leu His Ser Lys Tyr Arg
115 120 125115 120 125
Asn Phe Tyr Asp Gln His Phe Thr Thr Gly Ile Met Met Leu Leu ValAsn Phe Tyr Asp Gln His Phe Thr Thr Gly Ile Met Met Leu Leu Val
130 135 140130 135 140
Ala Ile Gln Leu Gly Tyr Lys Glu Ile Tyr Leu Cys Gly Ile Asp PheAla Ile Gln Leu Gly Tyr Lys Glu Ile Tyr Leu Cys Gly Ile Asp Phe
145 150 155 160145 150 155 160
Tyr Glu Asn Gly Phe Gly His Phe Tyr Glu Asn Gln Gly Gly Phe PheTyr Glu Asn Gly Phe Gly His Phe Tyr Glu Asn Gln Gly Gly Phe Phe
165 170 175165 170 175
Glu Glu Asp Ser Asp Pro Met His Asp Lys Asn Ile Asp Ile Gln AlaGlu Glu Asp Ser Asp Pro Met His Asp Lys Asn Ile Asp Ile Gln Ala
180 185 190180 185 190
Leu Glu Leu Ala Lys Lys Tyr Ala Lys Ile Tyr Ala Leu Val Pro AsnLeu Glu Leu Ala Lys Lys Tyr Ala Lys Ile Tyr Ala Leu Val Pro Asn
195 200 205195 200 205
Ser Ala Leu Val Lys Met Ile Pro Leu Ser Ser Gln Lys Gly Val LeuSer Ala Leu Val Lys Met Ile Pro Leu Ser Ser Gln Lys Gly Val Leu
210 215 220210 215 220
Glu Lys Val Lys Asp Arg Ile Gly Leu Gly Glu Phe Lys Arg Glu LysGlu Lys Val Lys Asp Arg Ile Gly Leu Gly Glu Phe Lys Arg Glu Lys
225 230 235 240225 230 235 240
Phe Gly Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln LysPhe Gly Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys
245 250 255245 250 255
Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu ArgGlu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg
260 265 270260 265 270
Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu LeuGln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu
275 280 285275 280 285
Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln LysGlu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys
290 295 300290 295 300
Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu ArgGlu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg
305 310 315 320305 310 315 320
Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu LeuGln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu
325 330 335325 330 335
Glu Leu Glu Arg Ser Leu Lys Ala Arg Leu Lys Ala Val Leu Ala SerGlu Leu Glu Arg Ser Leu Lys Ala Arg Leu Lys Ala Val Leu Ala Ser
340 345 350340 345 350
Lys Gly Ile Arg Gly Asp Asn Leu Ile Ile Val Ser Leu Lys Asp ThrLys Gly Ile Arg Gly Asp Asn Leu Ile Ile Val Ser Leu Lys Asp Thr
355 360 365355 360 365
Tyr Arg Leu Phe Lys Gly Gly Phe Ala Leu Leu Leu Asp Leu Lys AlaTyr Arg Leu Phe Lys Gly Gly Phe Ala Leu Leu Leu Asp Leu Lys Ala
370 375 380370 375 380
Leu Lys Ser Ile Ile Lys Ala Phe Leu Lys ArgLeu Lys Ser Ile Ile Lys Ala Phe Leu Lys Arg
385 390 395385 390 395
<210> 45<210> 45
<211> 260<211> 260
<212> ПРТ<212> PRT
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 45<400> 45
Met Gly Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys GluMet Gly Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu
1 5 10 151 5 10 15
Ile Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys AsnIle Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn
20 25 3020 25 30
Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys AlaGln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala
35 40 4535 40 45
Val Phe Tyr Asn Pro Ile Leu Phe Phe Glu Gln Tyr Tyr Thr Leu LysVal Phe Tyr Asn Pro Ile Leu Phe Phe Glu Gln Tyr Tyr Thr Leu Lys
50 55 6050 55 60
His Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys SerHis Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser
65 70 75 8065 70 75 80
Asn Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr PheAsn Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe
85 90 9585 90 95
Tyr Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys GlnTyr Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln
100 105 110100 105 110
Leu Lys Asp Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe AsnLeu Lys Asp Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn
115 120 125115 120 125
Gln Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala LeuGln Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu
130 135 140130 135 140
Gly Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn GlyGly Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly
145 150 155 160145 150 155 160
Ser Ser Tyr Ala Phe Asp Thr Lys Gln Lys Asn Leu Leu Lys Leu AlaSer Ser Tyr Ala Phe Asp Thr Lys Gln Lys Asn Leu Leu Lys Leu Ala
165 170 175165 170 175
Pro Asn Phe Lys Asn Asp Asn Ser His Tyr Ile Gly His Ser Lys AsnPro Asn Phe Lys Asn Asp Asn Ser His Tyr Ile Gly His Ser Lys Asn
180 185 190180 185 190
Thr Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile LysThr Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys
195 200 205195 200 205
Leu Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu LeuLeu Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu
210 215 220210 215 220
Ala Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn TyrAla Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr
225 230 235 240225 230 235 240
Thr Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe SerThr Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser
245 250 255245 250 255
Lys Asn Ile AsnLys Asn Ile Asn
260260
<210> 46<210> 46
<211> 298<211> 298
<212> ПРТ<212> PRT
<213> Streptococcus entericus<213> Streptococcus entericus
<400> 46<400> 46
Met Lys Lys Val Tyr Phe Cys His Thr Val Tyr His Leu Leu Ile ThrMet Lys Lys Val Tyr Phe Cys His Thr Val Tyr His Leu Leu Ile Thr
1 5 10 151 5 10 15
Leu Cys Lys Ile Ser Val Glu Glu Gln Val Glu Ile Ile Val Phe AspLeu Cys Lys Ile Ser Val Glu Glu Gln Val Glu Ile Ile Val Phe Asp
20 25 3020 25 30
Thr Val Ser Asn His Glu Leu Ile Val Gln Lys Ile Arg Asp Val PheThr Val Ser Asn His Glu Leu Ile Val Gln Lys Ile Arg Asp Val Phe
35 40 4535 40 45
Val Asn Thr Thr Val Leu Phe Ala Glu Gln Asn Thr Asp Phe Ser IleVal Asn Thr Thr Val Leu Phe Ala Glu Gln Asn Thr Asp Phe Ser Ile
50 55 6050 55 60
Leu Glu Ile Asp Arg Ala Thr Asp Ile Tyr Val Phe Asn Asp Trp ThrLeu Glu Ile Asp Arg Ala Thr Asp Ile Tyr Val Phe Asn Asp Trp Thr
65 70 75 8065 70 75 80
Pro Ile Gly Ala Tyr Leu Arg Lys Asn Lys Leu Phe Tyr His Leu IlePro Ile Gly Ala Tyr Leu Arg Lys Asn Lys Leu Phe Tyr His Leu Ile
85 90 9585 90 95
Glu Asp Gly Tyr Asn Tyr His Glu Tyr Asn Val Tyr Ala Asn Ala LeuGlu Asp Gly Tyr Asn Tyr His Glu Tyr Asn Val Tyr Ala Asn Ala Leu
100 105 110100 105 110
Thr Met Lys Arg Arg Leu Leu Asn Phe Val Leu Arg Arg Glu Glu ProThr Met Lys Arg Arg Leu Leu Asn Phe Val Leu Arg Arg Glu Glu Pro
115 120 125115 120 125
Ser Gly Phe Ser Arg Tyr Val Arg Ser Ile Glu Val Asn Arg Val LysSer Gly Phe Ser Arg Tyr Val Arg Ser Ile Glu Val Asn Arg Val Lys
130 135 140130 135 140
Tyr Leu Pro Asn Asp Cys Arg Lys Ser Lys Trp Val Glu Lys Pro ArgTyr Leu Pro Asn Asp Cys Arg Lys Ser Lys Trp Val Glu Lys Pro Arg
145 150 155 160145 150 155 160
Ser Ala Leu Phe Glu Asn Leu Val Pro Glu His Lys Gln Lys Ile IleSer Ala Leu Phe Glu Asn Leu Val Pro Glu His Lys Gln Lys Ile Ile
165 170 175165 170 175
Thr Ile Phe Gly Leu Glu Asn Tyr Gln Asp Ser Leu Arg Gly Val LeuThr Ile Phe Gly Leu Glu Asn Tyr Gln Asp Ser Leu Arg Gly Val Leu
180 185 190180 185 190
Val Leu Thr Gln Pro Leu Val Gln Asp Tyr Trp Asp Arg Asp Ile ThrVal Leu Thr Gln Pro Leu Val Gln Asp Tyr Trp Asp Arg Asp Ile Thr
195 200 205195 200 205
Thr Glu Glu Glu Gln Leu Glu Phe Tyr Arg Gln Ile Val Glu Ser TyrThr Glu Glu Glu Gln Leu Glu Phe Tyr Arg Gln Ile Val Glu Ser Tyr
210 215 220210 215 220
Gly Glu Gly Glu Gln Val Phe Phe Lys Ile His Pro Arg Asp Lys ValGly Glu Gly Glu Gln Val Phe Phe Lys Ile His Pro Arg Asp Lys Val
225 230 235 240225 230 235 240
Asp Tyr Ser Ser Leu Thr Asn Val Ile Phe Leu Lys Lys Asn Val ProAsp Tyr Ser Ser Leu Thr Asn Val Ile Phe Leu Lys Lys Asn Val Pro
245 250 255245 250 255
Met Glu Val Tyr Glu Leu Ile Ala Asp Cys His Phe Thr Lys Gly IleMet Glu Val Tyr Glu Leu Ile Ala Asp Cys His Phe Thr Lys Gly Ile
260 265 270260 265 270
Thr His Ser Ser Thr Ala Leu Asp Phe Leu Ser Cys Val Asp Lys LysThr His Ser Ser Thr Ala Leu Asp Phe Leu Ser Cys Val Asp Lys Lys
275 280 285275 280 285
Ile Thr Leu Lys Gln Met Lys Ala Asn SerIle Thr Leu Lys Gln Met Lys Ala Asn Ser
290 295290 295
<210> 47<210> 47
<211> 295<211> 295
<212> ПРТ<212> PRT
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 47<400> 47
Met Lys Glu Ile Ala Ile Ile Ser Asn Gln Arg Met Phe Phe Leu TyrMet Lys Glu Ile Ala Ile Ile Ser Asn Gln Arg Met Phe Phe Leu Tyr
1 5 10 151 5 10 15
Cys Leu Leu Thr Asn Lys Asn Val Glu Asp Val Phe Phe Ile Phe GluCys Leu Leu Thr Asn Lys Asn Val Glu Asp Val Phe Phe Ile Phe Glu
20 25 3020 25 30
Lys Gly Ala Met Pro Asn Asn Leu Thr Ser Ile Ser His Phe Ile ValLys Gly Ala Met Pro Asn Asn Leu Thr Ser Ile Ser His Phe Ile Val
35 40 4535 40 45
Leu Asp His Ser Lys Ser Glu Cys Tyr Asp Phe Phe Tyr Phe Asn PheLeu Asp His Ser Lys Ser Glu Cys Tyr Asp Phe Phe Tyr Phe Asn Phe
50 55 6050 55 60
Ile Ser Cys Lys Tyr Arg Leu Arg Gly Leu Asp Val Tyr Gly Ala AspIle Ser Cys Lys Tyr Arg Leu Arg Gly Leu Asp Val Tyr Gly Ala Asp
65 70 75 8065 70 75 80
His Ile Lys Gly Ala Lys Phe Phe Leu Glu Arg His Arg Phe Phe ValHis Ile Lys Gly Ala Lys Phe Phe Leu Glu Arg His Arg Phe Phe Val
85 90 9585 90 95
Val Glu Asp Gly Met Met Asn Tyr Ser Lys Asn Met Tyr Ala Phe SerVal Glu Asp Gly Met Met Asn Tyr Ser Lys Asn Met Tyr Ala Phe Ser
100 105 110100 105 110
Leu Phe Arg Thr Arg Asn Pro Val Ile Leu Pro Gly Gly Phe His ProLeu Phe Arg Thr Arg Asn Pro Val Ile Leu Pro Gly Gly Phe His Pro
115 120 125115 120 125
Asn Val Lys Thr Ile Phe Leu Thr Lys Asp Asn Pro Ile Pro Asp GlnAsn Val Lys Thr Ile Phe Leu Thr Lys Asp Asn Pro Ile Pro Asp Gln
130 135 140130 135 140
Ile Ala His Lys Arg Glu Ile Ile Asn Ile Lys Thr Leu Trp Gln AlaIle Ala His Lys Arg Glu Ile Ile Asn Ile Lys Thr Leu Trp Gln Ala
145 150 155 160145 150 155 160
Lys Thr Ala Thr Glu Lys Thr Lys Ile Leu Ser Phe Phe Glu Ile AspLys Thr Ala Thr Glu Lys Thr Lys Ile Leu Ser Phe Phe Glu Ile Asp
165 170 175165 170 175
Met Gln Glu Ile Ser Val Ile Lys Asn Arg Ser Phe Val Leu Tyr ThrMet Gln Glu Ile Ser Val Ile Lys Asn Arg Ser Phe Val Leu Tyr Thr
180 185 190180 185 190
Gln Pro Leu Ser Glu Asp Lys Leu Leu Thr Glu Ala Glu Lys Ile AspGln Pro Leu Ser Glu Asp Lys Leu Leu Thr Glu Ala Glu Lys Ile Asp
195 200 205195 200 205
Ile Tyr Arg Thr Ile Leu Thr Lys Tyr Asn His Ser Gln Thr Val IleIle Tyr Arg Thr Ile Leu Thr Lys Tyr Asn His Ser Gln Thr Val Ile
210 215 220210 215 220
Lys Pro His Pro Arg Asp Lys Thr Asp Tyr Lys Gln Leu Phe Pro AspLys Pro His Pro Arg Asp Lys Thr Asp Tyr Lys Gln Leu Phe Pro Asp
225 230 235 240225 230 235 240
Ala Tyr Val Met Lys Gly Thr Tyr Pro Ser Glu Leu Leu Thr Leu LeuAla Tyr Val Met Lys Gly Thr Tyr Pro Ser Glu Leu Leu Thr Leu Leu
245 250 255245 250 255
Gly Val Asn Phe Asn Lys Val Ile Thr Leu Phe Ser Thr Ala Val PheGly Val Asn Phe Asn Lys Val Ile Thr Leu Phe Ser Thr Ala Val Phe
260 265 270260 265 270
Asp Tyr Pro Lys Glu Lys Ile Asp Phe Tyr Gly Thr Ala Val His ProAsp Tyr Pro Lys Glu Lys Ile Asp Phe Tyr Gly Thr Ala Val His Pro
275 280 285275 280 285
Lys Leu Leu Asp Phe Phe AspLys Leu Leu Asp Phe Phe Asp
290 295290 295
<210> 48<210> 48
<211> 488<211> 488
<212> ПРТ<212> PRT
<213> Alistipes sp.<213> Alistipes sp.
<400> 48<400> 48
Met Ala Leu Leu Ser Gly Thr Ala Ala Cys Ser Asp Asp Glu Val SerMet Ala Leu Leu Ser Gly Thr Ala Ala Cys Ser Asp Asp Glu Val Ser
1 5 10 151 5 10 15
Gln Asn Leu Ile Val Ile Asn Gly Gly Glu His Phe Leu Ser Leu AspGln Asn Leu Ile Val Ile Asn Gly Gly Glu His Phe Leu Ser Leu Asp
20 25 3020 25 30
Gly Leu Ala Arg Ala Gly Lys Ile Ser Val Leu Ala Pro Ala Pro TrpGly Leu Ala Arg Ala Gly Lys Ile Ser Val Leu Ala Pro Ala Pro Trp
35 40 4535 40 45
Arg Val Thr Lys Ala Ala Gly Asp Thr Trp Phe Arg Leu Ser Ala ThrArg Val Thr Lys Ala Ala Gly Asp Thr Trp Phe Arg Leu Ser Ala Thr
50 55 6050 55 60
Glu Gly Pro Ala Gly Tyr Ser Glu Val Glu Leu Ser Leu Asp Glu AsnGlu Gly Pro Ala Gly Tyr Ser Glu Val Glu Leu Ser Leu Asp Glu Asn
65 70 75 8065 70 75 80
Pro Gly Ala Ala Arg Ser Ala Gln Leu Ala Phe Ala Cys Gly Asp AlaPro Gly Ala Ala Arg Ser Ala Gln Leu Ala Phe Ala Cys Gly Asp Ala
85 90 9585 90 95
Ile Val Pro Phe Arg Leu Ser Gln Gly Ala Leu Ser Ala Gly Tyr AspIle Val Pro Phe Arg Leu Ser Gln Gly Ala Leu Ser Ala Gly Tyr Asp
100 105 110100 105 110
Ser Pro Asp Tyr Tyr Phe Tyr Val Thr Phe Gly Thr Met Pro Thr LeuSer Pro Asp Tyr Tyr Phe Tyr Val Thr Phe Gly Thr Met Pro Thr Leu
115 120 125115 120 125
Tyr Ala Gly Ile His Leu Leu Ser His Asp Lys Pro Gly Tyr Val PheTyr Ala Gly Ile His Leu Leu Ser His Asp Lys Pro Gly Tyr Val Phe
130 135 140130 135 140
Tyr Ser Arg Ser Lys Thr Phe Asp Pro Ala Glu Phe Pro Ala Arg AlaTyr Ser Arg Ser Lys Thr Phe Asp Pro Ala Glu Phe Pro Ala Arg Ala
145 150 155 160145 150 155 160
Glu Val Thr Thr Ala Ala Asp Arg Thr Ala Asp Ala Thr Gln Ala GluGlu Val Thr Thr Ala Ala Asp Arg Thr Ala Asp Ala Thr Gln Ala Glu
165 170 175165 170 175
Met Glu Ala Met Ala Arg Glu Met Lys Arg Arg Ile Leu Glu Ile AsnMet Glu Ala Met Ala Arg Glu Met Lys Arg Arg Ile Leu Glu Ile Asn
180 185 190180 185 190
Ser Ala Asp Pro Thr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu ArgSer Ala Asp Pro Thr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg
195 200 205195 200 205
Cys Arg Ile Gly Tyr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser AlaCys Arg Ile Gly Tyr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala
210 215 220210 215 220
Arg Val Lys Val Ser Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn AsnArg Val Lys Val Ser Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn
225 230 235 240225 230 235 240
Phe Tyr Asn Tyr Phe Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp GluPhe Tyr Asn Tyr Phe Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Glu
245 250 255245 250 255
Ser Tyr Ala Ser Glu Val Glu Ala Leu Asp Trp Asn His Gly Gly ArgSer Tyr Ala Ser Glu Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg
260 265 270260 265 270
Tyr Pro Glu Thr Arg Ser Leu Pro Glu Phe Glu Ser Tyr Thr Trp ProTyr Pro Glu Thr Arg Ser Leu Pro Glu Phe Glu Ser Tyr Thr Trp Pro
275 280 285275 280 285
Tyr Tyr Leu Ser Thr Arg Pro Asp Tyr Arg Leu Val Val Gln Asp GlyTyr Tyr Leu Ser Thr Arg Pro Asp Tyr Arg Leu Val Val Gln Asp Gly
290 295 300290 295 300
Ser Leu Leu Glu Ser Ser Cys Pro Phe Ile Thr Glu Lys Leu Gly GluSer Leu Leu Glu Ser Ser Cys Pro Phe Ile Thr Glu Lys Leu Gly Glu
305 310 315 320305 310 315 320
Met Glu Ile Glu Ser Ile Gln Pro Tyr Glu Met Leu Ser Ala Leu ProMet Glu Ile Glu Ser Ile Gln Pro Tyr Glu Met Leu Ser Ala Leu Pro
325 330 335325 330 335
Glu Ser Ser Arg Lys Arg Phe Tyr Asp Met Ala Gly Phe Asp Tyr AspGlu Ser Ser Arg Lys Arg Phe Tyr Asp Met Ala Gly Phe Asp Tyr Asp
340 345 350340 345 350
Lys Phe Ala Ala Leu Phe Asp Ala Ser Pro Lys Lys Asn Leu Ile IleLys Phe Ala Ala Leu Phe Asp Ala Ser Pro Lys Lys Asn Leu Ile Ile
355 360 365355 360 365
Ile Gly Thr Ser His Ala Asp Asp Ala Ser Ala Arg Leu Gln Arg AspIle Gly Thr Ser His Ala Asp Asp Ala Ser Ala Arg Leu Gln Arg Asp
370 375 380370 375 380
Tyr Val Ala Arg Ile Met Glu Gln Tyr Gly Ala Gln Tyr Asp Val PheTyr Val Ala Arg Ile Met Glu Gln Tyr Gly Ala Gln Tyr Asp Val Phe
385 390 395 400385 390 395 400
Phe Lys Pro His Pro Ala Asp Thr Thr Ser Ala Gly Tyr Glu Thr GluPhe Lys Pro His Pro Ala Asp Thr Thr Ser Ala Gly Tyr Glu Thr Glu
405 410 415405 410 415
Phe Pro Gly Leu Thr Leu Leu Pro Gly Gln Met Pro Phe Glu Ile PhePhe Pro Gly Leu Thr Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe
420 425 430420 425 430
Val Trp Ser Leu Ile Asp Arg Val Asp Met Ile Gly Gly Tyr Pro SerVal Trp Ser Leu Ile Asp Arg Val Asp Met Ile Gly Gly Tyr Pro Ser
435 440 445435 440 445
Thr Val Phe Leu Thr Val Pro Val Asp Lys Val Arg Phe Ile Phe AlaThr Val Phe Leu Thr Val Pro Val Asp Lys Val Arg Phe Ile Phe Ala
450 455 460450 455 460
Ala Asp Ala Ala Ser Leu Val Arg Pro Leu Asn Ile Leu Phe Arg AspAla Asp Ala Ala Ser Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp
465 470 475 480465 470 475 480
Ala Thr Asp Val Glu Trp Met GlnAla Thr Asp Val Glu Trp Met Gln
485485
<210> 49<210> 49
<211> 291<211> 291
<212> ПРТ<212> PRT
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 49<400> 49
Met Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu IleMet Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu Ile
1 5 10 151 5 10 15
Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn GlnAsp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn Gln
20 20
Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala ValPhe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala Val
35 40 4535 40 45
Phe Tyr Thr Pro Asn Phe Phe Phe Glu Gln Tyr Tyr Thr Leu Lys HisPhe Tyr Thr Pro Asn Phe Phe Phe Glu Gln Tyr Tyr Thr Leu Lys His
50 55 6050 55 60
Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser AsnLeu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser Asn
65 70 75 8065 70 75 80
Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe TyrTyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe Tyr
85 90 9585 90 95
Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln LeuAsp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln Leu
100 105 110100 105 110
Lys Glu Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn GlnLys Glu Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn Gln
115 120 125115 120 125
Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu GlyArg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu Gly
130 135 140130 135 140
Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly SerTyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly Ser
145 150 155 160145 150 155 160
Ser Tyr Ala Phe Asp Thr Lys Gln Glu Asn Leu Leu Lys Leu Ala ProSer Tyr Ala Phe Asp Thr Lys Gln Glu Asn Leu Leu Lys Leu Ala Pro
165 170 175165 170 175
Asp Phe Lys Asn Asp Arg Ser His Tyr Ile Gly His Ser Lys Asn ThrAsp Phe Lys Asn Asp Arg Ser His Tyr Ile Gly His Ser Lys Asn Thr
180 185 190180 185 190
Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys LeuAsp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys Leu
195 200 205195 200 205
Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu AlaTyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu Ala
210 215 220210 215 220
Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr ThrPro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr Thr
225 230 235 240225 230 235 240
Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser LysLys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser Lys
245 250 255245 250 255
Asn Ile Asn Phe Lys Lys Ile Lys Ile Lys Glu Asn Val Tyr Tyr LysAsn Ile Asn Phe Lys Lys Ile Lys Ile Lys Glu Asn Val Tyr Tyr Lys
260 265 270260 265 270
Leu Ile Lys Asp Leu Leu Arg Leu Pro Ser Asp Ile Lys His Tyr PheLeu Ile Lys Asp Leu Leu Arg Leu Pro Ser Asp Ile Lys His Tyr Phe
275 280 285275 280 285
Lys Gly LysLys Gly Lys
290290
<210> 50<210> 50
<211> 312<211> 312
<212> ПРТ<212> PRT
<213> Streptococcus agalactiae<213> Streptococcus agalactiae
<400> 50<400> 50
Met Thr Asn Arg Lys Ile Tyr Val Cys His Thr Leu Tyr His Leu LeuMet Thr Asn Arg Lys Ile Tyr Val Cys His Thr Leu Tyr His Leu Leu
1 5 10 151 5 10 15
Ile Cys Leu Tyr Lys Glu Glu Ile Tyr Ser Asn Leu Glu Ile Ile LeuIle Cys Leu Tyr Lys Glu Glu Ile Tyr Ser Asn Leu Glu Ile Ile Leu
20 25 3020 25 30
Ser Ser Ser Ile Pro Asp Val Asp Asn Leu Glu Lys Lys Leu Lys SerSer Ser Ser Ile Pro Asp Val Asp Asn Leu Glu Lys Lys Leu Lys Ser
35 40 4535 40 45
Lys Thr Ile Asn Ile His Ile Leu Glu Glu Ser Ser Gly Glu Ser GluLys Thr Ile Asn Ile His Ile Leu Glu Glu Ser Ser Gly Glu Ser Glu
50 55 6050 55 60
Glu Leu Leu Ser Val Leu Lys Asp Ala Gly Leu Ser Tyr Ser Lys PheGlu Leu Leu Ser Val Leu Lys Asp Ala Gly Leu Ser Tyr Ser Lys Phe
65 70 75 8065 70 75 80
Asp Ser Asn Cys Phe Ile Phe Asn Asp Ala Thr Pro Ile Gly Arg ThrAsp Ser Asn Cys Phe Ile Phe Asn Asp Ala Thr Pro Ile Gly Arg Thr
85 90 9585 90 95
Leu Ile Lys His Gly Ile Tyr Tyr Asn Leu Ile Glu Asp Gly Leu AsnLeu Ile Lys His Gly Ile Tyr Tyr Asn Leu Ile Glu Asp Gly Leu Asn
100 105 110100 105 110
Cys Phe Thr Tyr Ser Ile Phe Ser Gln Lys Leu Trp Lys Tyr Tyr ValCys Phe Thr Tyr Ser Ile Phe Ser Gln Lys Leu Trp Lys Tyr Tyr Val
115 120 125115 120 125
Lys Lys Tyr Ile Leu His Lys Ile Gln Pro His Gly Phe Ser Arg TyrLys Lys Tyr Ile Leu His Lys Ile Gln Pro His Gly Phe Ser Arg Tyr
130 135 140130 135 140
Cys Leu Gly Ile Glu Val Asn Ser Leu Val Asn Leu Pro Lys Asp ProCys Leu Gly Ile Glu Val Asn Ser Leu Val Asn Leu Pro Lys Asp Pro
145 150 155 160145 150 155 160
Arg Tyr Lys Lys Phe Ile Glu Val Pro Arg Lys Glu Leu Phe Asp AsnArg Tyr Lys Lys Phe Ile Glu Val Pro Arg Lys Glu Leu Phe Asp Asn
165 170 175165 170 175
Val Thr Glu Tyr Gln Lys Glu Met Ala Ile Asn Leu Phe Gly Ala ValVal Thr Glu Tyr Gln Lys Glu Met Ala Ile Asn Leu Phe Gly Ala Val
180 185 190180 185 190
Arg Val Ser Ile Lys Ser Pro Ser Val Leu Val Leu Thr Gln Pro LeuArg Val Ser Ile Lys Ser Pro Ser Val Leu Val Leu Thr Gln Pro Leu
195 200 205195 200 205
Ser Ile Asp Lys Glu Phe Met Ser Tyr Asn Asn Lys Ile Glu Thr SerSer Ile Asp Lys Glu Phe Met Ser Tyr Asn Asn Lys Ile Glu Thr Ser
210 215 220210 215 220
Glu Glu Gln Phe Asn Phe Tyr Lys Ser Ile Val Asn Glu Tyr Ile AsnGlu Glu Gln Phe Asn Phe Tyr Lys Ser Ile Val Asn Glu Tyr Ile Asn
225 230 235 240225 230 235 240
Lys Gly Tyr Asn Val Tyr Leu Lys Val His Pro Arg Asp Val Val AspLys Gly Tyr Asn Val Tyr Leu Lys Val His Pro Arg Asp Val Val Asp
245 250 255245 250 255
Tyr Ser Lys Leu Pro Val Glu Leu Leu Pro Ser Asn Val Pro Met GluTyr Ser Lys Leu Pro Val Glu Leu Leu Pro Ser Asn Val Pro Met Glu
260 265 270260 265 270
Ile Ile Glu Leu Met Leu Thr Gly Arg Phe Glu Cys Gly Ile Thr HisIle Ile Glu Leu Met Leu Thr Gly Arg Phe Glu Cys Gly Ile Thr His
275 280 285275 280 285
Ser Ser Thr Ala Leu Asp Phe Leu Thr Cys Val Asp Lys Lys Ile ThrSer Ser Thr Ala Leu Asp Phe Leu Thr Cys Val Asp Lys Lys Ile Thr
290 295 300290 295 300
Leu Val Asp Leu Lys Asp Ile LysLeu Val Asp Leu Lys Asp Ile Lys
305 310305 310
<210> 51<210> 51
<211> 410<211> 410
<212> ПРТ<212> PRT
<213> Bibersteinia trehalosi<213> Bibersteinia trehalosi
<400> 51<400> 51
Met Glu Phe Cys Lys Met Ala Thr Thr Gln Lys Ile Cys Val Tyr LeuMet Glu Phe Cys Lys Met Ala Thr Thr Gln Lys Ile Cys Val Tyr Leu
1 5 10 151 5 10 15
Asp Tyr Ala Thr Ile Pro Ser Leu Asn Tyr Ile Leu His Phe Ala GlnAsp Tyr Ala Thr Ile Pro Ser Leu Asn Tyr Ile Leu His Phe Ala Gln
20 25 3020 25 30
His Phe Glu Asp Gln Glu Thr Ile Arg Leu Phe Gly Leu Ser Arg PheHis Phe Glu Asp Gln Glu Thr Ile Arg Leu Phe Gly Leu Ser Arg Phe
35 40 4535 40 45
His Ile Pro Glu Ser Val Ile Gln Arg Tyr Pro Lys Gly Val Val GlnHis Ile Pro Glu Ser Val Ile Gln Arg Tyr Pro Lys Gly Val Val Gln
50 55 6050 55 60
Phe Tyr Pro Asn Gln Glu Lys Asp Phe Ser Ala Leu Leu Leu Ala LeuPhe Tyr Pro Asn Gln Glu Lys Asp Phe Ser Ala Leu Leu Leu Ala Leu
65 70 75 8065 70 75 80
Lys Asn Ile Leu Ile Glu Val Lys Gln Gln Gln Arg Lys Cys Glu IleLys Asn Ile Leu Ile Glu Val Lys Gln Gln Gln Arg Lys Cys Glu Ile
85 90 9585 90 95
Glu Leu His Leu Asn Leu Phe His Tyr Gln Leu Leu Leu Leu Pro PheGlu Leu His Leu Asn Leu Phe His Tyr Gln Leu Leu Leu Leu Pro Phe
100 105 110100 105 110
Leu Ser Leu Tyr Leu Asp Thr Gln Asp Tyr Cys His Leu Thr Leu LysLeu Ser Leu Tyr Leu Asp Thr Gln Asp Tyr Cys His Leu Thr Leu Lys
115 120 125115 120 125
Phe Tyr Asp Asp Gly Ser Glu Ala Ile Ser Ala Leu Gln Glu Leu AlaPhe Tyr Asp Asp Gly Ser Glu Ala Ile Ser Ala Leu Gln Glu Leu Ala
130 135 140130 135 140
Leu Ala Pro Asp Leu Ala Ala Gln Ile Gln Phe Glu Lys Gln Gln PheLeu Ala Pro Asp Leu Ala Ala Gln Ile Gln Phe Glu Lys Gln Gln Phe
145 150 155 160145 150 155 160
Asp Glu Leu Val Val Lys Lys Ser Phe Lys Leu Ser Leu Leu Ser ArgAsp Glu Leu Val Val Lys Lys Ser Phe Lys Leu Ser Leu Leu Ser Arg
165 170 175165 170 175
Tyr Phe Trp Gly Lys Leu Phe Glu Ser Glu Tyr Ile Trp Phe Asn GlnTyr Phe Trp Gly Lys Leu Phe Glu Ser Glu Tyr Ile Trp Phe Asn Gln
180 185 190180 185 190
Ala Ile Leu Gln Lys Ala Glu Leu Gln Ile Leu Lys Gln Glu Ile SerAla Ile Leu Gln Lys Ala Glu Leu Gln Ile Leu Lys Gln Glu Ile Ser
195 200 205195 200 205
Ser Ser Arg Gln Met Asp Phe Ala Ile Tyr Gln Gln Met Ser Asp GluSer Ser Arg Gln Met Asp Phe Ala Ile Tyr Gln Gln Met Ser Asp Glu
210 215 220210 215 220
Gln Lys Gln Leu Val Leu Glu Ile Leu Asn Ile Asp Leu Asn Lys ValGln Lys Gln Leu Val Leu Glu Ile Leu Asn Ile Asp Leu Asn Lys Val
225 230 235 240225 230 235 240
Ala Tyr Leu Lys Gln Leu Met Glu Asn Gln Pro Ser Phe Leu Phe LeuAla Tyr Leu Lys Gln Leu Met Glu Asn Gln Pro Ser Phe Leu Phe Leu
245 250 255245 250 255
Gly Thr Thr Leu Phe Asn Ile Thr Gln Glu Thr Lys Thr Trp Leu MetGly Thr Thr Leu Phe Asn Ile Thr Gln Glu Thr Lys Thr Trp Leu Met
260 265 270260 265 270
Gln Met His Val Asp Leu Ile Gln Gln Tyr Cys Leu Pro Ser Gly GlnGln Met His Val Asp Leu Ile Gln Gln Tyr Cys Leu Pro Ser Gly Gln
275 280 285275 280 285
Phe Phe Asn Asn Lys Ala Gly Tyr Leu Cys Phe Tyr Lys Gly His ProPhe Phe Asn Asn Lys Ala Gly Tyr Leu Cys Phe Tyr Lys Gly His Pro
290 295 300290 295 300
Asn Glu Lys Glu Met Asn Gln Met Ile Leu Ser Gln Phe Lys Asn LeuAsn Glu Lys Glu Met Asn Gln Met Ile Leu Ser Gln Phe Lys Asn Leu
305 310 315 320305 310 315 320
Ile Ala Leu Pro Asp Asp Ile Pro Leu Glu Ile Leu Leu Leu Leu GlyIle Ala Leu Pro Asp Asp Ile Pro Leu Glu Ile Leu Leu Leu Leu Gly
325 330 335325 330 335
Val Ile Pro Ser Lys Val Gly Gly Phe Ala Ser Ser Ala Leu Phe AsnVal Ile Pro Ser Lys Val Gly Gly Phe Ala Ser Ser Ala Leu Phe Asn
340 345 350340 345 350
Phe Thr Pro Ala Gln Ile Glu Asn Ile Ile Phe Phe Thr Pro Arg TyrPhe Thr Pro Ala Gln Ile Glu Asn Ile Ile Phe Phe Thr Pro Arg Tyr
355 360 365355 360 365
Phe Glu Lys Asp Asn Arg Leu His Ala Thr Gln Tyr Arg Leu Met GlnPhe Glu Lys Asp Asn Arg Leu His Ala Thr Gln Tyr Arg Leu Met Gln
370 375 380370 375 380
Gly Leu Ile Glu Leu Gly Tyr Leu Asp Ala Glu Lys Ser Val Thr HisGly Leu Ile Glu Leu Gly Tyr Leu Asp Ala Glu Lys Ser Val Thr His
385 390 395 400385 390 395 400
Phe Glu Ile Met Gln Leu Leu Thr Lys GluPhe Glu Ile Met Gln Leu Leu Thr Lys Glu
405 410405 410
<210> 52<210> 52
<211> 406<211> 406
<212> ПРТ<212> PRT
<213> Haemophilus parahaemolyticus<213> Haemophilus parahaemolyticus
<400> 52<400> 52
Met Thr Glu Gln Tyr Ile Lys Asn Val Glu Val Tyr Leu Asp Tyr AlaMet Thr Glu Gln Tyr Ile Lys Asn Val Glu Val Tyr Leu Asp Tyr Ala
1 5 10 151 5 10 15
Thr Ile Pro Thr Leu Asn Tyr Phe Tyr His Phe Thr Glu Asn Lys AspThr Ile Pro Thr Leu Asn Tyr Phe Tyr His Phe Thr Glu Asn Lys Asp
20 25 3020 25 30
Asp Ile Ala Thr Ile Arg Leu Phe Gly Leu Gly Arg Phe Asn Ile SerAsp Ile Ala Thr Ile Arg Leu Phe Gly Leu Gly Arg Phe Asn Ile Ser
35 40 4535 40 45
Lys Ser Ile Ile Glu Ser Tyr Pro Glu Gly Ile Ile Arg Tyr Cys ProLys Ser Ile Ile Glu Ser Tyr Pro Glu Gly Ile Ile Arg Tyr Cys Pro
50 55 6050 55 60
Ile Ile Phe Glu Asp Gln Thr Ala Phe Gln Gln Leu Phe Ile Thr LeuIle Ile Phe Glu Asp Gln Thr Ala Phe Gln Gln Leu Phe Ile Thr Leu
65 70 75 8065 70 75 80
Leu Thr Glu Asp Ser Phe Cys Gln Tyr Arg Phe Asn Phe His Ile AsnLeu Thr Glu Asp Ser Phe Cys Gln Tyr Arg Phe Asn Phe His Ile Asn
85 90 9585 90 95
Leu Phe His Ser Trp Lys Met Leu Ile Pro Leu Leu His Ile Ile TrpLeu Phe His Ser Trp Lys Met Leu Ile Pro Leu Leu His Ile Ile Trp
100 105 110100 105 110
Gln Phe Lys His Lys Val Leu Asp Ile Lys Leu Asn Phe Tyr Asp AspGln Phe Lys His Lys Val Leu Asp Ile Lys Leu Asn Phe Tyr Asp Asp
115 120 125115 120 125
Gly Ser Glu Gly Leu Val Thr Leu Ser Lys Ile Glu Gln Asn Tyr SerGly Ser Glu Gly Leu Val Thr Leu Ser Lys Ile Glu Gln Asn Tyr Ser
130 135 140130 135 140
Ser Glu Ile Leu Gln Lys Ile Ile Asp Ile Asp Ser Gln Ser Phe TyrSer Glu Ile Leu Gln Lys Ile Ile Asp Ile Asp Ser Gln Ser Phe Tyr
145 150 155 160145 150 155 160
Ala Asp Lys Leu Ser Phe Leu Asp Glu Asp Ile Ala Arg Tyr Leu TrpAla Asp Lys Leu Ser Phe Leu Asp Glu Asp Ile Ala Arg Tyr Leu Trp
165 170 175165 170 175
Asn Ser Leu Phe Glu Ser His Tyr Tyr Leu Leu Asn Asp Phe Leu LeuAsn Ser Leu Phe Glu Ser His Tyr Tyr Leu Leu Asn Asp Phe Leu Leu
180 185 190180 185 190
Lys Asn Glu Lys Leu Ser Leu Leu Lys Asn Ser Ile Lys Tyr Cys HisLys Asn Glu Lys Leu Ser Leu Leu Lys Asn Ser Ile Lys Tyr Cys His
195 200 205195 200 205
Ile Met Asp Leu Glu Arg Tyr Leu Gln Phe Thr Gln Glu Glu Lys AspIle Met Asp Leu Glu Arg Tyr Leu Gln Phe Thr Gln Glu Glu Lys Asp
210 215 220210 215 220
Phe Phe Asn Glu Leu Leu Gly Ile Asn Ile Gln Ser Leu Glu Asp LysPhe Phe Asn Glu Leu Leu Gly Ile Asn Ile Gln Ser Leu Glu Asp Lys
225 230 235 240225 230 235 240
Ile Lys Ile Phe Gln Gln Lys Lys Thr Phe Ile Phe Thr Gly Thr ThrIle Lys Ile Phe Gln Gln Lys Lys Thr Phe Ile Phe Thr Gly Thr Thr
245 250 255245 250 255
Ile Phe Ser Leu Pro Lys Glu Glu Glu Glu Thr Leu Tyr Arg Leu HisIle Phe Ser Leu Pro Lys Glu Glu Glu Glu Thr Leu Tyr Arg Leu His
260 265 270260 265 270
Leu Asn Ala Ile Leu Asn Tyr Ile His Pro Asn Gly Lys Tyr Phe IleLeu Asn Ala Ile Leu Asn Tyr Ile His Pro Asn Gly Lys Tyr Phe Ile
275 280 285275 280 285
Gly Asp Gly Phe Thr Leu Val Ile Lys Gly His Pro His Gln Lys GluGly Asp Gly Phe Thr Leu Val Ile Lys Gly His Pro His Gln Lys Glu
290 295 300290 295 300
Met Asn Ser Arg Leu Glu Lys Ser Phe Glu Lys Ala Val Met Leu ProMet Asn Ser Arg Leu Glu Lys Ser Phe Glu Lys Ala Val Met Leu Pro
305 310 315 320305 310 315 320
Asp Asn Ile Pro Phe Glu Ile Leu Tyr Leu Ile Gly Cys Lys Pro AspAsp Asn Ile Pro Phe Glu Ile Leu Tyr Leu Ile Gly Cys Lys Pro Asp
325 330 335325 330 335
Lys Ile Gly Gly Phe Val Ser Thr Ser Tyr Phe Ser Cys Asp Lys LysLys Ile Gly Gly Phe Val Ser Thr Ser Tyr Phe Ser Cys Asp Lys Lys
340 345 350340 345 350
Asn Ile Ala Asp Leu Leu Phe Ile Ser Ala Arg Gln Glu Glu Val ArgAsn Ile Ala Asp Leu Leu Phe Ile Ser Ala Arg Gln Glu Glu Val Arg
355 360 365355 360 365
Lys Asn Asp Tyr Leu Phe Asn Ile Gln Tyr Gln Leu Arg Asp Met MetLys Asn Asp Tyr Leu Phe Asn Ile Gln Tyr Gln Leu Arg Asp Met Met
370 375 380370 375 380
Ile Lys Thr Gly Phe Ile Gln Glu Glu Lys Thr His Phe Tyr Ser AspIle Lys Thr Gly Phe Ile Gln Glu Glu Lys Thr His Phe Tyr Ser Asp
385 390 395 400385 390 395 400
Ile Pro Ile Phe Ile SerIle Pro Ile Phe Ile Ser
405405
<210> 53<210> 53
<211> 300<211> 300
<212> ПРТ<212> PRT
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 53<400> 53
Met Lys Tyr Asn Ile Lys Ile Lys Ala Ile Val Ile Val Ser Ser LeuMet Lys Tyr Asn Ile Lys Ile Lys Ala Ile Val Ile Val Ser Ser Leu
1 5 10 151 5 10 15
Arg Met Leu Leu Ile Phe Leu Met Leu Asn Lys Tyr His Leu Asp GluArg Met Leu Leu Ile Phe Leu Met Leu Asn Lys Tyr His Leu Asp Glu
20 25 3020 25 30
Val Leu Phe Val Phe Asn Glu Gly Phe Glu Leu His Lys Lys Tyr LysVal Leu Phe Val Phe Asn Glu Gly Phe Glu Leu His Lys Lys Tyr Lys
35 40 4535 40 45
Ile Lys His Tyr Val Ala Ile Lys Lys Lys Ile Thr Lys Phe Trp ArgIle Lys His Tyr Val Ala Ile Lys Lys Lys Ile Thr Lys Phe Trp Arg
50 55 6050 55 60
Leu Tyr Tyr Lys Leu Tyr Phe Tyr Arg Phe Lys Ile Asp Arg Ile ProLeu Tyr Tyr Lys Leu Tyr Phe Tyr Arg Phe Lys Ile Asp Arg Ile Pro
65 70 75 8065 70 75 80
Val Tyr Gly Ala Asp His Leu Gly Trp Thr Asp Tyr Phe Leu Lys TyrVal Tyr Gly Ala Asp His Leu Gly Trp Thr Asp Tyr Phe Leu Lys Tyr
85 90 9585 90 95
Phe Asp Phe Tyr Leu Ile Glu Asp Gly Ile Ala Asn Phe Ser Pro LysPhe Asp Phe Tyr Leu Ile Glu Asp Gly Ile Ala Asn Phe Ser Pro Lys
100 105 110100 105 110
Arg Tyr Glu Ile Asn Leu Thr Arg Asn Ile Pro Val Phe Gly Phe HisArg Tyr Glu Ile Asn Leu Thr Arg Asn Ile Pro Val Phe Gly Phe His
115 120 125115 120 125
Lys Thr Val Lys Lys Ile Tyr Leu Thr Ser Leu Glu Asn Val Pro SerLys Thr Val Lys Lys Ile Tyr Leu Thr Ser Leu Glu Asn Val Pro Ser
130 135 140130 135 140
Asp Ile Arg His Lys Val Glu Leu Ile Ser Leu Glu His Leu Trp LysAsp Ile Arg His Lys Val Glu Leu Ile Ser Leu Glu His Leu Trp Lys
145 150 155 160145 150 155 160
Thr Arg Thr Ala Gln Glu Gln His Asn Ile Leu Asp Phe Phe Ala PheThr Arg Thr Ala Gln Glu Gln His Asn Ile Leu Asp Phe Phe Ala Phe
165 170 175165 170 175
Asn Leu Asp Ser Leu Ile Ser Leu Lys Met Lys Lys Tyr Ile Leu PheAsn Leu Asp Ser Leu Ile Ser Leu Lys Met Lys Lys Tyr Ile Leu Phe
180 185 190180 185 190
Thr Gln Cys Leu Ser Glu Asp Arg Val Ile Ser Glu Gln Glu Lys IleThr Gln Cys Leu Ser Glu Asp Arg Val Ile Ser Glu Gln Glu Lys Ile
195 200 205195 200 205
Ala Ile Tyr Gln His Ile Ile Lys Asn Tyr Asp Glu Arg Leu Leu ValAla Ile Tyr Gln His Ile Ile Lys Asn Tyr Asp Glu Arg Leu Leu Val
210 215 220210 215 220
Ile Lys Pro His Pro Arg Glu Thr Thr Asp Tyr Gln Lys Tyr Phe GluIle Lys Pro His Pro Arg Glu Thr Thr Asp Tyr Gln Lys Tyr Phe Glu
225 230 235 240225 230 235 240
Asn Val Phe Val Tyr Gln Asp Val Val Pro Ser Glu Leu Phe Glu LeuAsn Val Phe Val Tyr Gln Asp Val Val Pro Ser Glu Leu Phe Glu Leu
245 250 255245 250 255
Leu Asp Val Asn Phe Glu Arg Val Ile Thr Leu Phe Ser Thr Ala ValLeu Asp Val Asn Phe Glu Arg Val Ile Thr Leu Phe Ser Thr Ala Val
260 265 270260 265 270
Phe Lys Tyr Asp Arg Asn Ile Val Asp Phe Tyr Gly Thr Arg Ile HisPhe Lys Tyr Asp Arg Asn Ile Val Asp Phe Tyr Gly Thr Arg Ile His
275 280 285275 280 285
Asp Lys Ile Tyr Gln Trp Phe Gly Asp Ile Lys PheAsp Lys Ile Tyr Gln Trp Phe Gly Asp Ile Lys Phe
290 295 300290 295 300
<210> 54<210> 54
<211> 381<211> 381
<212> ПРТ<212> PRT
<213> Vibrio harveyi<213> Vibrio harveyi
<400> 54<400> 54
Met Asp Ser Ser Pro Glu Asn Thr Ser Ser Thr Leu Glu Ile Tyr IleMet Asp Ser Ser Pro Glu Asn Thr Ser Ser Thr Leu Glu Ile Tyr Ile
1 5 10 151 5 10 15
Asp Ser Ala Thr Leu Pro Ser Leu Gln His Met Val Lys Ile Ile AspAsp Ser Ala Thr Leu Pro Ser Leu Gln His Met Val Lys Ile Ile Asp
20 25 3020 25 30
Glu Gln Ser Gly Asn Lys Lys Leu Ile Asn Trp Lys Arg Tyr Pro IleGlu Gln Ser Gly Asn Lys Lys Leu Ile Asn Trp Lys Arg Tyr Pro Ile
35 40 4535 40 45
Asp Asp Glu Leu Leu Leu Asp Lys Ile Asn Ala Leu Ser Phe Ser AspAsp Asp Glu Leu Leu Leu Asp Lys Ile Asn Ala Leu Ser Phe Ser Asp
50 55 6050 55 60
Thr Thr Asp Leu Thr Arg Tyr Met Glu Ser Ile Leu Leu Ile Gly AspThr Thr Asp Leu Thr Arg Tyr Met Glu Ser Ile Leu Leu Ile Gly Asp
65 70 75 8065 70 75 80
Ile Lys Arg Val Val Ile Asn Gly Asn Ser Leu Ser Asn Tyr Asn IleIle Lys Arg Val Val Ile Asn Gly Asn Ser Leu Ser Asn Tyr Asn Ile
85 90 9585 90 95
Val Gly Val Met Arg Ser Ile Asn Ala Leu Gly Leu Asp Leu Asp ValVal Gly Val Met Arg Ser Ile Asn Ala Leu Gly Leu Asp Leu Asp Val
100 105 110100 105 110
Glu Ile Asn Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Leu TyrGlu Ile Asn Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr
115 120 125115 120 125
Asn Phe Ser Gln Leu Pro Glu Ala Glu Arg Glu Leu Leu Val Ser MetAsn Phe Ser Gln Leu Pro Glu Ala Glu Arg Glu Leu Leu Val Ser Met
130 135 140130 135 140
Ser Lys Asn Asn Ile Leu Ala Ala Val Asn Gly Ile Gly Ser Tyr AspSer Lys Asn Asn Ile Leu Ala Ala Val Asn Gly Ile Gly Ser Tyr Asp
145 150 155 160145 150 155 160
Ser Gly Ser Pro Glu Asn Ile Tyr Gly Phe Ala Gln Ile Tyr Pro AlaSer Gly Ser Pro Glu Asn Ile Tyr Gly Phe Ala Gln Ile Tyr Pro Ala
165 170 175165 170 175
Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Asp Leu Glu IleThr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Asp Leu Glu Ile
180 185 190180 185 190
Gly Leu Ile Arg Asp Ile Leu Gly Asp Asn Val Lys Gln Met Lys TrpGly Leu Ile Arg Asp Ile Leu Gly Asp Asn Val Lys Gln Met Lys Trp
195 200 205195 200 205
Gly Gln Phe Leu Gly Phe Asn Glu Glu Gln Lys Glu Leu Phe Tyr GlnGly Gln Phe Leu Gly Phe Asn Glu Glu Gln Lys Glu Leu Phe Tyr Gln
210 215 220210 215 220
Leu Thr Ser Phe Asn Pro Asp Lys Ile Gln Ala Gln Tyr Lys Glu SerLeu Thr Ser Phe Asn Pro Asp Lys Ile Gln Ala Gln Tyr Lys Glu Ser
225 230 235 240225 230 235 240
Pro Asn Lys Asn Phe Val Phe Val Gly Thr Asn Ser Arg Ser Ala ThrPro Asn Lys Asn Phe Val Phe Val Gly Thr Asn Ser Arg Ser Ala Thr
245 250 255245 250 255
Ala Glu Gln Gln Ile Asn Ile Ile Lys Glu Ala Lys Lys Leu Asp SerAla Glu Gln Gln Ile Asn Ile Ile Lys Glu Ala Lys Lys Leu Asp Ser
260 265 270260 265 270
Glu Ile Ile Pro Asn Ser Ile Asp Gly Tyr Asp Leu Phe Phe Lys GlyGlu Ile Ile Pro Asn Ser Ile Asp Gly Tyr Asp Leu Phe Phe Lys Gly
275 280 285275 280 285
His Pro Ser Ala Thr Tyr Asn Gln Gln Ile Val Asp Ala His Asp MetHis Pro Ser Ala Thr Tyr Asn Gln Gln Ile Val Asp Ala His Asp Met
290 295 300290 295 300
Thr Glu Ile Tyr Asn Arg Thr Pro Phe Glu Val Leu Ala Met Thr SerThr Glu Ile Tyr Asn Arg Thr Pro Phe Glu Val Leu Ala Met Thr Ser
305 310 315 320305 310 315 320
Ser Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Leu Phe Phe SerSer Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Leu Phe Phe Ser
325 330 335325 330 335
Leu Pro Lys Thr Val Glu Thr Lys Phe Ile Phe Tyr Lys Ser Gly ThrLeu Pro Lys Thr Val Glu Thr Lys Phe Ile Phe Tyr Lys Ser Gly Thr
340 345 350340 345 350
Asp Ile Glu Ser Asn Ala Leu Ile Gln Val Met Leu Lys Leu Gly IleAsp Ile Glu Ser Asn Ala Leu Ile Gln Val Met Leu Lys Leu Gly Ile
355 360 365355 360 365
Ile Thr Asp Glu Lys Val Arg Phe Thr Thr Asp Ile LysIle Thr Asp Glu Lys Val Arg Phe Thr Thr Asp Ile Lys
370 375 380370 375 380
<210> 55<210> 55
<211> 483<211> 483
<212> ПРТ<212> PRT
<213> Alistipes sp.<213> Alistipes sp.
<400> 55<400> 55
Met Ala Ser Cys Ser Asp Asp Asp Lys Glu Gln Thr Gly Phe Gln IleMet Ala Ser Cys Ser Asp Asp Asp Lys Glu Gln Thr Gly Phe Gln Ile
1 5 10 151 5 10 15
Asp Asp Gly Ser Gly Phe Leu Ser Leu Asp Ala Ala Ala Arg Ser GlyAsp Asp Gly Ser Gly Phe Leu Ser Leu Asp Ala Ala Ala Arg Ser Gly
20 25 3020 25 30
Ser Ile Ala Ile Thr Ala Asn Asn Ser Trp Ser Val Thr Gln Asp LysSer Ile Ala Ile Thr Ala Asn Asn Ser Trp Ser Val Thr Gln Asp Lys
35 40 4535 40 45
Asp Ser Glu Trp Leu Thr Leu Ser Thr Thr Ser Gly Ala Ala Gly ArgAsp Ser Glu Trp Leu Thr Leu Ser Thr Thr Ser Gly Ala Ala Gly Arg
50 55 6050 55 60
Thr Glu Ile Gly Ile Met Leu Glu Ala Asn Pro Gly Glu Ala Arg AsnThr Glu Ile Gly Ile Met Leu Glu Ala Asn Pro Gly Glu Ala Arg Asn
65 70 75 8065 70 75 80
Ala Gly Leu Thr Phe Asn Ser Gly Gly Arg Thr Tyr Pro Phe Val IleAla Gly Leu Thr Phe Asn Ser Gly Gly Arg Thr Tyr Pro Phe Val Ile
85 90 9585 90 95
Thr Gln Ser Ala His Val Thr Ala Asp Phe Asp Asp Ala Asp His CysThr Gln Ser Ala His Val Thr Ala Asp Phe Asp Asp Ala Asp His Cys
100 105 110100 105 110
Phe Tyr Ile Thr Phe Gly Thr Leu Pro Thr Leu Tyr Ala Gly Leu HisPhe Tyr Ile Thr Phe Gly Thr Leu Pro Thr Leu Tyr Ala Gly Leu His
115 120 125115 120 125
Val Leu Ser His Asp Lys Pro Ser Tyr Val Phe Phe Gln Arg Ser GlnVal Leu Ser His Asp Lys Pro Ser Tyr Val Phe Phe Gln Arg Ser Gln
130 135 140130 135 140
Thr Phe Arg Pro Glu Glu Phe Pro Ala His Ala Glu Val Thr Ile AlaThr Phe Arg Pro Glu Glu Phe Pro Ala His Ala Glu Val Thr Ile Ala
145 150 155 160145 150 155 160
Ala Asp Pro Ser Ala Asn Ala Thr Asp Glu Asp Met Glu Arg Met ArgAla Asp Pro Ser Ala Asn Ala Thr Asp Glu Asp Met Glu Arg Met Arg
165 170 175165 170 175
Thr Ala Met Lys Gln Gln Ile Leu Lys Ile Asn Val Glu Asp Pro ThrThr Ala Met Lys Gln Gln Ile Leu Lys Ile Asn Val Glu Asp Pro Thr
180 185 190180 185 190
Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Cys Gly Ile Gly TyrAla Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Cys Gly Ile Gly Tyr
195 200 205195 200 205
Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Thr Arg Val Lys Val SerAsp Trp Phe Val Ala Gln Gly Ile Asp Ser Thr Arg Val Lys Val Ser
210 215 220210 215 220
Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Phe Tyr Asn Tyr PheMet Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Phe Tyr Asn Tyr Phe
225 230 235 240225 230 235 240
Gly Asp Pro Ala Thr Ala Glu Gln Asn Trp Glu Asn Tyr Ala Ala GlnGly Asp Pro Ala Thr Ala Glu Gln Asn Trp Glu Asn Tyr Ala Ala Gln
245 250 255245 250 255
Val Glu Ala Leu Asp Trp Gln His Gly Gly Arg Phe Pro Glu Thr ArgVal Glu Ala Leu Asp Trp Gln His Gly Gly Arg Phe Pro Glu Thr Arg
260 265 270260 265 270
Met Pro Asp Gly Phe Asp Phe Tyr Glu Trp Pro Tyr Tyr Leu Ala ThrMet Pro Asp Gly Phe Asp Phe Tyr Glu Trp Pro Tyr Tyr Leu Ala Thr
275 280 285275 280 285
Arg Pro Asn Tyr Arg Leu Val Leu Gln Asp Asp Asp Leu Leu Glu AlaArg Pro Asn Tyr Arg Leu Val Leu Gln Asp Asp Asp Leu Leu Glu Ala
290 295 300290 295 300
Thr Ser Pro Phe Met Thr Glu Arg Leu Gln Gln Met Arg Thr Glu SerThr Ser Pro Phe Met Thr Glu Arg Leu Gln Gln Met Arg Thr Glu Ser
305 310 315 320305 310 315 320
Lys Gln Pro Tyr Glu Leu Leu Ala Ser Leu Pro Ala Glu Ala Arg GlnLys Gln Pro Tyr Glu Leu Leu Ala Ser Leu Pro Ala Glu Ala Arg Gln
325 330 335325 330 335
Arg Phe Phe Arg Met Ala Gly Phe Asp Tyr Asp Ala Phe Ala Ala LeuArg Phe Phe Arg Met Ala Gly Phe Asp Tyr Asp Ala Phe Ala Ala Leu
340 345 350340 345 350
Phe Asp Ala Ser Pro Lys Lys Asn Leu Val Ile Ile Gly Thr Ser HisPhe Asp Ala Ser Pro Lys Lys Asn Leu Val Ile Ile Gly Thr Ser His
355 360 365355 360 365
Thr Ser Glu Glu Ser Glu Ala Gln Gln Ala Ala Tyr Val Glu Arg IleThr Ser Glu Glu Ser Glu Ala Gln Gln Ala Ala Tyr Val Glu Arg Ile
370 375 380370 375 380
Ile Gly Asp Tyr Gly Thr Ala Tyr Asp Ile Phe Phe Lys Pro His ProIle Gly Asp Tyr Gly Thr Ala Tyr Asp Ile Phe Phe Lys Pro His Pro
385 390 395 400385 390 395 400
Ala Asp Ser Ser Ser Ser Asn Tyr Glu Glu Arg Phe Glu Gly Leu ThrAla Asp Ser Ser Ser Ser Asn Tyr Glu Glu Arg Phe Glu Gly Leu Thr
405 410 415405 410 415
Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ser Leu LeuLeu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ser Leu Leu
420 425 430420 425 430
Asp Lys Val Asp Leu Ile Gly Gly Tyr Ser Ser Thr Val Phe Leu ThrAsp Lys Val Asp Leu Ile Gly Gly Tyr Ser Ser Thr Val Phe Leu Thr
435 440 445435 440 445
Val Pro Val Glu Lys Thr Gly Phe Ile Phe Ala Ala Asn Ala Glu SerVal Pro Val Glu Lys Thr Gly Phe Ile Phe Ala Ala Asn Ala Glu Ser
450 455 460450 455 460
Leu Pro Arg Pro Leu Asn Val Leu Phe Arg Asn Ala Glu His Val ArgLeu Pro Arg Pro Leu Asn Val Leu Phe Arg Asn Ala Glu His Val Arg
465 470 475 480465 470 475 480
Trp Ile GlnTrp Ile Gln
<210> 56<210> 56
<211> 483<211> 483
<212> ПРТ<212> PRT
<213> Alistipes shahii<213> Alistipes shahii
<400> 56<400> 56
Met Asp Asp Gly Thr Pro Ser Val Ser Ile Asn Gly Gly Thr Asp PheMet Asp Asp Gly Thr Pro Ser Val Ser Ile Asn Gly Gly Thr Asp Phe
1 5 10 151 5 10 15
Leu Ser Leu Asp His Leu Ala Arg Ser Gly Lys Ile Thr Val Asn AlaLeu Ser Leu Asp His Leu Ala Arg Ser Gly Lys Ile Thr Val Asn Ala
20 25 3020 25 30
Pro Ala Pro Trp Ser Val Thr Leu Ala Pro Glu Asn Tyr Gly Gln AspPro Ala Pro Trp Ser Val Thr Leu Ala Pro Glu Asn Tyr Gly Gln Asp
35 40 4535 40 45
Glu Lys Pro Asp Trp Leu Thr Leu Ser Ala Glu Glu Gly Pro Ala GlyGlu Lys Pro Asp Trp Leu Thr Leu Ser Ala Glu Glu Gly Pro Ala Gly
50 55 6050 55 60
Tyr Ser Glu Ile Asp Val Thr Phe Ala Glu Asn Pro Gly Pro Ala ArgTyr Ser Glu Ile Asp Val Thr Phe Ala Glu Asn Pro Gly Pro Ala Arg
65 70 75 8065 70 75 80
Ser Ala Ser Leu Leu Phe Ser Cys Asp Gly Lys Thr Leu Ala Phe ThrSer Ala Ser Leu Leu Phe Ser Cys Asp Gly Lys Thr Leu Ala Phe Thr
85 90 9585 90 95
Val Ser Gln Ser Ala Gly Gly Thr Gly Phe Asp Ala Pro Asp Tyr TyrVal Ser Gln Ser Ala Gly Gly Thr Gly Phe Asp Ala Pro Asp Tyr Tyr
100 105 110100 105 110
Phe Tyr Ile Ser Val Gly Thr Met Pro Thr Leu Tyr Ser Gly Leu HisPhe Tyr Ile Ser Val Gly Thr Met Pro Thr Leu Tyr Ser Gly Leu His
115 120 125115 120 125
Leu Leu Ser His Asp Lys Pro Ser Tyr Val Ser Tyr Glu Arg Ala SerLeu Leu Ser His Asp Lys Pro Ser Tyr Val Ser Tyr Glu Arg Ala Ser
130 135 140130 135 140
Thr Phe Asp Ala Ala Glu Phe Pro Asp Arg Ala Phe Val Tyr Pro ValThr Phe Asp Ala Ala Glu Phe Pro Asp Arg Ala Phe Val Tyr Pro Val
145 150 155 160145 150 155 160
Ala Asp Pro Thr Gly His Ala Thr Asn Glu Glu Leu Arg Ala Met SerAla Asp Pro Thr Gly His Ala Thr Asn Glu Glu Leu Arg Ala Met Ser
165 170 175165 170 175
Glu Ala Met Lys Arg Arg Ile Leu Glu Ile Asn Ala Glu Asp Pro ThrGlu Ala Met Lys Arg Arg Ile Leu Glu Ile Asn Ala Glu Asp Pro Thr
180 185 190180 185 190
Ala Val Phe Gly Leu Trp Val Asp Asp Leu Arg Cys Arg Leu Gly TyrAla Val Phe Gly Leu Trp Val Asp Asp Leu Arg Cys Arg Leu Gly Tyr
195 200 205195 200 205
Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Arg Val Lys Val ThrAsp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Arg Val Lys Val Thr
210 215 220210 215 220
Met Leu Ser Asp Gly Thr Ala Thr Tyr Asn Asn Phe His Asn Tyr PheMet Leu Ser Asp Gly Thr Ala Thr Tyr Asn Asn Phe His Asn Tyr Phe
225 230 235 240225 230 235 240
Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Asn Asp Tyr Ala Ala GluGly Asp Ala Ala Thr Ala Glu Gln Asn Trp Asn Asp Tyr Ala Ala Glu
245 250 255245 250 255
Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg Tyr Pro Glu Thr ArgVal Glu Ala Leu Asp Trp Asn His Gly Gly Arg Tyr Pro Glu Thr Arg
260 265 270260 265 270
Ala Pro Glu Glu Phe Ala Ser Tyr Thr Trp Pro Tyr Tyr Leu Ser ThrAla Pro Glu Glu Phe Ala Ser Tyr Thr Trp Pro Tyr Tyr Leu Ser Thr
275 280 285275 280 285
Arg Pro Asp Tyr Arg Leu Met Leu Gln Asn Ser Ser Leu Met Glu SerArg Pro Asp Tyr Arg Leu Met Leu Gln Asn Ser Ser Leu Met Glu Ser
290 295 300290 295 300
Ser Cys Pro Phe Ile Ala Asp Arg Leu Ala Ala Met Lys Met Glu SerSer Cys Pro Phe Ile Ala Asp Arg Leu Ala Ala Met Lys Met Glu Ser
305 310 315 320305 310 315 320
Val Gln Pro Tyr Glu Leu Leu Thr Ala Leu Pro Glu Ala Ser Lys GlnVal Gln Pro Tyr Glu Leu Leu Thr Ala Leu Pro Glu Ala Ser Lys Gln
325 330 335325 330 335
Gln Phe Tyr Arg Met Ala Lys Phe Asp Tyr Ala Arg Phe Ala Gly LeuGln Phe Tyr Arg Met Ala Lys Phe Asp Tyr Ala Arg Phe Ala Gly Leu
340 345 350340 345 350
Phe Asp Leu Ser Pro Lys Lys Asn Leu Ile Ile Ile Gly Thr Ser HisPhe Asp Leu Ser Pro Lys Lys Asn Leu Ile Ile Ile Gly Thr Ser His
355 360 365355 360 365
Ser Ser Ala Ala Ser Glu Gln Gln Gln Ala Ala Tyr Val Glu Arg IleSer Ser Ala Ala Ser Glu Gln Gln Gln Ala Ala Tyr Val Glu Arg Ile
370 375 380370 375 380
Ile Gln Gln Tyr Gly Ser Asp Tyr Asp Ile Phe Phe Lys Pro His ProIle Gln Gln Tyr Gly Ser Asp Tyr Asp Ile Phe Phe Lys Pro His Pro
385 390 395 400385 390 395 400
Ala Asp Ser Ser Ser Ala Gly Tyr Pro Asp Arg Phe Glu Gly Leu ThrAla Asp Ser Ser Ser Ala Gly Tyr Pro Asp Arg Phe Glu Gly Leu Thr
405 410 415405 410 415
Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ala Leu LeuLeu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ala Leu Leu
420 425 430420 425 430
Asp Lys Ile Asp Met Ile Gly Gly Tyr Pro Ser Thr Thr Phe Ile SerAsp Lys Ile Asp Met Ile Gly Gly Tyr Pro Ser Thr Thr Phe Ile Ser
435 440 445435 440 445
Val Pro Leu Asp Lys Val Gly Phe Leu Phe Ala Ala Asp Ala Asp GlyVal Pro Leu Asp Lys Val Gly Phe Leu Phe Ala Ala Asp Ala Asp Gly
450 455 460450 455 460
Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Ala Asn Val GluLeu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Ala Asn Val Glu
465 470 475 480465 470 475 480
Trp Ile GlnTrp Ile Gln
<210> 57<210> 57
<211> 401<211> 401
<212> ПРТ<212> PRT
<213> Actinobacillus suis<213> Actinobacillus suis
<400> 57<400> 57
Met Glu Arg Thr Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp PheMet Glu Arg Thr Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe
1 5 10 151 5 10 15
Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His LysAla Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys
20 25 3020 25 30
His Asp Asp Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu MetHis Asp Asp Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met
35 40 4535 40 45
Pro Gln Thr Leu Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser ArgPro Gln Thr Leu Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg
50 55 6050 55 60
Asn Val Glu His Asn Val Glu Pro Leu Leu Glu Gln Leu Gln Thr IleAsn Val Glu His Asn Val Glu Pro Leu Leu Glu Gln Leu Gln Thr Ile
65 70 75 8065 70 75 80
Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn LeuLeu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu
85 90 9585 90 95
Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr GlnPhe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Gln
100 105 110100 105 110
Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp GlyTyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly
115 120 125115 120 125
Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Lys Ser Ser Ser LeuSer Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Lys Ser Ser Ser Leu
130 135 140130 135 140
Val Gln Asp Leu Ala Ala Thr Lys Ala Ser Leu Val Ser Leu Phe GluVal Gln Asp Leu Ala Ala Thr Lys Ala Ser Leu Val Ser Leu Phe Glu
145 150 155 160145 150 155 160
Asn Gly Glu Gly Ser Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val TrpAsn Gly Glu Gly Ser Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp
165 170 175165 170 175
Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe LeuAsn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu
180 185 190180 185 190
Leu Asp Glu Lys Leu Gln Pro Leu Lys Ala Glu Leu Gly His Tyr GlnLeu Asp Glu Lys Leu Gln Pro Leu Lys Ala Glu Leu Gly His Tyr Gln
195 200 205195 200 205
Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu LeuLeu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu
210 215 220210 215 220
Trp Leu Lys Gln Ile Leu Lys Ile Asp Thr Glu Leu Glu Ser Leu MetTrp Leu Lys Gln Ile Leu Lys Ile Asp Thr Glu Leu Glu Ser Leu Met
225 230 235 240225 230 235 240
Gln Lys Leu Thr Ala Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr PheGln Lys Leu Thr Ala Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe
245 250 255245 250 255
Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His AlaPhe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala
260 265 270260 265 270
Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile GlyIle Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly
275 280 285275 280 285
Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu IleGlu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile
290 295 300290 295 300
Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Val Ile Phe Leu Pro GluAsn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Val Ile Phe Leu Pro Glu
305 310 315 320305 310 315 320
Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln LysAsn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys
325 330 335325 330 335
Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser LysIle Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys
340 345 350340 345 350
Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg GlnLeu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Gln
355 360 365355 360 365
Leu Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met LeuLeu Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu
370 375 380370 375 380
Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu SerGlu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser
385 390 395 400385 390 395 400
SerSer
<210> 58<210> 58
<211> 401<211> 401
<212> ПРТ<212> PRT
<213> Actinobacillus capsulatus<213> Actinobacillus capsulatus
<400> 58<400> 58
Met Glu Arg Ile Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp PheMet Glu Arg Ile Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe
1 5 10 151 5 10 15
Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His LysAla Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys
20 25 3020 25 30
His Asp His Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu MetHis Asp His Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met
35 40 4535 40 45
Pro Gln Thr Val Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser ArgPro Gln Thr Val Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg
50 55 6050 55 60
Asn Val Glu His Asn Val Glu Gln Leu Leu Glu Gln Leu Gln Thr IleAsn Val Glu His Asn Val Glu Gln Leu Leu Glu Gln Leu Gln Thr Ile
65 70 75 8065 70 75 80
Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn LeuLeu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu
85 90 9585 90 95
Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr LysPhe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Lys
100 105 110100 105 110
Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp GlyTyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly
115 120 125115 120 125
Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Gln Ser Asn Ser LeuSer Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Gln Ser Asn Ser Leu
130 135 140130 135 140
Ala Gln Asp Leu Ala Ser Thr Lys Ala Ser Leu Val Ser Leu Phe LysAla Gln Asp Leu Ala Ser Thr Lys Ala Ser Leu Val Ser Leu Phe Lys
145 150 155 160145 150 155 160
Asn Gly Glu Gly Ala Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val TrpAsn Gly Glu Gly Ala Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp
165 170 175165 170 175
Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe LeuAsn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu
180 185 190180 185 190
Ala His Glu Lys Leu Gln Pro Leu Lys Ile Glu Leu Gly His Tyr GlnAla His Glu Lys Leu Gln Pro Leu Lys Ile Glu Leu Gly His Tyr Gln
195 200 205195 200 205
Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu LeuLeu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu
210 215 220210 215 220
Trp Leu Lys Gln Ile Leu Lys Ile Asp Ala Glu Leu Glu Ser Leu MetTrp Leu Lys Gln Ile Leu Lys Ile Asp Ala Glu Leu Glu Ser Leu Met
225 230 235 240225 230 235 240
His Lys Leu Thr Thr Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr PheHis Lys Leu Thr Thr Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe
245 250 255245 250 255
Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His AlaPhe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala
260 265 270260 265 270
Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile GlyIle Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly
275 280 285275 280 285
Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu IleGlu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile
290 295 300290 295 300
Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Ala Ile Phe Leu Pro GluAsn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Ala Ile Phe Leu Pro Glu
305 310 315 320305 310 315 320
Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln LysAsn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys
325 330 335325 330 335
Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser LysIle Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys
340 345 350340 345 350
Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg AsnLeu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Asn
355 360 365355 360 365
Arg Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met LeuArg Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu
370 375 380370 375 380
Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu SerGlu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser
385 390 395 400385 390 395 400
SerSer
<210> 59<210> 59
<211> 311<211> 311
<212> ПРТ<212> PRT
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 59<400> 59
Met Phe Arg Glu Asp Asn Met Asn Leu Ile Ile Cys Cys Thr Pro LeuMet Phe Arg Glu Asp Asn Met Asn Leu Ile Ile Cys Cys Thr Pro Leu
1 5 10 151 5 10 15
Gln Val Ile Ile Ala Glu Lys Ile Ile Glu Arg Tyr Pro Glu Gln LysGln Val Ile Ile Ala Glu Lys Ile Ile Glu Arg Tyr Pro Glu Gln Lys
20 25 3020 25 30
Phe Tyr Gly Val Met Leu Glu Ser Phe Tyr Asn Asp Lys Phe Asp PhePhe Tyr Gly Val Met Leu Glu Ser Phe Tyr Asn Asp Lys Phe Asp Phe
35 40 4535 40 45
Tyr Glu Asn Lys Leu Lys His Leu Cys His Glu Phe Phe Cys Ile LysTyr Glu Asn Lys Leu Lys His Leu Cys His Glu Phe Phe Cys Ile Lys
50 55 6050 55 60
Ile Ala Arg Phe Lys Leu Glu Arg Tyr Lys Asn Leu Leu Ser Leu LeuIle Ala Arg Phe Lys Leu Glu Arg Tyr Lys Asn Leu Leu Ser Leu Leu
65 70 75 8065 70 75 80
Lys Ile Lys Asn Lys Thr Phe Asp Arg Val Phe Leu Ala Asn Ile GluLys Ile Lys Asn Lys Thr Phe Asp Arg Val Phe Leu Ala Asn Ile Glu
85 90 9585 90 95
Lys Arg Tyr Ile His Ile Ile Leu Ser Asn Ile Phe Phe Lys Glu LeuLys Arg Tyr Ile His Ile Ile Leu Ser Asn Ile Phe Phe Lys Glu Leu
100 105 110100 105 110
Tyr Thr Phe Asp Asp Gly Thr Ala Asn Ile Ala Pro Asn Ser His LeuTyr Thr Phe Asp Asp Gly Thr Ala Asn Ile Ala Pro Asn Ser His Leu
115 120 125115 120 125
Tyr Gln Glu Tyr Asp His Ser Leu Lys Lys Arg Ile Thr Asp Ile LeuTyr Gln Glu Tyr Asp His Ser Leu Lys Lys Arg Ile Thr Asp Ile Leu
130 135 140130 135 140
Leu Pro Asn His Tyr Asn Ser Asn Lys Val Lys Asn Ile Ser Lys LeuLeu Pro Asn His Tyr Asn Ser Asn Lys Val Lys Asn Ile Ser Lys Leu
145 150 155 160145 150 155 160
His Tyr Ser Ile Tyr Arg Cys Lys Asn Asn Ile Ile Asp Asn Ile GluHis Tyr Ser Ile Tyr Arg Cys Lys Asn Asn Ile Ile Asp Asn Ile Glu
165 170 175165 170 175
Tyr Met Pro Leu Phe Asn Leu Glu Lys Lys Tyr Thr Ala Gln Asp LysTyr Met Pro Leu Phe Asn Leu Glu Lys Lys Tyr Thr Ala Gln Asp Lys
180 185 190180 185 190
Ser Ile Ser Ile Leu Leu Gly Gln Pro Ile Phe Tyr Asp Glu Glu LysSer Ile Ser Ile Leu Leu Gly Gln Pro Ile Phe Tyr Asp Glu Glu Lys
195 200 205195 200 205
Asn Ile Arg Leu Ile Lys Glu Val Ile Ala Lys Phe Lys Ile Asp TyrAsn Ile Arg Leu Ile Lys Glu Val Ile Ala Lys Phe Lys Ile Asp Tyr
210 215 220210 215 220
Tyr Phe Pro His Pro Arg Glu Asp Tyr Tyr Ile Asp Asn Val Ser TyrTyr Phe Pro His Pro Arg Glu Asp Tyr Tyr Ile Asp Asn Val Ser Tyr
225 230 235 240225 230 235 240
Ile Lys Thr Pro Leu Ile Phe Glu Glu Phe Tyr Ala Glu Arg Ser IleIle Lys Thr Pro Leu Ile Phe Glu Glu Phe Tyr Ala Glu Arg Ser Ile
245 250 255245 250 255
Glu Asn Ser Ile Lys Ile Tyr Thr Phe Phe Ser Ser Ala Val Leu AsnGlu Asn Ser Ile Lys Ile Tyr Thr Phe Phe Ser Ser Ala Val Leu Asn
260 265 270260 265 270
Ile Val Thr Lys Glu Asn Ile Asp Arg Ile Tyr Ala Leu Lys Pro LysIle Val Thr Lys Glu Asn Ile Asp Arg Ile Tyr Ala Leu Lys Pro Lys
275 280 285275 280 285
Leu Thr Glu Lys Ala Tyr Leu Asp Cys Tyr Asp Ile Leu Lys Asp PheLeu Thr Glu Lys Ala Tyr Leu Asp Cys Tyr Asp Ile Leu Lys Asp Phe
290 295 300290 295 300
Gly Ile Lys Val Ile Asp IleGly Ile Lys Val Ile Asp Ile
305 310305 310
<210> 60<210> 60
<211> 399<211> 399
<212> ПРТ<212> PRT
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 60<400> 60
Met Leu Ile Gln Gln Asn Leu Glu Ile Tyr Leu Asp Tyr Ala Thr IleMet Leu Ile Gln Gln Asn Leu Glu Ile Tyr Leu Asp Tyr Ala Thr Ile
1 5 10 151 5 10 15
Pro Ser Leu Ala Cys Phe Met His Phe Ile Gln His Lys Asp Asp ValPro Ser Leu Ala Cys Phe Met His Phe Ile Gln His Lys Asp Asp Val
20 25 3020 25 30
Asp Ser Ile Arg Leu Phe Gly Leu Ala Arg Phe Asp Ile Pro Gln SerAsp Ser Ile Arg Leu Phe Gly Leu Ala Arg Phe Asp Ile Pro Gln Ser
35 40 4535 40 45
Ile Ile Asp Arg Tyr Pro Ala Asn His Leu Phe Tyr His Asn Ile AspIle Ile Asp Arg Tyr Pro Ala Asn His Leu Phe Tyr His Asn Ile Asp
50 55 6050 55 60
Asn Arg Asp Leu Thr Ala Val Leu Asn Gln Leu Ala Asp Ile Leu AlaAsn Arg Asp Leu Thr Ala Val Leu Asn Gln Leu Ala Asp Ile Leu Ala
65 70 75 8065 70 75 80
Gln Glu Asn Lys Arg Phe Gln Ile Asn Leu His Leu Asn Leu Phe HisGln Glu Asn Lys Arg Phe Gln Ile Asn Leu His Leu Asn Leu Phe His
85 90 9585 90 95
Ser Ile Asp Leu Phe Phe Ala Ile Tyr Pro Ile Tyr Gln Gln Tyr GlnSer Ile Asp Leu Phe Phe Ala Ile Tyr Pro Ile Tyr Gln Gln Tyr Gln
100 105 110100 105 110
His Lys Ile Ser Thr Ile Gln Leu Gln Leu Tyr Asp Asp Gly Ser GluHis Lys Ile Ser Thr Ile Gln Leu Gln Leu Tyr Asp Asp Gly Ser Glu
115 120 125115 120 125
Gly Ile Val Thr Gln His Ser Leu Cys Lys Ile Ala Asp Leu Glu GlnGly Ile Val Thr Gln His Ser Leu Cys Lys Ile Ala Asp Leu Glu Gln
130 135 140130 135 140
Leu Ile Leu Gln His Lys Asn Val Leu Leu Glu Leu Leu Thr Lys GlyLeu Ile Leu Gln His Lys Asn Val Leu Leu Glu Leu Leu Thr Lys Gly
145 150 155 160145 150 155 160
Thr Ala Asn Val Pro Asn Pro Thr Leu Leu Arg Tyr Leu Trp Asn AsnThr Ala Asn Val Pro Asn Pro Thr Leu Leu Arg Tyr Leu Trp Asn Asn
165 170 175165 170 175
Ile Ile Asp Ser Gln Phe His Leu Ile Ser Asp His Phe Leu Gln HisIle Ile Asp Ser Gln Phe His Leu Ile Ser Asp His Phe Leu Gln His
180 185 190180 185 190
Pro Lys Leu Gln Pro Leu Lys Arg Leu Leu Lys Arg Tyr Thr Ile LeuPro Lys Leu Gln Pro Leu Lys Arg Leu Leu Lys Arg Tyr Thr Ile Leu
195 200 205195 200 205
Asp Phe Thr Cys Tyr Pro Arg Phe Asn Ala Glu Gln Lys Gln Leu LeuAsp Phe Thr Cys Tyr Pro Arg Phe Asn Ala Glu Gln Lys Gln Leu Leu
210 215 220210 215 220
Lys Glu Ile Leu His Ile Ser Asn Glu Leu Glu Asn Leu Leu Lys LeuLys Glu Ile Leu His Ile Ser Asn Glu Leu Glu Asn Leu Leu Lys Leu
225 230 235 240225 230 235 240
Leu Lys Gln His Asn Thr Phe Leu Phe Thr Gly Thr Thr Ala Phe AsnLeu Lys Gln His Asn Thr Phe Leu Phe Thr Gly Thr Thr Ala Phe Asn
245 250 255245 250 255
Leu Asp Gln Glu Lys Leu Asp Leu Leu Thr Gln Leu His Ile Leu LeuLeu Asp Gln Glu Lys Leu Asp Leu Leu Thr Gln Leu His Ile Leu Leu
260 265 270260 265 270
Leu Asn Glu His Gln Asn Pro His Ser Thr His Tyr Ile Gly Asn AsnLeu Asn Glu His Gln Asn Pro His Ser Thr His Tyr Ile Gly Asn Asn
275 280 285275 280 285
Tyr Leu Leu Leu Ile Lys Gly His Ala Asn Ser Pro Ala Leu Asn HisTyr Leu Leu Leu Ile Lys Gly His Ala Asn Ser Pro Ala Leu Asn His
290 295 300290 295 300
Thr Leu Ala Leu His Phe Pro Asp Ala Ile Phe Leu Pro Ala Asn IleThr Leu Ala Leu His Phe Pro Asp Ala Ile Phe Leu Pro Ala Asn Ile
305 310 315 320305 310 315 320
Pro Phe Glu Ile Phe Ala Met Leu Gly Phe Thr Pro Asn Lys Met GlyPro Phe Glu Ile Phe Ala Met Leu Gly Phe Thr Pro Asn Lys Met Gly
325 330 335325 330 335
Gly Phe Ala Ser Thr Ser Tyr Ile Asn Tyr Pro Thr Glu Asn Ile AsnGly Phe Ala Ser Thr Ser Tyr Ile Asn Tyr Pro Thr Glu Asn Ile Asn
340 345 350340 345 350
His Leu Phe Phe Leu Thr Ser Asp Gln Pro Ser Ile Arg Thr Lys TrpHis Leu Phe Phe Leu Thr Ser Asp Gln Pro Ser Ile Arg Thr Lys Trp
355 360 365355 360 365
Leu Asp Tyr Glu Lys Gln Phe Gly Leu Met Tyr Ser Leu Leu Ala MetLeu Asp Tyr Glu Lys Gln Phe Gly Leu Met Tyr Ser Leu Leu Ala Met
370 375 380370 375 380
Gln Lys Ile Asn Glu Asp Gln Ala Phe Met Cys Thr Ile His AsnGln Lys Ile Asn Glu Asp Gln Ala Phe Met Cys Thr Ile His Asn
385 390 395385 390 395
<210> 61<210> 61
<211> 497<211> 497
<212> ПРТ<212> PRT
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 61<400> 61
Met Cys Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr ValMet Cys Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val
1 5 10 151 5 10 15
Asn Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile AspAsn Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp
20 25 3020 25 30
Thr Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly ThrThr Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr
35 40 4535 40 45
Pro Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe ValPro Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val
50 55 6050 55 60
Ala Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr GlyAla Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly
65 70 75 8065 70 75 80
Asp Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val ValAsp Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val
85 90 9585 90 95
Ala Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser LeuAla Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu
100 105 110100 105 110
Gln Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln AsnGln Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn
115 120 125115 120 125
Glu Arg Phe Ile Ser Trp Gly Arg Ile Gly Leu Thr Glu Asp Asn AlaGlu Arg Phe Ile Ser Trp Gly Arg Ile Gly Leu Thr Glu Asp Asn Ala
130 135 140130 135 140
Glu Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr SerGlu Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser
145 150 155 160145 150 155 160
Gln Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn ArgGln Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg
165 170 175165 170 175
Leu Asn Leu Glu Leu Asn Thr Asn Thr Ala His Ser Phe Pro Asn LeuLeu Asn Leu Glu Leu Asn Thr Asn Thr Ala His Ser Phe Pro Asn Leu
180 185 190180 185 190
Ala Pro Ile Leu Arg Ile Ile Ser Ser Lys Ser Asn Ile Leu Ile SerAla Pro Ile Leu Arg Ile Ile Ser Ser Lys Ser Asn Ile Leu Ile Ser
195 200 205195 200 205
Asn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu TyrAsn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu Tyr
210 215 220210 215 220
Asn Trp Lys Asp Thr Glu Asp Lys Ser Val Lys Leu Ser Asp Ser PheAsn Trp Lys Asp Thr Glu Asp Lys Ser Val Lys Leu Ser Asp Ser Phe
225 230 235 240225 230 235 240
Leu Val Leu Lys Asp Tyr Phe Asn Gly Ile Ser Ser Glu Lys Pro SerLeu Val Leu Lys Asp Tyr Phe Asn Gly Ile Ser Ser Glu Lys Pro Ser
245 250 255245 250 255
Gly Ile Tyr Gly Arg Tyr Asn Trp His Gln Leu Tyr Asn Thr Ser TyrGly Ile Tyr Gly Arg Tyr Asn Trp His Gln Leu Tyr Asn Thr Ser Tyr
260 265 270260 265 270
Tyr Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Pro Gln Leu His AspTyr Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Pro Gln Leu His Asp
275 280 285275 280 285
Leu Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp GlyLeu Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Gly
290 295 300290 295 300
Phe Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile ValPhe Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val
305 310 315 320305 310 315 320
Gly Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu LeuGly Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu
325 330 335325 330 335
Pro Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu ThrPro Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr
340 345 350340 345 350
Lys Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala IleLys Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile
355 360 365355 360 365
Asn Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe PheAsn Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe
370 375 380370 375 380
Lys Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly SerLys Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser
385 390 395 400385 390 395 400
Phe Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val LeuPhe Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu
405 410 415405 410 415
Met Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser SerMet Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser
420 425 430420 425 430
Leu Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe ThrLeu Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr
435 440 445435 440 445
Ser Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro LeuSer Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu
450 455 460450 455 460
Val Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val LeuVal Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu
465 470 475 480465 470 475 480
Phe Trp Ser Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Ala GlnPhe Trp Ser Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Ala Gln
485 490 495485 490 495
TyrTyr
<210> 62<210> 62
<211> 498<211> 498
<212> ПРТ<212> PRT
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 62<400> 62
Met Ser Glu Glu Asn Thr Gln Ser Ile Ile Lys Asn Asp Ile Asn LysMet Ser Glu Glu Asn Thr Gln Ser Ile Ile Lys Asn Asp Ile Asn Lys
1 5 10 151 5 10 15
Thr Ile Ile Asp Glu Glu Tyr Val Asn Leu Glu Pro Ile Asn Gln SerThr Ile Ile Asp Glu Glu Tyr Val Asn Leu Glu Pro Ile Asn Gln Ser
20 25 3020 25 30
Asn Ile Ser Phe Thr Lys His Ser Trp Val Gln Thr Cys Gly Thr GlnAsn Ile Ser Phe Thr Lys His Ser Trp Val Gln Thr Cys Gly Thr Gln
35 40 4535 40 45
Gln Leu Leu Thr Glu Gln Asn Lys Glu Ser Ile Ser Leu Ser Val ValGln Leu Leu Thr Glu Gln Asn Lys Glu Ser Ile Ser Leu Ser Val Val
50 55 6050 55 60
Ala Pro Arg Leu Asp Asp Asp Glu Lys Tyr Cys Phe Asp Phe Asn GlyAla Pro Arg Leu Asp Asp Asp Glu Lys Tyr Cys Phe Asp Phe Asn Gly
65 70 75 8065 70 75 80
Val Ser Asn Lys Gly Glu Lys Tyr Ile Thr Lys Val Thr Leu Asn ValVal Ser Asn Lys Gly Glu Lys Tyr Ile Thr Lys Val Thr Leu Asn Val
85 90 9585 90 95
Val Ala Pro Ser Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro ThrVal Ala Pro Ser Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Thr
100 105 110100 105 110
Leu Gln Gln Leu Met Asp Ile Ile Lys Ser Glu Glu Glu Asn Pro ThrLeu Gln Gln Leu Met Asp Ile Ile Lys Ser Glu Glu Glu Asn Pro Thr
115 120 125115 120 125
Ala Gln Arg Tyr Ile Ala Trp Gly Arg Ile Val Pro Thr Asp Glu GlnAla Gln Arg Tyr Ile Ala Trp Gly Arg Ile Val Pro Thr Asp Glu Gln
130 135 140130 135 140
Met Lys Glu Leu Asn Ile Thr Ser Phe Ala Leu Ile Asn Asn His ThrMet Lys Glu Leu Asn Ile Thr Ser Phe Ala Leu Ile Asn Asn His Thr
145 150 155 160145 150 155 160
Pro Ala Asp Leu Val Gln Glu Ile Val Lys Gln Ala Gln Thr Lys HisPro Ala Asp Leu Val Gln Glu Ile Val Lys Gln Ala Gln Thr Lys His
165 170 175165 170 175
Arg Leu Asn Val Lys Leu Ser Ser Asn Thr Ala His Ser Phe Asp AsnArg Leu Asn Val Lys Leu Ser Ser Asn Thr Ala His Ser Phe Asp Asn
180 185 190180 185 190
Leu Val Pro Ile Leu Lys Glu Leu Asn Ser Phe Asn Asn Val Thr ValLeu Val Pro Ile Leu Lys Glu Leu Asn Ser Phe Asn Asn Val Thr Val
195 200 205195 200 205
Thr Asn Ile Asp Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn LeuThr Asn Ile Asp Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu
210 215 220210 215 220
Tyr Asn Trp Arg Asp Thr Leu Asn Lys Thr Asp Asn Leu Lys Ile GlyTyr Asn Trp Arg Asp Thr Leu Asn Lys Thr Asp Asn Leu Lys Ile Gly
225 230 235 240225 230 235 240
Lys Asp Tyr Leu Glu Asp Val Ile Asn Gly Ile Asn Glu Asp Thr SerLys Asp Tyr Leu Glu Asp Val Ile Asn Gly Ile Asn Glu Asp Thr Ser
245 250 255245 250 255
Asn Thr Gly Thr Ser Ser Val Tyr Asn Trp Gln Lys Leu Tyr Pro AlaAsn Thr Gly Thr Ser Ser Val Tyr Asn Trp Gln Lys Leu Tyr Pro Ala
260 265 270260 265 270
Asn Tyr His Phe Leu Arg Lys Asp Tyr Leu Thr Leu Glu Pro Ser LeuAsn Tyr His Phe Leu Arg Lys Asp Tyr Leu Thr Leu Glu Pro Ser Leu
275 280 285275 280 285
His Glu Leu Arg Asp Tyr Ile Gly Asp Ser Leu Lys Gln Met Gln TrpHis Glu Leu Arg Asp Tyr Ile Gly Asp Ser Leu Lys Gln Met Gln Trp
290 295 300290 295 300
Asp Gly Phe Lys Lys Phe Asn Ser Lys Gln Gln Glu Leu Phe Leu SerAsp Gly Phe Lys Lys Phe Asn Ser Lys Gln Gln Glu Leu Phe Leu Ser
305 310 315 320305 310 315 320
Ile Val Asn Phe Asp Lys Gln Lys Leu Gln Asn Glu Tyr Asn Ser SerIle Val Asn Phe Asp Lys Gln Lys Leu Gln Asn Glu Tyr Asn Ser Ser
325 330 335325 330 335
Asn Leu Pro Asn Phe Val Phe Thr Gly Thr Thr Val Trp Ala Gly AsnAsn Leu Pro Asn Phe Val Phe Thr Gly Thr Thr Val Trp Ala Gly Asn
340 345 350340 345 350
His Glu Arg Glu Tyr Tyr Ala Lys Gln Gln Ile Asn Val Ile Asn AsnHis Glu Arg Glu Tyr Tyr Ala Lys Gln Gln Ile Asn Val Ile Asn Asn
355 360 365355 360 365
Ala Ile Asn Glu Ser Ser Pro His Tyr Leu Gly Asn Ser Tyr Asp LeuAla Ile Asn Glu Ser Ser Pro His Tyr Leu Gly Asn Ser Tyr Asp Leu
370 375 380370 375 380
Phe Phe Lys Gly His Pro Gly Gly Gly Ile Ile Asn Thr Leu Ile MetPhe Phe Lys Gly His Pro Gly Gly Gly Ile Ile Asn Thr Leu Ile Met
385 390 395 400385 390 395 400
Gln Asn Tyr Pro Ser Met Val Asp Ile Pro Ser Lys Ile Ser Phe GluGln Asn Tyr Pro Ser Met Val Asp Ile Pro Ser Lys Ile Ser Phe Glu
405 410 415405 410 415
Val Leu Met Met Thr Asp Met Leu Pro Asp Ala Val Ala Gly Ile AlaVal Leu Met Met Thr Asp Met Leu Pro Asp Ala Val Ala Gly Ile Ala
420 425 430420 425 430
Ser Ser Leu Tyr Phe Thr Ile Pro Ala Glu Lys Ile Lys Phe Ile ValSer Ser Leu Tyr Phe Thr Ile Pro Ala Glu Lys Ile Lys Phe Ile Val
435 440 445435 440 445
Phe Thr Ser Thr Glu Thr Ile Thr Asp Arg Glu Thr Ala Leu Arg SerPhe Thr Ser Thr Glu Thr Ile Thr Asp Arg Glu Thr Ala Leu Arg Ser
450 455 460450 455 460
Pro Leu Val Gln Val Met Ile Lys Leu Gly Ile Val Lys Glu Glu AsnPro Leu Val Gln Val Met Ile Lys Leu Gly Ile Val Lys Glu Glu Asn
465 470 475 480465 470 475 480
Val Leu Phe Trp Ala Asp Leu Pro Asn Cys Glu Thr Gly Val Cys IleVal Leu Phe Trp Ala Asp Leu Pro Asn Cys Glu Thr Gly Val Cys Ile
485 490 495485 490 495
Ala ValAla Val
<210> 63<210> 63
<211> 482<211> 482
<212> ПРТ<212> PRT
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 63<400> 63
Met Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val AsnMet Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val Asn
1 5 10 151 5 10 15
Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp ThrAsp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp Thr
20 25 3020 25 30
Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr ProPro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr Pro
35 40 4535 40 45
Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val AlaIle Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val Ala
50 55 6050 55 60
Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly AspPro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly Asp
65 70 75 8065 70 75 80
Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val AlaVal Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val Ala
85 90 9585 90 95
Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu GlnPro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu Gln
100 105 110100 105 110
Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn GluGln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn Glu
115 120 125115 120 125
Arg Phe Ile Ser Trp Gly Arg Ile Arg Leu Thr Glu Asp Asn Ala GluArg Phe Ile Ser Trp Gly Arg Ile Arg Leu Thr Glu Asp Asn Ala Glu
130 135 140130 135 140
Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser GlnLys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser Gln
145 150 155 160145 150 155 160
Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg LeuGlu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg Leu
165 170 175165 170 175
Asn Leu Glu Leu Asn Thr Asn Thr Gly His Ser Phe Arg Asn Ile AlaAsn Leu Glu Leu Asn Thr Asn Thr Gly His Ser Phe Arg Asn Ile Ala
180 185 190180 185 190
Pro Ile Leu Arg Ala Thr Ser Ser Lys Asn Asn Ile Leu Ile Ser AsnPro Ile Leu Arg Ala Thr Ser Ser Lys Asn Asn Ile Leu Ile Ser Asn
195 200 205195 200 205
Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Ser Leu Tyr AsnIle Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Ser Leu Tyr Asn
210 215 220210 215 220
Trp Lys Asp Thr Asp Asn Lys Ser Gln Lys Leu Ser Asp Ser Phe LeuTrp Lys Asp Thr Asp Asn Lys Ser Gln Lys Leu Ser Asp Ser Phe Leu
225 230 235 240225 230 235 240
Val Leu Lys Asp Tyr Leu Asn Gly Ile Ser Ser Glu Lys Pro Asn GlyVal Leu Lys Asp Tyr Leu Asn Gly Ile Ser Ser Glu Lys Pro Asn Gly
245 250 255245 250 255
Ile Tyr Ser Ile Tyr Asn Trp His Gln Leu Tyr His Ser Ser Tyr TyrIle Tyr Ser Ile Tyr Asn Trp His Gln Leu Tyr His Ser Ser Tyr Tyr
260 265 270260 265 270
Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Thr Lys Leu His Asp LeuPhe Leu Arg Lys Asp Tyr Leu Thr Val Glu Thr Lys Leu His Asp Leu
275 280 285275 280 285
Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Thr PheArg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Thr Phe
290 295 300290 295 300
Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val GlySer Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val Gly
305 310 315 320305 310 315 320
Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu ProPhe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu Pro
325 330 335325 330 335
Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr LysAsn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys
340 345 350340 345 350
Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile AsnGlu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile Asn
355 360 365355 360 365
Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe LysGlu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe Lys
370 375 380370 375 380
Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser PheGly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser Phe
385 390 395 400385 390 395 400
Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu MetAsn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu Met
405 410 415405 410 415
Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser LeuMet Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser Leu
420 425 430420 425 430
Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr SerTyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr Ser
435 440 445435 440 445
Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu ValSer Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu Val
450 455 460450 455 460
Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu PheGln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe
465 470 475 480465 470 475 480
Trp CysTrp Cys
<210> 64<210> 64
<211> 675<211> 675
<212> ПРТ<212> PRT
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 64<400> 64
Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala CysMet Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys
1 5 10 151 5 10 15
Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser AlaAsn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala
20 25 3020 25 30
Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala ProAsp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro
35 40 4535 40 45
Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro IleSer Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile
50 55 6050 55 60
Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala ProLeu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro
65 70 75 8065 70 75 80
Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile ThrGlu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr
85 90 9585 90 95
Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala ProGly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro
100 105 110100 105 110
Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln GlnThr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln
115 120 125115 120 125
Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln ArgLeu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg
130 135 140130 135 140
Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn LysPhe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys
145 150 155 160145 150 155 160
Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro GluLeu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu
165 170 175165 170 175
Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu AsnMet Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn
180 185 190180 185 190
Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro ProIle Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro
195 200 205195 200 205
Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His IleIle Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile
210 215 220210 215 220
Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln TrpSer Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp
225 230 235 240225 230 235 240
Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser LeuLys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu
245 250 255245 250 255
Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly MetLeu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met
260 265 270260 265 270
Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr PheGly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe
275 280 285275 280 285
Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu ArgLeu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg
290 295 300290 295 300
Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe AlaAsp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala
305 310 315 320305 310 315 320
Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly PheLys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe
325 330 335325 330 335
Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro AsnAsp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn
340 345 350340 345 350
Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys GluPhe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu
355 360 365355 360 365
Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn GluTyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu
370 375 380370 375 380
Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys GlyThr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly
385 390 395 400385 390 395 400
His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe ProHis Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro
405 410 415405 410 415
Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met MetAsp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met
420 425 430420 425 430
Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu TyrThr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr
435 440 445435 440 445
Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser SerPhe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser
450 455 460450 455 460
Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val GlnAsp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln
465 470 475 480465 470 475 480
Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe TrpVal Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp
485 490 495485 490 495
Ala Asp His Lys Val Asn Ser Met Glu Val Ala Ile Asp Glu Ala CysAla Asp His Lys Val Asn Ser Met Glu Val Ala Ile Asp Glu Ala Cys
500 505 510500 505 510
Thr Arg Ile Ile Ala Lys Arg Gln Pro Thr Ala Ser Asp Leu Arg LeuThr Arg Ile Ile Ala Lys Arg Gln Pro Thr Ala Ser Asp Leu Arg Leu
515 520 525515 520 525
Val Ile Ala Ile Ile Lys Thr Ile Thr Asp Leu Glu Arg Ile Gly AspVal Ile Ala Ile Ile Lys Thr Ile Thr Asp Leu Glu Arg Ile Gly Asp
530 535 540530 535 540
Val Ala Glu Ser Ile Ala Lys Val Ala Leu Glu Ser Phe Ser Asn LysVal Ala Glu Ser Ile Ala Lys Val Ala Leu Glu Ser Phe Ser Asn Lys
545 550 555 560545 550 555 560
Gln Tyr Asn Leu Leu Val Ser Leu Glu Ser Leu Gly Gln His Thr ValGln Tyr Asn Leu Leu Val Ser Leu Glu Ser Leu Gly Gln His Thr Val
565 570 575565 570 575
Arg Met Leu His Glu Val Leu Asp Ala Phe Ala Arg Met Asp Val LysArg Met Leu His Glu Val Leu Asp Ala Phe Ala Arg Met Asp Val Lys
580 585 590580 585 590
Ala Ala Ile Glu Val Tyr Gln Glu Asp Asp Arg Ile Asp Gln Glu TyrAla Ala Ile Glu Val Tyr Gln Glu Asp Asp Arg Ile Asp Gln Glu Tyr
595 600 605595 600 605
Glu Ser Ile Val Arg Gln Leu Met Ala His Met Met Glu Asp Pro SerGlu Ser Ile Val Arg Gln Leu Met Ala His Met Met Glu Asp Pro Ser
610 615 620610 615 620
Ser Ile Pro Asn Val Met Lys Val Met Trp Ala Ala Arg Ser Ile GluSer Ile Pro Asn Val Met Lys Val Met Trp Ala Ala Arg Ser Ile Glu
625 630 635 640625 630 635 640
Arg Val Gly Asp Arg Cys Gln Asn Ile Cys Glu Tyr Ile Ile Tyr PheArg Val Gly Asp Arg Cys Gln Asn Ile Cys Glu Tyr Ile Ile Tyr Phe
645 650 655645 650 655
Val Lys Gly Lys Asp Val Arg His Thr Lys Pro Asp Asp Phe Gly ThrVal Lys Gly Lys Asp Val Arg His Thr Lys Pro Asp Asp Phe Gly Thr
660 665 670660 665 670
Met Leu AspMet Leu Asp
675675
<210> 65<210> 65
<211> 510<211> 510
<212> ПРТ<212> PRT
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 65<400> 65
Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala CysMet Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys
1 5 10 151 5 10 15
Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser AlaAsn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala
20 25 3020 25 30
Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala ProAsp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro
35 40 4535 40 45
Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro IleSer Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile
50 55 6050 55 60
Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala ProLeu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro
65 70 75 8065 70 75 80
Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile ThrGlu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr
85 90 9585 90 95
Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala ProGly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro
100 105 110100 105 110
Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln GlnThr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln
115 120 125115 120 125
Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln ArgLeu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg
130 135 140130 135 140
Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn LysPhe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys
145 150 155 160145 150 155 160
Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro GluLeu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu
165 170 175165 170 175
Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu AsnMet Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn
180 185 190180 185 190
Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro ProIle Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro
195 200 205195 200 205
Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His IleIle Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile
210 215 220210 215 220
Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln TrpSer Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp
225 230 235 240225 230 235 240
Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser LeuLys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu
245 250 255245 250 255
Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly MetLeu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met
260 265 270260 265 270
Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr PheGly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe
275 280 285275 280 285
Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu ArgLeu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg
290 295 300290 295 300
Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe AlaAsp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala
305 310 315 320305 310 315 320
Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly PheLys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe
325 330 335325 330 335
Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro AsnAsp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn
340 345 350340 345 350
Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys GluPhe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu
355 360 365355 360 365
Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn GluTyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu
370 375 380370 375 380
Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys GlyThr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly
385 390 395 400385 390 395 400
His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe ProHis Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro
405 410 415405 410 415
Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met MetAsp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met
420 425 430420 425 430
Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu TyrThr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr
435 440 445435 440 445
Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser SerPhe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser
450 455 460450 455 460
Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val GlnAsp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln
465 470 475 480465 470 475 480
Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe TrpVal Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp
485 490 495485 490 495
Ala Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Asp LysAla Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Asp Lys
500 505 510500 505 510
<210> 66<210> 66
<211> 422<211> 422
<212> ПРТ<212> PRT
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 66<400> 66
Met Gly Thr Ile Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro SerMet Gly Thr Ile Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser
1 5 10 151 5 10 15
Ile Lys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val PheIle Lys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe
20 25 3020 25 30
Arg Cys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg GluArg Cys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu
35 40 4535 40 45
Ile Lys Gly Val Phe Phe Asn Pro Cys Val Leu Ser Ser Gln Met GlnIle Lys Gly Val Phe Phe Asn Pro Cys Val Leu Ser Ser Gln Met Gln
50 55 6050 55 60
Thr Val Gln Tyr Leu Met Asp Asn Gly Glu Tyr Ser Ile Glu Arg PheThr Val Gln Tyr Leu Met Asp Asn Gly Glu Tyr Ser Ile Glu Arg Phe
65 70 75 8065 70 75 80
Phe Cys Ser Val Ser Thr Asp Arg His Asp Phe Asp Gly Asp Tyr GlnPhe Cys Ser Val Ser Thr Asp Arg His Asp Phe Asp Gly Asp Tyr Gln
85 90 9585 90 95
Thr Ile Leu Pro Val Asp Gly Tyr Leu Lys Ala His Tyr Pro Phe ValThr Ile Leu Pro Val Asp Gly Tyr Leu Lys Ala His Tyr Pro Phe Val
100 105 110100 105 110
Cys Asp Thr Phe Ser Leu Phe Lys Gly His Glu Glu Ile Leu Lys HisCys Asp Thr Phe Ser Leu Phe Lys Gly His Glu Glu Ile Leu Lys His
115 120 125115 120 125
Val Lys Tyr His Leu Lys Thr Tyr Ser Lys Glu Leu Ser Ala Gly ValVal Lys Tyr His Leu Lys Thr Tyr Ser Lys Glu Leu Ser Ala Gly Val
130 135 140130 135 140
Leu Met Leu Leu Ser Ala Val Val Leu Gly Tyr Lys Glu Ile Tyr LeuLeu Met Leu Leu Ser Ala Val Val Leu Gly Tyr Lys Glu Ile Tyr Leu
145 150 155 160145 150 155 160
Val Gly Ile Asp Phe Gly Ala Ser Ser Trp Gly His Phe Tyr Asp GluVal Gly Ile Asp Phe Gly Ala Ser Ser Trp Gly His Phe Tyr Asp Glu
165 170 175165 170 175
Ser Gln Ser Gln His Phe Ser Asn His Met Ala Asp Cys His Asn IleSer Gln Ser Gln His Phe Ser Asn His Met Ala Asp Cys His Asn Ile
180 185 190180 185 190
Tyr Tyr Asp Met Leu Thr Ile Cys Leu Cys Gln Lys Tyr Ala Lys LeuTyr Tyr Asp Met Leu Thr Ile Cys Leu Cys Gln Lys Tyr Ala Lys Leu
195 200 205195 200 205
Tyr Ala Leu Ala Pro Asn Ser Pro Leu Ser His Leu Leu Thr Leu AsnTyr Ala Leu Ala Pro Asn Ser Pro Leu Ser His Leu Leu Thr Leu Asn
210 215 220210 215 220
Pro Gln Ala Lys Tyr Pro Phe Glu Leu Leu Asp Lys Pro Ile Gly TyrPro Gln Ala Lys Tyr Pro Phe Glu Leu Leu Asp Lys Pro Ile Gly Tyr
225 230 235 240225 230 235 240
Thr Ser Asp Leu Ile Ile Ser Ser Pro Leu Glu Glu Lys Leu Leu GluThr Ser Asp Leu Ile Ile Ser Ser Pro Leu Glu Glu Lys Leu Leu Glu
245 250 255245 250 255
Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu GluPhe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu
260 265 270260 265 270
Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe LysLys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys
275 280 285275 280 285
Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys LeuAsn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu
290 295 300290 295 300
Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn IleLeu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile
305 310 315 320305 310 315 320
Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu GluGlu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu
325 330 335325 330 335
Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu GluPhe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu
340 345 350340 345 350
Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe LysLys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys
355 360 365355 360 365
Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys LeuAsn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu
370 375 380370 375 380
Leu Ala Ser Arg Leu Asn Asn Ile Leu Arg Lys Ile Lys Arg Lys IleLeu Ala Ser Arg Leu Asn Asn Ile Leu Arg Lys Ile Lys Arg Lys Ile
385 390 395 400385 390 395 400
Leu Pro Phe Phe Trp Gly Gly Gly Val Thr Pro Thr Leu Lys Val SerLeu Pro Phe Phe Trp Gly Gly Gly Val Thr Pro Thr Leu Lys Val Ser
405 410 415405 410 415
Phe Arg Trp Gly Ala AlaPhe Arg Trp Gly Ala Ala
420420
<210> 67<210> 67
<211> 2851<211> 2851
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Фрагмент экспрессии < Ptet-lacY-FRT-add1-FRT><223> Expression fragment <Ptet-lacY-FRT-add1-FRT>
<400> 67<400> 67
tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60tggccagatg attaattcct aatttttgtt gacactctat cattgataga gttattttac 60
cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120cactccctat cagtgataga gaaaagtgaa atgaatagtt cgacaaaaat ctagaaataa 120
ttttgtttaa ctttaagaag gagatataca aatgtactat ttaaaaaaca caaacttttg 180ttttgtttaa ctttaagaag gagatataca aatgtactat ttaaaaaaca caaacttttg 180
gatgttcggt ttattctttt tcttttactt ttttatcatg ggagcctact tcccgttttt 240gatgttcggt ttattctttt tcttttactt ttttatcatg ggagcctact tcccgttttt 240
cccgatttgg ctacatgaca tcaaccatat cagcaaaagt gatacgggta ttatttttgc 300cccgatttgg ctacatgaca tcaaccatat cagcaaaagt gatacgggta ttatttttgc 300
cgctatttct ctgttctcgc tattattcca accgctgttt ggtctgcttt ctgacaaact 360cgctatttct ctgttctcgc tattattcca accgctgttt ggtctgcttt ctgacaaact 360
cgggctgcgc aaatacctgc tgtggattat taccggcatg ttagtgatgt ttgcgccgtt 420cgggctgcgc aaatacctgc tgtggattat taccggcatg ttagtgatgt ttgcgccgtt 420
ctttattttt atcttcgggc cactgttaca atacaacatt ttagtaggat cgattgttgg 480ctttattttt atcttcgggc cactgttaca atacaacatt ttagtaggat cgattgttgg 480
tggtatttat ctaggctttt gttttaacgc cggtgcgcca gcagtagagg catttattga 540tggtatttat ctaggctttt gttttaacgc cggtgcgcca gcagtagagg catttattga 540
gaaagtcagc cgtcgcagta atttcgaatt tggtcgcgcg cggatgtttg gctgtgttgg 600gaaagtcagc cgtcgcagta atttcgaatt tggtcgcgcg cggatgtttg gctgtgttgg 600
ctgggcgctg tgtgcctcga ttgtcggcat catgttcacc atcaataatc agtttgtttt 660ctgggcgctg tgtgcctcga ttgtcggcat catgttcacc atcaataatc agtttgtttt 660
ctggctgggc tctggctgtg cactcatcct cgccgtttta ctctttttcg ccaaaacgga 720ctggctgggc tctggctgtg cactcatcct cgccgtttta ctctttttcg ccaaaacgga 720
tgcgccctct tctgccacgg ttgccaatgc ggtaggtgcc aaccattcgg catttagcct 780tgcgccctct tctgccacgg ttgccaatgc ggtaggtgcc aaccattcgg catttagcct 780
taagctggca ctggaactgt tcagacagcc aaaactgtgg tttttgtcac tgtatgttat 840taagctggca ctggaactgt tcagacagcc aaaactgtgg tttttgtcac tgtatgttat 840
tggcgtttcc tgcacctacg atgtttttga ccaacagttt gctaatttct ttacttcgtt 900tggcgtttcc tgcacctacg atgtttttga ccaacagttt gctaatttct ttacttcgtt 900
ctttgctacc ggtgaacagg gtacgcgggt atttggctac gtaacgacaa tgggcgaatt 960ctttgctacc ggtgaacagg gtacgcgggt atttggctac gtaacgacaa tgggcgaatt 960
acttaacgcc tcgattatgt tctttgcgcc actgatcatt aatcgcatcg gtgggaaaaa 1020acttaacgcc tcgattatgt tctttgcgcc actgatcatt aatcgcatcg gtgggaaaaa 1020
cgccctgctg ctggctggca ctattatgtc tgtacgtatt attggctcat cgttcgccac 1080cgccctgctg ctggctggca ctattatgtc tgtacgtatt attggctcat cgttcgccac 1080
ctcagcgctg gaagtggtta ttctgaaaac gctgcatatg tttgaagtac cgttcctgct 1140ctcagcgctg gaagtggtta ttctgaaaac gctgcatatg tttgaagtac cgttcctgct 1140
ggtgggctgc tttaaatata ttaccagcca gtttgaagtg cgtttttcag cgacgattta 1200ggtgggctgc tttaaatata ttaccagcca gtttgaagtg cgtttttcag cgacgattta 1200
tctggtctgt ttctgcttct ttaagcaact ggcgatgatt tttatgtctg tactggcggg 1260tctggtctgt ttctgcttct ttaagcaact ggcgatgatt tttatgtctg tactggcggg 1260
caatatgtat gaaagcatcg gtttccaggg cgcttatctg gtgctgggtc tggtggcgct 1320caatatgtat gaaagcatcg gtttccaggg cgcttatctg gtgctgggtc tggtggcgct 1320
gggcttcacc ttaatttccg tgttcacgct tagcggcccc ggcccgcttt ccctgctgcg 1380gggcttcacc ttaatttccg tgttcacgct tagcggcccc ggcccgcttt ccctgctgcg 1380
tcgtcaggtg aatgaagtcg ctgggagcta agcggccgcg tcgacacgca aaaaggccat 1440tcgtcaggtg aatgaagtcg ctgggagcta agcggccgcg tcgacacgca aaaaggccat 1440
ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1500ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1500
gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1560gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1560
tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1620tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1620
cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga ccccacacta 1680cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga cccccacacta 1680
ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct ctagaaagta 1740ccatcatgta tgaatatcct ccttagttcc tattccgaag ttcctattct ctagaaagta 1740
taggaacttc ggcgcgtcct acctgtgaca cgcgtgccgc agtctcacgc ccggagcgta 1800taggaacttc ggcgcgtcct acctgtgaca cgcgtgccgc agtctcacgc ccggagcgta 1800
gcgaccgagt gagctagcta tttgtttatt tttctaaata cattcaaata tgtatccgct 1860gcgaccgagt gagctagcta tttgtttatt tttctaaata cattcaaata tgtatccgct 1860
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgaggga 1920catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgaggga 1920
agcggtgatc gccgaagtat cgactcaact atcagaggta gttggcgtca tcgagcgcca 1980agcggtgatc gccgaagtat cgactcaact atcagaggta gttggcgtca tcgagcgcca 1980
tctcgaaccg acgttgctgg ccgtacattt gtacggctcc gcagtggatg gcggcctgaa 2040tctcgaaccg acgttgctgg ccgtacattt gtacggctcc gcagtggatg gcggcctgaa 2040
gccacacagt gatattgatt tgctggttac ggtgaccgta aggcttgatg aaacaacgcg 2100gccacacagt gatattgatt tgctggttac ggtgaccgta aggcttgatg aaacaacgcg 2100
gcgagctttg atcaacgacc ttttggaaac ttcggcttcc cctggagaga gcgagattct 2160gcgagctttg atcaacgacc ttttggaaac ttcggcttcc cctggagaga gcgagattct 2160
ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc attccgtggc gttatccagc 2220ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc attccgtggc gttatccagc 2220
taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac attcttgcag gtatcttcga 2280taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac attcttgcag gtatcttcga 2280
gccagccacg atcgacattg atctggctat cttgctgaca aaagcaagag aacatagcgt 2340gccagccacg atcgacattg atctggctat cttgctgaca aaagcaagag aacatagcgt 2340
tgccttggta ggtccagcgg cggaggaact ctttgatccg gttcctgaac aggatctatt 2400tgccttggta ggtccagcgg cggaggaact ctttgatccg gttcctgaac aggatctatt 2400
tgaggcgcta aatgaaacct taacgctatg gaactcgccg cccgactggg ctggcgatga 2460tgaggcgcta aatgaaacct taacgctatg gaactcgccg cccgactggg ctggcgatga 2460
gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc gcagtaaccg gcaaaatcgc 2520gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc gcagtaaccg gcaaaatcgc 2520
gccgaaggat gtcgctgccg actgggcaat ggagcgcctg ccggcccagt atcagcccgt 2580gccgaaggat gtcgctgccg actgggcaat ggagcgcctg ccggcccagt atcagcccgt 2580
catacttgaa gctagacagg cttatcttgg acaagaagaa gatcgcttgg cctcgcgcgc 2640catacttgaa gctagacagg cttatcttgg acaagaagaa gatcgcttgg cctcgcgcgc 2640
agatcagttg gaagaatttg tccactacgt gaaaggcgag atcaccaagg tagtcggcaa 2700agatcagttg gaagaatttg tccactacgt gaaaggcgag atcaccaagg tagtcggcaa 2700
ataatgtcta acaattcgtt caagccgagg ggccgcaaga tccggccacg atgacccggt 2760ataatgtcta acaattcgtt caagccgagg ggccgcaaga tccggccacg atgacccggt 2760
cgtcgggtac cggcagggcg gggcgtaagg cgcgccattt aaatgaagtt cctattccga 2820cgtcgggtac cggcagggcg gggcgtaagg cgcgccattt aaatgaagtt cctattccga 2820
agttcctatt ctctagaaag tataggaact t 2851agttcctatt ctctagaaag tataggaact t 2851
<210> 68<210> 68
<211> 6521<211> 6521
<212> ДНК<212> DNA
<213> Искусственная ПОследовательность<213> Artificial SEQUENCE
<220><220>
<223> Кассета транспонзонов <Ptet-glmUM-PT5-glmS-FRT-dhfr-FRT><223> Transponson cassette <Ptet-glmUM-PT5-glmS-FRT-dhfr-FRT>
<400> 68<400> 68
acaggttggc tgataagtcc ccggtctagc ttgcatgcag attgcagcat tacacgtctt 60acaggttggc tgataagtcc ccggtctagc ttgcatgcag attgcagcat tacacgtctt 60
gagcgattgt gtaggctgga gctgcttcga aattaatacg actcactata ggggaattga 120gagcgattgt gtaggctgga gctgcttcga aattaatacg actcactata ggggaattga 120
ttctggtacc aaatgagtcg accggccaga tgattaattc ctaatttttg ttgacactct 180ttctggtacc aaatgagtcg accggccaga tgattaattc ctaatttttg ttgacactct 180
atcattgata gagttatttt accactccct atcagtgata gagaaaagtg aaatgaatag 240atcattgata gagttatttt accactccct atcagtgata gagaaaagtg aaatgaatag 240
ttcgacaaaa atctagaaat aattttgttt aactttaaga aggagatata caaatgctga 300ttcgacaaaa atctagaaat aattttgttt aactttaaga aggagatata caaatgctga 300
acaacgcgat gtctgttgtt atcctggcgg cgggtaaagg tacccgtatg tactctgacc 360acaacgcgat gtctgttgtt atcctggcgg cgggtaaagg tacccgtatg tactctgacc 360
tgccgaaagt tctgcacacc ctggcgggta aagcgatggt tcagcacgtt atcgacgcgg 420tgccgaaagt tctgcacacc ctggcgggta aagcgatggt tcagcacgtt atcgacgcgg 420
cgaacgaact gggtgcggcg cacgttcacc tggtttacgg tcacggtggt gacctgctga 480cgaacgaact gggtgcggcg cacgttcacc tggtttacgg tcacggtggt gacctgctga 480
aacaggcgct gaaagacgac aacctgaact gggttctgca ggcggaacag ctgggtaccg 540aacaggcgct gaaagacgac aacctgaact gggttctgca ggcggaacag ctgggtaccg 540
gtcacgcgat gcagcaggcg gcgccgttct tcgcggacga cgaagacatc ctgatgctgt 600gtcacgcgat gcagcaggcg gcgccgttct tcgcggacga cgaagacatc ctgatgctgt 600
acggtgacgt tccgctgatc tctgttgaaa ccctgcagcg tctgcgtgac gcgaaaccgc 660acggtgacgt tccgctgatc tctgttgaaa ccctgcagcg tctgcgtgac gcgaaaccgc 660
agggtggtat cggtctgctg accgttaaac tggacgaccc gaccggttac ggtcgtatca 720agggtggtat cggtctgctg accgttaaac tggacgaccc gaccggttac ggtcgtatca 720
cccgtgaaaa cggtaaagta accggtatcg ttgaacacaa agacgcgacc gacgaacagc 780cccgtgaaaa cggtaaagta accggtatcg ttgaacacaa agacgcgacc gacgaacagc 780
gtcagatcca ggagatcaac accggtatcc tgatcgcgaa cggtgcagac atgaaacgtt 840gtcagatcca ggagatcaac accggtatcc tgatcgcgaa cggtgcagac atgaaacgtt 840
ggctggcgaa actgaccaac aacaacgcgc agggtgaata ctacatcacc gacatcatcg 900ggctggcgaa actgaccaac aacaacgcgc agggtgaata ctacatcacc gacatcatcg 900
cgctggcgta ccaggaaggt cgtgaaatcg ttgcggttca cccgcagcgt ctgtctgaag 960cgctggcgta ccaggaaggt cgtgaaatcg ttgcggttca cccgcagcgt ctgtctgaag 960
ttgaaggtgt taacaaccgt ctgcagctgt ctcgtctgga acgtgtttac cagtctgaac 1020ttgaaggtgt taacaaccgt ctgcagctgt ctcgtctgga acgtgtttac cagtctgaac 1020
aggcggaaaa actgctgctg gcgggtgtta tgctgcgtga cccggcgcgt ttcgacctgc 1080aggcggaaaa actgctgctg gcgggtgtta tgctgcgtga cccggcgcgt ttcgacctgc 1080
gtggtaccct gacccacggt cgtgacgttg aaatcgacac caacgttatc atcgaaggta 1140gtggtaccct gacccacggt cgtgacgttg aaatcgacac caacgttatc atcgaaggta 1140
acgttaccct gggtcaccgt gtaaaaatcg gcaccggttg cgttatcaaa aactctgtta 1200acgttaccct gggtcaccgt gtaaaaatcg gcaccggttg cgttatcaaa aactctgtta 1200
tcggtgacga ctgcgaaatc tctccgtaca ccgttgttga agacgcgaac ctggcggcgg 1260tcggtgacga ctgcgaaatc tctccgtaca ccgttgttga agacgcgaac ctggcggcgg 1260
cgtgcaccat cggtccgttc gcgcgtctgc gtccgggtgc ggaactgctg gaaggtgcgc 1320cgtgcaccat cggtccgttc gcgcgtctgc gtccgggtgc ggaactgctg gaaggtgcgc 1320
acgttggtaa cttcgttgaa atgaaaaaag cgcgtctggg taaaggttct aaagcgggtc 1380acgttggtaa cttcgttgaa atgaaaaaag cgcgtctggg taaaggttct aaagcgggtc 1380
acctgaccta cctgggtgac gcggaaatcg gtgacaacgt taacatcggt gcgggtacca 1440acctgaccta cctgggtgac gcggaaatcg gtgacaacgt taacatcggt gcgggtacca 1440
tcacctgcaa ctacgacggt gcgaacaaat tcaaaaccat catcggtgac gacgttttcg 1500tcacctgcaa ctacgacggt gcgaacaaat tcaaaaccat catcggtgac gacgttttcg 1500
ttggttctga cacccagctg gttgcgccgg ttaccgttgg taaaggtgcg accatcgcgg 1560ttggttctga cacccagctg gttgcgccgg ttaccgttgg taaaggtgcg accatcgcgg 1560
cgggtaccac cgttacccgt aacgttggtg aaaacgcgct ggcgatctct cgtgttccgc 1620cgggtaccac cgttacccgt aacgttggtg aaaacgcgct ggcgatctct cgtgttccgc 1620
agacccagaa agaaggttgg cgtcgtccgg ttaaaaaaaa ataacgaagg agatagaacc 1680agacccagaa agaaggttgg cgtcgtccgg ttaaaaaaaa ataacgaagg agatagaacc 1680
atgtccaacc gtaaatactt cggtacggac ggtatccgtg gtcgtgtagg tgatgctccg 1740atgtccaacc gtaaatactt cggtacggac ggtatccgtg gtcgtgtagg tgatgctccg 1740
attacgccgg atttcgtcct gaaactcggt tgggcagcgg gtaaagttct cgcacgtcac 1800attacgccgg atttcgtcct gaaactcggt tgggcagcgg gtaaagttct cgcacgtcac 1800
ggctctcgta aaatcatcat cggtaaagac acccgtatct ctggttacat gctcgaatct 1860ggctctcgta aaatcatcat cggtaaagac acccgtatct ctggttacat gctcgaatct 1860
gcactggaag cgggtctggc tgcagctggt ctgtctgcac tgttcacggg tccgatgcca 1920gcactggaag cgggtctggc tgcagctggt ctgtctgcac tgttcacggg tccgatgcca 1920
accccagctg tagcgtacct gactcgcact ttccgtgcag aagcaggtat cgtgatctct 1980accccagctg tagcgtacct gactcgcact ttccgtgcag aagcaggtat cgtgatctct 1980
gcctctcaca acccgttcta cgacaacggt atcaaattct tcagcatcga tggtaccaaa 2040gcctctcaca acccgttcta cgacaacggt atcaaattct tcagcatcga tggtaccaaa 2040
ctcccagacg cggttgaaga ggctatcgaa gcggaaatgg agaaagaaat ctcttgtgta 2100ctcccagacg cggttgaaga ggctatcgaa gcggaaatgg agaaagaaat ctcttgtgta 2100
gactctgccg aactcggtaa agcgtctcgt atcgttgatg cagcgggtcg ttacatcgag 2160gactctgccg aactcggtaa agcgtctcgt atcgttgatg cagcgggtcg ttacatcgag 2160
ttctgcaaag ccacctttcc gaacgaactg agcctgtctg agctgaaaat cgtcgtagac 2220ttctgcaaag ccacctttcc gaacgaactg agcctgtctg agctgaaaat cgtcgtagac 2220
tgtgccaacg gtgcgactta ccacattgcc ccaaacgtac tgcgtgagct gggtgctaac 2280tgtgccaacg gtgcgactta ccacattgcc ccaaacgtac tgcgtgagct gggtgctaac 2280
gtcatcgcga tcggttgtga accgaacggt gtcaacatca acgcggaagt aggtgcgacc 2340gtcatcgcga tcggttgtga accgaacggt gtcaacatca acgcggaagt aggtgcgacc 2340
gatgttcgtg cactgcaggc tcgtgtactc gcggagaaag cggatctcgg tatcgccttt 2400gatgttcgtg cactgcaggc tcgtgtactc gcggagaaag cggatctcgg tatcgccttt 2400
gacggtgatg gtgaccgtgt tatcatggtt gaccacgaag gtaacaaagt ggatggtgac 2460gacggtgatg gtgaccgtgt tatcatggtt gaccacgaag gtaacaaagt ggatggtgac 2460
cagatcatgt acatcattgc ccgtgaaggt ctgcgtcagg gtcagctgcg tggtggtgca 2520cagatcatgt acatcattgc ccgtgaaggt ctgcgtcagg gtcagctgcg tggtggtgca 2520
gtaggtaccc tcatgagcaa catgggtctg gaactggccc tgaaacagct gggtatccca 2580gtaggtaccc tcatgagcaa catgggtctg gaactggccc tgaaacagct gggtatccca 2580
ttcgctcgtg ctaaagtagg cgaccgttac gttctggaga aaatgcagga gaaaggttgg 2640ttcgctcgtg ctaaagtagg cgaccgttac gttctggaga aaatgcagga gaaaggttgg 2640
cgtatcggtg ccgaaaactc tggtcacgtc atcctgctgg acaaaaccac taccggtgac 2700cgtatcggtg ccgaaaactc tggtcacgtc atcctgctgg acaaaaccac taccggtgac 2700
ggtatcgtag caggtctgca ggtactcgcc gctatggccc gtaaccacat gtccctccat 2760ggtatcgtag caggtctgca ggtactcgcc gctatggccc gtaaccacat gtccctccat 2760
gacctctgct ctggtatgaa aatgttcccg cagatcctgg ttaacgttcg ttacaccgca 2820gacctctgct ctggtatgaa aatgttcccg cagatcctgg ttaacgttcg ttacaccgca 2820
ggttctggtg atccgctgga acacgagtct gtgaaagccg ttaccgcaga agtggaagcg 2880ggttctggtg atccgctgga acacgagtct gtgaaagccg ttaccgcaga agtggaagcg 2880
gccctgggta accgtggtcg tgtactgctg cgtaaatccg gtactgagcc actgatccgt 2940gccctgggta accgtggtcg tgtactgctg cgtaaatccg gtactgagcc actgatccgt 2940
gttatggttg agggcgaaga tgaagcccag gtcaccgaat ttgcgcaccg tattgccgac 3000gttatggttg agggcgaaga tgaagcccag gtcaccgaat ttgcgcaccg tattgccgac 3000
gcagtcaaag cggtttaatt tcgtcgacac acaggaaaca tattaaaaat taaaacctgc 3060gcagtcaaag cggtttaatt tcgtcgacac acaggaaaca tattaaaaat taaaacctgc 3060
aggagtttaa acgcggccgc gatatcgttg taaaacgacg gccagtgcaa gaatcataaa 3120aggagtttaa acgcggccgc gatatcgttg taaaacgacg gccagtgcaa gaatcataaa 3120
aaatttattt gctttcagga aaatttttct gtataataga ttcataaatt tgagagagga 3180aaatttattt gctttcagga aaatttttct gtataataga ttcataaatt tgagagagga 3180
gtttttgtga gcggataaca attccccatc ttagtatatt agttaagtat aaatacacaa 3240gtttttgtga gcggataaca attccccatc ttagtatatt agttaagtat aaatacacaa 3240
ggagatatac atatgtgcgg tatcgttggt gctatcgcac agcgtgatgt agcggagatc 3300ggagatatac atatgtgcgg tatcgttggt gctatcgcac agcgtgatgt agcggagatc 3300
ctcctggaag gtctgcgtcg tctcgaatac cgtggttacg actctgccgg tctggcagta 3360ctcctggaag gtctgcgtcg tctcgaatac cgtggttacg actctgccgg tctggcagta 3360
gtggatgcag aaggtcacat gactcgtctg cgtcgtctgg gtaaagtgca gatgctcgcg 3420gtggatgcag aaggtcacat gactcgtctg cgtcgtctgg gtaaagtgca gatgctcgcg 3420
caggcggcgg aagaacaccc actccacggt ggtacgggta tcgcacacac tcgttgggca 3480caggcggcgg aagaacaccc actccacggt ggtacgggta tcgcacacac tcgttgggca 3480
acccacggtg aaccgtctga ggtcaacgca cacccgcatg ttagcgagca catcgtagtc 3540acccacggtg aaccgtctga ggtcaacgca cacccgcatg ttagcgagca catcgtagtc 3540
gttcacaacg gtatcatcga gaaccacgaa ccactccgtg aggaactcaa agcccgtggt 3600gttcacaacg gtatcatcga gaaccacgaa ccactccgtg aggaactcaa agcccgtggt 3600
tacaccttcg taagcgaaac cgacacggaa gttatcgccc acctcgttaa ctgggaactc 3660tacaccttcg taagcgaaac cgacacggaa gttatcgccc acctcgttaa ctgggaactc 3660
aaacagggtg gtactctgcg tgaagcagtt ctgcgtgcca ttccacagct gcgtggtgca 3720aaacagggtg gtactctgcg tgaagcagtt ctgcgtgcca ttccacagct gcgtggtgca 3720
tacggtaccg tgatcatgga ctctcgtcat ccggataccc tgctcgccgc acgttctggt 3780tacggtaccg tgatcatgga ctctcgtcat ccggataccc tgctcgccgc acgttctggt 3780
tctccactcg ttatcggtct gggtatgggt gagaacttca tcgcctctga tcagctggcc 3840tctccactcg ttatcggtct gggtatgggt gagaacttca tcgcctctga tcagctggcc 3840
ctgctcccag ttacccgtcg cttcatcttc ctggaagagg gtgacatcgc cgaaatcacc 3900ctgctcccag ttacccgtcg cttcatcttc ctggaagagg gtgacatcgc cgaaatcacc 3900
cgtcgttccg ttaacatctt cgacaaaacg ggtgcggaag ttaaacgtca ggacatcgag 3960cgtcgttccg ttaacatctt cgacaaaacg ggtgcggaag ttaaacgtca ggacatcgag 3960
tctaacctgc agtatgacgc tggtgacaaa ggcatctacc gtcactacat gcagaaagag 4020tctaacctgc agtatgacgc tggtgacaaa ggcatctacc gtcactacat gcagaaagag 4020
atctacgaac agccgaacgc gatcaaaaac accctgaccg gtcgtatctc tcacggtcag 4080atctacgaac agccgaacgc gatcaaaaac accctgaccg gtcgtatctc tcacggtcag 4080
gttgacctgt ctgagctggg tccaaacgcg gacgaactcc tgtccaaagt cgagcacatc 4140gttgacctgt ctgagctggg tccaaacgcg gacgaactcc tgtccaaagt cgagcacatc 4140
cagatcctgg cttgtggtac ctcttacaac tccggtatgg tttctcgtta ctggttcgaa 4200cagatcctgg cttgtggtac ctcttacaac tccggtatgg tttctcgtta ctggttcgaa 4200
tctctggcag gtatcccatg cgacgttgaa atcgcctccg aattccgtta tcgtaaatct 4260tctctggcag gtatcccatg cgacgttgaa atcgcctccg aattccgtta tcgtaaatct 4260
gcggtacgtc gtaactccct catgatcacc ctgtctcagt ctggtgaaac cgctgatact 4320gcggtacgtc gtaactccct catgatcacc ctgtctcagt ctggtgaaac cgctgatact 4320
ctggcaggtc tgcgtctcag caaagaactg ggttacctgg gttctctggc catctgcaac 4380ctggcaggtc tgcgtctcag caaagaactg ggttacctgg gttctctggc catctgcaac 4380
gttccgggtt ctagcctggt tcgtgagtct gacctggctc tgatgaccaa cgcgggtacg 4440gttccgggtt ctagcctggt tcgtgagtct gacctggctc tgatgaccaa cgcgggtacg 4440
gagatcggtg ttgcctctac caaagcgttc actacccagc tcactgtcct gctgatgctg 4500gagatcggtg ttgcctctac caaagcgttc actacccagc tcactgtcct gctgatgctg 4500
gttgccaaac tgtctcgtct caaaggcctc gacgctagca tcgaacacga catcgtacac 4560gttgccaaac tgtctcgtct caaaggcctc gacgctagca tcgaacacga catcgtacac 4560
ggtctgcagg ccctcccatc tcgtatcgag cagatgctgt ctcaggacaa acgtatcgaa 4620ggtctgcagg ccctcccatc tcgtatcgag cagatgctgt ctcaggacaa acgtatcgaa 4620
gcactggcag aagacttcag cgacaaacac cacgcgctgt ttctgggtcg tggtgaccag 4680gcactggcag aagacttcag cgacaaacac cacgcgctgt ttctgggtcg tggtgaccag 4680
tacccaattg cgctggaagg tgccctgaaa ctgaaagaga tcagctacat ccatgcagag 4740tacccaattg cgctggaagg tgccctgaaa ctgaaagaga tcagctacat ccatgcagag 4740
gcatacgcag cgggtgagct gaaacatggt ccactggccc tgatcgacgc agatatgccg 4800gcatacgcag cgggtgagct gaaacatggt ccactggccc tgatcgacgc agatatgccg 4800
gttattgtgg ttgctccgaa caacgaactg ctggagaaac tgaaatccaa catcgaggaa 4860gttattgtgg ttgctccgaa caacgaactg ctggagaaac tgaaatccaa catcgaggaa 4860
gtacgtgcgc gtggtggtca gctgtacgtg tttgctgacc aggacgcggg tttcgtttcc 4920gtacgtgcgc gtggtggtca gctgtacgtg tttgctgacc aggacgcggg tttcgtttcc 4920
agcgacaaca tgcacatcat cgaaatgccg catgttgaag aggtaatcgc gccaatcttc 4980agcgacaaca tgcacatcat cgaaatgccg catgttgaag aggtaatcgc gccaatcttc 4980
tacaccgtac cgctgcagct gctggcgtac catgtagccc tgatcaaagg tacggacgtt 5040tacaccgtac cgctgcagct gctggcgtac catgtagccc tgatcaaagg tacggacgtt 5040
gaccagccgc gtaacctggc gaaatccgtg accgtggaat aacgcggagg cgcgccattt 5100gaccagccgc gtaacctggc gaaatccgtg accgtggaat aacgcggagg cgcgccattt 5100
aaatcaacct cagcggtcat agctgtttcc tgtgactgag caataactag cataacccct 5160aaatcaacct cagcggtcat agctgtttcc tgtgactgag caataactag cataacccct 5160
tggggcctct aaacgggtct tgaggggttt tttgctgaaa ccaatttgcc tggcggcagt 5220tggggcctct aaacgggtct tgaggggttt tttgctgaaa ccaatttgcc tggcggcagt 5220
agcgcggtgg tcccacctga ccccatgccg aactcagaag tgaaacgccg tagcgccgat 5280agcgcggtgg tcccacctga ccccatgccg aactcagaag tgaaacgccg tagcgccgat 5280
ggtagtgtgg ggtctcccca tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa 5340ggtagtgtgg ggtctcccca tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa 5340
ggctcagtcg aaagactggg cctttcggga tccaggccgg cctgttaacg aattaatctt 5400ggctcagtcg aaagactggg cctttcggga tccaggccgg cctgttaacg aattaatctt 5400
ccgcggcggt atcgataagc ttgatatcga attccgaagt tcctattctc tagaaagtat 5460ccgcggcggt atcgataagc ttgatatcga attccgaagt tcctattctc tagaaagtat 5460
aggaacttca ggtctgaaga ggagtttacg tccagccaag ctagcttggc tgcaggtcgt 5520aggaacttca ggtctgaaga ggagtttacg tccagccaag ctagcttggc tgcaggtcgt 5520
cgaaattcta ccgggtaggg gaggcgcttt tcccaaggca gtctggagca tgcgctttag 5580cgaaattcta ccgggtaggg gaggcgcttt tcccaaggca gtctggagca tgcgctttag 5580
cagccccgct gggcacttgg cgctacacaa gtggcctctg gcctcgcaca cattccacat 5640cagccccgct gggcacttgg cgctacacaa gtggcctctg gcctcgcaca cattccacat 5640
ccaccggtag gcgccaaccg gctccgttct ttggtggccc cttcgcgcca ccttctactc 5700ccaccggtag gcgccaaccg gctccgttct ttggtggccc cttcgcgcca ccttctactc 5700
ctcccctagt caggaagttc ccccccgccc cgcagctcgc gtcgtgcagg acgtgacaaa 5760ctcccctagt caggaagttc ccccccgccc cgcagctcgc gtcgtgcagg acgtgacaaa 5760
tggaagtagc acgtctcact agtctcgtgc agatggacag caccgctgag caatggaagc 5820tggaagtagc acgtctcact agtctcgtgc agatggacag caccgctgag caatggaagc 5820
gggtaggcct ttggggcagc ggccaatagc agctttgctc cttcgctttc tgggctcaga 5880gggtaggcct ttggggcagc ggccaatagc agctttgctc cttcgctttc tgggctcaga 5880
ggctgggaag gggtgggtcc gggggcgggc tcaggggcgg gctcaggggc ggggcgggcg 5940ggctgggaag gggtggggtcc gggggcgggc tcaggggcgg gctcaggggc ggggcgggcg 5940
cccgaaggtc ctccggaggc ccggcattct gcacgcttca aaagcgcacg tctgccgcgc 6000cccgaaggtc ctccggaggc ccggcattct gcacgcttca aaagcgcacg tctgccgcgc 6000
tgttctcctc ttcctcatct ccgggccttt cgacctgcag cctgttgaca attaatcatc 6060tgttctcctc ttcctcatct ccggggccttt cgacctgcag cctgttgaca attaatcatc 6060
ggcatagtat atcggcatag tataatacga caaggtgagg aactaaacca tgggtcaaag 6120ggcatagtat atcggcatag tataatacga caaggtgagg aactaaacca tgggtcaaag 6120
tagcgatgaa gccaacgctc ccgttgcagg gcagtttgcg cttcccctga gtgccacctt 6180tagcgatgaa gccaacgctc ccgttgcagg gcagtttgcg cttcccctga gtgccacctt 6180
tggcttaggg gatcgcgtac gcaagaaatc tggtgccgct tggcagggtc aagtcgtcgg 6240tggcttaggg gatcgcgtac gcaagaaatc tggtgccgct tggcagggtc aagtcgtcgg 6240
ttggtattgc acaaaactca ctcctgaagg ctatgcggtc gagtccgaat cccacccagg 6300ttggtattgc acaaaactca ctcctgaagg ctatgcggtc gagtccgaat cccacccagg 6300
ctcagtgcaa atttatcctg tggctgcact tgaacgtgtg gcctaatgag gggatcaatt 6360ctcagtgcaa atttatcctg tggctgcact tgaacgtgtg gcctaatgag gggatcaatt 6360
ctctagagct cgctgatcag aagttcctat tctctagaaa gtataggaac ttcgatggcg 6420ctctagagct cgctgatcag aagttcctat tctctagaaa gtataggaac ttcgatggcg 6420
cctcatccct gaagccaata caacaaaaat taggaattaa tcatctggcc aatttcaggt 6480cctcatccct gaagccaata caacaaaaat taggaattaa tcatctggcc aatttcaggt 6480
ggcacttttc gggcagaccg gggacttatc agccaacctg t 6521ggcacttttc gggcagaccg gggacttatc agccaacctg t 6521
<210> 69<210> 69
<211> 3919<211> 3919
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Экспрессионная кассета <Ptet-glmSm-gna1-FRT-aacC1-FRT><223> Expression cassette <Ptet-glmSm-gna1-FRT-aacC1-FRT>
<400> 69<400> 69
ggtacccaaa tatgcataat cgaaattaat acgactcact ataggggaat tgattctggt 60ggtacccaaa tatgcataat cgaaattaat acgactcact ataggggaat tgattctggt 60
accaaatgag tcgaccggcc agatgattaa ttcctaattt ttgttgacac tctatcattg 120accaaatgag tcgaccggcc agatgattaa ttcctaattt ttgttgacac tctatcattg 120
atagagttat tttaccactc cctatcagtg atagagaaaa gtgaaatgaa tagttcgaca 180atagagttat tttaccactc cctatcagtg atagagaaaa gtgaaatgaa tagttcgaca 180
aaaatctaga aataattttg tttaacttta agaaggagat atacaaatgt gcggtatcgt 240aaaatctaga aataattttg tttaacttta agaaggagat atacaaatgt gcggtatcgt 240
tggtgctatc gcacagcgtg atgtagcgaa aatcctcctg gaaggtctgc gtcgtctcga 300tggtgctatc gcacagcgtg atgtagcgaa aatcctcctg gaaggtctgc gtcgtctcga 300
ataccgtggt tacgactctg ccggtctggc agtagtggat gcagaaggtc acatgactcg 360ataccgtggt tacgactctg ccggtctggc agtagtggat gcagaaggtc acatgactcg 360
tctgcgtcgt ctgggtaaag tgcagatgct cgcgcaggcg gcggaagaac acccactcca 420tctgcgtcgt ctgggtaaag tgcagatgct cgcgcaggcg gcggaagaac acccactcca 420
cggtggtacg ggtatcgcac acactcgttg ggcaacccac ggtgaaccgt ctgaggtcaa 480cggtggtacg ggtatcgcac acactcgttg ggcaacccac ggtgaaccgt ctgaggtcaa 480
cgcacacccg catgttagcg agcacatcgt agtcgttcac aacggtatca tcgagaacca 540cgcacacccg catgttagcg agcacatcgt agtcgttcac aacggtatca tcgagaacca 540
cgaaccactc cgtgaggaac tcaaagcccg tggttacacc ttcgtaagcg aaaccgacac 600cgaaccactc cgtgaggaac tcaaagcccg tggttacacc ttcgtaagcg aaaccgacac 600
ggaagttatc gcccacctcg ttaactggga actcaaacag ggtggtactc tgcgtgaagc 660ggaagttatc gcccacctcg ttaactggga actcaaacag ggtggtactc tgcgtgaagc 660
agttctgcgt gccattccac agctgcgtgg tgcatacggt accgtgatca tggactctcg 720agttctgcgt gccattccac agctgcgtgg tgcatacggt accgtgatca tggactctcg 720
tcatccggat accctgctcg ccgcacgttc tggttctcca ctcgttatcg gtctgggtat 780tcatccggat accctgctcg ccgcacgttc tggttctcca ctcgttatcg gtctgggtat 780
gggtgagaac ttcatcgcct ctgatcagct ggccctgctc ccagttaccc gtcgcttcat 840gggtgagaac ttcatcgcct ctgatcagct ggccctgctc ccagttaccc gtcgcttcat 840
cttcctggaa gagggtgaca tcgccgaaat cacccgtcgt tccgttaaca tcttcgacaa 900cttcctggaa gagggtgaca tcgccgaaat cacccgtcgt tccgttaaca tcttcgacaa 900
aacgggtgcg gaagttaaac gtcaggacat cgagtctaac ctgcagtatg acgctggtga 960aacgggtgcg gaagttaaac gtcaggacat cgagtctaac ctgcagtatg acgctggtga 960
caaaggcatc taccgtcact acatgcagaa agagatctac gaacagccga acgcgatcaa 1020caaaggcatc taccgtcact acatgcagaa agagatctac gaacagccga acgcgatcaa 1020
aaacaccctg accggtcgta tctctcacgg tcaggttgac ctgtctgagc tgggtccaaa 1080aaacaccctg accggtcgta tctctcacgg tcaggttgac ctgtctgagc tgggtccaaa 1080
cgcggacgaa ctcctgtcca aagtcgagca catccagatc ctggcttgtg gtacctctta 1140cgcggacgaa ctcctgtcca aagtcgagca catccagatc ctggcttgtg gtacctctta 1140
caactccggt atggtttctc gttactggtt cgaatctctg gcaggtatcc catgcgacgt 1200caactccggt atggtttctc gttactggtt cgaatctctg gcaggtatcc catgcgacgt 1200
tgaaatcgcc tccgaattcc gttatcgtaa atctgcggta cgtcgtaact ccctcatgat 1260tgaaatcgcc tccgaattcc gttatcgtaa atctgcggta cgtcgtaact ccctcatgat 1260
caccctgtct cagtctggtg aaaccgctga tactctggca ggtctgcgtc tcagcaaaga 1320caccctgtct cagtctggtg aaaccgctga tactctggca ggtctgcgtc tcagcaaaga 1320
actgggttac ctgggttctc tggccatctg caacgttccg ggttctagcc tggttcgtga 1380actgggttac ctgggttctc tggccatctg caacgttccg ggttctagcc tggttcgtga 1380
gtctgtgctg gctctgatga ccaacgcggg tacggagatc ggtgttgcct ctaccaaagc 1440gtctgtgctg gctctgatga ccaacgcggg tacggagatc ggtgttgcct ctaccaaagc 1440
gttcactacc cagctcactg tcctgctgat gctggttgcc aaactgtctc gtctcaaagg 1500gttcactacc cagctcactg tcctgctgat gctggttgcc aaactgtctc gtctcaaagg 1500
cctcgacgct agcatcgaac acgacatcgt acacggtctg caggccctcc catctcgtat 1560cctcgacgct agcatcgaac acgacatcgt acacggtctg caggccctcc catctcgtat 1560
cgagcagatg ctgccgcagg acaaacgtat cgaagcactg gcagaagact tcagcgacaa 1620cgagcagatg ctgccgcagg acaaacgtat cgaagcactg gcagaagact tcagcgacaa 1620
acaccacgcg ctgtttctgg gtcgtggtga ccagtaccca attgcgctgg aaggtgccct 1680acaccacgcg ctgtttctgg gtcgtggtga ccagtaccca attgcgctgg aaggtgccct 1680
gaaactgaaa gagatcagct acatccatgc agaggcatac gcagcgggtg agctgaaaca 1740gaaactgaaa gagatcagct acatccatgc agaggcatac gcagcgggtg agctgaaaca 1740
tggtccactg gccctgatcg acgcagatat gccggttatt gtggttgctc cgaacaacgg 1800tggtccactg gccctgatcg acgcagatat gccggttatt gtggttgctc cgaacaacgg 1800
cctgctggag aaactgaaat ccaacatcga ggaagtacgt gcgcgtggtg gtcagctgta 1860cctgctggag aaactgaaat ccaacatcga ggaagtacgt gcgcgtggtg gtcagctgta 1860
cgtgtttgct gaccaggacg cgggtttcgt ttccagcgac aacatgcaca tcatcgaaat 1920cgtgtttgct gaccaggacg cgggtttcgt ttccagcgac aacatgcaca tcatcgaaat 1920
gccgcatgtt gaagaggtaa tcgcgccaat cttctacacc gtaccgctgc agctgctggc 1980gccgcatgtt gaagaggtaa tcgcgccaat cttctacacc gtaccgctgc agctgctggc 1980
gtaccatgta gccctgatca aaggtacgga cgttgaccag ccgcgtaacc tggcgaaatc 2040gtaccatgta gccctgatca aaggtacgga cgttgaccag ccgcgtaacc tggcgaaatc 2040
cgtgaccgtg gaataacgaa ggagatagaa ccatgagctt acccgatgga ttttatataa 2100cgtgaccgtg gaataacgaa ggagatagaa ccatgagctt acccgatgga ttttatataa 2100
ggcgaatgga agagggggat ttggaacagg tcactgagac gctaaaggtt ttgaccaccg 2160ggcgaatgga agagggggat ttggaacagg tcactgagac gctaaaggtt ttgaccaccg 2160
tgggcactat tacccccgaa tccttcagca aactcataaa atactggaat gaagccacag 2220tgggcactat tacccccgaa tccttcagca aactcataaa atactggaat gaagccacag 2220
tatggaatga taacgaagat aaaaaaataa tgcaatataa ccccatggtg attgtggaca 2280tatggaatga taacgaagat aaaaaaataa tgcaatataa ccccatggtg attgtggaca 2280
agcgcaccga gacggttgcc gctacgggga atatcatcat cgaaagaaag atcattcatg 2340agcgcaccga gacggttgcc gctacgggga atatcatcat cgaaagaaag atcattcatg 2340
aactggggct atgtggccac atcgaggaca ttgcagtaaa ctccaagtat cagggccaag 2400aactggggct atgtggccac atcgaggaca ttgcagtaaa ctccaagtat cagggccaag 2400
gtttgggcaa gctcttgatt gatcaattgg taactatcgg ctttgactac ggttgttata 2460gtttgggcaa gctcttgatt gatcaattgg taactatcgg ctttgactac ggttgttata 2460
agattatttt agattgcgat gagaaaaatg tcaaattcta tgaaaaatgt gggtttagca 2520agattatttt agattgcgat gagaaaaatg tcaaattcta tgaaaaatgt gggtttagca 2520
acgcaggcgt ggaaatgcaa attagaaaat agaataacta gcataacccc ttggggcctc 2580acgcaggcgt ggaaatgcaa attagaaaat agaataacta gcataacccc ttggggcctc 2580
taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag tagcgcggtg 2640taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag tagcgcggtg 2640
gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg 2700gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg 2700
gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa aggctcagtc 2760gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa aggctcagtc 2760
gaaagactgg gcctttcggg atccaggccg gcctgttaac gaattaatct tccgcggcgg 2820gaaagactgg gcctttcggg atccaggccg gcctgttaac gaattaatct tccgcggcgg 2820
tatcgataag cttgatatcg aattccgaag ttcctattct ctagaaagta taggaacttc 2880tatcgataag cttgatatcg aattccgaag ttcctattct ctagaaagta taggaacttc 2880
aggtctgaag aggagtttac gtccagccaa gctagcttgg ctgcaggtcg tcgaaattct 2940aggtctgaag aggagtttac gtccagccaa gctagcttgg ctgcaggtcg tcgaaattct 2940
acgatctcgg cttgaacgaa ttgttaggtg gcggtacttg ggtcgatatc aaagtgcatc 3000acgatctcgg cttgaacgaa ttgttaggtg gcggtacttg ggtcgatatc aaagtgcatc 3000
acttcttccc gtatgcccaa ctttgtatag agagccactg cgggatcgtc accgtaatct 3060acttcttccc gtatgcccaa ctttgtatag agagccactg cgggatcgtc accgtaatct 3060
gcttgcacgt agatcacata agcaccaagc gcgttggcct catgcttgag gagattgatg 3120gcttgcacgt agatcacata agcaccaagc gcgttggcct catgcttgag gagattgatg 3120
agcgcggtgg caatgccctg cctccggtgc tcgccggaga ctgcgagatc atagatatag 3180agcgcggtgg caatgccctg cctccggtgc tcgccggaga ctgcgagatc atagatatag 3180
atctcactac gcggctgctc aaacctgggc agaacgtaag ccgcgagagc gccaacaacc 3240atctcactac gcggctgctc aaacctgggc agaacgtaag ccgcgagagc gccaacaacc 3240
gcttcttggt cgaaggcagc aagcgcgatg aatgtcttac tacggagcaa gttcccgagg 3300gcttcttggt cgaaggcagc aagcgcgatg aatgtcttac tacggagcaa gttcccgagg 3300
taatcggagt ccggctgatg ttgggagtag gtggctacgt ctccgaactc acgaccgaaa 3360taatcggagt ccggctgatg ttgggagtag gtggctacgt ctccgaactc acgaccgaaa 3360
agatcaagag cagcccgcat ggatttgact tggtcagggc cgagcctaca tgtgcgaatg 3420agatcaagag cagcccgcat ggatttgact tggtcagggc cgagcctaca tgtgcgaatg 3420
atgcccatac ttgagccacc taactttgtt ttagggcgac tgccctgctg cgtaacatcg 3480atgcccatac ttgagccacc taactttgtt ttagggcgac tgccctgctg cgtaacatcg 3480
ttgctgctgc gtaacatcgt tgctgctcca taacatcaaa catcgaccca cggcgtaacg 3540ttgctgctgc gtaacatcgt tgctgctcca taacatcaaa catcgaccca cggcgtaacg 3540
cgcttgctgc ttggatgccc gaggcataga ctgtacaaaa aaacagtcat aacaagccat 3600cgcttgctgc ttggatgccc gaggcataga ctgtacaaaa aaacagtcat aacaagccat 3600
gaaaaccgcc actgcgccgt taccaccgct gcgttcggtc aaggttctgg accagttgcg 3660gaaaaccgcc actgcgccgt taccaccgct gcgttcggtc aaggttctgg accagttgcg 3660
tgagcgcata cgctacttgc attacagttt acgaaccgaa caggcttatg tcaactgggt 3720tgagcgcata cgctacttgc attacagttt acgaaccgaa caggcttatg tcaactgggt 3720
tcgtgccttc atccgtttcc acggtgtgcg ctgcacttga acgtgtggcc taatgagggg 3780tcgtgccttc atccgtttcc acggtgtgcg ctgcacttga acgtgtggcc taatgagggg 3780
atcaattctc tagagctcgc tgatcagaag ttcctattct ctagaaagta taggaacttc 3840atcaattctc tagagctcgc tgatcagaag ttcctattct ctagaaagta taggaacttc 3840
gatggcgcct catccctgaa gccaataggg ataacagggt aatgatcgga tcccgggccc 3900gatggcgcct catccctgaa gccaataggg ataacagggt aatgatcgga tcccgggccc 3900
gtcgactgca gaggcctgc 3919gtcgactgca gaggcctgc 3919
<210> 70<210> 70
<211> 2850<211> 2850
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Экспрессионная кассета <Ptet-slr1975-FRT-cat-FRT><223> Expression cassette <Ptet-slr1975-FRT-cat-FRT>
<400> 70<400> 70
atgcataatc gaaattaata cgactcacta taggggaatt gattctggta ccaaatgagt 60atgcataatc gaaattaata cgactcacta taggggaatt gattctggta ccaaatgagt 60
cgaccggcca gatgattaat tcctaatttt tgttgacact ctatcattga tagagttatt 120cgaccggcca gatgattaat tcctaatttt tgttgacact ctatcattga tagagttatt 120
ttaccactcc ctatcagtga tagagaaaag tgaaatgaat agttcgacaa aaatctagaa 180ttaccactcc ctatcagtga tagagaaaag tgaaatgaat agttcgacaa aaatctagaa 180
ataattttgt ttaactttaa gaaggagata tacaaatgat cgctcaccgt cgtcaggaac 240ataattttgt ttaactttaa gaaggagata tacaaatgat cgctcaccgt cgtcaggaac 240
tggctcaaca gtattatcag gctctgcacc aagatgtgct gccgttctgg gaaaagtatt 300tggctcaaca gtattatcag gctctgcacc aagatgtgct gccgttctgg gaaaagtatt 300
cgctggatcg tcaaggcggt ggctatttta cctgcctgga ccgcaagggt caggtttttg 360cgctggatcg tcaaggcggt ggctatttta cctgcctgga ccgcaagggt caggtttttg 360
atacggacaa gttcatttgg ctgcaaaacc gtcaagtgtg gcaatttgcg gttttctaca 420atacggacaa gttcatttgg ctgcaaaacc gtcaagtgtg gcaatttgcg gttttctaca 420
atcgcctgga accgaaaccg cagtggctgg aaatcgctcg tcatggtgcg gattttctgg 480atcgcctgga accgaaaccg cagtggctgg aaatcgctcg tcatggtgcg gattttctgg 480
cacgtcacgg tcgtgatcag gacggtaact ggtatttcgc cctggatcag gaaggcaaac 540cacgtcacgg tcgtgatcag gacggtaact ggtatttcgc cctggatcag gaaggcaaac 540
cgctgcgcca accgtacaat gtgttttccg actgtttcgc ggcgatggcg tttagccagt 600cgctgcgcca accgtacaat gtgttttccg actgtttcgc ggcgatggcg tttagccagt 600
atgcactggc ttctggtgct caagaagcga aggccattgc actgcaagcg tataacaatg 660atgcactggc ttctggtgct caagaagcga aggccattgc actgcaagcg tataacaatg 660
ttctgcgtcg ccagcataac ccgaaaggtc aatatgaaaa gagttacccg ggtacccgtc 720ttctgcgtcg ccagcataac ccgaaaggtc aatatgaaaa gagttacccg ggtacccgtc 720
cgctgaaatc cctggcagtg ccgatgatcc tggctaatct gacgctggaa atggaatggc 780cgctgaaatc cctggcagtg ccgatgatcc tggctaatct gacgctggaa atggaatggc 780
tgctgccgcc gaccacggtc gaagaagtgc tggcccagac cgttcgtgaa gtcatgacgg 840tgctgccgcc gaccacggtc gaagaagtgc tggcccagac cgttcgtgaa gtcatgacgg 840
attttctgga cccggaaatt ggcctgatgc gcgaagcagt taccccgacg ggtgaatttg 900attttctgga cccggaaatt ggcctgatgc gcgaagcagt taccccgacg ggtgaatttg 900
tcgattcatt cgaaggccgc ctgctgaacc cgggtcatgg cattgaagcg atgtggttta 960tcgattcatt cgaaggccgc ctgctgaacc cgggtcatgg cattgaagcg atgtggttta 960
tgatggatat tgcccagcgt tcgggtgacc gccagctgca agaacaggct attgcggtgg 1020tgatggatat tgcccagcgt tcgggtgacc gccagctgca agaacaggct attgcggtgg 1020
ttctgaatac cctggaatat gcatgggatg aagaatttgg tggcatcttt tacttcctgg 1080ttctgaatac cctggaatat gcatgggatg aagaatttgg tggcatcttt tacttcctgg 1080
accgtcaagg tcacccgccg cagcaactgg aatgggatca gaaactgtgg tgggtccatc 1140accgtcaagg tcacccgccg cagcaactgg aatgggatca gaaactgtgg tgggtccatc 1140
tggaaaccct ggtggccctg gcaaaaggtc accaggcgac gggccaagaa aagtgctggc 1200tggaaaccct ggtggccctg gcaaaaggtc accaggcgac gggccaagaa aagtgctggc 1200
agtggtttga acgcgtgcat gattatgcat ggagccactt tgctgacccg gaatatggtg 1260agtggtttga acgcgtgcat gattatgcat ggagccactt tgctgacccg gaatatggtg 1260
aatggttcgg ctacctgaac cgtcgcggtg aagtgctgct gaatctgaaa ggtggcaaat 1320aatggttcgg ctacctgaac cgtcgcggtg aagtgctgct gaatctgaaa ggtggcaaat 1320
ggaagggctg cttccacgtt ccgcgtgcgc tgtggctgtg tgccgaaacc ctgcaactgc 1380ggaagggctg cttccacgtt ccgcgtgcgc tgtggctgtg tgccgaaacc ctgcaactgc 1380
cggtctctta aaataactag cataacccct tggggcctct aaacgggtct tgaggggttt 1440cggtctctta aaataactag cataacccct tggggcctct aaacggggtct tgaggggttt 1440
tttgctgaaa ccaatttgcc tggcggcagt agcgcggtgg tcccacctga ccccatgccg 1500tttgctgaaa ccaatttgcc tggcggcagt agcgcggtgg tcccacctga ccccatgccg 1500
aactcagaag tgaaacgccg tagcgccgat ggtagtgtgg ggtctcccca tgcgagagta 1560aactcagaag tgaaacgccg tagcgccgat ggtagtgtgg ggtctcccca tgcgagagta 1560
gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg cctttcggga 1620gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg cctttcggga 1620
tccaggccgg cctgttaacg aattaatctt ccgcggcggt atcgataagc ttgatatcga 1680tccaggccgg cctgttaacg aattaatctt ccgcggcggt atcgataagc ttgatatcga 1680
ggctgacatg ggaattagcc atggtccata tgaatatcct ccttagttcc tattccgaag 1740ggctgacatg ggaattagcc atggtccata tgaatatcct ccttagttcc tattccgaag 1740
ttcctattct ctagaaagta taggaacttc ggcgcgccta cctgtgacgg aagatcactt 1800ttcctattct ctagaaagta taggaacttc ggcgcgccta cctgtgacgg aagatcactt 1800
cgcagaataa ataaatcctg gtgtccctgt tgataccggg aagccctggg ccaacttttg 1860cgcagaataa ataaatcctg gtgtccctgt tgataccggg aagccctggg ccaacttttg 1860
gcgaaaatga gacgttgatc ggcacgtaag aggttccaac tttcaccata atgaaataag 1920gcgaaaatga gacgttgatc ggcacgtaag aggttccaac tttcaccata atgaaataag 1920
atcactaccg ggcgtatttt ttgagttgtc gagattttca ggagctaagg aagctaaaat 1980atcactaccg ggcgtatttt ttgagttgtc gagatttca ggagctaagg aagctaaaat 1980
ggagaaaaaa atcactggat ataccaccgt tgatatatcc caatggcatc gtaaagaaca 2040ggagaaaaaa atcactggat ataccaccgt tgatatatcc caatggcatc gtaaagaaca 2040
ttttgaggca tttcagtcag ttgctcaatg tacctataac cagaccgttc agctggatat 2100ttttgaggca tttcagtcag ttgctcaatg tacctataac cagaccgttc agctggatat 2100
tacggccttt ttaaagaccg taaagaaaaa taagcacaag ttttatccgg cctttattca 2160tacggccttt ttaaagaccg taaagaaaaa taagcacaag ttttatccgg cctttattca 2160
cattcttgcc cgcctgatga atgctcatcc ggaattacgt atggcaatga aagacggtga 2220cattcttgcc cgcctgatga atgctcatcc ggaattacgt atggcaatga aagacggtga 2220
gctggtgata tgggatagtg ttcacccttg ttacaccgtt ttccatgagc aaactgaaac 2280gctggtgata tgggatagtg ttcacccttg ttacaccgtt ttccatgagc aaactgaaac 2280
gttttcatcg ctctggagtg aataccacga cgatttccgg cagtttctac acatatattc 2340gttttcatcg ctctggagtg aataccacga cgatttccgg cagtttctac acatatattc 2340
gcaagatgtg gcgtgttacg gtgaaaacct ggcctatttc cctaaagggt ttattgagaa 2400gcaagatgtg gcgtgttacg gtgaaaacct ggcctatttc cctaaagggt ttattgagaa 2400
tatgtttttc gtctcagcca atccctgggt gagtttcacc agttttgatt taaacgtggc 2460tatgtttttc gtctcagcca atccctgggt gagtttcacc agttttgatt taaacgtggc 2460
caatatggac aacttcttcg cccccgtttt caccatgggc aaatattata cgcaaggcga 2520caatatggac aacttcttcg cccccgtttt caccatgggc aaatattata cgcaaggcga 2520
caaggtgctg atgccgctgg cgattcaggt tcatcatgcc gtttgtgatg gcttccatgt 2580caaggtgctg atgccgctgg cgattcaggt tcatcatgcc gtttgtgatg gcttccatgt 2580
cggcagatgc ttaatgaata caacagtact gcgatgagtg gcagggcggg gcgtaaggcg 2640cggcagatgc ttaatgaata caacagtact gcgatgagtg gcagggcggg gcgtaaggcg 2640
cgccatttaa atgaagttcc tattccgaag ttcctattct ctagaaagta taggaacttc 2700cgccatttaa atgaagttcc tattccgaag ttcctattct ctagaaagta taggaacttc 2700
gaagcagctc cagcctacac aatcgctcaa gacgtgtaat gctgcaatct gcatgcaagc 2760gaagcagctc cagcctacac aatcgctcaa gacgtgtaat gctgcaatct gcatgcaagc 2760
ttggcactgg cgatggcgcc tcatccctga agccaatagg gataacaggg taatgatcgg 2820ttggcactgg cgatggcgcc tcatccctga agccaatagg gataacaggg taatgatcgg 2820
atcccgggcc cgtcgactgc agaggcctgc 2850atcccgggcc cgtcgactgc agaggcctgc 2850
<210> 71<210> 71
<211> 4360<211> 4360
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета <Ptet-neuBC-FRT-kan-FRT><223> Expression cassette <Ptet-neuBC-FRT-kan-FRT>
<400> 71<400> 71
ggtacccaaa tatgcataat cgaaattaat acgactcact ataggggaat tgattctggt 60ggtacccaaa tatgcataat cgaaattaat acgactcact ataggggaat tgattctggt 60
accaaatgag tcgaccggcc agatgattaa ttcctaattt ttgttgacac tctatcattg 120accaaatgag tcgaccggcc agatgattaa ttcctaattt ttgttgacac tctatcattg 120
atagagttat tttaccactc cctatcagtg atagagaaaa gtgaaatgaa tagttcgaca 180atagagttat tttaccactc cctatcagtg atagagaaaa gtgaaatgaa tagttcgaca 180
aaaatctaga aataattttg tttaacttta agaaggagat atacaaatga aagaaatcaa 240aaaatctaga aataattttg tttaacttta agaaggagat atacaaatga aagaaatcaa 240
aatccagaac atcatcatca gcgaagaaaa agcgccgctg gttgtgccgg aaatcggcat 300aatccagaac atcatcatca gcgaagaaaa agcgccgctg gttgtgccgg aaatcggcat 300
taaccataat ggtagtctgg aactggcaaa aatcatggtg gatgcggcct ttagcgccgg 360taaccataat ggtagtctgg aactggcaaa aatcatggtg gatgcggcct ttagcgccgg 360
tgcaaaaatc attaaacatc agacccacat tgtggaagat gaaatgtcta aagcagcgaa 420tgcaaaaatc attaaacatc agacccacat tgtggaagat gaaatgtcta aagcagcgaa 420
aaaagttatc ccgggcaacg cgaaaatcag tatctacgaa atcatgcaga aatgcgcgct 480aaaagttatc ccgggcaacg cgaaaatcag tatctacgaa atcatgcaga aatgcgcgct 480
ggattacaaa gatgaactgg ccctgaaaga atataccgaa aaactgggtc tggtgtacct 540ggattacaaa gatgaactgg ccctgaaaga atataccgaa aaactgggtc tggtgtacct 540
gtctaccccg tttagtcgtg cgggtgcaaa ccgtctggaa gatatgggtg ttagtgcgtt 600gtctaccccg tttagtcgtg cgggtgcaaa ccgtctggaa gatatgggtg ttagtgcgtt 600
caaaatcggc agcggtgaat gtaacaatta tccgctgatc aaacatattg ccgcatttaa 660caaaatcggc agcggtgaat gtaacaatta tccgctgatc aaacatattg ccgcatttaa 660
aaaaccgatg attgttagca ccggcatgaa tagcatcgaa tctattaaac cgacggtgaa 720aaaaccgatg attgttagca ccggcatgaa tagcatcgaa tctattaaac cgacggtgaa 720
aatcctgctg gataacgaaa ttccgtttgt tctgatgcat accacgaatc tgtacccgac 780aatcctgctg gataacgaaa ttccgtttgt tctgatgcat accacgaatc tgtacccgac 780
cccgcacaac ctggtgcgtc tgaatgccat gctggaactg aaaaaagaat tctcttgcat 840cccgcacaac ctggtgcgtc tgaatgccat gctggaactg aaaaaagaat tctcttgcat 840
ggttggtctg agtgatcaca ccacggataa tctggcatgc ctgggtgcag tggttctggg 900ggttggtctg agtgatcaca ccacggataa tctggcatgc ctgggtgcag tggttctggg 900
tgcgtgtgtg ctggaacgtc atttcaccga tagcatgcac cgctctggtc cggatattgt 960tgcgtgtgtg ctggaacgtc atttcaccga tagcatgcac cgctctggtc cggatattgt 960
ttgtagtatg gatacgaaag cactgaaaga actgatcatt cagagcgaac agatggcgat 1020ttgtagtatg gatacgaaag cactgaaaga actgatcatt cagagcgaac agatggcgat 1020
cattcgcggc aacaatgaat ctaaaaaagc ggccaaacag gaacaggtga ccatcgattt 1080cattcgcggc aacaatgaat ctaaaaaagc ggccaaacag gaacaggtga ccatcgattt 1080
tgcattcgcg agtgtggtta gcatcaaaga tatcaaaaaa ggcgaagtgc tgagcatgga 1140tgcattcgcg agtgtggtta gcatcaaaga tatcaaaaaa ggcgaagtgc tgagcatgga 1140
taatatttgg gttaaacgtc cgggtctggg cggtatctct gcagcggaat ttgaaaacat 1200taatatttgg gttaaacgtc cgggtctggg cggtatctct gcagcggaat ttgaaaacat 1200
tctgggcaaa aaagcactgc gcgatattga aaatgatgcg cagctgtctt atgaagattt 1260tctgggcaaa aaagcactgc gcgatattga aaatgatgcg cagctgtctt atgaagattt 1260
cgcctaatcg aaggagatac aaccatgaag aaaattctgt ttatcaccgg cagccgtgca 1320cgcctaatcg aaggagatac aaccatgaag aaaattctgt ttatcaccgg cagccgtgca 1320
gattacagta aaatcaaaag cctgatgtac cgcgtgcaga acagctctga atttgaactg 1380gattacagta aaatcaaaag cctgatgtac cgcgtgcaga acagctctga atttgaactg 1380
tatattttcg cgaccggcat gcatctgagc aaaaacttcg gttacacggt taaagaactg 1440tatattttcg cgaccggcat gcatctgagc aaaaacttcg gttacacggt taaagaactg 1440
tacaaaaacg gtttcaaaaa catctacgaa ttcatcaact acgataaata ctaccagacc 1500tacaaaaacg gtttcaaaaa catctacgaa ttcatcaact acgataaata ctaccagacc 1500
gataaagcgc tggccaccac gatcgatggt ttcagccgtt acgccaatga actgaaaccg 1560gataaagcgc tggccaccac gatcgatggt ttcagccgtt acgccaatga actgaaaccg 1560
gatctgattg tggttcacgg cgatcgtatt gaaccgctgg cggcggcaat tgtgggtgca 1620gatctgattg tggttcacgg cgatcgtatt gaaccgctgg cggcggcaat tgtgggtgca 1620
ctgaacaata ttctggttgc gcatatcgaa ggcggtgaaa tttctggtac gatcgatgat 1680ctgaacaata ttctggttgc gcatatcgaa ggcggtgaaa tttctggtac gatcgatgat 1680
agtctgcgtc acgcaattag caaactggcg catatccacc tggtgaacga tgaatttgcg 1740agtctgcgtc acgcaattag caaactggcg catatccacc tggtgaacga tgaatttgcg 1740
aaacgtcgcc tgatgcagct gggcgaagat gaaaaatcta tcttcatcat cggtagtccg 1800aaacgtcgcc tgatgcagct gggcgaagat gaaaaatcta tcttcatcat cggtagtccg 1800
gatctggaac tgctgaacga taataaaatc agcctgtctg aagcgaaaaa atactacgat 1860gatctggaac tgctgaacga taataaaatc agcctgtctg aagcgaaaaa atactacgat 1860
atcaactacg aaaactacgc cctgctgatg tttcatccgg ttaccacgga aattaccagt 1920atcaactacg aaaactacgc cctgctgatg tttcatccgg ttaccacgga aattaccagt 1920
atcaaaaacc aggccgataa tctggtgaaa gcactgatcc agagcaacaa aaactacatc 1980atcaaaaacc aggccgataa tctggtgaaa gcactgatcc agagcaacaa aaactacatc 1980
gttatctacc cgaacaacga tctgggcttt gaactgattc tgcagtctta tgaagaattc 2040gttatctacc cgaacaacga tctgggcttt gaactgattc tgcagtctta tgaagaattc 2040
aaaaacaatc cgcgttttaa actgttcccg agtctgcgct ttgaatactt cattaccctg 2100aaaaacaatc cgcgttttaa actngttcccg agtctgcgct ttgaatactt cattaccctg 2100
ctgaaaaacg ccgattttat tatcggtaat agtagctgca tcctgaaaga agcgctgtat 2160ctgaaaaacg ccgattttat tatcggtaat agtagctgca tcctgaaaga agcgctgtat 2160
ctgaaaacgg ccggcattct ggtgggtagc cgtcagaatg gtcgtctggg taacgaaaat 2220ctgaaaacgg ccggcattct ggtgggtagc cgtcagaatg gtcgtctggg taacgaaaat 2220
accctgaaag ttaacgccaa ctctgatgaa atcctgaaag caatcaacac gatccacaaa 2280accctgaaag ttaacgccaa ctctgatgaa atcctgaaag caatcaacac gatccacaaa 2280
aaacaggatc tgtttagcgc aaaactggaa attctggatt ctagtaaact gtttttcgaa 2340aaacaggatc tgtttagcgc aaaactggaa attctggatt ctagtaaact gtttttcgaa 2340
tatctgcaga gcggcgattt ctttaaactg tctacccaga aagtgttcaa agatattaaa 2400tatctgcaga gcggcgattt ctttaaactg tctacccaga aagtgttcaa agatattaaa 2400
taaaataact agcataaccc cttggggcct ctaaacgggt cttgaggggt tttttgctga 2460taaaataact agcataaccc cttggggcct ctaaacggggt cttgaggggt tttttgctga 2460
aaccaatttg cctggcggca gtagcgcggt ggtcccacct gaccccatgc cgaactcaga 2520aaccaatttg cctggcggca gtagcgcggt ggtcccacct gaccccatgc cgaactcaga 2520
agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc catgcgagag tagggaactg 2580agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc catgcgagag tagggaactg 2580
ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgg gatccaggcc 2640ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgg gatccaggcc 2640
ggcctgttaa cgaattaatc ttccgcggcg gtatcgataa gcttgatatc gaggctgaca 2700ggcctgttaa cgaattaatc ttccgcggcg gtatcgataa gcttgatatc gaggctgaca 2700
tgggaattag ccatggtcca tatgaatatc ctccttagtt cctattccga agttcctatt 2760tgggaattag ccatggtcca tatgaatatc ctccttagtt cctattccga agttcctatt 2760
ctctagaaag tataggaact tcggcgcgtc ctacctgtga cacgcgtcaa gatcccctca 2820ctctagaaag tataggaact tcggcgcgtc ctacctgtga cacgcgtcaa gatcccctca 2820
cgctgccgca agcactcagg gcgcaagggc tgctaaagga agcggaacac gtagaaagcc 2880cgctgccgca agcactcagg gcgcaagggc tgctaaagga agcggaacac gtagaaagcc 2880
agtccgcaga aacggtgctg accccggatg aatgtcagct actgggctat ctggacaagg 2940agtccgcaga aacggtgctg accccggatg aatgtcagct actgggctat ctggacaagg 2940
gaaaacgcaa gcgcaaagag aaagcaggta gcttgcagtg ggcttacatg gcgatagcta 3000gaaaacgcaa gcgcaaagag aaagcaggta gcttgcagtg ggcttacatg gcgatagcta 3000
gactgggcgg ttttatggac agcaagcgaa ccggaattgc cagctggggc gccctctggt 3060gactgggcgg ttttatggac agcaagcgaa ccggaattgc cagctggggc gccctctggt 3060
aaggttggga agccctgcaa agtaaactgg atggctttct tgccgccaag gatctgatgg 3120aaggttggga agccctgcaa agtaaactgg atggctttct tgccgccaag gatctgatgg 3120
cgcaggggat caagatctga tcaagagaca ggatgaggat cgtttcgcat gattgaacaa 3180cgcaggggat caagatctga tcaagagaca ggatgaggat cgtttcgcat gattgaacaa 3180
gatggattgc acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg 3240gatggattgc acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg 3240
gcacaacaga caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc 3300gcacaacaga caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc 3300
ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca 3360ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca 3360
gcgcggctat cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc 3420gcgcggctat cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc 3420
actgaagcgg gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca 3480actgaagcgg gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca 3480
tctcaccttg ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat 3540tctcaccttg ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat 3540
acgcttgatc cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca 3600acgcttgatc cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca 3600
cgtactcgga tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg 3660cgtactcgga tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg 3660
ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc 3720ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc 3720
gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct 3780gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct 3780
ggattcatcg actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct 3840ggattcatcg actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct 3840
acccgtgata ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac 3900acccgtgata ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac 3900
ggtatcgccg ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc 3960ggtatcgccg ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc 3960
tgagcgggac tctggggttc gaaatgaccg accaagcgac gcccaacctg ccatcacgag 4020tgagcgggac tctggggttc gaaatgaccg accaagcgac gcccaacctg ccatcacgag 4020
atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg 4080atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg 4080
ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccagct 4140ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccagct 4140
tcaaaagcgc tctcggtacc ggcagggcgg ggcgtaaggc gcgccattta aatgaagaag 4200tcaaaagcgc tctcggtacc ggcagggcgg ggcgtaaggc gcgccattta aatgaagaag 4200
ttcctattcc gaagttccta ttctctagaa agtataggaa cttcgaagca gctccagcct 4260ttcctattcc gaagttccta ttctctagaa agtataggaa cttcgaagca gctccagcct 4260
acacaatcgc tcaagacgtg taatgctgca atctgcatgc aagcttggca ctggcgatgg 4320acacaatcgc tcaagacgtg taatgctgca atctgcatgc aagcttggca ctggcgatgg 4320
cgcctcatcc ctgaagccaa tagggataac agggtaatga 4360cgcctcatcc ctgaagccaa tagggataac agggtaatga 4360
<210> 72<210> 72
<211> 3981<211> 3981
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Экспрессионная кассета <Ptet-ppsA-FRT-aad1-FRT><223> Expression cassette <Ptet-ppsA-FRT-aad1-FRT>
<400> 72<400> 72
attctggtac caaatgagtc gaccggccag atgattaatt cctaattttt gttgacactc 60attctggtac caaatgagtc gaccggccag atgattaatt cctaattttt gttgacactc 60
tatcattgat agagttattt taccactccc tatcagtgat agagaaaagt gaaatgaata 120tatcattgat agagttattt taccactccc tatcagtgat agagaaaagt gaaatgaata 120
gttcgacaaa aatctagaaa taattttgtt tggcgtcgag aaggagatag aaccatgtcc 180gttcgacaaa aatctagaaa taattttgtt tggcgtcgag aaggagatag aaccatgtcc 180
aacaatggct cgtcaccgct ggtgctttgg tataaccaac tcggcatgaa tgatgtagac 240aacaatggct cgtcaccgct ggtgctttgg tataaccaac tcggcatgaa tgatgtagac 240
agggttgggg gcaaaaatgc ctccctgggt gaaatgatta ctaacctttc cggaatgggt 300agggttgggg gcaaaaatgc ctccctgggt gaaatgatta ctaacctttc cggaatgggt 300
gtttccgttc cgaatggttt cgccacaacc gccgacgcgt ttaaccagtt tctggaccaa 360gtttccgttc cgaatggttt cgccacaacc gccgacgcgt ttaaccagtt tctggaccaa 360
agcggcgtaa accagcgcat ttatgaactg ctggataaaa cggatattga cgatgttact 420agcggcgtaa accagcgcat ttatgaactg ctggataaaa cggatattga cgatgttact 420
cagcttgcga aagcgggcgc gcaaatccgc cagtggatta tcgacactcc cttccagcct 480cagcttgcga aagcgggcgc gcaaatccgc cagtggatta tcgacactcc cttccagcct 480
gagctggaaa acgccatcag cgaagcctat gcacagcttt ctgccgatga cgaaaacgcc 540gagctggaaa acgccatcag cgaagcctat gcacagcttt ctgccgatga cgaaaacgcc 540
tcttttgcgg tgcgctcctc cgccaccgca gaagatatgc cggacgcttc ttttgccggt 600tcttttgcgg tgcgctcctc cgccaccgca gaagatatgc cggacgcttc ttttgccggt 600
cagcaggaaa ccttcctcaa cgttcagggt tttgacgccg ttctcgtggc agtgaaacat 660cagcaggaaa ccttcctcaa cgttcagggt tttgacgccg ttctcgtggc agtgaaacat 660
gtatttgctt ctctgtttaa cgatcgcgcc atctcttatc gtgtgcacca gggttacgat 720gtatttgctt ctctgtttaa cgatcgcgcc atctcttatc gtgtgcacca gggttacgat 720
caccgtggtg tggcgctctc cgccggtgtt caacggatgg tgcgctctga cctcgcatca 780caccgtggtg tggcgctctc cgccggtgtt caacggatgg tgcgctctga cctcgcatca 780
tctggcgtga tgttctccat tgataccgaa tccggctttg accaggtggt gtttatcact 840tctggcgtga tgttctccat tgataccgaa tccggctttg accaggtggt gtttatcact 840
tccgcatggg gccttggtga gatggtcgtg cagggtgcgg ttaacccgga tgagttttac 900tccgcatggg gccttggtga gatggtcgtg cagggtgcgg ttaacccgga tgagttttac 900
gtgcataaac cgacactggc ggcgaatcgc ccggctatcg tgcgccgcac catggggtcg 960gtgcataaac cgacactggc ggcgaatcgc ccggctatcg tgcgccgcac catggggtcg 960
aaaaaaatcc gcatggttta cgcgccgacc caggagcacg gcaagcaggt taaaatcgaa 1020aaaaaaatcc gcatggttta cgcgccgacc caggagcacg gcaagcaggt taaaatcgaa 1020
gacgtaccgc aggaacagcg tgacatcttc tcgctgacca acgaagaagt gcaggaactg 1080gacgtaccgc aggaacagcg tgacatcttc tcgctgacca acgaagaagt gcaggaactg 1080
gcaaaacagg ccgtacaaat tgagaaacac tacggtcgcc cgatggatat tgagtgggcg 1140gcaaaacagg ccgtacaaat tgagaaacac tacggtcgcc cgatggatat tgagtgggcg 1140
aaagatggcc acaccggtaa actgttcatt gtgcaggcgc gtccggaaac cgtgcgctca 1200aaagatggcc acaccggtaa actngttcatt gtgcaggcgc gtccggaaac cgtgcgctca 1200
cgcggtcagg tcatggagcg ttatacgctg cattcacagg gtaagattat cgccgaaggc 1260cgcggtcagg tcatggagcg ttatacgctg cattcacagg gtaagattat cgccgaaggc 1260
cgtgctatcg gtcatcgcat cggtgcgggt ccggtgaaag tcatccatga tatcagcgaa 1320cgtgctatcg gtcatcgcat cggtgcgggt ccggtgaaag tcatccatga tatcagcgaa 1320
atgaaccgca tcgaacctgg tgacgtgctg gtcactgaca tgaccgaccc ggactgggaa 1380atgaaccgca tcgaacctgg tgacgtgctg gtcactgaca tgaccgaccc ggactgggaa 1380
ccgatcatga agaaagcatc tgccatcgtc accaaccgtg gcggtcgtac ctgtcacgcg 1440ccgatcatga agaaagcatc tgccatcgtc accaaccgtg gcggtcgtac ctgtcacgcg 1440
gcgatcatcg ctcgtgaact gggcattccg gcggtagtgg gctgtggtga tgcaacagaa 1500gcgatcatcg ctcgtgaact gggcattccg gcggtagtgg gctgtggtga tgcaacagaa 1500
cggatgaaag acggtgagaa cgtcactgtt tcttgtgccg aaggtgatac cggttacgtc 1560cggatgaaag acggtgagaa cgtcactgtt tcttgtgccg aaggtgatac cggttacgtc 1560
tatgcggagt tgctggaatt tagcgtgaaa agctccagcg tagaaacgat gccggatctg 1620tatgcggagt tgctggaatt tagcgtgaaa agctccagcg tagaaacgat gccggatctg 1620
ccgttgaaag tgatgatgaa cgtcggtaac ccggaccgag ctttcgactt cgcctgtctg 1680ccgttgaaag tgatgatgaa cgtcggtaac ccggaccgag ctttcgactt cgcctgtctg 1680
ccgaacgaag gcgtgggact tgcgcgtctg gaatttatca tcaaccgtat gattggcgtc 1740ccgaacgaag gcgtgggact tgcgcgtctg gaatttatca tcaaccgtat gattggcgtc 1740
cacccacgcg cactgcttga gtttgacgat caggaaccgc agttgcaaaa cgaaatccgc 1800cacccacgcg cactgcttga gtttgacgat caggaaccgc agttgcaaaa cgaaatccgc 1800
gagatgatga aaggttttga ttctccgcgt gaattttacg ttggtcgtct gactgaaggg 1860gagatgatga aaggttttga ttctccgcgt gaattttacg ttggtcgtct gactgaaggg 1860
atcgcgacgc tgggtgccgc gttttatccg aagcgcgtca ttgtccgtct ctctgatttt 1920atcgcgacgc tgggtgccgc gttttatccg aagcgcgtca ttgtccgtct ctctgatttt 1920
aaatcgaacg aatatgccaa cctggtcggt ggtgagcgtt acgagccaga tgaagagaac 1980aaatcgaacg aatatgccaa cctggtcggt ggtgagcgtt acgagccaga tgaagagaac 1980
ccgatgctcg gcttccgtgg cgcgggacgc tatatttccg acagcttccg cgactgtttc 2040ccgatgctcg gcttccgtgg cgcgggacgc tatatttccg acagcttccg cgactgtttc 2040
gcgctggagt gcgaagcagt gaaacgtgtg cgcaacgaca tggggctgac caacgttgag 2100gcgctggagt gcgaagcagt gaaacgtgtg cgcaacgaca tggggctgac caacgttgag 2100
atcatgatcc cgttcgtgcg aaccgtagat caggcgaaag cggtggttga ggaactggcg 2160atcatgatcc cgttcgtgcg aaccgtagat caggcgaaag cggtggttga ggaactggcg 2160
cgtcaggggc tgaaacgtgg tgagaacggg ctgaaaatca tcatgatgtg tgaaatcccg 2220cgtcaggggc tgaaacgtgg tgagaacggg ctgaaaatca tcatgatgtg tgaaatcccg 2220
tccaacgcct tgctggccga gcagttcctc gaatatttcg acggcttctc aattggctca 2280tccaacgcct tgctggccga gcagttcctc gaatatttcg acggcttctc aattggctca 2280
aacgacatga cgcagctggc gctcggtctg gatcgtgact ccggcgtggt gtctgaactg 2340aacgacatga cgcagctggc gctcggtctg gatcgtgact ccggcgtggt gtctgaactg 2340
ttcgatgagc gcaacgatgc ggtgaaagca ctgctgtcga tggcgattcg tgccgcgaag 2400ttcgatgagc gcaacgatgc ggtgaaagca ctgctgtcga tggcgattcg tgccgcgaag 2400
aaacagggca aatatgtcgg gatttgcggt cagggtccgt ccgaccacga agactttgcc 2460aaacagggca aatatgtcgg gatttgcggt cagggtccgt ccgaccacga agactttgcc 2460
gcatggttga tggaagaggg gatcgatagc ctgtctctga acccggacac cgtggtgcaa 2520gcatggttga tggaagaggg gatcgatagc ctgtctctga acccggacac cgtggtgcaa 2520
acctggttaa gcctggctga actgaagaaa taaaataact agcataaccc cttggggcct 2580acctggttaa gcctggctga actgaagaaa taaaataact agcataaccc cttggggcct 2580
ctaaacgggt cttgaggggt tttttgctga aaccaatttg cctggcggca gtagcgcggt 2640ctaaacgggt cttgaggggt tttttgctga aaccaatttg cctggcggca gtagcgcggt 2640
ggtcccacct gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt 2700ggtcccacct gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt 2700
ggggtctccc catgcgagag tagggaactg ccaggcatca aataaaacga aaggctcagt 2760ggggtctccc catgcgagag tagggaactg ccaggcatca aataaaacga aaggctcagt 2760
cgaaagactg ggcctttcgg gatccaggcc ggcctgttaa cgaattaatc ttccgcggcg 2820cgaaagactg ggcctttcgg gatccaggcc ggcctgttaa cgaattaatc ttccgcggcg 2820
gtatcgataa gcttgatatc gaggctgaca tgggaattag ccatggtcca tatgaatatc 2880gtatcgataa gcttgatatc gaggctgaca tgggaattag ccatggtcca tatgaatatc 2880
ctccttagtt cctattccga agttcctatt ctctagaaag tataggaact tccgagctct 2940ctccttagtt cctattccga agttcctatt ctctagaaag tataggaact tccgagctct 2940
agagaatgat ccctaaatgc ttcaataata ttgaaaaagg aagagtatga gggaagcggt 3000agagaatgat ccctaaatgc ttcaataata ttgaaaaagg aagagtatga gggaagcggt 3000
gatcgccgaa gtatcgactc aactatcaga ggtagttggc gtcatcgagc gccatctcga 3060gatcgccgaa gtatcgactc aactatcaga ggtagttggc gtcatcgagc gccatctcga 3060
accgacgttg ctggccgtac atttgtacgg ctccgcagtg gatggcggcc tgaagccaca 3120accgacgttg ctggccgtac atttgtacgg ctccgcagtg gatggcggcc tgaagccaca 3120
cagtgatatt gatttgctgg ttacggtgac cgtaaggctt gatgaaacaa cgcggcgagc 3180cagtgatatt gatttgctgg ttacggtgac cgtaaggctt gatgaaacaa cgcggcgagc 3180
tttgatcaac gaccttttgg aaacttcggc ttcccctgga gagagcgaga ttctccgcgc 3240tttgatcaac gaccttttgg aaacttcggc ttcccctgga gagagcgaga ttctccgcgc 3240
tgtagaagtc accattgttg tgcacgacga catcattccg tggcgttatc cagctaagcg 3300tgtagaagtc accattgttg tgcacgacga catcattccg tggcgttatc cagctaagcg 3300
cgaactgcaa tttggagaat ggcagcgcaa tgacattctt gcaggtatct tcgagccagc 3360cgaactgcaa tttggagaat ggcagcgcaa tgacattctt gcaggtatct tcgagccagc 3360
cacgatcgac attgatctgg ctatcttgct gacaaaagca agagaacata gcgttgcctt 3420cacgatcgac attgatctgg ctatcttgct gacaaaagca agagaacata gcgttgcctt 3420
ggtaggtcca gcggcggagg aactctttga tccggttcct gaacaggatc tatttgaggc 3480ggtaggtcca gcggcggagg aactctttga tccggttcct gaacaggatc tatttgaggc 3480
gctaaatgaa accttaacgc tatggaactc gccgcccgac tgggctggcg atgagcgaaa 3540gctaaatgaa accttaacgc tatggaactc gccgcccgac tgggctggcg atgagcgaaa 3540
tgtagtgctt acgttgtccc gcatttggta cagcgcagta accggcaaaa tcgcgccgaa 3600tgtagtgctt acgttgtccc gcatttggta cagcgcagta accggcaaaa tcgcgccgaa 3600
ggatgtcgct gccgactggg caatggagcg cctgccggcc cagtatcagc ccgtcatact 3660ggatgtcgct gccgactggg caatggagcg cctgccggcc cagtatcagc ccgtcatact 3660
tgaagctaga caggcttatc ttggacaaga agaagatcgc ttggcctcgc gcgcagatca 3720tgaagctaga caggcttatc ttggacaaga agaagatcgc ttggcctcgc gcgcagatca 3720
gttggaagaa tttgtccact acgtgaaagg cgagatcacc aaggtagtcg gcaaataata 3780gttggaagaa tttgtccact acgtgaaagg cgagatcacc aaggtagtcg gcaaataata 3780
gcgggactct gggaatttcg acgacctgca gccaagcgaa gttcctattc cgaagttcct 3840gcgggactct gggaatttcg acgacctgca gccaagcgaa gttcctattc cgaagttcct 3840
attctctaga aagtatagga acttcgaagc agctccagcc tacacaatcg ctcaagacgt 3900attctctaga aagtatagga acttcgaagc agctccagcc tacacaatcg ctcaagacgt 3900
gtaatgctgc aatctgcatg caagcttggc actggcgatg gcgcctcatc cctgaagcca 3960gtaatgctgc aatctgcatg caagcttggc actggcgatg gcgcctcatc cctgaagcca 3960
atagggataa cagggtaatg a 3981atagggataa cagggtaatg a 3981
<210> 73<210> 73
<211> 4097<211> 4097
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Экспрессионный фрагмент <Ptet-neuA-nanT-lox66-kan-lox72><223> Expression fragment <Ptet-neuA-nanT-lox66-kan-lox72>
<400> 73<400> 73
tgagcgattg tgtaggctgg agctgcttgg ccagatgatt aattcctaat ttttgttgac 60tgagcgattg tgtaggctgg agctgcttgg ccagatgatt aattcctaat ttttgttgac 60
actctatcat tgatagagtt attttaccac tccctatcag tgatagagaa aagtgaaatg 120actctatcat tgatagagtt attttaccac tccctatcag tgatagagaa aagtgaaatg 120
aatagttcga caaaaatcta gaaataattt tgtttaactt taagaaggag atatacatga 180aatagttcga caaaaatcta gaaataattt tgtttaactt taagaaggag atatacatga 180
gcctggccat tatcccggca cgtggcggtt ctaaaggcat caaaaacaaa aacctggttc 240gcctggccat tatcccggca cgtggcggtt ctaaaggcat caaaaacaaa aacctggttc 240
tgctgaacaa taaaccgctg atttattaca ccatcaaagc ggccctgaac gccaaaagta 300tgctgaacaa taaaccgctg atttattaca ccatcaaagc ggccctgaac gccaaaagta 300
ttagcaaagt ggttgtgagc tctgattctg atgaaatcct gaactacgca aaaagtcaga 360ttagcaaagt ggttgtgagc tctgattctg atgaaatcct gaactacgca aaaagtcaga 360
acgttgatat cctgaaacgt ccgatcagtc tggcacagga tgataccacg agcgataaag 420acgttgatat cctgaaacgt ccgatcagtc tggcacagga tgataccacg agcgataaag 420
tgctgctgca tgcgctgaaa ttctacaaag attacgaaga tgttgtgttc ctgcagccga 480tgctgctgca tgcgctgaaa ttctacaaag attacgaaga tgttgtgttc ctgcagccga 480
ccagcccgct gcgtacgaat attcacatca acgaagcgtt caacctgtac aaaaacagca 540ccagcccgct gcgtacgaat attcacatca acgaagcgtt caacctgtac aaaaacagca 540
acgcaaacgc gctgatttct gttagtgaat gcgataacaa aatcctgaaa gcgtttgtgt 600acgcaaacgc gctgatttct gttagtgaat gcgataacaa aatcctgaaa gcgtttgtgt 600
gcaatgattg tggcgatctg gccggtattt gtaacgatga atacccgttc atgccgcgcc 660gcaatgattg tggcgatctg gccggtattt gtaacgatga atacccgttc atgccgcgcc 660
agaaactgcc gaaaacctat atgagcaatg gtgccatcta catcctgaaa atcaaagaat 720agaaactgcc gaaaacctat atgagcaatg gtgccatcta catcctgaaa atcaaagaat 720
tcctgaacaa cccgagcttc ctgcagtcta aaacgaaaca tttcctgatg gatgaaagta 780tcctgaacaa cccgagcttc ctgcagtcta aaacgaaaca tttcctgatg gatgaaagta 780
gctctctgga tattgattgc ctggaagatc tgaaaaaagt ggaacagatc tggaaaaaat 840gctctctgga tattgattgc ctggaagatc tgaaaaaagt ggaacagatc tggaaaaaat 840
aagagctcga gtcgaaggag atagaaccat gagtactaca acccagaata tcccgtggta 900aagagctcga gtcgaaggag atagaaccat gagtactaca acccagaata tcccgtggta 900
tcgccatctc aaccgtgcac aatggcgcgc attttccgct gcctggttgg gatatctgct 960tcgccatctc aaccgtgcac aatggcgcgc attttccgct gcctggttgg gatatctgct 960
tgacggtttt gatttcgttt taatcgccct ggtactcacc gaagtacaag gtgaattcgg 1020tgacggtttt gatttcgttt taatcgccct ggtactcacc gaagtacaag gtgaattcgg 1020
gctgacgacg gtgcaggcgg caagtctgat ctctgcagcc tttatctctc gctggttcgg 1080gctgacgacg gtgcaggcgg caagtctgat ctctgcagcc tttatctctc gctggttcgg 1080
cggcctgatg ctcggcgcta tgggtgaccg ctacgggcgt cgtctggcaa tggtcaccag 1140cggcctgatg ctcggcgcta tgggtgaccg ctacgggcgt cgtctggcaa tggtcaccag 1140
catcgttctc ttctcggccg ggacgctggc ctgcggcttt gcgccaggct acatcaccat 1200catcgttctc ttctcggccg ggacgctggc ctgcggcttt gcgccaggct acatcaccat 1200
gtttatcgct cgtctggtca tcggcatggg gatggcgggt gaatacggtt ccagcgccac 1260gtttatcgct cgtctggtca tcggcatggg gatggcgggt gaatacggtt ccagcgccac 1260
ctatgtcatt gaaagctggc caaaacatct gcgtaacaaa gccagtggtt ttttgatttc 1320ctatgtcatt gaaagctggc caaaacatct gcgtaacaaa gccagtggtt ttttgatttc 1320
aggcttctct gtgggggccg tcgttgccgc tcaggtctat agcctggtgg ttccggtctg 1380aggcttctct gtggggggccg tcgttgccgc tcaggtctat agcctggtgg ttccggtctg 1380
gggctggcgt gcgctgttct ttatcggcat tttgccaatc atctttgctc tctggctgcg 1440gggctggcgt gcgctgttct ttatcggcat tttgccaatc atctttgctc tctggctgcg 1440
taaaaacatc ccggaagcgg aagactggaa agagaaacac gcaggtaaag caccagtacg 1500taaaaacatc ccggaagcgg aagactggaa agagaaacac gcaggtaaag caccagtacg 1500
cacaatggtg gatattctct accgtggtga acatcgcatt gccaatatcg taatgacact 1560cacaatggtg gatattctct accgtggtga acatcgcatt gccaatatcg taatgacact 1560
ggcggcggct actgcgctgt ggttctgctt cgccggtaac ctgcaaaatg ccgcgatcgt 1620ggcggcggct actgcgctgt ggttctgctt cgccggtaac ctgcaaaatg ccgcgatcgt 1620
cgctgttctt gggctgttat gcgccgcaat ctttatcagc tttatggtgc agagtgcagg 1680cgctgttctt gggctgttat gcgccgcaat ctttatcagc tttatggtgc agagtgcagg 1680
caaacgctgg ccaacgggcg taatgctgat ggtggtcgtg ttgtttgctt tcctctactc 1740caaacgctgg ccaacgggcg taatgctgat ggtggtcgtg ttgtttgctt tcctctactc 1740
atggccgatt caggcgctgc tgccaacgta tctgaaaacc gatctggctt ataacccgca 1800atggccgatt caggcgctgc tgccaacgta tctgaaaacc gatctggctt ataacccgca 1800
tactgtagcc aatgtgctgt tctttagtgg ctttggcgcg gcggtgggat gctgcgtagg 1860tactgtagcc aatgtgctgt tctttagtgg ctttggcgcg gcggtgggat gctgcgtagg 1860
tggcttcctc ggtgactggc tgggaacccg caaagcgtac gtttgtagcc tgctggcctc 1920tggcttcctc ggtgactggc tgggaacccg caaagcgtac gtttgtagcc tgctggcctc 1920
gcagctgctg attattccgg tatttgcgat tggcggcgca aacgtctggg tgctcggtct 1980gcagctgctg attattccgg tatttgcgat tggcggcgca aacgtctggg tgctcggtct 1980
gttactgttc ttccagcaaa tgcttggaca agggatcgcc gggatcttac caaaactgat 2040gttactgttc ttccagcaaa tgcttggaca agggatcgcc gggatcttac caaaactgat 2040
tggcggttat ttcgataccg accagcgtgc agcgggcctg ggctttacct acaacgttgg 2100tggcggttat ttcgataccg accagcgtgc agcgggcctg ggctttacct acaacgttgg 2100
cgcattgggc ggtgcactgg ccccaatcat cggcgcgttg atcgctcaac gtctggatct 2160cgcattgggc ggtgcactgg ccccaatcat cggcgcgttg atcgctcaac gtctggatct 2160
gggtactgcg ctggcatcgc tctcgttcag tctgacgttc gtggtgatcc tgctgattgg 2220gggtactgcg ctggcatcgc tctcgttcag tctgacgttc gtggtgatcc tgctgattgg 2220
gctggatatg ccttctcgcg ttcagcgttg gttgcgcccg gaagcgttgc gtactcatga 2280gctggatatg ccttctcgcg ttcagcgttg gttgcgcccg gaagcgttgc gtactcatga 2280
cgctatcgac ggtaaaccat tcagcggtgc cgtgccgttt ggcagcgcca aaaacgattt 2340cgctatcgac ggtaaaccat tcagcggtgc cgtgccgttt ggcagcgcca aaaacgattt 2340
agtcaaaacc aaaagttaat aaatcgatac tagcataacc ccttggggcc tctaaacgcg 2400agtcaaaacc aaaagttaat aaatcgatac tagcataacc ccttggggcc tctaaacgcg 2400
tcgacacgca aaaaggccat ccgtcaggat ggccttctgc ttaatttgat gcctggcagt 2460tcgacacgca aaaaggccat ccgtcaggat ggccttctgc ttaatttgat gcctggcagt 2460
ttatggcggg cgtcctgccc gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc 2520ttatggcggg cgtcctgccc gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc 2520
tcccggcgga tttgtcctac tcaggagagc gttcaccgac aaacaacaga taaaacgaaa 2580tcccggcgga tttgtcctac tcaggagagc gttcaccgac aaacaacaga taaaacgaaa 2580
ggcccagtct ttcgactgag cctttcgttt tatttgatgc ctggcagttc cctactctcg 2640ggcccagtct ttcgactgag cctttcgttt tatttgatgc ctggcagttc cctactctcg 2640
catggggaga ccccacacta ccatccggta tcgataagct tgatggcgaa agggggatgt 2700catggggaga ccccacacta ccatccggta tcgataagct tgatggcgaa agggggatgt 2700
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 2760gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 2760
acggccagtg aattcgagct cggtacctac cgttcgtata atgtatgcta tacgaagtta 2820acggccagtg aattcgagct cggtacctac cgttcgtata atgtatgcta tacgaagtta 2820
tcgagctcta gagaatgatc ccctccctca cgctgccgca agcactcagg gcgcaagggc 2880tcgagctcta gagaatgatc ccctccctca cgctgccgca agcactcagg gcgcaagggc 2880
tgctaaagga agcggaacac gtagaaagcc agtccgcaga aacggtgctg accccggatg 2940tgctaaagga agcggaacac gtagaaagcc agtccgcaga aacggtgctg accccggatg 2940
aatgtcagct actgggctat ctggacaagg gaaaacgcaa gcgcaaagag aaagcaggta 3000aatgtcagct actgggctat ctggacaagg gaaaacgcaa gcgcaaagag aaagcaggta 3000
gcttgcagtg ggcttacatg gcgatagcta gactgggcgg ttttatggac agcaagcgaa 3060gcttgcagtg ggcttacatg gcgatagcta gactgggcgg ttttatggac agcaagcgaa 3060
ccggaattgc cagctggggc gccctctggt aaggttggga agccctgcaa agtaaactgg 3120ccggaattgc cagctggggc gccctctggt aaggttggga agccctgcaa agtaaactgg 3120
atggctttct tgccgccaag gatctgatgg cgcaggggat caagatctga tcaagagaca 3180atggctttct tgccgccaag gatctgatgg cgcaggggat caagatctga tcaagagaca 3180
ggatgaggat cgtttcgcat gattgaacaa gatggattgc acgcaggttc tccggccgct 3240ggatgaggat cgtttcgcat gattgaacaa gatggattgc acgcaggttc tccggccgct 3240
tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg ctctgatgcc 3300tgggtggaga ggctattcgg ctatgactgg gcacaacaga caatcggctg ctctgatgcc 3300
gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac cgacctgtcc 3360gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac cgacctgtcc 3360
ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc cacgacgggc 3420ggtgccctga atgaactgca ggacgaggca gcgcggctat cgtggctggc cacgacgggc 3420
gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg gctgctattg 3480gttccttgcg cagctgtgct cgacgttgtc actgaagcgg gaagggactg gctgctattg 3480
ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga gaaagtatcc 3540ggcgaagtgc cggggcagga tctcctgtca tctcaccttg ctcctgccga gaaagtatcc 3540
atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg cccattcgac 3600atcatggctg atgcaatgcg gcggctgcat acgcttgatc cggctacctg cccattcgac 3600
caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg tcttgtcgat 3660caccaagcga aacatcgcat cgagcgagca cgtactcgga tggaagccgg tcttgtcgat 3660
caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt cgccaggctc 3720caggatgatc tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt cgccaggctc 3720
aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc atggcgatgc ctgcttgccg 3780aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc atggcgatgc ctgcttgccg 3780
aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg gctgggtgtg 3840aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg gctgggtgtg 3840
gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga gcttggcggc 3900gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga gcttggcggc 3900
gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc 3960gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc 3960
gccttctatc gccttcttga cgagttcttc tgagcgggac tctgggaatt tcgacgacct 4020gccttctatc gccttcttga cgagttcttc tgagcgggac tctgggaatt tcgacgacct 4020
gcagccaagc ataacttcgt ataatgtatg ctatacgaac ggtaggatcc tctagagtcg 4080gcagccaagc ataacttcgt ataatgtatg ctatacgaac ggtaggatcc tctagagtcg 4080
acctgcaggc atgcaag 4097acctgcaggc atgcaag 4097
<210> 74<210> 74
<211> 58<211> 58
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 74<400> 74
ctgtctctta tacacatctc ctgaaattgg ccagatgatt aattcctaat ttttgttg 58ctgtctctta tacacatctc ctgaaattgg ccagatgatt aattcctaat ttttgttg 58
<210> 75<210> 75
<211> 50<211> 50
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 75<400> 75
ctgtctctta tacacatctc agcattacac gtcttgagcg attgtgtagg 50ctgtctctta tacacatctc agcattacac gtcttgagcg attgtgtagg 50
<210> 76<210> 76
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 76<400> 76
cctgacgacg gtgagcgatc atttgtatat ctccttctta aagttaaaca aaattatttc 60cctgacgacg gtgagcgatc atttgtatat ctccttctta aagttaaaca aaattatttc 60
<210> 77<210> 77
<211> 59<211> 59
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 77<400> 77
aaccctgcaa ctgccggtct cttaaaataa ctagcataac cccttggggc ctctaaacg 59aaccctgcaa ctgccggtct cttaaaataa ctagcataac cccttggggc ctctaaacg 59
<210> 78<210> 78
<211> 59<211> 59
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 78<400> 78
aactttaaga aggagatata caaatgatcg ctcaccgtcg tcaggaactg gctcaacag 59aactttaaga aggagatata caaatgatcg ctcaccgtcg tcaggaactg gctcaacag 59
<210> 79<210> 79
<211> 57<211> 57
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 79<400> 79
aggccccaag gggttatgct agttatttta agagaccggc agttgcaggg tttcggc 57aggccccaag gggttatgct agttatttta agagaccggc agttgcaggg tttcggc 57
<210> 80<210> 80
<211> 62<211> 62
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 80<400> 80
gccccaaggg gttatgctag ttattttatt ccacggtcac ggatttcgcc aggttacgcg 60gccccaaggg gttatgctag ttattttatt ccacggtcac ggatttcgcc aggttacgcg 60
gc 62gc 62
<210> 81<210> 81
<211> 62<211> 62
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 81<400> 81
agcaccaacg ataccgcaca tttgtatatc tccttcttaa agttaaacaa aattatttct 60agcaccaacg ataccgcaca tttgtatatc tccttcttaa agttaaacaa aattatttct 60
ag 62ag 62
<210> 82<210> 82
<211> 62<211> 62
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 82<400> 82
gcgaaatccg tgaccgtgga ataaaataac tagcataacc ccttggggcc tctaaacggg 60gcgaaatccg tgaccgtgga ataaaataac tagcataacc ccttggggcc tctaaacggg 60
tc 62tc 62
<210> 83<210> 83
<211> 58<211> 58
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 83<400> 83
ctttaagaag gagatataca aatgtgcggt atcgttggtg ctatcgcaca gcgtgatg 58ctttaagaag gagatataca aatgtgcggt atcgttggtg ctatcgcaca gcgtgatg 58
<210> 84<210> 84
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 84<400> 84
cgtggaaatg caaattagaa aatagaataa ctagcataac cccttggggc ctctaaacgg 60cgtggaaatg caaattagaa aatagaataa ctagcataac cccttggggc ctctaaacgg 60
<210> 85<210> 85
<211> 64<211> 64
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 85<400> 85
tataaaatcc atcgggtaag ctcatggttc tatctccttc gttattccac ggtcacggat 60tataaaatcc atcgggtaag ctcatggttc tatctccttc gttattccac ggtcacggat 60
ttcg 64ttcg 64
<210> 86<210> 86
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 86<400> 86
atccgtgacc gtggaataac gaaggagata gaaccatgag cttacccgat ggattttata 60atccgtgacc gtggaataac gaaggagata gaaccatgag cttacccgat ggattttata 60
taagg 65taagg 65
<210> 87<210> 87
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 87<400> 87
ccgtttagag gccccaaggg gttatgctag ttattctatt ttctaatttg catttccacg 60ccgtttagag gccccaaggg gttatgctag ttattctatt ttctaatttg catttccacg 60
<210> 88<210> 88
<211> 59<211> 59
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 88<400> 88
ctggttaagc ctggctgaac tgaagaaata aaataactag cataacccct tggggcctc 59ctggttaagc ctggctgaac tgaagaaata aaataactag cataacccct tggggcctc 59
<210> 89<210> 89
<211> 59<211> 59
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 89<400> 89
tagaggcccc aaggggttat gctagttatt ttatttcttc agttcagcca ggcttaacc 59tagaggcccc aaggggttat gctagttatt ttatttcttc agttcagcca ggcttaacc 59
<210> 90<210> 90
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 90<400> 90
tttgtttggc gtcgagaagg agatagaacc atgtccaaca atggctcgtc accgctggtg 60tttgtttggc gtcgagaagg agatagaacc atgtccaaca atggctcgtc accgctggtg 60
<210> 91<210> 91
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 91<400> 91
aagcaccagc ggtgacgagc cattgttgga catggttcta tctccttctc gacgccaaac 60aagcaccagc ggtgacgagc cattgttgga catggttcta tctccttctc gacgccaaac 60
<210> 92<210> 92
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 92<400> 92
ctacccagaa agtgttcaaa gatattaaat aaaataacta gcataacccc ttggggcctc 60ctacccagaa agtgttcaaa gatattaaat aaaataacta gcataacccc ttggggcctc 60
<210> 93<210> 93
<211> 61<211> 61
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 93<400> 93
tttagaggcc ccaaggggtt atgctagtta ttttatttaa tatctttgaa cactttctgg 60tttagaggcc ccaaggggtt atgctagtta ttttatttaa tatctttgaa cactttctgg 60
g 61g 61
<210> 94<210> 94
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 94<400> 94
gatgatgatg ttctggattt tgatttcttt catttgtata tctccttctt aaagttaaac 60gatgatgatg ttctggattt tgatttcttt catttgtata tctccttctt aaagttaaac 60
<210> 95<210> 95
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 95<400> 95
tttgtttaac tttaagaagg agatatacaa atgaaagaaa tcaaaatcca gaacatcatc 60tttgtttaac tttaagaagg agatatacaa atgaaagaaa tcaaaatcca gaacatcatc 60
<210> 96<210> 96
<211> 70<211> 70
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 96<400> 96
agctgtctta tgaagatttc gcctaatcga aggagataca accatgaaga aaattctgtt 60agctgtctta tgaagatttc gcctaatcga aggagataca accatgaaga aaattctgtt 60
tatcaccggc 70tatcaccggc 70
<210> 97<210> 97
<211> 70<211> 70
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 97<400> 97
tgccggtgat aaacagaatt ttcttcatgg ttgtatctcc ttcgattagg cgaaatcttc 60tgccggtgat aaacagaatt ttcttcatgg ttgtatctcc ttcgattagg cgaaatcttc 60
ataagacagc 70agacagc 70
<210> 98<210> 98
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 98<400> 98
taattttgtt taactttaag aaggagatat acatgagcct ggccattatc ccggcacgtg 60taattttgtt taactttaag aaggagatat acatgagcct ggccattatc ccggcacgtg 60
<210> 99<210> 99
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 99<400> 99
tgccgggata atggccaggc tcatgtatat ctccttctta aagttaaaca aaattatttc 60tgccgggata atggccaggc tcatgtatat ctccttctta aagttaaaca aaattatttc 60
<210> 100<210> 100
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 100<400> 100
aaaataagag ctcgagtcga aggagataga accatgagta ctacaaccca gaatatcccg 60aaaataagag ctcgagtcga aggagataga accatgagta ctacaaccca gaatatcccg 60
<210> 101<210> 101
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 101<400> 101
tagtactcat ggttctatct ccttcgactc gagctcttat tttttccaga tctgttccac 60tagtactcat ggttctatct ccttcgactc gagctcttat tttttccaga tctgttccac 60
<210> 102<210> 102
<211> 60<211> 60
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 102<400> 102
aaaacgattt agtcaaaacc aaaagttaat aaatcgatac tagcataacc ccttggggcc 60aaaacgattt agtcaaaacc aaaagttaat aaatcgatac tagcataacc ccttggggcc 60
<210> 103<210> 103
<211> 56<211> 56
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 103<400> 103
aaggggttat gctagtatcg atttattaac ttttggtttt gactaaatcg tttttg 56aaggggttat gctagtatcg atttattaac ttttggtttt gactaaatcg tttttg 56
<210> 104<210> 104
<211> 73<211> 73
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 104<400> 104
ggggacaagt ttgtacaaaa aagcaggcta gaaggaggta tacaaatggg cctgaaaaaa 60ggggacaagt ttgtacaaaa aagcaggcta gaaggaggta tacaaatggg cctgaaaaaa 60
gcctgcctga ccg 73gcctgcctga ccg 73
<210> 105<210> 105
<211> 75<211> 75
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 105<400> 105
ggggacaagt ttgtacaaaa aagcaggcta gaaggagata tacaaatgac ccgcacccgt 60ggggacaagt ttgtacaaaa aagcaggcta gaaggagata tacaaatgac ccgcacccgt 60
atggaaaacg aactg 75atggaaaacg aactg 75
<210> 106<210> 106
<211> 62<211> 62
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 106<400> 106
ggggaccact ttgtacaaga aagctgggtt tattttttcc agatctgttc cacttttttc 60ggggaccact ttgtacaaga aagctgggtt tattttttcc agatctgttc cacttttttc 60
ag 62ag 62
<210> 107<210> 107
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 107<400> 107
aaaaagcagg ctagaaggag gtatacaaat gggcaaaaaa gtgattattg cgggcaacgg 60aaaaagcagg ctagaaggag gtatacaaat gggcaaaaaa gtgattattg cgggcaacgg 60
cccgagcc 68cccgagcc 68
<210> 108<210> 108
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 108<400> 108
aggctcataa ttgtacctcc ttcgaggttt agttgatgtt tttgctgaat ttgccatacg 60aggctcataa ttgtacctcc ttcgaggttt agttgatgtt tttgctgaat ttgccatacg 60
cttcgc 66cttcgc 66
<210> 109<210> 109
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 109<400> 109
tggcaaattc agcaaaaaca tcaactaaac ctcgaaggag gtacaattat gagcctggcc 60tggcaaattc agcaaaaaca tcaactaaac ctcgaaggag gtacaattat gagcctggcc 60
attatccc 68attacccc 68
<210> 110<210> 110
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 110<400> 110
tgcccgcaat aatcactttt ttgcccattt gtatacctcc ttctagcctg cttttttgta 60tgcccgcaat aatcactttt ttgcccattt gtatacctcc ttctagcctg cttttttgta 60
caaacttg 68caaacttg 68
<210> 111<210> 111
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 111<400> 111
aaaagcaggc tagaaggagg tatacaaatg aataagaaac cgctgattat tgctggcaac 60aaaagcaggc tagaaggagg tatacaaatg aataagaaac cgctgattat tgctggcaac 60
gggcc 65gggcc 65
<210> 112<210> 112
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 112<400> 112
aggctcataa ttgtacctcc ttcgaggttt atctcttcag gaatgcttta atgattgact 60aggctcataa ttgtacctcc ttcgaggttt atctcttcag gaatgcttta atgattgact 60
ttagcgcc 68ttagcgcc 68
<210> 113<210> 113
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 113<400> 113
atcattaaag cattcctgaa gagataaacc tcgaaggagg tacaattatg agcctggcca 60atcattaaag cattcctgaa gagataaacc tcgaaggagg tacaattatg agcctggcca 60
ttatc 65ttatc 65
<210> 114<210> 114
<211> 69<211> 69
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 114<400> 114
agcaataatc agcggtttct tattcatttg tatacctcct tctagcctgc ttttttgtac 60agcaataatc agcggtttct tattcatttg tatacctcct tctagcctgc ttttttgtac 60
aaacttgtg 69aaacttgtg 69
<210> 115<210> 115
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 115<400> 115
aaaagcaggc tagaaggagg tatacaaatg gggaccatta aaaagccctt aatcatagca 60aaaagcaggc tagaaggagg tatacaaatg gggaccatta aaaagccctt aatcatagca 60
ggaaatgg 68gggaaatgg 68
<210> 116<210> 116
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 116<400> 116
aggctcataa ttgtacctcc ttcgaggttt atgcagctcc ccaacggaaa ctaactttta 60aggctcataa ttgtacctcc ttcgaggttt atgcagctcc ccaacggaaa ctaactttta 60
atgttggg 68atgttggg 68
<210> 117<210> 117
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 117<400> 117
tagtttccgt tggggagctg cataaacctc gaaggaggta caattatgag cctggccatt 60tagtttccgt tggggagctg cataaacctc gaaggaggta caattatgag cctggccatt 60
atcccggc 68atcccggc 68
<210> 118<210> 118
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 118<400> 118
attaagggct ttttaatggt ccccatttgt atacctcctt ctagcctgct tttttgtaca 60attaagggct ttttaatggt ccccatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 119<210> 119
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 119<400> 119
aaaagcaggc tagaaggagg tatacaaatg agtgaagaaa acacccagtc cattattaaa 60aaaagcaggc tagaaggagg tatacaaatg agtgaagaaa acacccagtc cattattaaa 60
aacgac 66aacgac 66
<210> 120<210> 120
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 120<400> 120
aggctcataa ttgtacctcc ttcgaggttc agacagcaat acagacaccc gtttcgcaat 60aggctcataa ttgtacctcc ttcgaggttc agacagcaat agacaccc gtttcgcaat 60
tcggcagg 68tcggcagg 68
<210> 121<210> 121
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 121<400> 121
aaacgggtgt ctgtattgct gtctgaacct cgaaggaggt acaattatga gcctggccat 60aaacgggtgt ctgtattgct gtctgaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 122<210> 122
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 122<400> 122
atggactggg tgttttcttc actcatttgt atacctcctt ctagcctgct tttttgtaca 60atggactggg tgttttcttc actcatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 123<210> 123
<211> 64<211> 64
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 123<400> 123
aaaagcaggc tagaaggagg tatacaaatg accatttacc tggacccggc gtctctgccg 60aaaagcaggc tagaaggagg tatacaaatg accatttacc tggacccggc gtctctgccg 60
accc 64accc 64
<210> 124<210> 124
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 124<400> 124
aggctcataa ttgtacctcc ttcgaggttt acagttgttt cagagaatcc cagaagataa 60aggctcataa ttgtacctcc ttcgaggttt acagttgttt cagagaatcc cagagataa 60
tttggc 66tttggc 66
<210> 125<210> 125
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 125<400> 125
ttctgggatt ctctgaaaca actgtaaacc tcgaaggagg tacaattatg agcctggcca 60ttctgggatt ctctgaaaca actgtaaacc tcgaaggagg tacaattatg agcctggcca 60
ttatccc 67ttatccc 67
<210> 126<210> 126
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 126<400> 126
acgccgggtc caggtaaatg gtcatttgta tacctccttc tagcctgctt ttttgtacaa 60acgccgggtc caggtaaatg gtcatttgta tacctccttc tagcctgctt ttttgtacaa 60
acttgtg 67acttgtg 67
<210> 127<210> 127
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 127<400> 127
aaagcaggct agaaggaggt atacaaatgg gctgtaatag cgactccaac cacaacaact 60aaagcaggct agaaggaggt atacaaatgg gctgtaatag cgactccaac cacaacaact 60
ccgacgg 67ccgacgg 67
<210> 128<210> 128
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 128<400> 128
aggctcataa ttgtacctcc ttcgaggttt attgcaggtc cgagatcagt ttcacatcat 60aggctcataa ttgtacctcc ttcgaggttt attgcaggtc cgagatcagt ttcacatcat 60
tacgg 65tacgg 65
<210> 129<210> 129
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 129<400> 129
tgaaactgat ctcggacctg caataaacct cgaaggaggt acaattatga gcctggccat 60tgaaactgat ctcggacctg caataaacct cgaaggaggt acaattatga gcctggccat 60
tatccc 66tatccc 66
<210> 130<210> 130
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 130<400> 130
tggttggagt cgctattaca gcccatttgt atacctcctt ctagcctgct tttttgtaca 60tggttggagt cgctattaca gcccatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 131<210> 131
<211> 64<211> 64
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 131<400> 131
aaagcaggct agaaggaggt atacaaatga acaacgacaa ctccacgacc accaacaata 60aaagcaggct agaaggaggt atacaaatga acaacgacaa ctccacgacc accaacaata 60
acgc 64acgc 64
<210> 132<210> 132
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 132<400> 132
ctcataattg tacctccttc gaggtttaaa tgtcagagat cagtttaata ttatcgcggt 60ctcataattg tacctccttc gaggtttaaa tgtcagagat cagtttaata ttatcgcggt 60
taatcag 67taatcag 67
<210> 133<210> 133
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 133<400> 133
atattaaact gatctctgac atttaaacct cgaaggaggt acaattatga gcctggccat 60atattaaact gatctctgac atttaaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 134<210> 134
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 134<400> 134
tggtcgtgga gttgtcgttg ttcatttgta tacctccttc tagcctgctt ttttgtacaa 60tggtcgtgga gttgtcgttg ttcatttgta tacctccttc tagcctgctt ttttgtacaa 60
acttgtg 67acttgtg 67
<210> 135<210> 135
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 135<400> 135
aaagcaggct agaaggaggt atacaaatga aaacgattac cctgtatctg gacccggcgt 60aaagcaggct agaaggaggt atacaaatga aaacgattac cctgtatctg gacccggcgt 60
ccctgcc 67ccctgcc 67
<210> 136<210> 136
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 136<400> 136
aggctcataa ttgtacctcc ttcgaggttt acagctgttt caggctgtcc caaaagatca 60aggctcataa ttgtacctcc ttcgaggttt acagctgttt caggctgtcc caaaagatca 60
cttgcg 66cttgcg 66
<210> 137<210> 137
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 137<400> 137
tttgggacag cctgaaacag ctgtaaacct cgaaggaggt acaattatga gcctggccat 60tttgggacag cctgaaacag ctgtaaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 138<210> 138
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 138<400> 138
tccagataca gggtaatcgt tttcatttgt atacctcctt ctagcctgct tttttgtaca 60tccagataca gggtaatcgt tttcatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 139<210> 139
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 139<400> 139
aaagcaggct agaaggaggt atacaaatga aaaagatcct gaccgtcctg agcatcttta 60aaagcaggct agaaggaggt atacaaatga aaaagatcct gaccgtcctg agcatcttta 60
tcctgagc 68tcctgagc 68
<210> 140<210> 140
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 140<400> 140
aggctcataa ttgtacctcc ttcgaggttt agtccagcat cgtaccgaag tcatccggtt 60aggctcataa ttgtacctcc ttcgaggttt agtccagcat cgtaccgaag tcatccggtt 60
tggtgtgg 68tggtgtgg 68
<210> 141<210> 141
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 141<400> 141
atgacttcgg tacgatgctg gactaaacct cgaaggaggt acaattatga gcctggccat 60atgacttcgg tacgatgctg gactaaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 142<210> 142
<211> 70<211> 70
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 142<400> 142
tgctcaggac ggtcaggatc tttttcattt gtatacctcc ttctagcctg cttttttgta 60tgctcaggac ggtcaggatc tttttcattt gtatacctcc ttctagcctg cttttttgta 60
caaacttgtg 70caaacttgtg 70
<210> 143<210> 143
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 143<400> 143
aaagcaggct agaaggaggt atacaaatga cgaatcgcaa aatctatgtc tgccacaccc 60aaagcaggct agaaggaggt atacaaatga cgaatcgcaa aatctatgtc tgccacaccc 60
tgtacc 66tgtacc 66
<210> 144<210> 144
<211> 70<211> 70
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 144<400> 144
aggctcataa ttgtacctcc ttcgaggttt atttaatgtc tttcagatca accagcgtaa 60aggctcataa ttgtacctcc ttcgaggttt atttaatgtc tttcagatca accagcgtaa 60
ttttcttgtc 70ttttcttgtc 70
<210> 145<210> 145
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 145<400> 145
tggttgatct gaaagacatt aaataaacct cgaaggaggt acaattatga gcctggccat 60tggttgatct gaaagacatt aaataaacct cgaaggaggt acaattatga gcctggccat 60
tatccc 66tatccc 66
<210> 146<210> 146
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 146<400> 146
agacatagat tttgcgattc gtcatttgta tacctccttc tagcctgctt ttttgtacaa 60agacatagat tttgcgattc gtcatttgta tacctccttc tagcctgctt ttttgtacaa 60
acttgtg 67acttgtg 67
<210> 147<210> 147
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 147<400> 147
aaagcaggct agaaggaggt atacaaatgt tccgtgaaga caatatgaac ctgattatct 60aaagcaggct agaaggaggt atacaaatgt tccgtgaaga caatatgaac ctgattatct 60
gctgtacg 68gctgtacg 68
<210> 148<210> 148
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 148<400> 148
aggctcataa ttgtacctcc ttcgaggttt agatgtcgat aactttgata ccgaaatctt 60aggctcataa ttgtacctcc ttcgaggttt agatgtcgat aactttgata ccgaaatctt 60
tcagg 65tcagg 65
<210> 149<210> 149
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 149<400> 149
tcggtatcaa agttatcgac atctaaacct cgaaggaggt acaattatga gcctggccat 60tcggtatcaa agttatcgac atctaaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 150<210> 150
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 150<400> 150
aggttcatat tgtcttcacg gaacatttgt atacctcctt ctagcctgct tttttgtaca 60aggttcatat tgtcttcacg gaacatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 151<210> 151
<211> 69<211> 69
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 151<400> 151
aaagcaggct agaaggaggt atacaaatga aagaaatcgc catcatctcc aaccaacgca 60aaagcaggct agaaggaggt atacaaatga aagaaatcgc catcatctcc aaccaacgca 60
tgttcttcc 69tgttcttcc 69
<210> 152<210> 152
<211> 69<211> 69
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 152<400> 152
aggctcataa ttgtacctcc ttcgaggttt agtcaaagaa atccagcagt ttcggatgca 60aggctcataa ttgtacctcc ttcgaggttt agtcaaagaa atccagcagt ttcggatgca 60
ccgcggtgc 69ccgcggtgc 69
<210> 153<210> 153
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 153<400> 153
tccgaaactg ctggatttct ttgactaaac ctcgaaggag gtacaattat gagcctggcc 60tccgaaactg ctggatttct ttgactaaac ctcgaaggag gtacaattat gagcctggcc 60
attatccc 68attacccc 68
<210> 154<210> 154
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 154<400> 154
ttggagatga tggcgatttc tttcatttgt atacctcctt ctagcctgct tttttgtaca 60ttggagatga tggcgatttc tttcatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 155<210> 155
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 155<400> 155
aaagcaggct agaaggaggt atacaaatgc tgattcaaca gaacctggaa atctacctgg 60aaagcaggct agaaggaggt atacaaatgc tgattcaaca gaacctggaa atctacctgg 60
actacgc 67actacgc 67
<210> 156<210> 156
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 156<400> 156
aggctcataa ttgtacctcc ttcgaggttt aattgtgaat ggtgcacata aacgcctgat 60aggctcataa ttgtacctcc ttcgaggttt aattgtgaat ggtgcacata aacgcctgat 60
cttcgttg 68cttcgttg 68
<210> 157<210> 157
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 157<400> 157
aggcgtttat gtgcaccatt cacaattaaa cctcgaagga ggtacaatta tgagcctggc 60aggcgtttat gtgcaccatt cacaattaaa cctcgaagga ggtacaatta tgagcctggc 60
cattatcc 68cattattcc 68
<210> 158<210> 158
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 158<400> 158
atttccaggt tctgttgaat cagcatttgt atacctcctt ctagcctgct tttttgtaca 60atttccaggt tctgttgaat cagcatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 159<210> 159
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 159<400> 159
aaagcaggct agaaggaggt atacaaatgg gctgtaactc cgatagcaaa cacaataaca 60aaagcaggct agaaggaggt atacaaatgg gctgtaactc cgatagcaaa cacaataaca 60
gtgatggc 68gtgatggc 68
<210> 160<210> 160
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 160<400> 160
aggctcataa ttgtacctcc ttcgaggttt attgcaggtc actaatcagt ttcacatcat 60aggctcataa ttgtacctcc ttcgaggttt attgcaggtc actaatcagt ttcacatcat 60
tgcgg 65tgcgg 65
<210> 161<210> 161
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 161<400> 161
tgaaactgat tagtgacctg caataaacct cgaaggaggt acaattatga gcctggccat 60tgaaactgat tagtgacctg caataaacct cgaaggaggt acaattatga gcctggccat 60
tatcccgg 68tatcccgg 68
<210> 162<210> 162
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 162<400> 162
tgtttgctat cggagttaca gcccatttgt atacctcctt ctagcctgct tttttgtaca 60tgtttgctat cggagttaca gcccatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 163<210> 163
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 163<400> 163
aaagcaggct agaaggaggt atacaaatgt gtaacgataa tcaaaatacg gtcgatgttg 60aaagcaggct agaaggaggt atacaaatgt gtaacgataa tcaaaatacg gtcgatgttg 60
ttgtgagc 68ttgtgagc 68
<210> 164<210> 164
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 164<400> 164
aggctcataa ttgtacctcc ttcgaggttt aatactgagc aatacaaaca cccgaggaac 60aggctcataa ttgtacctcc ttcgaggttt aatactgagc aatacaaaca cccgaggaac 60
aatccggc 68aatccggc 68
<210> 165<210> 165
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 165<400> 165
tcgggtgttt gtattgctca gtattaaacc tcgaaggagg tacaattatg agcctggcca 60tcgggtgttt gtattgctca gtattaaacc tcgaaggagg tacaattatg agcctggcca 60
ttatccc 67ttatccc 67
<210> 166<210> 166
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 166<400> 166
accgtatttt gattatcgtt acacatttgt atacctcctt ctagcctgct tttttgtaca 60accgtatttt gattatcgtt acacatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 167<210> 167
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 167<400> 167
aaaagcaggc tagaaggagg tatacaaatg aacgataatc aaaatacggt ggacgtggtg 60aaaagcaggc tagaaggagg tatacaaatg aacgataatc aaaatacggt ggacgtggtg 60
gtctc 65gtctc 65
<210> 168<210> 168
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 168<400> 168
aggctcataa ttgtacctcc ttcgaggttt agcaccagaa cagcacatct ttttctttca 60aggctcataa ttgtacctcc ttcgaggttt agcaccagaa cagcacatct ttttctttca 60
caatgcc 67caatgcc 67
<210> 169<210> 169
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 169<400> 169
aaaaagatgt gctgttctgg tgctaaacct cgaaggaggt acaattatga gcctggccat 60aaaaagatgt gctgttctgg tgctaaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 170<210> 170
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 170<400> 170
tccaccgtat tttgattatc gttcatttgt atacctcctt ctagcctgct tttttgtaca 60tccaccgtat tttgattatc gttcatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 171<210> 171
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 171<400> 171
aaaagcaggc tagaaggagg tatacaaatg caaaacgtca ttatcgctgg taacggtccg 60aaaagcaggc tagaaggagg tatacaaatg caaaacgtca ttatcgctgg taacggtccg 60
agcctgc 67agcctgc 67
<210> 172<210> 172
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 172<400> 172
aggctcataa ttgtacctcc ttcgaggttt atttcttttt gtattctttc ttcagttttt 60aggctcataa ttgtacctcc ttcgaggttt atttcttttt gtattctttc ttcagttttt 60
tgatttcg 68tgatttcg 68
<210> 173<210> 173
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 173<400> 173
tgaagaaaga atacaaaaag aaataaacct cgaaggaggt acaattatga gcctggccat 60tgaagaaaga atacaaaaag aaataaacct cgaaggaggt acaattatga gcctggccat 60
tatcccgg 68tatcccgg 68
<210> 174<210> 174
<211> 68<211> 68
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 174<400> 174
ttaccagcga taatgacgtt ttgcatttgt atacctcctt ctagcctgct tttttgtaca 60ttaccagcga taatgacgtt ttgcatttgt atacctcctt ctagcctgct tttttgtaca 60
aacttgtg 68aacttgtg 68
<210> 175<210> 175
<211> 64<211> 64
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 175<400> 175
aaaagcaggc tagaaggagg tatacaaatg gattcttcgc cggaaaacac cagctctacg 60aaaagcaggc tagaaggagg tatacaaatg gattcttcgc cggaaaacac cagctctacg 60
ctgg 64ctgg 64
<210> 176<210> 176
<211> 66<211> 66
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 176<400> 176
aggctcataa ttgtacctcc ttcgaggttt atttgatgtc cgtcgtaaag cgcacttttt 60aggctcataa ttgtacctcc ttcgaggttt atttgatgtc cgtcgtaaag cgcacttttt 60
cgtccg 66cgtccg 66
<210> 177<210> 177
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 177<400> 177
tgcgctttac gacggacatc aaataaacct cgaaggaggt acaattatga gcctggccat 60tgcgctttac gacggacatc aaataaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 178<210> 178
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 178<400> 178
tggtgttttc cggcgaagaa tccatttgta tacctccttc tagcctgctt ttttgtacaa 60tggtgttttc cggcgaagaa tccatttgta tacctccttc tagcctgctt ttttgtacaa 60
acttgtg 67acttgtg 67
<210> 179<210> 179
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 179<400> 179
aaaagcaggc tagaaggagg tatacaaatg aagaaagtct acttctgcca tacggtctac 60aaaagcaggc tagaaggagg tatacaaatg aagaaagtct acttctgcca tacggtctac 60
catctgc 67catctgc 67
<210> 180<210> 180
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 180<400> 180
aggctcataa ttgtacctcc ttcgaggttt aactatttgc tttcatttgt ttcagggtga 60aggctcataa ttgtacctcc ttcgaggttt aactatttgc tttcatttgt ttcagggtga 60
ttttc 65ttttc 65
<210> 181<210> 181
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 181<400> 181
tgaaacaaat gaaagcaaat agttaaacct cgaaggaggt acaattatga gcctggccat 60tgaaacaaat gaaagcaaat agttaaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 182<210> 182
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 182<400> 182
tatggcagaa gtagactttc ttcatttgta tacctccttc tagcctgctt ttttgtacaa 60tatggcagaa gtagactttc ttcatttgta tacctccttc tagcctgctt ttttgtacaa 60
acttgtg 67acttgtg 67
<210> 183<210> 183
<211> 65<211> 65
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 183<400> 183
aaagcaggct agaaggaggt atacaaatgc gtaaaatcat caccttcttc agcctgttct 60aaagcaggct agaaggaggt atacaaatgc gtaaaatcat caccttcttc agcctgttct 60
tctcg 65tctcg 65
<210> 184<210> 184
<211> 70<211> 70
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 184<400> 184
aggctcataa ttgtacctcc ttcgaggttt aaaagttaat cgggttcggc atttcttcaa 60aggctcataa ttgtacctcc ttcgaggttt aaaagttaat cgggttcggc atttcttcaa 60
agaaaatctg 70agaaaatctg 70
<210> 185<210> 185
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 185<400> 185
aaatgccgaa cccgattaac ttttaaacct cgaaggaggt acaattatga gcctggccat 60aaatgccgaa cccgattaac ttttaaacct cgaaggaggt acaattatga gcctggccat 60
tatcccg 67tatcccg 67
<210> 186<210> 186
<211> 67<211> 67
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Праймер<223> Primer
<400> 186<400> 186
tgaagaaggt gatgatttta cgcatttgta tacctccttc tagcctgctt ttttgtacaa 60tgaagaaggt gatgatttta cgcatttgta tacctccttc tagcctgctt ttttgtacaa 60
acttgtg 67acttgtg 67
<210> 187<210> 187
<211> 3856<211> 3856
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Экспрессионная кассета <Ptet-wbdO-PT5-galE-FRT-cat-FRT><223> Expression cassette <Ptet-wbdO-PT5-galE-FRT-cat-FRT>
<400> 187<400> 187
acaggttggc tgataagtcc ccggtctgcc cgaaaagtgc cacctgaaat tggccagatg 60acaggttggc tgataagtcc ccggtctgcc cgaaaagtgc cacctgaaat tggccagatg 60
attaattcct aatttttgtt gattctggta ccaaatgagt cgaccggcca gatgattaat 120attaattcct aatttttgtt gattctggta ccaaatgagt cgaccggcca gatgattaat 120
tcctaatttt tgttgacact ctatcattga tagagttatt ttaccactcc ctatcagtga 180tcctaatttt tgttgacact ctatcattga tagagttatt ttaccactcc ctatcagtga 180
tagagaaaag tgaaatgaat agttcgacaa aaatctagaa ataattttgt ttaactttaa 240tagagaaaag tgaaatgaat agttcgacaa aaatctagaa ataattttgt ttaactttaa 240
gaaggagata tacaaatgct gacggaagtg cgcccggtct ctacgacgaa accgctggtg 300gaaggagata tacaaatgct gacggaagtg cgcccggtct ctacgacgaa accgctggtg 300
tctgtgattc tgccggtgaa caaattcaac ccgtatctgg atcgtgcaat tcattcaatc 360tctgtgattc tgccggtgaa caaattcaac ccgtatctgg atcgtgcaat tcattcaatc 360
ctgagtcagt cctatccgtc gattgaactg attatcattg caaacaattg caccaatgac 420ctgagtcagt cctatccgtc gattgaactg attatcattg caaacaattg caccaatgac 420
tttttcgatg ctctgaaaaa acgtgaatgt gaaaccatta aagtgctgcg cacgaacatc 480tttttcgatg ctctgaaaaa acgtgaatgt gaaaccatta aagtgctgcg cacgaacatc 480
gcgtatctgc cgtactgcct gaataaaggc ctggatctgt gtaacggtga ctttgttgcc 540gcgtatctgc cgtactgcct gaataaaggc ctggatctgt gtaacggtga ctttgttgcc 540
cgcatggatt cagatgacat ttcgcacccg gaacgtatcg atcgccaggt cgacttcctg 600cgcatggatt cagatgacat ttcgcacccg gaacgtatcg atcgccaggt cgacttcctg 600
attaacaatc cggacatcga tgtggttggc accaatgcag tctatattga tgaagatgac 660attaacaatc cggacatcga tgtggttggc accaatgcag tctatattga tgaagatgac 660
atcgaactgg aaaaaagcaa cctgccggtg aacaataacg ctattcgtaa aatgctgccg 720atcgaactgg aaaaaagcaa cctgccggtg aacaataacg ctattcgtaa aatgctgccg 720
tataaatgct gtctggtgca tccgtctgtt atgtttcgca aaaatgtcgt gatcaccagc 780tataaatgct gtctggtgca tccgtctgtt atgtttcgca aaaatgtcgt gatcaccagc 780
ggcggttaca tgttcgcgaa ttattctgaa gattacgaac tgtggaaccg tctggccgtt 840ggcggttaca tgttcgcgaa ttattctgaa gattacgaac tgtggaaccg tctggccgtt 840
gaaggccgca atttttataa cctgagcgaa tacctgctgt attaccgtct gcacaataac 900gaaggccgca atttttataa cctgagcgaa tacctgctgt attaccgtct gcacaataac 900
caatcaacgt cgaaaaataa cctgtttatg gtgatggcga acgatgtcgc cattaaagtg 960caatcaacgt cgaaaaataa cctgtttatg gtgatggcga acgatgtcgc cattaaagtg 960
aaatatttcc tgctgaccaa gaaaattagc tacctgctgg gtatcattcg cacggtcttt 1020aaatatttcc tgctgaccaa gaaaattagc tacctgctgg gtatcattcg cacggtcttt 1020
tctgtgttct attgcaaata catcaaatga tttcgtcgac acacaggaaa catattaaaa 1080tctgtgttct attgcaaata catcaaatga tttcgtcgac acacaggaaa catattaaaa 1080
attaaaacct gcaggagttt aaacgcggcc gcgatatcgt tgtaaaacga cggccagtgc 1140attaaaacct gcaggagttt aaacgcggcc gcgatatcgt tgtaaaacga cggccagtgc 1140
aagaatcata aaaaatttat ttgctttcag gaaaattttt ctgtataata gattcataaa 1200aagaatcata aaaaatttat ttgctttcag gaaaattttt ctgtataata gattcataaa 1200
tttgagagag gagtttttgt gagcggataa caattcccca tcttagtata ttagttaagt 1260tttgagagag gagtttttgt gagcggataa caattcccca tcttagtata ttagttaagt 1260
ataaatacac cgcggaggcg tcgaaggaga tacaaccatg agagttctgg ttaccggtgg 1320ataaatacac cgcggaggcg tcgaaggaga tacaaccatg agagttctgg ttaccggtgg 1320
tagcggttac attggaagtc atacctgtgt gcaattactg caaaacggtc atgatgtcat 1380tagcggttac attggaagtc atacctgtgt gcaattactg caaaacggtc atgatgtcat 1380
cattcttgat aacctctgta acagtaagcg cagcgtactg cctgttatcg agcgtttagg 1440cattcttgat aacctctgta acagtaagcg cagcgtactg cctgttatcg agcgtttagg 1440
cggcaaacat ccaacgtttg ttgaaggcga tattcgtaac gaagcgttga tgaccgagat 1500cggcaaacat ccaacgtttg ttgaaggcga tattcgtaac gaagcgttga tgaccgagat 1500
cctgcacgat cacgctatcg acaccgtgat ccacttcgcc gggctgaaag ccgtgggcga 1560cctgcacgat cacgctatcg acaccgtgat ccacttcgcc gggctgaaag ccgtgggcga 1560
atcggtacaa aaaccgctgg aatattacga caacaatgtc aacggcactc tgcgcctgat 1620atcggtacaa aaaccgctgg aatattacga caacaatgtc aacggcactc tgcgcctgat 1620
tagcgccatg cgcgccgcta acgtcaaaaa ctttattttt agctcctccg ccaccgttta 1680tagcgccatg cgcgccgcta acgtcaaaaa ctttattttt agctcctccg ccaccgttta 1680
tggcgatcag cccaaaattc catacgttga aagcttcccg accggcacac cgcaaagccc 1740tggcgatcag cccaaaattc catacgttga aagcttcccg accggcacac cgcaaagccc 1740
ttacggcaaa agcaagctga tggtggaaca gatcctcacc gatctgcaaa aagcccagcc 1800ttacggcaaa agcaagctga tggtggaaca gatcctcacc gatctgcaaa aagcccagcc 1800
ggactggagc attgccctgc tgcgctactt caacccggtt ggcgcgcatc cgtcgggcga 1860ggactggagc attgccctgc tgcgctactt caacccggtt ggcgcgcatc cgtcgggcga 1860
tatgggcgaa gatccgcaag gcattccgaa taacctgatg ccatacatcg cccaggttgc 1920tatgggcgaa gatccgcaag gcattccgaa taacctgatg ccatacatcg cccaggttgc 1920
tgtaggccgt cgcgactcgc tggcgatttt tggtaacgat tatccgaccg aagatggtac 1980tgtaggccgt cgcgactcgc tggcgatttt tggtaacgat tatccgaccg aagatggtac 1980
tggcgtacgc gattacatcc acgtaatgga tctggcggac ggtcacgtcg tggcgatgga 2040tggcgtacgc gattacatcc acgtaatgga tctggcggac ggtcacgtcg tggcgatgga 2040
aaaactggcg aacaagccag gcgtacacat ctacaacctc ggcgctggcg taggcaacag 2100aaaactggcg aacaagccag gcgtacacat ctacaacctc ggcgctggcg taggcaacag 2100
cgtgctggac gtggttaatg ccttcagcaa agcctgcggc aaaccggtta attatcattt 2160cgtgctggac gtggttaatg ccttcagcaa agcctgcggc aaaccggtta attatcattt 2160
tgcaccgcgt cgcgagggcg accttccggc ctactgggcg gacgccagca aagccgaccg 2220tgcaccgcgt cgcgagggcg accttccggc ctactgggcg gacgccagca aagccgaccg 2220
tgaactgaac tggcgcgtaa cgcgcacact cgatgaaatg gcgcaggaca cctggcactg 2280tgaactgaac tggcgcgtaa cgcgcacact cgatgaaatg gcgcaggaca cctggcactg 2280
gcagtcacgc catccacagg gatatcccga ttaacgccat ttaaatcaac ctcagcggtc 2340gcagtcacgc catccacagg gatatcccga ttaacgccat ttaaatcaac ctcagcggtc 2340
atagctgttt cctgtgactg agcaataact agcataaccc cttggggcct ctaaacgggt 2400atagctgttt cctgtgactg agcaataact agcataaccc cttggggcct ctaaacgggt 2400
cttgaggggt tttttgctga aaccaatttg cctggcggca gtagcgcggt ggtcccacct 2460cttgaggggt tttttgctga aaccaatttg cctggcggca gtagcgcggt ggtcccacct 2460
gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc 2520gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc 2520
catgcgagag tagggaactg ccaggcatca aataaaacga aaggctcagt cgaaagactg 2580catgcgagag tagggaactg ccaggcatca aataaaacga aaggctcagt cgaaagactg 2580
ggcctttcgg gatccaggcc ggcctgttaa cgaattaatc ttccgcggcg gtatcgataa 2640ggcctttcgg gatccaggcc ggcctgttaa cgaattaatc ttccgcggcg gtatcgataa 2640
gcttgatatc gaggctgaca tgggaattag ccatggtcca tatgaatatc ctccttagtt 2700gcttgatatc gaggctgaca tgggaattag ccatggtcca tatgaatatc ctccttagtt 2700
cctattccga agttcctatt ctctagaaag tataggaact tcggcgcgcc tacctgtgac 2760cctattccga agttcctatt ctctagaaag tataggaact tcggcgcgcc tacctgtgac 2760
ggaagatcac ttcgcagaat aaataaatcc tggtgtccct gttgataccg ggaagccctg 2820ggaagatcac ttcgcagaat aaataaatcc tggtgtccct gttgataccg ggaagccctg 2820
ggccaacttt tggcgaaaat gagacgttga tcggcacgta agaggttcca actttcacca 2880ggccaacttt tggcgaaaat gagacgttga tcggcacgta agaggttcca actttcacca 2880
taatgaaata agatcactac cgggcgtatt ttttgagttg tcgagatttt caggagctaa 2940taatgaaata agatcactac cgggcgtatt ttttgagttg tcgagatttt caggagctaa 2940
ggaagctaaa atggagaaaa aaatcactgg atataccacc gttgatatat cccaatggca 3000ggaagctaaa atggagaaaa aaatcactgg atataccacc gttgatatat cccaatggca 3000
tcgtaaagaa cattttgagg catttcagtc agttgctcaa tgtacctata accagaccgt 3060tcgtaaagaa cattttgagg catttcagtc agttgctcaa tgtacctata accagaccgt 3060
tcagctggat attacggcct ttttaaagac cgtaaagaaa aataagcaca agttttatcc 3120tcagctggat attacggcct ttttaaagac cgtaaagaaa aataagcaca agttttatcc 3120
ggcctttatt cacattcttg cccgcctgat gaatgctcat ccggaattac gtatggcaat 3180ggcctttatt cacattcttg cccgcctgat gaatgctcat ccggaattac gtatggcaat 3180
gaaagacggt gagctggtga tatgggatag tgttcaccct tgttacaccg ttttccatga 3240gaaagacggt gagctggtga tatgggatag tgttcaccct tgttacaccg ttttccatga 3240
gcaaactgaa acgttttcat cgctctggag tgaataccac gacgatttcc ggcagtttct 3300gcaaactgaa acgttttcat cgctctggag tgaataccac gacgatttcc ggcagtttct 3300
acacatatat tcgcaagatg tggcgtgtta cggtgaaaac ctggcctatt tccctaaagg 3360acacatatat tcgcaagatg tggcgtgtta cggtgaaaac ctggcctatt tccctaaagg 3360
gtttattgag aatatgtttt tcgtctcagc caatccctgg gtgagtttca ccagttttga 3420gtttattgag aatatgtttt tcgtctcagc caatccctgg gtgagtttca ccagttttga 3420
tttaaacgtg gccaatatgg acaacttctt cgcccccgtt ttcaccatgg gcaaatatta 3480tttaaacgtg gccaatatgg acaacttctt cgcccccgtt ttcaccatgg gcaaatatta 3480
tacgcaaggc gacaaggtgc tgatgccgct ggcgattcag gttcatcatg ccgtttgtga 3540tacgcaaggc gacaaggtgc tgatgccgct ggcgattcag gttcatcatg ccgtttgtga 3540
tggcttccat gtcggcagat gcttaatgaa tacaacagta ctgcgatgag tggcagggcg 3600tggcttccat gtcggcagat gcttaatgaa tacaacagta ctgcgatgag tggcagggcg 3600
gggcgtaagg cgcgccattt aaatgaagtt cctattccga agttcctatt ctctagaaag 3660gggcgtaagg cgcgccattt aaatgaagtt cctattccga agttcctatt ctctagaaag 3660
tataggaact tcgaagcagc tccagcctac acaatcgctc aagacgtgta atgctgcaat 3720tataggaact tcgaagcagc tccagcctac acaatcgctc aagacgtgta atgctgcaat 3720
ctgcatgcaa gcttggcact ggcgatggcg cctcatccct gaagccaata agcagctcca 3780ctgcatgcaa gcttggcact ggcgatggcg cctcatccct gaagccaata agcagctcca 3780
gcctacacaa tcgctcaaga cgtgtaatgc tgcaatctgc atgcaagcta gaccggggac 3840gcctacacaa tcgctcaaga cgtgtaatgc tgcaatctgc atgcaagcta gaccggggac 3840
ttatcagcca acctgt 3856ttatcagcca acctgt 3856
<210> 188<210> 188
<211> 4568<211> 4568
<212> ДНК<212> DNA
<213> Искусственная Последовательность<213> Artificial Sequence
<220><220>
<223> Экспрессионная кассета <Ptet-lgtA-PT5-galT-FRT-kan-FRT>.<223> Expression cassette <Ptet-lgtA-PT5-galT-FRT-kan-FRT>.
<400> 188<400> 188
acaggttggc tgataagtcc ccggtctagc ttgcatgcag attgcagcat tacacgtctt 60acaggttggc tgataagtcc ccggtctagc ttgcatgcag attgcagcat tacacgtctt 60
gagcgattgt gtaggctgga gctgcttcga agttcctata ctttctagag aataggaact 120gagcgattgt gtaggctgga gctgcttcga agttcctata ctttctagag aataggaact 120
tcggaatagg aacttcattt aaatggcgcg ccttacgccc cgccctgccg gtaccgagag 180tcggaatagg aacttcattt aaatggcgcg ccttacgccc cgccctgccg gtaccgagag 180
cgcttttgaa gctggggtgg gcgaagaact ccagcatgag atccccgcgc tggaggatca 240cgcttttgaa gctggggtgg gcgaagaact ccagcatgag atccccgcgc tggaggatca 240
tccagccggc gtcccggaaa acgattccga agcccaacct ttcatagaag gcggcggtgg 300tccagccggc gtcccggaaa acgattccga agcccaacct ttcatagaag gcggcggtgg 300
aatcgaaatc tcgtgatggc aggttgggcg tcgcttggtc ggtcatttcg aaccccagag 360aatcgaaatc tcgtgatggc aggttgggcg tcgcttggtc ggtcatttcg aaccccagag 360
tcccgctcag aagaactcgt caagaaggcg atagaaggcg atgcgctgcg aatcgggagc 420tcccgctcag aagaactcgt caagaaggcg atagaaggcg atgcgctgcg aatcgggagc 420
ggcgataccg taaagcacga ggaagcggtc agcccattcg ccgccaagct cttcagcaat 480ggcgataccg taaagcacga ggaagcggtc agcccattcg ccgccaagct cttcagcaat 480
atcacgggta gccaacgcta tgtcctgata gcggtccgcc acacccagcc ggccacagtc 540atcacgggta gccaacgcta tgtcctgata gcggtccgcc acacccagcc ggccacagtc 540
gatgaatcca gaaaagcggc cattttccac catgatattc ggcaagcagg catcgccatg 600gatgaatcca gaaaagcggc cattttccac catgatattc ggcaagcagg catcgccatg 600
ggtcacgacg agatcctcgc cgtcgggcat gcgcgccttg agcctggcga acagttcggc 660ggtcacgacg agatcctcgc cgtcgggcat gcgcgccttg agcctggcga acagttcggc 660
tggcgcgagc ccctgatgct cttcgtccag atcatcctga tcgacaagac cggcttccat 720tggcgcgagc ccctgatgct cttcgtccag atcatcctga tcgacaagac cggcttccat 720
ccgagtacgt gctcgctcga tgcgatgttt cgcttggtgg tcgaatgggc aggtagccgg 780ccgagtacgt gctcgctcga tgcgatgttt cgcttggtgg tcgaatgggc aggtagccgg 780
atcaagcgta tgcagccgcc gcattgcatc agccatgatg gatactttct cggcaggagc 840atcaagcgta tgcagccgcc gcattgcatc agccatgatg gatactttct cggcaggagc 840
aaggtgagat gacaggagat cctgccccgg cacttcgccc aatagcagcc agtcccttcc 900aaggtgagat gacaggagat cctgccccgg cacttcgccc aatagcagcc agtcccttcc 900
cgcttcagtg acaacgtcga gcacagctgc gcaaggaacg cccgtcgtgg ccagccacga 960cgcttcagtg acaacgtcga gcacagctgc gcaaggaacg cccgtcgtgg ccagccacga 960
tagccgcgct gcctcgtcct gcagttcatt cagggcaccg gacaggtcgg tcttgacaaa 1020tagccgcgct gcctcgtcct gcagttcatt cagggcaccg gacaggtcgg tcttgacaaa 1020
aagaaccggg cgcccctgcg ctgacagccg gaacacggcg gcatcagagc agccgattgt 1080aagaaccggg cgcccctgcg ctgacagccg gaacacggcg gcatcagagc agccgattgt 1080
ctgttgtgcc cagtcatagc cgaatagcct ctccacccaa gcggccggag aacctgcgtg 1140ctgttgtgcc cagtcatagc cgaatagcct ctccacccaa gcggccggag aacctgcgtg 1140
caatccatct tgttcaatca tgcgaaacga tcctcatcct gtctcttgat cagatcttga 1200caatccatct tgttcaatca tgcgaaacga tcctcatcct gtctcttgat cagatcttga 1200
tcccctgcgc catcagatcc ttggcggcaa gaaagccatc cagtttactt tgcagggctt 1260tcccctgcgc catcagatcc ttggcggcaa gaaagccatc cagtttactt tgcagggctt 1260
cccaacctta ccagagggcg ccccagctgg caattccggt tcgcttgctg tccataaaac 1320cccaacctta cccagagggcg ccccagctgg caattccggt tcgcttgctg tccataaaac 1320
cgcccagtct agctatcgcc atgtaagccc actgcaagct acctgctttc tctttgcgct 1380cgcccagtct agctatcgcc atgtaagccc actgcaagct acctgctttc tctttgcgct 1380
tgcgttttcc cttgtccaga tagcccagta gctgacattc atccggggtc agcaccgttt 1440tgcgttttcc cttgtccaga tagcccagta gctgacattc atccggggtc agcaccgttt 1440
ctgcggactg gctttctacg tgttccgctt cctttagcag cccttgcgcc ctgagtgctt 1500ctgcggactg gctttctacg tgttccgctt cctttagcag cccttgcgcc ctgagtgctt 1500
gcggcagcgt gaggggatct tgacgcgtgt cacaggtagg acgcgccgaa gttcctatac 1560gcggcagcgt gaggggatct tgacgcgtgt cacaggtagg acgcgccgaa gttcctatac 1560
tttctagaga ataggaactt cggaatagga actaaggagg atattcatac atgatggtag 1620tttctagaga ataggaactt cggaatagga actaaggagg atattcatac atgatggtag 1620
tgttcgaaat taatacgact cactataggg gaattgattc tggtaccaaa tgagtcgacc 1680tgttcgaaat taatacgact cactataggg gaattgattc tggtaccaaa tgagtcgacc 1680
ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 1740ggccagatga ttaattccta atttttgttg acactctatc attgatagag ttattttacc 1740
actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 1800actccctatc agtgatagag aaaagtgaaa tgaatagttc gacaaaaatc tagaaataat 1800
tttgtttaac tttaagaagg agatatacaa atgccgtccg aagcattccg tcgtcaccgt 1860tttgtttaac tttaagaagg agatatacaa atgccgtccg aagcattccg tcgtcaccgt 1860
gcttatcgcg aaaacaaact gcagccactg gtctctgtcc tgatctgcgc atacaacgtt 1920gcttatcgcg aaaacaaact gcagccactg gtctctgtcc tgatctgcgc atacaacgtt 1920
gagaaatact tcgcacagtc tctggcagct gtagttaacc agacctggcg taacctggat 1980gagaaatact tcgcacagtc tctggcagct gtagttaacc agacctggcg taacctggat 1980
atcctgatcg tagatgacgg ctctacggat ggtacgctgg cgatcgcaca gcgtttccag 2040atcctgatcg tagatgacgg ctctacggat ggtacgctgg cgatcgcaca gcgtttccag 2040
gaacaggacg gtcgtatccg cattctcgct cagccgcgta actctggtct gatcccgtct 2100gaacaggacg gtcgtatccg cattctcgct cagccgcgta actctggtct gatcccgtct 2100
ctgaacatcg gtctggacga actggccaaa tctggtggtg gtggcgaata catcgcccgt 2160ctgaacatcg gtctggacga actggccaaa tctggtggtg gtggcgaata catcgcccgt 2160
actgacgccg acgacattgc ggccccggat tggatcgaaa aaatcgtagg tgaaatggag 2220actgacgccg acgacattgc ggccccggat tggatcgaaa aaatcgtagg tgaaatggag 2220
aaagaccgct ctatcatcgc gatgggtgct tggctggaag ttctgtccga agagaaagac 2280aaagaccgct ctatcatcgc gatgggtgct tggctggaag ttctgtccga agagaaagac 2280
ggtaaccgtc tggcccgtca ccatgaacac ggcaaaatct ggaaaaaacc gacccgtcac 2340ggtaaccgtc tggcccgtca ccatgaacac ggcaaaatct ggaaaaaacc gacccgtcac 2340
gaagatatcg cggacttctt cccgttcggt aacccgatcc ataacaacac catgatcatg 2400gaagatatcg cggacttctt cccgttcggt aacccgatcc ataacaacac catgatcatg 2400
cgtcgtagcg taatcgacgg tggtctgcgt tacaacaccg aacgtgattg ggcagaagac 2460cgtcgtagcg taatcgacgg tggtctgcgt tacaacaccg aacgtgattg ggcagaagac 2460
taccagtttt ggtatgacgt gtctaaactg ggtcgtctgg cttactaccc agaagcgctg 2520taccagtttt ggtatgacgt gtctaaactg ggtcgtctgg cttactaccc agaagcgctg 2520
gttaaatacc gtctgcacgc caaccaggtt agctccaaat actccatccg tcagcacgaa 2580gttaaatacc gtctgcacgc caaccaggtt agctccaaat actccatccg tcagcacgaa 2580
atcgcacagg gtatccagaa aacggctcgt aacgacttcc tgcagtccat gggtttcaaa 2640atcgcacagg gtatccagaa aacggctcgt aacgacttcc tgcagtccat gggtttcaaa 2640
acccgtttcg actctctgga gtaccgtcag atcaaagcgg ttgcgtatga gctgctggag 2700acccgtttcg actctctgga gtaccgtcag atcaaagcgg ttgcgtatga gctgctggag 2700
aaacacctgc cggaagagga ctttgaacgt gcgcgtcgtt tcctgtacca gtgcttcaaa 2760aaacacctgc cggaagagga ctttgaacgt gcgcgtcgtt tcctgtacca gtgcttcaaa 2760
cgtaccgaca ctctgccggc gggtgcatgg ctcgactttg cagcggatgg tcgtatgcgt 2820cgtaccgaca ctctgccggc gggtgcatgg ctcgactttg cagcggatgg tcgtatgcgt 2820
cgtctgttta ccctgcgtca gtacttcggt atcctgcatc gtctcctgaa aaaccgctaa 2880cgtctgttta ccctgcgtca gtacttcggt atcctgcatc gtctcctgaa aaaccgctaa 2880
tgatttcgtc gacacacagg aaacatatta aaaattaaaa cctgcaggag tttaaacgcg 2940tgatttcgtc gacacacagg aaacatatta aaaattaaaa cctgcaggag tttaaacgcg 2940
gccgcgatat cgttgtaaaa cgacggccag tgcaagaatc ataaaaaatt tatttgcttt 3000gccgcgatat cgttgtaaaa cgacggccag tgcaagaatc ataaaaaatt tatttgcttt 3000
caggaaaatt tttctgtata atagattcat aaatttgaga gaggagtttt tgtgagcgga 3060caggaaaatt tttctgtata atagattcat aaatttgaga gaggagtttt tgtgagcgga 3060
taacaattcc ccatcttagt atattagtta agtataaata cacaaggaga tataccatga 3120taacaattcc ccatcttagt atattagtta agtataaata cacaaggaga tataccatga 3120
cgcaatttaa tcccgttgat catccacatc gccgctacaa cccgctcacc gggcaatgga 3180cgcaatttaa tcccgttgat catccacatc gccgctacaa cccgctcacc gggcaatgga 3180
ttctggtttc accgcaccgc gctaagcgcc cctggcaggg ggcgcaggaa acgccagcca 3240ttctggtttc accgcaccgc gctaagcgcc cctggcaggg ggcgcaggaa acgccagcca 3240
aacaggtgtt acctgcgcac gatccagatt gcttcctctg cgcaggtaat gtgcgggtga 3300aacaggtgtt acctgcgcac gatccagatt gcttcctctg cgcaggtaat gtgcgggtga 3300
caggcgataa aaaccccgat tacaccggga cttacgtttt cactaatgac tttgcggctt 3360caggcgataa aaaccccgat tacaccggga cttacgtttt cactaatgac tttgcggctt 3360
tgatgtctga cacgccagat gcgccagaaa gtcacgatcc gctgatgcgt tgccagagcg 3420tgatgtctga cacgccagat gcgccagaaa gtcacgatcc gctgatgcgt tgccagagcg 3420
cgcgcggcac cagccgggtg atctgctttt caccggatca cagtaaaacg ctgccagagc 3480cgcgcggcac cagccgggtg atctgctttt caccggatca cagtaaaacg ctgccagagc 3480
tcagcgttgc agcattgacg gaaatcgtca aaacctggca ggagcaaacc gcagaactgg 3540tcagcgttgc agcattgacg gaaatcgtca aaacctggca ggagcaaacc gcagaactgg 3540
ggaaaacgta cccatgggtg caggtttttg aaaacaaagg cgcggcgatg ggctgctcta 3600ggaaaacgta cccatgggtg caggtttttg aaaacaaagg cgcggcgatg ggctgctcta 3600
acccgcatcc gcacggtcag atttgggcaa atagcttcct gcctaacgaa gctgagcgcg 3660acccgcatcc gcacggtcag atttgggcaa atagcttcct gcctaacgaa gctgagcgcg 3660
aagaccgcct gcaaaaagaa tattttgccg aacagaaatc accaatgctg gtggattatg 3720aagaccgcct gcaaaaagaa tattttgccg aacagaaatc accaatgctg gtggattatg 3720
ttcagcgcga gctggcagac ggtagccgta ccgttgtcga aaccgaacac tggttagccg 3780ttcagcgcga gctggcagac ggtagccgta ccgttgtcga aaccgaacac tggttagccg 3780
tcgtgcctta ctgggctgcc tggccgttcg aaacgctact gctgcccaaa gcccacgttt 3840tcgtgcctta ctgggctgcc tggccgttcg aaacgctact gctgcccaaa gcccacgttt 3840
tacggatcac cgatttgacc gacgcccagc gcagcgatct ggcgctggcg ttgaaaaagc 3900tacggatcac cgatttgacc gacgcccagc gcagcgatct ggcgctggcg ttgaaaaagc 3900
tgaccagtcg ttatgacaac ctcttccagt gctccttccc ctactctatg ggctggcacg 3960tgaccagtcg ttatgacaac ctcttccagt gctccttccc ctactctatg ggctggcacg 3960
gcgcgccatt taatggcgaa gagaatcaac actggcagct gcacgcgcac ttttatccgc 4020gcgcgccatt taatggcgaa gagaatcaac actggcagct gcacgcgcac ttttatccgc 4020
ctctgctgcg ctccgccacc gtacgtaaat ttatggttgg ttatgaaatg ctggcagaga 4080ctctgctgcg ctccgccacc gtacgtaaat ttatggttgg ttatgaaatg ctggcagaga 4080
cccagcgaga cctgaccgca gaacaggcag cagagcgttt gcgcgcagtc agcgatatcc 4140cccagcgaga cctgaccgca gaacaggcag cagagcgttt gcgcgcagtc agcgatatcc 4140
attttcgcga atccggagtg taacgcggag gcgcgccatt taaatcaacc tcagcggtca 4200attttcgcga atccggagtg taacgcggag gcgcgccatt taaatcaacc tcagcggtca 4200
tagctgtttc ctgtgactga gcaataacta gcataacccc ttggggcctc taaacgggtc 4260tagctgtttc ctgtgactga gcaataacta gcataacccc ttggggcctc taaacgggtc 4260
ttgaggggtt ttttgctgaa accaatttgc ctggcggcag tagcgcggtg gtcccacctg 4320ttgaggggtt ttttgctgaa accaatttgc ctggcggcag tagcgcggtg gtcccacctg 4320
accccatgcc gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg gggtctcccc 4380accccatgcc gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg gggtctcccc 4380
atgcgagagt agggaactgc caggcatcaa ataaaacgaa aggctcagtc gaaagactgg 4440atgcgagagt agggaactgc caggcatcaa ataaaacgaa aggctcagtc gaaagactgg 4440
gcctttcggg atccaggccg gcctgttaac gaattaatct tccgcggcaa caaaaattag 4500gcctttcggg atccaggccg gcctgttaac gaattaatct tccgcggcaa caaaaattag 4500
gaattaatca tctggccaat ttcaggtggc acttttcggg cagaccgggg acttatcagc 4560gaattaatca tctggccaat ttcaggtggc acttttcggg cagaccgggg acttatcagc 4560
caacctgt 4568caacctgt 4568
<---<---
Claims (23)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP17183391.6 | 2017-07-26 | ||
| EP17183391 | 2017-07-26 | ||
| PCT/EP2018/070214 WO2019020707A1 (en) | 2017-07-26 | 2018-07-25 | Sialyltransferases and their use in producing sialylated oligosaccharides |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| RU2020105821A RU2020105821A (en) | 2021-08-26 |
| RU2020105821A3 RU2020105821A3 (en) | 2021-11-19 |
| RU2822039C2 true RU2822039C2 (en) | 2024-06-28 |
Family
ID=
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2001077314A1 (en) * | 2000-04-11 | 2001-10-18 | Kyowa Hakko Kogyo Co., Ltd. | MODIFIED α2,3-SIALYLTRANSFERASE GENE AND PROCESS FOR PRODUCING α2,3-SIALYLTRANSFERASE AND COMPLEX SACCHARIDE CONTAINING SIALIC ACID |
| US20050089956A1 (en) * | 2001-09-26 | 2005-04-28 | Kyowa Hakko Kogyo Co. Ltd. | Process for producing alpha 2,3/ alpha 2,8-sialyltransferase and sialic acid-containing complex sugar |
| US7968310B2 (en) * | 2006-02-01 | 2011-06-28 | Biogenerix Ag | Tagged sialyltransferase proteins |
| EP2441832A1 (en) * | 2009-06-12 | 2012-04-18 | Japan Tobacco, Inc. | Novel protein and gene that codes therefor |
| US8187838B2 (en) * | 2006-03-14 | 2012-05-29 | Japan Tobacco Inc. | β-Galactoside-α2, 6-sialyltransferase, a gene encoding thereof, and a method for producing thereof |
| US8187853B2 (en) * | 2007-03-02 | 2012-05-29 | Japan Tobacco Inc. | β-galactoside-α2,6-sialyltransferase, a gene encoding thereof, and a method for enhancing enzyme activity |
| WO2014153253A1 (en) * | 2013-03-14 | 2014-09-25 | Glycosyn LLC | Microorganisms and methods for producing sialylated and n-acetylglucosamine-containing oligosaccharides |
| RU2560190C2 (en) * | 2010-11-23 | 2015-08-20 | Нестек С.А. | Oligosaccharide mixture and food product containing said mixture, particularly infant formula |
| WO2017101958A1 (en) * | 2015-12-18 | 2017-06-22 | Glycom A/S | Fermentative production of oligosaccharides |
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2001077314A1 (en) * | 2000-04-11 | 2001-10-18 | Kyowa Hakko Kogyo Co., Ltd. | MODIFIED α2,3-SIALYLTRANSFERASE GENE AND PROCESS FOR PRODUCING α2,3-SIALYLTRANSFERASE AND COMPLEX SACCHARIDE CONTAINING SIALIC ACID |
| US20050089956A1 (en) * | 2001-09-26 | 2005-04-28 | Kyowa Hakko Kogyo Co. Ltd. | Process for producing alpha 2,3/ alpha 2,8-sialyltransferase and sialic acid-containing complex sugar |
| US7968310B2 (en) * | 2006-02-01 | 2011-06-28 | Biogenerix Ag | Tagged sialyltransferase proteins |
| US8187838B2 (en) * | 2006-03-14 | 2012-05-29 | Japan Tobacco Inc. | β-Galactoside-α2, 6-sialyltransferase, a gene encoding thereof, and a method for producing thereof |
| US8187853B2 (en) * | 2007-03-02 | 2012-05-29 | Japan Tobacco Inc. | β-galactoside-α2,6-sialyltransferase, a gene encoding thereof, and a method for enhancing enzyme activity |
| EP2441832A1 (en) * | 2009-06-12 | 2012-04-18 | Japan Tobacco, Inc. | Novel protein and gene that codes therefor |
| RU2560190C2 (en) * | 2010-11-23 | 2015-08-20 | Нестек С.А. | Oligosaccharide mixture and food product containing said mixture, particularly infant formula |
| WO2014153253A1 (en) * | 2013-03-14 | 2014-09-25 | Glycosyn LLC | Microorganisms and methods for producing sialylated and n-acetylglucosamine-containing oligosaccharides |
| WO2017101958A1 (en) * | 2015-12-18 | 2017-06-22 | Glycom A/S | Fermentative production of oligosaccharides |
Non-Patent Citations (1)
| Title |
|---|
| CHEN X. Human Milk Oligosaccharides (HMOS): Structure, Function, and Enzyme-Catalyzed Synthesis. Adv Carbohydr Chem Biochem. 2015;72:113-90. https://doi.org/10.1016/bs.accb.2015.08.002. * |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111133112B (en) | Sialyltransferase and its use in producing sialylated oligosaccharides | |
| AU2019278599B2 (en) | Fermentative production of sialylated saccharides | |
| CN113195509B (en) | α-1,3-Fucosyltransferase for the production of 3-fucosyllactose and inversion of lactose | |
| AU2018296557B2 (en) | Fucosyltransferases and their use in producing fucosylated oligosaccharides | |
| KR102726984B1 (en) | Fermentative production of N-acetylneuraminic acid | |
| AU2017385601B2 (en) | In vivo synthesis of sialylated compounds | |
| WO2018077892A1 (en) | Improved process for the production of fucosylated oligosaccharides | |
| CN114466934A (en) | Production of fucosyllactose in host cells | |
| KR20220116243A (en) | Lactose converting alpha-1,2-fucosyltransferase enzyme | |
| CN116745430A (en) | Production of oligosaccharides comprising LN3 as core structure in host cells | |
| KR20220042350A (en) | Biosynthesis of enzymes for use in the treatment of maple diabetes mellitus (MSUD) | |
| RU2822039C2 (en) | Sialyltransferases and use thereof in producing sialylated oligosaccharides | |
| US12344875B2 (en) | Sialyltransferases for the production of sialylated oligosaccharides | |
| HK40021260A (en) | Sialyltransferases and their use in producing sialylated oligosaccharides | |
| RU2819876C2 (en) | Enzymatic production of sialylated saccharides | |
| DK202200591A1 (en) | New sialyltransferases for in vivo synthesis of lst-c | |
| HK40038571A (en) | Fermentative production of sialylated saccharides | |
| TW202221137A (en) | Production of fucosylated lactose structures by a cell | |
| AU2023236935A1 (en) | Sialyltransferases for the production of sialylated oligosaccharides | |
| HK40019749A (en) | Fucosyltransferases and their use in producing fucosylated oligosaccharides |