RU2819876C2 - Enzymatic production of sialylated saccharides - Google Patents
Enzymatic production of sialylated saccharides Download PDFInfo
- Publication number
- RU2819876C2 RU2819876C2 RU2020139928A RU2020139928A RU2819876C2 RU 2819876 C2 RU2819876 C2 RU 2819876C2 RU 2020139928 A RU2020139928 A RU 2020139928A RU 2020139928 A RU2020139928 A RU 2020139928A RU 2819876 C2 RU2819876 C2 RU 2819876C2
- Authority
- RU
- Russia
- Prior art keywords
- leu
- ile
- lys
- glu
- ser
- Prior art date
Links
- 150000001720 carbohydrates Chemical class 0.000 title claims abstract description 65
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 26
- 230000002255 enzymatic effect Effects 0.000 title claims abstract description 23
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 113
- 230000000694 effects Effects 0.000 claims abstract description 82
- 108090000141 Sialyltransferases Proteins 0.000 claims abstract description 80
- 102000003838 Sialyltransferases Human genes 0.000 claims abstract description 80
- 241000588724 Escherichia coli Species 0.000 claims abstract description 77
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 claims abstract description 66
- 230000014509 gene expression Effects 0.000 claims abstract description 52
- 238000000034 method Methods 0.000 claims abstract description 48
- SQVRNKJHWKZAKO-PFQGKNLYSA-N N-acetyl-beta-neuraminic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-PFQGKNLYSA-N 0.000 claims abstract description 46
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 claims abstract description 32
- 230000006696 biosynthetic metabolic pathway Effects 0.000 claims abstract description 21
- 108010043841 Glucosamine 6-Phosphate N-Acetyltransferase Proteins 0.000 claims abstract description 19
- 102000002740 Glucosamine 6-Phosphate N-Acetyltransferase Human genes 0.000 claims abstract description 19
- 238000000855 fermentation Methods 0.000 claims abstract description 18
- 230000004151 fermentation Effects 0.000 claims abstract description 18
- 102000003960 Ligases Human genes 0.000 claims abstract description 16
- 108090000364 Ligases Proteins 0.000 claims abstract description 16
- 102000048245 N-acetylneuraminate lyases Human genes 0.000 claims abstract description 10
- 108700023220 N-acetylneuraminate lyases Proteins 0.000 claims abstract description 10
- 101710200202 N-acetylgalactosamine-6-phosphate deacetylase Proteins 0.000 claims abstract description 9
- 108010062110 water dikinase pyruvate Proteins 0.000 claims abstract description 9
- 108010010750 N-acetylmannosamine-6-phosphate epimerase Proteins 0.000 claims abstract description 8
- -1 fuculokinase Proteins 0.000 claims abstract description 8
- 102100041034 Glucosamine-6-phosphate isomerase 1 Human genes 0.000 claims abstract description 7
- 108010069483 N-acetylglucosamine-6-phosphate deacetylase Proteins 0.000 claims abstract description 7
- 108010022717 glucosamine-6-phosphate isomerase Proteins 0.000 claims abstract description 7
- 101150066555 lacZ gene Proteins 0.000 claims abstract description 6
- 108010084372 D-arabinose isomerase Proteins 0.000 claims abstract description 5
- 101150035354 araA gene Proteins 0.000 claims abstract description 4
- 230000000813 microbial effect Effects 0.000 claims description 130
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 115
- 239000002773 nucleotide Substances 0.000 claims description 91
- 125000003729 nucleotide group Chemical group 0.000 claims description 91
- 229920001184 polypeptide Polymers 0.000 claims description 63
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 63
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 63
- 150000007523 nucleic acids Chemical class 0.000 claims description 45
- 108020004707 nucleic acids Proteins 0.000 claims description 44
- 102000039446 nucleic acids Human genes 0.000 claims description 44
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 claims description 31
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 claims description 31
- 239000012634 fragment Substances 0.000 claims description 24
- BRGMHAYQAZFZDJ-PVFLNQBWSA-N N-Acetylglucosamine 6-phosphate Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BRGMHAYQAZFZDJ-PVFLNQBWSA-N 0.000 claims description 20
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 claims description 19
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 claims description 19
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 19
- 229930006000 Sucrose Natural products 0.000 claims description 19
- 239000005720 sucrose Substances 0.000 claims description 19
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 claims description 18
- 229950006780 n-acetylglucosamine Drugs 0.000 claims description 18
- DVGKRPYUFRZAQW-UHFFFAOYSA-N 3 prime Natural products CC(=O)NC1OC(CC(O)C1C(O)C(O)CO)(OC2C(O)C(CO)OC(OC3C(O)C(O)C(O)OC3CO)C2O)C(=O)O DVGKRPYUFRZAQW-UHFFFAOYSA-N 0.000 claims description 15
- AXQLFFDZXPOFPO-UHFFFAOYSA-N UNPD216 Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC(C1O)C(O)C(CO)OC1OC1C(O)C(O)C(O)OC1CO AXQLFFDZXPOFPO-UHFFFAOYSA-N 0.000 claims description 15
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 claims description 15
- TYALNJQZQRNQNQ-JLYOMPFMSA-N alpha-Neup5Ac-(2->6)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O1 TYALNJQZQRNQNQ-JLYOMPFMSA-N 0.000 claims description 15
- AXQLFFDZXPOFPO-UNTPKZLMSA-N beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O([C@@H]1O[C@H](CO)[C@H](O)[C@@H]([C@H]1O)O[C@H]1[C@@H]([C@H]([C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)NC(=O)C)[C@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@@H]1CO AXQLFFDZXPOFPO-UNTPKZLMSA-N 0.000 claims description 15
- USIPEGYTBGEPJN-UHFFFAOYSA-N lacto-N-tetraose Natural products O1C(CO)C(O)C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(=O)C)C1OC1C(O)C(CO)OC(OC(C(O)CO)C(O)C(O)C=O)C1O USIPEGYTBGEPJN-UHFFFAOYSA-N 0.000 claims description 15
- 239000000758 substrate Substances 0.000 claims description 15
- 229930182830 galactose Natural products 0.000 claims description 14
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 13
- 229910052799 carbon Inorganic materials 0.000 claims description 13
- TYALNJQZQRNQNQ-UHFFFAOYSA-N #alpha;2,6-sialyllactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OCC1C(O)C(O)C(O)C(OC2C(C(O)C(O)OC2CO)O)O1 TYALNJQZQRNQNQ-UHFFFAOYSA-N 0.000 claims description 12
- CILYIEBUXJIHCO-UHFFFAOYSA-N 102778-91-6 Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC2C(C(O)C(O)OC2CO)O)OC(CO)C1O CILYIEBUXJIHCO-UHFFFAOYSA-N 0.000 claims description 12
- CILYIEBUXJIHCO-UITFWXMXSA-N N-acetyl-alpha-neuraminyl-(2->3)-beta-D-galactosyl-(1->4)-beta-D-glucose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)O[C@@H]2CO)O)O[C@H](CO)[C@@H]1O CILYIEBUXJIHCO-UITFWXMXSA-N 0.000 claims description 12
- OIZGSVFYNBZVIK-UHFFFAOYSA-N N-acetylneuraminosyl-D-lactose Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OC1C(O)C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C1O OIZGSVFYNBZVIK-UHFFFAOYSA-N 0.000 claims description 12
- 230000000295 complement effect Effects 0.000 claims description 12
- 238000012546 transfer Methods 0.000 claims description 11
- 125000005629 sialic acid group Chemical group 0.000 claims description 10
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 claims description 9
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 claims description 8
- QUOQJNYANJQSDA-MHQSSNGYSA-N Sialyllacto-N-tetraose a Chemical compound O1C([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](OC2[C@H]([C@H](OC3[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]3O)O)O[C@H](CO)[C@H]2O)NC(C)=O)O[C@H](CO)[C@@H]1O QUOQJNYANJQSDA-MHQSSNGYSA-N 0.000 claims description 8
- 239000008101 lactose Substances 0.000 claims description 8
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 claims description 7
- 102100033341 N-acetylmannosamine kinase Human genes 0.000 claims description 7
- 108010029147 N-acylmannosamine kinase Proteins 0.000 claims description 7
- SFMRPVLZMVJKGZ-JRZQLMJNSA-N Sialyllacto-N-tetraose b Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]2O)O)O1 SFMRPVLZMVJKGZ-JRZQLMJNSA-N 0.000 claims description 7
- MSWZFWKMSRAUBD-UHFFFAOYSA-N beta-D-galactosamine Natural products NC1C(O)OC(CO)C(O)C1O MSWZFWKMSRAUBD-UHFFFAOYSA-N 0.000 claims description 7
- MSWZFWKMSRAUBD-IVMDWMLBSA-N 2-amino-2-deoxy-D-glucopyranose Chemical compound N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O MSWZFWKMSRAUBD-IVMDWMLBSA-N 0.000 claims description 6
- RPSBVJXBTXEJJG-RAMSCCQBSA-N 6-Sialyl-N-acetyllactosamine Chemical compound O[C@@H]1[C@@H](NC(=O)C)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO[C@@]2(O[C@H]([C@H](NC(C)=O)[C@@H](O)C2)[C@H](O)[C@H](O)CO)C(O)=O)O1 RPSBVJXBTXEJJG-RAMSCCQBSA-N 0.000 claims description 6
- 229960002442 glucosamine Drugs 0.000 claims description 6
- SXMGGNXBTZBGLU-UHFFFAOYSA-N sialyllacto-n-tetraose c Chemical compound OCC1OC(OC2C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC(C(C(O)C1O)O)OC1COC1(C(O)=O)CC(O)C(NC(C)=O)C(C(O)C(O)CO)O1 SXMGGNXBTZBGLU-UHFFFAOYSA-N 0.000 claims description 6
- MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 claims description 5
- SNFSYLYCDAVZGP-UHFFFAOYSA-N UNPD26986 Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(OC(O)C(O)C2O)CO)OC(CO)C(O)C1O SNFSYLYCDAVZGP-UHFFFAOYSA-N 0.000 claims description 5
- FCIROHDMPFOSFG-LAVSNGQLSA-N disialyllacto-N-tetraose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)OC[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@]3(O[C@H]([C@H](NC(C)=O)[C@@H](O)C3)[C@H](O)[C@H](O)CO)C(O)=O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@H]3[C@@H]([C@@H](O)C(O)O[C@@H]3CO)O)O[C@H](CO)[C@@H]2O)O)O1 FCIROHDMPFOSFG-LAVSNGQLSA-N 0.000 claims description 5
- IEQCXFNWPAHHQR-UHFFFAOYSA-N lacto-N-neotetraose Natural products OCC1OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C(NC(=O)C)C(O)C1OC1OC(CO)C(O)C(O)C1O IEQCXFNWPAHHQR-UHFFFAOYSA-N 0.000 claims description 5
- 229940062780 lacto-n-neotetraose Drugs 0.000 claims description 5
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 claims description 4
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 claims description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 4
- RJTOFDPWCJDYFZ-SPVZFZGWSA-N Lacto-N-triaose Chemical compound CC(=O)N[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O RJTOFDPWCJDYFZ-SPVZFZGWSA-N 0.000 claims description 4
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 claims description 4
- 239000008103 glucose Substances 0.000 claims description 4
- RBMYDHMFFAVMMM-PLQWBNBWSA-N neolactotetraose Chemical compound O([C@H]1[C@H](O)[C@H]([C@@H](O[C@@H]1CO)O[C@@H]1[C@H]([C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O)O)NC(=O)C)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O RBMYDHMFFAVMMM-PLQWBNBWSA-N 0.000 claims description 4
- KFEUJDWYNGMDBV-UHFFFAOYSA-N (N-Acetyl)-glucosamin-4-beta-galaktosid Natural products OC1C(NC(=O)C)C(O)OC(CO)C1OC1C(O)C(O)C(O)C(CO)O1 KFEUJDWYNGMDBV-UHFFFAOYSA-N 0.000 claims description 3
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 claims description 3
- 229940062827 2'-fucosyllactose Drugs 0.000 claims description 3
- HWHQUWQCBPAQQH-UHFFFAOYSA-N 2-O-alpha-L-Fucosyl-lactose Natural products OC1C(O)C(O)C(C)OC1OC1C(O)C(O)C(CO)OC1OC(C(O)CO)C(O)C(O)C=O HWHQUWQCBPAQQH-UHFFFAOYSA-N 0.000 claims description 3
- ODDPRQJTYDIWJU-UHFFFAOYSA-N 3'-beta-D-galactopyranosyl-lactose Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(OC2C(OC(O)C(O)C2O)CO)OC(CO)C1O ODDPRQJTYDIWJU-UHFFFAOYSA-N 0.000 claims description 3
- AUNPEJDACLEKSC-ZAYDSPBTSA-N 3-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O AUNPEJDACLEKSC-ZAYDSPBTSA-N 0.000 claims description 3
- WJPIUUDKRHCAEL-UHFFFAOYSA-N 3FL Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(O)C1O WJPIUUDKRHCAEL-UHFFFAOYSA-N 0.000 claims description 3
- 229930091371 Fructose Natural products 0.000 claims description 3
- 239000005715 Fructose Substances 0.000 claims description 3
- KFEUJDWYNGMDBV-LODBTCKLSA-N N-acetyllactosamine Chemical compound O[C@@H]1[C@@H](NC(=O)C)[C@H](O)O[C@H](CO)[C@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 KFEUJDWYNGMDBV-LODBTCKLSA-N 0.000 claims description 3
- HESSGHHCXGBPAJ-UHFFFAOYSA-N N-acetyllactosamine Natural products CC(=O)NC(C=O)C(O)C(C(O)CO)OC1OC(CO)C(O)C(O)C1O HESSGHHCXGBPAJ-UHFFFAOYSA-N 0.000 claims description 3
- HMQPEDMEOBLSQB-RCBHQUQDSA-N beta-D-Galp-(1->3)-alpha-D-GlcpNAc Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HMQPEDMEOBLSQB-RCBHQUQDSA-N 0.000 claims description 3
- ODDPRQJTYDIWJU-OAUIKNEUSA-N beta-D-Galp-(1->3)-beta-D-Galp-(1->4)-beta-D-Glcp Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@H](O[C@@H](O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@@H]1O ODDPRQJTYDIWJU-OAUIKNEUSA-N 0.000 claims description 3
- 230000001771 impaired effect Effects 0.000 claims description 3
- 229930191176 lacto-N-biose Natural products 0.000 claims description 3
- JCQLYHFGKNRPGE-FCVZTGTOSA-N lactulose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 JCQLYHFGKNRPGE-FCVZTGTOSA-N 0.000 claims description 3
- 229960000511 lactulose Drugs 0.000 claims description 3
- PFCRQPBOOFTZGQ-UHFFFAOYSA-N lactulose keto form Natural products OCC(=O)C(O)C(C(O)CO)OC1OC(CO)C(O)C(O)C1O PFCRQPBOOFTZGQ-UHFFFAOYSA-N 0.000 claims description 3
- TVVLIFCVJJSLBL-SEHWTJTBSA-N Lacto-N-fucopentaose V Chemical compound O[C@H]1C(O)C(O)[C@H](C)O[C@H]1OC([C@@H](O)C=O)[C@@H](C(O)CO)O[C@H]1[C@H](O)[C@@H](OC2[C@@H](C(OC3[C@@H](C(O)C(O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](O)[C@@H](CO)O1 TVVLIFCVJJSLBL-SEHWTJTBSA-N 0.000 claims description 2
- FZIVHOUANIQOMU-YIHIYSSUSA-N alpha-L-Fucp-(1->2)-beta-D-Galp-(1->3)-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H]([C@H](O[C@@H]4[C@H](OC(O)[C@H](O)[C@H]4O)CO)O[C@H](CO)[C@@H]3O)O)O[C@H](CO)[C@H]2O)NC(C)=O)O[C@H](CO)[C@H](O)[C@@H]1O FZIVHOUANIQOMU-YIHIYSSUSA-N 0.000 claims description 2
- CMQZRJBJDCVIEY-JEOLMMCMSA-N alpha-L-Fucp-(1->3)-[beta-D-Galp-(1->4)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](CO)O[C@@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H](OC(O)[C@H](O)[C@H]3O)CO)O[C@H](CO)[C@@H]2O)O)[C@@H]1NC(C)=O CMQZRJBJDCVIEY-JEOLMMCMSA-N 0.000 claims description 2
- DUKURNFHYQXCJG-JEOLMMCMSA-N alpha-L-Fucp-(1->4)-[beta-D-Galp-(1->3)]-beta-D-GlcpNAc-(1->3)-beta-D-Galp-(1->4)-D-Glcp Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](NC(C)=O)[C@H](O[C@@H]2[C@H]([C@H](O[C@@H]3[C@H](OC(O)[C@H](O)[C@H]3O)CO)O[C@H](CO)[C@@H]2O)O)O[C@@H]1CO DUKURNFHYQXCJG-JEOLMMCMSA-N 0.000 claims description 2
- FZIVHOUANIQOMU-UHFFFAOYSA-N lacto-N-fucopentaose I Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(OC3C(C(OC4C(OC(O)C(O)C4O)CO)OC(CO)C3O)O)OC(CO)C2O)NC(C)=O)OC(CO)C(O)C1O FZIVHOUANIQOMU-UHFFFAOYSA-N 0.000 claims description 2
- FKADDOYBRRMBPP-UHFFFAOYSA-N lacto-N-fucopentaose II Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C2O)O)OC1CO FKADDOYBRRMBPP-UHFFFAOYSA-N 0.000 claims description 2
- CMQZRJBJDCVIEY-UHFFFAOYSA-N lacto-N-fucopentaose III Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(CO)OC(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)C1NC(C)=O CMQZRJBJDCVIEY-UHFFFAOYSA-N 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 5
- 102100031324 N-acetylglucosamine-6-phosphate deacetylase Human genes 0.000 claims 2
- HWHQUWQCBPAQQH-BWRPKUOHSA-N 2-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O HWHQUWQCBPAQQH-BWRPKUOHSA-N 0.000 claims 1
- OVRNDRQMDRJTHS-OZRXBMAMSA-N N-acetyl-beta-D-mannosamine Chemical compound CC(=O)N[C@@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-OZRXBMAMSA-N 0.000 claims 1
- 150000002712 melibioses Chemical class 0.000 claims 1
- 102100031317 Alpha-N-acetylgalactosaminidase Human genes 0.000 abstract description 6
- 239000000126 substance Substances 0.000 abstract description 3
- 238000012258 culturing Methods 0.000 abstract description 2
- OIRDTQYFTABQOQ-UHTZMRCNSA-N Vidarabine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@@H]1O OIRDTQYFTABQOQ-UHTZMRCNSA-N 0.000 abstract 1
- OIRDTQYFTABQOQ-UHFFFAOYSA-N ara-adenosine Natural products Nc1ncnc2n(cnc12)C1OC(CO)C(O)C1O OIRDTQYFTABQOQ-UHFFFAOYSA-N 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 189
- 102000004169 proteins and genes Human genes 0.000 description 41
- 108020004414 DNA Proteins 0.000 description 38
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 37
- 102000004190 Enzymes Human genes 0.000 description 36
- 108090000790 Enzymes Proteins 0.000 description 36
- 229920001542 oligosaccharide Polymers 0.000 description 33
- 150000002482 oligosaccharides Chemical class 0.000 description 33
- 150000001413 amino acids Chemical group 0.000 description 26
- 108010009298 lysylglutamic acid Proteins 0.000 description 26
- 108010050848 glycylleucine Proteins 0.000 description 25
- 238000006243 chemical reaction Methods 0.000 description 24
- 108010034529 leucyl-lysine Proteins 0.000 description 24
- 230000015572 biosynthetic process Effects 0.000 description 23
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 22
- 230000003834 intracellular effect Effects 0.000 description 21
- GSXOAOHZAIYLCY-UHFFFAOYSA-N D-F6P Natural products OCC(=O)C(O)C(O)C(O)COP(O)(O)=O GSXOAOHZAIYLCY-UHFFFAOYSA-N 0.000 description 20
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 20
- 108010057821 leucylproline Proteins 0.000 description 19
- 108010012581 phenylalanylglutamate Proteins 0.000 description 19
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 18
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 17
- 108010089804 glycyl-threonine Proteins 0.000 description 17
- 108010051110 tyrosyl-lysine Proteins 0.000 description 17
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 16
- 108090000340 Transaminases Proteins 0.000 description 16
- 238000013518 transcription Methods 0.000 description 16
- 230000035897 transcription Effects 0.000 description 16
- 108010035265 N-acetylneuraminate synthase Proteins 0.000 description 15
- 102100029954 Sialic acid synthase Human genes 0.000 description 15
- 102000003929 Transaminases Human genes 0.000 description 15
- BGWGXPAPYGQALX-ARQDHWQXSA-N beta-D-fructofuranose 6-phosphate Chemical compound OC[C@@]1(O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BGWGXPAPYGQALX-ARQDHWQXSA-N 0.000 description 15
- 108010054155 lysyllysine Proteins 0.000 description 15
- 244000005700 microbiome Species 0.000 description 15
- 108010073969 valyllysine Proteins 0.000 description 15
- 108010068265 aspartyltyrosine Proteins 0.000 description 14
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 14
- 239000000203 mixture Substances 0.000 description 14
- 108010026333 seryl-proline Proteins 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- 241000589875 Campylobacter jejuni Species 0.000 description 13
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 13
- 108010005233 alanylglutamic acid Proteins 0.000 description 13
- 108010092854 aspartyllysine Proteins 0.000 description 13
- 108010005774 beta-Galactosidase Proteins 0.000 description 13
- 108010085325 histidylproline Proteins 0.000 description 13
- 238000013519 translation Methods 0.000 description 13
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 12
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 12
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 12
- 108010020764 Transposases Proteins 0.000 description 12
- 102000008579 Transposases Human genes 0.000 description 12
- 108010047495 alanylglycine Proteins 0.000 description 12
- 108010038633 aspartylglutamate Proteins 0.000 description 12
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 108010064235 lysylglycine Proteins 0.000 description 12
- 102100026189 Beta-galactosidase Human genes 0.000 description 11
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 11
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- 108090001066 Racemases and epimerases Proteins 0.000 description 11
- 102000004879 Racemases and epimerases Human genes 0.000 description 11
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 11
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 11
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 11
- 108010051242 phenylalanylserine Proteins 0.000 description 11
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 10
- 108010092114 histidylphenylalanine Proteins 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 150000002772 monosaccharides Chemical class 0.000 description 10
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 9
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 9
- OVRNDRQMDRJTHS-UOLFYFMNSA-N N-acetyl-alpha-D-mannosamine Chemical compound CC(=O)N[C@@H]1[C@@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-UOLFYFMNSA-N 0.000 description 9
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 9
- 108010077245 asparaginyl-proline Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 108010025306 histidylleucine Proteins 0.000 description 9
- 108010015796 prolylisoleucine Proteins 0.000 description 9
- 108010008005 sugar-phosphatase Proteins 0.000 description 9
- 108010020532 tyrosyl-proline Proteins 0.000 description 9
- 108010003137 tyrosyltyrosine Proteins 0.000 description 9
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 8
- 108090000156 Fructokinases Proteins 0.000 description 8
- 102000003793 Fructokinases Human genes 0.000 description 8
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 8
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 8
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 8
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 8
- BRGMHAYQAZFZDJ-ZTVVOAFPSA-N N-acetyl-D-mannosamine 6-phosphate Chemical compound CC(=O)N[C@@H]1C(O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BRGMHAYQAZFZDJ-ZTVVOAFPSA-N 0.000 description 8
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 8
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 8
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- 108010044940 alanylglutamine Proteins 0.000 description 8
- XHMJOUIAFHJHBW-VFUOTHLCSA-N glucosamine 6-phosphate Chemical compound N[C@H]1[C@H](O)O[C@H](COP(O)(O)=O)[C@H](O)[C@@H]1O XHMJOUIAFHJHBW-VFUOTHLCSA-N 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- 108010038320 lysylphenylalanine Proteins 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 7
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 7
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 7
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 7
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 7
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 7
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 7
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 7
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 7
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 7
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 7
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 7
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 7
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 7
- 108700026244 Open Reading Frames Proteins 0.000 description 7
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 7
- 108010070944 alanylhistidine Proteins 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 108010008355 arginyl-glutamine Proteins 0.000 description 7
- 108010068380 arginylarginine Proteins 0.000 description 7
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 230000001965 increasing effect Effects 0.000 description 7
- 108010060845 lactose permease Proteins 0.000 description 7
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 235000000346 sugar Nutrition 0.000 description 7
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 6
- 238000004977 Hueckel calculation Methods 0.000 description 6
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 6
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 6
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 6
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 6
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 6
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 6
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 6
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 6
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 6
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 6
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 6
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 6
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 6
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 6
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 6
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 description 6
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 description 6
- 101150044535 agaA gene Proteins 0.000 description 6
- 108010064886 beta-D-galactoside alpha 2-6-sialyltransferase Proteins 0.000 description 6
- 235000013305 food Nutrition 0.000 description 6
- 238000012239 gene modification Methods 0.000 description 6
- 230000005017 genetic modification Effects 0.000 description 6
- 235000013617 genetically modified food Nutrition 0.000 description 6
- 230000010354 integration Effects 0.000 description 6
- 235000011073 invertase Nutrition 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010084572 phenylalanyl-valine Proteins 0.000 description 6
- DTBNBXWJWCWCIK-UHFFFAOYSA-K phosphonatoenolpyruvate Chemical compound [O-]C(=O)C(=C)OP([O-])([O-])=O DTBNBXWJWCWCIK-UHFFFAOYSA-K 0.000 description 6
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 6
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 6
- OIZGSVFYNBZVIK-FHHHURIISA-N 3'-sialyllactose Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@@]1(C(O)=O)O[C@@H]1[C@@H](O)[C@H](O[C@H]([C@H](O)CO)[C@H](O)[C@@H](O)C=O)O[C@H](CO)[C@@H]1O OIZGSVFYNBZVIK-FHHHURIISA-N 0.000 description 5
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 5
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 5
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 5
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 5
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 5
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 5
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 5
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 5
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 5
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 5
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 5
- 102000004894 Glutamine-fructose-6-phosphate transaminase (isomerizing) Human genes 0.000 description 5
- 108090001031 Glutamine-fructose-6-phosphate transaminase (isomerizing) Proteins 0.000 description 5
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 5
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 5
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 5
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 5
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 5
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 5
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 5
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 5
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 5
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 5
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 5
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 5
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 5
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 5
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 5
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 5
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 5
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 5
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 5
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 5
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 5
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 5
- OVRNDRQMDRJTHS-ZTVVOAFPSA-N N-acetyl-D-mannosamine Chemical compound CC(=O)N[C@@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-ZTVVOAFPSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 5
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 5
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 5
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 5
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 5
- 108091000080 Phosphotransferase Proteins 0.000 description 5
- 241000493790 Photobacterium leiognathi Species 0.000 description 5
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 5
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 5
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 5
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 5
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 5
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 5
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 5
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 5
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 5
- 101150018392 cscA gene Proteins 0.000 description 5
- 101150091121 cscR gene Proteins 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 101150015731 fucI gene Proteins 0.000 description 5
- 101150025078 fucK gene Proteins 0.000 description 5
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 5
- 108010081551 glycylphenylalanine Proteins 0.000 description 5
- 101150100121 gna1 gene Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 235000020256 human milk Nutrition 0.000 description 5
- 210000004251 human milk Anatomy 0.000 description 5
- 239000001573 invertase Substances 0.000 description 5
- GSXOAOHZAIYLCY-HSUXUTPPSA-N keto-D-fructose 6-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)[C@H](O)COP(O)(O)=O GSXOAOHZAIYLCY-HSUXUTPPSA-N 0.000 description 5
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 230000002018 overexpression Effects 0.000 description 5
- 229930029653 phosphoenolpyruvate Natural products 0.000 description 5
- 102000020233 phosphotransferase Human genes 0.000 description 5
- 101150067185 ppsA gene Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 5
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 5
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 4
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 4
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 4
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 4
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 4
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 4
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 4
- 241000099223 Alistipes sp. Species 0.000 description 4
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 4
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 4
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 4
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 4
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 4
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 4
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 4
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 4
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 4
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 4
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 4
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 4
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 4
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 4
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 4
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 4
- TXCIAUNLDRJGJZ-UHFFFAOYSA-N CMP-N-acetyl neuraminic acid Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-UHFFFAOYSA-N 0.000 description 4
- TXCIAUNLDRJGJZ-BILDWYJOSA-N CMP-N-acetyl-beta-neuraminic acid Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@]1(C(O)=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-BILDWYJOSA-N 0.000 description 4
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 4
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 4
- 101100061504 Escherichia coli cscB gene Proteins 0.000 description 4
- 101100309698 Escherichia coli cscK gene Proteins 0.000 description 4
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 4
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 4
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 4
- CMBXOSFZCFGDLE-IHRRRGAJSA-N Gln-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O CMBXOSFZCFGDLE-IHRRRGAJSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 4
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 4
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 4
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 4
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 4
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 4
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 4
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 4
- 241000606831 Histophilus somni Species 0.000 description 4
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 4
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 4
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 4
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 4
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 4
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 4
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 4
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 4
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 4
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 4
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 4
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 4
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 4
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 4
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 4
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 4
- 102000004195 Isomerases Human genes 0.000 description 4
- 108090000769 Isomerases Proteins 0.000 description 4
- QZNPNKJXABGCRC-LFRDXLMFSA-N L-fuculose Chemical compound C[C@H](O)[C@@H](O)[C@@H](O)C(=O)CO QZNPNKJXABGCRC-LFRDXLMFSA-N 0.000 description 4
- 229930182816 L-glutamine Natural products 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 4
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 4
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 4
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 4
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 4
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 4
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 4
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 4
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 4
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 4
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 4
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 4
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 4
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 4
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 4
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 4
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 4
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 4
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 4
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 4
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 4
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 241000588650 Neisseria meningitidis Species 0.000 description 4
- 241000606856 Pasteurella multocida Species 0.000 description 4
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 4
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 4
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 4
- 241001517016 Photobacterium damselae Species 0.000 description 4
- 241000607606 Photobacterium sp. Species 0.000 description 4
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- 108010079005 RDV peptide Proteins 0.000 description 4
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 4
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 4
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 4
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 4
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 4
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 4
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 4
- 241000192581 Synechocystis sp. Species 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 4
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 4
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 4
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 4
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 4
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 4
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 4
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 4
- 101710196080 UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase Proteins 0.000 description 4
- 108010061048 UDPacetylglucosamine pyrophosphorylase Proteins 0.000 description 4
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 4
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 4
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 4
- 241000606834 [Haemophilus] ducreyi Species 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 4
- 101150117187 glmS gene Proteins 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010091871 leucylmethionine Proteins 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 101150019075 neuA gene Proteins 0.000 description 4
- 229940051027 pasteurella multocida Drugs 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- 108010032867 phosphoglucosamine mutase Proteins 0.000 description 4
- 108091000115 phosphomannomutase Proteins 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 230000009450 sialylation Effects 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 210000000130 stem cell Anatomy 0.000 description 4
- 150000008163 sugars Chemical class 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- MGSRCZKZVOBKFT-UHFFFAOYSA-N thymol Chemical compound CC(C)C1=CC=C(C)C=C1O MGSRCZKZVOBKFT-UHFFFAOYSA-N 0.000 description 4
- 230000032258 transport Effects 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 3
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 3
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 3
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 3
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 3
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 3
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 3
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 3
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 3
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 3
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 3
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 3
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 3
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 3
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 3
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 3
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 3
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 3
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 3
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 3
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 3
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 3
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 3
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 3
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 3
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 3
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 3
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 3
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 3
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 3
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 3
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 3
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 3
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 3
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 3
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 3
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 3
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 3
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 3
- 108010037637 E coli beta-galactoside permease Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 229930182566 Gentamicin Natural products 0.000 description 3
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 3
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 3
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 3
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 3
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 3
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 3
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 3
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 3
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 3
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 3
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 3
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 3
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 3
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 3
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 3
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 3
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 3
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 3
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 3
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 3
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 3
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 3
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 3
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 3
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 3
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 3
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 3
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 3
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 3
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 3
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 3
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 3
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 3
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 3
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 3
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 3
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 3
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 3
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 3
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 3
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 3
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 3
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 3
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 3
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 3
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 3
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 3
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 3
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 3
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 3
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 3
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 3
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 3
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 3
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 3
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 3
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 3
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 3
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 3
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 3
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 3
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 3
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 3
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 3
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 3
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 3
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 3
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 3
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 3
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 3
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 3
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 3
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 3
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 3
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 3
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 3
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 3
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 3
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 3
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 3
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 3
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 3
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 3
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 3
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 3
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 3
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 3
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 3
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 3
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 3
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 3
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 3
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 3
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 3
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 3
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 3
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 3
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- 108060005182 N-acylglucosamine 2-epimerase Proteins 0.000 description 3
- 108010081778 N-acylneuraminate cytidylyltransferase Proteins 0.000 description 3
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 3
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 3
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 3
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 3
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 3
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 3
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 3
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 3
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 3
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 3
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 3
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 3
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 3
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 3
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 3
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 3
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 3
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 3
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 3
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 3
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 3
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 3
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 3
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 3
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 3
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 3
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 3
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 3
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- 241000193985 Streptococcus agalactiae Species 0.000 description 3
- 101710180600 Sucrose operon repressor Proteins 0.000 description 3
- 101710117283 Sucrose permease Proteins 0.000 description 3
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 3
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 3
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 3
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 3
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 3
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 3
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 3
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 3
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 3
- OCCYDHCUKXRPSJ-SXNHZJKMSA-N Trp-Ile-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OCCYDHCUKXRPSJ-SXNHZJKMSA-N 0.000 description 3
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 3
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 3
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 3
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 3
- IUQDEKCCHWRHRW-IHPCNDPISA-N Tyr-Asn-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IUQDEKCCHWRHRW-IHPCNDPISA-N 0.000 description 3
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 3
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 3
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 3
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 3
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 3
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 3
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 3
- 101710091363 UDP-N-acetylglucosamine 2-epimerase Proteins 0.000 description 3
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 3
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 3
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 3
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 3
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 3
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 3
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 3
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 3
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 3
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 3
- 241000607284 Vibrio sp. Species 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 230000002730 additional effect Effects 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 3
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- IERHLVCPSMICTF-UHFFFAOYSA-N cytidine monophosphate Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(COP(O)(O)=O)O1 IERHLVCPSMICTF-UHFFFAOYSA-N 0.000 description 3
- IERHLVCPSMICTF-ZAKLUEHWSA-N cytidine-5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-ZAKLUEHWSA-N 0.000 description 3
- 150000002016 disaccharides Chemical class 0.000 description 3
- 235000013350 formula milk Nutrition 0.000 description 3
- 101150045500 galK gene Proteins 0.000 description 3
- 229960002518 gentamicin Drugs 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 3
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000037353 metabolic pathway Effects 0.000 description 3
- 230000004060 metabolic process Effects 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 238000002552 multiple reaction monitoring Methods 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 108010004914 prolylarginine Proteins 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 3
- 239000011734 sodium Substances 0.000 description 3
- 150000004044 tetrasaccharides Chemical class 0.000 description 3
- 238000004809 thin layer chromatography Methods 0.000 description 3
- 150000004043 trisaccharides Chemical class 0.000 description 3
- 108010029384 tryptophyl-histidine Proteins 0.000 description 3
- 101150018163 wcaJ gene Proteins 0.000 description 3
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- PAHHYDSPOXDASW-VGWMRTNUSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-3-hydroxypropanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO PAHHYDSPOXDASW-VGWMRTNUSA-N 0.000 description 2
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 2
- SNFSYLYCDAVZGP-OLAZETNGSA-N 2'-fucosyllactose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)O[C@H](CO)[C@H](O)[C@@H]1O SNFSYLYCDAVZGP-OLAZETNGSA-N 0.000 description 2
- 241000606730 Actinobacillus capsulatus Species 0.000 description 2
- 241000606731 Actinobacillus suis Species 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 2
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 2
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 2
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- 241000030716 Alistipes shahii Species 0.000 description 2
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- ITHMWNNUDPJJER-ULQDDVLXSA-N Arg-His-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ITHMWNNUDPJJER-ULQDDVLXSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 2
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 2
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 2
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 2
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- ATHZHGQSAIJHQU-XIRDDKMYSA-N Asn-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ATHZHGQSAIJHQU-XIRDDKMYSA-N 0.000 description 2
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 2
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 2
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 2
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 2
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 2
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 2
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- 241000606767 Avibacterium paragallinarum Species 0.000 description 2
- 241001135228 Bacteroides ovatus Species 0.000 description 2
- 101710173142 Beta-fructofuranosidase, cell wall isozyme Proteins 0.000 description 2
- 241000218561 Bibersteinia trehalosi Species 0.000 description 2
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 2
- 241000589877 Campylobacter coli Species 0.000 description 2
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 2
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 2
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 2
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 2
- 101710088194 Dehydrogenase Proteins 0.000 description 2
- 241000194033 Enterococcus Species 0.000 description 2
- QGWNDRXFNXRZMB-UUOKFMHZSA-N GDP Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O QGWNDRXFNXRZMB-UUOKFMHZSA-N 0.000 description 2
- 102100024515 GDP-L-fucose synthase Human genes 0.000 description 2
- 108030006298 GDP-L-fucose synthases Proteins 0.000 description 2
- 102000048120 Galactokinases Human genes 0.000 description 2
- 108700023157 Galactokinases Proteins 0.000 description 2
- 102100036291 Galactose-1-phosphate uridylyltransferase Human genes 0.000 description 2
- 101710090046 Galactose-1-phosphate uridylyltransferase Proteins 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 2
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 2
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 2
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 2
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 2
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 2
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 2
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 2
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 2
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 2
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 2
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 2
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 2
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 2
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 2
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 2
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 2
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 2
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 2
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 2
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 2
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 2
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 2
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 2
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 2
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 241000606822 Haemophilus parahaemolyticus Species 0.000 description 2
- VIVSWEBJUHXCDS-DCAQKATOSA-N His-Asn-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O VIVSWEBJUHXCDS-DCAQKATOSA-N 0.000 description 2
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 2
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 2
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 2
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 2
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 2
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- GUXQAPACZVVOKX-AVGNSLFASA-N His-Lys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GUXQAPACZVVOKX-AVGNSLFASA-N 0.000 description 2
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 2
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 2
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 2
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 2
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 2
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- VISRCHQHQCLODA-NAKRPEOUSA-N Ile-Pro-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N VISRCHQHQCLODA-NAKRPEOUSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 2
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 2
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- SHZGCJCMOBCMKK-PQMKYFCFSA-N L-Fucose Natural products C[C@H]1O[C@H](O)[C@@H](O)[C@@H](O)[C@@H]1O SHZGCJCMOBCMKK-PQMKYFCFSA-N 0.000 description 2
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- 241000194036 Lactococcus Species 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- 108010071324 Livagen Proteins 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 2
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- 108010003266 Lys-Leu-Tyr-Asp Proteins 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 2
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 2
- AVTWKENDGGUWDC-BQBZGAKWSA-N Met-Cys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O AVTWKENDGGUWDC-BQBZGAKWSA-N 0.000 description 2
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 2
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 2
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 2
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 2
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 2
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 2
- DJJBHQHOZLUBCN-WDSOQIARSA-N Met-Lys-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DJJBHQHOZLUBCN-WDSOQIARSA-N 0.000 description 2
- KVNOBVKRBOYSIV-SZMVWBNQSA-N Met-Pro-Trp Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KVNOBVKRBOYSIV-SZMVWBNQSA-N 0.000 description 2
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 2
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- 102000002307 N-acylglucosamine 2-epimerase Human genes 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 241000588912 Pantoea agglomerans Species 0.000 description 2
- 241000606594 Pasteurella dagmatis Species 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 2
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 2
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 2
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 2
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 2
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 2
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 2
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 2
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 2
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 2
- 102000009569 Phosphoglucomutase Human genes 0.000 description 2
- 102000030605 Phosphomannomutase Human genes 0.000 description 2
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 2
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 2
- 241000607565 Photobacterium phosphoreum Species 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 108091006161 SLC17A5 Proteins 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 2
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 2
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 2
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 2
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 2
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241000009877 Streptococcus entericus Species 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 101000693115 Sulfurisphaera tokodaii (strain DSM 16993 / JCM 10545 / NBRC 100140 / 7) Sugar-1-phosphate acetyltransferase Proteins 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 2
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 2
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- 239000005844 Thymol Substances 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 2
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 2
- YTVJTXJTNRWJCR-JBACZVJFSA-N Trp-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N YTVJTXJTNRWJCR-JBACZVJFSA-N 0.000 description 2
- LORJKYIPJIRIRT-BVSLBCMMSA-N Trp-Pro-Tyr Chemical compound C([C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 LORJKYIPJIRIRT-BVSLBCMMSA-N 0.000 description 2
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 2
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 2
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 2
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 2
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 2
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 2
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 2
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 2
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 2
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 2
- QMNWABHLJOHGDS-IHRRRGAJSA-N Tyr-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QMNWABHLJOHGDS-IHRRRGAJSA-N 0.000 description 2
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 2
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 2
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 2
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 2
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 2
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 2
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 2
- 108010075202 UDP-glucose 4-epimerase Proteins 0.000 description 2
- 102100021436 UDP-glucose 4-epimerase Human genes 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 2
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 2
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 2
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 2
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 2
- 241000607618 Vibrio harveyi Species 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- HXXFSFRBOHSIMQ-VFUOTHLCSA-N alpha-D-glucose 1-phosphate Chemical compound OC[C@H]1O[C@H](OP(O)(O)=O)[C@H](O)[C@@H](O)[C@@H]1O HXXFSFRBOHSIMQ-VFUOTHLCSA-N 0.000 description 2
- PHTAQVMXYWFMHF-GJGMMKECSA-N alpha-L-Fucp-(1->2)-beta-D-Galp-(1->4)-D-GlcpNAc Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@@H]2[C@H](OC(O)[C@H](NC(C)=O)[C@H]2O)CO)O[C@H](CO)[C@H](O)[C@@H]1O PHTAQVMXYWFMHF-GJGMMKECSA-N 0.000 description 2
- 150000001408 amides Chemical class 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- RPKLZQLYODPWTM-KBMWBBLPSA-N cholanoic acid Chemical compound C1CC2CCCC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@@H](CCC(O)=O)C)[C@@]1(C)CC2 RPKLZQLYODPWTM-KBMWBBLPSA-N 0.000 description 2
- 101150075169 cscB gene Proteins 0.000 description 2
- 101150013880 cscK gene Proteins 0.000 description 2
- 239000012531 culture fluid Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 238000000132 electrospray ionisation Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 108091008053 gene clusters Proteins 0.000 description 2
- 108010084034 glucosamine-1-phosphate acetyltransferase Proteins 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- QGWNDRXFNXRZMB-UHFFFAOYSA-N guanidine diphosphate Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O QGWNDRXFNXRZMB-UHFFFAOYSA-N 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- ZKLLSNQJRLJIGT-UYFOZJQFSA-N keto-D-fructose 1-phosphate Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C(=O)COP(O)(O)=O ZKLLSNQJRLJIGT-UYFOZJQFSA-N 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 101150063315 nanE gene Proteins 0.000 description 2
- 101150098382 neuB gene Proteins 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 230000001681 protective effect Effects 0.000 description 2
- LXNHXLLTXMVWPM-UHFFFAOYSA-N pyridoxine Chemical compound CC1=NC=C(CO)C(CO)=C1O LXNHXLLTXMVWPM-UHFFFAOYSA-N 0.000 description 2
- 238000004445 quantitative analysis Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 235000020183 skimmed milk Nutrition 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 229960000790 thymol Drugs 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- LJJBSCMJXPZTOP-VBBGBFMKSA-N (2s)-2,5-diamino-5-oxopentanoic acid;[(2r,3r,4s)-2,3,4,6-tetrahydroxy-5-oxohexyl] dihydrogen phosphate Chemical compound OC(=O)[C@@H](N)CCC(N)=O.OCC(=O)[C@@H](O)[C@H](O)[C@H](O)COP(O)(O)=O LJJBSCMJXPZTOP-VBBGBFMKSA-N 0.000 description 1
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 1
- BSABBBMNWQWLLU-VKHMYHEASA-N (S)-lactaldehyde Chemical compound C[C@H](O)C=O BSABBBMNWQWLLU-VKHMYHEASA-N 0.000 description 1
- FPIPGXGPPPQFEQ-UHFFFAOYSA-N 13-cis retinol Natural products OCC=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 108010036211 5-HT-moduline Proteins 0.000 description 1
- 241000007909 Acaryochloris Species 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 241001468163 Acetobacterium woodii Species 0.000 description 1
- 102100033647 Activity-regulated cytoskeleton-associated protein Human genes 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- ZRNWJUAQKFUUKV-SRVKXCTJSA-N Arg-Met-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZRNWJUAQKFUUKV-SRVKXCTJSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- BFDDUDQCPJWQRQ-IHRRRGAJSA-N Arg-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O BFDDUDQCPJWQRQ-IHRRRGAJSA-N 0.000 description 1
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- FANGHKQYFPYDNB-UBHSHLNASA-N Asn-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N FANGHKQYFPYDNB-UBHSHLNASA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- JPSODRNUDXONAS-XIRDDKMYSA-N Asn-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC(=O)N)N JPSODRNUDXONAS-XIRDDKMYSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000193752 Bacillus circulans Species 0.000 description 1
- 241000193749 Bacillus coagulans Species 0.000 description 1
- 241000193422 Bacillus lentus Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 241000194107 Bacillus megaterium Species 0.000 description 1
- 241000194106 Bacillus mycoides Species 0.000 description 1
- 241000194103 Bacillus pumilus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000770536 Bacillus thermophilus Species 0.000 description 1
- 241000962950 Bacteroides ovatus ATCC 8483 Species 0.000 description 1
- 241000186000 Bifidobacterium Species 0.000 description 1
- 241000186016 Bifidobacterium bifidum Species 0.000 description 1
- 241001608472 Bifidobacterium longum Species 0.000 description 1
- 241000186015 Bifidobacterium longum subsp. infantis Species 0.000 description 1
- 241000193417 Brevibacillus laterosporus Species 0.000 description 1
- 241000605902 Butyrivibrio Species 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 101100245749 Campylobacter jejuni subsp. jejuni serotype O:23/36 (strain 81-176) pseF gene Proteins 0.000 description 1
- 241000661436 Candidatus Scalindua Species 0.000 description 1
- 241000588923 Citrobacter Species 0.000 description 1
- 241000193401 Clostridium acetobutylicum Species 0.000 description 1
- 241001656809 Clostridium autoethanogenum Species 0.000 description 1
- 241000186566 Clostridium ljungdahlii Species 0.000 description 1
- 229910021591 Copper(I) chloride Inorganic materials 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- PFAQXUDMZVMADG-AVGNSLFASA-N Cys-Gln-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PFAQXUDMZVMADG-AVGNSLFASA-N 0.000 description 1
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 1
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 1
- VNXXMHTZQGGDSG-CIUDSAMLSA-N Cys-His-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O VNXXMHTZQGGDSG-CIUDSAMLSA-N 0.000 description 1
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 1
- UQHYQYXOLIYNSR-CUJWVEQBSA-N Cys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N)O UQHYQYXOLIYNSR-CUJWVEQBSA-N 0.000 description 1
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- LBSKYJOZIIOZIO-DCAQKATOSA-N Cys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N LBSKYJOZIIOZIO-DCAQKATOSA-N 0.000 description 1
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 1
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 1
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 1
- MFMDKTLJCUBQIC-MXAVVETBSA-N Cys-Phe-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MFMDKTLJCUBQIC-MXAVVETBSA-N 0.000 description 1
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 1
- JEKIARHEWURQRJ-BZSNNMDCSA-N Cys-Phe-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CS)N JEKIARHEWURQRJ-BZSNNMDCSA-N 0.000 description 1
- CAXGCBSRJLADPD-FXQIFTODSA-N Cys-Pro-Asn Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CAXGCBSRJLADPD-FXQIFTODSA-N 0.000 description 1
- CNAMJJOZGXPDHW-IHRRRGAJSA-N Cys-Pro-Phe Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O CNAMJJOZGXPDHW-IHRRRGAJSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 1
- RFSUNEUAIZKAJO-VRPWFDPXSA-N D-Fructose Natural products OC[C@H]1OC(O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-VRPWFDPXSA-N 0.000 description 1
- ZZZCUOFIHGPKAK-UHFFFAOYSA-N D-erythro-ascorbic acid Natural products OCC1OC(=O)C(O)=C1O ZZZCUOFIHGPKAK-UHFFFAOYSA-N 0.000 description 1
- XHMJOUIAFHJHBW-GASJEMHNSA-N D-galactosamine 6-phosphate Chemical compound N[C@H]1C(O)O[C@H](COP(O)(O)=O)[C@H](O)[C@@H]1O XHMJOUIAFHJHBW-GASJEMHNSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-NQXXGFSBSA-N D-ribulose Chemical compound OC[C@@H](O)[C@@H](O)C(=O)CO ZAQJHHRNXZUBTE-NQXXGFSBSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-UHFFFAOYSA-N D-threo-2-Pentulose Natural products OCC(O)C(O)C(=O)CO ZAQJHHRNXZUBTE-UHFFFAOYSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-WUJLRWPWSA-N D-xylulose Chemical compound OC[C@@H](O)[C@H](O)C(=O)CO ZAQJHHRNXZUBTE-WUJLRWPWSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 241001135747 Desulfobacula toluolica Species 0.000 description 1
- 241001407023 Desulfotignum phosphitoxidans Species 0.000 description 1
- 241000194031 Enterococcus faecium Species 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000901842 Escherichia coli W Species 0.000 description 1
- 108010046276 FLP recombinase Proteins 0.000 description 1
- VTLYFUHAOXGGBS-UHFFFAOYSA-N Fe3+ Chemical compound [Fe+3] VTLYFUHAOXGGBS-UHFFFAOYSA-N 0.000 description 1
- 241000605986 Fusobacterium nucleatum Species 0.000 description 1
- LQEBEXMHBLQMDB-UHFFFAOYSA-N GDP-L-fucose Natural products OC1C(O)C(O)C(C)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C3=C(C(N=C(N)N3)=O)N=C2)O1 LQEBEXMHBLQMDB-UHFFFAOYSA-N 0.000 description 1
- LQEBEXMHBLQMDB-JGQUBWHWSA-N GDP-beta-L-fucose Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C3=C(C(NC(N)=N3)=O)N=C2)O1 LQEBEXMHBLQMDB-JGQUBWHWSA-N 0.000 description 1
- 108010062427 GDP-mannose 4,6-dehydratase Proteins 0.000 description 1
- 102000002312 GDPmannose 4,6-dehydratase Human genes 0.000 description 1
- 241000029369 Galerina nana Species 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- WVUZERSNWGUKJY-BPUTZDHNSA-N Gln-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N WVUZERSNWGUKJY-BPUTZDHNSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 1
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 1
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- NMYFPKCIGUJMIK-GUBZILKMSA-N Gln-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NMYFPKCIGUJMIK-GUBZILKMSA-N 0.000 description 1
- XUZQMPGBGFQJMY-SRVKXCTJSA-N Gln-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XUZQMPGBGFQJMY-SRVKXCTJSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 1
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- 108030000121 Glucosamine-6-phosphate deaminases Proteins 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 241000006448 Halorhabdus tiamatea Species 0.000 description 1
- 241000590002 Helicobacter pylori Species 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- YJBMLTVVVRJNOK-SRVKXCTJSA-N His-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N YJBMLTVVVRJNOK-SRVKXCTJSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- VGYOLSOFODKLSP-IHPCNDPISA-N His-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 VGYOLSOFODKLSP-IHPCNDPISA-N 0.000 description 1
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 1
- KQJBFMJFUXAYPK-AVGNSLFASA-N His-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KQJBFMJFUXAYPK-AVGNSLFASA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- RXKFKJVJVHLRIE-XIRDDKMYSA-N His-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CN=CN3)N RXKFKJVJVHLRIE-XIRDDKMYSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 101000588377 Homo sapiens N-acylneuraminate cytidylyltransferase Proteins 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- VUPHVQCDULLACF-NAKRPEOUSA-N Ile-Met-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N VUPHVQCDULLACF-NAKRPEOUSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 241000170280 Kluyveromyces sp. Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 1
- 101710186049 L-fuculokinase Proteins 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 240000001046 Lactobacillus acidophilus Species 0.000 description 1
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 1
- 244000199885 Lactobacillus bulgaricus Species 0.000 description 1
- 235000013960 Lactobacillus bulgaricus Nutrition 0.000 description 1
- 244000199866 Lactobacillus casei Species 0.000 description 1
- 235000013958 Lactobacillus casei Nutrition 0.000 description 1
- 241000218492 Lactobacillus crispatus Species 0.000 description 1
- 241000186673 Lactobacillus delbrueckii Species 0.000 description 1
- 241000186606 Lactobacillus gasseri Species 0.000 description 1
- 241001561398 Lactobacillus jensenii Species 0.000 description 1
- 240000006024 Lactobacillus plantarum Species 0.000 description 1
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 1
- 241000186604 Lactobacillus reuteri Species 0.000 description 1
- 241000218588 Lactobacillus rhamnosus Species 0.000 description 1
- 241000186869 Lactobacillus salivarius Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 1
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 1
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- SDTSLIMYROCDNS-FXQIFTODSA-N Met-Cys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O SDTSLIMYROCDNS-FXQIFTODSA-N 0.000 description 1
- RMHHNLKYPOOKQN-FXQIFTODSA-N Met-Cys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O RMHHNLKYPOOKQN-FXQIFTODSA-N 0.000 description 1
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- HMEVNCOJHJTLNB-BVSLBCMMSA-N Met-Trp-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N HMEVNCOJHJTLNB-BVSLBCMMSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- 241000202987 Methanobrevibacter Species 0.000 description 1
- 241000192041 Micrococcus Species 0.000 description 1
- 241000187708 Micromonospora Species 0.000 description 1
- 241000907999 Mortierella alpina Species 0.000 description 1
- 101100174763 Mus musculus Galk1 gene Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- OVBPIULPVIDEAO-UHFFFAOYSA-N N-Pteroyl-L-glutaminsaeure Natural products C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)NC(CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-UHFFFAOYSA-N 0.000 description 1
- 125000003047 N-acetyl group Chemical group 0.000 description 1
- BRGMHAYQAZFZDJ-KEWYIRBNSA-N N-acetyl-D-galactosamine 6-phosphate Chemical compound CC(=O)N[C@H]1C(O)O[C@H](COP(O)(O)=O)[C@H](O)[C@@H]1O BRGMHAYQAZFZDJ-KEWYIRBNSA-N 0.000 description 1
- 102100035286 N-acetyl-D-glucosamine kinase Human genes 0.000 description 1
- 108010032040 N-acetylglucosamine kinase Proteins 0.000 description 1
- 101710179749 N-acetylmannosamine kinase Proteins 0.000 description 1
- SQVRNKJHWKZAKO-LUWBGTNYSA-N N-acetylneuraminic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)CC(O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-LUWBGTNYSA-N 0.000 description 1
- 102100034977 N-acylglucosamine 2-epimerase Human genes 0.000 description 1
- 102100031349 N-acylneuraminate cytidylyltransferase Human genes 0.000 description 1
- 206010051606 Necrotising colitis Diseases 0.000 description 1
- PVNIIMVLHYAWGP-UHFFFAOYSA-N Niacin Chemical compound OC(=O)C1=CC=CN=C1 PVNIIMVLHYAWGP-UHFFFAOYSA-N 0.000 description 1
- 241000080590 Niso Species 0.000 description 1
- 241000424623 Nostoc punctiforme Species 0.000 description 1
- 241000192673 Nostoc sp. Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 235000019482 Palm oil Nutrition 0.000 description 1
- 241000588701 Pectobacterium carotovorum Species 0.000 description 1
- 241001442654 Percnon planissimum Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- ISYSEOWLRQKQEQ-JYJNAYRXSA-N Phe-His-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISYSEOWLRQKQEQ-JYJNAYRXSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- DZVXMMSUWWUIQE-ACRUOGEOSA-N Phe-His-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N DZVXMMSUWWUIQE-ACRUOGEOSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- WURZLPSMYZLEGH-UNQGMJICSA-N Phe-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N)O WURZLPSMYZLEGH-UNQGMJICSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- JLDZQPPLTJTJLE-IHPCNDPISA-N Phe-Trp-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JLDZQPPLTJTJLE-IHPCNDPISA-N 0.000 description 1
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 1
- WDOCBGZHAQQIBL-IHPCNDPISA-N Phe-Trp-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 WDOCBGZHAQQIBL-IHPCNDPISA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- 101710188351 Phosphoenolpyruvate-dependent phosphotransferase system Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000235061 Pichia sp. Species 0.000 description 1
- 241000605861 Prevotella Species 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 241000169446 Promethis Species 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 241000231663 Puffinus auricularis Species 0.000 description 1
- 108010054530 RGDN peptide Proteins 0.000 description 1
- 108010025216 RVF peptide Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- 235000019484 Rapeseed oil Nutrition 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 241001030146 Rhodotorula sp. Species 0.000 description 1
- 241000235088 Saccharomyces sp. Species 0.000 description 1
- 241001360381 Saccharomycopsis sp. Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241000720795 Schizosaccharomyces sp. Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 101001010097 Shigella phage SfV Bactoprenol-linked glucose translocase Proteins 0.000 description 1
- 101710161071 Sialic acid transporter NanT Proteins 0.000 description 1
- 241000204117 Sporolactobacillus Species 0.000 description 1
- 208000007107 Stomach Ulcer Diseases 0.000 description 1
- 241000194022 Streptococcus sp. Species 0.000 description 1
- 235000019486 Sunflower oil Nutrition 0.000 description 1
- 241000520244 Tatumella citrea Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- YDWLCDQXLCILCZ-BWAGICSOSA-N Thr-His-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YDWLCDQXLCILCZ-BWAGICSOSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 1
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 1
- SMDQRGAERNMJJF-JQWIXIFHSA-N Trp-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 SMDQRGAERNMJJF-JQWIXIFHSA-N 0.000 description 1
- OFSLQLHHDQOWDB-QEJZJMRPSA-N Trp-Cys-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 OFSLQLHHDQOWDB-QEJZJMRPSA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 1
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 1
- PVRRBEROBJQPJX-SZMVWBNQSA-N Trp-His-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PVRRBEROBJQPJX-SZMVWBNQSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- SLOYNOMYOAOUCX-BVSLBCMMSA-N Trp-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SLOYNOMYOAOUCX-BVSLBCMMSA-N 0.000 description 1
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- BVOCLAPFOBSJHR-KKUMJFAQSA-N Tyr-Cys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BVOCLAPFOBSJHR-KKUMJFAQSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- PDKILSUYSUGCAO-JBACZVJFSA-N Tyr-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PDKILSUYSUGCAO-JBACZVJFSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 1
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- DAOREBHZAKCOEN-ULQDDVLXSA-N Tyr-Leu-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O DAOREBHZAKCOEN-ULQDDVLXSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- XDGPTBVOSHKDFT-KKUMJFAQSA-N Tyr-Met-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XDGPTBVOSHKDFT-KKUMJFAQSA-N 0.000 description 1
- GYBVHTWOQJMYAM-HRCADAONSA-N Tyr-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N GYBVHTWOQJMYAM-HRCADAONSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- NAHUCETZGZZSEX-IHPCNDPISA-N Tyr-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NAHUCETZGZZSEX-IHPCNDPISA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 241000607626 Vibrio cholerae Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- FPIPGXGPPPQFEQ-BOOMUCAASA-N Vitamin A Natural products OC/C=C(/C)\C=C\C=C(\C)/C=C/C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-BOOMUCAASA-N 0.000 description 1
- 229930003451 Vitamin B1 Natural products 0.000 description 1
- 229930003268 Vitamin C Natural products 0.000 description 1
- 229930003316 Vitamin D Natural products 0.000 description 1
- QYSXJUFSXHHAJI-XFEUOLMDSA-N Vitamin D3 Natural products C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C/C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-XFEUOLMDSA-N 0.000 description 1
- 229930003427 Vitamin E Natural products 0.000 description 1
- 229930003448 Vitamin K Natural products 0.000 description 1
- 241000589636 Xanthomonas campestris Species 0.000 description 1
- 241000490645 Yarrowia sp. Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 241000193453 [Clostridium] cellulolyticum Species 0.000 description 1
- USAZACJQJDHAJH-KDEXOMDGSA-N [[(2r,3s,4r,5s)-5-(2,4-dioxo-1h-pyrimidin-6-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2r,3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] hydrogen phosphate Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](C=2NC(=O)NC(=O)C=2)O1 USAZACJQJDHAJH-KDEXOMDGSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- FPIPGXGPPPQFEQ-OVSJKPMPSA-N all-trans-retinol Chemical compound OC\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-OVSJKPMPSA-N 0.000 description 1
- 108010015684 alpha-N-Acetylgalactosaminidase Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 229910052786 argon Inorganic materials 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 229940054340 bacillus coagulans Drugs 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- DLRVVLDZNNYCBX-ZZFZYMBESA-N beta-melibiose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@H](O)O1 DLRVVLDZNNYCBX-ZZFZYMBESA-N 0.000 description 1
- 229940002008 bifidobacterium bifidum Drugs 0.000 description 1
- 229940004120 bifidobacterium infantis Drugs 0.000 description 1
- 229940009291 bifidobacterium longum Drugs 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 239000004327 boric acid Substances 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- FAPWYRCQGJNNSJ-UBKPKTQASA-L calcium D-pantothenic acid Chemical compound [Ca+2].OCC(C)(C)[C@@H](O)C(=O)NCCC([O-])=O.OCC(C)(C)[C@@H](O)C(=O)NCCC([O-])=O FAPWYRCQGJNNSJ-UBKPKTQASA-L 0.000 description 1
- 229910000019 calcium carbonate Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000013375 chromatographic separation Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000003930 cognitive ability Effects 0.000 description 1
- 238000001360 collision-induced dissociation Methods 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 229910000365 copper sulfate Inorganic materials 0.000 description 1
- OXBLHERUFWYNTN-UHFFFAOYSA-M copper(I) chloride Chemical compound [Cu]Cl OXBLHERUFWYNTN-UHFFFAOYSA-M 0.000 description 1
- ARUVKPQLZAKDPS-UHFFFAOYSA-L copper(II) sulfate Chemical compound [Cu+2].[O-][S+2]([O-])([O-])[O-] ARUVKPQLZAKDPS-UHFFFAOYSA-L 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011026 diafiltration Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 235000015872 dietary supplement Nutrition 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- IJKVHSBPTUYDLN-UHFFFAOYSA-N dihydroxy(oxo)silane Chemical compound O[Si](O)=O IJKVHSBPTUYDLN-UHFFFAOYSA-N 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- BVTBRVFYZUCAKH-UHFFFAOYSA-L disodium selenite Chemical compound [Na+].[Na+].[O-][Se]([O-])=O BVTBRVFYZUCAKH-UHFFFAOYSA-L 0.000 description 1
- 208000000718 duodenal ulcer Diseases 0.000 description 1
- 238000000909 electrodialysis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000000369 enteropathogenic effect Effects 0.000 description 1
- 239000000147 enterotoxin Substances 0.000 description 1
- 231100000655 enterotoxin Toxicity 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 229960000304 folic acid Drugs 0.000 description 1
- 235000019152 folic acid Nutrition 0.000 description 1
- 239000011724 folic acid Substances 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 150000002337 glycosamines Chemical class 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 150000004820 halides Chemical class 0.000 description 1
- 229940037467 helicobacter pylori Drugs 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 230000010189 intracellular transport Effects 0.000 description 1
- PVFSDGKDKFSOTB-UHFFFAOYSA-K iron(3+);triacetate Chemical compound [Fe+3].CC([O-])=O.CC([O-])=O.CC([O-])=O PVFSDGKDKFSOTB-UHFFFAOYSA-K 0.000 description 1
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 101150001899 lacY gene Proteins 0.000 description 1
- RJTOFDPWCJDYFZ-UHFFFAOYSA-N lacto-N-triose Natural products CC(=O)NC1C(O)C(O)C(CO)OC1OC1C(O)C(OC(C(O)CO)C(O)C(O)C=O)OC(CO)C1O RJTOFDPWCJDYFZ-UHFFFAOYSA-N 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 1
- 229940004208 lactobacillus bulgaricus Drugs 0.000 description 1
- 229940017800 lactobacillus casei Drugs 0.000 description 1
- 229940072205 lactobacillus plantarum Drugs 0.000 description 1
- 229940001882 lactobacillus reuteri Drugs 0.000 description 1
- 229960001375 lactose Drugs 0.000 description 1
- 108010044538 lactostatin Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 101150036529 manZ gene Proteins 0.000 description 1
- 102000016470 mariner transposase Human genes 0.000 description 1
- 108060004631 mariner transposase Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 238000001471 micro-filtration Methods 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 101150070589 nagB gene Proteins 0.000 description 1
- 101150027065 nagE gene Proteins 0.000 description 1
- 101150043097 nagK gene Proteins 0.000 description 1
- 101150076570 nanK gene Proteins 0.000 description 1
- 208000004995 necrotizing enterocolitis Diseases 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229960003512 nicotinic acid Drugs 0.000 description 1
- 235000001968 nicotinic acid Nutrition 0.000 description 1
- 239000011664 nicotinic acid Substances 0.000 description 1
- MGFYIUFZLHCRTH-UHFFFAOYSA-N nitrilotriacetic acid Chemical compound OC(=O)CN(CC(O)=O)CC(O)=O MGFYIUFZLHCRTH-UHFFFAOYSA-N 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000002540 palm oil Substances 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 201000006195 perinatal necrotizing enterocolitis Diseases 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- SHUZOJHMOBOZST-UHFFFAOYSA-N phylloquinone Natural products CC(C)CCCCC(C)CCC(C)CCCC(=CCC1=C(C)C(=O)c2ccccc2C1=O)C SHUZOJHMOBOZST-UHFFFAOYSA-N 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- JLKDVMWYMMLWTI-UHFFFAOYSA-M potassium iodate Chemical compound [K+].[O-]I(=O)=O JLKDVMWYMMLWTI-UHFFFAOYSA-M 0.000 description 1
- 239000001230 potassium iodate Substances 0.000 description 1
- 235000006666 potassium iodate Nutrition 0.000 description 1
- 229940093930 potassium iodate Drugs 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010065320 prolyl-lysyl-glutamyl-lysine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- RADKZDMFGJYCBB-UHFFFAOYSA-N pyridoxal hydrochloride Natural products CC1=NC=C(CO)C(C=O)=C1O RADKZDMFGJYCBB-UHFFFAOYSA-N 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 239000002516 radical scavenger Substances 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000001223 reverse osmosis Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- AWUCVROLDVIAJX-GSVOUGTGSA-N sn-glycerol 3-phosphate Chemical compound OC[C@@H](O)COP(O)(O)=O AWUCVROLDVIAJX-GSVOUGTGSA-N 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000011781 sodium selenite Substances 0.000 description 1
- 235000015921 sodium selenite Nutrition 0.000 description 1
- 229960001471 sodium selenite Drugs 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000002600 sunflower oil Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 229960003495 thiamine Drugs 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 108091006107 transcriptional repressors Proteins 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- YWYZEGXAUVWDED-UHFFFAOYSA-N triammonium citrate Chemical compound [NH4+].[NH4+].[NH4+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O YWYZEGXAUVWDED-UHFFFAOYSA-N 0.000 description 1
- IEDVJHCEMCRBQM-UHFFFAOYSA-N trimethoprim Chemical compound COC1=C(OC)C(OC)=CC(CC=2C(=NC(N)=NC=2)N)=C1 IEDVJHCEMCRBQM-UHFFFAOYSA-N 0.000 description 1
- 229960001082 trimethoprim Drugs 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 229940118696 vibrio cholerae Drugs 0.000 description 1
- 235000019155 vitamin A Nutrition 0.000 description 1
- 239000011719 vitamin A Substances 0.000 description 1
- 235000010374 vitamin B1 Nutrition 0.000 description 1
- 239000011691 vitamin B1 Substances 0.000 description 1
- 235000019158 vitamin B6 Nutrition 0.000 description 1
- 239000011726 vitamin B6 Substances 0.000 description 1
- 235000019154 vitamin C Nutrition 0.000 description 1
- 239000011718 vitamin C Substances 0.000 description 1
- 235000019166 vitamin D Nutrition 0.000 description 1
- 239000011710 vitamin D Substances 0.000 description 1
- 150000003710 vitamin D derivatives Chemical class 0.000 description 1
- 235000019165 vitamin E Nutrition 0.000 description 1
- 229940046009 vitamin E Drugs 0.000 description 1
- 239000011709 vitamin E Substances 0.000 description 1
- 235000019168 vitamin K Nutrition 0.000 description 1
- 239000011712 vitamin K Substances 0.000 description 1
- 150000003721 vitamin K derivatives Chemical class 0.000 description 1
- 229940045997 vitamin a Drugs 0.000 description 1
- 229940011671 vitamin b6 Drugs 0.000 description 1
- 229940046008 vitamin d Drugs 0.000 description 1
- 229940046010 vitamin k Drugs 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- 229960001763 zinc sulfate Drugs 0.000 description 1
Images
Abstract
Description
Предшествующий уровень техникиPrior Art
Настоящее изобретение относится к способу ферментативного получения сиалилированных сахаридов, а также к используемым в нем рекомбинантным или генетически модифицированным микробным клеткам.The present invention relates to a method for the enzymatic production of sialylated saccharides, as well as to the recombinant or genetically modified microbial cells used in it.
На сегодняшний день идентифицировано более 150 различающихся по структуре олигосахаридов грудного молока (НМО, от англ. human milk oligosaccharides). Несмотря на то, что НМО представлены лишь в незначительном количестве среди общих питательных веществ грудного молока, их благотворное влияние на развитие вскармливаемых грудью детей стало очевидным за последние десятилетия.To date, more than 150 breast milk oligosaccharides (HMO, from the English human milk oligosaccharides) differing in structure have been identified. Although HMOs are represented in only minor amounts among the total nutrients in breast milk, their beneficial effects on the development of breastfed infants have become evident over recent decades.
Было обнаружено, что среди НМО сиалилированные НМО (SHMO, от англ. sialylated HMOs) поддерживают устойчивость к энтеропатогенным бактериям и вирусам. Интересно, что недавние исследования дополнительно продемонстрировали защитное действие длинноцепочечных SHMO в отношении некротизирующего энтероколита, который является одним из самых распространенных и приводящих к летальному исходу заболеваний среди недоношенных младенцев. В дополнение к этому, считается, что SHMO поддерживают развитие головного мозга младенцев и их когнитивных способностей. Помимо этого показано, что сиалилированные олигосахариды нейтрализуют энтеротоксины различных патогенных микроорганизмов, включая Escherichia coli, Vibrio cholerae и Salmonella. Кроме того, было обнаружено, что сиалилированные олигосахариды препятствуют колонизации кишечника Helicobacter pylori и, таким образом, предотвращают или подавляют образование язв желудка и двенадцатиперстной кишки.Among HMOs, sialylated HMOs (SHMOs) have been found to support resistance to enteropathogenic bacteria and viruses. Interestingly, recent studies have further demonstrated the protective effects of long-chain SHMOs against necrotizing enterocolitis, which is one of the most common and fatal diseases among preterm infants. In addition to this, SHMOs are thought to support the development of infants' brains and cognitive abilities. In addition, sialylated oligosaccharides have been shown to neutralize enterotoxins from various pathogens, including Escherichia coli, Vibrio cholerae and Salmonella. In addition, sialylated oligosaccharides have been found to inhibit colonization of the intestine by Helicobacter pylori and thus prevent or suppress the formation of gastric and duodenal ulcers.
Среди сиалилированных олигосахаридов наиболее распространенными компонентами грудного молока являются 3'-сиалиллактоза, 6'-сиалиллактоза, сиалиллакто-N-тетраоза а, сиалиллакто-N-тетраоза b, сиалиллакто-N-тетраоза с и дисиалиллакто-N-тетраоза.Among the sialylated oligosaccharides, the most common components of breast milk are 3'-sialyllactose, 6'-sialyllactose, sialyllacto-N-tetraose a, sialyllacto-N-tetraose b, sialyllacto-N-tetraose c and disialyllacto-N-tetraose.
Поскольку сиалилированные олигосахариды имеют сложную структуру, методы их химического или (химико-)ферментативного синтеза являются нестандартными и связаны с большими трудностями, например, с необходимостью регулирования стереохимической конфигурации, образования специфических связей, доступностью исходного сырья и так далее. Соответственно, имеющиеся в продаже сиалилированные олигосахариды весьма дороги вследствие их низкого содержания в природных источниках.Since sialylated oligosaccharides have a complex structure, methods for their chemical or (chemo-)enzymatic synthesis are non-standard and associated with great difficulties, for example, the need to regulate the stereochemical configuration, the formation of specific bonds, the availability of starting materials, and so on. Accordingly, commercially available sialylated oligosaccharides are quite expensive due to their low abundance in natural sources.
Ввиду этого были предприняты усилия по конструированию путей обмена веществ у микроорганизмов, продуцирующих сиалилированные олигосахариды, поскольку этот подход является наиболее перспективным способом получения НМО в промышленном масштабе. Для получения SHMO посредством микробиологической ферментации такой микроорганизм обычно культивируют в присутствии экзогенной сиаловой кислоты.In view of this, efforts have been made to engineer the metabolic pathways of microorganisms that produce sialylated oligosaccharides, since this approach is the most promising way to produce HMOs on an industrial scale. To produce SHMO by microbial fermentation, such a microorganism is usually cultured in the presence of exogenous sialic acid.
В публикации международной заявки WO 2007/101862 А1 описывается способ крупномасштабного синтеза in vivo сиалилированных олигосахаридов с использованием внутриклеточного пула уридиндифосфат-N-ацетилглюкозамина (УДФ-GlcNAc, от англ. N-acetylglucosamine) путем культивирования микроорганизма в культуральной среде, при этом указанный микроорганизм содержит гетерологичные гены, кодирующие синтетазу цитидин-5'-монофосфо-N-ацетилнейраминовой кислоты (ЦМФ-Neu5Ac-синтетазу, от англ. N-acetylneuraminic acid), синтазу сиаловых кислот, С1 cNAc-6-фосфат-2-эпимеразу и сиалилтрансферазу. Вдобавок были делетированы эндогенные гены, кодирующие альдолазу сиаловых кислот (NanA) и ManNac-киназу (NanK).International application publication WO 2007/101862 A1 describes a method for the large-scale in vivo synthesis of sialylated oligosaccharides using an intracellular pool of uridine diphosphate-N-acetylglucosamine (UDP-GlcNAc, from the English N-acetylglucosamine) by cultivating a microorganism in a culture medium, wherein said microorganism contains heterologous genes encoding cytidine-5'-monophospho-N-acetylneuraminic acid synthetase (CMF-Neu5Ac synthetase, N-acetylneuraminic acid), sialic acid synthase, C1 cNAc-6-phosphate-2-epimerase and sialyltransferase. In addition, endogenous genes encoding sialic acid aldolase (NanA) and ManNac kinase (NanK) were deleted.
В публикации международной заявки WO 2014/153253 А1 описываются способы и композиции для модификации бактерий, продуцирующих сиалилированные олигосахариды, а также способ получения сиалилированного олигосахарида в бактерии, причем указанная бактерия содержит экзогенную сиалилтрансферазу, дефектный путь катаболизма сиаловых кислот, обладает способностью синтезировать сиаловые кислоты и содержит функционально активный ген пермеазы лактозы, при этом указанную бактерию культивируют в присутствии лактозы. Способность синтезировать сиаловые кислоты заключается в экспрессии экзогенной ЦМФ-Neu5Ac-синтетазы, экзогенной синтазы сиаловых кислот и экзогенной УДФ-ClcNAc-2-эпимеразы.International application publication WO 2014/153253 A1 describes methods and compositions for modifying bacteria that produce sialylated oligosaccharides, as well as a method for producing a sialylated oligosaccharide in a bacterium, wherein said bacterium contains an exogenous sialyltransferase, a defective sialic acid catabolism pathway, has the ability to synthesize sialic acids, and contains a functionally active lactose permease gene, wherein said bacterium is cultivated in the presence of lactose. The ability to synthesize sialic acids consists of the expression of exogenous CMP-Neu5Ac synthetase, exogenous sialic acid synthase and exogenous UDP-ClcNAc-2 epimerase.
Однако, получение сиалилированных олигосахаридов желательно осуществлять путем микробиологической ферментации, в ходе которой не требуется наличия и/или добавления экзогенной сиаловой кислоты. Кроме того, получение сиалилированных олигосахаридов желательно осуществлять с использованием микроорганизмов, для которых нет необходимости в использовании внутриклеточного пула УДФ-N-ClcNAc, поскольку считается, что это будет энергетически выгодно для клетки.However, the production of sialylated oligosaccharides is preferably carried out by microbiological fermentation, during which the presence and/or addition of exogenous sialic acid is not required. In addition, the production of sialylated oligosaccharides is preferably carried out using microorganisms for which there is no need to use the intracellular pool of UDP-N-ClcNAc, since it is believed that this will be energetically beneficial for the cell.
Краткое описание сущности изобретенияBrief description of the invention
Данная задача решается, помимо прочего, путем разработки способа ферментативного получения сиалилированных сахаридов с применением цельных клеток, который не требует добавления экзогенной сиаловой кислоты, и с использованием генетически модифицированной микробной клетки, которая может синтезировать сиалилированные сахариды в отсутствие экзогенной сиаловой кислоты.This problem is achieved, among other things, by developing a method for the enzymatic production of sialylated saccharides using whole cells, which does not require the addition of exogenous sialic acid, and using a genetically modified microbial cell that can synthesize sialylated saccharides in the absence of exogenous sialic acid.
Согласно одному из аспектов предложен способ получения сиалилированного сахарида, включающий стадии а) предоставления по меньшей мере одной генетически модифицированной микробной клетки, которая содержит (1) путь биосинтеза сиаловой кислоты для внутриклеточного биосинтеза N-ацетилнейраминовой кислоты (Neu5Ac, NeuNAc), при этом указанный путь биосинтеза сиаловой кислоты включает в себя глюкозамин-6-фосфат-N-ацетилтрансферазу, (2) синтетазу цитидин-5'-монофосфо-(ЦМФ)-сиаловых кислот и (3) гетерологичную сиалилтрансферазу; b) культивирования по меньшей мере одной генетически модифицированной микробной клетки в ферментационном бульоне и в условиях, позволяющих получать указанный сиалилированный сахарид; и возможно с) извлечения указанного сиалилированного сахарида.According to one aspect, there is provided a method for producing a sialylated saccharide, comprising the steps of a) providing at least one genetically modified microbial cell that contains (1) a sialic acid biosynthetic pathway for the intracellular biosynthesis of N-acetylneuraminic acid (Neu5Ac, NeuNAc), wherein said pathway sialic acid biosynthesis includes glucosamine-6-phosphate-N-acetyltransferase, (2) cytidine-5'-monophospho-(CMP)-sialic acid synthetase, and (3) heterologous sialyltransferase; b) cultivating at least one genetically modified microbial cell in a fermentation broth and under conditions allowing the production of said sialylated saccharide; and optionally c) recovering said sialylated saccharide.
Согласно другому аспекту предложена генетически модифицированная микробная клетка для получения сиалилированного сахарида, причем данная микробная клетка содержит (1) путь биосинтеза сиаловой кислоты для внутриклеточного биосинтеза N-ацетилнейраминовой кислоты, при этом указанный путь биосинтеза сиаловой кислоты содержит глюкозамин-6-фосфат-N-ацетилтрансферазу; (2) синтетазу цитидин-5'-монофосфо-(ЦМФ)-N-ацетилнейраминовой кислоты для переноса N-ацетилнейраминовой кислоты на цитидин-5'-монофосфат с образованием ЦМФ-активированной N-ацетилнейраминовой кислоты; и (3) гетерологичную сиалилтрансферазу.In another aspect, a genetically modified microbial cell is provided for producing a sialylated saccharide, wherein the microbial cell comprises (1) a sialic acid biosynthetic pathway for intracellular biosynthesis of N-acetylneuraminic acid, wherein said sialic acid biosynthetic pathway comprises glucosamine-6-phosphate-N-acetyltransferase ; (2) cytidine 5'-monophospho-(CMP)-N-acetylneuraminic acid synthetase to transfer N-acetylneuraminic acid to cytidine 5'-monophosphate to form CMP-activated N-acetylneuraminic acid; and (3) heterologous sialyltransferase.
Согласно другому аспекту предложен сиалилированный сахарид, который может быть получен способом или с использованием генетически модифицированной микробной клетки по изобретению.According to another aspect, a sialylated saccharide is provided that can be produced by the method or using a genetically modified microbial cell of the invention.
Согласно другому аспекту предложено применение сиалилированного сахарида, который получают способом или с использованием генетически модифицированной микробной клетки по изобретению, для приготовления пищевой композиции, предпочтительно композиции для грудных детей.According to another aspect, the use of a sialylated saccharide, which is obtained by the method or using a genetically modified microbial cell according to the invention, for the preparation of a nutritional composition, preferably a composition for infants, is provided.
Согласно еще одному аспекту предложена пищевая композиция, содержащая по меньшей мере один сиалилированный сахарид, полученный способом или с использованием генетически модифицированной микробной клетки по изобретению.According to yet another aspect, a food composition is provided comprising at least one sialylated saccharide produced by a method or using a genetically modified microbial cell of the invention.
Краткое описание графических материаловBrief description of graphic materials
На Фиг. 1 показано схематичное представление пути биосинтеза сиаловой кислоты, который может быть использован генетически модифицированной микробной клеткой для ферментативного получения сиалилированных сахаридов, при этом в указанном пути биосинтеза сиаловой кислоты используется УДФ-GlcNAc.In FIG. 1 shows a schematic representation of a sialic acid biosynthetic pathway that can be used by a genetically modified microbial cell to enzymatically produce sialylated saccharides, wherein said sialic acid biosynthetic pathway utilizes UDP-GlcNAc.
На Фиг. 2 показано схематичное представление пути биосинтеза сиаловой кислоты, который может быть использован генетически модифицированной микробной клеткой по изобретению для ферментативного получения сиалилированных сахаридов.In FIG. 2 is a schematic representation of the sialic acid biosynthetic pathway that can be used by a genetically modified microbial cell of the invention to enzymatically produce sialylated saccharides.
На Фиг. 3 показано схематичное представление другого пути биосинтеза сиаловой кислоты, который может быть использован генетически модифицированной микробной клеткой по изобретению для ферментативного получения сиалилированных сахаридов.In FIG. 3 is a schematic representation of another sialic acid biosynthetic pathway that can be used by a genetically modified microbial cell of the invention to enzymatically produce sialylated saccharides.
Подробное описаниеDetailed description
Согласно первому аспекту предложен способ ферментативного получения сиалилированного сахарида. Способ включает стадии а) предоставления по меньшей мере одной генетически модифицированной микробной клетки, способной синтезировать данный сиалилированный сахарид, при этом указанная по меньшей мере одна генетически модифицированная микробная клетка содержит (1) путь биосинтеза сиаловой кислоты, включающий в себя глюкозамин-6-фосфат-N-ацетилтрансферазу; (2) синтетазу цитидин-5'-монофосфо-(ЦМФ)-N-ацетилнейраминовой кислоты; и (3) гетерологичную сиалилтрансферазу; b) культивирования по меньшей мере одной генетически модифицированной микробной клетки в ферментационном бульоне и в условиях, позволяющихполучать указанный сиалилированный сахарид, и возможно с) извлечения указанного сиалилированного сахарида.According to a first aspect, a method is provided for the enzymatic production of a sialylated saccharide. The method includes the steps of a) providing at least one genetically modified microbial cell capable of synthesizing a given sialylated saccharide, wherein said at least one genetically modified microbial cell comprises (1) a sialic acid biosynthetic pathway comprising glucosamine-6-phosphate- N-acetyltransferase; (2) cytidine-5'-monophospho-(CMP)-N-acetylneuraminic acid synthetase; and (3) heterologous sialyltransferase; b) cultivating at least one genetically modified microbial cell in a fermentation broth and under conditions allowing the production of said sialylated saccharide, and optionally c) recovering said sialylated saccharide.
Соответственно, согласно второму аспекту изобретение также относится к генетически модифицированной микробной клетке для ферментативного получения сиалилированного сахарида, причем данная микробная клетка содержит (1) путь биосинтеза сиаловой кислоты для внутриклеточного биосинтеза N-ацетилнейраминовой кислоты, при этом указанный путь биосинтеза сиаловой кислоты включает в себя глюкозамин-6-фосфат-N-ацетилтрансферазу; (2) синтетазу цитидин-5'-монофосфо-(ЦМФ)-сиаловых кислот для переноса N-ацетилнейраминовой кислоты на цитидин-5'-монофосфат с образованием ЦМФ-активированной сиаловой кислоты; и (3) сиалилтрансферазу для переноса группировки N-ацетилнейраминовой кислоты с ЦМФ-активированной сиаловой кислоты в качестве донорного субстрата на акцепторную молекулу, причем акцепторной молекулой является молекула сахарида, в результате чего осуществляется внутриклеточный биосинтез сиалилированного сахарида.Accordingly, according to a second aspect, the invention also provides a genetically modified microbial cell for the enzymatic production of a sialylated saccharide, wherein the microbial cell comprises (1) a sialic acid biosynthetic pathway for the intracellular biosynthesis of N-acetylneuraminic acid, wherein said sialic acid biosynthetic pathway includes glucosamine -6-phosphate-N-acetyltransferase; (2) cytidine 5'-monophospho-(CMP)-sialic acid synthetase to transfer N-acetylneuraminic acid to cytidine 5'-monophosphate to form CMP-activated sialic acid; and (3) a sialyltransferase to transfer the N-acetylneuraminic acid moiety from CMP-activated sialic acid as a donor substrate to an acceptor molecule, the acceptor molecule being a saccharide molecule, resulting in intracellular biosynthesis of the sialylated saccharide.
Генетически модифицированная микробная клетка содержит путь биосинтеза сиаловой кислоты для внутриклеточного биосинтеза N-ацетилнейраминовой кислоты, в котором не используется УДФ-GlcNAc. Генетически модифицированная микробная клетка содержит путь биосинтеза сиаловой кислоты для внутриклеточного биосинтеза N-ацетилнейраминовой кислоты с использованием глюкозамин-6-фосфат-N-ацетилтрансферазы. В пути биосинтеза сиаловой кислоты с использованием глюкозамин-6-фосфат-N-ацетилтрансферазы для внутриклеточного биосинтеза N-ацетилнейраминовой кислоты не используется УДФ-GlcNAc для биосинтеза сиаловой кислоты (Фиг. 2 и Фиг. 3).The genetically modified microbial cell contains a sialic acid biosynthesis pathway for the intracellular biosynthesis of N-acetylneuraminic acid, which does not use UDP-GlcNAc. The genetically modified microbial cell contains a sialic acid biosynthetic pathway for the intracellular biosynthesis of N-acetylneuraminic acid using glucosamine 6-phosphate N-acetyltransferase. The sialic acid biosynthesis pathway using glucosamine-6-phosphate-N-acetyltransferase for intracellular N-acetylneuraminic acid biosynthesis does not use UDP-GlcNAc for sialic acid biosynthesis (Figure 2 and Figure 3).
Путь биосинтеза сиаловой кислоты содержит активности ферментов глутамин:фруктозо-6-фосфат-аминотрансферазы и синтазы N-ацетилнейраминовой кислоты. Путь биосинтеза сиаловой кислоты также содержит а) активности ферментов глюкозамин-6-фосфат-N-ацетилтрансферазы, N-ацетилглюкозамин-6-фосфат-фосфатазы и N-ацетилглюкозамин-2-эпимеразы (Фиг. 2); и/или b) активности ферментов глюкозамин-6-фосфат-N-ацетилтрансферазы, N-ацетилглюкозамин-6-фосфат-эпимеразы и N-ацетилманнозамин-6-фосфатфосфатазы (Фиг. 3). Поэтому, для внутриклеточного биосинтеза сиаловой кислоты нет необходимости в том, чтобы генетически модифицированная микробная клетка содержала активности ферментов фосфоглюкозамин-мутазы, N-ацетилглюкозамин-1-фосфат-уридилтрансферазы и УДФ-N-ацетилглюкозамин-2-эпимеразы с одновременным высвобождением УДФ (Фиг. 1). Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка, способная синтезировать сиаловую кислоту, не содержит активности одного или более ферментов, выбранных из группы, состоящей из активности ферментов фосфоглюкозамин-мутазы, N-ацетилглюкозамин-1-фосфат-уридилтрансферазы и УДФ-N-ацетилглюкозамин-2-эпимеразы с одновременным высвобождением УДФ.The sialic acid biosynthetic pathway contains the activities of the enzymes glutamine:fructose-6-phosphate aminotransferase and N-acetylneuraminic acid synthase. The sialic acid biosynthetic pathway also contains a) the enzyme activities of glucosamine 6-phosphate N-acetyltransferase, N-acetylglucosamine 6-phosphate phosphatase and N-acetylglucosamine 2-epimerase (Figure 2); and/or b) the activities of the enzymes glucosamine 6-phosphate N-acetyltransferase, N-acetylglucosamine 6-phosphate epimerase and N-acetylmannosamine 6-phosphate phosphatase (Figure 3). Therefore, for the intracellular biosynthesis of sialic acid, it is not necessary for the genetically modified microbial cell to contain the activities of the enzymes phosphoglucosamine mutase, N-acetylglucosamine-1-phosphate-uridyltransferase and UDP-N-acetylglucosamine-2-epimerase with the simultaneous release of UDP (Fig. 1). Thus, in a further and/or alternative embodiment, the genetically modified microbial cell capable of synthesizing sialic acid does not contain the activity of one or more enzymes selected from the group consisting of the activity of the enzymes phosphoglucosamine mutase, N-acetylglucosamine-1-phosphate uridyltransferase and UDP-N-acetylglucosamine-2-epimerase with simultaneous release of UDP.
Фермент глутамин:фруктозо-6-фосфат-аминотрансфераза (ЕС 2.6.1.16) катализирует превращение фруктозо-6-фосфата (Frc-6P) в глюкозамин-6-фосфат (GlcN-6P) с использованием глутамина. Обычно считается, что эта ферментативная реакция является первой стадией в пути биосинтеза гексозаминов. Альтернативными названиями глутамин:фруктозо-6-фосфат-аминотрансферазы являются D-фруктозо-6-фосфат-аминотрансфераза, GFAT (от англ. Glutamine-Fructose-6-Phosphate Aminotransferase - глутамин-фруктозо-6-фосфат-аминотрансфераза), глюкозамин-6-фосфат-синтаза, гексозофосфат-аминотрансфераза и L-глутамин-D-фруктозо-6-фосфат-аминотрансфераза.The enzyme glutamine:fructose-6-phosphate aminotransferase (EC 2.6.1.16) catalyzes the conversion of fructose-6-phosphate (Frc-6P) to glucosamine-6-phosphate (GlcN-6P) using glutamine. This enzymatic reaction is generally believed to be the first step in the hexosamine biosynthetic pathway. Alternative names for glutamine:fructose-6-phosphate aminotransferase are D-fructose-6-phosphate aminotransferase, GFAT (Glutamine-Fructose-6-Phosphate Aminotransferase), glucosamine-6 -phosphate synthase, hexose phosphate aminotransferase and L-glutamine-D-fructose-6-phosphate aminotransferase.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит глутамин:фруктозо-6-фосфат-аминотрансферазу, предпочтительно гетерологичную глутамин:фруктозо-6-фосфат-аминотрансферазу, более предпочтительно глутамин:фруктозо-6-фосфат-аминотрансферазу, которая происходит из Е. coli (GlmS из Е. coli (UniProtKB - Р17169; SEQ ID NO 67)) или функционально активный вариант GlmS из Е. coli. Наиболее предпочтительно, функционально активный вариант представляет собой версию GlmS из Е. coli, которая демонстрирует значительно сниженную чувствительность к ингибированию глюкозамин-6-фосфатом по сравнению с ферментом дикого типа. Пример функционально активного варианта GlmS из Е. coli, который демонстрирует значительно сниженную чувствительность к ингибированию глюкозамин-6-фосфатом, представлен в SEQ ID NO 68.In a further and/or alternative embodiment, the genetically modified microbial cell comprises a glutamine:fructose-6-phosphate aminotransferase, preferably a heterologous glutamine:fructose-6-phosphate aminotransferase, more preferably a glutamine:fructose-6-phosphate aminotransferase, which is derived from E . coli (GlmS from E. coli (UniProtKB - P17169; SEQ ID NO 67)) or a functionally active variant of GlmS from E. coli. Most preferably, the functionally active variant is a version of GlmS from E. coli that exhibits significantly reduced sensitivity to inhibition by glucosamine 6-phosphate compared to the wild type enzyme. An example of a functionally active GlmS variant from E. coli that exhibits significantly reduced sensitivity to glucosamine 6-phosphate inhibition is provided in SEQ ID NO 68.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, которая кодирует глутамин:фруктозо-6-фосфат-аминотрансферазу, предпочтительно глутамин:фруктозо-6-фосфат-аминотрансферазу GlmS из Е. coli (SEQ ID NO 69), или нуклеотидную последовательность, кодирующую функционально активный вариант, представляющий собой версию GlmS из E. coli, которая демонстрирует значительно сниженную чувствительность к ингибированию глюкозамин-6-фосфатом по сравнению с ферментом дикого типа, (glmS*54 или glmS* (как представлено в SEQ ID NO 70)).In a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule comprising a nucleotide sequence that encodes a glutamine:fructose-6-phosphate aminotransferase, preferably glutamine:fructose-6-phosphate aminotransferase GlmS from E. coli (SEQ ID NO 69), or a nucleotide sequence encoding a functionally active variant that is a version of GlmS from E. coli that exhibits significantly reduced sensitivity to inhibition by glucosamine-6-phosphate compared with the wild-type enzyme, (glmS*54 or glmS* (as presented in SEQ ID NO 70)).
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, выбранную из группы, состоящей из:Thus, in a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO 67 и SEQ ID NO 68;1) nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO 67 and SEQ ID NO 68;
2) нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO 69 и SEQ ID NO 70;2) nucleotide sequences that are presented in any of SEQ ID NO 69 and SEQ ID NO 70;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO 67 и SEQ ID NO 68;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with one of the nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO 67 and SEQ ID NO 68;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO 69 и SEQ ID NO 70;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with one of the nucleotide sequences that are presented in any of SEQ ID NO 69 and SEQ ID NO 70;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной микробной клетке, для обеспечения внутриклеточной активности глутамин:фруктозо-6-фосфат-аминотрансферазы.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in a genetically modified microbial cell to ensure intracellular glutamine:fructose-6-phosphate aminotransferase activity.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка обладает глюкозамин-6-фосфат-N-ацетилтрансферазной активностью. Под действием указанной глюкозамин-6-фосфат-N-ацетилтрансферазы GlcN-6P превращается в N-ацетилглюкозамин-6-фосфат (GlcNAc-6P). Примером глюкозамин-6-фосфат-N-ацетилтрансферазы является Gna1 из Saccharomyces cerevisiae (UniProtKB - Р43577; SEQ ID NO 77).In a further and/or alternative embodiment, the genetically modified microbial cell has glucosamine 6-phosphate N-acetyltransferase activity. Under the action of the specified glucosamine-6-phosphate-N-acetyltransferase, GlcN-6P is converted into N-acetylglucosamine-6-phosphate (GlcNAc-6P). An example of glucosamine-6-phosphate-N-acetyltransferase is Gna1 from Saccharomyces cerevisiae (UniProtKB - P43577; SEQ ID NO 77).
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит глюкозамин-6-фосфат-N-ацетилтрансферазу, предпочтительно гетерологичную глюкозамин-6-фосфат-N-ацетилтрансферазу, более предпочтительно Gna1 из S. cerevisiae (кодируемую нуклеотидной последовательностью, представленной в SEQ ID NO 78) или ее функционально активный вариант.In a further and/or alternative embodiment, the genetically modified microbial cell comprises a glucosamine 6-phosphate N-acetyltransferase, preferably a heterologous glucosamine 6-phosphate N-acetyltransferase, more preferably Gna1 from S. cerevisiae (encoded by the nucleotide sequence set forth in SEQ ID NO 78) or its functionally active variant.
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, выбранную из группы, состоящей из:Thus, in a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, представленный в SEQ ID NO 77;1) nucleotide sequences encoding the polypeptide presented in SEQ ID NO 77;
2) нуклеотидных последовательностей, представленных в SEQ ID NO 78;2) nucleotide sequences presented in SEQ ID NO 78;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с нуклеотидными последовательностями, кодирующими полипептид, представленный в SEQ ID NO 77;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with the nucleotide sequences encoding the polypeptide presented in SEQ ID NO 77;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, представленных в SEQ ID NO 78;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with one of the nucleotide sequences presented in SEQ ID NO 78;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной микробной клетке, для обеспечения внутриклеточной активности глутамин:фруктозо-6-фосфат-аминотрансферазы.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in a genetically modified microbial cell to ensure intracellular glutamine:fructose-6-phosphate aminotransferase activity.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка обладает N-ацетилглюкозамин-6-фосфат-фосфатазной активностью. Под действием указанной N-ацетилглюкозамин-6-фосфат-фосфатазы GlcNAc-6P превращается в N-ацетилглюкозамин (GlcNAc). Примерами N-ацетилглюкозамин-6-фосфат-фосфатазы являются фосфатазы сахаров HAD-подобного суперсемейства (от англ. haloacid dehydrogenase - дегидрогеназа галогенкислоты), которые катализируют превращение GlcNAc-6P в GlcNAc. HAD-подобное суперсемейство ферментов названо по имени бактериального фермента дегидрогеназы галогенкислоты и включает фосфатазы. Подходящая фосфатаза HAD-подобного суперсемейства, катализирующая превращение GlcNAc-6P в GlcNAc, может быть выбрана из группы, состоящей из фруктозо-1-фосфат-фосфатазы (YqaB, UniProtKB - Р77475; SEQ ID NO 79) и альфа-D-глюкозо-1-фосфат-фосфатазы (YihX, UniProtKB - P0A8Y3; SEQ ID NO 80). Считается, что ферменты YqaB из E, coli и YihX из E.coli также воздействуют на GlcNAc-6P (Lee, S.-W. and Oh, M.-K. (2015) Metabolic Engineering, 28: 143-150).In a further and/or alternative embodiment, the genetically modified microbial cell has N-acetylglucosamine-6-phosphate phosphatase activity. Under the action of the specified N-acetylglucosamine-6-phosphate phosphatase, GlcNAc-6P is converted to N-acetylglucosamine (GlcNAc). Examples of N-acetylglucosamine-6-phosphate phosphatases are sugar phosphatases of the HAD-like superfamily (haloacid dehydrogenase), which catalyze the conversion of GlcNAc-6P to GlcNAc. The HAD-like superfamily of enzymes is named after the bacterial halide dehydrogenase enzyme and includes phosphatases. A suitable HAD-like superfamily phosphatase catalyzing the conversion of GlcNAc-6P to GlcNAc may be selected from the group consisting of fructose-1-phosphate phosphatase (YqaB, UniProtKB - P77475; SEQ ID NO 79) and alpha-D-glucose-1 -phosphate phosphatase (YihX, UniProtKB - P0A8Y3; SEQ ID NO 80). The enzymes YqaB from E. coli and YihX from E. coli are also thought to act on GlcNAc-6P (Lee, S.-W. and Oh, M.-K. (2015) Metabolic Engineering, 28: 143-150).
В дополнительном и/или альтернативном воплощении фосфатаза сахаров HAD-подобного суперсемейства, катализирующая превращение GlcNAc-6P в GlcNAc, представляет собой гетерологичный фермент в генетически модифицированной микробной клетке. В дополнительном и/или альтернативном воплощении фосфатаза сахаров HAD-подобного суперсемейства, катализирующая превращение GlcNAc6P в GlcNAc, выбрана из группы, состоящей из YqaB из Е. coli, YihX из Е. coli и их функциональных вариантов.In an additional and/or alternative embodiment, the HAD-like superfamily sugar phosphatase that catalyzes the conversion of GlcNAc-6P to GlcNAc is a heterologous enzyme in a genetically modified microbial cell. In an additional and/or alternative embodiment, the HAD-like superfamily sugar phosphatase catalyzing the conversion of GlcNAc6P to GlcNAc is selected from the group consisting of E. coli YqaB, E. coli YihX, and functional variants thereof.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, которая содержит и экспрессирует нуклеотидную последовательность, кодирующую фосфатазу сахаров HAD-подобного суперсемейства, катализирующую превращение GlcNAc-6P в GlcNAc. В дополнительном и/или альтернативном воплощении нуклеотидная последовательность, кодирующая фосфатазу сахаров HAD-подобного суперсемейства, катализирующую превращение GlcNAc-6P в GlcNAc, представляет собой гетерологичную нуклеотидную последовательность. В дополнительном и/или альтернативном воплощении нуклеотидная последовательность, кодирующая фосфатазу сахаров HAD-подобного суперсемейства, катализирующую превращение GlcNAc-6P в GlcNAc, кодирует фруктозо-1-фосфат-фосфатазу из Е. coli или альфа-D-глюкозо-1-фосфат-фосфатазу из Е, coli либо функциональный фрагмент одного из этих двух ферментов.In an additional and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule that contains and expresses a nucleotide sequence encoding a HAD-like superfamily sugar phosphatase that catalyzes the conversion of GlcNAc-6P to GlcNAc. In an additional and/or alternative embodiment, the nucleotide sequence encoding a HAD-like superfamily sugar phosphatase that catalyzes the conversion of GlcNAc-6P to GlcNAc is a heterologous nucleotide sequence. In an additional and/or alternative embodiment, the nucleotide sequence encoding a HAD-like superfamily sugar phosphatase catalyzing the conversion of GlcNAc-6P to GlcNAc encodes E. coli fructose-1-phosphate phosphatase or alpha-D-glucose-1-phosphate phosphatase from E, coli or a functional fragment of one of these two enzymes.
YqaB из Е. coli кодируется нуклеотидной последовательностью, представленной в SEQ ID NO 81, в то время как YihX из Е. coli кодируется нуклеотидными последовательностями, представленными в SEQ ID NO 82. Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, выбранную из группы, состоящей из:YqaB from E. coli is encoded by the nucleotide sequence set forth in SEQ ID NO 81, while YihX from E. coli is encoded by the nucleotide sequence set forth in SEQ ID NO 82. Thus, in a further and/or alternative embodiment, a genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO 79 и SEQ ID NO 80;1) nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO 79 and SEQ ID NO 80;
2) нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO 81 и SEQ ID NO 82;2) nucleotide sequences that are presented in any of SEQ ID NO 81 and SEQ ID NO 82;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO 79 и SEQ ID NO 80;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with any of the nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO 79 and SEQ ID NO 80;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из нуклеотидных последовательностей, которые представлены в одной из SEQ ID NO 81 и SEQ ID NO 82;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with any of the nucleotide sequences that are presented in one of SEQ ID NO 81 and SEQ ID NO 82;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной микробной клетке, для обеспечения внутриклеточной активности фосфатазы сахаров, которая катализирует превращение GlcNAc-6P в GlcNAc.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in the genetically modified microbial cell to provide intracellular sugar phosphatase activity that catalyzes the conversion of GlcNAc-6P to GlcNAc.
В дополнительном и/или альтернативном воплощении не встречающийся в природе микроорганизм генетически модифицирован таким образом, чтобы содержать молекулу нуклеиновой кислоты, включающую и экспрессирующую нуклеотидную последовательность, которая кодирует фосфатазу сахаров HAD-подобного суперсемейства, катализирующую превращение GlcNAc-6P в GlcNAc, или функциональный фрагмент указанной HAD-фосфатазы, и/или, чтобы содержать фосфатазу сахаров HAD-подобного суперсемейства.In an additional and/or alternative embodiment, the non-naturally occurring microorganism is genetically modified to contain a nucleic acid molecule comprising and expressing a nucleotide sequence that encodes a HAD-like superfamily sugar phosphatase catalyzing the conversion of GlcNAc-6P to GlcNAc, or a functional fragment thereof HAD phosphatases, and/or to contain a sugar phosphatase of the HAD-like superfamily.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка обладает N-ацетилглюкозамин-2-эпимеразной активностью. N-Ацетилглюкозамин-2-эпимераза (ЕС 5.1.3.8) представляет собой фермент, который катализирует превращение N-ацетилглюкозамина (GlcNAc) в N-ацетилманнозамин (ManNAc). Данный фермент представляет собой рацемазу, действующую на углеводы и их производные. Систематическим названием фермента этого класса является N-ацил-D-глюкозамин-2-эпимераза. Этот фермент принимает участие в метаболизме амино-сахаров и метаболизме нуклеотид-сахаров, предпочтительно представляет собой гетерологичную N-ацетилглюкозамин-2-эпимеразу.In a further and/or alternative embodiment, the genetically modified microbial cell has N-acetylglucosamine-2-epimerase activity. N-Acetylglucosamine 2-epimerase (EC 5.1.3.8) is an enzyme that catalyzes the conversion of N-acetylglucosamine (GlcNAc) to N-acetylmannosamine (ManNAc). This enzyme is a racemase that acts on carbohydrates and their derivatives. The systematic name for this class of enzyme is N-acyl-D-glucosamine-2-epimerase. This enzyme is involved in amino sugar metabolism and nucleotide sugar metabolism and is preferably a heterologous N-acetylglucosamine-2-epimerase.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит N-ацетилглюкозамин-2-эпимеразу, предпочтительно гетерологичную N-ацетилглюкозамин-2-эпимеразу. Были описаны примеры N-ацетилглюкозамин-2-эпимераз из Anabena variabilis, Acaryochloris sp., Nostoc sp., Nostoc punctiforme, Bacteroides ovatus или Synechocystis sp. Примером подходящей N-ацетилглюкозамин-2-эпимеразы является N-ацетилглюкозамин-2-эпимераза из В. ovatus АТСС (Американская коллекция типовых культур) 8483 (UniProtKB - A7LVG6, SEQ ID NO 83), которая кодируется геном BACOVA_01816 (SEQ ID NO 85). Другим примером является Л/-ацетилглюкозамин-2-эпимераза из Synechocystis sp.(штамма РСС 6803) (UniProtKB - Р74124; SEQ ID NO 84), которая также известна как ренин-связывающий белок и кодируется геном slr1975 (SEQ ID NO 86).In a further and/or alternative embodiment, the genetically modified microbial cell contains N-acetylglucosamine-2-epimerase, preferably a heterologous N-acetylglucosamine-2-epimerase. Examples of N-acetylglucosamine-2-epimerases from Anabena variabilis, Acaryochloris sp., Nostoc sp., Nostoc punctiforme, Bacteroides ovatus or Synechocystis sp. have been described. An example of a suitable N-acetylglucosamine-2-epimerase is N-acetylglucosamine-2-epimerase from B. ovatus ATCC (American Type Culture Collection) 8483 (UniProtKB - A7LVG6, SEQ ID NO 83), which is encoded by the gene BACOVA_01816 (SEQ ID NO 85) . Another example is L-acetylglucosamine-2-epimerase from Synechocystis sp. (strain PCC 6803) (UniProtKB - P74124; SEQ ID NO 84), which is also known as renin-binding protein and is encoded by the slr1975 gene (SEQ ID NO 86).
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, которая кодирует N-ацетилглюкозамин-2-эпимеразу, предпочтительно N-ацетилглюкозамин-2-эпимеразу из В. ovatus АТСС 8483 или Synechocystis sp.(штамма РСС 6803) либо их функциональный вариант.In a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing a nucleotide sequence that encodes N-acetylglucosamine-2-epimerase, preferably N-acetylglucosamine-2-epimerase from B. ovatus ATCC 8483 or Synechocystis sp. (strain RSS 6803) or their functional version.
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, выбранную из группы, состоящей из:Thus, in a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO 83 и SEQ ID NO 84;1) nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO 83 and SEQ ID NO 84;
2) нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO 85 и SEQ ID NO 86;2) nucleotide sequences that are presented in any of SEQ ID NO 85 and SEQ ID NO 86;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO 83 и SEQ ID NO 84;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with one of the nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO 83 and SEQ ID NO 84;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO 85 и SEQ ID NO 86;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with one of the nucleotide sequences that are presented in any of SEQ ID NO 85 and SEQ ID NO 86;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности для обеспечения внутриклеточной активности N-ацетилглюкозамин-2-эпимеразы.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence to ensure intracellular N-acetylglucosamine-2-epimerase activity.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка обладает N-ацетилглюкозамин-6-фосфат-эпимеразной активностью и N-ацетилманнозамин-6-фосфат-фосфатазной активностью. N-Ацетилглюкозамин-6-фосфат-эпимераза катализирует превращение N-ацетилглюкозамин-6-фосфата (GlcNAc-6P) в N-ацетилманнозамин-6-фосфат (ManNAc-6P), в то время как N-ацетилманнозамин-6-фосфат-фосфатаза дефосфорилирует ManNAc-6P с получением N-ацетилманнозамина (ManNAc). Наличие N-ацетилглюкозамин-6-фосфат-эпимеразной активности и N-ацетилманнозамин-6-фосфат-фосфатазной активности обеспечивает дополнительный или альтернативный путь предоставления ManNAc для получения Neu5Ac.In a further and/or alternative embodiment, the genetically modified microbial cell has N-acetylglucosamine 6-phosphate epimerase activity and N-acetylmannosamine 6-phosphate phosphatase activity. N-Acetylglucosamine 6-phosphate epimerase catalyzes the conversion of N-acetylglucosamine 6-phosphate (GlcNAc-6P) to N-acetylmannosamine 6-phosphate (ManNAc-6P), while N-acetylmannosamine 6-phosphate phosphatase dephosphorylates ManNAc-6P to produce N-acetylmannosamine (ManNAc). The presence of N-acetylglucosamine 6-phosphate epimerase activity and N-acetylmannosamine 6-phosphate phosphatase activity provides an additional or alternative route for providing ManNAc to produce Neu5Ac.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит N-ацетилглюкозамин-6-фосфат-эпимеразу. Примером подходящей N-ацетилглюкозамин-6-фосфат-эпимеразы является NanE из Е. coli (UniProtKB - Р0А761, SEQ ID NO 87), которая кодируется геном nanE Е. coli (SEQ ID NO 88).In a further and/or alternative embodiment, the genetically modified microbial cell contains N-acetylglucosamine 6-phosphate epimerase. An example of a suitable N-acetylglucosamine-6-phosphate epimerase is NanE from E. coli (UniProtKB - P0A761, SEQ ID NO 87), which is encoded by the E. coli nanE gene (SEQ ID NO 88).
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, кодирующую N-ацетилглюкозамин-6-фосфат-эпимеразу, предпочтительно нуклеотидную последовательность, кодирующую NanE из Е. coilThus, in a further and/or alternative embodiment, the genetically modified microbial cell comprises a nucleic acid molecule comprising and expressing a nucleotide sequence encoding N-acetylglucosamine 6-phosphate epimerase, preferably a nucleotide sequence encoding NanE from E. coli
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, выбранную из группы, состоящей из:Thus, in a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, представленный в SEQ ID NO 87;1) nucleotide sequences encoding the polypeptide presented in SEQ ID NO 87;
2) нуклеотидных последовательностей, представленных в SEQ ID NO 88;2) nucleotide sequences presented in SEQ ID NO 88;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с нуклеотидными последовательностями, кодирующими полипептид, представленный в SEQ ID NO 87;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with the nucleotide sequences encoding the polypeptide presented in SEQ ID NO 87;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, представленных в SEQ ID NO 88;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with one of the nucleotide sequences presented in SEQ ID NO 88;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной микробной клетке, для обеспечения внутриклеточной активности N-ацетилглюкозамин-6-фосфат-эпимеразы.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in a genetically modified microbial cell to ensure intracellular activity of N-acetylglucosamine-6-phosphate epimerase.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит N-ацетилманнозамин-6-фосфат-фосфатазу.In a further and/or alternative embodiment, the genetically modified microbial cell contains N-acetylmannosamine-6-phosphate phosphatase.
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, кодирующую N-ацетилманнозамин-6-фосфат-фосфатазу.Thus, in an additional and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence encoding N-acetylmannosamine 6-phosphate phosphatase.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит активность синтазы сиаловых кислот. Синтаза сиаловых кислот катализирует реакцию конденсации ManNAc и фосфоенолпирувата (ФЕП) с образованием N-ацетилнейраминовой кислоты (NeuNAc).In a further and/or alternative embodiment, the genetically modified microbial cell contains sialic acid synthase activity. Sialic acid synthase catalyzes the condensation reaction of ManNAc and phosphoenolpyruvate (PEP) to form N-acetylneuraminic acid (NeuNAc).
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит синтазу сиаловых кислот или ее функциональный вариант, предпочтительно гетерологичную синтазу сиаловых кислот. Известны примеры синтаз сиаловых кислот из бактерий различных видов, таких как Campylobacter jejuni, Streptococcus agalactiae, Butyrivibrio proteociasticus, Methanobrevibacter ruminatium, Acetobacterium woodii, Desulfobacula toluolica, Escherichia coli, Prevotella nigescens, Halorhabdus tiamatea, Desulfotignum phosphitoxidans или Candidates Scalindua sp., Idomarina loihiensis, Fusobacterium nucleatum или Neisseria meningitidis. Предпочтительно, синтазой сиаловых кислот является синтаза N-ацетилнейраминовой кислоты NeuB из С. jejuni (SEQ ID NO 89), которая кодируется геном neuB С. jejuni (SEQ ID NO 90).In a further and/or alternative embodiment, the genetically modified microbial cell comprises a sialic acid synthase or a functional variant thereof, preferably a heterologous sialic acid synthase. There are known examples of sialic acid synthases from bacteria of various species, such as Campylobacter jejuni, Streptococcus agalactiae, Butyrivibrio proteociasticus, Methanobrevibacter ruminatium, Acetobacterium woodii, Desulfobacula toluolica, Escherichia coli, Prevotella nigescens, Halorhabdus tiamatea, Desulfotignum phosphitoxidans or Candi dates Scalindua sp., Idomarina loihiensis, Fusobacterium nucleatum or Neisseria meningitidis. Preferably, the sialic acid synthase is N-acetylneuraminic acid synthase NeuB from C. jejuni (SEQ ID NO 89), which is encoded by the C. jejuni neuB gene (SEQ ID NO 90).
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, выбранную из группы, состоящей из:Thus, in a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, представленный в SEQ ID NO 89;1) nucleotide sequences encoding the polypeptide presented in SEQ ID NO 89;
2) нуклеотидных последовательностей, представленных в SEQ ID NO 90;2) nucleotide sequences presented in SEQ ID NO 90;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с нуклеотидными последовательностями, кодирующими полипептид, представленный в SEQ ID NO 89;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with the nucleotide sequences encoding the polypeptide presented in SEQ ID NO 89;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, представленных в SEQ ID NO 90;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with one of the nucleotide sequences presented in SEQ ID NO 90;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной микробной клетке, для обеспечения внутриклеточной активности синтазы N-ацетилнейраминовой кислоты.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in a genetically modified microbial cell to ensure intracellular activity of N-acetylneuraminic acid synthase.
Генетически модифицированная микробная клетка обладает активностью синтетазы цитидин-5'-монофосфо-(ЦМФ)-N-ацетилнейраминовой кислоты для переноса цитидин-5'-монофосфата на N-ацетилнейраминовую кислоту с образованием ЦМФ-активированной N-ацетилнейраминовой кислоты (ЦМФ-NeuNAc). В данной области техники известно и описано несколько синтетаз 5'-монофосфо-(ЦМФ)-сиаловой кислоты, например, синтетазы 5'-монофосфо-(ЦМФ)-сиаловой кислоты из Е. coli, Neisseria meningitidis, Campylobacter jejuni, Streptococcus sp. и так далее.The genetically modified microbial cell has cytidine 5'-monophospho-(CMP)-N-acetylneuraminic acid synthetase activity to transfer cytidine 5'-monophosphate to N-acetylneuraminic acid to form CMP-activated N-acetylneuraminic acid (CMP-NeuNAc). Several 5'-monophospho-(CMP)-sialic acid synthetases are known and described in the art, for example, 5'-monophospho-(CMP)-sialic acid synthetases from E. coli, Neisseria meningitidis, Campylobacter jejuni, Streptococcus sp. and so on.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит синтетазу цитидин-5'-монофосфо-(ЦМФ)-N-ацетилнейраминовой кислоты, предпочтительно гетерологичную синтетазу цитидин-5'-монофосфо-(ЦМФ)-N-ацетилнейраминовой кислоты, более предпочтительно N-ацетилнейраминат-цитидилтрансферазу NeuA из Е. coli. NeuA из Е. coli (UnitProtKB - Р13266; SEQ ID NO 91) кодируется геном neuA E. coli (SEQ ID NO 92).In a further and/or alternative embodiment, the genetically modified microbial cell comprises cytidine-5'-monophospho-(CMP)-N-acetylneuraminic acid synthetase, preferably a heterologous cytidine-5'-monophospho-(CMP)-N-acetylneuraminic acid synthetase, more preferably N-acetylneuraminate cytidyltransferase NeuA from E. coli. NeuA from E. coli (UnitProtKB - P13266; SEQ ID NO 91) is encoded by the E. coli neuA gene (SEQ ID NO 92).
Таким образом, в дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, содержащую и экспрессирующую нуклеотидную последовательность, выбранную из группы, состоящей из:Thus, in a further and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule containing and expressing a nucleotide sequence selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, представленный в SEQ ID NO 91;1) nucleotide sequences encoding the polypeptide presented in SEQ ID NO 91;
2) нуклеотидных последовательностей, представленных в SEQ ID NO 92;2) nucleotide sequences presented in SEQ ID NO 92;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с нуклеотидными последовательностями, кодирующими полипептид, представленный в SEQ ID NO 91;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with the nucleotide sequences encoding the polypeptide presented in SEQ ID NO 91;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, представленных в SEQ ID NO 92;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99%, or greater than 99% with one of the nucleotide sequences presented in SEQ ID NO 92;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной микробной клетке, для обеспечения активности N-ацетилнейраминат-цитидилтрансферазы.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in the genetically modified microbial cell to ensure N-acetylneuraminate cytidyl transferase activity.
Генетически модифицированная микробная клетка обладает сиалилтрансферазной активностью, предпочтительно активностью гетерологичной сиалилтрансферазы и более предпочтительно сиалилтрансферазной активностью, выбранной из группы, состоящей из α-2,3-сиалилтрансферазной активности, α-2,6-сиалилтрансферазной активности и/или α-2,8-сиалилтрансферазной активности. В результате проявления сиалилтрансферазной активности возможно осуществление переноса группировки N-ацетилнейраминовой кислоты с ЦМФ-NeuNAc на акцепторную молекулу, причем указанной акцепторной молекулой является молекула сахарида, с получением сиалилированного сахарида.The genetically modified microbial cell has a sialyltransferase activity, preferably a heterologous sialyltransferase activity and more preferably a sialyltransferase activity selected from the group consisting of α-2,3-sialyltransferase activity, α-2,6-sialyltransferase activity and/or α-2,8- sialyltransferase activity. As a result of the manifestation of sialyltransferase activity, it is possible to transfer the N-acetylneuraminic acid group from CMP-NeuNAc to an acceptor molecule, wherein said acceptor molecule is a saccharide molecule, producing a sialylated saccharide.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит по меньшей мере одну сиалилтрансферазу, предпочтительно по меньшей мере одну гетерологичную сиалилтрансферазу, при этом указанная сиалилтрансфераза может обладать α-2,3-сиалилтрансферазной активностью, и/или α-2,6-сиалилтрансферазной активностью, и/или α-2,8-сиалилтрансферазной активностью для переноса группировки NeuNAc с ЦМФ-NeuNAc в качестве донорного субстрата на акцепторный сахарид.In a further and/or alternative embodiment, the genetically modified microbial cell contains at least one sialyltransferase, preferably at least one heterologous sialyltransferase, which sialyltransferase may have α-2,3-sialyltransferase activity, and/or α-2,6- sialyltransferase activity, and/or α-2,8-sialyltransferase activity to transfer the NeuNAc moiety from CMP-NeuNAc as a donor substrate to an acceptor saccharide.
Термин "сиалилтрансфераза", использованный в данном описании, относится к полипептидам, которые могут обладать сиалилтрансферазной активностью. "Сиалилтрансферазная активность" относится к переносу остатка сиаловой кислоты, предпочтительно остатка N-ацетилнейраминовой кислоты (Neu5Ac), с донорного субстрата на акцепторную молекулу. Термин "сиалилтрансфераза" включает в себя функциональные фрагменты сиалилтрансфераз, описанных в данной заявке, функциональные варианты сиалилтрансфераз, описанных в данной заявке, и функциональные фрагменты функциональных вариантов. "Функциональный" в этом отношении означает, что фрагменты и/или варианты могут обладать сиалилтрансферазной активностью. Функциональные фрагменты сиалилтрансферазы охватывают укороченные версии сиалилтрансферазы, которая кодируется своим встречающимся в природе геном, при этом такая укороченная версия может обладать сиалилтрансферазной активностью. Примерами укороченных версий являются сиалилтрансферазы, не содержащие так называемой лидерной последовательности, которая обычно служит для придания полипептиду конкретной внутриклеточной локализации. В типичном случае такие лидерные последовательности удаляются из полипептида во время его внутриклеточного транспорта, и они также отсутствуют во встречающейся в природе зрелой сиалилтрансферазе.The term "sialyltransferase" as used herein refers to polypeptides that may have sialyltransferase activity. "Sialyltransferase activity" refers to the transfer of a sialic acid residue, preferably an N-acetylneuraminic acid residue (Neu5Ac), from a donor substrate to an acceptor molecule. The term "sialyltransferase" includes functional fragments of the sialyltransferases described in this application, functional variants of the sialyltransferases described in this application, and functional fragments of functional variants. "Functional" in this regard means that the fragments and/or variants may have sialyltransferase activity. Functional sialyltransferase fragments comprise truncated versions of the sialyltransferase that is encoded by its naturally occurring gene, which truncated version may have sialyltransferase activity. Examples of shortened versions are sialyltransferases that do not contain the so-called leader sequence, which usually serves to give the polypeptide a specific intracellular localization. Typically, such leader sequences are removed from the polypeptide during intracellular transport, and they are also absent from naturally occurring mature sialyltransferase.
Гетерологичная сиалилтрансфераза способна осуществлять перенос остатка сиаловой кислоты с донорного субстрата на акцепторную молекулу. Термин "способный осуществлять" применительно к гетерологичной сиалилтрансферазе относится к сиалилтрансферазной активности гетерологичной сиалилтрансферазы и к тому положению, что для проявления гетерологичной сиалилтрансферазой своей ферментативной активности необходимы подходящие условия реакции. В отсутствие подходящих условий реакции гетерологичная сиалилтрансфераза не обладает своей ферментативной активностью, однако сохраняет свою ферментативную активность и обладает своей ферментативной активностью при возвращении к подходящим условиям реакции. Подходящие условия реакции включают наличие подходящего донорного субстрата, наличие подходящих акцепторных молекул, наличие важных кофакторов, таких как, например, одновалентные или двухвалентные ионы, значение рН в соответствующем диапазоне, подходящая температура и тому подобное. Нет необходимости в том, чтобы соблюдались оптимальные значения для каждого отдельного фактора, влияющего на ферментативную реакцию с участием гетерологичной сиалилтрансферазы, однако условия реакции должны быть такими, чтобы гетерологичная сиалилтрансфераза проявляла свою ферментативную активность. Соответственно, термин "способный осуществлять" исключает любые условия, в которых ферментативная активность гетерологичной сиалилтрансферазы была бы необратимо нарушена, и также исключает воздействие на гетерологичную сиалилтрансферазу любого такого условия. Напротив, "способный осуществлять" означает, что сиалилтрансфераза является ферментативно активной, т.е. обладает своей сиалилтрансферазной активностью, если для сиалилтрансферазы предусмотрены разрешающие условия реакции (включающие все требования, необходимые для осуществления сиалилтрансферазой своей ферментативной активности).Heterologous sialyltransferase is capable of transferring a sialic acid residue from a donor substrate to an acceptor molecule. The term "capable of performing" in relation to a heterologous sialyltransferase refers to the sialyltransferase activity of the heterologous sialyltransferase and the concept that suitable reaction conditions are required for the heterologous sialyltransferase to exhibit its enzymatic activity. In the absence of suitable reaction conditions, the heterologous sialyltransferase does not have its enzymatic activity, but retains its enzymatic activity and has its enzymatic activity when returned to suitable reaction conditions. Suitable reaction conditions include the presence of a suitable donor substrate, the presence of suitable acceptor molecules, the presence of important cofactors such as, for example, monovalent or divalent ions, a pH value in the appropriate range, a suitable temperature, and the like. It is not necessary that optimal values be observed for each individual factor affecting the enzymatic reaction involving the heterologous sialyltransferase, but the reaction conditions must be such that the heterologous sialyltransferase exhibits its enzymatic activity. Accordingly, the term “capable of performing” excludes any conditions under which the enzymatic activity of the heterologous sialyltransferase would be irreversibly impaired, and also excludes exposure of the heterologous sialyltransferase to any such condition. In contrast, “capable of performing” means that the sialyltransferase is enzymatically active, i.e. possesses its sialyltransferase activity if the permissive reaction conditions are provided for the sialyltransferase (including all the requirements necessary for the sialyltransferase to carry out its enzymatic activity).
Сиалилтрансферазы можно различить по типу связи с сахаром, которую они образуют. Использованные в данном описании термины "α-2,3-сиалилтрансфераза" и "α-2,3-сиалилтрансферазная активность" относятся к полипептидам и их ферментативной активности, которые катализируют добавление к галактозе, N-ацетилгалактозамину либо к остатку галактозы или N-ацетилгалактозамина, в качестве акцепторной молекулы, остатка сиаловой кислоты с образованием α-2,3-связи. Аналогичным образом, термины "α-2,6-сиалилтрансфераза" и "α-2,6-сиалилтрансферазная активность" относятся к полипептидам и их ферментативной активности, которые катализируют добавление к галактозе, N-ацетил галактоза ми ну либо к остатку галактозы или N-ацетилгалактозамина, в качестве акцепторной молекулы, остатка сиаловой кислоты с образованием α-2,6-связи. Аналогичным образом, термины "α-2,8-сиалилтрансфераза" и "α-2,8-сиалилтрансферазная активность" относятся к полипептидам и их ферментативной активности, которые катализируют добавление к галактозе, N-ацетилгалактозамину либо к остатку галактозы или N-ацетилгалактозамина, в качестве акцепторной молекулы, остатка сиаловой кислоты с образованием α-2,8-связи.Sialyltransferases can be distinguished by the type of sugar bond they form. As used herein, the terms "α-2,3-sialyltransferase" and "α-2,3-sialyltransferase activity" refer to polypeptides and their enzymatic activity that catalyze addition to galactose, N-acetylgalactosamine, or a galactose or N-acetylgalactosamine residue. , as an acceptor molecule, a sialic acid residue to form an α-2,3 bond. Likewise, the terms "α-2,6-sialyltransferase" and "α-2,6-sialyltransferase activity" refer to polypeptides and their enzymatic activities that catalyze the addition of galactose, N-acetyl galactose or a galactose moiety. -acetylgalactosamine, as an acceptor molecule, a sialic acid residue to form an α-2,6 bond. Likewise, the terms "α-2,8-sialyltransferase" and "α-2,8-sialyltransferase activity" refer to polypeptides and their enzymatic activity that catalyze addition to galactose, N-acetylgalactosamine, or a galactose or N-acetylgalactosamine residue, as an acceptor molecule, a sialic acid residue to form an α-2,8 bond.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит гетерологичную сиалилтрансферазу, которая предпочтительно выбрана из группы, состоящей из:In an additional and/or alternative embodiment, the genetically modified microbial cell contains a heterologous sialyltransferase, which is preferably selected from the group consisting of:
1) полипептидов, содержащих аминокислотную последовательность или состоящих из аминокислотной последовательности, которая представлена в любой из SEQ ID NO: 1-33;1) polypeptides containing an amino acid sequence or consisting of an amino acid sequence that is presented in any of SEQ ID NO: 1-33;
2) полипептидов, содержащих аминокислотную последовательность или состоящих из аминокислотной последовательности, которая имеет идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из аминокислотных последовательностей, которые представлены в любой из SEQ ID NO: 1-33; и2) polypeptides containing an amino acid sequence or consisting of an amino acid sequence that has at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% sequence identity with any of the amino acid sequences, which are presented in any of SEQ ID NO: 1-33; And
3) фрагментов любого из полипептидов из (1) и (2).3) fragments of any of the polypeptides from (1) and (2).
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка трансформирована для того, чтобы содержать молекулу нуклеиновой кислоты, которая включает и экспрессирует нуклеотидную последовательность, кодирующую гетерологичную сиалилтрансферазу. Предпочтительно, нуклеотидную последовательность, которая может быть выбрана из Таблицы 1. В дополнительном и/или альтернативном воплощении нуклеотидная последовательность выбрана из группы, состоящей из:In a further and/or alternative embodiment, the genetically modified microbial cell is transformed to contain a nucleic acid molecule that includes and expresses a nucleotide sequence encoding a heterologous sialyltransferase. Preferably, a nucleotide sequence which may be selected from Table 1. In an additional and/or alternative embodiment, the nucleotide sequence is selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO: 1-33;1) nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO: 1-33;
2) нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO: 34-66;2) nucleotide sequences that are presented in any of SEQ ID NO: 34-66;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO: 1-33;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with one of the nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO: 1-33;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из нуклеотидных последовательностей, представленных в SEQ ID NO: 34-66;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with any of the nucleotide sequences presented in SEQ ID NO: 34-66 ;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной микробной клетке, для обеспечения проявления сиалилтрансферазной активности.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in the genetically modified microbial cell to ensure the manifestation of sialyltransferase activity.
Выражение "любая из SEQ ID NO: 1-33" относится к "любой SEQ ID NO из группы, состоящей из SEQ ID NO 1, SEQ ID NO 2, SEQ ID NO 3, SEQ ID NO 4, SEQ ID NO 5, SEQ ID NO 6, SEQ ID NO 7, SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 12, SEQ ID NO 13. SEQ ID NO 14. SEQ ID NO 15, SEQ ID NO 16, SEQ ID NO 17, SEQ ID NO 18, SEQ ID NO 19, SEQ ID NO 20, SEQ ID NO 21, SEQ ID NO 22, SEQ ID NO 23, SEQ ID NO 24, SEQ ID NO 25, SEQ ID NO 26, SEQ ID NO 27, SEQ ID NO 28, SEQ ID NO 29, SEQ ID NO 30, SEQ ID NO 31, SEQ ID NO 32 и SEQ ID NO 33. Тот же принцип применим к выражению "любая из SEQ ID NO: 34-66". Вообще говоря, выражение "любая из SEQ ID NO: X-Z", где "X" и "Z" представляют собой натуральные числа, относится ко всем последовательностям (нуклеотидным последовательностям или аминокислотным последовательностям), представленным в любой из "SEQ ID NO", содержащих идентификационный номер от X до Z.The expression "any of SEQ ID NO: 1-33" refers to "any SEQ ID NO from the group consisting of SEQ ID NO 1, SEQ ID NO 2, SEQ ID NO 3, SEQ ID NO 4, SEQ ID NO 5, SEQ ID NO 6, SEQ ID NO 7, SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 12, SEQ ID NO 13. SEQ ID NO 14. SEQ ID NO 15, SEQ ID NO 16, SEQ ID NO 17, SEQ ID NO 18, SEQ ID NO 19, SEQ ID NO 20, SEQ ID NO 21, SEQ ID NO 22, SEQ ID NO 23, SEQ ID NO 24, SEQ ID NO 25, SEQ ID NO 26, SEQ ID NO 27, SEQ ID NO 28, SEQ ID NO 29, SEQ ID NO 30, SEQ ID NO 31, SEQ ID NO 32 and SEQ ID NO 33. The same principle applies to the expression "any of SEQ ID NO: 34-66". Generally speaking, the expression "any of SEQ ID NO: X-Z", where "X" and "Z" are natural numbers, refers to all sequences (nucleotide sequences or amino acid sequences) present in any of "SEQ ID NO" containing the identification number from X to Z.
Помимо этого, генетически модифицированная микробная клетка генетически модифицирована с возможностью экспрессировать нуклеотидную последовательность, кодирующую гетерологичную сиалилтрансферазу. Для этого нуклеотидная последовательность, кодирующая гетерологичную сиалилтрансферазу, функционально связана по меньшей мере с одной последовательностью контроля экспрессии, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности, кодирующей гетерологичную сиалилтрансферазу в генетически модифицированной клетке.In addition, the genetically modified microbial cell is genetically modified to express a nucleotide sequence encoding a heterologous sialyltransferase. To do this, a nucleotide sequence encoding a heterologous sialyltransferase is operably linked to at least one expression control sequence, affecting the transcription and/or translation of said nucleotide sequence encoding a heterologous sialyltransferase in a genetically modified cell.
Термин "функционально связанный", использованный в данном описании, относится к функциональной связи между нуклеотидной последовательностью, кодирующей гетерологичную сиалилтрансферазу, и второй нуклеотидной последовательностью - последовательностью контроля экспрессии нуклеиновой кислоты (такой как промотор, оператор, энхансер, регулятор, ряд сайтов связывания транскрипционных факторов, терминатор транскрипции, сайт связывания рибосомы), причем данная последовательность контроля экспрессии оказывает влияние на транскрипцию и/или трансляцию нуклеиновой кислоты, соответствующей нуклеотидной последовательности, кодирующей гетерологичную сиалилтрансферазу. Соответственно, термин "промотор" означает последовательности ДНК, которые обычно "предшествуют" гену" в полимерной молекуле ДНК и предоставляют сайт инициации транскрипции в матричной РНК (мРНК). "Регуляторные" последовательности ДНК, обычно также расположенные "вверх по течению от" гена (т.е. предшествующие гену) в указанной полимерной молекуле ДНК, связывают белки, которые определяют частоту (или степень) инициации транскрипции. Совместно именуемые как "промоторная/регуляторная" или "контрольная" последовательность ДНК, эти последовательности, "предшествующие" выбранному гену (или серии генов) в функциональной полимерной молекуле ДНК, действуют совместно в отношении определения того, будет ли происходить транскрипция (и возможно экспрессия) гена. Последовательности ДНК, которые "следуют" за геном в полимерной молекуле ДНК и обеспечивают наличие сигнала терминации транскрипции в мРНК, именуются как последовательности "терминатора" транскрипции.The term "operably linked" as used herein refers to a functional relationship between a nucleotide sequence encoding a heterologous sialyltransferase and a second nucleotide sequence, a nucleic acid expression control sequence (such as a promoter, operator, enhancer, regulator, a number of transcription factor binding sites, transcription terminator, ribosome binding site), and this expression control sequence affects the transcription and/or translation of the nucleic acid corresponding to the nucleotide sequence encoding the heterologous sialyltransferase. Accordingly, the term "promoter" refers to DNA sequences that typically "precede" a gene" in a polymeric DNA molecule and provide a transcription initiation site in messenger RNA (mRNA). "Regulatory" DNA sequences are also typically located "upstream" of a gene ( i.e., preceding the gene) in a specified polymeric DNA molecule, bind proteins that determine the frequency (or extent) of transcription initiation. Collectively referred to as the “promoter/regulator” or “control” DNA sequence, these sequences “precede” the selected gene (. or series of genes) in a functional DNA polymer molecule, act together to determine whether transcription (and possibly expression) of a gene will occur. DNA sequences that “follow” the gene in the DNA polymer molecule and provide a transcription termination signal in the mRNA, are referred to as transcription "terminator" sequences.
В дополнительном и/или альтернативном воплощении гетерологичная сиалилтрансфераза, которая может обладать α-2,3-сиалилтрансферазной активностью, выбрана из группы, состоящей из:In a further and/or alternative embodiment, the heterologous sialyltransferase, which may have α-2,3-sialyltransferase activity, is selected from the group consisting of:
1) полипептидов, содержащих аминокислотную последовательность или состоящих из аминокислотной последовательности, представленной в любой из SEQ ID NO: 1-27;1) polypeptides containing an amino acid sequence or consisting of an amino acid sequence presented in any of SEQ ID NO: 1-27;
2) полипептидов, содержащих аминокислотную последовательность или состоящих из аминокислотной последовательности, которая имеет идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из аминокислотных последовательностей, которые представлены в любой из SEQ ID NO: 1-27; и2) polypeptides containing an amino acid sequence or consisting of an amino acid sequence that has at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% sequence identity with any of the amino acid sequences, which are presented in any of SEQ ID NO: 1-27; And
3) фрагментов любого из полипептидов из (1) и (2).3) fragments of any of the polypeptides from (1) and (2).
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, которая содержит по меньшей мере одну нуклеотидную последовательность, кодирующую указанную гетерологичную сиалилтрансферазу, которая может обладать а-2,3-сиалилтрансферазной активностью, при этом указанная по меньшей мере одна нуклеотидная последовательность выбрана из группы, состоящей из:In an additional and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule that contains at least one nucleotide sequence encoding said heterologous sialyltransferase, which may have α-2,3-sialyltransferase activity, wherein said at least one nucleotide sequence the sequence is selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, представленный в любой из SEQ ID NO: 1-27;1) nucleotide sequences encoding a polypeptide presented in any of SEQ ID NO: 1-27;
2) нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO: 34-60;2) nucleotide sequences that are presented in any of SEQ ID NO: 34-60;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO: 1-27;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with one of the nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO: 1-27;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из нуклеотидных последовательностей, представленных в SEQ ID NO: 34-60;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with any of the nucleotide sequences presented in SEQ ID NO: 34-60 ;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной клетке, для обеспечения проявления α-2,3-сиалилтрансферазной активности.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in the genetically modified cell to ensure the manifestation of α-2,3-sialyltransferase activity.
В дополнительном и/или альтернативном воплощении гетерологичная сиалилтрансфераза, которая может обладать α-2,3-сиалилтрансферазной активностью, имеет относительную эффективность по меньшей мере в 100 раз, по меньшей мере в 200 раз, по меньшей мере в 300 раз, по меньшей мере в 1000 раз, по меньшей мере в 10000 раз превышающую относительную эффективность сиалилтрансферазы, представленной в SEQ ID NO 27, по данным количественного анализа сиалилирования лакто-N-тетраозы (LNT) с применением жидкостной хроматографии в сочетании стандемной масс-спектрометрией (LC-MS/MS).In a further and/or alternative embodiment, the heterologous sialyltransferase, which may have α-2,3-sialyltransferase activity, has a relative potency of at least 100-fold, at least 200-fold, at least 300-fold, at least 1000 times, at least 10,000 times the relative efficiency of the sialyltransferase shown in SEQ ID NO 27, as determined by quantitative analysis of sialylation of lacto-N-tetraose (LNT) using liquid chromatography coupled to standard mass spectrometry (LC-MS/MS ).
В другом воплощении гетерологичная сиалилтрансфераза может обладать α-2,6-сиалилтрансферазной активностью.In another embodiment, the heterologous sialyltransferase may have α-2,6-sialyltransferase activity.
В дополнительном воплощении гетерологичная сиалилтрансфераза, которая может обладать α-2,6-сиалилтрансферазной активностью, выбрана из группы, состоящей из:In a further embodiment, the heterologous sialyltransferase, which may have α-2,6-sialyltransferase activity, is selected from the group consisting of:
1) полипептидов, содержащих аминокислотную последовательность или состоящих из аминокислотной последовательности, представленной в любой из SEQ ID NO: 28-33;1) polypeptides containing an amino acid sequence or consisting of an amino acid sequence presented in any of SEQ ID NO: 28-33;
2) полипептидов, содержащих аминокислотную последовательность или состоящих из аминокислотной последовательности, которая имеет идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из аминокислотных последовательностей, которые представлены в любой из SEQ ID NO: 28-33; и2) polypeptides containing an amino acid sequence or consisting of an amino acid sequence that has at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% sequence identity with any of the amino acid sequences, which are presented in any of SEQ ID NO: 28-33; And
3) фрагментов любого из полипептидов из (1) и (2).3) fragments of any of the polypeptides from (1) and (2).
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка включает молекулу нуклеиновой кислоты, которая содержит по меньшей мере одну нуклеотидную последовательность, кодирующую указанную гетерологичную сиалилтрансферазу, которая может обладать α-2,6-сиалилтрансферазной активностью, при этом указанная по меньшей мере одна нуклеотидная последовательность выбрана из группы, состоящей из:In an additional and/or alternative embodiment, the genetically modified microbial cell includes a nucleic acid molecule that contains at least one nucleotide sequence encoding said heterologous sialyltransferase, which may have α-2,6-sialyltransferase activity, wherein said at least one nucleotide sequence the sequence is selected from the group consisting of:
1) нуклеотидных последовательностей, кодирующих полипептид, представленный в любой из SEQ ID NO: 28-33;1) nucleotide sequences encoding a polypeptide presented in any of SEQ ID NO: 28-33;
2) нуклеотидных последовательностей, которые представлены в любой из SEQ ID NO: 61-66;2) nucleotide sequences that are presented in any of SEQ ID NO: 61-66;
3) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с одной из нуклеотидных последовательностей, кодирующих полипептид, который представлен в любой из SEQ ID NO: 28-33;3) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with one of the nucleotide sequences encoding a polypeptide that is presented in any of SEQ ID NO: 28-33;
4) нуклеотидных последовательностей, имеющих идентичность последовательности по меньшей мере 80%, 90%, 95%, 96%, 97%, 98%, 99% или больше 99% с любой из нуклеотидных последовательностей, представленных в SEQ ID NO: 61-66;4) nucleotide sequences having sequence identity of at least 80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% with any of the nucleotide sequences presented in SEQ ID NO: 61-66 ;
5) нуклеотидных последовательностей, комплементарных любой из нуклеотидных последовательностей из (1), (2), (3) и (4); и5) nucleotide sequences complementary to any of the nucleotide sequences from (1), (2), (3) and (4); And
6) фрагментов любой из нуклеотидных последовательностей из (1), (2), (3), (4) и (5);6) fragments of any of the nucleotide sequences from (1), (2), (3), (4) and (5);
при этом указанная нуклеотидная последовательность функционально связана по меньшей мере с одной последовательностью контроля экспрессии нуклеиновой кислоты, воздействуя на транскрипцию и/или трансляцию указанной нуклеотидной последовательности в генетически модифицированной клетке, для обеспечения проявления α-2,6-сиалилтрансферазной активности.wherein said nucleotide sequence is operably linked to at least one nucleic acid expression control sequence, affecting the transcription and/or translation of said nucleotide sequence in the genetically modified cell to ensure the manifestation of α-2,6-sialyltransferase activity.
В дополнительном и/или альтернативном воплощении гетерологичная сиалилтрансфераза, которая может обладать α-2,6-сиалилтрансферазной активностью, имеет относительную эффективность по меньшей мере в 100 раз, более предпочтительно по меньшей мере в 200 раз, наиболее предпочтительно по меньшей мере в 300 раз превышающую относительную эффективность сиалилтрансферазы, представленной в SEQ ID NO 33, по данным количественного анализа сиалилирования LNT (от англ. lacto-N-neotetraose - лакто-N-неотетраоза).In a further and/or alternative embodiment, the heterologous sialyltransferase, which may have α-2,6-sialyltransferase activity, has a relative potency of at least 100 times, more preferably at least 200 times, most preferably at least 300 times greater the relative efficiency of the sialyltransferase presented in SEQ ID NO 33, according to quantitative analysis of sialylation of LNT (from the English lacto-N-neotetraose - lacto-N-neotetraose).
В дополнительном и/или альтернативном воплощении гетерологичная сиалилтрансфераза может обладать α-2,8-сиалилтрансферазной активностью. Примером гетерологичной сиалилтрансферазы, которая может обладать α-2,8-сиалилтрансферазной активностью, является сиалилтрансфераза CstlI из Campylobacter jejunii ОН4384.In a further and/or alternative embodiment, the heterologous sialyltransferase may have α-2,8-sialyltransferase activity. An example of a heterologous sialyltransferase that may have α-2,8-sialyltransferase activity is the sialyltransferase CstlI from Campylobacter jejunii OH4384.
Сиалилтрансфераза способна осуществлять перенос остатка сиаловой кислоты, например, остатка N-ацетилнейраминовой кислоты (NeuSAc), с донорного субстрата, например, ЦМФ-Neu5Ac, на акцепторную молекулу. Акцепторной молекулой является молекула сахарида, предпочтительно молекула сахарида, приведенного в Таблице 2.Sialyltransferase is capable of transferring a sialic acid residue, such as N-acetylneuraminic acid (NeuSAc), from a donor substrate, such as CMP-Neu5Ac, to an acceptor molecule. The acceptor molecule is a saccharide molecule, preferably a saccharide molecule listed in Table 2.
В дополнительном и/или альтернативном воплощении акцепторная молекула представляет собой моносахарид, предпочтительно моносахарид, выбранный из группы, состоящей из N-ацетилглюкозамина, галактозы и N-ацетилгалактозамина.In a further and/or alternative embodiment, the acceptor molecule is a monosaccharide, preferably a monosaccharide selected from the group consisting of N-acetylglucosamine, galactose and N-acetylgalactosamine.
В дополнительном и/или альтернативном воплощении акцепторная молекула представляет собой дисахарид, предпочтительно дисахарид, выбранный из группы, состоящей из лактозы, N-ацетиллактозамина, лакто-N-биозы, лактулозы и мелибиозы.In a further and/or alternative embodiment, the acceptor molecule is a disaccharide, preferably a disaccharide selected from the group consisting of lactose, N-acetyllactosamine, lacto-N-biose, lactulose and melibiose.
В дополнительном и/или альтернативном воплощении акцепторная молекула представляет собой трисахарид, предпочтительно трисахарид, выбранный из группы, состоящей из раффинозы, лакто-N-триозы II, 2'-фукозиллактозы, 3-фукозиллактозы, 3'-сиалиллактозы, 6'-сиалиллактозы, 3'-сиалил-N-ацетиллактозамина, 6'-сиалил-N-ацетиллактозамина, 3'-галактозиллактозы и 6'-галактозиллактозы.In an additional and/or alternative embodiment, the acceptor molecule is a trisaccharide, preferably a trisaccharide selected from the group consisting of raffinose, lacto-N-triose II, 2'-fucosyllactose, 3-fucosyllactose, 3'-sialyllactose, 6'-sialyllactose, 3'-sialyl-N-acetyllactosamine, 6'-sialyl-N-acetyllactosamine, 3'-galactosyllactose and 6'-galactosyllactose.
В дополнительном и/или альтернативном воплощении акцепторная молекула представляет собой тетрасахарид, предпочтительно тетрасахарид, выбранный из группы, состоящей из лакто-N-тетраозы, лакто-N-неотетраозы, 2',3-дифукозиллактозы, 3-фукозил-3'-сиалиллактозы и 3-фукозил-6'-сиалиллактозы.In an additional and/or alternative embodiment, the acceptor molecule is a tetrasaccharide, preferably a tetrasaccharide selected from the group consisting of lacto-N-tetraose, lacto-N-neotetraose, 2',3-difucosyllactose, 3-fucosyl-3'-sialyllactose and 3-fucosyl-6'-sialyllactose.
В дополнительном и/или альтернативном воплощении акцепторная молекула представляет собой пентасахарид, предпочтительно пентасахарид, выбранный из группы, состоящей из сиалиллакто-N-тетраозы а, сиалиллакто-N-тетраозы b, сиалиллакто-N-тетраозы с, лакто-N-фукопентаозы I, лакто-N-фукопентаозы II, лакто-N-фукопентаозы III, лакто-N-фукопентаозы V, лакто-N-неофукопентаозы I и лакто-N-неофукопентаозы V.In an additional and/or alternative embodiment, the acceptor molecule is a pentasaccharide, preferably a pentasaccharide selected from the group consisting of sialyllacto-N-tetraose a, sialyllacto-N-tetraose b, sialyllacto-N-tetraose c, lacto-N-fucopentaose I, lacto-N-fucopentaose II, lacto-N-fucopentaose III, lacto-N-fucopentaose V, lacto-N-neofucopentaose I and lacto-N-neofucopentaose V.
Термин "функциональный вариант", использованный в данном описании применительно к упомянутому в данной заявке ферменту, относится к полипептидным вариантам указанных ферментов, не утратившим активности, и последовательность которых по меньшей мере на 70%, предпочтительно по меньшей мере на 80%, по меньшей мере на 85%, по меньшей мере на 90%, по меньшей мере на 95%, по меньшей мере на 98% или по меньшей мере на 99% идентична аминокислотной последовательности указанного фермента. При этом принимается во внимание возможность некоторой вариабельности в данных по геномным последовательностям, из которых эти полипептиды происходят, и также возможность того, что некоторые из аминокислот, присутствующих в этих полипептидах, могут быть заменены без существенного затрагивания каталитической активности фермента.The term "functional variant" as used herein in relation to an enzyme mentioned herein refers to polypeptide variants of said enzymes that have not lost activity and are at least 70%, preferably at least 80%, consistent 85%, at least 90%, at least 95%, at least 98%, or at least 99% identical to the amino acid sequence of the specified enzyme. This takes into account the possibility of some variability in the genomic sequence data from which these polypeptides are derived, and also the possibility that some of the amino acids present in these polypeptides can be replaced without significantly affecting the catalytic activity of the enzyme.
Термин "функциональный вариант" также включает в себя полипептидные варианты указанных ферментов, которые представляют собой укороченные варианты фермента без существенной утраты каталитической активности. Таким образом, аминокислотная последовательность укороченных вариантов может отличаться от аминокислотных последовательностей указанного фермента в том смысле, что отсутствует одна аминокислота, отсутствуют две аминокислоты или участок, состоящий из следующих друг за другом аминокислот числом больше двух. Укорочение может быть произведено на амино-конце (N-конце), на карбоксильном конце (С-конце) и/или внутри аминокислотной последовательности указанного фермента.The term "functional variant" also includes polypeptide variants of these enzymes, which are shortened versions of the enzyme without significant loss of catalytic activity. Thus, the amino acid sequence of the truncated variants may differ from the amino acid sequences of the specified enzyme in the sense that one amino acid is missing, two amino acids are missing, or a region consisting of more than two consecutive amino acids. The truncation can be made at the amino terminus (N-terminus), at the carboxyl terminus (C-terminus) and/or within the amino acid sequence of the enzyme.
Термин "функционально связанный" относится к функциональной связи между последовательностью контроля экспрессии нуклеиновой кислоты (такой как промотор, сигнальная последовательность или ряд сайтов связывания транскрипционных факторов) и второй нуклеиновокислотной последовательностью, причем данная последовательность контроля экспрессии оказывает влияние на транскрипцию и/или трансляцию нуклеиновой кислоты, соответствующей второй последовательности.The term "operably linked" refers to a functional relationship between a nucleic acid expression control sequence (such as a promoter, a signal sequence, or a series of transcription factor binding sites) and a second nucleic acid sequence, wherein the expression control sequence affects the transcription and/or translation of the nucleic acid, the corresponding second sequence.
Следует понимать, что в случае микробной клетки, уже несущей один или несколько генов, кодирующих указанные ферменты, и экспрессирующей указанные гены способом, достаточным для продуцирования NeuNAc, ЦМФ-NeuNAc и/или сиалилированного сахарида, нет необходимости в проведении генетической модификации с целью завершения биосинтеза сиаловой кислоты и переноса группировки сиаловой кислоты на акцепторный сахарид, но тем не менее генетическая модификация может быть осуществлена для того, чтобы изменить уровень экспрессии одного или более указанных генов с целью повышения содержания внутри клетки одного или нескольких продуктов указанных генов, как например, с целью повышения количества глутамин:фруктозо-6-фосфат-аминотрансферазы, глюкозамин-6-фосфат-N-ацетилтрансферазы, N-ацетилглюкозамин-6-фосфат-фосфатазы, N-ацетилглюкозамин-2-эпимеразы и/или синтазы N-ацетилнейраминовой кислоты, что повышает таким образом скорость биосинтеза NeuSAc и, в связи с этим, сиалилированного сахарида, в генетически модифицированной клетке.It should be understood that in the case of a microbial cell already carrying one or more genes encoding these enzymes, and expressing these genes in a manner sufficient to produce NeuNAc, CMP-NeuNAc and/or sialylated saccharide, there is no need for genetic modification to complete the biosynthesis sialic acid and transfer of the sialic acid moiety to an acceptor saccharide, but genetic modification may nevertheless be carried out to change the expression level of one or more of these genes in order to increase the intracellular content of one or more products of these genes, such as for the purpose of increasing the amount of glutamine:fructose-6-phosphate aminotransferase, glucosamine-6-phosphate-N-acetyltransferase, N-acetylglucosamine-6-phosphate-phosphatase, N-acetylglucosamine-2-epimerase and/or N-acetylneuraminic acid synthase, which increases thus the rate of biosynthesis of NeuSAc and, in connection with this, sialylated saccharide, in a genetically modified cell.
В дополнительном и/или альтернативном воплощении в генетически модифицированной микробной клетке синтезируется больше ФЕП, чем в клетке дикого типа. В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована для того, чтобы обладать повышенным путем биосинтеза ФЕП. Предпочтительно, генетически модифицированная микробная клетка генетически модифицирована для того, чтобы обладать более высокой активностью фосфоенолпируват-синтазы, например, в том смысле, что ген ppsA, кодирующий фосфоенолпируват-синтазу, сверхэкспрессирован, и/или в том смысле, что не встречающиеся в природе микроорганизмы содержат по меньшей мере одну дополнительную копию нуклеотидной последовательности, что обеспечивает экспрессию фосфоенолпируват-синтазы или ее функционального варианта. Сверхэкспрессия гена ppsA повышает внутриклеточный синтез ФЕП, в результате чего больше молекул ФЕП доступны для продуцирования сиаловой кислоты. Например, подходящей фосфоенолпируват-синтазой является PpsA из Е. coli.In a further and/or alternative embodiment, more PEP is synthesized in the genetically modified microbial cell than in the wild type cell. In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified to have an enhanced PEP biosynthetic pathway. Preferably, the genetically modified microbial cell is genetically modified to have higher phosphoenolpyruvate synthase activity, for example in the sense that the ppsA gene encoding phosphoenolpyruvate synthase is overexpressed, and/or in the sense that non-naturally occurring microorganisms contain at least one additional copy of the nucleotide sequence, which provides expression of phosphoenolpyruvate synthase or a functional variant thereof. Overexpression of the ppsA gene increases intracellular PEP synthesis, resulting in more PEP molecules available for the production of sialic acid. For example, a suitable phosphoenolpyruvate synthase is PpsA from E. coli.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую PpsA из Е. coli или ее функциональный вариант. Указанная нуклеотидная последовательность, кодирующая PpsA из Е. coli или ее функциональный вариант, имеет идентичность последовательности по меньшей мере 80%, по меньшей мере 85%, по меньшей мере 90%, по меньшей мере 95%, по меньшей мере 98% или по меньшей мере 99% с последовательностью гена ppsA Е. coli.In a further and/or alternative embodiment, the genetically modified microbial cell comprises a nucleic acid molecule comprising a nucleotide sequence encoding PpsA from E. coli or a functional variant thereof. Said nucleotide sequence encoding PpsA from E. coli or a functional variant thereof has a sequence identity of at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least at least 99% with the E. coli ppsA gene sequence.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка помимо этого содержит один или более генов, кодирующих полипептид, который может обладать активностью фермента, выбранного из группы, состоящей из пермеазы сахарозы, гидролазы сахарозы, фруктокиназы, L-глутамин:D-фруктозо-6-фосфат-аминотрансферазы, глюкозамин-6-фосфат-N-ацетилтрансферазы, N-ацетилглюкозамин-2-эпимеразы, синтазы сиаловых кислот, фосфоенолпируват-синтазы, при этом предпочтительно, что по меньшей мере один из этих генов, предпочтительно все гены, сверхэкспрессируется/сверхэкспрессируются в генетически модифицированной микробной клетке по сравнению с микробной клеткой дикого типа.In an additional and/or alternative embodiment, the genetically modified microbial cell further contains one or more genes encoding a polypeptide that may have enzyme activity selected from the group consisting of sucrose permease, sucrose hydrolase, fructokinase, L-glutamine:D-fructose- 6-phosphate aminotransferases, glucosamine-6-phosphate-N-acetyltransferases, N-acetylglucosamine-2-epimerase, sialic acid synthase, phosphoenolpyruvate synthase, preferably at least one of these genes, preferably all genes, is overexpressed /overexpressed in a genetically modified microbial cell compared to a wild-type microbial cell.
В дополнительном и/или альтернативном воплощении путь катаболизма сиаловых кислот, который протекает естественным образом в линии клеток-предшественников для генетически модифицированной микробной клетки, не работает в генетически модифицированной микробной клетке.In a further and/or alternative embodiment, the sialic acid catabolism pathway that occurs naturally in the progenitor cell line for the genetically modified microbial cell does not operate in the genetically modified microbial cell.
В дополнительном и/или альтернативном воплощении способа и генетически модифицированной микробной клетки у данной генетически модифицированной микробной клетки отсутствует активность, или она обладает более низкой активностью, по сравнению с клеткой-предшественником генетически модифицированной микробной клетки, одного или более ферментов, выбранных из группы, состоящей из α-N-ацетилгалактозаминидазы (например, NagA), N-ацетилглюкозамин-киназы (например, NagK), N-ацетилнейраминат-лиазы (синоним: альдолазы N-ацетилнейраминовой кислоты, например, NanA), β-галактозидазы, глюкозамин-6-фосфат-дезаминазы, N-ацетилглюкозамин-6-фосфат-дезацетилазы, N-ацетилманнозамин-киназы и/или N-ацетилманнозамин-6-фосфат-эпимеразы.In an additional and/or alternative embodiment of the method and the genetically modified microbial cell, the genetically modified microbial cell lacks activity, or has lower activity, compared to the progenitor cell of the genetically modified microbial cell, of one or more enzymes selected from the group consisting of from α-N-acetylgalactosaminidase (e.g. NagA), N-acetylglucosamine kinase (e.g. NagK), N-acetylneuraminate lyase (synonym: N-acetylneuraminic acid aldolase, e.g. NanA), β-galactosidase, glucosamine-6- phosphate deaminases, N-acetylglucosamine 6-phosphate deacetylase, N-acetylmannosamine kinase and/or N-acetylmannosamine 6-phosphate epimerase.
В дополнительном и/или альтернативном воплощении способа и генетически модифицированной микробной клетки генетически модифицированная микробная клетка помимо этого содержит один или более генов, кодирующих полипептид, который может обладать активностью фермента, выбранного из группы, состоящей из N-ацетилглюкозамин-1-фосфат-уридилтрансферазы, глюкозамин-1-фосфат-ацетилтрансферазы, фосфоглюкозамин-мутазы, УДФ-N-ацетилглюкозамин-2-эпимеразы, УДФ-галактозо-4-эпимеразы, галактозо-1-фосфат-уридилилтрансферазы, фосфоглюкомутазы, глюкозо-1-фосфат-уридилилтрансферазы, фосфоманномутазы, маннозо-1-фосфат-гуанозилтрансферазы, ГДФ(гуанозиндифосфат)-манноза-4,6-дегидратазы, ГДФ-L-фукозосинтазы и фукозокиназы/L-фукозо-1-фосфат-гуанилтрансферазы.In an additional and/or alternative embodiment of the method and the genetically modified microbial cell, the genetically modified microbial cell further contains one or more genes encoding a polypeptide that may have the activity of an enzyme selected from the group consisting of N-acetylglucosamine-1-phosphate-uridyltransferase, glucosamine-1-phosphate acetyltransferase, phosphoglucosamine mutase, UDP-N-acetylglucosamine-2-epimerase, UDP-galactose-4-epimerase, galactose-1-phosphate-uridylyltransferase, phosphoglucomutase, glucose-1-phosphate-uridylyltransferase, phosphomannomutase, mannose-1-phosphate-guanosyltransferase, GDP(guanosine diphosphate)-mannose-4,6-dehydratase, GDP-L-fucose synthase and fucosokinase/L-fucose-1-phosphate-guanyltransferase.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка содержит по меньшей мере один компонент, выбранный из группы, состоящей из функционально активной пермеазы лактозы, функционально активного переносчика (экспортера) сиаловых кислот, причем она предпочтительно содержит и экспрессирует по меньшей мере одну нуклеотидную последовательность, кодирующую один компонент, выбранный из группы, состоящей из функционально активной пермеазы лактозы, функционально активной пермеазы сахарозы, функционально активного переносчика (экспортера) сиаловых кислот, при этом предпочтительно, чтобы в данной клетке сверхэкспрессировалась по меньшей мере одна из этих нуклеотидных последовательностей.In an additional and/or alternative embodiment, the genetically modified microbial cell contains at least one component selected from the group consisting of a functionally active lactose permease, a functionally active sialic acid transporter (exporter), and preferably it contains and expresses at least one nucleotide sequence , encoding one component selected from the group consisting of a functionally active lactose permease, a functionally active sucrose permease, a functionally active sialic acid transporter (exporter), and preferably at least one of these nucleotide sequences is overexpressed in a given cell.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка дополнительно модифицирована с возможностью перенесения указанного единственного источника углерода в данную клетку посредством механизма без потребления ФЕП.In a further and/or alternative embodiment, the genetically modified microbial cell is further modified to transfer said single carbon source into the cell through a mechanism without consuming PEP.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка обладает функциональной системой утилизации сахарозы. Указанная функционально активная система утилизации сахарозы делает возможным импорт в клетку экзогенно поставляемой сахарозы и ее гидролиз, в результате чего получаемые моносахариды глюкоза и фруктоза могут утилизироваться метаболическим путем благодаря метаболизму генетически модифицированной клетки и в целях получения желаемого сиалилированного олигосахарида.In an additional and/or alternative embodiment, the genetically modified microbial cell has a functional sucrose utilization system. This functionally active sucrose utilization system allows the import of exogenously supplied sucrose into the cell and its hydrolysis, as a result of which the resulting monosaccharides glucose and fructose can be utilized metabolically through the metabolism of the genetically modified cell and in order to obtain the desired sialylated oligosaccharide.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована для того, чтобы обладать функционально активной системой утилизации сахарозы. В дополнительном и/или альтернативном воплощении система утилизации сахарозы у не встречающегося в природе микроорганизма содержит систему переноса молекул сахарозы и протонов по механизму симпорта, фруктокиназу, инвертазу и репрессор сахарозного оперона.In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified to have a functional sucrose utilization system. In an additional and/or alternative embodiment, the sucrose utilization system in a non-natural microorganism comprises a sucrose and proton symport transport system, fructokinase, invertase, and a sucrose operon repressor.
Подходящей системой переноса молекул сахарозы и протонов по механизму симпорта является CscB, кодируемая геном cscB, например, CscB из Е. coli (UniProtKB - Р30000), кодируемая геном cscB Е. coli.A suitable system for transferring sucrose molecules and protons by the symport mechanism is CscB, encoded by the cscB gene, for example, CscB from E. coli (UniProtKB - P30000), encoded by the cscB gene of E. coli.
Подходящей фруктокиназой (ЕС 2.7.1.4) является CscK, кодируемая геном cscK, например, CscK из Е. coli (UniProtKB - Р40713), кодируемая геном cscK E. coli.A suitable fructokinase (EC 2.7.1.4) is CscK encoded by the cscK gene, for example CscK from E. coli (UniProtKB - P40713) encoded by the cscK gene of E. coli.
Подходящей инвертазой (ЕС 3.2.1.26), которая катализирует гидролиз концевых нередуцирующих β-D-фруктофуранозидных остатков в β-D-фруктофуранозидах, является CscA, например, CscA из Е. coli (UniProtKB -086076), кодируемая геном cscA E. coli.A suitable invertase (EC 3.2.1.26) that catalyzes the hydrolysis of terminal non-reducing β-D-fructofuranoside residues in β-D-fructofuranosides is CscA, for example, CscA from E. coli (UniProtKB -086076), encoded by the E. coli cscA gene.
Подходящим репрессором сахарозного оперона является CscR, кодируемый геном cscR, например, CscR из Е. coli (UniProtKB - Р62604), кодируемый геном cscR Е. coli.A suitable repressor of the sucrose operon is CscR, encoded by the cscR gene, for example, CscR from E. coli (UniProtKB - P62604), encoded by the E. coli cscR gene.
В дополнительном и/или альтернативном воплощении генетически модифицированная клетка генетически модифицирована для того, чтобы обладать системой переноса молекул сахарозы и протонов по механизму симпорта, фруктокиназой, инвертазой и репрессором сахарозного оперона или функционально активными вариантами любого из этих белков.In a further and/or alternative embodiment, the genetically modified cell is genetically modified to possess a sucrose and proton symport transport system, fructokinase, invertase and a sucrose operon repressor, or functionally active variants of any of these proteins.
В дополнительном и/или альтернативном воплощении генетически модифицированная клетка генетически модифицирована для того, чтобы обладать молекулой нуклеиновой кислоты, содержащей нуклеотидные последовательности, кодирующие систему переноса молекул сахарозы и протонов по механизму симпорта, фруктокиназу, инвертазу и репрессор сахарозного оперона, для экспрессии указанных системы переноса молекул сахарозы и протонов по механизму симпорта, фруктокиназы, инвертазы и репрессора сахарозного оперона. В дополнительном и/или альтернативном воплощении генетически модифицированная клетка генетически модифицирована с возможностью экспрессировать гены cscB, cscK, cscA, предпочтительно гены cscB, cscK, cscA и cscR E. coli.In an additional and/or alternative embodiment, the genetically modified cell is genetically modified to possess a nucleic acid molecule containing nucleotide sequences encoding a sucrose and proton symport transport system, fructokinase, invertase and a sucrose operon repressor, for expressing said transport system. sucrose and protons through the symport mechanism, fructokinase, invertase and repressor of the sucrose operon. In a further and/or alternative embodiment, the genetically modified cell is genetically modified to express the cscB, cscK, cscA genes, preferably the cscB, cscK, cscA and cscR genes of E. coli.
В дополнительном и/или альтернативном воплощении нуклеотидная последовательность, кодирующая функционально активный вариант CscB, CscK, CscA или CscR, имеет идентичность последовательности по меньшей мере 80%, по меньшей мере 85%, по меньшей мере 90%, по меньшей мере 95%, по меньшей мере 98% или по меньшей мере 99% с последовательностью cscB, cscK, cscA или cscR Е. coli, соответственно.In a further and/or alternative embodiment, the nucleotide sequence encoding a functionally active variant of CscB, CscK, CscA or CscR has at least 80%, at least 85%, at least 90%, at least 95% sequence identity to at least 98% or at least 99% with the E. coli cscB, cscK, cscA or cscR sequence, respectively.
В дополнительном и/или альтернативном воплощении такой не встречающийся в природе микроорганизм экспрессирует β-галактозид-пермеазу и β-галактозидазу.In a further and/or alternative embodiment, such a non-naturally occurring microorganism expresses β-galactoside permease and β-galactosidase.
В дополнительном и/или альтернативном воплощении такой не встречающийся в природе микроорганизм генетически модифицирован с возможностью экспрессировать р-галактозидпермеазу, предпочтительно пермеазу лактозы LacY из Е. coli (SEQ ID NO 93) или ее функционально активный вариант и β-галактозидазу, предпочтительно LacZ из Е. coli (SEQ ID NO 95) или ее функционально активный вариант. В дополнительном и/или альтернативном воплощении такой не встречающийся в природе микроорганизм генетически модифицирован с возможностью нести в себе молекулу нуклеиновой кислоты, содержащую нуклеотидную последовательность, кодирующую β-галактозид-пермеазу, предпочтительно нуклеотидную последовательность, кодирующую LacY из Е. coli (SEQ ID NO 94) или ее функционально активный вариант, и/или нуклеотидную последовательность, кодирующую β-галактозидазу, предпочтительно нуклеотидную последовательность, кодирующую LacZ из Е. coli (SEQ ID NO 96) или ее функционально активный вариант.In a further and/or alternative embodiment, such a non-naturally occurring microorganism is genetically modified to express a β-galactosidase, preferably the lactose permease LacY from E. coli (SEQ ID NO 93) or a functionally active variant thereof, and a β-galactosidase, preferably LacZ from E . coli (SEQ ID NO 95) or a functionally active variant thereof. In a further and/or alternative embodiment, such a non-naturally occurring microorganism is genetically modified to carry a nucleic acid molecule comprising a nucleotide sequence encoding a β-galactoside permease, preferably a nucleotide sequence encoding LacY from E. coli (SEQ ID NO 94 ) or a functionally active variant thereof, and/or a nucleotide sequence encoding β-galactosidase, preferably a nucleotide sequence encoding LacZ from E. coli (SEQ ID NO 96) or a functionally active variant thereof.
В дополнительном и/или альтернативном воплощении нуклеотидная последовательность, кодирующая LacY из Е. coli или ее функционально активный вариант, имеет идентичность последовательности с последовательностью lacY Е. coli, составляющую по меньшей мере 80%, по меньшей мере 85%, по меньшей мере 90%, по меньшей мере 95%, по меньшей мере 98% или по меньшей мере 99%.In a further and/or alternative embodiment, the nucleotide sequence encoding E. coli LacY or a functionally active variant thereof has sequence identity to the E. coli lacY sequence of at least 80%, at least 85%, at least 90% , at least 95%, at least 98% or at least 99%.
В дополнительном и/или альтернативном воплощении нуклеотидная последовательность, кодирующая LacZ из Е. coli или ее функционально активный вариант, имеет идентичность последовательности с последовательностью lacZ Е. coli, составляющую по меньшей мере 80%, по меньшей мере 85%, по меньшей мере 90%, по меньшей мере 95%, по меньшей мере 98% или по меньшей мере 99%.In a further and/or alternative embodiment, the nucleotide sequence encoding E. coli LacZ or a functionally active variant thereof has sequence identity with the E. coli lacZ sequence of at least 80%, at least 85%, at least 90% , at least 95%, at least 98% or at least 99%.
Наличие не встречающегося в природе микроорганизма, который может продуцировать ЦМФ-Neu5Ac и который экспрессирует функционально активную β-галактозид-пермеазу и функционально активную β-галактозидазу, предусматривает культивирование указанного не встречающегося в природе микроорганизма на лактозе как на единственном источнике углерода.The presence of a non-naturally occurring microorganism that can produce CMP-Neu5Ac and that expresses a functionally active β-galactoside permease and a functionally active β-galactosidase involves culturing the non-naturally occurring microorganism on lactose as the sole carbon source.
Генетически модифицированная микробная клетка, которая может продуцировать сиалилированные сахариды, возможно может иметь дополнительные свойства и может быть генетически модифицирована для того, чтобы обладать этими дополнительными свойствами. Считается, что эти дополнительные свойства улучшают продуктивность такого не встречающегося в природе микроорганизма, приводя к более высоким выходам сиалилированных сахаридов.A genetically modified microbial cell that can produce sialylated saccharides may possibly have additional properties and may be genetically modified to have these additional properties. These additional properties are believed to improve the productivity of such a non-naturally occurring microorganism, leading to higher yields of sialylated saccharides.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована с возможностью аннулирования активности УДФ-глюкоза:ундекапренилфосфат-глюкозо-1-фосфат-трансферазы, предпочтительно посредством делетирования гена wcaJ или его функционально активного варианта, посредством нарушения экспрессии гена wcaJ или его функционально активного варианта или посредством аннулирования активности фермента WcaJ в результате внесения мутаций в кодирующий белок участок, вследствие чего полипептид, кодируемый измененной нуклеотидной последовательностью, не обладает активностью фермента WcaJ. WcaJ кодирует УДФ-глюкоза:ундекапренилфосфат-глюкозо-1-фосфат-трансферазу. Указанная УДФ-глюкоза:ундекапренилфосфат-глюкозо-1-фосфат-трансфераза представляет собой первый фермент в биосинтезе колановой кислоты.In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified to abolish UDP-glucose:undecaprenylphosphate-glucose-1-phosphate transferase activity, preferably by deleting the wcaJ gene or a functionally active variant thereof, by disrupting the expression of the wcaJ gene or a functionally active variant thereof. active variant or by abrogating the activity of the WcaJ enzyme as a result of introducing mutations into the protein-coding region, as a result of which the polypeptide encoded by the altered nucleotide sequence does not have the activity of the WcaJ enzyme. WcaJ encodes UDP-glucose:undecaprenylphosphate-glucose-1-phosphate transferase. This UDP-glucose:undecaprenylphosphate-glucose-1-phosphate transferase is the first enzyme in the biosynthesis of colanic acid.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована в том смысле, что ген β-галактозидазы (lacZ) делетирован, в том смысле, что экспрессия гена β-галактозидазы нарушена, или в том смысле, что нуклеотидная последовательность кодирующего белок участка в гене β-галактозидазы изменена, вследствие чего полипептид, кодируемый указанной измененной нуклеотидной последовательностью, не обладает ферментативной активностью β-галактозидазы.In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified in the sense that the β-galactosidase (lacZ) gene is deleted, in the sense that expression of the β-galactosidase gene is disrupted, or in the sense that the nucleotide sequence of the protein coding region in the β-galactosidase gene is changed, as a result of which the polypeptide encoded by the specified altered nucleotide sequence does not have β-galactosidase enzymatic activity.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована в том смысле, что ген, кодирующий галактокиназу (например, ген galK), делетирован, в том смысле, что экспрессия гена galK нарушена, или в том смысле, что нуклеотидная последовательность кодирующего белок участка в гене galK изменена, вследствие чего полипептид, кодируемый указанной(ыми) измененной(ыми) нуклеотидной(ыми) последовательностью(ями), не обладает ферментативной активностью галактокиназы. Делетирование или инактивация гена gal/K/GalK имеет то преимущество, что генетически модифицированная микробная клетка может утилизировать галактозу в качестве акцепторного субстрата только в случае реакций сиалилирования.In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified in the sense that the gene encoding galactokinase (for example, the galK gene) is deleted, in the sense that expression of the galK gene is disrupted, or in the sense that the nucleotide sequence of the encoding the protein region in the galK gene is changed, as a result of which the polypeptide encoded by the specified altered nucleotide sequence(s) does not have galactokinase enzymatic activity. Deletion or inactivation of the gal/K/GalK gene has the advantage that the genetically modified microbial cell can utilize galactose as an acceptor substrate only in the case of sialylation reactions.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована в том смысле, что ген, кодирующий N-ацетилгалактозаминидазу (nagA), делетирован, что его экспрессия нарушена или в том смысле, что нуклеотидная последовательность кодирующего белок участка изменена, вследствие чего полипептид, кодируемый указанной(ыми) измененной(ыми) нуклеотидной(ыми) последовательностью(ями), не обладает ферментативной активностью N-ацетилгалактозаминидазы. Делеция или инактивация nagA/NagA имеет то преимущество, что генетически модифицированная микробная клетка может утилизировать GlcNAc или GlcNAc-6-фосфат в качестве акцептора только в случае реакций сиалилирования.In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified in the sense that the gene encoding N-acetylgalactosaminidase (nagA) is deleted, its expression is disrupted, or in the sense that the nucleotide sequence of the protein coding region is changed, thereby causing the polypeptide , encoded by the specified altered nucleotide sequence(s), does not possess N-acetylgalactosaminidase enzymatic activity. Deletion or inactivation of nagA/NagA has the advantage that the genetically modified microbial cell can utilize GlcNAc or GlcNAc-6-phosphate as a scavenger only in the case of sialylation reactions.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована с возможностью аннулирования фукозоизомеразной активности, предпочтительно посредством делетирования гена fucI, посредством нарушения экспрессии гена fucI или посредством модификации кодирующего белок участка в гене fucI, вследствие чего полипептид, кодируемый указанной измененной нуклеотидной последовательностью, не обладает активностью фукозоизомеразы. Например, L-фукозоизомераза FucI из Е. coli (UniProtKB - Р69922) кодируется геном fucI Е. coli.In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified to abolish fucose isomerase activity, preferably by deleting the fucI gene, by disrupting the expression of the fucI gene, or by modifying a protein coding region in the fucI gene, whereby the polypeptide encoded by said altered nucleotide sequence is does not have fucose isomerase activity. For example, L-fucose isomerase FucI from E. coli (UniProtKB - P69922) is encoded by the E. coli fucI gene.
Фукулозокиназа катализирует фосфорилирование фукозы. Фукулозокиназа представляет собой второй фермент в подпути, в котором синтезируются L-лактальдегид и глицерофосфат из L-фукозы. Фукулозокиназа FucK из Е. coli (UniProtKB - Р11553) кодируется геном fucK Е. coli. Фукулозокиназа из Е. coli также может катализировать фосфорилирование, с более низкой эффективностью, D-рибулозы, D-ксилулозы и D-фруктозы.Fuculose kinase catalyzes the phosphorylation of fucose. Fuculose kinase is the second enzyme in the subpathway that synthesizes L-lactaldehyde and glycerophosphate from L-fucose. Fuculose kinase FucK from E. coli (UniProtKB - P11553) is encoded by the E. coli fucK gene. Fuculose kinase from E. coli can also catalyze the phosphorylation, with lower efficiency, of D-ribulose, D-xylulose and D-fructose.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована с возможностью аннулирования фукозоизомеразной активности, предпочтительно посредством делетирования гена fucK, или посредством нарушения экспрессии гена fucK, или посредством внесения мутаций в кодирующий белок участк гена fucK, вследствие чего полипептид, кодируемый указанной измененной нуклеотидной последовательностью, не обладает активностью фукозоизомеразы.In an additional and/or alternative embodiment, the genetically modified microbial cell is genetically modified to abolish fucose isomerase activity, preferably by deleting the fucK gene, or by disrupting the expression of the fucK gene, or by introducing mutations in the protein-coding region of the fucK gene, whereby the polypeptide encoded by said altered nucleotide sequence, does not have fucose isomerase activity.
N-Ацетилгалактозамин-6-фосфат-дезацетилаза катализирует следующую реакцию: N-ацетил-D-галактозамин-6-фосфат + H2O → D-галактозамин-6-фосфат + ацетат.N-Ацетилгалактозамин-6-фосфат-дезацетилаза кодируется геном agaA. В Е. coli N-ацетилгалактозамин-6-фосфат-дезацетилаза AgaA (UniProtKB - Р42906) кодируется геном agaA Е. coli.N-Acetylgalactosamine 6-phosphate deacetylase catalyzes the following reaction: N-acetyl-D-galactosamine 6-phosphate + H 2 O → D-galactosamine 6-phosphate + acetate. N-Acetylgalactosamine 6-phosphate deacetylase is encoded by the gene agaA. In E. coli, N-acetylgalactosamine-6-phosphate deacetylase AgaA (UniProtKB - P42906) is encoded by the E. coli agaA gene.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка генетически модифицирована с возможностью аннулирования активности N-ацетилгалактозамин-6-фосфат-дезацетилазы, предпочтительно путем делетирования гена agaA, путем нарушения экспрессии гена agaA или путем внесения мутаций в кодирующий белок участок в гене agaA, вследствие чего полипептид, кодируемый указанной измененной нуклеотидной последовательностью, не обладает активностью N-ацетилгалактозамин-6-фосфат-дезацетилазы.In a further and/or alternative embodiment, the genetically modified microbial cell is genetically modified to abolish N-acetylgalactosamine 6-phosphate deacetylase activity, preferably by deleting the agaA gene, by disrupting the expression of the agaA gene, or by introducing mutations in the protein coding region in the agaA gene, as a result, the polypeptide encoded by said altered nucleotide sequence does not have N-acetylgalactosamine-6-phosphate deacetylase activity.
В дополнительном и/или альтернативном воплощении по меньшей мере одна генетически модифицированная микробная клетка обладает повышенным продуцированием одного или более нуклеотид-активированных сахаров, выбранных из группы, состоящей из УДФ-N-ацетилглюкозамина, УДФ-галактозы и ГДФ-фукозы. Предпочтительно, чтобы по меньшей мере одна генетически модифицированная микробная клетка была дополнительно генетически модифицированадля того, чтобы обладать повышенным продуцированием одного или нескольких указанных нуклеотид-активированных сахаров. Продуцирование по меньшей мере одного из указанных нуклеотид-активированных сахаров в такой дополнительно генетически модифицированной клетке повышено по сравнению с продуцированием того(тех) же нуклеотид-активированного(ых) сахара(ов) в клетке-предшественнике дополнительно генетически модифицированной микробной клетки до подвергания ее дополнительной генетической модификации с целью обладания повышенным продуцированием по меньшей мере одного из указанных нуклеотид-активированных сахаров.In a further and/or alternative embodiment, the at least one genetically modified microbial cell has increased production of one or more nucleotide-activated sugars selected from the group consisting of UDP-N-acetylglucosamine, UDP-galactose and GDP-fucose. Preferably, the at least one genetically modified microbial cell is further genetically modified to have increased production of one or more of said nucleotide-activated sugars. The production of at least one of the specified nucleotide-activated sugars in such additional genetically modified cell is increased compared to the production of the same nucleotide-activated sugar(s) in the precursor cell of the additional genetically modified microbial cell before subjecting it to additional genetic modification to have increased production of at least one of these nucleotide-activated sugars.
В дополнительном и/или альтернативном воплощении по меньшей мере одна микробная клетка дополнительно генетически модифицирована с возможностью сверхэкспрессировать один или несколько генов, кодирующих полипептид, который может обладать активностью фермента, выбранного из группы, состоящей из L-глутамин:D-фруктозо-6-фосфат-аминотрансферазы, N-ацетилглюкозамин-1-фосфат-уридилтрансферазы, глюкозамин-1-фосфат-ацетилтрансферазы, фосфоглюкозамин-мутазы, УДФ-галактозо-4-эпимеразы, галактозо-1-фосфат-уридилилтрансферазы, фосфоглюкомутазы, глюкозо-1-фосфат-уридилилтрансферазы, фосфоманномутазы, маннозо-1-фосфат-гуанозилтрансферазы, ГДФ-маннозо-4,6-дегидратазы, ГДФ-L-фукозосинтазы и фукозокиназы/L-фукозо-1-фосфат-гуанилтрансферазы.In a further and/or alternative embodiment, the at least one microbial cell is further genetically modified to overexpress one or more genes encoding a polypeptide that may have enzyme activity selected from the group consisting of L-glutamine:D-fructose-6-phosphate -aminotransferases, N-acetylglucosamine-1-phosphate-uridyltransferase, glucosamine-1-phosphate-acetyltransferase, phosphoglucosamine-mutase, UDP-galactose-4-epimerase, galactose-1-phosphate-uridylyltransferase, phosphoglucomutase, glucose-1-phosphate-uridylyltransferase , phosphomannomutase, mannose-1-phosphate-guanosyltransferase, GDP-mannose-4,6-dehydratase, GDP-L-fucose synthase and fucosokinase/L-fucose-1-phosphate-guanyltransferase.
В настоящее время, и как очевидно в общей области техники и в данном описании применительно к каждому полинуклеотиду или каждой нуклеиновой кислоте, рассмотренных в данном описании, соответственно, указанная сверхэкспрессия одного или нескольких генов или полипептидов, представляет собой сверхэкспрессию по сравнению с клеткой-предшественником дополнительно генетически модифицированной микробной клетки до подвергания ее дополнительной генетической модификации с целью обладания сверхэкспрессией указанных одного или нескольких генов или полипептидов.At present, and as is apparent in the general art and herein with respect to each polynucleotide or each nucleic acid contemplated herein, respectively, said overexpression of one or more genes or polypeptides constitutes overexpression relative to a progenitor cell additionally a genetically modified microbial cell prior to subjecting it to further genetic modification to overexpress said one or more genes or polypeptides.
Сверхэкспрессия одного или более из указанных генов увеличивает количество соответствующих полипептидов, т.е. фермента(ов), в генетически модифицированной микробной клетке, и поэтому в данной клетке повышается активность соответствующих ферментов, необходимых для повышения внутриклеточного продуцирования сиалилированных сахаридов.Overexpression of one or more of these genes increases the amount of the corresponding polypeptides, i.e. enzyme(s) in a genetically modified microbial cell, and therefore the activity of the corresponding enzymes necessary for increasing the intracellular production of sialylated saccharides is increased in this cell.
В дополнительном и/или альтернативном воплощении по меньшей мере у одной генетически модифицированной клетки отсутствует активность, или она обладает более низкой активностью, одного или более ферментов, выбранных из группы, состоящей из β-галактозидазы, глюкозамин-6-фосфат-дезаминазы, N-ацетилглюкозамин-6-фосфат-дезацетилазы, N-ацетилманнозамин-киназы, N-ацетилманнозамин-6-фосфат-эпимеразы и альдолазы N-ацетилнейраминовой кислоты, по сравнению с клеткой до подвергания ее генетической модификации.In a further and/or alternative embodiment, at least one genetically modified cell lacks activity, or has lower activity, of one or more enzymes selected from the group consisting of β-galactosidase, glucosamine-6-phosphate deaminase, N- acetylglucosamine 6-phosphate deacetylase, N-acetylmannosamine kinase, N-acetylmannosamine 6-phosphate epimerase, and N-acetylneuraminic acid aldolase, compared to a cell before genetic modification.
В дополнительном и/или альтернативном воплощении один или несколько генов, кодирующих β-галактозидазу, глюкозамин-6-фосфат-дезаминазу, N-ацетилглюкозамин-6-фосфат-дезацетилазу, N-ацетилманнозамин-киназу, N-ацетилманнозамин-6-фосфат-эпимеразу и альдолазу N-ацетилнейраминовой кислоты, делетирован/делетированы из генома генетически модифицированной клетки, или экспрессия одного или нескольких генов, кодирующих β-галактозидазу, глюкозамин-6-фосфат-дезаминазу, N-ацетилглюкозамин-6-фосфат-дезацетилазу, N-ацетилманнозамин-киназу, N-ацетилманнозамин-6-фосфат-эпимеразу и альдолазу N-ацетилнейраминовой кислоты, инактивирована или по меньшей мере ослаблена в генетически модифицированной клетке в результате проведения дополнительной генетической модификации клетки. Экспрессия указанных генов ослаблена в дополнительно генетически модифицированной клетке по сравнению с клеткой-предшественником дополнительно генетически модифицированной клетки до подвергания ее дополнительной генетической модификации с целью обладания ослабленной сверхэкспрессией указанных генов.In an additional and/or alternative embodiment, one or more genes encoding β-galactosidase, glucosamine 6-phosphate deaminase, N-acetylglucosamine 6-phosphate deacetylase, N-acetylmannosamine kinase, N-acetylmannosamine 6-phosphate epimerase and N-acetylneuraminic acid aldolase, deleted/deleted from the genome of a genetically modified cell, or expression of one or more genes encoding β-galactosidase, glucosamine-6-phosphate deaminase, N-acetylglucosamine-6-phosphate deacetylase, N-acetylmannosamine- kinase, N-acetylmannosamine-6-phosphate epimerase and N-acetylneuraminic acid aldolase, is inactivated or at least weakened in the genetically modified cell as a result of further genetic modification of the cell. The expression of said genes is attenuated in the additionally genetically modified cell compared to the progenitor cell of the additionally genetically modified cell before it is further genetically modified to have attenuated overexpression of said genes.
Генетически модифицированная микробная клетка предпочтительно представляет собой прокариотическую клетку. Соответствующие микробные клетки включают клетки дрожжей, клетки бактерий, клетки архебактерий, клетки водорослей и клетки грибов.The genetically modified microbial cell is preferably a prokaryotic cell. Suitable microbial cells include yeast cells, bacterial cells, archaebacterial cells, algal cells and fungal cells.
В дополнительном и/или альтернативном воплощении генетически модифицированная микробная клетка представляет собой бактериальную клетку, предпочтительно бактериальную клетку, выбранную из группы, состоящей из Bacillus, Lactobacillus, Lactococcus, Enterococcus, Bifidobacterium, Sporolactobacillus spp., Micromonospora spp., Micrococcus spp., Rhodococcus spp.и Pseudomonas. Подходящими бактериальными видами являются Bacillus subtilis, Bacillus licheniformis, Bacillus coagulans, Bacillus thermophilus, Bacillus laterosporus, Bacillus megaterium, Bacillus mycoides, Bacillus pumilus, Bacillus lentus, Bacillus cereus, Bacillus circulans, Bifidobacterium longum, Bifidobacterium infantis, Bifidobacterium bifidum, Citrobacter freundii, Clostridium cellulolyticum, Clostridium ljungdahlii, Clostridium autoethanogenum, Clostridium acetobutylicum, Corynebacterium glutamicum, Enterococcus faecium, Enterococcus thermophiies, Escherichia coli, Erwinia herbicola (Pantoea agglomerans), Lactobacillus acidophilus, Lactobacillus salivarius, Lactobacillus plantarum, Lactobacillus helveticus, Lactobacillus delbrueckii, Lactobacillus rhamnosus, Lactobacillus bulgaricus, Lactobacillus crispatus, Lactobacillus gasseri, Lactobacillus casei, Lactobacillus reuteri, Lactobacillus jensenii, Lactococcus iactis, Pantoea citrea, Pectobacterium carotovorum, Proprionibacterium freudenreichii, Pseudomonas fiuorescens, Pseudomonas aeruginosa, Streptococcus thermophiies и Xanthomonas campestris.In a further and/or alternative embodiment, the genetically modified microbial cell is a bacterial cell, preferably a bacterial cell selected from the group consisting of Bacillus, Lactobacillus, Lactococcus, Enterococcus, Bifidobacterium, Sporolactobacillus spp., Micromonospora spp., Micrococcus spp., Rhodococcus spp. .and Pseudomonas. Suitable bacterial species are Bacillus subtilis, Bacillus licheniformis, Bacillus coagulans, Bacillus thermophilus, Bacillus laterosporus, Bacillus megaterium, Bacillus mycoides, Bacillus pumilus, Bacillus lentus, Bacillus cereus, Bacillus circulans, Bifidobacterium longum, Bifidobacterium infantis, Bifidobacterium bifidum, Citrobacter f reundii, Clostridium cellulolyticum, Clostridium ljungdahlii, Clostridium autoethanogenum, Clostridium acetobutylicum, Corynebacterium glutamicum, Enterococcus faecium, Enterococcus thermophiies, Escherichia coli, Erwinia herbicola (Pantoea agglomerans), Lactobacillus acidophilus, Lactobacillus salivarius, Lactobacillus plantarum, obacillus helveticus, Lactobacillus delbrueckii, Lactobacillus rhamnosus, Lactobacillus bulgaricus , Lactobacillus crispatus, Lactobacillus gasseri, Lactobacillus casei, Lactobacillus reuteri, Lactobacillus jensenii, Lactococcus iactis, Pantoea citrea, Pectobacterium carotovorum, Proprionibacterium freudenreichii, Pseudomonas fiuorescens, Pseudomonas aeruginosa, thermophies and Xanthomonas campestris.
В альтернативном воплощении генетически модифицированная клетка представляет собой клетку дрожжей, предпочтительно выбранную из группы, состоящей из Saccharomyces sp,, в частности, Saccharomyces cerevisiae, Saccharomycopsis sp., Pichia sp., в частности, Pichia pastoris, Hansenula sp., Kluyveromyces sp., Yarrowia sp., Rhodotorula sp.и Schizosaccharomyces sp.In an alternative embodiment, the genetically modified cell is a yeast cell, preferably selected from the group consisting of Saccharomyces sp., in particular Saccharomyces cerevisiae, Saccharomycopsis sp., Pichia sp., in particular Pichia pastoris, Hansenula sp., Kluyveromyces sp., Yarrowia sp., Rhodotorula sp. and Schizosaccharomyces sp.
Генетически модифицированная клетка модифицирована методами генетической инженерии, чтобы содержать путь биосинтеза NeuNAc, активность синтетазы цитидин-5'-монофосфо-(ЦМФ)-сиаловой кислоты и активность сиалилтрансферазы.A genetically modified cell is genetically engineered to contain the NeuNAc biosynthetic pathway, cytidine 5'-monophospho-(CMP)-sialic acid synthetase activity, and sialyltransferase activity.
Термин "генетически модифицированная", использованный в данном описании, относится к модификации генотипа микробной клетки с использованием методов молекулярной биологии. Модификация генотипа микробной клетки может включать перенос генов в пределах видовых границ и/или через видовые границы, вставку, делетирование, замену и/или модификацию нуклеотидов, триплетов, генов, открытых рамок считывания, промоторов, энхансеров, терминаторов и других нуклеотидных последовательностей, опосредующих и/или регулирующих генную экспрессию. Модификация генотипа микробной клетки направлена на создание генетически модифицированного микроорганизма, обладающего особыми, желаемыми свойствами. Генетически модифицированные микробные клетки могут содержать один или более генов, которых нет в нативной (не подвергнутой генетической модификации) форме данной клетки. Методы введения молекул экзогенной нуклеиновой кислоты и/или вставки молекул экзогенной нуклеиновой кислоты (рекомбинантной, гетерологичной) в наследственную информацию клетки, с целью вставки, делетирования или изменения нуклеотидной последовательности генетической информации клетки, известны специалисту в данной области. Генетически модифицированные микробные клетки могут содержать один или более генов, которые имеются в нативной форме данной клетки, причем указанные гены модифицируют и повторно вводят в эту микробную клетку искусственным способом. Термин "генетически модифицированный" также охватывает микробные клетки, которые содержат молекулу нуклеиновой кислоты, являющуюся эндогенной для данной клетки, и которая модифицирована без извлечения молекулы данной нуклеиновой кислоты из этой клетки. Такие модификации включают модификации, осуществляемые с использованием замещения генов, сайт-специфических мутаций и связанных с ними методов.The term "genetically modified" as used herein refers to modification of the genotype of a microbial cell using molecular biology techniques. Modification of the genotype of a microbial cell may include gene transfer within and/or across species boundaries, insertion, deletion, substitution and/or modification of nucleotides, triplets, genes, open reading frames, promoters, enhancers, terminators and other nucleotide sequences that mediate and /or regulating gene expression. Modification of the genotype of a microbial cell is aimed at creating a genetically modified microorganism with special, desired properties. Genetically modified microbial cells may contain one or more genes that are not present in the native (non-genetically modified) form of the cell. Methods for introducing exogenous nucleic acid molecules and/or inserting exogenous nucleic acid molecules (recombinant, heterologous) into the hereditary information of a cell, for the purpose of inserting, deleting or changing the nucleotide sequence of the genetic information of the cell, are known to a person skilled in the art. Genetically modified microbial cells may contain one or more genes that are present in the native form of the cell, which genes are modified and reintroduced into the microbial cell by artificial means. The term "genetically modified" also includes microbial cells that contain a nucleic acid molecule that is endogenous to that cell and that is modified without removing the nucleic acid molecule from that cell. Such modifications include modifications made using gene replacement, site-specific mutations and related techniques.
Термин "гетерологичный", использованный в данном описании, относится к полипептиду, аминокислотной последовательности, молекуле нуклеиновой кислоты или нуклеотидной последовательности, который(ая) является чужеродным(ой) для клетки или организма, т.е. к полипептиду, аминокислотной последовательности, молекуле нуклеиновой кислоты или нуклеотидной последовательности, который(ая) не встречается в природе в указанной клетке или указанном организме. "Гетерологичная последовательность", или "гетерологичная нуклеиновая кислота", или "гетерологичный полипептид", как использовано в данном описании, означает все только что перечисленное, которое происходит из источника, чужеродного для данной конкретной клетки хозяина (например, из другого вида) или, если происходит из того же источника, то является модифицированной(ым) по сравнению со своей исходной формой. Таким образом, гетерологичная нуклеиновая кислота, функционально связанная с промотором, происходит из источника, отличающегося от того, из которого происходит промотор, или, в случае происхождения из того же источника, является модифицированной по сравнению с ее первоначальной формой. Гетерологичная последовательность может быть стабильно введена в геном микробной клетки хозяина посредством, например, трансфекции, трансформации, конъюгирования или трансдукции, в результате чего клетка хозяина становится генетически модифицированной. Могут быть применены методы, которые будут зависеть от клетки хозяина и подлежащей введению последовательности. Специалисту в данной области техники известны различные методы, и они описаны, например, в Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Соответственно, "гетерологичный полипептид" представляет собой полипептид, который не встречается в природе в данной клетке, и "гетерологичная сиалилтрансфераза" представляет собой сиалилтрансферазу, которая не встречается в природе в данной микробной клетке.The term “heterologous” as used herein refers to a polypeptide, amino acid sequence, nucleic acid molecule or nucleotide sequence that is foreign to a cell or organism, i.e. to a polypeptide, amino acid sequence, nucleic acid molecule, or nucleotide sequence that does not occur naturally in a specified cell or organism. "Heterologous sequence" or "heterologous nucleic acid" or "heterologous polypeptide" as used herein means anything just listed that comes from a source foreign to that particular host cell (eg, from another species) or, if it comes from the same source, it is modified from its original form. Thus, the heterologous nucleic acid operably linked to a promoter is derived from a source different from that of the promoter or, if originated from the same source, is modified from its original form. A heterologous sequence can be stably introduced into the genome of a microbial host cell by, for example, transfection, transformation, conjugation or transduction, thereby causing the host cell to be genetically modified. Methods may be used which will depend on the host cell and the sequence to be introduced. Various methods are known to those skilled in the art and are described, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Accordingly, a “heterologous polypeptide” is a polypeptide that does not naturally occur in a given cell, and a “heterologous sialyltransferase” is a sialyltransferase that does not naturally occur in a given microbial cell.
Согласно одному из аспектов предложен способ, посредством которого сиалилированный сахарид может быть получен путем ферментации, т.е. посредством биокатализа с применением цельных клеток, используя генетически модифицированную микробную клетку, как изложено в данном описании ранее. Для получения указанного сиалилированного сахарида нет необходимости в добавлении в ферментационный бульон N-ацетилглюкозамина, N-ацетилманнозамина и/или N-ацетилнейраминовой кислоты и/или в культивировании генетически модифицированной микробной клетки в присутствии N-ацетилглюкозамина, N-ацетилманнозамина и/или N-ацетилнейраминовой кислоты для внутриклеточного биосинтеза сиалилированного сахарида.In one aspect, a method is provided whereby the sialylated saccharide can be produced by fermentation, i.e. via whole cell biocatalysis using a genetically modified microbial cell as previously described herein. To obtain the specified sialylated saccharide, there is no need to add N-acetylglucosamine, N-acetylmannosamine and/or N-acetylneuraminic acid to the fermentation broth and/or to cultivate a genetically modified microbial cell in the presence of N-acetylglucosamine, N-acetylmannosamine and/or N-acetylneuraminic acid acids for intracellular biosynthesis of sialylated saccharide.
В данном способе по меньшей мере одну генетически модифицированную микробную клетку культивируют в ферментационном бульоне и в условиях, позволяющих получать сахарид, содержащий по меньшей мере одну группировку N-ацетилнейраминовой кислоты.In this method, at least one genetically modified microbial cell is cultured in a fermentation broth and under conditions that allow the production of a saccharide containing at least one N-acetylneuraminic acid moiety.
В дополнительном и/или альтернативном воплощении ферментационный бульон содержит по меньшей мере один источник углерода, при этом по меньшей мере один источник углерода предпочтительно выбран из группы, состоящей из глюкозы, фруктозы, сахарозы, глицерина и их комбинаций.In a further and/or alternative embodiment, the fermentation broth contains at least one carbon source, wherein the at least one carbon source is preferably selected from the group consisting of glucose, fructose, sucrose, glycerol, and combinations thereof.
Хотя в способе и в генетически модифицированной/ модифицированной микробной клетке используется источник углерода, нет необходимости добавлять в ферментационный бульон глюкозамин, и/или N-ацетилнейраминовую кислоту, и/или N-ацетил глюкозамин, и/или N-ацетилманнозамин, поскольку N-ацетилнейраминовая кислота продуцируется внутриклеточно генетически модифицированной микробной клеткой. Таким образом, в дополнительном и/или альтернативном воплощении по меньшей мере одну генетически модифицированную микробную клетку культивируют в отсутствие и/или без добавления одного или более чем одного, выбранного из группы, состоящей из глюкозамина, N-ацетилглюкозамина, N-ацетилманнозамина и N-ацетилнейраминовой кислоты. Генетически модифицированную микробную клетку можно культивировать в отсутствие и/или без добавления галактозы, если галактозу не поставляют в качестве акцепторного субстрата для реакции с участием сиалилтрансферазы. В дополнительном и/или альтернативном воплощении по меньшей мере одну генетически модифицированную микробную клетку культивируют в присутствии одного или нескольких моносахаридов (например, галактозы), дисахаридов (например, лактозы), трисахаридов (например, лакто-N-триозы II), тетрасахаридов (например, лакто-N-тетраозы) и/или пентасахаридов (например, сиалиллакто-N-тетраозы а).Although the process and the genetically modified/modified microbial cell utilize a carbon source, it is not necessary to add glucosamine and/or N-acetylneuraminic acid and/or N-acetyl glucosamine and/or N-acetylmannosamine to the fermentation broth since N-acetylneuraminic acid the acid is produced intracellularly by a genetically modified microbial cell. Thus, in a further and/or alternative embodiment, the at least one genetically modified microbial cell is cultured in the absence and/or without the addition of one or more than one selected from the group consisting of glucosamine, N-acetylglucosamine, N-acetylmannosamine and N- acetylneuraminic acid. The genetically modified microbial cell can be cultured in the absence and/or without the addition of galactose, as long as galactose is not supplied as an acceptor substrate for the sialyltransferase reaction. In a further and/or alternative embodiment, at least one genetically modified microbial cell is cultured in the presence of one or more monosaccharides (e.g., galactose), disaccharides (e.g., lactose), trisaccharides (e.g., lacto-N-triose II), tetrasaccharides (e.g. , lacto-N-tetraose) and/or pentasaccharides (for example, sialyllacto-N-tetraose a).
Согласно дополнительному и/или альтернативному воплощению по меньшей мере одну генетически модифицированную микробную клетку культивируют в присутствии по меньшей мере одного акцепторного субстрата, выбранного из группы, состоящей из галактозы, N-ацетил галактозами на, N-ацетилглюкозамина, лактозы, лактулозы, N-ацетиллактозамина, лакто-N-биозы, лакто-N-триозы, 2'-фукозиллактозы, 3-фукозиллактозы, 3'-сиалиллактозы, 6'-сиалиллактозы, 3'-сиалил-N-ацетиллактозамина, 6'-сиалил-N-ацетиллактозамина, 3'-галактозиллактозы, 6'-галактозиллактозы, лакто-N-триозы II, лакто-N-тетраозы, лакто-N-неотетраозы, 2',3-дифукозиллактозы, 3-фукозил-3'-сиалиллактозы и 3-фукозил-6'-сиалиллактозы. Эти субстраты импортируются в клетку и используются в данной клетке в качестве акцепторных молекул.According to a further and/or alternative embodiment, at least one genetically modified microbial cell is cultured in the presence of at least one acceptor substrate selected from the group consisting of galactose, N-acetyl galactosamine, N-acetylglucosamine, lactose, lactulose, N-acetyllactosamine , lacto-N-biose, lacto-N-triose, 2'-fucosyllactose, 3-fucosyllactose, 3'-sialyllactose, 6'-sialyllactose, 3'-sialyl-N-acetyllactosamine, 6'-sialyl-N-acetyllactosamine, 3'-galactosyllactose, 6'-galactosyllactose, lacto-N-triose II, lacto-N-tetraose, lacto-N-neotetraose, 2',3-difucosyllactose, 3-fucosyl-3'-sialyllactose and 3-fucosyl-6 '-sialyllactose. These substrates are imported into the cell and used as acceptor molecules by the cell.
Генетически модифицированной клетке для роста, размножения и продуцирования сиалилированных олигосахаридов необходим источник углерода. В дополнительном и/или альтернативном воплощении генетически модифицированная клетка может расти на недорогом единственном источнике углерода, таком как, например, глицерин, глюкоза или сахароза. Указанный единственный источник углерода предоставляет исходный продукт для биосинтеза ЦМФ-сиаловых кислот в генетически модифицированной клетке. Поэтому, чтобы получить сиалилированные олигосахариды, нет необходимости в культивировании генетически модифицированной клетки в присутствии Neu5Ac, ManNAc, GlcNAc или глюкозамина (GlcN).A genetically modified cell requires a carbon source to grow, reproduce, and produce sialylated oligosaccharides. In a further and/or alternative embodiment, the genetically modified cell can grow on an inexpensive single carbon source, such as, for example, glycerol, glucose or sucrose. This single carbon source provides the starting product for the biosynthesis of CMP-sialic acids in the genetically modified cell. Therefore, to obtain sialylated oligosaccharides, there is no need to cultivate a genetically modified cell in the presence of Neu5Ac, ManNAc, GlcNAc or glucosamine (GlcN).
Способ включает возможную стадию извлечения сиалилированного сахарида, который был продуцирован по меньшей мере одной генетически модифицированной микробной клеткой в процессе ее культивирования в ферментационном бульоне. Сиалилированный сахарид можно извлечь из ферментационного бульона после удаления генетически модифицированных микробных клеток, например, путем центрифугирования, и/или можно извлечь из клеток, например, в том смысле, что клетки собирают из ферментационного бульона путем центрифугирования и подвергают стадии лизирования клеток. После этого сиалилированные сахариды далее можно очистить от ферментационного бульона и/или клеточных лизатов подходящими методами, известными специалисту в данной области. Подходящие методы включают микрофильтрацию, ультрафильтрацию, диафильтрацию, хроматографию по типу хроматографии с псевдодвижущимся слоем, электродиализ, обратный осмос, гель-фильтрацию, анионообменную хроматографию, катионообменную хроматографию и тому подобные методы.The method includes the optional step of recovering a sialylated saccharide that has been produced by at least one genetically modified microbial cell during its cultivation in a fermentation broth. The sialylated saccharide can be recovered from the fermentation broth after removal of the genetically modified microbial cells, for example by centrifugation, and/or can be recovered from the cells, for example in the sense that the cells are collected from the fermentation broth by centrifugation and subjected to a cell lysis step. The sialylated saccharides can then be purified from the fermentation broth and/or cell lysates by suitable methods known to one skilled in the art. Suitable methods include microfiltration, ultrafiltration, diafiltration, pseudo-moving bed chromatography, electrodialysis, reverse osmosis, gel filtration, anion exchange chromatography, cation exchange chromatography and the like.
Способ и генетически модифицированную микробную клетку, применяемую в данном способе, используют для получения сиалилированного сахарида. Термин "сиалилированный сахарид" относится к молекуле сахарида, содержащей по меньшей мере одну группировку N-ацетилнейраминовой кислоты.The method and the genetically modified microbial cell used in this method are used to produce sialylated saccharide. The term "sialylated saccharide" refers to a saccharide molecule containing at least one N-acetylneuraminic acid moiety.
В дополнительном и/или альтернативном воплощении сиалилированный сахарид представляет собой олигосахарид. Термин "олигосахарид", использованный в данном описании, относится к полимерам, состоящим из моносахаридных остатков, при этом указанные полимеры содержат по меньшей мере два моносахаридных остатка, но не больше 10 моносахаридных остатков, предпочтительно не больше 7 моносахаридных остатков. Олигосахариды либо представляют собой линейную цепь моносахаридов, либо являются разветвленными. Помимо этого, моносахаридные остатки в олигосахарид ах могут иметь ряд химических модификаций. Соответственно, олигосахариды могут содержать одну или более несахаридных группировок. Термин "сиалилированный олигосахарид", использованный в данном описании, относится к олигосахаридам, содержащим одну или более группировок N-ацетилнейраминовой кислоты.In a further and/or alternative embodiment, the sialylated saccharide is an oligosaccharide. The term "oligosaccharide" as used herein refers to polymers composed of monosaccharide residues, wherein said polymers contain at least two monosaccharide residues, but no more than 10 monosaccharide residues, preferably no more than 7 monosaccharide residues. Oligosaccharides are either a linear chain of monosaccharides or are branched. In addition, monosaccharide residues in oligosaccharides can have a number of chemical modifications. Accordingly, oligosaccharides may contain one or more non-saccharide moieties. The term "sialylated oligosaccharide" as used herein refers to oligosaccharides containing one or more N-acetylneuraminic acid moieties.
Согласно дополнительному и/или альтернативному воплощению сиалилированный олигосахарид выбран из группы, состоящей из 3'-сиалиллактозы, 6'-сиалиллактозы, сиалиллакто-N-тетраозы а, сиалиллакто-N-тетраозы b, сиалиллакто-N-тетраозы с, фукозил-сиалиллакто-N-тетраозы а, фукозил-сиалиллакто-N-тетраозы b, фукозил-сиалиллакто-N-тетраозы с, дисиалиллакто-N-тетраозы, фукозилдисиалиллакто-N-тетраозы I, фукозилдисиалиллакто-N-тетраозы II, 3'-сиалилгалактозы, 6'-сиалилгалактозы, 3'-сиалил-N-ацетиллактозамина и 6'-сиалил-N-ацетиллактозамина.In a further and/or alternative embodiment, the sialylated oligosaccharide is selected from the group consisting of 3'-sialyllactose, 6'-sialyllactose, sialyllacto-N-tetraose a, sialyllacto-N-tetraose b, sialyllacto-N-tetraose c, fucosyl-sialyllacto- N-tetraose a, fucosyl-sialyllacto-N-tetraose b, fucosyl-sialyllacto-N-tetraose c, disialyl lacto-N-tetraose, fucosyldisialyl lacto-N-tetraose I, fucosyldisialyl lacto-N-tetraose II, 3'-sialylgalactose, 6' -sialylgalactose, 3'-sialyl-N-acetyllactosamine and 6'-sialyl-N-acetyllactosamine.
Согласно другому аспекту изобретения предложено применение генетически модифицированной микробной клетки, описанной в данной заявке ранее, для получения сиалилированного сахарида в способе ферментации с применением цельных клеток, т.е. сиалилированный сахарид синтезируют, используя генетически модифицированную микробную клетку.According to another aspect of the invention, the use of a genetically modified microbial cell, previously described in this application, for the production of sialylated saccharide in a fermentation method using whole cells, i.e. The sialylated saccharide is synthesized using a genetically modified microbial cell.
Согласно другому аспекту изобретения предложен сиалилированный сахарид, полученный способом и/или посредством использования генетически модифицированной микробной клетки, которые описаны в данной заявке ранее. В дополнительном и/или альтернативном воплощении сиалилированный сахарид представляет собой сиалилированный олигосахарид, предпочтительно сиалилированный олигосахарид, выбранный из группы, состоящей из 3'-сиалиллактозы, 6'-сиалиллактозы, сиалиллакто-N-тетраозы а, сиалиллакто-N-тетраозы b, сиалиллакто-N-тетраозы с, фукозил-сиалиллакто-N-тетраозы а, фукозил-сиалиллакто-N-тетраозы b, фукозил-сиалиллакто-N-тетраозы с, дисиалиллакто-N-тетраозы, фукозилдисиалиллакто-N-тетраозы I, фукозилдисиалиллакто-N-тетраозы II, 3'-сиалилгалактозы, 6'-сиалилгалактозы, 3'-сиалил-N-ацетиллактозамина и 6'-сиалил-N-ацетиллактозамина.According to another aspect of the invention, there is provided a sialylated saccharide obtained by a method and/or through the use of a genetically modified microbial cell as previously described in this application. In an additional and/or alternative embodiment, the sialylated saccharide is a sialylated oligosaccharide, preferably a sialylated oligosaccharide selected from the group consisting of 3'-sialyllactose, 6'-sialyllactose, sialyllacto-N-tetraose a, sialyllacto-N-tetraose b, sialyllactose N-tetraose c, fucosyl-sialyllacto-N-tetraose a, fucosyl-sialyllacto-N-tetraose b, fucosyl-sialyllacto-N-tetraose c, disialyl lacto-N-tetraose, fucosyldisialyl lacto-N-tetraose I, fucosyldisialyl lacto-N-tetraose II, 3'-sialylgalactose, 6'-sialylgalactose, 3'-sialyl-N-acetyllactosamine and 6'-sialyl-N-acetyllactosamine.
Согласно другому аспекту изобретения предложено применение сиалилированного сахарида, полученного способом, описанным в данной заявке ранее, и/или посредством использования генетически модифицированной микробной клетки, описанной в данной заявке ранее, для приготовления пищевой композиции.According to another aspect of the invention, the use of a sialylated saccharide obtained by a method previously described in this application and/or through the use of a genetically modified microbial cell described earlier in this application for the preparation of a food composition is proposed.
Таким образом, согласно другому аспекту изобретения предложена пищевая композиция, содержащая по меньшей мере один сиалилированный сахарид, предпочтительно по меньшей мере один сиалилированный олигосахарид, который получен способом и/или с использованием генетически модифицированной микробной клетки, которые описаны в данной заявке ранее. В дополнительном и/или альтернативном воплощении сиалилированный олигосахарид выбран из группы, состоящей из 3'-сиалиллактозы, 6'-сиалиллактозы, сиалиллакто-N-тетраозы а, сиалиллакто-N-тетраозы b, сиалиллакто-N-тетраозы с, фукозил-сиалиллакто-N-тетраозы а, фукозил-сиалиллакто-N-тетраозы b, фукозил-сиалиллакто-N-тетраозы с, дисиалиллакто-N-тетраозы, фукозилдисиалиллакто-N-тетраозы I, фукозилдисиалиллакто-N-тетраозы II.Thus, according to another aspect of the invention, there is provided a food composition comprising at least one sialylated saccharide, preferably at least one sialylated oligosaccharide, which is produced by a method and/or using a genetically modified microbial cell as previously described herein. In an additional and/or alternative embodiment, the sialylated oligosaccharide is selected from the group consisting of 3'-sialyllactose, 6'-sialyllactose, sialyllacto-N-tetraose a, sialyllacto-N-tetraose b, sialyllacto-N-tetraose c, fucosyl-sialyllacto- N-tetraose a, fucosyl-sialyllacto-N-tetraose b, fucosyl-sialyllacto-N-tetraose c, disialyl lacto-N-tetraose, fucosyldisialyl lacto-N-tetraose I, fucosyldisialyl lacto-N-tetraose II.
В дополнительном и/или альтернативном воплощении пищевая композиция дополнительно содержит по меньшей мере один нейтральный НМО, предпочтительно 2'-FL.In a further and/or alternative embodiment, the food composition further comprises at least one neutral HMO, preferably 2'-FL.
В дополнительном и/или альтернативном воплощении пищевая композиция содержит 3-SL, 6-SL и 2'-FL.In an additional and/or alternative embodiment, the food composition contains 3-SL, 6-SL and 2'-FL.
В дополнительном воплощении пищевая композиция выбрана из группы, состоящей из лекарственных, фармацевтических композиций, смеси для грудных детей и пищевых добавок.In a further embodiment, the nutritional composition is selected from the group consisting of medicinal, pharmaceutical, infant formula and dietary supplement compositions.
Пищевая композиция может быть представлена в жидкой форме или в твердой форме, включая, но не ограничиваясь этим, порошки, гранулы, хлопья и пеллеты.The food composition may be in liquid form or in solid form, including, but not limited to, powders, granules, flakes and pellets.
Далее настоящее изобретение будет описано применительно к конкретным воплощениям, однако данное изобретение не ограничивается ими, а только формулой изобретения. Кроме того, термины "первый", "второй" и тому подобные в описании и в формуле изобретения используются для различения схожих элементов и необязательно для описания последовательности либо во времени, либо в пространстве, либо при ранжировании, либо для описания любым другим образом. Следует понимать, что используемые таким образом термины являются взаимозаменяемыми при соответствующих обстоятельствах, и что воплощения изобретения, описанные в данной заявке, могут работать в других последовательностях, чем изложено или проиллюстрировано в данном описании.Next, the present invention will be described in relation to specific embodiments, however, the present invention is not limited to them, but only to the claims. In addition, the terms “first,” “second,” and the like in the specification and claims are used to distinguish like elements and not necessarily to describe a sequence either in time, space, ranking, or any other manner of description. It should be understood that the terms so used are interchangeable under appropriate circumstances, and that the embodiments of the invention described in this application may operate in sequences other than those set forth or illustrated herein.
Нужно отметить, что термин "содержащий", использованный в формуле изобретения, не следует интерпретировать как термин, ограниченный перечисленными ниже средствами; он не исключает другие элементы или стадии. Таким образом, его следует интерпретировать как термин, конкретизирующий наличие указанных признаков, целых чисел, стадий или компонентов, которые упомянуты, и не исключающий наличия или добавления одного или нескольких других признаков, целых чисел, стадий или компонентов либо их групп. Таким образом, объем выражения "устройство, содержащее средства А и В", не следует ограничивать устройствами, состоящими только из компонентов А и В. Применительно к настоящему изобретению это означает, что единственными релевантными компонентами данного устройства являются А и В.It should be noted that the term "comprising" used in the claims should not be interpreted as a term limited to the following means; it does not exclude other elements or stages. Thus, it should be interpreted as a term specifying the presence of specified features, integers, stages or components that are mentioned, and not excluding the presence or addition of one or more other features, integers, stages or components or groups thereof. Thus, the scope of the expression "device comprising means A and B" should not be limited to devices consisting only of components A and B. For the purposes of the present invention, this means that the only relevant components of a given device are A and B.
Упоминание по всему этому описанию "одного из воплощений" или "какого-либо воплощения" означает, что по меньшей мере в одно из воплощений настоящего изобретения включены конкретный признак, конкретная структура или конкретное характерное свойство, описанные вместе с данным воплощением. Таким образом, случаи появления в различных местах по всему данному описанию фраз "в одном из воплощений" или "в воплощении" не обязательно всегда относятся к одному и тому же воплощению, но относиться могут. Кроме того, конкретные признаки, структуры или характеристики могут быть объединены любым подходящим образом в одном или нескольких воплощениях, что будет очевидно специалисту средней квалификации в данной области техники из этого описания.Reference throughout this description to “one of the embodiments” or “any embodiment” means that at least one of the embodiments of the present invention includes a particular feature, a particular structure, or a particular characteristic property described in conjunction with that embodiment. Thus, occurrences in various places throughout this specification of the phrases “in one of the embodiments” or “in an embodiment” do not necessarily always refer to the same embodiment, but they may. Moreover, specific features, structures, or characteristics may be combined in any suitable manner in one or more embodiments, as will be apparent to one of ordinary skill in the art from this disclosure.
Аналогичным образом, должно быть очевидно, что в описании типичных воплощений изобретения различные признаки данного изобретения иногда сгруппированы вместе в одном воплощении, на одном чертеже или их описании с целью оптимизации раскрытия и помощи в понимании одного или более чем одного из различных аспектов изобретения. Однако, этот способ раскрытия не следует интерпретировать как отражение намерения, что заявляемое изобретение требует большего количества признаков, чем явно указанные в каждом пункте формулы изобретения. Скорее, как отражено следующей далее формулой изобретения, аспекты изобретения заключаются не во всех признаках одного описанного выше воплощения. Таким образом, формула изобретения, приведенная после подробного описания, тем самым явно включена в это подробное описание, причем каждый пункт формулы изобретения имеет (юридическую) силу сам по себе в качестве отдельного воплощения данного изобретения.Likewise, it will be apparent that in the description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, drawing, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various aspects of the invention. However, this manner of disclosure should not be interpreted as reflecting the intention that the claimed invention requires more features than those expressly set forth in each claim. Rather, as reflected by the following claims, aspects of the invention are not embodied in all of the features of a single embodiment described above. Thus, the claims appearing after the detailed description are hereby expressly incorporated into the detailed description, with each claim having legal effect on its own as a separate embodiment of the invention.
Кроме того, хотя некоторые воплощения, описанные в данной заявке, включают в себя некоторые, но не другие признаки, включенные в другие воплощения, подразумевается, что комбинации признаков различных воплощений находятся в пределах объема изобретения и образуют другие воплощения, что будет понятно специалистам в данной области техники. Например, в приведенной далее формуле изобретения любое из заявляемых воплощений можно использовать в любой комбинации.In addition, although some embodiments described herein include some but not other features included in other embodiments, combinations of features of various embodiments are intended to be within the scope of the invention and form other embodiments as will be understood by those skilled in the art. field of technology. For example, in the following claims, any of the claimed embodiments can be used in any combination.
Кроме того, некоторые из воплощений описаны в данной заявке как способ или комбинация элементов способа, который может быть реализован с использованием процессора компьютерной системы или другого средства выполнения данной функции. Таким образом, процессор с необходимыми инструкциями для выполнения такого способа или элемента способа образует средство для осуществления способа или элемента способа. Кроме того, описанный в данной заявке элемент воплощения устройства представляет собой пример средства для осуществления функции, выполняемой этим элементом с целью осуществления изобретения.In addition, some of the embodiments are described herein as a method or combination of elements of a method that can be implemented using a computer system processor or other means of performing a given function. Thus, a processor with the necessary instructions for executing such method or method element constitutes means for implementing the method or method element. In addition, the device embodiment element described herein is an example of a means for implementing the function performed by this element for the purpose of carrying out the invention.
В описании и графических материалах, представленных в данной заявке, приведены многочисленные конкретные подробности. Однако очевидно, что воплощения изобретения могут быть осуществлены на практике без этих конкретных подробностей. В других случаях общеизвестные способы, структуры и методы не показаны подробно, чтобы не затруднять понимания этого описания.Numerous specific details are set forth in the specification and drawings provided herein. However, it will be appreciated that embodiments of the invention may be practiced without these specific details. In other cases, well-known methods, structures and methods are not shown in detail so as not to obscure the understanding of this description.
Теперь данное изобретение будет раскрыто посредством подробного описания нескольких воплощений изобретения. Очевидно, что в соответствии со знаниями специалистов в данной области техники могут быть созданы другие воплощения изобретения без отклонения от истинной сущности или технического учения изобретения, при этом изобретение ограничено только условиями прилагаемой формулы изобретения.The present invention will now be disclosed by way of detailed description of several embodiments of the invention. It will be appreciated that other embodiments of the invention may be made in accordance with the knowledge of those skilled in the art without departing from the true spirit or technical doctrine of the invention, and the invention is limited only by the terms of the appended claims.
ПримерыExamples
На Фиг. 1-3 показаны схемы, демонстрирующие альтернативные пути для внутриклеточного биосинтеза NeuNAc, ЦМФ-NeuNAc и сиалилированных сахаридов.In FIG. 1-3 are schematic diagrams demonstrating alternative pathways for the intracellular biosynthesis of NeuNAc, CMP-NeuNAc, and sialylated saccharides.
С использованием клеток, генетически модифицированных так, как описано в данной заявке, может быть осуществлено ферментативное получение сиалилированных сахаридов. Единственный предусмотренный источник углерода (например, сахароза) поступает в микробную клетку и метаболизируется с образованием фруктозо-6-фосфата (от Фиг. 1 до Фиг. 3). Затем L-глутамин:D-фруктозо-6-фосфат-аминотрансфераза (GlmS) осуществляет превращение фруктозо-6-фосфата в глюкозамин-6-фосфат (от Фиг. 1 до Фиг. 3), который в свою очередь метаболизируется глюкозамин-6-фосфат-N-ацетил-трансферазой (Gna1) до N-ацетилглюкозамин-6-фосфата (Фиг. 2 и Фиг. 3). N-Ацетилглюкозамин-6-фосфат может быть превращен 1) в N-ацетилманнозамин-6-фосфат при участии N-ацетилглюкозамин-6-фосфат-эпимеразы (NanE) и далее в N-ацетилманнозамин при участии N-ацетилманнозамин-6-фосфат-фосфатазы (Фиг. 3) или 2) в N-ацетилглюкозамин при участии N-ацетилглюкозамин-6-фосфат-фосфатазы (YihX/YqaB) и далее метаболизирован до N-ацетилманнозамина при участии N-ацетилглюкозамин-2-эпимеразы (Slr1975) (Фиг. 2). Синтаза сиаловых кислот (NanA) катализирует превращение N-ацетилманнозамина в N-ацетил-нейраминовую кислоту, которая преобразуется в ЦМФ-N-ацетилнейраминовую кислоту при участии синтетазы ЦМФ-сиаловых кислот (от Фиг. 1 до Фиг. 3). Акцепторный субстрат может быть добавлен в культуральную жидкость, введен в клетку и модифицирован или синтезирован de novo рекомбинантной клеткой хозяина. Акцепторный субстрат образует связь с N-ацетилнейраминовой кислотой в реакции, катализируемой сиалилтрансферазой (SiaT), с образованием сиалилированного сахарида, который может быть экспортирован в культуральную жидкость.Using cells genetically modified as described in this application, enzymatic production of sialylated saccharides can be carried out. The only carbon source provided (eg, sucrose) enters the microbial cell and is metabolized to form fructose-6-phosphate (Figure 1 to Figure 3). L-glutamine:D-fructose-6-phosphate aminotransferase (GlmS) then converts fructose-6-phosphate to glucosamine-6-phosphate (Figure 1 to Figure 3), which in turn is metabolized by glucosamine-6-phosphate. phosphate-N-acetyl-transferase (Gna1) to N-acetylglucosamine-6-phosphate (Fig. 2 and Fig. 3). N-Acetylglucosamine-6-phosphate can be converted 1) into N-acetylmannosamine-6-phosphate with the participation of N-acetylglucosamine-6-phosphate epimerase (NanE) and further into N-acetylmannosamine with the participation of N-acetylmannosamine-6-phosphate- phosphatase (Fig. 3) or 2) to N-acetylglucosamine with the participation of N-acetylglucosamine-6-phosphate phosphatase (YihX/YqaB) and further metabolized to N-acetylmannosamine with the participation of N-acetylglucosamine-2-epimerase (Slr1975) (Fig. .2). Sialic acid synthase (NanA) catalyzes the conversion of N-acetylmannosamine to N-acetyl-neuraminic acid, which is converted to CMP-N-acetylneuraminic acid by CMP-sialic acid synthetase (Figure 1 to Figure 3). The acceptor substrate can be added to the culture fluid, introduced into the cell, and modified or synthesized de novo by a recombinant host cell. The acceptor substrate forms a bond with N-acetylneuraminic acid in a reaction catalyzed by sialyltransferase (SiaT) to form a sialylated saccharide that can be exported to the culture fluid.
Пример 1. Получение различных сиалилированных олигосахаридовExample 1. Preparation of various sialylated oligosaccharides
Последовательности генов охарактеризованных или предполагаемых сиалилтрансфераз получали из литературных и общедоступных баз данных. Поскольку часто описывается, что сиалилтрансферазы проявляют более высокую активность при удалении их сигнального пептида, авторы изобретения анализировали соответствующие белковые последовательности, используя для предсказания в режиме "онлайн" программное средство SignalP (Petersen et al., Nature Methods, 2011, Sep 29; 8(10): 785-6). Гены были получены путем синтеза благодаря сотрудничеству с GenScript либо, как аннотировано, в полноразмерной форме, либо, когда предсказано наличие сигнального пептида, в виде укороченного варианта, у которого N-концевой сигнальный пептид отсутствует.Gene sequences of characterized or putative sialyltransferases were obtained from literature and public databases. Since sialyltransferases are often described to exhibit higher activity upon removal of their signal peptide, we analyzed the corresponding protein sequences using the SignalP software for online prediction (Petersen et al., Nature Methods, 2011, Sep 29; 8( 10): 785-6). The genes were synthesized through collaboration with GenScript, either as annotated in full-length form or, when predicted to have a signal peptide, as a truncated version lacking the N-terminal signal peptide.
Каждую из сиалилтрансфераз 1-26 субклонировали в составе оперона с neuА методом сиквенс-независимого безлигазного клонирования (SLIC, от англ. sequence and ligation-independent cloning) в pDEST14 с использованием ген-специфичных праймеров, получая плазмиды общего вида: pDEST14-siaT-neuA. Остальные сиалилтрансферазы 27-100 непосредственно субклонировали благодаря сотрудничеству с GenScript в плазмиду рЕТ11а с использованием сайтов рестрикции NdeI и BamHI. Обе экспрессирующие системы позволяют осуществлять изопропилтиогалактозид(IPTG)-индуцибельную (IPTG, от англ. isopropylthiogalactoside) генную экспрессию. Чтобы провести скрининг активностей in vitro, данными плазмидами трансформировали штамм Е. coli BL21(DE3), лишенный активности LacZ.Each of the sialyltransferases 1-26 was subcloned as part of the operon with neuA by sequence-independent ligation-free cloning (SLIC, from sequence and ligation-independent cloning) into pDEST14 using gene-specific primers, obtaining plasmids of the general form: pDEST14-siaT-neuA . The remaining sialyltransferases 27-100 were directly subcloned through collaboration with GenScript into plasmid pET11a using NdeI and BamHI restriction sites. Both expression systems allow for isopropylthiogalactoside (IPTG)-inducible gene expression. To screen for in vitro activities, E. coli strain BL21(DE3), lacking LacZ activity, was transformed with these plasmids.
Штаммы Е, coli, несущие плазмиды для экспрессии siaT9 (α-2,3-сиалилтрансферазы) и siaT18 (α-2,6-сиалилтрансферазы), выращивали при 30°С во встряхиваемых колбах емкостью 100 мл, в которые вносили 20 мл среды 2YT (двукратный дрожжевой экстракт-триптон), дополненной ампициллином (100 мкг/мл). Когда оптическая плотность при 600 нм (OD600) для культур достигала значений в диапазоне 0,1-0,3, индуцировали генную экспрессию, добавляя 0,3 мМ изопропилтиогалактозид, и инкубирование продолжали в течение 12-16 часов. Клетки собирали путем центрифугирования и механически разрушали в определенном объеме 50 мМ буфера на основе трис-HCl, рН 7,5, используя стеклянные шарики. Белковый экстракт хранили на льду до начала проведения анализа. Анализ in vitro проводили в общем объеме 25 мкл, содержащем 50 мМ трис-HCI, рН 7,5, 5 мМ MgCl2, 10 мМ ЦМФ-Neu5Ac и соответствующие акцепторные субстраты в концентрации от 5 до 20 мМ. Анализ начинали, добавляя 3 мкл белкового экстракта, и продолжали в течение 16 часов. Образование сиалилированных олигосахаридов в результате проявления активности сиалилтрансфераз определяли посредством тонкослойной хроматографии (TLC).E coli strains carrying plasmids for the expression of siaT9 (α-2,3-sialyltransferase) and siaT18 (α-2,6-sialyltransferase) were grown at 30°C in 100 ml shake flasks, into which 20 ml of 2YT medium was added (double tryptone yeast extract) supplemented with ampicillin (100 μg/ml). When the optical density at 600 nm (OD 600 ) for the cultures reached values in the range of 0.1-0.3, gene expression was induced by adding 0.3 mM isopropylthiogalactoside and incubation was continued for 12-16 hours. Cells were collected by centrifugation and mechanically disrupted in a defined volume of 50 mM Tris-HCl buffer, pH 7.5, using glass beads. The protein extract was kept on ice until analysis. The in vitro assay was performed in a total volume of 25 μl containing 50 mM Tris-HCI, pH 7.5, 5 mM MgCl 2 , 10 mM CMP-Neu5Ac and appropriate acceptor substrates at concentrations ranging from 5 to 20 mM. The assay was started by adding 3 μl of protein extract and continued for 16 hours. The formation of sialylated oligosaccharides as a result of sialyltransferase activity was determined by thin layer chromatography (TLC).
Для этого образцы наносили на пластинки с силикагелем 60 F254 (Merck KGaA, Darmstadt, Germany). В качестве подвижной фазы использовали смесь бутанол:ацетон:уксусная кислота:H2O (35/35/7/23 (об./об./об./об.)). Для обнаружения разделяемых веществ TLC-пластинку пропитывали реагентом на основе тимола (0,5 г тимола, растворенного в 95 мл этанола с добавлением 5 мл серной кислоты) и нагревали. Сиалилированные продукты реакции двигались медленнее, чем их акцепторные субстраты.For this purpose, the samples were applied to plates with silica gel 60 F 254 (Merck KGaA, Darmstadt, Germany). A mixture of butanol:acetone:acetic acid: H2O (35/35/7/23 (v/v/v/v)) was used as the mobile phase. To detect the substances to be separated, the TLC plate was impregnated with a thymol-based reagent (0.5 g of thymol dissolved in 95 ml of ethanol with the addition of 5 ml of sulfuric acid) and heated. The sialylated reaction products moved more slowly than their acceptor substrates.
Обе сиалилтрансферазы обладали способностью сиалилировать галактозу или различные олигосахариды, содержащие по меньшей мере один остаток галактозы. Никакого сиалилированного олигосахарида не было обнаружено, когда в реакционную смесь добавляли сахарозу (Таблица 3).Both sialyltransferases had the ability to sialylate galactose or various oligosaccharides containing at least one galactose residue. No sialylated oligosaccharide was detected when sucrose was added to the reaction mixture (Table 3).
Пример 2. Конструирование путей обмена веществ штамма BL21(DE3) Е. coli c целью получения N-ацетилнейраминовой кислотыExample 2. Design of metabolic pathways of E. coli strain BL21(DE3) to obtain N-acetylneuraminic acid
Конструирование путей обмена веществ осуществляли посредством мутагенеза и делетирования определенных эндогенных генов и интегрирования в геном гетерологичных генов. Гены lacZ и araA инактивировали посредством мутагенеза с использованием ошибочно спаривающихся олигонуклеотидов, как описано Ellis и др. (Proc. Natl. Acad. Sci. USA, 98: 6742-6746 (2001)).The construction of metabolic pathways was carried out through mutagenesis and deletion of certain endogenous genes and integration of heterologous genes into the genome. The lacZ and araA genes were inactivated by mutagenesis using mismatched oligonucleotides as described by Ellis et al. (Proc. Natl. Acad. Sci. USA, 98: 6742-6746 (2001)).
Делетирование участков генома осуществляли в соответствии со способом Datsenko и Warner (Proc. Natl. Acad. Sci. USA, 97: 6640-6645 (2000)). Чтобы предотвратить расщепление N-ацетилглюкозамина, из генома штамма BL21(DE3) Е. coli были делетированы следующие гены: ген N-ацетилглюкозамин-специфичного PTS-фермента II (nagE) (PTS - фосфотрансферазная система, от англ. phosphotransferase system), ген N-ацетилглюкозамин-6-фосфат-дезацетилазы (nagA) и ген глюкозамин-6-фосфат-дезаминазы (nagB). Также был целиком делетирован кластер генов катаболизма N-ацетилнейраминовой кислоты, кодирующий N-ацетилманнозамин-киназу (nanK), N-ацетилманнозамин-6-фосфат-эпимеразу (nanE), альдолазу N-ацетилнейраминовой кислоты (папА) и пермеазу сиаловых кислот (nanT). Гены manX, manY и manZ, кодирующие фосфоенолпируват-зависимую фосфотрансферазную систему, облегчающую импорт глюкозамина, также были делетированы. Кроме того, были делетированы гены wzxC-wcaJ. Ген wcaJ кодирует УДФ-глюкоза:ундекапренилфосфат-глюкозо-1-фосфат-трансферазу, катализирующую первую стадию синтеза колановой кислоты (Stevenson et al., J. Bacteriol., 1996, 178: 4885-4893). Помимо этого были делетированы гены fucI, fucK и agaA, кодирующие L-фукозоизомеразу, L-фукулозокиназу и N-ацетилгалактозамин-6-фосфат-дезацетилазу, соответственно.Deletion of genome regions was carried out in accordance with the method of Datsenko and Warner (Proc. Natl. Acad. Sci. USA, 97: 6640-6645 (2000)). To prevent the breakdown of N-acetylglucosamine, the following genes were deleted from the genome of E. coli strain BL21(DE3): N-acetylglucosamine-specific PTS enzyme II (nagE) gene (PTS - phosphotransferase system), N gene -acetylglucosamine 6-phosphate deacetylase (nagA) and glucosamine 6-phosphate deaminase gene (nagB). The N-acetylneuraminic acid catabolism gene cluster encoding N-acetylmannosamine kinase (nanK), N-acetylmannosamine-6-phosphate epimerase (nanE), N-acetylneuraminic acid aldolase (papA), and sialic acid permease (nanT) was also completely deleted. . The manX, manY, and manZ genes, encoding the phosphoenolpyruvate-dependent phosphotransferase system that facilitates glucosamine import, were also deleted. In addition, the wzxC-wcaJ genes were deleted. The wcaJ gene encodes UDP-glucose:undecaprenylphosphate-glucose-1-phosphate transferase, which catalyzes the first step in the synthesis of colanic acid (Stevenson et al., J. Bacteriol., 1996, 178: 4885-4893). In addition, the fucI, fucK, and agaA genes encoding L-fucose isomerase, L-fuculose kinase, and N-acetylgalactosamine-6-phosphate deacetylase, respectively, were deleted.
Интегрирование в геном гетерологичных генов выполняли посредством транспозиции, используя либо транспозазу EZ-Tn5™ (Epicenter, USA), либо высокоактивную С9-мутантную форму транспозазы "моряка" (mariner transposase) Himar1 (Proc. Natl. Acad. Sci. USA, 1999, 96: 11428-11433). Чтобы получить EZ-Tn5 транспосомы, представляющий интерес ген вместе с маркером устойчивости к антибиотикам, фланкированным FRT-сайтами (от англ. flippase recognition target -участок распознования флиппазы), (альтернативно, ген маркера устойчивости был фланкирован сайтами lox66-lox71), амплифицировали. Полученный продукт полимеразной цепной реакции (ПЦР) нес на обоих концах состоящие из 19 пар оснований (п.о.) мозаичные концы - сайты распознавания для транспозазы EZ-Tn5. Для интегрирования с использованием транспозазы Himar1 представляющие интерес экспрессирующие конструкции (опероны) аналогичным образом клонировали вместе с маркерами устойчивости к антибиотикам, фланкированными FRT-сайтами/lox66-lox71-сайтами, и переносили в вектор pEcomar, который кодирует высокоактивную С9-мутантную форму транспозазы "моряка" Himar1 под контролем арабинозного индуцибельного промотора ParaB. Все гены были кодон-оптимизированы для экспрессии в Е. coli и получены путем синтеза в GenScript Corp.Integration of heterologous genes into the genome was performed by transposition using either the EZ-Tn5™ transposase (Epicenter, USA) or the highly active C9 mutant form of the mariner transposase Himar1 (Proc. Natl. Acad. Sci. USA, 1999, 96: 11428-11433). To obtain the EZ-Tn5 transposome, the gene of interest along with an antibiotic resistance marker flanked by FRT sites (alternatively, the resistance marker gene was flanked by lox66-lox71 sites) was amplified. The resulting polymerase chain reaction (PCR) product had mosaic ends at both ends consisting of 19 base pairs (bp) - recognition sites for the EZ-Tn5 transposase. For integration using the Himar1 transposase, expression constructs of interest (operons) were similarly cloned along with antibiotic resistance markers flanked by FRT sites/lox66-lox71 sites and transferred into the pEcomar vector, which encodes a highly active C9 mutant form of the sailor transposase "Himar1 is under the control of the arabinose inducible promoter P araB . All genes were codon-optimized for expression in E. coli and synthesized by GenScript Corp.
Экспрессируемый фрагмент <Ptet-lacY-FRT-aadA-FRT> интегрировали посредством использования транспозазы EZ-Tn5. После успешного интегрирования гена импортера лактозы LacY из Е. coli K12 TG1 (GenBank: ABN72583) ген устойчивости удаляли из стрептомицин-устойчивых клонов под действием рекомбиназы FLP (флиппазы, от англ. - flippase), кодируемой плазмидой рСР20 (Proc. Natl. Acad. Sci. USA, 2000, 97: 6640-6645). Кроме того, в геном встраивали кластер генов csc Е. coli W (GenBank: СР002185.1), содержащий гены пермеазы сахарозы, фруктокиназы, гидролазы сахарозы и транскрипционного репрессора (гены cscB, cscK, cscA и cscR, соответственно), позволяющий данному штамму расти на сахарозе в качестве единственного источника углерода. Этот кластер был интегрирован в геном штамма BL21(DE3) Е, coli посредством транспозиции с использованием плазмиды pEcomar-cscABKR.The expressed fragment <P tet -lacY-FRT-aadA-FRT> was integrated using the EZ-Tn5 transposase. After successful integration of the lactose importer gene LacY from E. coli K12 TG1 (GenBank: ABN72583), the resistance gene was removed from streptomycin-resistant clones by the action of FLP recombinase (flippase) encoded by plasmid pCP20 (Proc. Natl. Acad. Sci USA, 2000, 97: 6640-6645). In addition, the csc gene cluster of E. coli W (GenBank: CP002185.1), containing the genes for sucrose permease, fructokinase, sucrose hydrolase and transcriptional repressor (genes cscB, cscK, cscA and cscR, respectively), was inserted into the genome, allowing this strain to grow on sucrose as the sole carbon source. This cluster was integrated into the genome of E coli strain BL21(DE3) by transposition using the pEcomar-cscABKR plasmid.
Полученный штамм дополнительно модифицировали для продуцирования NeuNAc посредством интегрирования в геном следующих экспрессионных кассет: <Ptet-slr1975-gna1-lox66-aacC1-lox71> (SEQ ID NO 97), <Ptet-neuB-lox66-kanR-lox71> (SEQ ID NO 98), <Ptet-slr1975-Pt5-neuB-FRT-dhfr-FRT> (SEQ ID NO 99), <Ptet-glmS*-gna1-lox66-aacC1-lox71> (SEQ ID NO 100) и <Ptet-ppsA-1ох66-аасС1-lox71> (SEQ ID NO 101). За исключением экспрессионной кассеты dhfr (дигидрофолатредуктазы) все гены маркеров устойчивости пошагово удаляли из генома (перед следующим раундом интегрирования генов) посредством введения плазмиды pKD-Cre (SEQ ID NO 102), после чего проводили селекцию на чашках с 2YT-агаром, содержащим ампициллин (100 мкг/мл) и 100 мМ L-арабинозу, при 30°С. После этого устойчивые клоны переносили на чашки с 2YT-агаром без ампициллина, а также антибиотика, использованного для отбора при интегрировании в геном. Чашки инкубировали при 42°С для устранения содержащих данную плазмиду клеток. Для дальнейших экспериментов и модификаций использовали клоны, чувствительные к ампициллину и использованному для отбора антибиотику.The resulting strain was further modified to produce NeuNAc by integrating the following expression cassettes into the genome: <P tet -slr1975-gna1-lox66-aacC1-lox71> (SEQ ID NO 97), <P tet -neuB-lox66-kanR-lox71> (SEQ ID NO 98), <P tet -slr1975-P t5 -neuB-FRT-dhfr-FRT> (SEQ ID NO 99), <P tet -glmS*-gna1-lox66-aacC1-lox71> (SEQ ID NO 100) and <P tet -ppsA-1ox66-aacC1-lox71> (SEQ ID NO 101). With the exception of the dhfr (dihydrofolate reductase) expression cassette, all resistance marker genes were stepwise removed from the genome (before the next round of gene integration) by introducing the pKD-Cre plasmid (SEQ ID NO 102), followed by selection on 2YT agar plates containing ampicillin ( 100 µg/ml) and 100 mM L-arabinose, at 30°C. After this, resistant clones were transferred to 2YT agar plates without ampicillin, as well as the antibiotic used for selection during integration into the genome. The dishes were incubated at 42°C to eliminate cells containing this plasmid. For further experiments and modifications, clones sensitive to ampicillin and the antibiotic used for selection were used.
Ген slr1975 (GenBank: BAL35720) кодирует N-ацетилглюкозамин-2-эпимеразу из Synechocystis sp. РСС6803. Ген дпа1 (GenBank: NP_116637) кодирует глюкозамин-6-фосфат-ацетилтрансферазу из Saccharomyces cerevisiae. Ген neuB (GenBank: AF305571) кодирует синтазу сиаловых кислот из Campylobacter jejuni. Ген glmS* представляет собой мутантную версию гена L-глутамин:D-фруктозо-6-фосфат-аминотрансферазы из Е. coli (Metab. Eng., 2005, May; 7(3): 201-14). Ген ppsA (GenBank: ACT43527) кодирует фосфоенолпируват-синтазу из Е. coli BL21(DE3).The slr1975 gene (GenBank: BAL35720) encodes N-acetylglucosamine-2-epimerase from Synechocystis sp. RSS6803. The dpa1 gene (GenBank: NP_116637) encodes glucosamine 6-phosphate acetyltransferase from Saccharomyces cerevisiae. The neuB gene (GenBank: AF305571) encodes sialic acid synthase from Campylobacter jejuni. The glmS* gene is a mutant version of the L-glutamine:D-fructose-6-phosphate aminotransferase gene from E. coli (Metab. Eng., 2005, May; 7(3): 201-14). The ppsA gene (GenBank: ACT43527) encodes phosphoenolpyruvate synthase from E. coli BL21(DE3).
Чтобы создать <Ptet-slr1975-gna1-lox66-aacC1-lox71>, гены slr1975 и gna1 субклонировали в виде оперона после конститутивного промотора Ptet (тетрациклинового промотора), выполняли слияние с геном устойчивости к гентамицину (фланкированному lox66/1ох71-сайтами) и встраивали в вектор pEcomar с использованием лигирования по "тупым" концам. Полученную экспрессионную кассету интегрировали в геном с использованием вектора pEcomar-slr195-gna1-aacC1 и высокоактивной С9-мутантной формы транспозазы "моряка" Himar1 под контролем арабинозного индуцибельного промотора ParaB.To create <P tet -slr1975-gna1-lox66-aacC1-lox71>, the slr1975 and gna1 genes were subcloned as an operon after the constitutive P tet promoter (tetracycline promoter) and fused with the gentamicin resistance gene (flanked by lox66/1ox71 sites) and inserted into the pEcomar vector using blunt end ligation. The resulting expression cassette was integrated into the genome using the pEcomar-slr195-gna1-aacC1 vector and the highly active C9 mutant form of the sailor transposase Himar1 under the control of the arabinose inducible promoter P araB .
Чтобы создать <Ptet-neuB-lox66-kanR-lox71>, ген пеиВ клонировали после конститутивного промотора Ptet и выполняли слияние с геном устойчивости к канамицину (фланкированному 1ох66/1ох71-сайтами). Полученную экспрессионную кассету интегрировали в геном, используя транспозазу EZ-Tn5. Чтобы создать <Ptet-slr1975-Pt5-neuB-FRT-dhfr-FRT>, гены slr1975 и пеиВ по отдельности субклонировали после конститутивных промоторов Ptet и Pt5, соответственно, и проводили слияние с геном устойчивости к триметоприму (фланкированному FRT-сайтами). Полученную экспрессионную кассету интегрировали в геном, используя транспозазу EZ-Tn5.To create <P tet -neuB-lox66-kanR-lox71>, the peiB gene was cloned downstream of the constitutive P tet promoter and fused to the kanamycin resistance gene (flanked by 1ox66/1ox71 sites). The resulting expression cassette was integrated into the genome using the EZ-Tn5 transposase. To create <P tet -slr1975-P t5 -neuB-FRT-dhfr-FRT>, the slr1975 and peiB genes were individually subcloned downstream of the constitutive P tet and P t5 promoters, respectively, and fused to the trimethoprim resistance gene (flanked by FRT- sites). The resulting expression cassette was integrated into the genome using the EZ-Tn5 transposase.
Экспрессионную кассету <Ptet-glmS*-gna1-lox66-aacC1-lox71> создавали посредством клонирования генов glmS* и gna1 в виде оперона после конститутивного промотора Ptet. Далее выполняли слияние этой конструкции с геном устойчивости к гентамицину (фланкированному 1ох66/1ох71-сайтами). Полученную экспрессионную кассету интегрировали в геном, используя транспозазу EZ-Tn5.The expression cassette <P tet -glmS*-gna1-lox66-aacC1-lox71> was created by cloning the glmS* and gna1 genes as an operon downstream of the constitutive P tet promoter. Next, this construct was fused with the gentamicin resistance gene (flanked by 1ox66/1ox71 sites). The resulting expression cassette was integrated into the genome using the EZ-Tn5 transposase.
Чтобы создать <Ptet-ppsA-lox66-aacC1-lox71>, ген ppsA клонировали после конститутивного промотора Ptet и выполняли слияние с геном устойчивости к гентамицину (фланкированному 1ох66/1ох71-сайтами). Полученную экспрессионную кассету интегрировали в геном, используя транспозазу EZ-Tn5.To create <P tet -ppsA-lox66-aacC1-lox71>, the ppsA gene was cloned downstream of the constitutive P tet promoter and fused to the gentamicin resistance gene (flanked by 1ox66/1ox71 sites). The resulting expression cassette was integrated into the genome using the EZ-Tn5 transposase.
В целом, суммарные модификации генома привели к получению Neu5Ac-продуцирующего штамма Е. coli № NANA1.In general, the overall genome modifications led to the production of the Neu5Ac-producing E. coli strain No. NANA1.
Пример 3. Создание и культивирование линии микробных клеток для получения 3'-сиалиллактозыExample 3. Creation and cultivation of a microbial cell line to produce 3'-sialyllactose
Штамм Е. coli № NANA1 далее модифицировали посредством интегрирования <Piei-siaT9-Pi5-neuA-lox66-aacC1-lox71> (SEQ ID NO 103) в геном с использованием транспозазы EZ-Tn5, получая штамм, продуцирующий 3'-SL. Ген siaT9 (GenBank: BAF91160), кодон-оптимизированный для экспрессии в Е. coli и полученный путем синтеза в GenScript, кодирует α-2,3-сиалилтрансферазу из Vibrio sp. JT-FAJ-16. Ген neuA (GenBank: AF305571) кодирует синтетазу ЦМФ-сиаловых кислот из Campylobacter jejuni.E. coli strain No. NANA1 was further modified by integrating < Piei -siaT9-P i5 -neuA-lox66-aacC1-lox71> (SEQ ID NO 103) into the genome using the EZ-Tn5 transposase, resulting in a 3'-SL producing strain . The siaT9 gene (GenBank: BAF91160), codon-optimized for expression in E. coli and synthesized in GenScript, encodes an α-2,3-sialyltransferase from Vibrio sp. JT-FAJ-16. The neuA gene (GenBank: AF305571) encodes a CMP-sialic acid synthetase from Campylobacter jejuni.
Культивирование штамма проводили в 96-луночных планшетах. С этой целью одиночные колонии штамма переносили с агаровых чашек в лунки титрационных микропланшетов с 200 мкл минимальной среды, содержащей: NH4H2PO4 - 7 г⋅л-1; K2HPO4 - 7 г⋅л-1; KОН - 2 г⋅л-1; лимонную кислоту - 0,3 г л-1; NH4Cl - 5 г л-1; противовспенивающий агент - 1 мл⋅л-1, 0,1 мМ CaCl2; 8 мМ MgSO4, микроэлементы и 2% сахарозы в качестве источника углерода. В состав микроэлементов входили: нитрилотриуксусная кислота - 0,101 г⋅л-1, рН 6,5; лимоннокислое аммиачное трехвалентное железо - 0,056 г⋅л-1; MnCl2 × 4H2O - 0,01 г⋅л-1; CoCl2 × 6H2O - 0,002 г⋅л-1; CuCl2 × 2H2O - 0,001 г⋅л-1; борная кислота - 0,002 г⋅л-1; ZnSO4 × 7H2O - 0,009 г⋅л-1; Na2MoO4 × 2H2O - 0,001 г⋅л-1; Na2SeO3 - 0,002 г⋅л-1; NiSO4 × 6H2O - 0,002 г⋅л-1. Культивирование проводили в течение приблизительно 20 часов при 30°С с энергичным встряхиванием. Затем по 50 мкл культуральной жидкости переносили в 96-луночные планшеты с глубокими лунками (2,0 мл), содержащими по 400 мкл минимальной среды из расчета на одну лунку.The strain was cultivated in 96-well plates. For this purpose, single colonies of the strain were transferred from agar plates into the wells of microtiter plates with 200 μl of minimal medium containing: NH 4 H 2 PO 4 - 7 g⋅l -1 ; K 2 HPO 4 - 7 g⋅l -1 ; KOH - 2 g⋅l -1 ; citric acid - 0.3 g l -1 ; NH 4 Cl - 5 g l -1 ; antifoaming agent - 1 ml⋅l -1 , 0.1 mM CaCl 2 ; 8 mM MgSO 4 , trace elements and 2% sucrose as a carbon source. The microelements included: nitrilotriacetic acid - 0.101 g⋅l -1 , pH 6.5; ammonium citrate ferric iron - 0.056 g⋅l -1 ; MnCl 2 × 4H 2 O - 0.01 g⋅l -1 ; CoCl 2 × 6H 2 O - 0.002 g⋅l -1 ; CuCl 2 × 2H 2 O - 0.001 g⋅l -1 ; boric acid - 0.002 g⋅l -1 ; ZnSO 4 × 7H 2 O - 0.009 g⋅l -1 ; Na 2 MoO 4 × 2H 2 O - 0.001 g⋅l -1 ; Na 2 SeO 3 - 0.002 g⋅l -1 ; NiSO 4 × 6H 2 O - 0.002 g⋅l -1 . Cultivation was carried out for approximately 20 hours at 30°C with vigorous shaking. Then, 50 μl of culture liquid was transferred into 96-well plates with deep wells (2.0 ml) containing 400 μl of minimal medium per well.
После инкубирования в течение еще 48 часов культивирование останавливали и с использованием масс-спектрометрии в супернатанте определяли уровень 3'-сиалиллактозы. Масс-спектрометрический анализ проводили в режиме мониторинга множественных реакций (MRM, от англ. multiple reaction monitoring), используя систему детектирования на основе жидкостного хромато-масс-спектрометра с тройным квадруполем (LC Triple-Quadrupole MS). В квадруполе 1 (Q1) осуществляется отбор и анализ родительских ионов, фрагментация происходит в ячейке соударений с использованием аргона в качестве инициирующего столкновительную диссоциацию газа (CID gas (от англ. collision-induced dissociation)), отбор фрагментированных ионов производится в квадруполе 3 (Q3). Хроматографическое разделение лактозы, 3'-сиалиллактозы и 6'-сиалиллактозы после разбавления культурального супернатанта в отношении 1:100 в H2O (со степенью чистоты "для LC/MS") выполняли на колонке для высокоэффективной жидкостной хроматографии (HPLC) XBridge Amide (3,5 мкм; 2,1×50 мм (Waters, USA)) с использованием защитного картриджа XBridge Amide (3,5 мкм; 2,1×10 мм (Waters, USA)). Температура термостата для колонок системы HPLC составляла 50°С. В качестве подвижной фазы использовали смесь ацетонитрил:H2O с 10 мМ ацетатом аммония. В прибор вводили образец объемом 1 мкл; процесс разделения проводили в течение 3,60 мин при скорости потока 400 мкл/мин. 3'-Сиалиллактозу и 6'-сиалиллактозу анализировали методом MRM, используя электрораспылительную ионизацию (ESI) в режиме положительных ионов. Для работы использовали масс-спектрометр с единичным разрешением. Сиалиллактоза образует ион с m/z 656,2 [М+Na]. Родительский ион сиалиллактозы далее фрагментировался в ячейке столкновений с образованием фрагментированных ионов с m/z 612,15; m/z 365,15 и m/z 314,15. "Энергию соударений, напряжение на первом и третьем квадруполях (Q1 Pre Bias и Q3 Pre Bias) оптимизировали индивидуально для каждого аналита. Методы количественного определения выполняли с использованием имеющихся в продаже стандартов (Carbosynth, Compton, UK). По окончании культивирования в культуральном супернатанте достигали титра 3'-SL приблизительно 0,6 гл-1.After incubation for another 48 hours, the culture was stopped and the level of 3'-sialyllactose in the supernatant was determined using mass spectrometry. Mass spectrometric analysis was carried out in multiple reaction monitoring (MRM) mode using a detection system based on a liquid chromatography-mass spectrometer with a triple quadrupole (LC Triple-Quadrupole MS). In quadrupole 1 (Q1), the selection and analysis of parent ions is carried out, fragmentation occurs in a collision cell using argon as a gas initiating collision-induced dissociation (CID gas), the selection of fragmented ions is carried out in quadrupole 3 (Q3 ). Chromatographic separation of lactose, 3'-sialyllactose and 6'-sialyllactose after diluting the culture supernatant 1:100 in H 2 O (LC/MS grade) was performed on an XBridge Amide high performance liquid chromatography (HPLC) column ( 3.5 µm; 2.1×50 mm (Waters, USA)) using the XBridge Amide protective cartridge (3.5 µm; 2.1×10 mm (Waters, USA)). The oven temperature for the HPLC columns was 50°C. A mixture of acetonitrile: H2O with 10 mM ammonium acetate was used as the mobile phase. A sample volume of 1 μl was injected into the device; the separation process was carried out for 3.60 min at a flow rate of 400 μL/min. 3′-Sialyl lactose and 6′-sialyllactose were analyzed by MRM using electrospray ionization (ESI) in positive ion mode. A unit resolution mass spectrometer was used for this work. Sialyl lactose forms an ion with m/z 656.2 [M+Na]. The parent sialyllactose ion was further fragmented in the collision cell to form fragment ions with m/z 612.15; m/z 365.15 and m/z 314.15. "Collision energy, voltage at the first and third quadrupoles (Q1 Pre Bias and Q3 Pre Bias) were optimized individually for each analyte. Quantification methods were performed using commercially available standards (Carbosynth, Compton, UK). At the end of cultivation, the culture supernatant reached 3'-SL titer is approximately 0.6 hl -1 .
Пример 4. Создание и культивирование линии микробных клеток для получения 6'-сиалиллактозыExample 4. Creation and cultivation of a microbial cell line to produce 6'-sialyllactose
Штамм Е. coli № NANA1 далее модифицировали посредством интегрирования <Ptet-siaT18-Pt5-neuA-lox66-aacC1-lox71> (SEQ ID NO 104) в геном с использованием транспозазы EZ-Tn5, получая штамм, продуцирующий 6'-SL. Ген siaT18 (GenBank: АВ500947), кодон-оптимизированный для экспрессии в Е. coli и полученный путем синтеза в GenScript, кодирует α-2,6-сиалилтрансферазу из Photobacterium leiognathi JT-SHIZ-119. Ген neuA (GenBank: AF305571) кодирует синтетазу ЦМФ-сиаловых кислот из Campylobacter jejuni.E. coli strain No. NANA1 was further modified by integrating <P tet -siaT18-P t5 -neuA-lox66-aacC1-lox71> (SEQ ID NO 104) into the genome using the EZ-Tn5 transposase, resulting in a 6'-SL producing strain . The siaT18 gene (GenBank: AB500947), codon-optimized for expression in E. coli and synthesized in GenScript, encodes an α-2,6-sialyltransferase from Photobacterium leiognathi JT-SHIZ-119. The neuA gene (GenBank: AF305571) encodes a CMP-sialic acid synthetase from Campylobacter jejuni.
С использованием этого продуцирующего 6'-SL штамма проводили культивирование в 96-луночном планшете, как описано в примере 2. По окончании культивирования в культуральном супернатанте достигали титра 6'-SL приблизительно 0,9 г⋅л-1.Using this 6'-SL producing strain, cultivation was carried out in a 96-well plate as described in Example 2. At the end of cultivation, a 6'-SL titer of approximately 0.9 g⋅l -1 was reached in the culture supernatant.
Пример 5. Состав смеси для грудных детей, содержащей сиалиллактозуExample 5 Composition of an infant formula containing sialyllactose
Смесь для грудных детей: Обезжиренное молокоInfant formula: Skim milk
Растительные масла (пальмовое масло, рапсовое масло, подсолнечное масло)Vegetable oils (palm oil, rapeseed oil, sunflower oil)
Олигосахариды грудного молокаBreast milk oligosaccharides
L-ФукозаL-Fucose
6'-Сиалиллактоза6'-Sialyl lactose
Сухое обезжиренное молокоSkimmed milk powder
Масло из Mortierella alpineMortierella alpine oil
Рыбий жирFish fat
Карбонат кальцияCalcium carbonate
Хлорид калияPotassium chloride
Витамин СVitamin C
Хлорид натрияSodium chloride
Витамин ЕVitamin E
Ацетат железаIron acetate
Сульфат цинкаZinc sulfate
НиацинNiacin
D-Пантотенат кальцияD-Calcium Pantothenate
Сульфат медиCopper sulfate
Витамин АVitamin A
Витамин В1Vitamin B1
Витамин В6Vitamin B6
Сульфат магнияMagnesium sulfate
Иодат калияPotassium iodate
Фолиевая кислотаFolic acid
Витамин KVitamin K
Селенит натрияSodium selenite
Витамин D.Vitamin D
--->--->
ПЕРЕЧЕНЬ ПОСЛЕДОВАТЕЛЬНОСТЕЙ LIST OF SEQUENCES
<110> Jennewein Biotechnologie GmbH<110> Jennewein Biotechnologie GmbH
<120> Получение сиалилированных сахаридов<120> Preparation of sialylated saccharides
<130> P 1802 WO<130>P 1802 WO
<160> 104 <160> 104
<170> PatentIn версия 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 1410<211> 1410
<212> ДНК<212> DNA
<213> Campylobacter coli<213> Campylobacter coli
<400> 1<400> 1
atgcaaaacg tcattatcgc tggtaacggt ccgagcctgc aatcaatcaa ctatcaacgc 60atgcaaaacg tcattatcgc tggtaacggt ccgagcctgc aatcaatcaa ctatcaacgc 60
ctgccgaaag aatacgacat cttccgctgc aaccagttct acttcgaaga taaatactac 120ctgccgaaag aatacgacat cttccgctgc aaccagttct acttcgaaga taaatactac 120
ctgggcaaaa acatcaaagc ggcctttttc aatccgtatc cgttcctgca gcaataccat 180ctgggcaaaa acatcaaagc ggcctttttc aatccgtatc cgttcctgca gcaataccat 180
accgcgaaac agctggtgtt caacaacgaa tacaaaatcg aaaacatctt ttgtagcacg 240accgcgaaac agctggtgtt caacaacgaa tacaaaatcg aaaacatctt ttgtagcacg 240
ttcaatctgc cgttcatcga aaaagataac ttcatcaaca aattttacga tttctttccg 300ttcaatctgc cgttcatcga aaaagataac ttcatcaaca aattttacga tttctttccg 300
gacgctaaac tgggtcacaa aatcatcgaa aacctgaaag aattttacgc gtacatcaaa 360gacgctaaac tgggtcacaa aatcatcgaa aacctgaaag aattttacgc gtacatcaaa 360
tacaacgaaa tctacctgaa caaacgtatt accagcggca tctatatgtg cgcaattgct 420tacaacgaaa tctacctgaa caaacgtatt accagcggca tctatatgtg cgcaattgct 420
atcgcgctgg gttataaaaa catttacctg tgtggcatcg atttctatga aggtgaaacg 480atcgcgctgg gttataaaaa catttacctg tgtggcatcg atttctatga aggtgaaacg 480
atctacccgt tcaaagccat gtctaaaaac attaagaaaa tttttccgtg gatcaaagat 540atctacccgt tcaaagccat gtctaaaaac attaagaaaa tttttccgtg gatcaaagat 540
ttcaacccga gtaacttcca ttccaaagaa tacgacatcg aaatcctgaa actgctggaa 600ttcaacccga gtaacttcca ttccaaagaa tacgacatcg aaatcctgaa actgctggaa 600
tcaatctaca aagttaacat ctacgcactg tgcgataact cggccctggc aaattacttc 660tcaatctaca aagttaacat ctacgcactg tgcgataact cggccctggc aaattacttc 660
ccgctgctgg tgaacaccga caattcattt gttctggaaa acaaatcgga tgactgtatc 720ccgctgctgg tgaacaccga caattcattt gttctggaaa acaaatcgga tgactgtatc 720
aacgatatcc tgctgaccaa caatacgccg ggcattaact tctataaaag ccagatccaa 780aacgatatcc tgctgaccaa caatacgccg ggcattaact tctataaaag ccagatccaa 780
gtcaacaata ccgaaattct gctgctgaac tttcagaata tgatcagcgc caaagaaaac 840gtcaacaata ccgaaattct gctgctgaac tttcagaata tgatcagcgc caaagaaaac 840
gaaatttcta acctgaacaa aatcctgcaa gactcataca aaaccatcaa cacgaaagaa 900gaaatttcta acctgaacaa aatcctgcaa gactcataca aaaccatcaa cacgaaagaa 900
aacgaaatta gtaatctgaa taaaatcctg caggattcct ataaaacgat taataccaaa 960aacgaaatta gtaatctgaa taaaatcctg caggattcct ataaaacgat taataccaaa 960
gaaaatgaaa tttcgaatct gaacaaaatc ctgcaggata aagacaaact gctgatcgtt 1020gaaaatgaaa tttcgaatct gaacaaaatc ctgcaggata aagacaaact gctgatcgtt 1020
aaagaaaacc tgctgaattt caaaagccgt catggtaaag ccaaatttcg cattcagaac 1080aaagaaaacc tgctgaattt caaaagccgt catggtaaag ccaaatttcg cattcagaac 1080
caactgtctt ataaactggg ccaggcaatg atggtcaata gcaaatctct gctgggttat 1140caactgtctt ataaactggg ccaggcaatg atggtcaata gcaaatctct gctgggttat 1140
atccgtatgc cgtttgtgct gagttacatc aaagacaaac acaaacagga acaaaaaatc 1200atccgtatgc cgtttgtgct gagttacatc aaagacaaac acaaacagga acaaaaaatc 1200
tatcaggaaa aaattaagaa agatccgagc ctgaccctgc cgccgctgga agattatccg 1260tatcaggaaa aaattaagaa agatccgagc ctgaccctgc cgccgctgga agattatccg 1260
gactacaaag aagctctgaa agaaaaagaa tgcctgacct atcgcctggg ccagacgctg 1320gactacaaag aagctctgaa agaaaaagaa tgcctgacct atcgcctggg cgacgctg 1320
attaaagcgg atcaagaatg gtacaaaggt ggctatgtga aaatgtggtt cgaaatcaaa 1380attaaagcgg atcaagaatg gtacaaaggt ggctatgtga aaatgtggtt cgaaatcaaa 1380
aaactgaaga aagaatacaa aaagaaataa 1410aaactgaaga aagaatacaa aaagaaataa 1410
<210> 2<210> 2
<211> 1146<211> 1146
<212> ДНК<212> DNA
<213> Vibrio sp.<213> Vibrio sp.
<400> 2<400> 2
atgaacaacg acaactccac gaccaccaac aataacgcta ttgaaatcta tgtggatcgt 60atgaacaacg acaactccac gaccaccaac aataacgcta ttgaaatcta tgtggatcgt 60
gcgaccctgc cgacgatcca gcaaatgacc aaaattgtta gccagaaaac gtctaacaaa 120gcgaccctgc cgacgatcca gcaaatgacc aaaattgtta gccagaaaac gtctaacaaa 120
aaactgatct catggtcgcg ctacccgatt accgataaaa gcctgctgaa gaaaattaac 180aaactgatct catggtcgcg ctacccgatt accgataaaa gcctgctgaa gaaaattaac 180
gcggaatttt tcaaagaaca atttgaactg acggaaagcc tgaaaaacat catcctgtct 240gcggaatttt tcaaagaaca atttgaactg acggaaagcc tgaaaaacat catcctgtct 240
gaaaacatcg ataacctgat cattcatggc aataccctgt ggagtattga tgtggttgac 300gaaaacatcg ataacctgat cattcatggc aataccctgt ggagtattga tgtggttgac 300
attatcaaag aagtcaacct gctgggcaaa aatattccga tcgaactgca cttttatgat 360attatcaaag aagtcaacct gctgggcaaa aatattccga tcgaactgca cttttatgat 360
gacggttccg ccgaatacgt tcgtatctac gaatttagta aactgccgga atccgaacag 420gacggttccg ccgaatacgt tcgtatctac gaatttagta aactgccgga atccgaacag 420
aaatacaaaa ccagcctgtc taaaaacaac atcaaattct caatcgatgg caccgactcg 480aaatacaaaa ccagcctgtc taaaaacaac atcaaattct caatcgatgg caccgactcg 480
ttcaaaaaca cgatcgaaaa catctacggt ttcagccaac tgtatccgac cacgtaccac 540ttcaaaaaca cgatcgaaaa catctacggt ttcagccaac tgtatccgac cacgtaccac 540
atgctgcgtg cagatatctt cgacaccacg ctgaaaatta acccgctgcg cgaactgctg 600atgctgcgtg cagatatctt cgacaccacg ctgaaaatta acccgctgcg cgaactgctg 600
tcaaacaaca tcaaacagat gaaatgggat tacttcaaag acttcaacta caaacaaaaa 660tcaaacaaca tcaaacagat gaaatgggat tacttcaaag acttcaacta caaacaaaaa 660
gatatctttt actcactgac caacttcaac ccgaaagaaa tccaggaaga cttcaacaaa 720gatatctttt actcactgac caacttcaac ccgaaagaaa tccaggaaga cttcaacaaa 720
aactcgaaca aaaacttcat cttcatcggc agtaactccg cgaccgccac ggcagaagaa 780aactcgaaca aaaacttcat cttcatcggc agtaactccg cgaccgccac ggcagaagaa 780
caaatcaata ttatcagcga agcgaagaaa gaaaacagca gcattatcac caattcaatt 840caaatcaata ttatcagcga agcgaagaaa gaaaacagca gcattatcac caattcaatt 840
tcggattatg acctgttttt caaaggtcat ccgtctgcca cgtttaacga acagattatc 900tcggattatg acctgttttt caaaggtcat ccgtctgcca cgtttaacga acagattatc 900
aatgcacacg atatgatcga aatcaacaac aaaatcccgt tcgaagctct gatcatgacc 960aatgcacacg atatgatcga aatcaacaac aaaatcccgt tcgaagctct gatcatgacc 960
ggcattctgc cggatgccgt tggcggtatg ggtagttccg tctttttcag tatcccgaaa 1020ggcattctgc cggatgccgt tggcggtatg ggtagttccg tctttttcag tatcccgaaa 1020
gaagtcaaaa acaaattcgt gttctataaa agtggtacgg atatcgaaaa taactccctg 1080gaagtcaaaa acaaattcgt gttctataaa agtggtacgg atatcgaaaa taactccctg 1080
attcaggtga tgctgaaact gaatctgatt aaccgcgata atattaaact gatctctgac 1140attcaggtga tgctgaaact gaatctgatt aaccgcgata atattaaact gatctctgac 1140
atttaa 1146atttaa 1146
<210> 3<210> 3
<211> 1173<211> 1173
<212> ДНК<212> DNA
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 3<400> 3
atgggctgta atagcgactc caaccacaac aactccgacg gcaacatcac caaaaacaaa 60atgggctgta atagcgactc caaccacaac aactccgacg gcaacatcac caaaaacaaa 60
acgatcgaag tttatgtcga tcgtgcaacc ctgccgacga ttcagcaaat gacccagatc 120acgatcgaag tttatgtcga tcgtgcaacc ctgccgacga ttcagcaaat gacccagatc 120
atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgctaccc gatcaatgat 180atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgctaccc gatcaatgat 180
gaagaactgc tggaatcaat taacggctcg tttttcaaaa acaactctga actgatcaaa 240gaagaactgc tggaatcaat taacggctcg tttttcaaaa acaactctga actgatcaaa 240
agtctggatt ccatgattct gaccaatgac attaagaaag tgatcatcaa cggtaacacg 300agtctggatt ccatgattct gaccaatgac attaagaaag tgatcatcaa cggtaacacg 300
ctgtgggcgg ccgatgtggt taacatcatc aaatcaatcg aagcgttcgg caagaaaacc 360ctgtgggcgg ccgatgtggt taacatcatc aaatcaatcg aagcgttcgg caagaaaacc 360
gaaatcgaac tgaactttta tgatgacggt tcggccgaat atgtgcgtct gtacgacttt 420gaaatcgaac tgaactttta tgatgacggt tcggccgaat atgtgcgtct gtacgacttt 420
agcaaactgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattctg 480agcaaactgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattctg 480
agcagcatca acggcaccca gccgttcgaa aacgtcgtgg aaaacatcta cggtttcagt 540agcagcatca acggcaccca gccgttcgaa aacgtcgtgg aaaacatcta cggtttcagt 540
caactgtacc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600caactgtacc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600
ctgcgcagtc tgaaaggcgt tctgtccaac aacatcaaac agatgaaatg ggattacttc 660ctgcgcagtc tgaaaggcgt tctgtccaac aacatcaaac agatgaaatg ggattacttc 660
aaaaccttca acagccagca aaaagacaaa ttctacaact tcacgggttt taacccggat 720aaaaccttca acagccagca aaaagacaaa ttctacaact tcacgggttt taacccggat 720
gaaattatgg aacaatacaa agcaagcccg aacaaaaatt ttatcttcgt cggcaccaat 780gaaattatgg aacaatacaa agcaagcccg aacaaaaatt ttatcttcgt cggcaccaat 780
tctggcaccg caacggctga acagcaaatt gatatcctga ccgaagctaa aaacccgaac 840tctggcaccg caacggctga acagcaaatt gatatcctga ccgaagctaa aaacccgaac 840
agcccgatta tcacgaaatc gatccagggc ttcgacctgt ttttcaaagg tcatccgtct 900agcccgatta tcacgaaatc gatccagggc ttcgacctgt ttttcaaagg tcatccgtct 900
gcaacctaca acaaacaaat catcgatgct cacaacatga tcgaaatcta caacaaaatc 960gcaacctaca acaaacaaat catcgatgct cacaacatga tcgaaatcta caacaaaatc 960
ccgttcgaag cgctgatcat gaccgatgcc ctgccggatg cggtgggcgg tatgggcagc 1020ccgttcgaag cgctgatcat gaccgatgcc ctgccggatg cggtgggcgg tatgggcagc 1020
agcgtgtttt tcagcctgcc gaataccgtg gaaaacaaat tcattttcta taaatccgat 1080agcgtgtttt tcagcctgcc gaataccgtg gaaaacaaat tcattttcta taaatccgat 1080
acggacattg aaaacaatgc cctgatccag gttatgattg aactgaatat cgtgaaccgt 1140acggacattg aaaacaatgc cctgatccag gttatgattg aactgaatat cgtgaaccgt 1140
aatgatgtga aactgatctc ggacctgcaa taa 1173aatgatgtga aactgatctc ggacctgcaa taa 1173
<210> 4<210> 4
<211> 1167<211> 1167
<212> ДНК<212> DNA
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 4<400> 4
atgaaaacga ttaccctgta tctggacccg gcgtccctgc cggcactgaa ccaactgatg 60atgaaaacga ttaccctgta tctggacccg gcgtccctgc cggcactgaa ccaactgatg 60
gattttacgc agaacaatga agacaaaacc catccgcgta tctttggcct gtctcgcttc 120gattttacgc agaacaatga agacaaaacc catccgcgta tctttggcct gtctcgcttc 120
aaaattccgg ataacattat cacccaatat cagaatatcc actttgttga actgaaagac 180aaaattccgg ataacattat cacccaatat cagaatatcc actttgttga actgaaagac 180
aatcgtccga cggaagccct gttcaccatt ctggatcagt acccgggtaa cattgaactg 240aatcgtccga cggaagccct gttcaccatt ctggatcagt acccgggtaa cattgaactg 240
gacatccatc tgaatattgc tcacagcgtc cagctgattc gtccgatcct ggcgtatcgc 300gacatccatc tgaatattgc tcacagcgtc cagctgattc gtccgatcct ggcgtatcgc 300
tttaaacatc tggatcgtgt gtccatccag cgcctgaacc tgtatgatga cggctcaatg 360tttaaacatc tggatcgtgt gtccatccag cgcctgaacc tgtatgatga cggctcaatg 360
gaatacgttg atctggaaaa agaagaaaac aaagacatct cggcagaaat taaacaagct 420gaatacgttg atctggaaaa agaagaaaac aaagacatct cggcagaaat taaacaagct 420
gaaaaacagc tgagccatta tctgctgacg ggtaaaatca aattcgataa cccgaccatt 480gaaaaacagc tgagccatta tctgctgacg ggtaaaatca aattcgataa cccgaccatt 480
gcgcgctacg tttggcagtc tgcctttccg gtcaaatatc acttcctgag tacggactac 540gcgcgctacg tttggcagtc tgcctttccg gtcaaatatc acttcctgag tacggactac 540
tttgaaaaag cagaatttct gcaaccgctg aaagaatatc tggcggaaaa ttaccagaaa 600tttgaaaaag cagaatttct gcaaccgctg aaagaatatc tggcggaaaa ttaccagaaa 600
atggattgga cggcctatca gcaactgacc ccggaacagc aagcatttta cctgaccctg 660atggattgga cggcctatca gcaactgacc ccggaacagc aagcatttta cctgaccctg 660
gttggcttca acgacgaagt caaacagagt ctggaagtgc agcaagcgaa atttattttc 720gttggcttca acgacgaagt caaacagagt ctggaagtgc agcaagcgaa atttattttc 720
acgggcacca cgacctggga aggtaatacc gatgttcgtg aatattacgc ccagcaacag 780acgggcacca cgacctggga aggtaatacc gatgttcgtg aatattacgc ccagcaacag 780
ctgaacctgc tgaatcattt tacccaggcg ggcggcgacc tgtttattgg tgaccattac 840ctgaacctgc tgaatcattt tacccaggcg ggcggcgacc tgtttattgg tgaccattac 840
aaaatttact tcaaaggtca cccgcgcggc ggtgaaatca acgattacat cctgaacaac 900aaaatttact tcaaaggtca cccgcgcggc ggtgaaatca acgattacat cctgaacaac 900
gcaaaaaaca tcacgaatat cccggctaat atctctttcg aagtgctgat gatgaccggc 960gcaaaaaaca tcacgaatat cccggctaat atctctttcg aagtgctgat gatgaccggc 960
ctgctgccgg ataaagtcgg cggtgtggct agctctctgt acttcagtct gccgaaagaa 1020ctgctgccgg ataaagtcgg cggtgtggct agctctctgt acttcagtct gccgaaagaa 1020
aaaattagtc acatcatctt caccagcaac aaacaggtca aatcaaaaga agatgccctg 1080aaaattagtc acatcatctt caccagcaac aaacaggtca aatcaaaaga agatgccctg 1080
aacaatccgt acgtgaaagt tatgcgtcgc ctgggtatta tcgatgaatc gcaagtgatc 1140aacaatccgt acgtgaaagt tatgcgtcgc ctgggtatta tcgatgaatc gcaagtgatc 1140
ttttgggaca gcctgaaaca gctgtaa 1167ttttgggaca gcctgaaaca gctgtaa 1167
<210> 5<210> 5
<211> 1116<211> 1116
<212> ДНК<212> DNA
<213> Neisseria meningitidis<213> Neisseria meningitidis
<400> 5<400> 5
atgggcctga aaaaagcctg cctgaccgtg ctgtgtctga tcgtgttttg cttcggcatc 60atgggcctga aaaaagcctg cctgaccgtg ctgtgtctga tcgtgttttg cttcggcatc 60
ttttatacgt tcgatcgtgt gaaccagggt gaacgcaatg cagttagtct gctgaaagaa 120ttttatacgt tcgatcgtgt gaaccagggt gaacgcaatg cagttagtct gctgaaagaa 120
aaactgttta acgaagaagg cgaaccggtg aatctgatct tctgttacac cattctgcaa 180aaactgttta acgaagaagg cgaaccggtg aatctgatct tctgttacac cattctgcaa 180
atgaaagttg ccgaacgtat tatggcacag catccgggtg aacgctttta tgtggttctg 240atgaaagttg ccgaacgtat tatggcacag catccgggtg aacgctttta tgtggttctg 240
atgagcgaaa accgtaacga aaaatacgat tactacttca accagatcaa agataaagcg 300atgagcgaaa accgtaacga aaaatacgat tactacttca accagatcaa agataaagcg 300
gaacgcgcct atttctttca cctgccgtac ggcctgaaca aaagttttaa tttcattccg 360gaacgcgcct atttctttca cctgccgtac ggcctgaaca aaagttttaa tttcattccg 360
acgatggcgg aactgaaagt gaaaagcatg ctgctgccga aagttaaacg tatctatctg 420acgatggcgg aactgaaagt gaaaagcatg ctgctgccga aagttaaacg tatctatctg 420
gcaagcctgg aaaaagtgtc tattgcggcc tttctgagca cctacccgga tgcggaaatc 480gcaagcctgg aaaaagtgtc tattgcggcc tttctgagca cctacccgga tgcggaaatc 480
aaaaccttcg atgatggcac gggtaatctg attcagagct ctagttatct gggcgatgaa 540aaaaccttcg atgatggcac gggtaatctg attcagagct ctagttatct gggcgatgaa 540
ttttctgtta acggtacgat caaacgtaat ttcgcccgca tgatgatcgg tgattggtct 600ttttctgtta acggtacgat caaacgtaat ttcgcccgca tgatgatcgg tgattggtct 600
attgcgaaaa cccgcaacgc cagtgatgaa cattacacga tcttcaaagg cctgaaaaac 660attgcgaaaa cccgcaacgc cagtgatgaa cattacacga tcttcaaagg cctgaaaaac 660
atcatggatg atggtcgtcg caaaatgacc tacctgccgc tgttcgatgc gtctgaactg 720atcatggatg atggtcgtcg caaaatgacc tacctgccgc tgttcgatgc gtctgaactg 720
aaaacgggcg atgaaaccgg cggtacggtg cgtattctgc tgggtagccc ggataaagaa 780aaaacgggcg atgaaaccgg cggtacggtg cgtattctgc tgggtagccc ggataaagaa 780
atgaaagaaa tctctgaaaa agcagcgaaa aacttcaaaa tccagtatgt tgccccgcac 840atgaaagaaa tctctgaaaa agcagcgaaa aacttcaaaa tccagtatgt tgccccgcac 840
ccgcgtcaga cctacggcct gagtggtgtg accacgctga acagcccgta tgttattgaa 900ccgcgtcaga cctacggcct gagtggtgtg accacgctga acagcccgta tgttattgaa 900
gattacatcc tgcgtgaaat taagaaaaac ccgcataccc gctatgaaat ctacacgttt 960gattacatcc tgcgtgaaat taagaaaaac ccgcataccc gctatgaaat ctacacgttt 960
ttcagcggcg ccgcactgac catgaaagat tttccgaacg tgcacgttta tgcactgaaa 1020ttcagcggcg ccgcactgac catgaaagat tttccgaacg tgcacgttta tgcactgaaa 1020
ccggcgtctc tgccggaaga ttattggctg aaaccggtgt acgcgctgtt tacccagagt 1080ccggcgtctc tgccggaaga ttattggctg aaaccggtgt acgcgctgtt tacccagagt 1080
ggtattccga tcctgacgtt cgatgataaa aattaa 1116ggtattccga tcctgacgtt cgatgataaa aattaa 1116
<210> 6<210> 6
<211> 852<211> 852
<212> ДНК<212> DNA
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 6<400> 6
atggataaat ttgcagaaca tgaaattccg aaagcagtga tcgttgctgg caacggtgaa 60atggataaat ttgcagaaca tgaaattccg aaagcagtga tcgttgctgg caacggtgaa 60
agtctgtccc agattgatta tcgtctgctg ccgaaaaact acgacgtctt ccgttgcaac 120agtctgtccc agattgatta tcgtctgctg ccgaaaaact acgacgtctt ccgttgcaac 120
caattctact tcgaagaacg ctacttcctg ggcaataaaa tcaaagccgt gtttttcacc 180caattctact tcgaagaacg ctacttcctg ggcaataaaa tcaaagccgt gtttttcacc 180
ccgggtgttt ttctggaaca gtattacacg ctgtatcatc tgaaacgcaa caatgaatac 240ccgggtgttt ttctggaaca gtattacacg ctgtatcatc tgaaacgcaa caatgaatac 240
tttgtcgata acgtgattct gagctctttc aatcacccga ccgtggacct ggaaaaatca 300tttgtcgata acgtgattct gagctctttc aatcacccga ccgtggacct ggaaaaatca 300
cagaaaatcc aagcactgtt catcgatgtt atcaacggct acgaaaaata cctgtcgaaa 360cagaaaatcc aagcactgtt catcgatgtt atcaacggct acgaaaaata cctgtcgaaa 360
ctgaccgctt tcgatgttta tctgcgttac aaagaactgt atgaaaatca gcgcattacg 420ctgaccgctt tcgatgttta tctgcgttac aaagaactgt atgaaaatca gcgcattacg 420
agcggtgttt acatgtgcgc tgtcgcgatc gccatgggct ataccgatat ttacctgacg 480agcggtgttt acatgtgcgc tgtcgcgatc gccatgggct ataccgatat ttacctgacg 480
ggtatcgact tttatcaagc gtctgaagaa aactacgcct tcgataacaa aaaaccgaat 540ggtatcgact tttatcaagc gtctgaagaa aactacgcct tcgataacaa aaaaccgaat 540
attatccgtc tgctgccgga ctttcgcaaa gaaaaaaccc tgttcagcta tcattctaaa 600attatccgtc tgctgccgga ctttcgcaaa gaaaaaaccc tgttcagcta tcattctaaa 600
gatattgacc tggaagcgct gtcatttctg cagcaacatt accacgtgaa cttctactca 660gatattgacc tggaagcgct gtcatttctg cagcaacatt accacgtgaa cttctactca 660
atctcgccga tgagtccgct gtccaaacat tttccgatcc cgacggttga agatgactgt 720atctcgccga tgagtccgct gtccaaacat tttccgatcc cgacggttga agatgactgt 720
gaaaccacgt tcgtcgcccc gctgaaagaa aactatatta atgacatcct gctgccgccg 780gaaaccacgt tcgtcgcccc gctgaaagaa aactatatta atgacatcct gctgccgccg 780
cactttgtct atgaaaaact gggcgtggat aaactggcgg ccgcactgga acatcaccat 840cactttgtct atgaaaaact gggcgtggat aaactggcgg ccgcactgga acatcaccat 840
caccatcact aa 852caccatcact aa 852
<210> 7<210> 7
<211> 1158<211> 1158
<212> ДНК<212> DNA
<213> Pasteurella dagmatis<213> Pasteurella dagmatis
<400> 7<400> 7
atgaccattt acctggaccc ggcgtctctg ccgaccctga accaactgat gcattttacg 60atgaccattt acctggaccc ggcgtctctg ccgaccctga accaactgat gcattttacg 60
aaagaaagcg aagacaaaga aaccgcacgt atttttggct tctctcgctt taaactgccg 120aaagaaagcg aagacaaaga aaccgcacgt atttttggct tctctcgctt taaactgccg 120
gaaaaaatca cggaacagta caacaacatc catttcgtgg aaatcaaaaa caatcgtccg 180gaaaaaatca cggaacagta caacaacatc catttcgtgg aaatcaaaaa caatcgtccg 180
acggaagata ttttcaccat cctggaccag tacccggaaa aactggaact ggatctgcat 240acggaagata ttttcaccat cctggaccag tacccggaaa aactggaact ggatctgcat 240
ctgaacattg cacacagcat ccagctgttt catccgattc tgcaatatcg tttcaaacac 300ctgaacattg cacacagcat ccagctgttt catccgattc tgcaatatcg tttcaaacac 300
ccggatcgca ttagtatcaa atccctgaac ctgtatgatg acggcaccat ggaatacgtt 360ccggatcgca ttagtatcaa atccctgaac ctgtatgatg acggcaccat ggaatacgtt 360
gatctggaaa aagaagaaaa caaagacatc aaaagtgcga tcaaaaaagc cgaaaaacag 420gatctggaaa aagaagaaaa caaagacatc aaaagtgcga tcaaaaaagc cgaaaaacag 420
ctgtccgatt atctgctgac gggtaaaatt aactttgaca atccgaccct ggcacgctac 480ctgtccgatt atctgctgac gggtaaaatt aactttgaca atccgaccct ggcacgctac 480
gtttggcagt cacaatatcc ggtcaaatac catttcctgt cgacggaata ttttgaaaaa 540gtttggcagt cacaatatcc ggtcaaatac catttcctgt cgacggaata ttttgaaaaa 540
gctgaattcc tgcagccgct gaaaacctat ctggcgggca aataccaaaa aatggattgg 600gctgaattcc tgcagccgct gaaaacctat ctggcgggca aataccaaaa aatggattgg 600
tcagcctatg aaaaactgtc gccggaacag caaacgtttt acctgaaact ggtcggtttc 660tcagcctatg aaaaactgtc gccggaacag caaacgtttt acctgaaact ggtcggtttc 660
agtgatgaaa ccaaacagct gtttcacacg gaacaaacca aatttatttt cacgggcacc 720agtgatgaaa ccaaacagct gtttcacacg gaacaaacca aatttatttt cacgggcacc 720
acgacctggg agggtaacac cgatatccgt gaatattacg cgaaacagca actgaatctg 780acgacctggg agggtaacac cgatatccgt gaatattacg cgaaacagca actgaatctg 780
ctgaaacatt ttacccacag cgaaggcgac ctgtttatcg gtgaccagta caaaatctac 840ctgaaacatt ttacccacag cgaaggcgac ctgtttatcg gtgaccagta caaaatctac 840
ttcaaaggcc atccgcgcgg cggtgatatt aacgactata tcctgaaaca cgcaaaagat 900ttcaaaggcc atccgcgcgg cggtgatatt aacgactata tcctgaaaca cgcaaaagat 900
attacgaaca tcccggctaa tattagcttc gaaatcctga tgatgaccgg tctgctgccg 960attacgaaca tcccggctaa tattagcttc gaaatcctga tgatgaccgg tctgctgccg 960
gacaaagtcg gcggtgtggc gagctctctg tacttctctc tgccgaaaga aaaaatcagc 1020gacaaagtcg gcggtgtggc gagctctctg tacttctctc tgccgaaaga aaaaatcagc 1020
cacattatct tcacctctaa caagaaaatt aaaaacaaag aagatgccct gaatgacccg 1080cacattatct tcacctctaa caagaaaatt aaaaacaaag aagatgccct gaatgacccg 1080
tacgtgcgtg ttatgctgcg tctgggtatg attgacaaaa gccaaattat cttctgggat 1140tacgtgcgtg ttatgctgcg tctgggtatg attgacaaaa gccaaattat cttctgggat 1140
tctctgaaac aactgtaa 1158tctctgaaac aactgtaa 1158
<210> 8<210> 8
<211> 1173<211> 1173
<212> ДНК<212> DNA
<213> Photobacterium phosphoreum<213> Photobacterium phosphoreum
<400> 8<400> 8
atgggctgta actccgatag caaacacaat aacagtgatg gcaatattac caaaaacaaa 60atgggctgta actccgatag caaacacaat aacagtgatg gcaatattac caaaaacaaa 60
acgatcgaag tctatgtgga ccgtgcgacc ctgccgacga ttcagcaaat gacccagatc 120acgatcgaag tctatgtgga ccgtgcgacc ctgccgacga ttcagcaaat gacccagatc 120
atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgttaccc gatcaatgat 180atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgttaccc gatcaatgat 180
gaaacgctgc tggaatcaat taatggctcg tttttcaaaa accgcccgga actgatcaaa 240gaaacgctgc tggaatcaat taatggctcg tttttcaaaa accgcccgga actgatcaaa 240
agtctggatt ccatgattct gaccaacgaa attaagaaag tgatcatcaa cggtaacacg 300agtctggatt ccatgattct gaccaacgaa attaagaaag tgatcatcaa cggtaacacg 300
ctgtgggcag ttgacgtggt taatattatc aaaagcattg aagctctggg caagaaaacc 360ctgtgggcag ttgacgtggt taatattatc aaaagcattg aagctctggg caagaaaacc 360
gaaatcgaac tgaacttcta tgatgacggt tctgcggaat atgtgcgtct gtacgatttt 420gaaatcgaac tgaacttcta tgatgacggt tctgcggaat atgtgcgtct gtacgatttt 420
agccgcctgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattcag 480agccgcctgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattcag 480
agcagcatca acggcaccca accgttcgac aacagcatcg aaaacatcta cggtttctct 540agcagcatca acggcaccca accgttcgac aacagcatcg aaaacatcta cggtttctct 540
cagctgtatc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600cagctgtatc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600
ctgacgagtc tgaaacgcgt tatctccaac aacatcaaac agatgaaatg ggattacttc 660ctgacgagtc tgaaacgcgt tatctccaac aacatcaaac agatgaaatg ggattacttc 660
accacgttca attcccagca gaaaaacaaa ttttacaact tcaccggctt caacccggaa 720accacgttca attcccagca gaaaaacaaa ttttacaact tcaccggctt caacccggaa 720
aaaatcaaag aacaatacaa agcgagtccg cacgaaaatt ttattttcat tggcaccaac 780aaaatcaaag aacaatacaa agcgagtccg cacgaaaatt ttattttcat tggcaccaac 780
tccggcaccg ccaccgcaga acagcaaatt gatatcctga ccgaagccaa aaaaccggac 840tccggcaccg ccaccgcaga acagcaaatt gatatcctga ccgaagccaa aaaaccggac 840
tcaccgatta tcaccaacag cattcagggc ctggacctgt ttttcaaagg tcatccgtct 900tcaccgatta tcaccaacag cattcagggc ctggacctgt ttttcaaagg tcatccgtct 900
gcgacctata accagcaaat tatcgacgcc cacaacatga tcgaaatcta caacaaaatc 960gcgacctata accagcaaat tatcgacgcc cacaacatga tcgaaatcta caacaaaatc 960
ccgttcgaag cactgatcat gaccgatgca ctgccggacg ctgttggcgg tatgggtagt 1020ccgttcgaag cactgatcat gaccgatgca ctgccggacg ctgttggcgg tatgggtagt 1020
tccgtctttt tctcactgcc gaataccgtc gaaaacaaat tcattttcta taaatcggat 1080tccgtctttt tctcactgcc gaataccgtc gaaaacaaat tcattttcta taaatcggat 1080
acggacattg aaaacaatgc tctgatccag gttatgatcg aactgaatat cgtgaaccgc 1140acggacattg aaaacaatgc tctgatccag gttatgatcg aactgaatat cgtgaaccgc 1140
aatgatgtga aactgattag tgacctgcaa taa 1173aatgatgtga aactgattag tgacctgcaa taa 1173
<210> 9<210> 9
<211> 1254<211> 1254
<212> ДНК<212> DNA
<213> Avibacterium paragallinarum<213> Avibacterium paragallinarum
<400> 9<400> 9
atgcgtaaaa tcatcacctt cttcagcctg ttcttctcga tctcagcgtg gtgtcaaaaa 60atgcgtaaaa tcatcacctt cttcagcctg ttcttctcga tctcagcgtg gtgtcaaaaa 60
atggaaatct acctggacta tgcgtcgctg ccgagcctga acatgatcct gaacctggtt 120atggaaatct acctggacta tgcgtcgctg ccgagcctga acatgatcct gaacctggtt 120
gaaaacaaaa acaacgaaaa agtcgaacgt attatcggct tcgaacgctt tgatttcaac 180gaaaacaaaa acaacgaaaa agtcgaacgt attatcggct tcgaacgctt tgatttcaac 180
aaagaaattc tgaatagctt ctctaaagaa cgtatcgaat ttagtaaagt ctccattctg 240aaagaaattc tgaatagctt ctctaaagaa cgtatcgaat ttagtaaagt ctccattctg 240
gatatcaaag aattttcaga caaactgtac ctgaacattg aaaaatcgga tacgccggtg 300gatatcaaag aattttcaga caaactgtac ctgaacattg aaaaatcgga tacgccggtg 300
gacctgatta tccataccaa tctggatcac tcagttcgtt cgctgctgag catctttaaa 360gacctgatta tccataccaa tctggatcac tcagttcgtt cgctgctgag catctttaaa 360
accctgagtc cgctgttcca taaaatcaac atcgaaaaac tgtacctgta cgatgacggc 420accctgagtc cgctgttcca taaaatcaac atcgaaaaac tgtacctgta cgatgacggc 420
agcggtaact atgttgatct gtaccagcac cgccaagaaa atatttctgc gattctgatc 480agcggtaact atgttgatct gtaccagcac cgccaagaaa atatttctgc gattctgatc 480
gaagcccaga aaaaactgaa agacgcgctg gaaaatcgtg aaacggatac cgacaaactg 540gaagcccaga aaaaactgaa agacgcgctg gaaaatcgtg aaacggatac cgacaaactg 540
catagcctga cgcgctatac ctggcacaaa atctttccga cggaatatat cctgctgcgt 600catagcctga cgcgctatac ctggcacaaa atctttccga cggaatatat cctgctgcgt 600
ccggattacc tggatattga cgaaaaaatg caaccgctga aacatttcct gagcgatacc 660ccggattacc tggatattga cgaaaaaatg caaccgctga aacatttcct gagcgatacc 660
atcgtgtcta tggacctgtc tcgctttagt catttctcca aaaaccagaa agaactgttt 720atcgtgtcta tggacctgtc tcgctttagt catttctcca aaaaccagaa agaactgttt 720
ctgaaaatca cgcacttcga tcaaaacatc ttcaacgaac tgaacatcgg caccaaaaac 780ctgaaaatca cgcacttcga tcaaaacatc ttcaacgaac tgaacatcgg caccaaaaac 780
aaagaataca aaacgttcat cttcaccggc accacgacct gggaaaaaga taagaaaaaa 840aaagaataca aaacgttcat cttcaccggc accacgacct gggaaaaaga taagaaaaaa 840
cgtctgaaca acgcgaaact gcagacggaa attctggaat cttttatcaa accgaacggc 900cgtctgaaca acgcgaaact gcagacggaa attctggaat cttttatcaa accgaacggc 900
aaattctacc tgggtaacga tatcaaaatc tttttcaaag gccacccgaa aggtgatgac 960aaattctacc tgggtaacga tatcaaaatc tttttcaaag gccacccgaa aggtgatgac 960
attaacgact acattatccg caaaaccggc gcagaaaaaa ttccggctaa catcccgttt 1020attaacgact acattatccg caaaaccggc gcagaaaaaa ttccggctaa catcccgttt 1020
gaagttctga tgatgacgaa tagtctgccg gattatgtcg gcggtattat gagtaccgtg 1080gaagttctga tgatgacgaa tagtctgccg gattatgtcg gcggtattat gagtaccgtg 1080
tacttttccc tgccgccgaa aaatattgat aaagtggttt tcctgggttc cgaaaaaatc 1140tacttttccc tgccgccgaa aaatattgat aaagtggttt tcctgggttc cgaaaaaatc 1140
aaaaacgaaa acgacgccaa atcacagacc ctgtcgaaac tgatgctgat gctgaacgtc 1200aaaaacgaaa acgacgccaa atcacagacc ctgtcgaaac tgatgctgat gctgaacgtc 1200
atcacgccgg aacagatttt ctttgaagaa atgccgaacc cgattaactt ttaa 1254atcacgccgg aacagatttt ctttgaagaa atgccgaacc cgattaactt ttaa 1254
<210> 10<210> 10
<211> 1293<211> 1293
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 10<400> 10
atgacccgca cccgtatgga aaacgaactg attgtgagca aaaacatgca gaacattatt 60atgacccgca cccgtatgga aaacgaactg attgtgagca aaaacatgca gaacattatt 60
atcgccggta acggtccgag cctgaaaaat attaactata aacgtctgcc gcgcgaatac 120atcgccggta acggtccgag cctgaaaaat attaactata aacgtctgcc gcgcgaatac 120
gatgtgttcc gttgcaacca gttctacttc gaagacaaat actacctggg caagaaaatt 180gatgtgttcc gttgcaacca gttctacttc gaagacaaat actacctggg caagaaaatt 180
aaagccgtgt ttttcaatcc gggcgtgttt ctgcaacaat atcataccgc aaaacagctg 240aaagccgtgt ttttcaatcc gggcgtgttt ctgcaacaat atcataccgc aaaacagctg 240
attctgaaaa acgaatacga aatcaaaaac atcttttgta gcaccttcaa tctgccgttt 300attctgaaaa acgaatacga aatcaaaaac atcttttgta gcaccttcaa tctgccgttt 300
atcgaatcta acgatttcct gcaccaattt tataactttt tcccggacgc taaactgggc 360atcgaatcta acgatttcct gcaccaattt tataactttt tcccggacgc taaactgggc 360
tacgaagtca tcgaaaacct gaaagaattt tacgcgtaca tcaaatacaa cgaaatctac 420tacgaagtca tcgaaaacct gaaagaattt tacgcgtaca tcaaatacaa cgaaatctac 420
ttcaacaaac gcatcacctc tggcgtgtat atgtgcgcga ttgccatcgc actgggttat 480ttcaacaaac gcatcacctc tggcgtgtat atgtgcgcga ttgccatcgc actgggttat 480
aaaacgattt acctgtgtgg catcgatttc tatgaaggtg acgttattta cccgtttgaa 540aaaacgattt acctgtgtgg catcgatttc tatgaaggtg acgttattta cccgtttgaa 540
gcaatgagta ccaacattaa aacgatcttc ccgggtatca aagatttcaa accgagtaac 600gcaatgagta ccaacattaa aacgatcttc ccgggtatca aagatttcaa accgagtaac 600
tgccattcca aagaatatga catcgaagcg ctgaaactgc tgaaaagcat ctacaaagtt 660tgccattcca aagaatatga catcgaagcg ctgaaactgc tgaaaagcat ctacaaagtt 660
aacatctacg ccctgtgtga tgacagtatt ctggcaaatc atttcccgct gtccattaac 720aacatctacg ccctgtgtga tgacagtatt ctggcaaatc atttcccgct gtccattaac 720
atcaacaaca acttcaccct ggaaaacaaa cacaacaact caatcaacga tattctgctg 780atcaacaaca acttcaccct ggaaaacaaa cacaacaact caatcaacga tattctgctg 780
accgacaata cgccgggcgt ctcgttttat aaaaatcagc tgaaagccga taacaaaatc 840accgacaata cgccgggcgt ctcgttttat aaaaatcagc tgaaagccga taacaaaatc 840
atgctgaact tctacaacat cctgcatagc aaagataacc tgatcaaatt cctgaacaaa 900atgctgaact tctacaacat cctgcatagc aaagataacc tgatcaaatt cctgaacaaa 900
gaaatcgctg ttctgaaaaa acagaccacg caacgtgcta aagcgcgcat tcagaaccac 960gaaatcgctg ttctgaaaaa acagaccacg caacgtgcta aagcgcgcat tcagaaccac 960
ctgagctata aactgggcca agccctgatt atcaatagca aatctgtcct gggtttcctg 1020ctgagctata aactgggcca agccctgatt atcaatagca aatctgtcct gggtttcctg 1020
tctctgccgt ttattatcct gtcaattgtg atctcgcaca aacaggaaca aaaagcgtat 1080tctctgccgt ttattatcct gtcaattgtg atctcgcaca aacaggaaca aaaagcgtat 1080
aaattcaaag tgaagaaaaa cccgaacctg gcactgccgc cgctggaaac ctatccggat 1140aaattcaaag tgaagaaaaa cccgaacctg gcactgccgc cgctggaaac ctatccggat 1140
tacaacgaag ccctgaaaga aaaagaatgc ttcacgtaca aactgggcga agaatttatc 1200tacaacgaag ccctgaaaga aaaagaatgc ttcacgtaca aactgggcga agaatttatc 1200
aaagcaggta aaaactggta tggcgaaggt tacatcaaat ttatcttcaa agatgttccg 1260aaagcaggta aaaactggta tggcgaaggt tacatcaaat ttatcttcaa agatgttccg 1260
cgtctgaaac gtgaatttga aaaaggcgaa taa 1293cgtctgaaac gtgaatttga aaaaggcgaa taa 1293
<210> 11<210> 11
<211> 1188<211> 1188
<212> ДНК<212> DNA
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 11<400> 11
atgaataaga aaccgctgat tattgctggc aacgggccaa gcatcaaaga cttagattat 60atgaataaga aaccgctgat tattgctggc aacgggccaa gcatcaaaga cttagattat 60
gcgttgttcc cgaaagactt tgatgtattc cgatgtaatc aattctactt cgaggacaaa 120gcgttgttcc cgaaagactt tgatgtattc cgatgtaatc aattctactt cgaggacaaa 120
tactatttag ggcgggaaat aaaaggggtg ttctttaacg cgcacgtctt cgatctccaa 180tactatttag ggcgggaaat aaaaggggtg ttctttaacg cgcacgtctt cgatctccaa 180
atgaagatca ctaaagccat agtcaaaaac ggggaatatc acccggacca catatattgc 240atgaagatca ctaaagccat agtcaaaaac ggggaatatc acccggacca catatattgc 240
acacatgtcg aaccgtacgg ttacgttaac ggaaaccagc aactcatgca agagtacctg 300acacatgtcg aaccgtacgg ttacgttaac ggaaaccagc aactcatgca agagtacctg 300
gaaaaacatt ttgtgggagt ccgaagcacg tacgcatacc tgaaagatct agagccattc 360gaaaaacatt ttgtgggagt ccgaagcacg tacgcatacc tgaaagatct agagccattc 360
tttattctgc acagtaagta tcgcaacttc tacgaccagc acttcacaac gggcatcatg 420tttattctgc acagtaagta tcgcaacttc tacgaccagc acttcacaac gggcatcatg 420
atgctactgg tggccatcca attgggatac aaagaaatat acctgtgcgg aatagacttc 480atgctactgg tggccatcca attgggatac aaagaaatat acctgtgcgg aatagacttc 480
tacgaaaacg gattcggaca tttctacgag aaccaagggg gattctttga agaggatagc 540tacgaaaacg gattcggaca tttctacgag aaccaagggg gattctttga agaggatagc 540
gatccgatgc acgataagaa catagacatc caagcactgg aactggcaaa gaaatacgcg 600gatccgatgc acgataagaa catagacatc caagcactgg aactggcaaa gaaatacgcg 600
aaaatctacg cactggtacc gaacagcgcc ctagtgaaaa tgattccgtt gagcagccaa 660aaaatctacg cactggtacc gaacagcgcc ctagtgaaaa tgattccgtt gagcagccaa 660
aaaggagttc tggaaaaggt gaaggaccgg atcgggttgg gcgagtttaa gagagagaaa 720aaaggagttc tggaaaaggt gaaggaccgg atcgggttgg gcgagtttaa gagagagaaa 720
ttcgggcaaa aagaattgga aagacagaag gaattagaac gacaaaaaga gctcgaacgc 780ttcgggcaaa aagaattgga aagacagaag gaattagaac gacaaaaaga gctcgaacgc 780
caaaaggagc ttgaacgtca aaaggaactt gaacgacaaa aagagttgga gaggcagaaa 840caaaaggagc ttgaacgtca aaaggaactt gaacgacaaa aagagttgga gaggcagaaa 840
gaactcgaac gccaaaaaga attagagaga cagaaggaat tagagcgcca aaaggagctt 900gaactcgaac gccaaaaaga attagagaga cagaaggaat tagagcgcca aaaggagctt 900
gagcgtcaaa aagaattaga gaggcagaag gagttagaaa ggcagaaaga actggagaga 960gagcgtcaaa aagaattaga gaggcagaag gagttagaaa ggcagaaaga actggagaga 960
cagaaagaac tcgaaaggca gaaggagttg gaacgccaaa aagaactaga attagaacga 1020cagaaagaac tcgaaaggca gaaggagttg gaacgccaaa aagaactaga attagaacga 1020
tccttaaaag cacgattgaa agcggtactc gcgagcaaag gcatccgcgg cgacaacctg 1080tccttaaaag cacgattgaa agcggtactc gcgagcaaag gcatccgcgg cgacaacctg 1080
ataatcgtaa gtttaaaaga cacctaccga ctgtttaaag ggggatttgc gttactcttg 1140ataatcgtaa gtttaaaaga cacctaccga ctgtttaaag ggggatttgc gttactcttg 1140
gacctgaagg cgctaaagtc aatcattaaa gcattcctga agagataa 1188gacctgaagg cgctaaagtc aatcattaaa gcattcctga agagataa 1188
<210> 12<210> 12
<211> 783<211> 783
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 12<400> 12
atgggcaaaa aagtgattat tgcgggcaac ggcccgagcc tgaaagaaat tgattatagc 60atgggcaaaa aagtgattat tgcgggcaac ggcccgagcc tgaaagaaat tgattatagc 60
cgtctgccga acgattttga tgtgtttcgc tgcaaccagt tttatttcga agataaatat 120cgtctgccga acgattttga tgtgtttcgc tgcaaccagt tttatttcga agataaatat 120
tacctgggca aaaaatgcaa agcggtgttc tataatccga tcctgttctt cgaacagtat 180tacctgggca aaaaatgcaa agcggtgttc tataatccga tcctgttctt cgaacagtat 180
tacaccctga aacatctgat tcagaaccag gaatatgaaa ccgaactgat catgtgcagc 240tacaccctga aacatctgat tcagaaccag gaatatgaaa ccgaactgat catgtgcagc 240
aactataacc aggcgcatct ggaaaacgaa aactttgtga aaaccttcta cgattatttt 300aactataacc aggcgcatct ggaaaacgaa aactttgtga aaaccttcta cgattatttt 300
ccggatgcgc atctgggcta tgattttttc aaacagctga aagatttcaa cgcgtacttc 360ccggatgcgc atctgggcta tgattttttc aaacagctga aagatttcaa cgcgtacttc 360
aaattccacg aaatctattt caaccagcgt attaccagcg gcgtgtatat gtgcgcggtg 420aaattccacg aaatctattt caaccagcgt attaccagcg gcgtgtatat gtgcgcggtg 420
gcgattgcgc tgggctataa agaaatttat ctgagcggca tcgattttta tcagaacggc 480gcgattgcgc tgggctataa agaaatttat ctgagcggca tcgattttta tcagaacggc 480
agcagctatg cgtttgatac caaacagaaa aacctgctga aactggcccc gaactttaaa 540agcagctatg cgtttgatac caaacagaaa aacctgctga aactggcccc gaactttaaa 540
aacgataaca gccactatat tggccatagc aaaaacaccg atatcaaagc gctggaattt 600aacgataaca gccactatat tggccatagc aaaaacaccg atatcaaagc gctggaattt 600
ctggaaaaaa cctataaaat caaactgtat tgcctgtgcc cgaacagcct gctggccaac 660ctggaaaaaa cctataaaat caaactgtat tgcctgtgcc cgaacagcct gctggccaac 660
tttattgaac tggcaccgaa tctgaacagc aacttcatca tccaggaaaa aaacaactat 720tttattgaac tggcaccgaa tctgaacagc aacttcatca tccaggaaaa aaacaactat 720
accaaagata ttctgattcc gagcagcgaa gcgtatggca aattcagcaa aaacatcaac 780accaaagata ttctgattcc gagcagcgaa gcgtatggca aattcagcaa aaacatcaac 780
taa 783taa 783
<210> 13<210> 13
<211> 897<211> 897
<212> ДНК<212> DNA
<213> Streptococcus entericus<213> Streptococcus entericus
<400> 13<400> 13
atgaagaaag tctacttctg ccatacggtc taccatctgc tgattaccct gtgcaaaatt 60atgaagaaag tctacttctg ccatacggtc taccatctgc tgattaccct gtgcaaaatt 60
agcgttgaag aacaagttga aattattgtg ttcgataccg ttagtaatca tgaactgatt 120agcgttgaag aacaagttga aattattgtg ttcgataccg ttagtaatca tgaactgatt 120
gtccagaaaa tccgcgacgt gtttgttaac accacggtgc tgttcgcaga acaaaatacc 180gtccagaaaa tccgcgacgt gtttgttaac accacggtgc tgttcgcaga acaaaatacc 180
gatttttcca ttctggaaat cgatcgcgct acggacattt atgtgttcaa cgactggacc 240gatttttcca ttctggaaat cgatcgcgct acggacattt atgtgttcaa cgactggacc 240
ccgatcggcg cgtatctgcg taaaaacaaa ctgttttacc atctgatcga agatggttat 300ccgatcggcg cgtatctgcg taaaaacaaa ctgttttacc atctgatcga agatggttat 300
aactaccacg aatataacgt ttacgcgaat gccctgacca tgaaacgtcg cctgctgaac 360aactaccacg aatataacgt ttacgcgaat gccctgacca tgaaacgtcg cctgctgaac 360
ttcgtgctgc gtcgcgaaga accgtcaggc ttttcgcgtt atgttcgcag cattgaagtt 420ttcgtgctgc gtcgcgaaga accgtcaggc ttttcgcgtt atgttcgcag cattgaagtt 420
aaccgtgtca aatacctgcc gaatgattgc cgcaaaagca aatgggttga aaaaccgcgt 480aaccgtgtca aatacctgcc gaatgattgc cgcaaaagca aatgggttga aaaaccgcgt 480
tctgccctgt tcgaaaatct ggtcccggaa cataaacaga aaatcatcac gatcttcggc 540tctgccctgt tcgaaaatct ggtcccggaa cataaacaga aaatcatcac gatcttcggc 540
ctggaaaact atcaagatag cctgcgcggt gtcctggtgc tgacccagcc gctggtgcaa 600ctggaaaact atcaagatag cctgcgcggt gtcctggtgc tgacccagcc gctggtgcaa 600
gactactggg atcgcgacat taccacggaa gaagaacagc tggaatttta tcgtcaaatc 660gactactggg atcgcgacat taccacggaa gaagaacagc tggaatttta tcgtcaaatc 660
gtggaatctt acggcgaagg tgaacaggtg tttttcaaaa ttcacccgcg tgataaagtt 720gtggaatctt acggcgaagg tgaacaggtg tttttcaaaa ttcacccgcg tgataaagtt 720
gactatagct ctctgaccaa cgtcattttt ctgaagaaaa acgtcccgat ggaagtgtac 780gactatagct ctctgaccaa cgtcattttt ctgaagaaaa acgtcccgat ggaagtgtac 780
gaactgattg ccgattgtca ttttaccaaa ggtatcacgc acagttccac cgcactggac 840gaactgattg ccgattgtca ttttaccaaa ggtatcacgc acagttccac cgcactggac 840
ttcctgtcct gtgtggataa gaaaatcacc ctgaaacaaa tgaaagcaaa tagttaa 897ttcctgtcct gtgtggataa gaaaatcacc ctgaaacaaa tgaaagcaaa tagttaa 897
<210> 14<210> 14
<211> 888<211> 888
<212> ДНК<212> DNA
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 14<400> 14
atgaaagaaa tcgccatcat ctccaaccaa cgcatgttct tcctgtactg tctgctgacc 60atgaaagaaa tcgccatcat ctccaaccaa cgcatgttct tcctgtactg tctgctgacc 60
aataaaaatg tcgaagacgt gttcttcatt tttgaaaaag gcgcgatgcc gaacaatctg 120aataaaaatg tcgaagacgt gttcttcatt tttgaaaaag gcgcgatgcc gaacaatctg 120
accagcattt ctcatttcat cgtgctggat cacagtaaat ccgaatgcta tgactttttc 180accagcattt ctcatttcat cgtgctggat cacagtaaat ccgaatgcta tgactttttc 180
tacttcaact tcatcagttg taaatatcgt ctgcgcggcc tggatgttta cggtgcagac 240tacttcaact tcatcagttg taaatatcgt ctgcgcggcc tggatgttta cggtgcagac 240
catatcaaag gcgctaaatt tttcctggaa cgtcaccgct ttttcgtggt tgaagatggt 300catatcaaag gcgctaaatt tttcctggaa cgtcaccgct ttttcgtggt tgaagatggt 300
atgatgaact acagcaaaaa catgtacgca ttctctctgt tccgtacccg caatccggtg 360atgatgaact acagcaaaaa catgtacgca ttctctctgt tccgtacccg caatccggtg 360
attctgccgg gcggttttca tccgaacgtt aaaaccatct tcctgacgaa agataatccg 420attctgccgg gcggttttca tccgaacgtt aaaaccatct tcctgacgaa agataatccg 420
attccggacc agatcgctca caaacgtgaa atcatcaaca tcaaaaccct gtggcaagcg 480attccggacc agatcgctca caaacgtgaa atcatcaaca tcaaaaccct gtggcaagcg 480
aaaaccgcca cggaaaaaac gaaaattctg agctttttcg aaatcgatat gcaggaaatt 540aaaaccgcca cggaaaaaac gaaaattctg agctttttcg aaatcgatat gcaggaaatt 540
tcagttatca aaaaccgctc gtttgtcctg tatacccaac cgctgtcaga agataaactg 600tcagttatca aaaaccgctc gtttgtcctg tatacccaac cgctgtcaga agataaactg 600
ctgacggaag cggaaaaaat tgacatctat cgtaccattc tgacgaaata caaccattcg 660ctgacggaag cggaaaaaat tgacatctat cgtaccattc tgacgaaata caaccattcg 660
cagaccgtta tcaaaccgca cccgcgcgat aaaacggact ataaacaact gtttccggat 720cagaccgtta tcaaaccgca cccgcgcgat aaaacggact ataaacaact gtttccggat 720
gcctatgtca tgaaaggcac ctacccgagt gaactgctga cgctgctggg tgtcaacttc 780gcctatgtca tgaaaggcac ctacccgagt gaactgctga cgctgctggg tgtcaacttc 780
aacaaagtga tcaccctgtt ttccacggcg gtcttcgatt atccgaaaga aaaaatcgac 840aacaaagtga tcaccctgtt ttccacggcg gtcttcgatt atccgaaaga aaaaatcgac 840
ttctacggca ccgcggtgca tccgaaactg ctggatttct ttgactaa 888ttctacggca ccgcggtgca tccgaaactg ctggatttct ttgactaa 888
<210> 15<210> 15
<211> 1467<211> 1467
<212> ДНК<212> DNA
<213> Alistipes sp.<213> Alistipes sp.
<400> 15<400> 15
atggccctgc tgagcggtac cgccgcatgc tcagatgacg aagtctcgca gaacctgatc 60atggccctgc tgagcggtac cgccgcatgc tcagatgacg aagtctcgca gaacctgatc 60
gtgattaatg gcggtgaaca ttttctgagc ctggatggtc tggcccgtgc aggtaaaatt 120gtgattaatg gcggtgaaca ttttctgagc ctggatggtc tggcccgtgc aggtaaaatt 120
agcgtgctgg caccggctcc gtggcgtgtt acgaaagcag ctggtgatac ctggtttcgc 180agcgtgctgg caccggctcc gtggcgtgtt acgaaagcag ctggtgatac ctggtttcgc 180
ctgagcgcaa ccgaaggtcc ggctggttac agcgaagtgg aactgtctct ggatgaaaat 240ctgagcgcaa ccgaaggtcc ggctggttac agcgaagtgg aactgtctct ggatgaaaat 240
ccgggtgccg cacgtagcgc acagctggcg tttgcctgtg gtgatgcgat tgtgccgttc 300ccgggtgccg cacgtagcgc acagctggcg tttgcctgtg gtgatgcgat tgtgccgttc 300
cgcctgagtc aaggcgcact gtccgctggt tatgattcac cggactatta cttttacgtt 360cgcctgagtc aaggcgcact gtccgctggt tatgattcac cggactatta cttttacgtt 360
accttcggca cgatgccgac cctgtatgcc ggtatccatc tgctgagcca cgataaaccg 420accttcggca cgatgccgac cctgtatgcc ggtatccatc tgctgagcca cgataaaccg 420
ggctatgtct tttactcacg ttcgaaaacg tttgacccgg ccgaattccc ggcacgtgct 480ggctatgtct tttactcacg ttcgaaaacg tttgacccgg ccgaattccc ggcacgtgct 480
gaagttacca ccgcagctga tcgtaccgcc gatgcaaccc aggccgaaat ggaagcaatg 540gaagttacca ccgcagctga tcgtaccgcc gatgcaaccc aggccgaaat ggaagcaatg 540
gctcgcgaaa tgaaacgtcg catcctggaa attaactctg cggatccgac cgccgtgttt 600gctcgcgaaa tgaaacgtcg catcctggaa attaactctg cggatccgac cgccgtgttt 600
ggcctgtatg ttgatgacct gcgttgccgc attggctacg attggttcgt ggcgcagggt 660ggcctgtatg ttgatgacct gcgttgccgc attggctacg attggttcgt ggcgcagggt 660
atcgacagtg cccgtgtcaa agtgagcatg ctgtctgatg gcaccggcac gtacaacaat 720atcgacagtg cccgtgtcaa agtgagcatg ctgtctgatg gcaccggcac gtacaacaat 720
ttttataact acttcggtga cgcggccacg gcggaacaaa attgggaaag ttatgcgtcc 780ttttataact acttcggtga cgcggccacg gcggaacaaa attgggaaag ttatgcgtcc 780
gaagttgaag ccctggattg gaatcacggc ggtcgttatc cggaaacccg ctcgctgccg 840gaagttgaag ccctggattg gaatcacggc ggtcgttatc cggaaacccg ctcgctgccg 840
gaatttgaaa gctacacgtg gccgtattac ctgtctaccc gtccggatta tcgcctggtg 900gaatttgaaa gctacacgtg gccgtattac ctgtctaccc gtccggatta tcgcctggtg 900
gttcaggacg gcagtctgct ggaaagctct tgtccgttta ttaccgaaaa actgggtgaa 960gttcaggacg gcagtctgct ggaaagctct tgtccgttta ttaccgaaaa actgggtgaa 960
atggaaatcg aatccattca accgtatgaa atgctgtcag ccctgccgga aagttcccgt 1020atggaaatcg aatccattca accgtatgaa atgctgtcag ccctgccgga aagttcccgt 1020
aaacgctttt atgatatggc aggcttcgat tacgacaaat ttgcagctct gttcgatgcg 1080aaacgctttt atgatatggc aggcttcgat tacgacaaat ttgcagctct gttcgatgcg 1080
tccccgaaga aaaacctgat tatcattggt acctctcatg cggatgatgc cagtgcacgt 1140tccccgaaga aaaacctgat tatcattggt acctctcatg cggatgatgc cagtgcacgt 1140
ctgcagcgtg attacgttgc acgcatcatg gaacagtatg gcgctcaata cgatgtcttt 1200ctgcagcgtg attacgttgc acgcatcatg gaacagtatg gcgctcaata cgatgtcttt 1200
ttcaaaccgc acccggcaga caccacgtca gctggttatg aaacggaatt tccgggcctg 1260ttcaaaccgc acccggcaga caccacgtca gctggttatg aaacggaatt tccgggcctg 1260
accctgctgc cgggtcaaat gccgtttgaa atcttcgttt ggtccctgat tgatcgtgtc 1320accctgctgc cgggtcaaat gccgtttgaa atcttcgttt ggtccctgat tgatcgtgtc 1320
gacatgatcg gcggttatcc gtcaacggtc tttctgaccg ttccggtcga taaagtgcgc 1380gacatgatcg gcggttatcc gtcaacggtc tttctgaccg ttccggtcga taaagtgcgc 1380
tttatttttg ccgcggatgc agcttctctg gtgcgtccgc tgaatatcct gttccgcgat 1440tttatttttg ccgcggatgc agcttctctg gtgcgtccgc tgaatatcct gttccgcgat 1440
gcgaccgacg ttgaatggat gcagtaa 1467gcgaccgacg ttgaatggat gcagtaa 1467
<210> 16<210> 16
<211> 876<211> 876
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 16<400> 16
atgaagaaag tgattatcgc cggcaatggt ccgagcctga aagaaattga ttattctcgt 60atgaagaaag tgattatcgc cggcaatggt ccgagcctga aagaaattga ttattctcgt 60
ctgccgaatg atttcgacgt ctttcgctgc aaccagttct actttgaaga caaatattac 120ctgccgaatg atttcgacgt ctttcgctgc aaccagttct actttgaaga caaatattac 120
ctgggcaaaa aatgtaaagc cgtgttttat accccgaact ttttctttga acagtattac 180ctgggcaaaa aatgtaaagc cgtgttttat accccgaact ttttctttga acagtattac 180
acgctgaaac atctgattca gaaccaagaa tatgaaaccg aactgatcat gtgctcaaac 240acgctgaaac atctgattca gaaccaagaa tatgaaaccg aactgatcat gtgctcaaac 240
tacaatcaag cacatctgga aaacgaaaac ttcgtcaaaa cgttctacga ttacttcccg 300tacaatcaag cacatctgga aaacgaaaac ttcgtcaaaa cgttctacga ttacttcccg 300
gacgctcacc tgggttacga tttctttaaa cagctgaaag aattcaacgc gtacttcaaa 360gacgctcacc tgggttacga tttctttaaa cagctgaaag aattcaacgc gtacttcaaa 360
ttccacgaaa tctacttcaa ccaacgtatc acctcaggcg tgtatatgtg tgcggttgcc 420ttccacgaaa tctacttcaa ccaacgtatc acctcaggcg tgtatatgtg tgcggttgcc 420
attgcactgg gttataaaga aatttacctg tcgggcatcg atttttatca gaatggtagc 480attgcactgg gttataaaga aatttacctg tcgggcatcg atttttatca gaatggtagc 480
tcttacgcct tcgacacgaa acaagaaaat ctgctgaaac tggcaccgga ttttaaaaac 540tcttacgcct tcgacacgaa acaagaaaat ctgctgaaac tggcaccgga ttttaaaaac 540
gaccgctcac attatattgg ccactcgaaa aacaccgata tcaaagctct ggaattcctg 600gaccgctcac attatattgg ccactcgaaa aacaccgata tcaaagctct ggaattcctg 600
gaaaaaacgt acaaaatcaa actgtactgc ctgtgtccga atagtctgct ggctaacttt 660gaaaaaacgt acaaaatcaa actgtactgc ctgtgtccga atagtctgct ggctaacttt 660
atcgaactgg cgccgaacct gaattccaac ttcatcatcc aggagaaaaa caactacacc 720atcgaactgg cgccgaacct gaattccaac ttcatcatcc aggagaaaaa caactacacc 720
aaagatatcc tgatcccgag ttccgaagcg tacggcaaat ttagcaaaaa catcaacttc 780aaagatatcc tgatcccgag ttccgaagcg tacggcaaat ttagcaaaaa catcaacttc 780
aagaaaatta aaatcaaaga aaacgtgtat tacaaactga ttaaagatct gctgcgtctg 840aagaaaatta aaatcaaaga aaacgtgtat tacaaactga ttaaagatct gctgcgtctg 840
ccgtctgaca tcaaacatta ttttaaaggt aaataa 876ccgtctgaca tcaaacatta ttttaaaggt aaataa 876
<210> 17<210> 17
<211> 939<211> 939
<212> ДНК<212> DNA
<213> Streptococcus agalactiae<213> Streptococcus agalactiae
<400> 17<400> 17
atgacgaatc gcaaaatcta tgtctgccac accctgtacc atctgctgat ctgcctgtat 60atgacgaatc gcaaaatcta tgtctgccac accctgtacc atctgctgat ctgcctgtat 60
aaagaagaaa tctactcaaa tctggaaatt atcctgagca gcagcattcc ggatgtggac 120aaagaagaaa tctactcaaa tctggaaatt atcctgagca gcagcattcc ggatgtggac 120
aacctggaga aaaaactgaa aagcaaaacc atcaacatcc atattctgga agaatcctca 180aacctggaga aaaaactgaa aagcaaaacc atcaacatcc atattctgga agaatcctca 180
ggcgaatctg aagaactgct gagtgttctg aaagatgcag gtctgtctta cagtaaattc 240ggcgaatctg aagaactgct gagtgttctg aaagatgcag gtctgtctta cagtaaattc 240
gatagcaact gcttcatctt caacgacgct accccgattg gccgtacgct gatcaaacac 300gatagcaact gcttcatctt caacgacgct accccgattg gccgtacgct gatcaaacac 300
ggtatttatt acaatctgat cgaagatggc ctgaactgtt ttacctactc gattttcagc 360ggtatttatt acaatctgat cgaagatggc ctgaactgtt ttacctactc gattttcagc 360
cagaaactgt ggaaatacta cgtgaaaaaa tacatcctgc ataaaattca accgcacggc 420cagaaactgt ggaaatacta cgtgaaaaaa tacatcctgc ataaaattca accgcacggc 420
ttttcccgct actgcctggg tatcgaagtg aacagtctgg ttaatctgcc gaaagatccg 480ttttcccgct actgcctggg tatcgaagtg aacagtctgg ttaatctgcc gaaagatccg 480
cgttacaaaa aattcatcga agtcccgcgc aaagaactgt tcgacaatgt tacggaatac 540cgttacaaaa aattcatcga agtcccgcgc aaagaactgt tcgacaatgt tacggaatac 540
cagaaagaaa tggcgatcaa cctgtttggc gccgtccgtg tgtctattaa atccccgtca 600cagaaagaaa tggcgatcaa cctgtttggc gccgtccgtg tgtctattaa atccccgtca 600
gttctggtcc tgacccagcc gctgtccatc gataaagaat ttatgtcata caacaacaaa 660gttctggtcc tgacccagcc gctgtccatc gataaagaat ttatgtcata caacaacaaa 660
atcgaaacgt cggaagaaca attcaacttc tacaaaagca tcgtgaacga atacatcaac 720atcgaaacgt cggaagaaca attcaacttc tacaaaagca tcgtgaacga atacatcaac 720
aaaggttaca acgtctacct gaaagtgcat ccgcgtgatg tggttgacta ttctaaactg 780aaaggttaca acgtctacct gaaagtgcat ccgcgtgatg tggttgacta ttctaaactg 780
ccggttgaac tgctgccgag taacgtcccg atggaaatta tcgaactgat gctgaccggc 840ccggttgaac tgctgccgag taacgtcccg atggaaatta tcgaactgat gctgaccggc 840
cgctttgaat gcggtattac ccatagcagc accgccctgg atttcctgac ctgtgtggac 900cgctttgaat gcggtattac ccatagcagc accgccctgg atttcctgac ctgtgtggac 900
aagaaaatta cgctggttga tctgaaagac attaaataa 939aagaaaatta cgctggttga tctgaaagac attaaataa 939
<210> 18<210> 18
<211> 1233<211> 1233
<212> ДНК<212> DNA
<213> Bibersteinia trehalosi<213> Bibersteinia trehalosi
<400> 18<400> 18
atggaattct gcaaaatggc aacgacgcaa aaaatctgtg tctacctgga ctatgctacg 60atggaattct gcaaaatggc aacgacgcaa aaaatctgtg tctacctgga ctatgctacg 60
atcccgagcc tgaactacat cctgcacttt gcgcaacatt tcgaagatca ggaaaccatt 120atcccgagcc tgaactacat cctgcacttt gcgcaacatt tcgaagatca ggaaaccatt 120
cgtctgtttg gcctgtcccg cttccacatt ccggaatcag tcatccagcg ctatccgaaa 180cgtctgtttg gcctgtcccg cttccacatt ccggaatcag tcatccagcg ctatccgaaa 180
ggtgtggttc aattttaccc gaaccaggaa aaagacttca gcgcgctgct gctggccctg 240ggtgtggttc aattttaccc gaaccaggaa aaagacttca gcgcgctgct gctggccctg 240
aaaaacatcc tgatcgaagt taaacagcaa cagcgtaaat gcgaaatcga actgcatctg 300aaaaacatcc tgatcgaagt taaacagcaa cagcgtaaat gcgaaatcga actgcatctg 300
aacctgtttc actatcagct gctgctgctg ccgttcctga gtctgtatct ggatacccag 360aacctgtttc actatcagct gctgctgctg ccgttcctga gtctgtatct ggatacccag 360
gactactgtc atctgacgct gaaattttac gatgacggct ctgaagcgat tagtgccctg 420gactactgtc atctgacgct gaaattttac gatgacggct ctgaagcgat tagtgccctg 420
caggaactgg cactggctcc ggatctggcg gcccaaatcc agtttgaaaa acaacagttc 480caggaactgg cactggctcc ggatctggcg gcccaaatcc agtttgaaaa acaacagttc 480
gacgaactgg tcgtgaaaaa atcgtttaaa ctgtcgctgc tgagccgcta tttttggggt 540gacgaactgg tcgtgaaaaa atcgtttaaa ctgtcgctgc tgagccgcta tttttggggt 540
aaactgttcg aaagcgaata catttggttc aatcaagcaa tcctgcagaa agctgaactg 600aaactgttcg aaagcgaata catttggttc aatcaagcaa tcctgcagaa agctgaactg 600
caaattctga aacaggaaat cagctctagt cgtcagatgg attttgcaat ttatcaacag 660caaattctga aacaggaaat cagctctagt cgtcagatgg attttgcaat ttatcaacag 660
atgtccgacg aacaaaaaca gctggtgctg gaaattctga acatcgatct gaataaagtt 720atgtccgacg aacaaaaaca gctggtgctg gaaattctga acatcgatct gaataaagtt 720
gcttacctga aacaactgat ggaaaaccag ccgtcttttc tgttcctggg caccacgctg 780gcttacctga aacaactgat ggaaaaccag ccgtcttttc tgttcctggg caccacgctg 780
tttaatatta cccaggaaac caaaacgtgg ctgatgcaga tgcatgtgga tctgatccaa 840tttaatatta cccaggaaac caaaacgtgg ctgatgcaga tgcatgtgga tctgatccaa 840
cagtattgcc tgccgagcgg ccagtttttc aacaataaag ccggctatct gtgtttttac 900cagtattgcc tgccgagcgg ccagtttttc aacaataaag ccggctatct gtgtttttac 900
aaaggtcacc cgaacgaaaa agaaatgaac caaatgatcc tgtctcagtt caaaaacctg 960aaaggtcacc cgaacgaaaa agaaatgaac caaatgatcc tgtctcagtt caaaaacctg 960
atcgcgctgc cggatgacat tccgctggaa atcctgctgc tgctgggcgt tattccgagt 1020atcgcgctgc cggatgacat tccgctggaa atcctgctgc tgctgggcgt tattccgagt 1020
aaagtcggcg gttttgcatc ctcagctctg tttaacttca ccccggcgca gatcgaaaat 1080aaagtcggcg gttttgcatc ctcagctctg tttaacttca ccccggcgca gatcgaaaat 1080
attatctttt tcacgccgcg ttatttcgaa aaagataatc gcctgcacgc cacgcaatac 1140attatctttt tcacgccgcg ttatttcgaa aaagataatc gcctgcacgc cacgcaatac 1140
cgtctgatgc agggcctgat tgaactgggt tatctggacg ctgaaaaatc tgtgacccac 1200cgtctgatgc agggcctgat tgaactgggt tatctggacg ctgaaaaatc tgtgacccac 1200
tttgaaatca tgcaactgct gacgaaagaa taa 1233tttgaaatca tgcaactgct gacgaaagaa taa 1233
<210> 19<210> 19
<211> 1221<211> 1221
<212> ДНК<212> DNA
<213> Haemophilus parahaemolyticus<213> Haemophilus parahaemolyticus
<400> 19<400> 19
atgaccgaac agtacatcaa aaacgtggaa gtttacctgg attacgcgac catcccgacg 60atgaccgaac agtacatcaa aaacgtggaa gtttacctgg attacgcgac catcccgacg 60
ctgaactact tctaccattt caccgaaaac aaagatgaca tcgccacgat tcgtctgttt 120ctgaactact tctaccattt caccgaaaac aaagatgaca tcgccacgat tcgtctgttt 120
ggcctgggtc gcttcaacat cagtaaatcc atcatcgaaa gctacccgga aggcattatc 180ggcctgggtc gcttcaacat cagtaaatcc atcatcgaaa gctacccgga aggcattatc 180
cgttactgcc cgattatctt tgaagatcaa accgcatttc agcaactgtt cattaccctg 240cgttactgcc cgattatctt tgaagatcaa accgcatttc agcaactgtt cattaccctg 240
ctgacggaag acagtttttg tcagtatcgc tttaacttcc atattaacct gtttcactcc 300ctgacggaag acagtttttg tcagtatcgc tttaacttcc atattaacct gtttcactcc 300
tggaaaatgc tgatcccgct gctgcatatt atctggcagt ttaaacacaa agtcctggat 360tggaaaatgc tgatcccgct gctgcatatt atctggcagt ttaaacacaa agtcctggat 360
attaaactga acttctatga tgacggcagt gaaggtctgg tgacgctgtc caaaatcgaa 420attaaactga acttctatga tgacggcagt gaaggtctgg tgacgctgtc caaaatcgaa 420
cagaactaca gctctgaaat cctgcaaaaa atcatcgata tcgactcaca gtcgttttat 480cagaactaca gctctgaaat cctgcaaaaa atcatcgata tcgactcaca gtcgttttat 480
gcagataaac tgtctttcct ggatgaagac attgctcgtt acctgtggaa cagtctgttt 540gcagataaac tgtctttcct ggatgaagac attgctcgtt acctgtggaa cagtctgttt 540
gaatcccatt attacctgct gaacgacttc ctgctgaaaa acgaaaaact gtcactgctg 600gaatcccatt attacctgct gaacgacttc ctgctgaaaa acgaaaaact gtcactgctg 600
aaaaactcga tcaaatactg ccacatcatg gatctggaac gctacctgca gtttacccaa 660aaaaactcga tcaaatactg ccacatcatg gatctggaac gctacctgca gtttacccaa 660
gaagaaaaag actttttcaa cgaactgctg ggcatcaaca tccagagtct ggaagataaa 720gaagaaaaag actttttcaa cgaactgctg ggcatcaaca tccagagtct ggaagataaa 720
atcaaaatct tccagcagaa gaaaaccttt attttcacgg gtaccacgat cttcagcctg 780atcaaaatct tccagcagaa gaaaaccttt attttcacgg gtaccacgat cttcagcctg 780
ccgaaagaag aagaagaaac cctgtatcgt ctgcatctga acgcaatcct gaattatatt 840ccgaaagaag aagaagaaac cctgtatcgt ctgcatctga acgcaatcct gaattatatt 840
cacccgaacg gcaaatactt tattggcgat ggtttcacgc tggttatcaa aggtcatccg 900cacccgaacg gcaaatactt tattggcgat ggtttcacgc tggttatcaa aggtcatccg 900
caccagaaag aaatgaacag ccgcctggaa aaatcttttg aaaaagctgt catgctgccg 960caccagaaag aaatgaacag ccgcctggaa aaatcttttg aaaaagctgt catgctgccg 960
gataatatcc cgttcgaaat tctgtatctg atcggctgca aaccggacaa aattggcggt 1020gataatatcc cgttcgaaat tctgtatctg atcggctgca aaccggacaa aattggcggt 1020
tttgtgagca cctcttactt cagctgtgat aagaaaaaca ttgcggacct gctgtttatc 1080tttgtgagca cctcttactt cagctgtgat aagaaaaaca ttgcggacct gctgtttatc 1080
tctgcccgtc aagaagaagt tcgcaaaaac gattacctgt ttaacatcca gtaccaactg 1140tctgcccgtc aagaagaagt tcgcaaaaac gattacctgt ttaacatcca gtaccaactg 1140
cgtgacatga tgattaaaac cggttttatc caggaagaaa aaacgcactt ctactcagat 1200cgtgacatga tgattaaaac cggttttatc caggaagaaa aaacgcactt ctactcagat 1200
atcccgatct tcatctcgta a 1221atcccgatct tcatctcgta a 1221
<210> 20<210> 20
<211> 903<211> 903
<212> ДНК<212> DNA
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 20<400> 20
atgaaatata acatcaaaat taaagctatc gtcatcgtgt cgagcctgcg tatgctgctg 60atgaaatata acatcaaaat taaagctatc gtcatcgtgt cgagcctgcg tatgctgctg 60
atcttcctga tgctgaataa ataccacctg gatgaagttc tgtttgtctt caacgaaggc 120atcttcctga tgctgaataa ataccacctg gatgaagttc tgtttgtctt caacgaaggc 120
ttcgaactgc ataaaaaata caaaatcaaa cactatgtgg cgattaaaaa gaaaattacc 180ttcgaactgc ataaaaaata caaaatcaaa cactatgtgg cgattaaaaa gaaaattacc 180
aaattctggc gtctgtacta caaactgtac ttctaccgtt tcaaaattga ccgcatcccg 240aaattctggc gtctgtacta caaactgtac ttctaccgtt tcaaaattga ccgcatcccg 240
gtttatggcg cagatcatct gggttggacc gactattttc tgaaatactt cgatttctac 300gtttatggcg cagatcatct gggttggacc gactattttc tgaaatactt cgatttctac 300
ctgattgaag acggcatcgc taacttctcc ccgaaacgtt acgaaattaa cctgacgcgc 360ctgattgaag acggcatcgc taacttctcc ccgaaacgtt acgaaattaa cctgacgcgc 360
aatatcccgg tctttggttt ccataaaacc gtgaagaaaa tttacctgac gagtctggaa 420aatatcccgg tctttggttt ccataaaacc gtgaagaaaa tttacctgac gagtctggaa 420
aatgttccgt ccgatattcg tcataaagtc gaactgatca gcctggaaca cctgtggaaa 480aatgttccgt ccgatattcg tcataaagtc gaactgatca gcctggaaca cctgtggaaa 480
acccgcacgg cgcaggaaca acacaacatc ctggatttct ttgcctttaa tctggacagc 540acccgcacgg cgcaggaaca acacaacatc ctggatttct ttgcctttaa tctggacagc 540
ctgatctctc tgaaaatgaa aaaatacatc ctgttcaccc agtgcctgtc agaagatcgc 600ctgatctctc tgaaaatgaa aaaatacatc ctgttcaccc agtgcctgtc agaagatcgc 600
gtcatttcgg aacaggaaaa aatcgcgatc taccaacata tcatcaaaaa ctacgatgaa 660gtcatttcgg aacaggaaaa aatcgcgatc taccaacata tcatcaaaaa ctacgatgaa 660
cgtctgctgg ttatcaaacc gcacccgcgc gaaaccacgg actatcagaa atactttgaa 720cgtctgctgg ttatcaaacc gcacccgcgc gaaaccacgg actatcagaa atactttgaa 720
aatgtcttcg tgtaccaaga tgtggttccg agcgaactgt ttgaactgct ggacgtgaac 780aatgtcttcg tgtaccaaga tgtggttccg agcgaactgt ttgaactgct ggacgtgaac 780
ttcgaacgtg ttattaccct gttttctacg gccgtgttca aatatgatcg caatatcgtt 840ttcgaacgtg ttattaccct gttttctacg gccgtgttca aatatgatcg caatatcgtt 840
gacttctacg gtacgcgcat ccacgacaaa atctatcaat ggttcggcga catcaaattc 900gacttctacg gtacgcgcat ccacgacaaa atctatcaat ggttcggcga catcaaattc 900
taa 903taa 903
<210> 21<210> 21
<211> 1146<211> 1146
<212> ДНК<212> DNA
<213> Vibrio harveyi<213> Vibrio harveyi
<400> 21<400> 21
atggattctt cgccggaaaa caccagctct acgctggaaa tttacatcga ttcagcaacc 60atggattctt cgccggaaaa caccagctct acgctggaaa tttacatcga ttcagcaacc 60
ctgccgtcgc tgcagcacat ggtgaaaatt atcgacgaac aaagtggcaa caaaaaactg 120ctgccgtcgc tgcagcacat ggtgaaaatt atcgacgaac aaagtggcaa caaaaaactg 120
atcaactgga aacgttatcc gatcgatgac gaactgctgc tggataaaat caacgctctg 180atcaactgga aacgttatcc gatcgatgac gaactgctgc tggataaaat caacgctctg 180
agcttttctg ataccacgga cctgacccgt tatatggaaa gtattctgct gatcggcgat 240agcttttctg ataccacgga cctgacccgt tatatggaaa gtattctgct gatcggcgat 240
attaaacgcg tggttattaa cggtaatagt ctgtccaact acaatattgt cggcgtgatg 300attaaacgcg tggttattaa cggtaatagt ctgtccaact acaatattgt cggcgtgatg 300
cgctccatca acgccctggg tctggatctg gacgttgaaa tcaattttta tgatgacggt 360cgctccatca acgccctggg tctggatctg gacgttgaaa tcaattttta tgatgacggt 360
tcagcagaat atgtccgtct gtacaacttc tcgcagctgc cggaagctga acgcgaactg 420tcagcagaat atgtccgtct gtacaacttc tcgcagctgc cggaagctga acgcgaactg 420
ctggtgtcaa tgtcgaaaaa caatattctg gcggccgtta acggcatcgg ttcttatgat 480ctggtgtcaa tgtcgaaaaa caatattctg gcggccgtta acggcatcgg ttcttatgat 480
agcggctctc cggaaaatat ttacggtttt gcgcagattt atccggccac ctaccacatg 540agcggctctc cggaaaatat ttacggtttt gcgcagattt atccggccac ctaccacatg 540
ctgcgtgcgg acattttcga tacggacctg gaaatcggcc tgattcgcga tatcctgggt 600ctgcgtgcgg acattttcga tacggacctg gaaatcggcc tgattcgcga tatcctgggt 600
gacaacgtca aacagatgaa atggggccaa tttctgggtt tcaacgaaga acagaaagaa 660gacaacgtca aacagatgaa atggggccaa tttctgggtt tcaacgaaga acagaaagaa 660
ctgttttatc aactgaccag cttcaacccg gataaaatcc aggcgcaata caaagaatct 720ctgttttatc aactgaccag cttcaacccg gataaaatcc aggcgcaata caaagaatct 720
ccgaacaaaa acttcgtttt cgtcggcacc aacagtcgtt ccgcaacggc tgaacagcaa 780ccgaacaaaa acttcgtttt cgtcggcacc aacagtcgtt ccgcaacggc tgaacagcaa 780
atcaacatca tcaaagaagc caaaaaactg gatagcgaaa ttatcccgaa cagcatcgat 840atcaacatca tcaaagaagc caaaaaactg gatagcgaaa ttatcccgaa cagcatcgat 840
ggctatgacc tgtttttcaa aggtcatccg agcgcgacct acaaccagca aattgttgat 900ggctatgacc tgtttttcaa aggtcatccg agcgcgacct acaaccagca aattgttgat 900
gcccacgaca tgaccgaaat ctataatcgc acgccgtttg aagtcctggc aatgacgagt 960gcccacgaca tgaccgaaat ctataatcgc acgccgtttg aagtcctggc aatgacgagt 960
tccctgccgg atgctgtggg cggtatgggc tcatcgctgt ttttctcact gccgaaaacc 1020tccctgccgg atgctgtggg cggtatgggc tcatcgctgt ttttctcact gccgaaaacc 1020
gtggaaacga aattcatttt ctataaaagt ggcaccgata ttgaatccaa tgcgctgatc 1080gtggaaacga aattcatttt ctataaaagt ggcaccgata ttgaatccaa tgcgctgatc 1080
caggttatgc tgaaactggg tatcattacg gacgaaaaag tgcgctttac gacggacatc 1140caggttatgc tgaaactggg tatcattacg gacgaaaaag tgcgctttac gacggacatc 1140
aaataa 1146aaataa 1146
<210> 22<210> 22
<211> 1452<211> 1452
<212> ДНК<212> DNA
<213> Alistipes sp.<213> Alistipes sp.
<400> 22<400> 22
atggccagct gttctgatga cgataaagaa cagacgggtt ttcaaatcga cgatggctct 60atggccagct gttctgatga cgataaagaa cagacggggtt ttcaaatcga cgatggctct 60
ggtttcctga gtctggatgc agctgcgcgt agtggctcca ttgccatcac cgcaaacaat 120ggtttcctga gtctggatgc agctgcgcgt agtggctcca ttgccatcac cgcaaacaat 120
tcatggtcgg tgacgcagga taaagacagc gaatggctga ccctgagcac cacgtctggt 180tcatggtcgg tgacgcagga taaagacagc gaatggctga ccctgagcac cacgtctggt 180
gcagcaggtc gtaccgaaat tggtatcatg ctggaagcga acccgggcga agctcgtaat 240gcagcaggtc gtaccgaaat tggtatcatg ctggaagcga acccgggcga agctcgtaat 240
gcgggtctga cctttaactc tggcggtcgc acgtatccgt tcgtgattac ccagagtgcc 300gcgggtctga cctttaactc tggcggtcgc acgtatccgt tcgtgattac ccagagtgcc 300
catgttacgg cagattttga cgatgctgac cactgctttt atatcacctt tggtaccctg 360catgttacgg cagattttga cgatgctgac cactgctttt atatcacctt tggtaccctg 360
ccgaccctgt atgcaggtct gcatgtgctg tcccacgata aaccgtcata tgtgtttttc 420ccgaccctgt atgcaggtct gcatgtgctg tcccacgata aaccgtcata tgtgtttttc 420
cagcgttccc aaacctttcg cccggaagaa ttcccggccc atgcagaagt tacgattgct 480cagcgttccc aaacctttcg cccggaagaa ttcccggccc atgcagaagt tacgattgct 480
gcggatccgt cagctaatgc gaccgatgaa gacatggaac gtatgcgcac ggccatgaaa 540gcggatccgt cagctaatgc gaccgatgaa gacatggaac gtatgcgcac ggccatgaaa 540
cagcaaattc tgaaaatcaa cgttgaagat ccgaccgcag tttttggcct gtatgtcgac 600cagcaaattc tgaaaatcaa cgttgaagat ccgaccgcag tttttggcct gtatgtcgac 600
gatctgcgtt gtggcattgg ttacgattgg ttcgtcgccc agggtatcga cagtacccgc 660gatctgcgtt gtggcattgg ttacgattgg ttcgtcgccc agggtatcga cagtacccgc 660
gtgaaagtta gtatgctgtc cgatggcacc ggcacgtaca acaacttcta caactacttc 720gtgaaagtta gtatgctgtc cgatggcacc ggcacgtaca acaacttcta caactacttc 720
ggcgatccgg ccaccgcaga acaaaactgg gaaaattacg ccgcacaggt ggaagcgctg 780ggcgatccgg ccaccgcaga acaaaactgg gaaaattacg ccgcacaggt ggaagcgctg 780
gattggcaac acggcggtcg ttttccggaa acccgcatgc cggatggttt tgacttctat 840gattggcaac acggcggtcg ttttccggaa acccgcatgc cggatggttt tgacttctat 840
gaatggccgt attacctggc aacgcgtccg aactaccgcc tggttctgca ggacgatgac 900gaatggccgt attacctggc aacgcgtccg aactaccgcc tggttctgca ggacgatgac 900
ctgctggaag cgacgtctcc gtttatgacc gaacgtctgc agcaaatgcg caccgaatcg 960ctgctggaag cgacgtctcc gtttatgacc gaacgtctgc agcaaatgcg caccgaatcg 960
aaacagccgt atgaactgct ggccagcctg ccggctgaag cccgtcaacg ctttttccgt 1020aaacagccgt atgaactgct ggccagcctg ccggctgaag cccgtcaacg ctttttccgt 1020
atggctggct ttgattacga cgcgtttgct gcgctgttcg atgccagccc gaagaaaaac 1080atggctggct ttgattacga cgcgtttgct gcgctgttcg atgccagccc gaagaaaaac 1080
ctggtcatta tcggcacgtc acatacctcg gaagaaagcg aagcacagca agccgcatat 1140ctggtcatta tcggcacgtc acatacctcg gaagaaagcg aagcacagca agccgcatat 1140
gtggaacgta ttatcggcga ttatggtacc gcctacgaca ttttctttaa accgcacccg 1200gtggaacgta ttatcggcga ttatggtacc gcctacgaca ttttctttaa accgcacccg 1200
gcagatagct ctagttccaa ctacgaagaa cgctttgaag gtctgaccct gctgccgggt 1260gcagatagct ctagttccaa ctacgaagaa cgctttgaag gtctgaccct gctgccgggt 1260
cagatgccgt ttgaaatttt cgtctggtcg ctgctggata aagtggacct gatcggcggt 1320cagatgccgt ttgaaatttt cgtctggtcg ctgctggata aagtggacct gatcggcggt 1320
tattcatcga cggtgtttct gaccgtcccg gtggaaaaaa ccggctttat tttcgctgcg 1380tattcatcga cggtgtttct gaccgtcccg gtggaaaaaa ccggctttat tttcgctgcg 1380
aatgctgaaa gcctgccgcg cccgctgaac gttctgttcc gtaatgcgga acatgtccgc 1440aatgctgaaa gcctgccgcg cccgctgaac gttctgttcc gtaatgcgga acatgtccgc 1440
tggatccagt aa 1452tggatccagt aa 1452
<210> 23<210> 23
<211> 1452<211> 1452
<212> ДНК<212> DNA
<213> Alistipes shahii<213> Alistipes shahii
<400> 23<400> 23
atggacgatg gcaccccgag tgtcagcatc aacggcggca ccgacttcct gagcctggac 60atggacgatg gcaccccgag tgtcagcatc aacggcggca ccgacttcct gagcctggac 60
cacctggcac gcagcggcaa aatcacggtc aacgcaccgg ctccgtggtc tgtgaccctg 120cacctggcac gcagcggcaa aatcacggtc aacgcaccgg ctccgtggtc tgtgaccctg 120
gccccggaaa attacggcca ggatgaaaaa ccggactggc tgaccctgag cgccgaagaa 180gccccggaaa attacggcca ggatgaaaaa ccggactggc tgaccctgag cgccgaagaa 180
ggcccggcag gttatagcga aatcgatgtt acctttgcgg aaaacccggg tccggcccgt 240ggcccggcag gttatagcga aatcgatgtt acctttgcgg aaaacccggg tccggcccgt 240
tccgcatcac tgctgttcag ctgcgatggt aaaaccctgg cctttacggt ttcgcagagc 300tccgcatcac tgctgttcag ctgcgatggt aaaaccctgg cctttacggt ttcgcagagc 300
gcaggcggta cgggtttcga tgctccggac tattactttt atatttcggt cggcaccatg 360gcaggcggta cgggtttcga tgctccggac tattactttt atatttcggt cggcaccatg 360
ccgacgctgt actcgggtct gcatctgctg agccacgata aaccgtctta tgttagttac 420ccgacgctgt actcgggtct gcatctgctg agccacgata aaccgtctta tgttagttac 420
gaacgtgcga gcacctttga tgcggccgaa ttcccggacc gcgcgtttgt ctatccggtg 480gaacgtgcga gcacctttga tgcggccgaa ttcccggacc gcgcgtttgt ctatccggtg 480
gccgatccga ccggtcatgc aaccaacgaa gaactgcgtg cgatgagcga agccatgaaa 540gccgatccga ccggtcatgc aaccaacgaa gaactgcgtg cgatgagcga agccatgaaa 540
cgtcgcatcc tggaaattaa tgcagaagat ccgaccgctg ttttcggtct gtgggtcgat 600cgtcgcatcc tggaaattaa tgcagaagat ccgaccgctg ttttcggtct gtgggtcgat 600
gacctgcgtt gccgcctggg ctacgattgg tttgtggctc aaggtatcga ctctgcgcgc 660gacctgcgtt gccgcctggg ctacgattgg tttgtggctc aaggtatcga ctctgcgcgc 660
gtgaaagtta cgatgctgag tgatggcacc gcgacgtata acaattttca taactacttc 720gtgaaagtta cgatgctgag tgatggcacc gcgacgtata acaattttca taactacttc 720
ggtgacgcag ctaccgccga acagaactgg aatgattatg cggccgaagt tgaagcactg 780ggtgacgcag ctaccgccga acagaactgg aatgattatg cggccgaagt tgaagcactg 780
gactggaatc atggcggtcg ttatccggaa acccgtgccc cggaagaatt cgcctcctac 840gactggaatc atggcggtcg ttatccggaa acccgtgccc cggaagaatt cgcctcctac 840
acctggccgt attacctgtc aacgcgtccg gattatcgcc tgatgctgca aaacagctct 900acctggccgt attacctgtc aacgcgtccg gattatcgcc tgatgctgca aaacagctct 900
ctgatggaaa gttcctgtcc gtttatcgca gatcgcctgg cagctatgaa aatggaatcc 960ctgatggaaa gttcctgtcc gtttatcgca gatcgcctgg cagctatgaa aatggaatcc 960
gtgcagccgt atgaactgct gacggcactg ccggaagctt caaaacagca attctatcgt 1020gtgcagccgt atgaactgct gacggcactg ccggaagctt caaaacagca attctatcgt 1020
atggccaaat ttgattacgc acgctttgct ggcctgttcg acctgtctcc gaagaaaaac 1080atggccaaat ttgattacgc acgctttgct ggcctgttcg acctgtctcc gaagaaaaac 1080
ctgattatca ttggtacctc tcattcatcg gcggccagtg aacagcaaca ggcagcttac 1140ctgattatca ttggtacctc tcattcatcg gcggccagtg aacagcaaca ggcagcttac 1140
gtcgaacgta tcattcaaca gtatggcagt gattacgaca ttttctttaa accgcacccg 1200gtcgaacgta tcattcaaca gtatggcagt gattacgaca ttttctttaa accgcacccg 1200
gcagatagct ctagtgctgg ttatccggac cgctttgaag gtctgaccct gctgccgggt 1260gcagatagct ctagtgctgg ttatccggac cgctttgaag gtctgaccct gctgccgggt 1260
cagatgccgt ttgaaatctt cgtttgggcg ctgctggata aaatcgacat gattggcggt 1320cagatgccgt ttgaaatctt cgtttgggcg ctgctggata aaatcgacat gattggcggt 1320
tatccgtcca ccacgtttat ttcagtgccg ctggataaag ttggctttct gttcgcggcc 1380tatccgtcca ccacgtttat ttcagtgccg ctggataaag ttggctttct gttcgcggcc 1380
gatgccgacg gtctggtccg cccgctgaat atcctgttcc gtgacgctgc aaatgtcgaa 1440gatgccgacg gtctggtccg cccgctgaat atcctgttcc gtgacgctgc aaatgtcgaa 1440
tggattcaat aa 1452tggattcaat aa 1452
<210> 24<210> 24
<211> 1206<211> 1206
<212> ДНК<212> DNA
<213> Actinobacillus suis<213> Actinobacillus suis
<400> 24<400> 24
atggaacgca cgccgcaact gcaagcggtg gacatttaca ttgacttcgc aacgatcccg 60atggaacgca cgccgcaact gcaagcggtg gacatttaca ttgacttcgc aacgatcccg 60
agcctgagct actttctgca ctttctgaaa cataaacacg atgatcagcg tctgcgtctg 120agcctgagct actttctgca ctttctgaaa cataaacacg atgatcagcg tctgcgtctg 120
ttcagcctgg cccgttttga aatgccgcaa accctgattg aacagtatga aggcattatc 180ttcagcctgg cccgttttga aatgccgcaa accctgattg aacagtatga aggcattatc 180
cagttctcgc gcaacgtgga acataatgtt gaaccgctgc tggaacagct gcaaacgatc 240cagttctcgc gcaacgtgga acataatgtt gaaccgctgc tggaacagct gcaaacgatc 240
ctgtcacaag aaggtaaaca gtttgaactg catctgcacc tgaacctgtt tcattcgttc 300ctgtcacaag aaggtaaaca gtttgaactg catctgcacc tgaacctgtt tcattcgttc 300
gaaatgtttc tgaatctgag cccgacctac acgcagtaca aagaaaaaat ctctaaaatc 360gaaatgtttc tgaatctgag cccgacctac acgcagtaca aagaaaaaat ctctaaaatc 360
gttctgcacc tgtatgatga cggcagtgaa ggtgtcatga aacagtacca actgcagaaa 420gttctgcacc tgtatgatga cggcagtgaa ggtgtcatga aacagtacca actgcagaaa 420
agctctagtc tggtgcagga tctggcggcc accaaagcat ctctggttag cctgttcgaa 480agctctagtc tggtgcagga tctggcggcc accaaagcat ctctggttag cctgttcgaa 480
aacggcgaag gttcgtttag ccagattgat ctgatccgtt atgtctggaa tgctgtgctg 540aacggcgaag gttcgtttag ccagattgat ctgatccgtt atgtctggaa tgctgtgctg 540
gaaacccatt attacctgct gtctgatcac tttctgctgg acgaaaaact gcagccgctg 600gaaacccatt attacctgct gtctgatcac tttctgctgg acgaaaaact gcagccgctg 600
aaagcagaac tgggccatta ccaactgctg aacctgagtg cttatcagta cctgtcctca 660aaagcagaac tgggccatta ccaactgctg aacctgagtg cttatcagta cctgtcctca 660
gaagatctgc tgtggctgaa acagattctg aaaatcgaca ccgaactgga aagcctgatg 720gaagatctgc tgtggctgaa acagattctg aaaatcgaca ccgaactgga aagcctgatg 720
caaaaactga cggcgcagcc ggtgtatttc tttagcggta ccacgttttt caacatcagt 780caaaaactga cggcgcagcc ggtgtatttc tttagcggta ccacgttttt caacatcagt 780
ttcgaagata aacaacgtct ggcgaatatc catgccattc tgatccgcga acacctggac 840ttcgaagata aacaacgtct ggcgaatatc catgccattc tgatccgcga acacctggac 840
ccgaactccc agctgtttat tggcgaaccg tacctgtttg tcttcaaagg tcatccgaac 900ccgaactccc agctgtttat tggcgaaccg tacctgtttg tcttcaaagg tcatccgaac 900
tcaccggaaa ttaatcaggc cctgcgtgaa tattacccga acgttatctt cctgccggaa 960tcaccggaaa ttaatcaggc cctgcgtgaa tattacccga acgttatctt cctgccggaa 960
aatattccgt ttgaaatcct gaccctgctg ggcttctccc cgcaaaaaat tggcggtttt 1020aatattccgt ttgaaatcct gaccctgctg ggcttctccc cgcaaaaaat tggcggtttt 1020
gcgtcaacga tccacgttaa ttccgaacag tcaaaactgg ccaaactgtt tttcctgacc 1080gcgtcaacga tccacgttaa ttccgaacag tcaaaactgg ccaaactgtt tttcctgacc 1080
tcgacggatg aacaagaacg ccagctgagc gacggttata ttaaacaata cgcactggct 1140tcgacggatg aacaagaacg ccagctgagc gacggttata ttaaacaata cgcactggct 1140
caggctatgc tggaaatgca actggtctcg caagaacaag tctattactg ctcgctgtcg 1200caggctatgc tggaaatgca actggtctcg caagaacaag tctattactg ctcgctgtcg 1200
tcgtaa 1206tcgtaa 1206
<210> 25<210> 25
<211> 1206<211> 1206
<212> ДНК<212> DNA
<213> Actinobacillus capsulatus<213> Actinobacillus capsulatus
<400> 25<400> 25
atggaacgca tcccgcaact gcaagctgtc gatatttaca ttgacttcgc cacgatcccg 60atggaacgca tcccgcaact gcaagctgtc gatatttaca ttgacttcgc cacgatcccg 60
agcctgtcct actttctgca ctttctgaaa cataaacacg atcatcagcg tctgcgcctg 120agcctgtcct actttctgca ctttctgaaa cataaacacg atcatcagcg tctgcgcctg 120
ttcagcctgg cgcgttttga aatgccgcag accgtcattg aacaatatga aggcattatc 180ttcagcctgg cgcgttttga aatgccgcag accgtcattg aacaatatga aggcattatc 180
cagttctcac gcaacgtgga acacaatgtt gaacaactgc tggaacagct gcaaacgatc 240cagttctcac gcaacgtgga acacaatgtt gaacaactgc tggaacagct gcaaacgatc 240
ctgtcgcagg aaggtaaaca atttgaactg cacctgcatc tgaacctgtt tcacagtttc 300ctgtcgcagg aaggtaaaca atttgaactg cacctgcatc tgaacctgtt tcacagtttc 300
gaaatgtttc tgaatctgtc cccgacctac acgaaataca aagaaaaaat ctcaaaaatc 360gaaatgtttc tgaatctgtc cccgacctac acgaaataca aagaaaaaat ctcaaaaatc 360
gttctgcatc tgtatgatga cggctcggaa ggtgtcatga aacagtacca actgcagcaa 420gttctgcatc tgtatgatga cggctcggaa ggtgtcatga aacagtacca actgcagcaa 420
agtaactccc tggcacagga tctggctagc accaaagcgt cactggtttc gctgttcaaa 480agtaactccc tggcacagga tctggctagc accaaagcgt cactggtttc gctgttcaaa 480
aacggcgaag gtgccttttc tcagattgat ctgatccgtt atgtctggaa tgcagtgctg 540aacggcgaag gtgccttttc tcagattgat ctgatccgtt atgtctggaa tgcagtgctg 540
gaaacccact attacctgct gtcagaccac tttctggccc atgaaaaact gcagccgctg 600gaaacccact attacctgct gtcagaccac tttctggccc atgaaaaact gcagccgctg 600
aaaattgaac tgggccatta ccagctgctg aatctgtctg cctatcaata cctgagctct 660aaaattgaac tgggccatta ccagctgctg aatctgtctg cctatcaata cctgagctct 660
gaagatctgc tgtggctgaa acaaattctg aaaatcgacg cagaactgga aagtctgatg 720gaagatctgc tgtggctgaa acaaattctg aaaatcgacg cagaactgga aagtctgatg 720
cataaactga ccacgcagcc ggtgtatttc tttagcggta ccacgttttt caacatttcg 780cataaactga ccacgcagcc ggtgtatttc tttagcggta ccacgttttt caacatttcg 780
ttcgaagata aacagcgtct ggccaatatc cacgcaattc tgatccgcga acatctggac 840ttcgaagata aacagcgtct ggccaatatc cacgcaattc tgatccgcga acatctggac 840
ccgaacagtc agctgtttat cggcgaaccg tacctgtttg ttttcaaagg tcacccgaac 900ccgaacagtc agctgtttat cggcgaaccg tacctgtttg ttttcaaagg tcacccgaac 900
tccccggaaa ttaatcaggc tctgcgcgaa tattacccga acgcgatctt cctgccggaa 960tccccggaaa ttaatcaggc tctgcgcgaa tattacccga acgcgatctt cctgccggaa 960
aatattccgt ttgaaatcct gaccctgctg ggcttcagcc cgcagaaaat tggcggtttt 1020aatattccgt ttgaaatcct gaccctgctg ggcttcagcc cgcagaaaat tggcggtttt 1020
gcttctacga tccatgtgaa cagcgaacaa tctaaactgg cgaaactgtt tttcctgacc 1080gcttctacga tccatgtgaa cagcgaacaa tctaaactgg cgaaactgtt tttcctgacc 1080
agtacggatg aacaggaacg taatcgctcc gacggttata ttaaacagta cgcgctggcc 1140agtacggatg aacaggaacg taatcgctcc gacggttata ttaaacagta cgcgctggcc 1140
caagcaatgc tggaaatgca actggtctcg caagaacaag tctactactg ctcgctgtcg 1200caagcaatgc tggaaatgca actggtctcg caagaacaag tctactactg ctcgctgtcg 1200
tcgtaa 1206tcgtaa 1206
<210> 26<210> 26
<211> 936<211> 936
<212> ДНК<212> DNA
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 26<400> 26
atgttccgtg aagacaatat gaacctgatt atctgctgta cgccgctgca agtgattatc 60atgttccgtg aagacaatat gaacctgatt atctgctgta cgccgctgca agtgattatc 60
gccgaaaaaa ttatcgaacg ctatccggaa cagaaatttt atggcgttat gctggaatca 120gccgaaaaaa ttatcgaacg ctatccggaa cagaaatttt atggcgttat gctggaatca 120
ttctacaacg ataaattcga cttctacgaa aacaaactga aacatctgtg ccacgaattt 180ttctacaacg ataaattcga cttctacgaa aacaaactga aacatctgtg ccacgaattt 180
ttctgtatca aaatcgcacg tttcaaactg gaacgctata aaaacctgct gtcactgctg 240ttctgtatca aaatcgcacg tttcaaactg gaacgctata aaaacctgct gtcactgctg 240
aaaatcaaaa acaaaacctt cgatcgtgtc ttcctggcta acatcgaaaa acgctacatc 300aaaatcaaaa acaaaacctt cgatcgtgtc ttcctggcta acatcgaaaa acgctacatc 300
catatcatcc tgtcgaacat tttctttaaa gaactgtaca ccttcgatga cggcacggcg 360catatcatcc tgtcgaacat tttctttaaa gaactgtaca ccttcgatga cggcacggcg 360
aacatcgccc cgaatagtca tctgtatcaa gaatacgatc actccctgaa aaaacgtatt 420aacatcgccc cgaatagtca tctgtatcaa gaatacgatc actccctgaa aaaacgtatt 420
accgacatcc tgctgccgaa ccattacaac agcaacaaag tgaaaaacat cagcaaactg 480accgacatcc tgctgccgaa ccattacaac agcaacaaag tgaaaaacat cagcaaactg 480
cactactcta tctaccgctg caaaaacaac atcatcgata acatcgaata catgccgctg 540cactactcta tctaccgctg caaaaacaac atcatcgata acatcgaata catgccgctg 540
tttaacctgg agaaaaaata cacggcacag gataaaagta tttccatcct gctgggtcaa 600tttaacctgg agaaaaaata cacggcacag gataaaagta tttccatcct gctgggtcaa 600
ccgattttct atgacgaaga gaaaaacatt cgtctgatca aagaagtcat cgccaaattc 660ccgattttct atgacgaaga gaaaaacatt cgtctgatca aagaagtcat cgccaaattc 660
aaaatcgatt actacttccc gcacccgcgc gaagattact acatcgacaa cgtgtcttac 720aaaatcgatt actacttccc gcacccgcgc gaagattact acatcgacaa cgtgtcttac 720
atcaaaaccc cgctgatctt tgaagaattt tacgcggaac gttcaatcga aaattcgatc 780atcaaaaccc cgctgatctt tgaagaattt tacgcggaac gttcaatcga aaattcgatc 780
aaaatctata cctttttcag ctctgccgtg ctgaacatcg ttacgaaaga aaatattgat 840aaaatctata cctttttcag ctctgccgtg ctgaacatcg ttacgaaaga aaatattgat 840
cgcatctacg cactgaaacc gaaactgacg gaaaaagcgt atctggattg ttacgacatc 900cgcatctacg cactgaaacc gaaactgacg gaaaaagcgt atctggattg ttacgacatc 900
ctgaaagatt tcggtatcaa agttatcgac atctaa 936ctgaaagatt tcggtatcaa agttatcgac atctaa 936
<210> 27<210> 27
<211> 1200<211> 1200
<212> ДНК<212> DNA
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 27<400> 27
atgctgattc aacagaacct ggaaatctac ctggactacg caaccatccc gagcctggcc 60atgctgattc aacagaacct ggaaatctac ctggactacg caaccatccc gagcctggcc 60
tgctttatgc acttcattca acacaaagat gacgtcgata gtattcgtct gtttggcctg 120tgctttatgc acttcattca acacaaagat gacgtcgata gtattcgtct gtttggcctg 120
gcacgcttcg atatcccgca gtccattatc gaccgttacc cggctaacca cctgttttat 180gcacgcttcg atatcccgca gtccattatc gaccgttacc cggctaacca cctgttttat 180
cacaacatcg ataatcgcga cctgaccgca gtgctgaacc agctggcgga tattctggcc 240cacaacatcg ataatcgcga cctgaccgca gtgctgaacc agctggcgga tattctggcc 240
caggaaaata aacgttttca aatcaacctg catctgaacc tgtttcacag cattgacctg 300caggaaaata aacgttttca aatcaacctg catctgaacc tgtttcacag cattgacctg 300
tttttcgcta tttatccgat ctaccagcaa tatcagcata aaatttctac catccagctg 360tttttcgcta tttatccgat ctaccagcaa tatcagcata aaatttctac catccagctg 360
caactgtacg atgacggcag cgaaggtatt gttacgcagc attctctgtg caaaattgcg 420caactgtacg atgacggcag cgaaggtatt gttacgcagc attctctgtg caaaattgcg 420
gatctggaac agctgatcct gcaacacaaa aacgtgctgc tggaactgct gaccaaaggc 480gatctggaac agctgatcct gcaacacaaa aacgtgctgc tggaactgct gaccaaaggc 480
acggccaacg ttccgaatcc gaccctgctg cgttatctgt ggaacaatat tatcgattca 540acggccaacg ttccgaatcc gaccctgctg cgttatctgt ggaacaatat tatcgattca 540
cagtttcatc tgatctcgga ccattttctg caacacccga aactgcaacc gctgaaacgt 600cagtttcatc tgatctcgga ccattttctg caacacccga aactgcaacc gctgaaacgt 600
ctgctgaaac gctacaccat tctggatttt acgtgttatc cgcgcttcaa tgccgaacag 660ctgctgaaac gctacaccat tctggatttt acgtgttatc cgcgcttcaa tgccgaacag 660
aaacaactgc tgaaagaaat tctgcatatc tcaaacgaac tggaaaatct gctgaaactg 720aaacaactgc tgaaagaaat tctgcatatc tcaaacgaac tggaaaatct gctgaaactg 720
ctgaaacagc acaacacctt tctgttcacg ggcaccacgg cgtttaatct ggatcaggaa 780ctgaaacagc acaacacctt tctgttcacg ggcaccacgg cgtttaatct ggatcaggaa 780
aaactggacc tgctgaccca actgcatatc ctgctgctga acgaacacca gaatccgcat 840aaactggacc tgctgaccca actngcatatc ctgctgctga acgaacacca gaatccgcat 840
tcaacgcact acattggcaa caattatctg ctgctgatca aaggtcatgc aaactcgccg 900tcaacgcact acattggcaa caattatctg ctgctgatca aaggtcatgc aaactcgccg 900
gctctgaatc ataccctggc gctgcacttt ccggatgcga ttttcctgcc ggccaatatt 960gctctgaatc ataccctggc gctgcacttt ccggatgcga ttttcctgcc ggccaatatt 960
ccgtttgaaa tcttcgcgat gctgggcttt acgccgaaca aaatgggcgg tttcgccagc 1020ccgtttgaaa tcttcgcgat gctgggcttt acgccgaaca aaatgggcgg tttcgccagc 1020
acctcttaca ttaattatcc gacggaaaac atcaatcacc tgtttttcct gaccagtgat 1080acctcttaca ttaattatcc gacggaaaac atcaatcacc tgtttttcct gaccagtgat 1080
cagccgtcca ttcgcacgaa atggctggac tacgaaaaac aatttggtct gatgtattcc 1140cagccgtcca ttcgcacgaa atggctggac tacgaaaaac aatttggtct gatgtattcc 1140
ctgctggcaa tgcagaaaat caacgaagat caggcgttta tgtgcaccat tcacaattaa 1200ctgctggcaa tgcagaaaat caacgaagat caggcgttta tgtgcaccat tcacaattaa 1200
<210> 28<210> 28
<211> 1494<211> 1494
<212> ДНК<212> DNA
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 28<400> 28
atgtgtaacg ataatcaaaa tacggtcgat gttgttgtga gcaccgttaa cgataacgtc 60atgtgtaacg ataatcaaaa tacggtcgat gttgttgtga gcaccgttaa cgataacgtc 60
atcgaaaaca acacgtacca agttaaaccg atcgataccc cgaccacgtt tgacagttac 120atcgaaaaca acacgtacca agttaaaccg atcgataccc cgaccacgtt tgacagttac 120
tcctggattc agacgtgcgg caccccgatc ctgaaagatg acgaaaaata ttcactgtcg 180tcctggattc agacgtgcgg caccccgatc ctgaaagatg acgaaaaata ttcactgtcg 180
tttgatttcg tcgccccgga actggatcag gacgaaaaat tctgtttcga atttaccggc 240tttgatttcg tcgccccgga actggatcag gacgaaaaat tctgtttcga atttaccggc 240
gatgttgacg gtaaacgtta tgtcacgcag accaacctga cggtggttgc accgaccctg 300gatgttgacg gtaaacgtta tgtcacgcag accaacctga cggtggttgc accgaccctg 300
gaagtttacg tcgatcatgc tagtctgccg tccctgcagc aactgatgaa aatcatccag 360gaagtttacg tcgatcatgc tagtctgccg tccctgcagc aactgatgaa aatcatccag 360
cagaaaaacg aatactcaca gaatgaacgt ttcatttcgt ggggccgcat cggtctgacg 420cagaaaaacg aatactcaca gaatgaacgt ttcatttcgt ggggccgcat cggtctgacg 420
gaagataacg cggaaaaact gaatgcccat atttatccgc tggcaggcaa caatacctca 480gaagataacg cggaaaaact gaatgcccat atttatccgc tggcaggcaa caatacctca 480
caggaactgg tggatgcagt gatcgattac gctgactcga aaaaccgtct gaatctggaa 540caggaactgg tggatgcagt gatcgattac gctgactcga aaaaccgtct gaatctggaa 540
ctgaacacga ataccgcgca cagctttccg aacctggccc cgattctgcg cattatcagc 600ctgaacacga ataccgcgca cagctttccg aacctggccc cgattctgcg cattatcagc 600
tctaaaagca acatcctgat ctctaacatc aacctgtacg atgacggcag tgctgaatat 660tctaaaagca acatcctgat ctctaacatc aacctgtacg atgacggcag tgctgaatat 660
gtgaacctgt acaattggaa agataccgaa gacaaatccg tgaaactgag cgattctttc 720gtgaacctgt acaattggaa agataccgaa gacaaatccg tgaaactgag cgattctttc 720
ctggttctga aagactactt taacggtatt agttccgaaa aaccgagcgg catctatggt 780ctggttctga aagactactt taacggtatt agttccgaaa aaccgagcgg catctatggt 780
cgctacaact ggcatcaact gtataatacg tcttattact tcctgcgtaa agattacctg 840cgctacaact ggcatcaact gtataatacg tcttattact tcctgcgtaa agattacctg 840
accgttgaac cgcagctgca cgacctgcgc gaatatctgg gcggtagtct gaaacaaatg 900accgttgaac cgcagctgca cgacctgcgc gaatatctgg gcggtagtct gaaacaaatg 900
tcctgggatg gcttttcaca gctgtcgaaa ggtgacaaag aactgttcct gaacattgtc 960tcctgggatg gcttttcaca gctgtcgaaa ggtgacaaag aactgttcct gaacattgtc 960
ggctttgatc aggaaaaact gcagcaagaa taccagcaat cagaactgcc gaatttcgtg 1020ggctttgatc aggaaaaact gcagcaagaa taccagcaat cagaactgcc gaatttcgtg 1020
tttacgggca ccacgacctg ggcaggcggt gaaaccaaag aatattacgc tcagcaacag 1080tttacgggca ccacgacctg ggcaggcggt gaaaccaaag aatattacgc tcagcaacag 1080
gtgaacgtcg tgaacaatgc gattaatgaa accagcccgt attacctggg ccgtgaacat 1140gtgaacgtcg tgaacaatgc gattaatgaa accagcccgt attacctggg ccgtgaacat 1140
gacctgtttt tcaaaggtca cccgcgcggc ggtattatca atgatattat cctgggcagt 1200gacctgtttt tcaaaggtca cccgcgcggc ggtattatca atgatattat cctgggcagt 1200
ttcaacaata tgattgacat cccggccaaa gtgtcctttg aagttctgat gatgacgggt 1260ttcaacaata tgattgacat cccggccaaa gtgtcctttg aagttctgat gatgacgggt 1260
atgctgccgg ataccgtggg cggtattgcg tcatcgctgt attttagcat cccggccgaa 1320atgctgccgg ataccgtggg cggtattgcg tcatcgctgt attttagcat cccggccgaa 1320
aaagtctctt tcattgtgtt taccagctct gatacgatca ccgatcgtga agacgcgctg 1380aaagtctctt tcattgtgtt taccagctct gatacgatca ccgatcgtga agacgcgctg 1380
aaatctccgc tggtgcaggt tatgatgacc ctgggcattg ttaaagaaaa agatgtgctg 1440aaatctccgc tggtgcaggt tatgatgacc ctgggcattg ttaaagaaaa agatgtgctg 1440
ttctggtcgg atctgccgga ttgttcctcg ggtgtttgta ttgctcagta ttaa 1494ttctggtcgg atctgccgga ttgttcctcg ggtgtttgta ttgctcagta ttaa 1494
<210> 29<210> 29
<211> 1497<211> 1497
<212> ДНК<212> DNA
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 29<400> 29
atgagtgaag aaaacaccca gtccattatt aaaaacgaca tcaacaaaac catcatcgat 60atgagtgaag aaaacaccca gtccattatt aaaaacgaca tcaacaaaac catcatcgat 60
gaagaatacg ttaacctgga accgatcaac cagtctaaca tcagttttac caaacatagc 120gaagaatacg ttaacctgga accgatcaac cagtctaaca tcagttttac caaacatagc 120
tgggtccaga cctgcggtac gcagcaactg ctgacggaac aaaacaaaga atcaatttcg 180tgggtccaga cctgcggtac gcagcaactg ctgacggaac aaaacaaaga atcaatttcg 180
ctgagcgtgg ttgcgccgcg tctggatgac gatgaaaaat actgtttcga tttcaacggt 240ctgagcgtgg ttgcgccgcg tctggatgac gatgaaaaat actgtttcga tttcaacggt 240
gttagtaata aaggcgaaaa atacatcacc aaagtcacgc tgaatgtcgt ggcaccgtct 300gttagtaata aaggcgaaaa atacatcacc aaagtcacgc tgaatgtcgt ggcaccgtct 300
ctggaagttt atgtggatca tgctagtctg ccgaccctgc aacaactgat ggatattatc 360ctggaagttt atgtggatca tgctagtctg ccgaccctgc aacaactgat ggatattatc 360
aaatcggaag aagaaaaccc gaccgcacag cgttacattg cttggggccg catcgtgccg 420aaatcggaag aagaaaaccc gaccgcacag cgttacattg cttggggccg catcgtgccg 420
acggacgaac agatgaaaga actgaatatt accagctttg cgctgatcaa caatcacacg 480acggacgaac agatgaaaga actgaatatt accagctttg cgctgatcaa caatcacacg 480
ccggccgatc tggttcagga aattgtcaaa caggcgcaaa ccaaacatcg tctgaacgtg 540ccggccgatc tggttcagga aattgtcaaa caggcgcaaa ccaaacatcg tctgaacgtg 540
aaactgagca gcaatacggc ccactcgttt gacaatctgg ttccgattct gaaagaactg 600aaactgagca gcaatacggc ccactcgttt gacaatctgg ttccgattct gaaagaactg 600
aacagcttca acaatgtgac cgttacgaat atcgatctgt atgacgatgg cagcgcggaa 660aacagcttca acaatgtgac cgttacgaat atcgatctgt atgacgatgg cagcgcggaa 660
tatgttaacc tgtacaattg gcgcgacacc ctgaacaaaa cggataatct gaaaattggc 720tatgttaacc tgtacaattg gcgcgacacc ctgaacaaaa cggataatct gaaaattggc 720
aaagactatc tggaagatgt cattaacggt atcaatgaag ataccagcaa caccggcacg 780aaagactatc tggaagatgt cattaacggt atcaatgaag ataccagcaa caccggcacg 780
agttccgtgt acaattggca gaaactgtat ccggctaact accattttct gcgtaaagat 840agttccgtgt acaattggca gaaactgtat ccggctaact accattttct gcgtaaagat 840
tatctgaccc tggaaccgtc cctgcacgaa ctgcgcgact acattggtga ttcactgaaa 900tatctgaccc tggaaccgtc cctgcacgaa ctgcgcgact acattggtga ttcactgaaa 900
cagatgcaat gggacggctt caaaaaattc aactcgaaac agcaagaact gtttctgagc 960cagatgcaat gggacggctt caaaaaattc aactcgaaac agcaagaact gtttctgagc 960
atcgtgaatt tcgataaaca gaaactgcaa aacgaataca attcatcgaa cctgccgaat 1020atcgtgaatt tcgataaaca gaaactgcaa aacgaataca attcatcgaa cctgccgaat 1020
tttgtgttca ccggtaccac ggtttgggca ggcaaccacg aacgcgaata ctacgctaaa 1080tttgtgttca ccggtaccac ggtttgggca ggcaaccacg aacgcgaata ctacgctaaa 1080
cagcaaatca acgttatcaa caacgccatc aacgaaagct ctccgcatta tctgggtaat 1140cagcaaatca acgttatcaa caacgccatc aacgaaagct ctccgcatta tctgggtaat 1140
tcctacgacc tgtttttcaa aggccacccg ggcggtggca ttatcaacac cctgatcatg 1200tcctacgacc tgtttttcaa aggccacccg ggcggtggca ttatcaacac cctgatcatg 1200
cagaattatc cgtcaatggt cgatattccg tccaaaatct catttgaagt gctgatgatg 1260cagaattatc cgtcaatggt cgatattccg tccaaaatct catttgaagt gctgatgatg 1260
accgacatgc tgccggatgc cgtggcaggt attgcgagtt ccctgtactt cacgatcccg 1320accgacatgc tgccggatgc cgtggcaggt attgcgagtt ccctgtactt cacgatcccg 1320
gccgaaaaaa tcaaattcat cgttttcacc tctacggaaa ccattacgga tcgtgaaacc 1380gccgaaaaaa tcaaattcat cgttttcacc tctacggaaa ccattacgga tcgtgaaacc 1380
gccctgcgta gtccgctggt ccaggtgatg attaaactgg gcatcgtgaa agaagaaaat 1440gccctgcgta gtccgctggt ccaggtgatg attaaactgg gcatcgtgaa agaagaaaat 1440
gtgctgttct gggcggacct gccgaattgc gaaacgggtg tctgtattgc tgtctga 1497gtgctgttct gggcggacct gccgaattgc gaaacgggtg tctgtattgc tgtctga 1497
<210> 30<210> 30
<211> 1449<211> 1449
<212> ДНК<212> DNA
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 30<400> 30
atgaacgata atcaaaatac ggtggacgtg gtggtctcaa ccgtcaacga taacgtgatc 60atgaacgata atcaaaatac ggtggacgtg gtggtctcaa ccgtcaacga taacgtgatc 60
gaaaacaaca cgtaccaagt caaaccgatc gataccccga ccacgttcga ctcatactcg 120gaaaacaaca cgtaccaagt caaaccgatc gataccccga ccacgttcga ctcatactcg 120
tggattcaga cgtgcggcac cccgatcctg aaagatgacg aaaaatatag cctgtctttt 180tggattcaga cgtgcggcac cccgatcctg aaagatgacg aaaaatatag cctgtctttt 180
gatttcgttg ccccggaact ggatcaagac gaaaaattct gtttcgaatt taccggcgat 240gatttcgttg ccccggaact ggatcaagac gaaaaattct gtttcgaatt taccggcgat 240
gtggatggta aacgttatgt gacgcagacc aacctgacgg tggttgcacc gaccctggaa 300gtggatggta aacgttatgt gacgcagacc aacctgacgg tggttgcacc gaccctggaa 300
gtttacgtcg atcatgcttc actgccgtcg ctgcagcaac tgatgaaaat catccagcag 360gtttacgtcg atcatgcttc actgccgtcg ctgcagcaac tgatgaaaat catccagcag 360
aaaaacgaat acagccagaa tgaacgcttt atttcttggg gccgtatccg cctgacggaa 420aaaaacgaat acagccagaa tgaacgcttt atttcttggg gccgtatccg cctgacggaa 420
gataacgcgg aaaaactgaa tgcccatatt tatccgctgg caggcaacaa taccagccag 480gataacgcgg aaaaactgaa tgcccatatt tatccgctgg caggcaacaa taccagccag 480
gaactggtgg acgcagttat cgattacgct gactctaaaa accgtctgaa tctggaactg 540gaactggtgg acgcagttat cgattacgct gactctaaaa accgtctgaa tctggaactg 540
aacacgaata ccggccacag tttccgtaac attgcgccga tcctgcgcgc caccagctct 600aacacgaata ccggccacag tttccgtaac attgcgccga tcctgcgcgc caccagctct 600
aaaaacaaca tcctgatctc caacatcaac ctgtacgatg acggtagtgc tgaatatgtg 660aaaaacaaca tcctgatctc caacatcaac ctgtacgatg acggtagtgc tgaatatgtg 660
tccctgtaca actggaaaga taccgacaat aaatcacaga aactgagtga ttcctttctg 720tccctgtaca actggaaaga taccgacaat aaatcacaga aactgagtga ttcctttctg 720
gttctgaaag actacctgaa tggcatcagt tccgaaaaac cgaacggtat ttatagcatc 780gttctgaaag actacctgaa tggcatcagt tccgaaaaac cgaacggtat ttatagcatc 780
tacaattggc atcagctgta tcactcatcg tattacttcc tgcgtaaaga ttacctgacg 840tacaattggc atcagctgta tcactcatcg tattacttcc tgcgtaaaga ttacctgacg 840
gtggaaacca aactgcacga cctgcgcgaa tatctgggcg gttcactgaa acaaatgtcg 900gtggaaacca aactgcacga cctgcgcgaa tatctgggcg gttcactgaa acaaatgtcg 900
tgggatacct ttagccagct gtctaaaggc gacaaagaac tgttcctgaa cattgttggt 960tgggatacct ttagccagct gtctaaaggc gacaaagaac tgttcctgaa cattgttggt 960
tttgatcagg aaaaactgca gcaagaatac cagcaaagcg aactgccgaa tttcgtcttt 1020tttgatcagg aaaaactgca gcaagaatac cagcaaagcg aactgccgaa tttcgtcttt 1020
acgggcacca cgacctgggc aggcggtgaa accaaagaat attacgctca gcaacaggtg 1080acgggcacca cgacctgggc aggcggtgaa accaaagaat attacgctca gcaacaggtg 1080
aacgtcgtga acaatgcgat taatgaaacc tctccgtatt acctgggccg tgaacatgac 1140aacgtcgtga acaatgcgat taatgaaacc tctccgtatt acctgggccg tgaacatgac 1140
ctgtttttca aaggtcaccc gcgcggcggt attatcaatg atattatcct gggctcattc 1200ctgtttttca aaggtcaccc gcgcggcggt attatcaatg atattatcct gggctcattc 1200
aacaatatga ttgacatccc ggccaaagtt tcgtttgaag tcctgatgat gacgggtatg 1260aacaatatga ttgacatccc ggccaaagtt tcgtttgaag tcctgatgat gacgggtatg 1260
ctgccggata ccgttggcgg tattgcgagc agcctgtatt ttagtatccc ggccgaaaaa 1320ctgccggata ccgttggcgg tattgcgagc agcctgtatt ttagtatccc ggccgaaaaa 1320
gtgtccttca ttgtttttac cagttccgat acgatcaccg atcgcgaaga cgcgctgaaa 1380gtgtccttca ttgtttttac cagttccgat acgatcaccg atcgcgaaga cgcgctgaaa 1380
agtccgctgg tccaagtgat gatgaccctg ggcattgtga aagaaaaaga tgtgctgttc 1440agtccgctgg tccaagtgat gatgaccctg ggcattgtga aagaaaaaga tgtgctgttc 1440
tggtgctaa 1449tggtgctaa 1449
<210> 31<210> 31
<211> 2028<211> 2028
<212> ДНК<212> DNA
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 31<400> 31
atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60
acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120
cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga acagacgtgc 180cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga agacgtgc 180
ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240
gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300
tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360
gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420
tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480
ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540
atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600
catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660
atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720
aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780
ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840
ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900
cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960
aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020
ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080
tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140
gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200
catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260
atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320
gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380
ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440
gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agaccacaaa 1500gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agaccacaaa 1500
gttaatagca tggaagtcgc gattgatgaa gcctgcaccc gcattatcgc aaaacgtcag 1560gttaatagca tggaagtcgc gattgatgaa gcctgcaccc gcattatcgc aaaacgtcag 1560
ccgacggctt ctgatctgcg cctggtgatt gcgattatca aaacgatcac cgatctggaa 1620ccgacggctt ctgatctgcg cctggtgatt gcgattatca aaacgatcac cgatctggaa 1620
cgtattggcg acgttgccga atctattgcg aaagtcgcgc tggaatcttt ttctaacaaa 1680cgtattggcg acgttgccga atctattgcg aaagtcgcgc tggaatcttt ttctaacaaa 1680
cagtacaatc tgctggttag cctggaatct ctgggtcaac ataccgtgcg catgctgcac 1740cagtacaatc tgctggttag cctggaatct ctgggtcaac ataccgtgcg catgctgcac 1740
gaagttctgg atgcattcgc tcgtatggac gtcaaagcag ctatcgaagt gtatcaggaa 1800gaagttctgg atgcattcgc tcgtatggac gtcaaagcag ctatcgaagt gtatcaggaa 1800
gatgaccgca tcgatcaaga atacgaaagt attgtccgtc agctgatggc ccacatgatg 1860gatgaccgca tcgatcaaga atacgaaagt attgtccgtc agctgatggc ccacatgatg 1860
gaagatccgt catcgattcc gaacgttatg aaagtcatgt gggcggcccg ttccatcgaa 1920gaagatccgt catcgattcc gaacgttatg aaagtcatgt gggcggcccg ttccatcgaa 1920
cgcgttggtg atcgttgcca gaatatttgt gaatacatca tctacttcgt gaaaggcaaa 1980cgcgttggtg atcgttgcca gaatatttgt gaatacatca tctacttcgt gaaaggcaaa 1980
gatgttcgcc acaccaaacc ggatgacttc ggtacgatgc tggactaa 2028gatgttcgcc acaccaaacc ggatgacttc ggtacgatgc tggactaa 2028
<210> 32<210> 32
<211> 1533<211> 1533
<212> ДНК<212> DNA
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 32<400> 32
atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60
acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120
cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga acagacgtgc 180cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga agacgtgc 180
ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240
gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300
tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360
gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420
tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480
ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540
atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600
catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660
atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720
aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780
ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840
ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900
cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960
aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020
ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080
tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140
gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200
catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260
atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320
gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380
ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440
gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agacctgccg 1500gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agacctgccg 1500
gactgctcgt ctggtgtgtg tatcgacaaa taa 1533gactgctcgt ctggtgtgtg tatcgacaaa taa 1533
<210> 33<210> 33
<211> 1269<211> 1269
<212> ДНК<212> DNA
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 33<400> 33
atggggacca ttaaaaagcc cttaatcata gcaggaaatg gtccatcaat taaggaccta 60atggggacca ttaaaaagcc cttaatcata gcaggaaatg gtccatcaat taaggaccta 60
gactatgctt tatttccaaa agacttcgat gtctttcgct gcaaccagtt ttacttcgag 120gactatgctt tatttccaaa agacttcgat gtctttcgct gcaaccagtt ttacttcgag 120
gataaatatt acctaggacg cgaaataaaa ggagtgttct ttaacccttg tgtattaagc 180gataaatatt acctaggacg cgaaataaaa ggagtgttct ttaacccttg tgtattaagc 180
agtcaaatgc aaacagtgca ataccttatg gacaatggcg aatatagcat agaacgcttc 240agtcaaatgc aaacagtgca ataccttatg gacaatggcg aatatagcat agaacgcttc 240
ttttgcagtg tttcaacaga tcgccacgat tttgatgggg attaccaaac gattttaccg 300ttttgcagtg tttcaacaga tcgccacgat tttgatgggg attaccaaac gattttaccg 300
gtagacggtt atttaaaagc acactatccg ttcgtctgcg atacattcag cttattcaaa 360gtagacggtt atttaaaagc acactatccg ttcgtctgcg atacattcag cttattcaaa 360
ggtcacgaag aaatcttaaa acacgtgaaa taccacctga aaacgtacag caaagaactt 420ggtcacgaag aaatcttaaa acacgtgaaa taccacctga aaacgtacag caaagaactt 420
agtgcgggtg tcttaatgtt attgagtgca gtggtattag gatacaaaga aatataccta 480agtgcgggtg tcttaatgtt attgagtgca gtggtattag gatacaaaga aatataccta 480
gtaggaatcg acttcggcgc ctcatcttgg gggcacttct atgacgaaag ccaatcccaa 540gtaggaatcg acttcggcgc ctcatcttgg gggcacttct atgacgaaag ccaatcccaa 540
cactttagca atcacatggc agattgtcac aatatctatt acgacatgct gactatttgt 600cactttagca atcacatggc agattgtcac aatatctatt acgacatgct gactatttgt 600
ctctgtcaaa agtatgcaaa attgtacgca ttagcaccca attcaccatt atcacatttg 660ctctgtcaaa agtatgcaaa attgtacgca ttagcaccca attcaccatt atcacatttg 660
cttacactaa atccacaggc caaataccca tttgaactat tagataaacc tatcgggtat 720cttacactaa atccacaggc caaataccca tttgaactat tagataaacc tatcgggtat 720
actagcgacc taattattag tagcccgttg gaagagaagt tgctcgaatt taagaatatc 780actagcgacc taattattag tagcccgttg gaagagaagt tgctcgaatt taagaatatc 780
gaagagaagt tgcttgagtt caaaaacata gaagagaaac tcttagagtt caagaatatt 840gaagagaagt tgcttgagtt caaaaacata gaagagaaac tcttagagtt caagaatatt 840
gaagagaaac tattagaatt taaaaacatc gaggaaaaac ttttggagtt caaaaatata 900gaagagaaac tattagaatt taaaaacatc gaggaaaaac ttttggagtt caaaaatata 900
gaagagaaac tcctagagtt caagaacatt gaggaaaagt tgcttgagtt caaaaatatt 960gaagagaaac tcctagagtt caagaacatt gaggaaaagt tgcttgagtt caaaaatatt 960
gaggaaaagt tgctcgaatt taagaatatc gaggaaaaac ttttggaatt taagaacata 1020gaggaaaagt tgctcgaatt taagaatatc gaggaaaaac ttttggaatt taagaacata 1020
gaagaaaagt tactcgaatt taaaaacatt gaagagaaac tattggaatt taaaaatata 1080gaagaaaagt tactcgaatt taaaaacatt gaagagaaac tattggaatt taaaaatata 1080
gaggaaaagt tacttgagtt caaaaacata gaggaaaagt tacttgaatt taagaacata 1140gaggaaaagt tacttgagtt caaaaacata gaggaaaagt tacttgaatt taagaacata 1140
gaagagaaac ttctcgcaag ccgactgaac aacattctac gtaaaatcaa gcggaaaata 1200gaagagaaac ttctcgcaag ccgactgaac aacattctac gtaaaatcaa gcggaaaata 1200
cttccattct tttggggcgg aggtgtaacc ccaacattaa aagttagttt ccgttgggga 1260cttccattct tttggggcgg aggtgtaacc ccaacattaa aagttagttt ccgttgggga 1260
gctgcataa 1269gctgcataa 1269
<210> 34<210> 34
<211> 469<211> 469
<212> Белок<212> Protein
<213> Campylobacter coli<213> Campylobacter coli
<400> 34<400> 34
Met Gln Asn Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Gln Ser Ile Met Gln Asn Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Gln Ser Ile
1 5 10 15 1 5 10 15
Asn Tyr Gln Arg Leu Pro Lys Glu Tyr Asp Ile Phe Arg Cys Asn Gln Asn Tyr Gln Arg Leu Pro Lys Glu Tyr Asp Ile Phe Arg Cys Asn Gln
20 25 30 20 25 30
Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Asn Ile Lys Ala Ala Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Asn Ile Lys Ala Ala
35 40 45 35 40 45
Phe Phe Asn Pro Tyr Pro Phe Leu Gln Gln Tyr His Thr Ala Lys Gln Phe Phe Asn Pro Tyr Pro Phe Leu Gln Gln Tyr His Thr Ala Lys Gln
50 55 60 50 55 60
Leu Val Phe Asn Asn Glu Tyr Lys Ile Glu Asn Ile Phe Cys Ser Thr Leu Val Phe Asn Asn Glu Tyr Lys Ile Glu Asn Ile Phe Cys Ser Thr
65 70 75 80 65 70 75 80
Phe Asn Leu Pro Phe Ile Glu Lys Asp Asn Phe Ile Asn Lys Phe Tyr Phe Asn Leu Pro Phe Ile Glu Lys Asp Asn Phe Ile Asn Lys Phe Tyr
85 90 95 85 90 95
Asp Phe Phe Pro Asp Ala Lys Leu Gly His Lys Ile Ile Glu Asn Leu Asp Phe Phe Pro Asp Ala Lys Leu Gly His Lys Ile Ile Glu Asn Leu
100 105 110 100 105 110
Lys Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Leu Asn Lys Lys Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Leu Asn Lys
115 120 125 115 120 125
Arg Ile Thr Ser Gly Ile Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly Arg Ile Thr Ser Gly Ile Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly
130 135 140 130 135 140
Tyr Lys Asn Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Glu Thr Tyr Lys Asn Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Glu Thr
145 150 155 160 145 150 155 160
Ile Tyr Pro Phe Lys Ala Met Ser Lys Asn Ile Lys Lys Ile Phe Pro Ile Tyr Pro Phe Lys Ala Met Ser Lys Asn Ile Lys Lys Ile Phe Pro
165 170 175 165 170 175
Trp Ile Lys Asp Phe Asn Pro Ser Asn Phe His Ser Lys Glu Tyr Asp Trp Ile Lys Asp Phe Asn Pro Ser Asn Phe His Ser Lys Glu Tyr Asp
180 185 190 180 185 190
Ile Glu Ile Leu Lys Leu Leu Glu Ser Ile Tyr Lys Val Asn Ile Tyr Ile Glu Ile Leu Lys Leu Leu Glu Ser Ile Tyr Lys Val Asn Ile Tyr
195 200 205 195 200 205
Ala Leu Cys Asp Asn Ser Ala Leu Ala Asn Tyr Phe Pro Leu Leu Val Ala Leu Cys Asp Asn Ser Ala Leu Ala Asn Tyr Phe Pro Leu Leu Val
210 215 220 210 215 220
Asn Thr Asp Asn Ser Phe Val Leu Glu Asn Lys Ser Asp Asp Cys Ile Asn Thr Asp Asn Ser Phe Val Leu Glu Asn Lys Ser Asp Asp Cys Ile
225 230 235 240 225 230 235 240
Asn Asp Ile Leu Leu Thr Asn Asn Thr Pro Gly Ile Asn Phe Tyr Lys Asn Asp Ile Leu Leu Thr Asn Asn Thr Pro Gly Ile Asn Phe Tyr Lys
245 250 255 245 250 255
Ser Gln Ile Gln Val Asn Asn Thr Glu Ile Leu Leu Leu Asn Phe Gln Ser Gln Ile Gln Val Asn Asn Thr Glu Ile Leu Leu Leu Asn Phe Gln
260 265 270 260 265 270
Asn Met Ile Ser Ala Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile Asn Met Ile Ser Ala Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile
275 280 285 275 280 285
Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Glu Asn Glu Ile Ser Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Glu Asn Glu Ile Ser
290 295 300 290 295 300
Asn Leu Asn Lys Ile Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Asn Leu Asn Lys Ile Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys
305 310 315 320 305 310 315 320
Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile Leu Gln Asp Lys Asp Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile Leu Gln Asp Lys Asp Lys
325 330 335 325 330 335
Leu Leu Ile Val Lys Glu Asn Leu Leu Asn Phe Lys Ser Arg His Gly Leu Leu Ile Val Lys Glu Asn Leu Leu Asn Phe Lys Ser Arg His Gly
340 345 350 340 345 350
Lys Ala Lys Phe Arg Ile Gln Asn Gln Leu Ser Tyr Lys Leu Gly Gln Lys Ala Lys Phe Arg Ile Gln Asn Gln Leu Ser Tyr Lys Leu Gly Gln
355 360 365 355 360 365
Ala Met Met Val Asn Ser Lys Ser Leu Leu Gly Tyr Ile Arg Met Pro Ala Met Met Val Asn Ser Lys Ser Leu Leu Gly Tyr Ile Arg Met Pro
370 375 380 370 375 380
Phe Val Leu Ser Tyr Ile Lys Asp Lys His Lys Gln Glu Gln Lys Ile Phe Val Leu Ser Tyr Ile Lys Asp Lys His Lys Gln Glu Gln Lys Ile
385 390 395 400 385 390 395 400
Tyr Gln Glu Lys Ile Lys Lys Asp Pro Ser Leu Thr Leu Pro Pro Leu Tyr Gln Glu Lys Ile Lys Lys Asp Pro Ser Leu Thr Leu Pro Pro Leu
405 410 415 405 410 415
Glu Asp Tyr Pro Asp Tyr Lys Glu Ala Leu Lys Glu Lys Glu Cys Leu Glu Asp Tyr Pro Asp Tyr Lys Glu Ala Leu Lys Glu Lys Glu Cys Leu
420 425 430 420 425 430
Thr Tyr Arg Leu Gly Gln Thr Leu Ile Lys Ala Asp Gln Glu Trp Tyr Thr Tyr Arg Leu Gly Gln Thr Leu Ile Lys Ala Asp Gln Glu Trp Tyr
435 440 445 435 440 445
Lys Gly Gly Tyr Val Lys Met Trp Phe Glu Ile Lys Lys Leu Lys Lys Lys Gly Gly Tyr Val Lys Met Trp Phe Glu Ile Lys Lys Leu Lys Lys
450 455 460 450 455 460
Glu Tyr Lys Lys Lys Glu Tyr Lys Lys Lys
465 465
<210> 35<210> 35
<211> 381<211> 381
<212> Белок<212> Protein
<213> Vibrio sp.<213> Vibrio sp.
<400> 35<400> 35
Met Asn Asn Asp Asn Ser Thr Thr Thr Asn Asn Asn Ala Ile Glu Ile Met Asn Asn Asp Asn Ser Thr Thr Thr Asn Asn Asn Ala Ile Glu Ile
1 5 10 15 1 5 10 15
Tyr Val Asp Arg Ala Thr Leu Pro Thr Ile Gln Gln Met Thr Lys Ile Tyr Val Asp Arg Ala Thr Leu Pro Thr Ile Gln Gln Met Thr Lys Ile
20 25 30 20 25 30
Val Ser Gln Lys Thr Ser Asn Lys Lys Leu Ile Ser Trp Ser Arg Tyr Val Ser Gln Lys Thr Ser Asn Lys Lys Leu Ile Ser Trp Ser Arg Tyr
35 40 45 35 40 45
Pro Ile Thr Asp Lys Ser Leu Leu Lys Lys Ile Asn Ala Glu Phe Phe Pro Ile Thr Asp Lys Ser Leu Leu Lys Lys Ile Asn Ala Glu Phe Phe
50 55 60 50 55 60
Lys Glu Gln Phe Glu Leu Thr Glu Ser Leu Lys Asn Ile Ile Leu Ser Lys Glu Gln Phe Glu Leu Thr Glu Ser Leu Lys Asn Ile Ile Leu Ser
65 70 75 80 65 70 75 80
Glu Asn Ile Asp Asn Leu Ile Ile His Gly Asn Thr Leu Trp Ser Ile Glu Asn Ile Asp Asn Leu Ile Ile His Gly Asn Thr Leu Trp Ser Ile
85 90 95 85 90 95
Asp Val Val Asp Ile Ile Lys Glu Val Asn Leu Leu Gly Lys Asn Ile Asp Val Val Asp Ile Ile Lys Glu Val Asn Leu Leu Gly Lys Asn Ile
100 105 110 100 105 110
Pro Ile Glu Leu His Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Pro Ile Glu Leu His Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg
115 120 125 115 120 125
Ile Tyr Glu Phe Ser Lys Leu Pro Glu Ser Glu Gln Lys Tyr Lys Thr Ile Tyr Glu Phe Ser Lys Leu Pro Glu Ser Glu Gln Lys Tyr Lys Thr
130 135 140 130 135 140
Ser Leu Ser Lys Asn Asn Ile Lys Phe Ser Ile Asp Gly Thr Asp Ser Ser Leu Ser Lys Asn Asn Ile Lys Phe Ser Ile Asp Gly Thr Asp Ser
145 150 155 160 145 150 155 160
Phe Lys Asn Thr Ile Glu Asn Ile Tyr Gly Phe Ser Gln Leu Tyr Pro Phe Lys Asn Thr Ile Glu Asn Ile Tyr Gly Phe Ser Gln Leu Tyr Pro
165 170 175 165 170 175
Thr Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Thr Leu Lys Thr Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Thr Leu Lys
180 185 190 180 185 190
Ile Asn Pro Leu Arg Glu Leu Leu Ser Asn Asn Ile Lys Gln Met Lys Ile Asn Pro Leu Arg Glu Leu Leu Ser Asn Asn Ile Lys Gln Met Lys
195 200 205 195 200 205
Trp Asp Tyr Phe Lys Asp Phe Asn Tyr Lys Gln Lys Asp Ile Phe Tyr Trp Asp Tyr Phe Lys Asp Phe Asn Tyr Lys Gln Lys Asp Ile Phe Tyr
210 215 220 210 215 220
Ser Leu Thr Asn Phe Asn Pro Lys Glu Ile Gln Glu Asp Phe Asn Lys Ser Leu Thr Asn Phe Asn Pro Lys Glu Ile Gln Glu Asp Phe Asn Lys
225 230 235 240 225 230 235 240
Asn Ser Asn Lys Asn Phe Ile Phe Ile Gly Ser Asn Ser Ala Thr Ala Asn Ser Asn Lys Asn Phe Ile Phe Ile Gly Ser Asn Ser Ala Thr Ala
245 250 255 245 250 255
Thr Ala Glu Glu Gln Ile Asn Ile Ile Ser Glu Ala Lys Lys Glu Asn Thr Ala Glu Glu Gln Ile Asn Ile Ile Ser Glu Ala Lys Lys Glu Asn
260 265 270 260 265 270
Ser Ser Ile Ile Thr Asn Ser Ile Ser Asp Tyr Asp Leu Phe Phe Lys Ser Ser Ile Ile Thr Asn Ser Ile Ser Asp Tyr Asp Leu Phe Phe Lys
275 280 285 275 280 285
Gly His Pro Ser Ala Thr Phe Asn Glu Gln Ile Ile Asn Ala His Asp Gly His Pro Ser Ala Thr Phe Asn Glu Gln Ile Ile Asn Ala His Asp
290 295 300 290 295 300
Met Ile Glu Ile Asn Asn Lys Ile Pro Phe Glu Ala Leu Ile Met Thr Met Ile Glu Ile Asn Asn Lys Ile Pro Phe Glu Ala Leu Ile Met Thr
305 310 315 320 305 310 315 320
Gly Ile Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Val Phe Phe Gly Ile Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Val Phe Phe
325 330 335 325 330 335
Ser Ile Pro Lys Glu Val Lys Asn Lys Phe Val Phe Tyr Lys Ser Gly Ser Ile Pro Lys Glu Val Lys Asn Lys Phe Val Phe Tyr Lys Ser Gly
340 345 350 340 345 350
Thr Asp Ile Glu Asn Asn Ser Leu Ile Gln Val Met Leu Lys Leu Asn Thr Asp Ile Glu Asn Asn Ser Leu Ile Gln Val Met Leu Lys Leu Asn
355 360 365 355 360 365
Leu Ile Asn Arg Asp Asn Ile Lys Leu Ile Ser Asp Ile Leu Ile Asn Arg Asp Asn Ile Lys Leu Ile Ser Asp Ile
370 375 380 370 375 380
<210> 36<210> 36
<211> 390<211> 390
<212> Белок<212> Protein
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 36<400> 36
Met Gly Cys Asn Ser Asp Ser Asn His Asn Asn Ser Asp Gly Asn Ile Met Gly Cys Asn Ser Asp Ser Asn His Asn Asn Ser Asp Gly Asn Ile
1 5 10 15 1 5 10 15
Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro
20 25 30 20 25 30
Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys
35 40 45 35 40 45
Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Glu Leu Leu Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Glu Leu Leu
50 55 60 50 55 60
Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Asn Ser Glu Leu Ile Lys Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Asn Ser Glu Leu Ile Lys
65 70 75 80 65 70 75 80
Ser Leu Asp Ser Met Ile Leu Thr Asn Asp Ile Lys Lys Val Ile Ile Ser Leu Asp Ser Met Ile Leu Thr Asn Asp Ile Lys Lys Val Ile Ile
85 90 95 85 90 95
Asn Gly Asn Thr Leu Trp Ala Ala Asp Val Val Asn Ile Ile Lys Ser Asn Gly Asn Thr Leu Trp Ala Ala Asp Val Val Asn Ile Ile Lys Ser
100 105 110 100 105 110
Ile Glu Ala Phe Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp Ile Glu Ala Phe Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp
115 120 125 115 120 125
Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Lys Leu Pro Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Lys Leu Pro
130 135 140 130 135 140
Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Leu Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Leu
145 150 155 160 145 150 155 160
Ser Ser Ile Asn Gly Thr Gln Pro Phe Glu Asn Val Val Glu Asn Ile Ser Ser Ile Asn Gly Thr Gln Pro Phe Glu Asn Val Val Glu Asn Ile
165 170 175 165 170 175
Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala
180 185 190 180 185 190
Asp Ile Phe Glu Thr Asn Leu Pro Leu Arg Ser Leu Lys Gly Val Leu Asp Ile Phe Glu Thr Asn Leu Pro Leu Arg Ser Leu Lys Gly Val Leu
195 200 205 195 200 205
Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Lys Thr Phe Asn Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Lys Thr Phe Asn
210 215 220 210 215 220
Ser Gln Gln Lys Asp Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Asp Ser Gln Gln Lys Asp Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Asp
225 230 235 240 225 230 235 240
Glu Ile Met Glu Gln Tyr Lys Ala Ser Pro Asn Lys Asn Phe Ile Phe Glu Ile Met Glu Gln Tyr Lys Ala Ser Pro Asn Lys Asn Phe Ile Phe
245 250 255 245 250 255
Val Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile Val Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile
260 265 270 260 265 270
Leu Thr Glu Ala Lys Asn Pro Asn Ser Pro Ile Ile Thr Lys Ser Ile Leu Thr Glu Ala Lys Asn Pro Asn Ser Pro Ile Ile Thr Lys Ser Ile
275 280 285 275 280 285
Gln Gly Phe Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn Gln Gly Phe Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn
290 295 300 290 295 300
Lys Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile Lys Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile
305 310 315 320 305 310 315 320
Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly
325 330 335 325 330 335
Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn
340 345 350 340 345 350
Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu
355 360 365 355 360 365
Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys
370 375 380 370 375 380
Leu Ile Ser Asp Leu Gln Leu Ile Ser Asp Leu Gln
385 390 385 390
<210> 37<210> 37
<211> 388<211> 388
<212> Белок<212> Protein
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 37<400> 37
Met Lys Thr Ile Thr Leu Tyr Leu Asp Pro Ala Ser Leu Pro Ala Leu Met Lys Thr Ile Thr Leu Tyr Leu Asp Pro Ala Ser Leu Pro Ala Leu
1 5 10 15 1 5 10 15
Asn Gln Leu Met Asp Phe Thr Gln Asn Asn Glu Asp Lys Thr His Pro Asn Gln Leu Met Asp Phe Thr Gln Asn Asn Glu Asp Lys Thr His Pro
20 25 30 20 25 30
Arg Ile Phe Gly Leu Ser Arg Phe Lys Ile Pro Asp Asn Ile Ile Thr Arg Ile Phe Gly Leu Ser Arg Phe Lys Ile Pro Asp Asn Ile Ile Thr
35 40 45 35 40 45
Gln Tyr Gln Asn Ile His Phe Val Glu Leu Lys Asp Asn Arg Pro Thr Gln Tyr Gln Asn Ile His Phe Val Glu Leu Lys Asp Asn Arg Pro Thr
50 55 60 50 55 60
Glu Ala Leu Phe Thr Ile Leu Asp Gln Tyr Pro Gly Asn Ile Glu Leu Glu Ala Leu Phe Thr Ile Leu Asp Gln Tyr Pro Gly Asn Ile Glu Leu
65 70 75 80 65 70 75 80
Asp Ile His Leu Asn Ile Ala His Ser Val Gln Leu Ile Arg Pro Ile Asp Ile His Leu Asn Ile Ala His Ser Val Gln Leu Ile Arg Pro Ile
85 90 95 85 90 95
Leu Ala Tyr Arg Phe Lys His Leu Asp Arg Val Ser Ile Gln Arg Leu Leu Ala Tyr Arg Phe Lys His Leu Asp Arg Val Ser Ile Gln Arg Leu
100 105 110 100 105 110
Asn Leu Tyr Asp Asp Gly Ser Met Glu Tyr Val Asp Leu Glu Lys Glu Asn Leu Tyr Asp Asp Gly Ser Met Glu Tyr Val Asp Leu Glu Lys Glu
115 120 125 115 120 125
Glu Asn Lys Asp Ile Ser Ala Glu Ile Lys Gln Ala Glu Lys Gln Leu Glu Asn Lys Asp Ile Ser Ala Glu Ile Lys Gln Ala Glu Lys Gln Leu
130 135 140 130 135 140
Ser His Tyr Leu Leu Thr Gly Lys Ile Lys Phe Asp Asn Pro Thr Ile Ser His Tyr Leu Leu Thr Gly Lys Ile Lys Phe Asp Asn Pro Thr Ile
145 150 155 160 145 150 155 160
Ala Arg Tyr Val Trp Gln Ser Ala Phe Pro Val Lys Tyr His Phe Leu Ala Arg Tyr Val Trp Gln Ser Ala Phe Pro Val Lys Tyr His Phe Leu
165 170 175 165 170 175
Ser Thr Asp Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Glu Ser Thr Asp Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Glu
180 185 190 180 185 190
Tyr Leu Ala Glu Asn Tyr Gln Lys Met Asp Trp Thr Ala Tyr Gln Gln Tyr Leu Ala Glu Asn Tyr Gln Lys Met Asp Trp Thr Ala Tyr Gln Gln
195 200 205 195 200 205
Leu Thr Pro Glu Gln Gln Ala Phe Tyr Leu Thr Leu Val Gly Phe Asn Leu Thr Pro Glu Gln Gln Ala Phe Tyr Leu Thr Leu Val Gly Phe Asn
210 215 220 210 215 220
Asp Glu Val Lys Gln Ser Leu Glu Val Gln Gln Ala Lys Phe Ile Phe Asp Glu Val Lys Gln Ser Leu Glu Val Gln Gln Ala Lys Phe Ile Phe
225 230 235 240 225 230 235 240
Thr Gly Thr Thr Thr Trp Glu Gly Asn Thr Asp Val Arg Glu Tyr Tyr Thr Gly Thr Thr Thr Trp Glu Gly Asn Thr Asp Val Arg Glu Tyr Tyr
245 250 255 245 250 255
Ala Gln Gln Gln Leu Asn Leu Leu Asn His Phe Thr Gln Ala Gly Gly Ala Gln Gln Gln Leu Asn Leu Leu Asn His Phe Thr Gln Ala Gly Gly
260 265 270 260 265 270
Asp Leu Phe Ile Gly Asp His Tyr Lys Ile Tyr Phe Lys Gly His Pro Asp Leu Phe Ile Gly Asp His Tyr Lys Ile Tyr Phe Lys Gly His Pro
275 280 285 275 280 285
Arg Gly Gly Glu Ile Asn Asp Tyr Ile Leu Asn Asn Ala Lys Asn Ile Arg Gly Gly Glu Ile Asn Asp Tyr Ile Leu Asn Asn Ala Lys Asn Ile
290 295 300 290 295 300
Thr Asn Ile Pro Ala Asn Ile Ser Phe Glu Val Leu Met Met Thr Gly Thr Asn Ile Pro Ala Asn Ile Ser Phe Glu Val Leu Met Met Thr Gly
305 310 315 320 305 310 315 320
Leu Leu Pro Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Leu Pro Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser
325 330 335 325 330 335
Leu Pro Lys Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Gln Leu Pro Lys Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Gln
340 345 350 340 345 350
Val Lys Ser Lys Glu Asp Ala Leu Asn Asn Pro Tyr Val Lys Val Met Val Lys Ser Lys Glu Asp Ala Leu Asn Asn Pro Tyr Val Lys Val Met
355 360 365 355 360 365
Arg Arg Leu Gly Ile Ile Asp Glu Ser Gln Val Ile Phe Trp Asp Ser Arg Arg Leu Gly Ile Ile Asp Glu Ser Gln Val Ile Phe Trp Asp Ser
370 375 380 370 375 380
Leu Lys Gln Leu Leu Lys Gln Leu
385 385
<210> 38<210> 38
<211> 371<211> 371
<212> Белок<212> Protein
<213> Neisseria meningitidis<213> Neisseria meningitidis
<400> 38<400> 38
Met Gly Leu Lys Lys Ala Cys Leu Thr Val Leu Cys Leu Ile Val Phe Met Gly Leu Lys Lys Ala Cys Leu Thr Val Leu Cys Leu Ile Val Phe
1 5 10 15 1 5 10 15
Cys Phe Gly Ile Phe Tyr Thr Phe Asp Arg Val Asn Gln Gly Glu Arg Cys Phe Gly Ile Phe Tyr Thr Phe Asp Arg Val Asn Gln Gly Glu Arg
20 25 30 20 25 30
Asn Ala Val Ser Leu Leu Lys Glu Lys Leu Phe Asn Glu Glu Gly Glu Asn Ala Val Ser Leu Leu Lys Glu Lys Leu Phe Asn Glu Glu Gly Glu
35 40 45 35 40 45
Pro Val Asn Leu Ile Phe Cys Tyr Thr Ile Leu Gln Met Lys Val Ala Pro Val Asn Leu Ile Phe Cys Tyr Thr Ile Leu Gln Met Lys Val Ala
50 55 60 50 55 60
Glu Arg Ile Met Ala Gln His Pro Gly Glu Arg Phe Tyr Val Val Leu Glu Arg Ile Met Ala Gln His Pro Gly Glu Arg Phe Tyr Val Val Leu
65 70 75 80 65 70 75 80
Met Ser Glu Asn Arg Asn Glu Lys Tyr Asp Tyr Tyr Phe Asn Gln Ile Met Ser Glu Asn Arg Asn Glu Lys Tyr Asp Tyr Tyr Phe Asn Gln Ile
85 90 95 85 90 95
Lys Asp Lys Ala Glu Arg Ala Tyr Phe Phe His Leu Pro Tyr Gly Leu Lys Asp Lys Ala Glu Arg Ala Tyr Phe Phe His Leu Pro Tyr Gly Leu
100 105 110 100 105 110
Asn Lys Ser Phe Asn Phe Ile Pro Thr Met Ala Glu Leu Lys Val Lys Asn Lys Ser Phe Asn Phe Ile Pro Thr Met Ala Glu Leu Lys Val Lys
115 120 125 115 120 125
Ser Met Leu Leu Pro Lys Val Lys Arg Ile Tyr Leu Ala Ser Leu Glu Ser Met Leu Leu Pro Lys Val Lys Arg Ile Tyr Leu Ala Ser Leu Glu
130 135 140 130 135 140
Lys Val Ser Ile Ala Ala Phe Leu Ser Thr Tyr Pro Asp Ala Glu Ile Lys Val Ser Ile Ala Ala Phe Leu Ser Thr Tyr Pro Asp Ala Glu Ile
145 150 155 160 145 150 155 160
Lys Thr Phe Asp Asp Gly Thr Gly Asn Leu Ile Gln Ser Ser Ser Tyr Lys Thr Phe Asp Asp Gly Thr Gly Asn Leu Ile Gln Ser Ser Ser Tyr
165 170 175 165 170 175
Leu Gly Asp Glu Phe Ser Val Asn Gly Thr Ile Lys Arg Asn Phe Ala Leu Gly Asp Glu Phe Ser Val Asn Gly Thr Ile Lys Arg Asn Phe Ala
180 185 190 180 185 190
Arg Met Met Ile Gly Asp Trp Ser Ile Ala Lys Thr Arg Asn Ala Ser Arg Met Met Ile Gly Asp Trp Ser Ile Ala Lys Thr Arg Asn Ala Ser
195 200 205 195 200 205
Asp Glu His Tyr Thr Ile Phe Lys Gly Leu Lys Asn Ile Met Asp Asp Asp Glu His Tyr Thr Ile Phe Lys Gly Leu Lys Asn Ile Met Asp Asp
210 215 220 210 215 220
Gly Arg Arg Lys Met Thr Tyr Leu Pro Leu Phe Asp Ala Ser Glu Leu Gly Arg Arg Lys Met Thr Tyr Leu Pro Leu Phe Asp Ala Ser Glu Leu
225 230 235 240 225 230 235 240
Lys Thr Gly Asp Glu Thr Gly Gly Thr Val Arg Ile Leu Leu Gly Ser Lys Thr Gly Asp Glu Thr Gly Gly Thr Val Arg Ile Leu Leu Gly Ser
245 250 255 245 250 255
Pro Asp Lys Glu Met Lys Glu Ile Ser Glu Lys Ala Ala Lys Asn Phe Pro Asp Lys Glu Met Lys Glu Ile Ser Glu Lys Ala Ala Lys Asn Phe
260 265 270 260 265 270
Lys Ile Gln Tyr Val Ala Pro His Pro Arg Gln Thr Tyr Gly Leu Ser Lys Ile Gln Tyr Val Ala Pro His Pro Arg Gln Thr Tyr Gly Leu Ser
275 280 285 275 280 285
Gly Val Thr Thr Leu Asn Ser Pro Tyr Val Ile Glu Asp Tyr Ile Leu Gly Val Thr Thr Leu Asn Ser Pro Tyr Val Ile Glu Asp Tyr Ile Leu
290 295 300 290 295 300
Arg Glu Ile Lys Lys Asn Pro His Thr Arg Tyr Glu Ile Tyr Thr Phe Arg Glu Ile Lys Lys Asn Pro His Thr Arg Tyr Glu Ile Tyr Thr Phe
305 310 315 320 305 310 315 320
Phe Ser Gly Ala Ala Leu Thr Met Lys Asp Phe Pro Asn Val His Val Phe Ser Gly Ala Ala Leu Thr Met Lys Asp Phe Pro Asn Val His Val
325 330 335 325 330 335
Tyr Ala Leu Lys Pro Ala Ser Leu Pro Glu Asp Tyr Trp Leu Lys Pro Tyr Ala Leu Lys Pro Ala Ser Leu Pro Glu Asp Tyr Trp Leu Lys Pro
340 345 350 340 345 350
Val Tyr Ala Leu Phe Thr Gln Ser Gly Ile Pro Ile Leu Thr Phe Asp Val Tyr Ala Leu Phe Thr Gln Ser Gly Ile Pro Ile Leu Thr Phe Asp
355 360 365 355 360 365
Asp Lys Asn Asp Lys Asn
370 370
<210> 39<210> 39
<211> 283<211> 283
<212> Белок<212> Protein
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 39<400> 39
Met Asp Lys Phe Ala Glu His Glu Ile Pro Lys Ala Val Ile Val Ala Met Asp Lys Phe Ala Glu His Glu Ile Pro Lys Ala Val Ile Val Ala
1 5 10 15 1 5 10 15
Gly Asn Gly Glu Ser Leu Ser Gln Ile Asp Tyr Arg Leu Leu Pro Lys Gly Asn Gly Glu Ser Leu Ser Gln Ile Asp Tyr Arg Leu Leu Pro Lys
20 25 30 20 25 30
Asn Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Phe Glu Glu Arg Tyr Asn Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Phe Glu Glu Arg Tyr
35 40 45 35 40 45
Phe Leu Gly Asn Lys Ile Lys Ala Val Phe Phe Thr Pro Gly Val Phe Phe Leu Gly Asn Lys Ile Lys Ala Val Phe Phe Thr Pro Gly Val Phe
50 55 60 50 55 60
Leu Glu Gln Tyr Tyr Thr Leu Tyr His Leu Lys Arg Asn Asn Glu Tyr Leu Glu Gln Tyr Tyr Thr Leu Tyr His Leu Lys Arg Asn Asn Glu Tyr
65 70 75 80 65 70 75 80
Phe Val Asp Asn Val Ile Leu Ser Ser Phe Asn His Pro Thr Val Asp Phe Val Asp Asn Val Ile Leu Ser Ser Phe Asn His Pro Thr Val Asp
85 90 95 85 90 95
Leu Glu Lys Ser Gln Lys Ile Gln Ala Leu Phe Ile Asp Val Ile Asn Leu Glu Lys Ser Gln Lys Ile Gln Ala Leu Phe Ile Asp Val Ile Asn
100 105 110 100 105 110
Gly Tyr Glu Lys Tyr Leu Ser Lys Leu Thr Ala Phe Asp Val Tyr Leu Gly Tyr Glu Lys Tyr Leu Ser Lys Leu Thr Ala Phe Asp Val Tyr Leu
115 120 125 115 120 125
Arg Tyr Lys Glu Leu Tyr Glu Asn Gln Arg Ile Thr Ser Gly Val Tyr Arg Tyr Lys Glu Leu Tyr Glu Asn Gln Arg Ile Thr Ser Gly Val Tyr
130 135 140 130 135 140
Met Cys Ala Val Ala Ile Ala Met Gly Tyr Thr Asp Ile Tyr Leu Thr Met Cys Ala Val Ala Ile Ala Met Gly Tyr Thr Asp Ile Tyr Leu Thr
145 150 155 160 145 150 155 160
Gly Ile Asp Phe Tyr Gln Ala Ser Glu Glu Asn Tyr Ala Phe Asp Asn Gly Ile Asp Phe Tyr Gln Ala Ser Glu Glu Asn Tyr Ala Phe Asp Asn
165 170 175 165 170 175
Lys Lys Pro Asn Ile Ile Arg Leu Leu Pro Asp Phe Arg Lys Glu Lys Lys Lys Pro Asn Ile Ile Arg Leu Leu Pro Asp Phe Arg Lys Glu Lys
180 185 190 180 185 190
Thr Leu Phe Ser Tyr His Ser Lys Asp Ile Asp Leu Glu Ala Leu Ser Thr Leu Phe Ser Tyr His Ser Lys Asp Ile Asp Leu Glu Ala Leu Ser
195 200 205 195 200 205
Phe Leu Gln Gln His Tyr His Val Asn Phe Tyr Ser Ile Ser Pro Met Phe Leu Gln Gln His Tyr His Val Asn Phe Tyr Ser Ile Ser Pro Met
210 215 220 210 215 220
Ser Pro Leu Ser Lys His Phe Pro Ile Pro Thr Val Glu Asp Asp Cys Ser Pro Leu Ser Lys His Phe Pro Ile Pro Thr Val Glu Asp Asp Cys
225 230 235 240 225 230 235 240
Glu Thr Thr Phe Val Ala Pro Leu Lys Glu Asn Tyr Ile Asn Asp Ile Glu Thr Thr Phe Val Ala Pro Leu Lys Glu Asn Tyr Ile Asn Asp Ile
245 250 255 245 250 255
Leu Leu Pro Pro His Phe Val Tyr Glu Lys Leu Gly Val Asp Lys Leu Leu Leu Pro Pro His Phe Val Tyr Glu Lys Leu Gly Val Asp Lys Leu
260 265 270 260 265 270
Ala Ala Ala Leu Glu His His His His His His Ala Ala Ala Leu Glu His His His His His
275 280 275 280
<210> 40<210> 40
<211> 385<211> 385
<212> Белок<212> Protein
<213> Pasteurella dagmatis<213> Pasteurella dagmatis
<400> 40<400> 40
Met Thr Ile Tyr Leu Asp Pro Ala Ser Leu Pro Thr Leu Asn Gln Leu Met Thr Ile Tyr Leu Asp Pro Ala Ser Leu Pro Thr Leu Asn Gln Leu
1 5 10 15 1 5 10 15
Met His Phe Thr Lys Glu Ser Glu Asp Lys Glu Thr Ala Arg Ile Phe Met His Phe Thr Lys Glu Ser Glu Asp Lys Glu Thr Ala Arg Ile Phe
20 25 30 20 25 30
Gly Phe Ser Arg Phe Lys Leu Pro Glu Lys Ile Thr Glu Gln Tyr Asn Gly Phe Ser Arg Phe Lys Leu Pro Glu Lys Ile Thr Glu Gln Tyr Asn
35 40 45 35 40 45
Asn Ile His Phe Val Glu Ile Lys Asn Asn Arg Pro Thr Glu Asp Ile Asn Ile His Phe Val Glu Ile Lys Asn Asn Arg Pro Thr Glu Asp Ile
50 55 60 50 55 60
Phe Thr Ile Leu Asp Gln Tyr Pro Glu Lys Leu Glu Leu Asp Leu His Phe Thr Ile Leu Asp Gln Tyr Pro Glu Lys Leu Glu Leu Asp Leu His
65 70 75 80 65 70 75 80
Leu Asn Ile Ala His Ser Ile Gln Leu Phe His Pro Ile Leu Gln Tyr Leu Asn Ile Ala His Ser Ile Gln Leu Phe His Pro Ile Leu Gln Tyr
85 90 95 85 90 95
Arg Phe Lys His Pro Asp Arg Ile Ser Ile Lys Ser Leu Asn Leu Tyr Arg Phe Lys His Pro Asp Arg Ile Ser Ile Lys Ser Leu Asn Leu Tyr
100 105 110 100 105 110
Asp Asp Gly Thr Met Glu Tyr Val Asp Leu Glu Lys Glu Glu Asn Lys Asp Asp Gly Thr Met Glu Tyr Val Asp Leu Glu Lys Glu Glu Asn Lys
115 120 125 115 120 125
Asp Ile Lys Ser Ala Ile Lys Lys Ala Glu Lys Gln Leu Ser Asp Tyr Asp Ile Lys Ser Ala Ile Lys Lys Ala Glu Lys Gln Leu Ser Asp Tyr
130 135 140 130 135 140
Leu Leu Thr Gly Lys Ile Asn Phe Asp Asn Pro Thr Leu Ala Arg Tyr Leu Leu Thr Gly Lys Ile Asn Phe Asp Asn Pro Thr Leu Ala Arg Tyr
145 150 155 160 145 150 155 160
Val Trp Gln Ser Gln Tyr Pro Val Lys Tyr His Phe Leu Ser Thr Glu Val Trp Gln Ser Gln Tyr Pro Val Lys Tyr His Phe Leu Ser Thr Glu
165 170 175 165 170 175
Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Thr Tyr Leu Ala Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Thr Tyr Leu Ala
180 185 190 180 185 190
Gly Lys Tyr Gln Lys Met Asp Trp Ser Ala Tyr Glu Lys Leu Ser Pro Gly Lys Tyr Gln Lys Met Asp Trp Ser Ala Tyr Glu Lys Leu Ser Pro
195 200 205 195 200 205
Glu Gln Gln Thr Phe Tyr Leu Lys Leu Val Gly Phe Ser Asp Glu Thr Glu Gln Gln Thr Phe Tyr Leu Lys Leu Val Gly Phe Ser Asp Glu Thr
210 215 220 210 215 220
Lys Gln Leu Phe His Thr Glu Gln Thr Lys Phe Ile Phe Thr Gly Thr Lys Gln Leu Phe His Thr Glu Gln Thr Lys Phe Ile Phe Thr Gly Thr
225 230 235 240 225 230 235 240
Thr Thr Trp Glu Gly Asn Thr Asp Ile Arg Glu Tyr Tyr Ala Lys Gln Thr Thr Trp Glu Gly Asn Thr Asp Ile Arg Glu Tyr Tyr Ala Lys Gln
245 250 255 245 250 255
Gln Leu Asn Leu Leu Lys His Phe Thr His Ser Glu Gly Asp Leu Phe Gln Leu Asn Leu Leu Lys His Phe Thr His Ser Glu Gly Asp Leu Phe
260 265 270 260 265 270
Ile Gly Asp Gln Tyr Lys Ile Tyr Phe Lys Gly His Pro Arg Gly Gly Ile Gly Asp Gln Tyr Lys Ile Tyr Phe Lys Gly His Pro Arg Gly Gly
275 280 285 275 280 285
Asp Ile Asn Asp Tyr Ile Leu Lys His Ala Lys Asp Ile Thr Asn Ile Asp Ile Asn Asp Tyr Ile Leu Lys His Ala Lys Asp Ile Thr Asn Ile
290 295 300 290 295 300
Pro Ala Asn Ile Ser Phe Glu Ile Leu Met Met Thr Gly Leu Leu Pro Pro Ala Asn Ile Ser Phe Glu Ile Leu Met Met Thr Gly Leu Leu Pro
305 310 315 320 305 310 315 320
Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Pro Lys Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Pro Lys
325 330 335 325 330 335
Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Lys Ile Lys Asn Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Lys Ile Lys Asn
340 345 350 340 345 350
Lys Glu Asp Ala Leu Asn Asp Pro Tyr Val Arg Val Met Leu Arg Leu Lys Glu Asp Ala Leu Asn Asp Pro Tyr Val Arg Val Met Leu Arg Leu
355 360 365 355 360 365
Gly Met Ile Asp Lys Ser Gln Ile Ile Phe Trp Asp Ser Leu Lys Gln Gly Met Ile Asp Lys Ser Gln Ile Ile Phe Trp Asp Ser Leu Lys Gln
370 375 380 370 375 380
Leu Leu
385 385
<210> 41<210> 41
<211> 390<211> 390
<212> Белок<212> Protein
<213> Photobacterium phosphoreum<213> Photobacterium phosphoreum
<400> 41<400> 41
Met Gly Cys Asn Ser Asp Ser Lys His Asn Asn Ser Asp Gly Asn Ile Met Gly Cys Asn Ser Asp Ser Lys His Asn Asn Ser Asp Gly Asn Ile
1 5 10 15 1 5 10 15
Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro
20 25 30 20 25 30
Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys
35 40 45 35 40 45
Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Thr Leu Leu Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Thr Leu Leu
50 55 60 50 55 60
Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Arg Pro Glu Leu Ile Lys Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Arg Pro Glu Leu Ile Lys
65 70 75 80 65 70 75 80
Ser Leu Asp Ser Met Ile Leu Thr Asn Glu Ile Lys Lys Val Ile Ile Ser Leu Asp Ser Met Ile Leu Thr Asn Glu Ile Lys Lys Val Ile Ile
85 90 95 85 90 95
Asn Gly Asn Thr Leu Trp Ala Val Asp Val Val Asn Ile Ile Lys Ser Asn Gly Asn Thr Leu Trp Ala Val Asp Val Val Asn Ile Ile Lys Ser
100 105 110 100 105 110
Ile Glu Ala Leu Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp Ile Glu Ala Leu Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp
115 120 125 115 120 125
Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Arg Leu Pro Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Arg Leu Pro
130 135 140 130 135 140
Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Gln Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Gln
145 150 155 160 145 150 155 160
Ser Ser Ile Asn Gly Thr Gln Pro Phe Asp Asn Ser Ile Glu Asn Ile Ser Ser Ile Asn Gly Thr Gln Pro Phe Asp Asn Ser Ile Glu Asn Ile
165 170 175 165 170 175
Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala
180 185 190 180 185 190
Asp Ile Phe Glu Thr Asn Leu Pro Leu Thr Ser Leu Lys Arg Val Ile Asp Ile Phe Glu Thr Asn Leu Pro Leu Thr Ser Leu Lys Arg Val Ile
195 200 205 195 200 205
Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Thr Thr Phe Asn Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Thr Thr Phe Asn
210 215 220 210 215 220
Ser Gln Gln Lys Asn Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Glu Ser Gln Gln Lys Asn Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Glu
225 230 235 240 225 230 235 240
Lys Ile Lys Glu Gln Tyr Lys Ala Ser Pro His Glu Asn Phe Ile Phe Lys Ile Lys Glu Gln Tyr Lys Ala Ser Pro His Glu Asn Phe Ile Phe
245 250 255 245 250 255
Ile Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile Ile Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile
260 265 270 260 265 270
Leu Thr Glu Ala Lys Lys Pro Asp Ser Pro Ile Ile Thr Asn Ser Ile Leu Thr Glu Ala Lys Lys Pro Asp Ser Pro Ile Ile Thr Asn Ser Ile
275 280 285 275 280 285
Gln Gly Leu Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn Gln Gly Leu Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn
290 295 300 290 295 300
Gln Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile Gln Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile
305 310 315 320 305 310 315 320
Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly
325 330 335 325 330 335
Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn
340 345 350 340 345 350
Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu
355 360 365 355 360 365
Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys
370 375 380 370 375 380
Leu Ile Ser Asp Leu Gln Leu Ile Ser Asp Leu Gln
385 390 385 390
<210> 42<210> 42
<211> 417<211> 417
<212> Белок<212> Protein
<213> Avibacterium paragallinarum<213> Avibacterium paragallinarum
<400> 42<400> 42
Met Arg Lys Ile Ile Thr Phe Phe Ser Leu Phe Phe Ser Ile Ser Ala Met Arg Lys Ile Ile Thr Phe Phe Ser Leu Phe Phe Ser Ile Ser Ala
1 5 10 15 1 5 10 15
Trp Cys Gln Lys Met Glu Ile Tyr Leu Asp Tyr Ala Ser Leu Pro Ser Trp Cys Gln Lys Met Glu Ile Tyr Leu Asp Tyr Ala Ser Leu Pro Ser
20 25 30 20 25 30
Leu Asn Met Ile Leu Asn Leu Val Glu Asn Lys Asn Asn Glu Lys Val Leu Asn Met Ile Leu Asn Leu Val Glu Asn Lys Asn Asn Glu Lys Val
35 40 45 35 40 45
Glu Arg Ile Ile Gly Phe Glu Arg Phe Asp Phe Asn Lys Glu Ile Leu Glu Arg Ile Ile Gly Phe Glu Arg Phe Asp Phe Asn Lys Glu Ile Leu
50 55 60 50 55 60
Asn Ser Phe Ser Lys Glu Arg Ile Glu Phe Ser Lys Val Ser Ile Leu Asn Ser Phe Ser Lys Glu Arg Ile Glu Phe Ser Lys Val Ser Ile Leu
65 70 75 80 65 70 75 80
Asp Ile Lys Glu Phe Ser Asp Lys Leu Tyr Leu Asn Ile Glu Lys Ser Asp Ile Lys Glu Phe Ser Asp Lys Leu Tyr Leu Asn Ile Glu Lys Ser
85 90 95 85 90 95
Asp Thr Pro Val Asp Leu Ile Ile His Thr Asn Leu Asp His Ser Val Asp Thr Pro Val Asp Leu Ile Ile His Thr Asn Leu Asp His Ser Val
100 105 110 100 105 110
Arg Ser Leu Leu Ser Ile Phe Lys Thr Leu Ser Pro Leu Phe His Lys Arg Ser Leu Leu Ser Ile Phe Lys Thr Leu Ser Pro Leu Phe His Lys
115 120 125 115 120 125
Ile Asn Ile Glu Lys Leu Tyr Leu Tyr Asp Asp Gly Ser Gly Asn Tyr Ile Asn Ile Glu Lys Leu Tyr Leu Tyr Asp Asp Gly Ser Gly Asn Tyr
130 135 140 130 135 140
Val Asp Leu Tyr Gln His Arg Gln Glu Asn Ile Ser Ala Ile Leu Ile Val Asp Leu Tyr Gln His Arg Gln Glu Asn Ile Ser Ala Ile Leu Ile
145 150 155 160 145 150 155 160
Glu Ala Gln Lys Lys Leu Lys Asp Ala Leu Glu Asn Arg Glu Thr Asp Glu Ala Gln Lys Lys Leu Lys Asp Ala Leu Glu Asn Arg Glu Thr Asp
165 170 175 165 170 175
Thr Asp Lys Leu His Ser Leu Thr Arg Tyr Thr Trp His Lys Ile Phe Thr Asp Lys Leu His Ser Leu Thr Arg Tyr Thr Trp His Lys Ile Phe
180 185 190 180 185 190
Pro Thr Glu Tyr Ile Leu Leu Arg Pro Asp Tyr Leu Asp Ile Asp Glu Pro Thr Glu Tyr Ile Leu Leu Arg Pro Asp Tyr Leu Asp Ile Asp Glu
195 200 205 195 200 205
Lys Met Gln Pro Leu Lys His Phe Leu Ser Asp Thr Ile Val Ser Met Lys Met Gln Pro Leu Lys His Phe Leu Ser Asp Thr Ile Val Ser Met
210 215 220 210 215 220
Asp Leu Ser Arg Phe Ser His Phe Ser Lys Asn Gln Lys Glu Leu Phe Asp Leu Ser Arg Phe Ser His Phe Ser Lys Asn Gln Lys Glu Leu Phe
225 230 235 240 225 230 235 240
Leu Lys Ile Thr His Phe Asp Gln Asn Ile Phe Asn Glu Leu Asn Ile Leu Lys Ile Thr His Phe Asp Gln Asn Ile Phe Asn Glu Leu Asn Ile
245 250 255 245 250 255
Gly Thr Lys Asn Lys Glu Tyr Lys Thr Phe Ile Phe Thr Gly Thr Thr Gly Thr Lys Asn Lys Glu Tyr Lys Thr Phe Ile Phe Thr Gly Thr Thr
260 265 270 260 265 270
Thr Trp Glu Lys Asp Lys Lys Lys Arg Leu Asn Asn Ala Lys Leu Gln Thr Trp Glu Lys Asp Lys Lys Lys Arg Leu Asn Asn Ala Lys Leu Gln
275 280 285 275 280 285
Thr Glu Ile Leu Glu Ser Phe Ile Lys Pro Asn Gly Lys Phe Tyr Leu Thr Glu Ile Leu Glu Ser Phe Ile Lys Pro Asn Gly Lys Phe Tyr Leu
290 295 300 290 295 300
Gly Asn Asp Ile Lys Ile Phe Phe Lys Gly His Pro Lys Gly Asp Asp Gly Asn Asp Ile Lys Ile Phe Phe Lys Gly His Pro Lys Gly Asp Asp
305 310 315 320 305 310 315 320
Ile Asn Asp Tyr Ile Ile Arg Lys Thr Gly Ala Glu Lys Ile Pro Ala Ile Asn Asp Tyr Ile Ile Arg Lys Thr Gly Ala Glu Lys Ile Pro Ala
325 330 335 325 330 335
Asn Ile Pro Phe Glu Val Leu Met Met Thr Asn Ser Leu Pro Asp Tyr Asn Ile Pro Phe Glu Val Leu Met Met Thr Asn Ser Leu Pro Asp Tyr
340 345 350 340 345 350
Val Gly Gly Ile Met Ser Thr Val Tyr Phe Ser Leu Pro Pro Lys Asn Val Gly Gly Ile Met Ser Thr Val Tyr Phe Ser Leu Pro Pro Lys Asn
355 360 365 355 360 365
Ile Asp Lys Val Val Phe Leu Gly Ser Glu Lys Ile Lys Asn Glu Asn Ile Asp Lys Val Val Phe Leu Gly Ser Glu Lys Ile Lys Asn Glu Asn
370 375 380 370 375 380
Asp Ala Lys Ser Gln Thr Leu Ser Lys Leu Met Leu Met Leu Asn Val Asp Ala Lys Ser Gln Thr Leu Ser Lys Leu Met Leu Met Leu Asn Val
385 390 395 400 385 390 395 400
Ile Thr Pro Glu Gln Ile Phe Phe Glu Glu Met Pro Asn Pro Ile Asn Ile Thr Pro Glu Gln Ile Phe Phe Glu Glu Met Pro Asn Pro Ile Asn
405 410 415 405 410 415
Phe Phe
<210> 43<210> 43
<211> 430<211> 430
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 43<400> 43
Met Thr Arg Thr Arg Met Glu Asn Glu Leu Ile Val Ser Lys Asn Met Met Thr Arg Thr Arg Met Glu Asn Glu Leu Ile Val Ser Lys Asn Met
1 5 10 15 1 5 10 15
Gln Asn Ile Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Asn Ile Asn Gln Asn Ile Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Asn Ile Asn
20 25 30 20 25 30
Tyr Lys Arg Leu Pro Arg Glu Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Lys Arg Leu Pro Arg Glu Tyr Asp Val Phe Arg Cys Asn Gln Phe
35 40 45 35 40 45
Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Ile Lys Ala Val Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Ile Lys Ala Val Phe
50 55 60 50 55 60
Phe Asn Pro Gly Val Phe Leu Gln Gln Tyr His Thr Ala Lys Gln Leu Phe Asn Pro Gly Val Phe Leu Gln Gln Tyr His Thr Ala Lys Gln Leu
65 70 75 80 65 70 75 80
Ile Leu Lys Asn Glu Tyr Glu Ile Lys Asn Ile Phe Cys Ser Thr Phe Ile Leu Lys Asn Glu Tyr Glu Ile Lys Asn Ile Phe Cys Ser Thr Phe
85 90 95 85 90 95
Asn Leu Pro Phe Ile Glu Ser Asn Asp Phe Leu His Gln Phe Tyr Asn Asn Leu Pro Phe Ile Glu Ser Asn Asp Phe Leu His Gln Phe Tyr Asn
100 105 110 100 105 110
Phe Phe Pro Asp Ala Lys Leu Gly Tyr Glu Val Ile Glu Asn Leu Lys Phe Phe Pro Asp Ala Lys Leu Gly Tyr Glu Val Ile Glu Asn Leu Lys
115 120 125 115 120 125
Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Phe Asn Lys Arg Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Phe Asn Lys Arg
130 135 140 130 135 140
Ile Thr Ser Gly Val Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly Tyr Ile Thr Ser Gly Val Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly Tyr
145 150 155 160 145 150 155 160
Lys Thr Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Asp Val Ile Lys Thr Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Asp Val Ile
165 170 175 165 170 175
Tyr Pro Phe Glu Ala Met Ser Thr Asn Ile Lys Thr Ile Phe Pro Gly Tyr Pro Phe Glu Ala Met Ser Thr Asn Ile Lys Thr Ile Phe Pro Gly
180 185 190 180 185 190
Ile Lys Asp Phe Lys Pro Ser Asn Cys His Ser Lys Glu Tyr Asp Ile Ile Lys Asp Phe Lys Pro Ser Asn Cys His Ser Lys Glu Tyr Asp Ile
195 200 205 195 200 205
Glu Ala Leu Lys Leu Leu Lys Ser Ile Tyr Lys Val Asn Ile Tyr Ala Glu Ala Leu Lys Leu Leu Lys Ser Ile Tyr Lys Val Asn Ile Tyr Ala
210 215 220 210 215 220
Leu Cys Asp Asp Ser Ile Leu Ala Asn His Phe Pro Leu Ser Ile Asn Leu Cys Asp Asp Ser Ile Leu Ala Asn His Phe Pro Leu Ser Ile Asn
225 230 235 240 225 230 235 240
Ile Asn Asn Asn Phe Thr Leu Glu Asn Lys His Asn Asn Ser Ile Asn Ile Asn Asn Asn Phe Thr Leu Glu Asn Lys His Asn Asn Ser Ile Asn
245 250 255 245 250 255
Asp Ile Leu Leu Thr Asp Asn Thr Pro Gly Val Ser Phe Tyr Lys Asn Asp Ile Leu Leu Thr Asp Asn Thr Pro Gly Val Ser Phe Tyr Lys Asn
260 265 270 260 265 270
Gln Leu Lys Ala Asp Asn Lys Ile Met Leu Asn Phe Tyr Asn Ile Leu Gln Leu Lys Ala Asp Asn Lys Ile Met Leu Asn Phe Tyr Asn Ile Leu
275 280 285 275 280 285
His Ser Lys Asp Asn Leu Ile Lys Phe Leu Asn Lys Glu Ile Ala Val His Ser Lys Asp Asn Leu Ile Lys Phe Leu Asn Lys Glu Ile Ala Val
290 295 300 290 295 300
Leu Lys Lys Gln Thr Thr Gln Arg Ala Lys Ala Arg Ile Gln Asn His Leu Lys Lys Gln Thr Thr Gln Arg Ala Lys Ala Arg Ile Gln Asn His
305 310 315 320 305 310 315 320
Leu Ser Tyr Lys Leu Gly Gln Ala Leu Ile Ile Asn Ser Lys Ser Val Leu Ser Tyr Lys Leu Gly Gln Ala Leu Ile Ile Asn Ser Lys Ser Val
325 330 335 325 330 335
Leu Gly Phe Leu Ser Leu Pro Phe Ile Ile Leu Ser Ile Val Ile Ser Leu Gly Phe Leu Ser Leu Pro Phe Ile Ile Leu Ser Ile Val Ile Ser
340 345 350 340 345 350
His Lys Gln Glu Gln Lys Ala Tyr Lys Phe Lys Val Lys Lys Asn Pro His Lys Gln Glu Gln Lys Ala Tyr Lys Phe Lys Val Lys Lys Asn Pro
355 360 365 355 360 365
Asn Leu Ala Leu Pro Pro Leu Glu Thr Tyr Pro Asp Tyr Asn Glu Ala Asn Leu Ala Leu Pro Pro Leu Glu Thr Tyr Pro Asp Tyr Asn Glu Ala
370 375 380 370 375 380
Leu Lys Glu Lys Glu Cys Phe Thr Tyr Lys Leu Gly Glu Glu Phe Ile Leu Lys Glu Lys Glu Cys Phe Thr Tyr Lys Leu Gly Glu Glu Phe Ile
385 390 395 400 385 390 395 400
Lys Ala Gly Lys Asn Trp Tyr Gly Glu Gly Tyr Ile Lys Phe Ile Phe Lys Ala Gly Lys Asn Trp Tyr Gly Glu Gly Tyr Ile Lys Phe Ile Phe
405 410 415 405 410 415
Lys Asp Val Pro Arg Leu Lys Arg Glu Phe Glu Lys Gly Glu Lys Asp Val Pro Arg Leu Lys Arg Glu Phe Glu Lys Gly Glu
420 425 430 420 425 430
<210> 44<210> 44
<211> 395<211> 395
<212> Белок<212> Protein
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 44<400> 44
Met Asn Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Ile Lys Met Asn Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Ile Lys
1 5 10 15 1 5 10 15
Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Arg Cys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Arg Cys
20 25 30 20 25 30
Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Ile Lys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Ile Lys
35 40 45 35 40 45
Gly Val Phe Phe Asn Ala His Val Phe Asp Leu Gln Met Lys Ile Thr Gly Val Phe Phe Asn Ala His Val Phe Asp Leu Gln Met Lys Ile Thr
50 55 60 50 55 60
Lys Ala Ile Val Lys Asn Gly Glu Tyr His Pro Asp His Ile Tyr Cys Lys Ala Ile Val Lys Asn Gly Glu Tyr His Pro Asp His Ile Tyr Cys
65 70 75 80 65 70 75 80
Thr His Val Glu Pro Tyr Gly Tyr Val Asn Gly Asn Gln Gln Leu Met Thr His Val Glu Pro Tyr Gly Tyr Val Asn Gly Asn Gln Gln Leu Met
85 90 95 85 90 95
Gln Glu Tyr Leu Glu Lys His Phe Val Gly Val Arg Ser Thr Tyr Ala Gln Glu Tyr Leu Glu Lys His Phe Val Gly Val Arg Ser Thr Tyr Ala
100 105 110 100 105 110
Tyr Leu Lys Asp Leu Glu Pro Phe Phe Ile Leu His Ser Lys Tyr Arg Tyr Leu Lys Asp Leu Glu Pro Phe Phe Ile Leu His Ser Lys Tyr Arg
115 120 125 115 120 125
Asn Phe Tyr Asp Gln His Phe Thr Thr Gly Ile Met Met Leu Leu Val Asn Phe Tyr Asp Gln His Phe Thr Thr Gly Ile Met Met Leu Leu Val
130 135 140 130 135 140
Ala Ile Gln Leu Gly Tyr Lys Glu Ile Tyr Leu Cys Gly Ile Asp Phe Ala Ile Gln Leu Gly Tyr Lys Glu Ile Tyr Leu Cys Gly Ile Asp Phe
145 150 155 160 145 150 155 160
Tyr Glu Asn Gly Phe Gly His Phe Tyr Glu Asn Gln Gly Gly Phe Phe Tyr Glu Asn Gly Phe Gly His Phe Tyr Glu Asn Gln Gly Gly Phe Phe
165 170 175 165 170 175
Glu Glu Asp Ser Asp Pro Met His Asp Lys Asn Ile Asp Ile Gln Ala Glu Glu Asp Ser Asp Pro Met His Asp Lys Asn Ile Asp Ile Gln Ala
180 185 190 180 185 190
Leu Glu Leu Ala Lys Lys Tyr Ala Lys Ile Tyr Ala Leu Val Pro Asn Leu Glu Leu Ala Lys Lys Tyr Ala Lys Ile Tyr Ala Leu Val Pro Asn
195 200 205 195 200 205
Ser Ala Leu Val Lys Met Ile Pro Leu Ser Ser Gln Lys Gly Val Leu Ser Ala Leu Val Lys Met Ile Pro Leu Ser Ser Gln Lys Gly Val Leu
210 215 220 210 215 220
Glu Lys Val Lys Asp Arg Ile Gly Leu Gly Glu Phe Lys Arg Glu Lys Glu Lys Val Lys Asp Arg Ile Gly Leu Gly Glu Phe Lys Arg Glu Lys
225 230 235 240 225 230 235 240
Phe Gly Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Phe Gly Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys
245 250 255 245 250 255
Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg
260 265 270 260 265 270
Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu
275 280 285 275 280 285
Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys
290 295 300 290 295 300
Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg
305 310 315 320 305 310 315 320
Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu
325 330 335 325 330 335
Glu Leu Glu Arg Ser Leu Lys Ala Arg Leu Lys Ala Val Leu Ala Ser Glu Leu Glu Arg Ser Leu Lys Ala Arg Leu Lys Ala Val Leu Ala Ser
340 345 350 340 345 350
Lys Gly Ile Arg Gly Asp Asn Leu Ile Ile Val Ser Leu Lys Asp Thr Lys Gly Ile Arg Gly Asp Asn Leu Ile Ile Val Ser Leu Lys Asp Thr
355 360 365 355 360 365
Tyr Arg Leu Phe Lys Gly Gly Phe Ala Leu Leu Leu Asp Leu Lys Ala Tyr Arg Leu Phe Lys Gly Gly Phe Ala Leu Leu Leu Asp Leu Lys Ala
370 375 380 370 375 380
Leu Lys Ser Ile Ile Lys Ala Phe Leu Lys Arg Leu Lys Ser Ile Ile Lys Ala Phe Leu Lys Arg
385 390 395 385 390 395
<210> 45<210> 45
<211> 260<211> 260
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 45<400> 45
Met Gly Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu Met Gly Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu
1 5 10 15 1 5 10 15
Ile Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn Ile Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn
20 25 30 20 25 30
Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala
35 40 45 35 40 45
Val Phe Tyr Asn Pro Ile Leu Phe Phe Glu Gln Tyr Tyr Thr Leu Lys Val Phe Tyr Asn Pro Ile Leu Phe Phe Glu Gln Tyr Tyr Thr Leu Lys
50 55 60 50 55 60
His Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser His Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser
65 70 75 80 65 70 75 80
Asn Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe Asn Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe
85 90 95 85 90 95
Tyr Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln Tyr Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln
100 105 110 100 105 110
Leu Lys Asp Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn Leu Lys Asp Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn
115 120 125 115 120 125
Gln Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu Gln Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu
130 135 140 130 135 140
Gly Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly Gly Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly
145 150 155 160 145 150 155 160
Ser Ser Tyr Ala Phe Asp Thr Lys Gln Lys Asn Leu Leu Lys Leu Ala Ser Ser Tyr Ala Phe Asp Thr Lys Gln Lys Asn Leu Leu Lys Leu Ala
165 170 175 165 170 175
Pro Asn Phe Lys Asn Asp Asn Ser His Tyr Ile Gly His Ser Lys Asn Pro Asn Phe Lys Asn Asp Asn Ser His Tyr Ile Gly His Ser Lys Asn
180 185 190 180 185 190
Thr Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys Thr Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys
195 200 205 195 200 205
Leu Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu Leu Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu
210 215 220 210 215 220
Ala Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr Ala Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr
225 230 235 240 225 230 235 240
Thr Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser Thr Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser
245 250 255 245 250 255
Lys Asn Ile Asn Lys Asn Ile Asn
260 260
<210> 46<210> 46
<211> 298<211> 298
<212> Белок<212> Protein
<213> Streptococcus entericus<213> Streptococcus entericus
<400> 46<400> 46
Met Lys Lys Val Tyr Phe Cys His Thr Val Tyr His Leu Leu Ile Thr Met Lys Lys Val Tyr Phe Cys His Thr Val Tyr His Leu Leu Ile Thr
1 5 10 15 1 5 10 15
Leu Cys Lys Ile Ser Val Glu Glu Gln Val Glu Ile Ile Val Phe Asp Leu Cys Lys Ile Ser Val Glu Glu Gln Val Glu Ile Ile Val Phe Asp
20 25 30 20 25 30
Thr Val Ser Asn His Glu Leu Ile Val Gln Lys Ile Arg Asp Val Phe Thr Val Ser Asn His Glu Leu Ile Val Gln Lys Ile Arg Asp Val Phe
35 40 45 35 40 45
Val Asn Thr Thr Val Leu Phe Ala Glu Gln Asn Thr Asp Phe Ser Ile Val Asn Thr Thr Val Leu Phe Ala Glu Gln Asn Thr Asp Phe Ser Ile
50 55 60 50 55 60
Leu Glu Ile Asp Arg Ala Thr Asp Ile Tyr Val Phe Asn Asp Trp Thr Leu Glu Ile Asp Arg Ala Thr Asp Ile Tyr Val Phe Asn Asp Trp Thr
65 70 75 80 65 70 75 80
Pro Ile Gly Ala Tyr Leu Arg Lys Asn Lys Leu Phe Tyr His Leu Ile Pro Ile Gly Ala Tyr Leu Arg Lys Asn Lys Leu Phe Tyr His Leu Ile
85 90 95 85 90 95
Glu Asp Gly Tyr Asn Tyr His Glu Tyr Asn Val Tyr Ala Asn Ala Leu Glu Asp Gly Tyr Asn Tyr His Glu Tyr Asn Val Tyr Ala Asn Ala Leu
100 105 110 100 105 110
Thr Met Lys Arg Arg Leu Leu Asn Phe Val Leu Arg Arg Glu Glu Pro Thr Met Lys Arg Arg Leu Leu Asn Phe Val Leu Arg Arg Glu Glu Pro
115 120 125 115 120 125
Ser Gly Phe Ser Arg Tyr Val Arg Ser Ile Glu Val Asn Arg Val Lys Ser Gly Phe Ser Arg Tyr Val Arg Ser Ile Glu Val Asn Arg Val Lys
130 135 140 130 135 140
Tyr Leu Pro Asn Asp Cys Arg Lys Ser Lys Trp Val Glu Lys Pro Arg Tyr Leu Pro Asn Asp Cys Arg Lys Ser Lys Trp Val Glu Lys Pro Arg
145 150 155 160 145 150 155 160
Ser Ala Leu Phe Glu Asn Leu Val Pro Glu His Lys Gln Lys Ile Ile Ser Ala Leu Phe Glu Asn Leu Val Pro Glu His Lys Gln Lys Ile Ile
165 170 175 165 170 175
Thr Ile Phe Gly Leu Glu Asn Tyr Gln Asp Ser Leu Arg Gly Val Leu Thr Ile Phe Gly Leu Glu Asn Tyr Gln Asp Ser Leu Arg Gly Val Leu
180 185 190 180 185 190
Val Leu Thr Gln Pro Leu Val Gln Asp Tyr Trp Asp Arg Asp Ile Thr Val Leu Thr Gln Pro Leu Val Gln Asp Tyr Trp Asp Arg Asp Ile Thr
195 200 205 195 200 205
Thr Glu Glu Glu Gln Leu Glu Phe Tyr Arg Gln Ile Val Glu Ser Tyr Thr Glu Glu Glu Gln Leu Glu Phe Tyr Arg Gln Ile Val Glu Ser Tyr
210 215 220 210 215 220
Gly Glu Gly Glu Gln Val Phe Phe Lys Ile His Pro Arg Asp Lys Val Gly Glu Gly Glu Gln Val Phe Phe Lys Ile His Pro Arg Asp Lys Val
225 230 235 240 225 230 235 240
Asp Tyr Ser Ser Leu Thr Asn Val Ile Phe Leu Lys Lys Asn Val Pro Asp Tyr Ser Ser Leu Thr Asn Val Ile Phe Leu Lys Lys Asn Val Pro
245 250 255 245 250 255
Met Glu Val Tyr Glu Leu Ile Ala Asp Cys His Phe Thr Lys Gly Ile Met Glu Val Tyr Glu Leu Ile Ala Asp Cys His Phe Thr Lys Gly Ile
260 265 270 260 265 270
Thr His Ser Ser Thr Ala Leu Asp Phe Leu Ser Cys Val Asp Lys Lys Thr His Ser Ser Thr Ala Leu Asp Phe Leu Ser Cys Val Asp Lys Lys
275 280 285 275 280 285
Ile Thr Leu Lys Gln Met Lys Ala Asn Ser Ile Thr Leu Lys Gln Met Lys Ala Asn Ser
290 295 290 295
<210> 47<210> 47
<211> 295<211> 295
<212> Белок<212> Protein
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 47<400> 47
Met Lys Glu Ile Ala Ile Ile Ser Asn Gln Arg Met Phe Phe Leu Tyr Met Lys Glu Ile Ala Ile Ile Ser Asn Gln Arg Met Phe Phe Leu Tyr
1 5 10 15 1 5 10 15
Cys Leu Leu Thr Asn Lys Asn Val Glu Asp Val Phe Phe Ile Phe Glu Cys Leu Leu Thr Asn Lys Asn Val Glu Asp Val Phe Phe Ile Phe Glu
20 25 30 20 25 30
Lys Gly Ala Met Pro Asn Asn Leu Thr Ser Ile Ser His Phe Ile Val Lys Gly Ala Met Pro Asn Asn Leu Thr Ser Ile Ser His Phe Ile Val
35 40 45 35 40 45
Leu Asp His Ser Lys Ser Glu Cys Tyr Asp Phe Phe Tyr Phe Asn Phe Leu Asp His Ser Lys Ser Glu Cys Tyr Asp Phe Phe Tyr Phe Asn Phe
50 55 60 50 55 60
Ile Ser Cys Lys Tyr Arg Leu Arg Gly Leu Asp Val Tyr Gly Ala Asp Ile Ser Cys Lys Tyr Arg Leu Arg Gly Leu Asp Val Tyr Gly Ala Asp
65 70 75 80 65 70 75 80
His Ile Lys Gly Ala Lys Phe Phe Leu Glu Arg His Arg Phe Phe Val His Ile Lys Gly Ala Lys Phe Phe Leu Glu Arg His Arg Phe Phe Val
85 90 95 85 90 95
Val Glu Asp Gly Met Met Asn Tyr Ser Lys Asn Met Tyr Ala Phe Ser Val Glu Asp Gly Met Met Asn Tyr Ser Lys Asn Met Tyr Ala Phe Ser
100 105 110 100 105 110
Leu Phe Arg Thr Arg Asn Pro Val Ile Leu Pro Gly Gly Phe His Pro Leu Phe Arg Thr Arg Asn Pro Val Ile Leu Pro Gly Gly Phe His Pro
115 120 125 115 120 125
Asn Val Lys Thr Ile Phe Leu Thr Lys Asp Asn Pro Ile Pro Asp Gln Asn Val Lys Thr Ile Phe Leu Thr Lys Asp Asn Pro Ile Pro Asp Gln
130 135 140 130 135 140
Ile Ala His Lys Arg Glu Ile Ile Asn Ile Lys Thr Leu Trp Gln Ala Ile Ala His Lys Arg Glu Ile Ile Asn Ile Lys Thr Leu Trp Gln Ala
145 150 155 160 145 150 155 160
Lys Thr Ala Thr Glu Lys Thr Lys Ile Leu Ser Phe Phe Glu Ile Asp Lys Thr Ala Thr Glu Lys Thr Lys Ile Leu Ser Phe Phe Glu Ile Asp
165 170 175 165 170 175
Met Gln Glu Ile Ser Val Ile Lys Asn Arg Ser Phe Val Leu Tyr Thr Met Gln Glu Ile Ser Val Ile Lys Asn Arg Ser Phe Val Leu Tyr Thr
180 185 190 180 185 190
Gln Pro Leu Ser Glu Asp Lys Leu Leu Thr Glu Ala Glu Lys Ile Asp Gln Pro Leu Ser Glu Asp Lys Leu Leu Thr Glu Ala Glu Lys Ile Asp
195 200 205 195 200 205
Ile Tyr Arg Thr Ile Leu Thr Lys Tyr Asn His Ser Gln Thr Val Ile Ile Tyr Arg Thr Ile Leu Thr Lys Tyr Asn His Ser Gln Thr Val Ile
210 215 220 210 215 220
Lys Pro His Pro Arg Asp Lys Thr Asp Tyr Lys Gln Leu Phe Pro Asp Lys Pro His Pro Arg Asp Lys Thr Asp Tyr Lys Gln Leu Phe Pro Asp
225 230 235 240 225 230 235 240
Ala Tyr Val Met Lys Gly Thr Tyr Pro Ser Glu Leu Leu Thr Leu Leu Ala Tyr Val Met Lys Gly Thr Tyr Pro Ser Glu Leu Leu Thr Leu Leu
245 250 255 245 250 255
Gly Val Asn Phe Asn Lys Val Ile Thr Leu Phe Ser Thr Ala Val Phe Gly Val Asn Phe Asn Lys Val Ile Thr Leu Phe Ser Thr Ala Val Phe
260 265 270 260 265 270
Asp Tyr Pro Lys Glu Lys Ile Asp Phe Tyr Gly Thr Ala Val His Pro Asp Tyr Pro Lys Glu Lys Ile Asp Phe Tyr Gly Thr Ala Val His Pro
275 280 285 275 280 285
Lys Leu Leu Asp Phe Phe Asp Lys Leu Leu Asp Phe Phe Asp
290 295 290 295
<210> 48<210> 48
<211> 488<211> 488
<212> Белок<212> Protein
<213> Alistipes sp.<213> Alistipes sp.
<400> 48<400> 48
Met Ala Leu Leu Ser Gly Thr Ala Ala Cys Ser Asp Asp Glu Val Ser Met Ala Leu Leu Ser Gly Thr Ala Ala Cys Ser Asp Asp Glu Val Ser
1 5 10 15 1 5 10 15
Gln Asn Leu Ile Val Ile Asn Gly Gly Glu His Phe Leu Ser Leu Asp Gln Asn Leu Ile Val Ile Asn Gly Gly Glu His Phe Leu Ser Leu Asp
20 25 30 20 25 30
Gly Leu Ala Arg Ala Gly Lys Ile Ser Val Leu Ala Pro Ala Pro Trp Gly Leu Ala Arg Ala Gly Lys Ile Ser Val Leu Ala Pro Ala Pro Trp
35 40 45 35 40 45
Arg Val Thr Lys Ala Ala Gly Asp Thr Trp Phe Arg Leu Ser Ala Thr Arg Val Thr Lys Ala Ala Gly Asp Thr Trp Phe Arg Leu Ser Ala Thr
50 55 60 50 55 60
Glu Gly Pro Ala Gly Tyr Ser Glu Val Glu Leu Ser Leu Asp Glu Asn Glu Gly Pro Ala Gly Tyr Ser Glu Val Glu Leu Ser Leu Asp Glu Asn
65 70 75 80 65 70 75 80
Pro Gly Ala Ala Arg Ser Ala Gln Leu Ala Phe Ala Cys Gly Asp Ala Pro Gly Ala Ala Arg Ser Ala Gln Leu Ala Phe Ala Cys Gly Asp Ala
85 90 95 85 90 95
Ile Val Pro Phe Arg Leu Ser Gln Gly Ala Leu Ser Ala Gly Tyr Asp Ile Val Pro Phe Arg Leu Ser Gln Gly Ala Leu Ser Ala Gly Tyr Asp
100 105 110 100 105 110
Ser Pro Asp Tyr Tyr Phe Tyr Val Thr Phe Gly Thr Met Pro Thr Leu Ser Pro Asp Tyr Tyr Phe Tyr Val Thr Phe Gly Thr Met Pro Thr Leu
115 120 125 115 120 125
Tyr Ala Gly Ile His Leu Leu Ser His Asp Lys Pro Gly Tyr Val Phe Tyr Ala Gly Ile His Leu Leu Ser His Asp Lys Pro Gly Tyr Val Phe
130 135 140 130 135 140
Tyr Ser Arg Ser Lys Thr Phe Asp Pro Ala Glu Phe Pro Ala Arg Ala Tyr Ser Arg Ser Lys Thr Phe Asp Pro Ala Glu Phe Pro Ala Arg Ala
145 150 155 160 145 150 155 160
Glu Val Thr Thr Ala Ala Asp Arg Thr Ala Asp Ala Thr Gln Ala Glu Glu Val Thr Thr Ala Ala Asp Arg Thr Ala Asp Ala Thr Gln Ala Glu
165 170 175 165 170 175
Met Glu Ala Met Ala Arg Glu Met Lys Arg Arg Ile Leu Glu Ile Asn Met Glu Ala Met Ala Arg Glu Met Lys Arg Arg Ile Leu Glu Ile Asn
180 185 190 180 185 190
Ser Ala Asp Pro Thr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Ser Ala Asp Pro Thr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg
195 200 205 195 200 205
Cys Arg Ile Gly Tyr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Cys Arg Ile Gly Tyr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala
210 215 220 210 215 220
Arg Val Lys Val Ser Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Arg Val Lys Val Ser Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn
225 230 235 240 225 230 235 240
Phe Tyr Asn Tyr Phe Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Glu Phe Tyr Asn Tyr Phe Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Glu
245 250 255 245 250 255
Ser Tyr Ala Ser Glu Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg Ser Tyr Ala Ser Glu Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg
260 265 270 260 265 270
Tyr Pro Glu Thr Arg Ser Leu Pro Glu Phe Glu Ser Tyr Thr Trp Pro Tyr Pro Glu Thr Arg Ser Leu Pro Glu Phe Glu Ser Tyr Thr Trp Pro
275 280 285 275 280 285
Tyr Tyr Leu Ser Thr Arg Pro Asp Tyr Arg Leu Val Val Gln Asp Gly Tyr Tyr Leu Ser Thr Arg Pro Asp Tyr Arg Leu Val Val Gln Asp Gly
290 295 300 290 295 300
Ser Leu Leu Glu Ser Ser Cys Pro Phe Ile Thr Glu Lys Leu Gly Glu Ser Leu Leu Glu Ser Ser Cys Pro Phe Ile Thr Glu Lys Leu Gly Glu
305 310 315 320 305 310 315 320
Met Glu Ile Glu Ser Ile Gln Pro Tyr Glu Met Leu Ser Ala Leu Pro Met Glu Ile Glu Ser Ile Gln Pro Tyr Glu Met Leu Ser Ala Leu Pro
325 330 335 325 330 335
Glu Ser Ser Arg Lys Arg Phe Tyr Asp Met Ala Gly Phe Asp Tyr Asp Glu Ser Ser Arg Lys Arg Phe Tyr Asp Met Ala Gly Phe Asp Tyr Asp
340 345 350 340 345 350
Lys Phe Ala Ala Leu Phe Asp Ala Ser Pro Lys Lys Asn Leu Ile Ile Lys Phe Ala Ala Leu Phe Asp Ala Ser Pro Lys Lys Asn Leu Ile Ile
355 360 365 355 360 365
Ile Gly Thr Ser His Ala Asp Asp Ala Ser Ala Arg Leu Gln Arg Asp Ile Gly Thr Ser His Ala Asp Asp Ala Ser Ala Arg Leu Gln Arg Asp
370 375 380 370 375 380
Tyr Val Ala Arg Ile Met Glu Gln Tyr Gly Ala Gln Tyr Asp Val Phe Tyr Val Ala Arg Ile Met Glu Gln Tyr Gly Ala Gln Tyr Asp Val Phe
385 390 395 400 385 390 395 400
Phe Lys Pro His Pro Ala Asp Thr Thr Ser Ala Gly Tyr Glu Thr Glu Phe Lys Pro His Pro Ala Asp Thr Thr Ser Ala Gly Tyr Glu Thr Glu
405 410 415 405 410 415
Phe Pro Gly Leu Thr Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Phe Pro Gly Leu Thr Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe
420 425 430 420 425 430
Val Trp Ser Leu Ile Asp Arg Val Asp Met Ile Gly Gly Tyr Pro Ser Val Trp Ser Leu Ile Asp Arg Val Asp Met Ile Gly Gly Tyr Pro Ser
435 440 445 435 440 445
Thr Val Phe Leu Thr Val Pro Val Asp Lys Val Arg Phe Ile Phe Ala Thr Val Phe Leu Thr Val Pro Val Asp Lys Val Arg Phe Ile Phe Ala
450 455 460 450 455 460
Ala Asp Ala Ala Ser Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Asp Ala Ala Ser Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp
465 470 475 480 465 470 475 480
Ala Thr Asp Val Glu Trp Met Gln Ala Thr Asp Val Glu Trp Met Gln
485 485
<210> 49<210> 49
<211> 291<211> 291
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 49<400> 49
Met Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu Ile Met Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu Ile
1 5 10 15 1 5 10 15
Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn Gln Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn Gln
20 25 30 20 25 30
Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala Val Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala Val
35 40 45 35 40 45
Phe Tyr Thr Pro Asn Phe Phe Phe Glu Gln Tyr Tyr Thr Leu Lys His Phe Tyr Thr Pro Asn Phe Phe Phe Glu Gln Tyr Tyr Thr Leu Lys His
50 55 60 50 55 60
Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser Asn Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser Asn
65 70 75 80 65 70 75 80
Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe Tyr Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe Tyr
85 90 95 85 90 95
Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln Leu Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln Leu
100 105 110 100 105 110
Lys Glu Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn Gln Lys Glu Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn Gln
115 120 125 115 120 125
Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu Gly Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu Gly
130 135 140 130 135 140
Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly Ser Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly Ser
145 150 155 160 145 150 155 160
Ser Tyr Ala Phe Asp Thr Lys Gln Glu Asn Leu Leu Lys Leu Ala Pro Ser Tyr Ala Phe Asp Thr Lys Gln Glu Asn Leu Leu Lys Leu Ala Pro
165 170 175 165 170 175
Asp Phe Lys Asn Asp Arg Ser His Tyr Ile Gly His Ser Lys Asn Thr Asp Phe Lys Asn Asp Arg Ser His Tyr Ile Gly His Ser Lys Asn Thr
180 185 190 180 185 190
Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys Leu Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys Leu
195 200 205 195 200 205
Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu Ala Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu Ala
210 215 220 210 215 220
Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr Thr Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr Thr
225 230 235 240 225 230 235 240
Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser Lys Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser Lys
245 250 255 245 250 255
Asn Ile Asn Phe Lys Lys Ile Lys Ile Lys Glu Asn Val Tyr Tyr Lys Asn Ile Asn Phe Lys Lys Ile Lys Ile Lys Glu Asn Val Tyr Tyr Lys
260 265 270 260 265 270
Leu Ile Lys Asp Leu Leu Arg Leu Pro Ser Asp Ile Lys His Tyr Phe Leu Ile Lys Asp Leu Leu Arg Leu Pro Ser Asp Ile Lys His Tyr Phe
275 280 285 275 280 285
Lys Gly Lys Lys Gly Lys
290 290
<210> 50<210> 50
<211> 312<211> 312
<212> Белок<212> Protein
<213> Streptococcus agalactiae<213> Streptococcus agalactiae
<400> 50<400> 50
Met Thr Asn Arg Lys Ile Tyr Val Cys His Thr Leu Tyr His Leu Leu Met Thr Asn Arg Lys Ile Tyr Val Cys His Thr Leu Tyr His Leu Leu
1 5 10 15 1 5 10 15
Ile Cys Leu Tyr Lys Glu Glu Ile Tyr Ser Asn Leu Glu Ile Ile Leu Ile Cys Leu Tyr Lys Glu Glu Ile Tyr Ser Asn Leu Glu Ile Ile Leu
20 25 30 20 25 30
Ser Ser Ser Ile Pro Asp Val Asp Asn Leu Glu Lys Lys Leu Lys Ser Ser Ser Ser Ile Pro Asp Val Asp Asn Leu Glu Lys Lys Leu Lys Ser
35 40 45 35 40 45
Lys Thr Ile Asn Ile His Ile Leu Glu Glu Ser Ser Gly Glu Ser Glu Lys Thr Ile Asn Ile His Ile Leu Glu Glu Ser Ser Gly Glu Ser Glu
50 55 60 50 55 60
Glu Leu Leu Ser Val Leu Lys Asp Ala Gly Leu Ser Tyr Ser Lys Phe Glu Leu Leu Ser Val Leu Lys Asp Ala Gly Leu Ser Tyr Ser Lys Phe
65 70 75 80 65 70 75 80
Asp Ser Asn Cys Phe Ile Phe Asn Asp Ala Thr Pro Ile Gly Arg Thr Asp Ser Asn Cys Phe Ile Phe Asn Asp Ala Thr Pro Ile Gly Arg Thr
85 90 95 85 90 95
Leu Ile Lys His Gly Ile Tyr Tyr Asn Leu Ile Glu Asp Gly Leu Asn Leu Ile Lys His Gly Ile Tyr Tyr Asn Leu Ile Glu Asp Gly Leu Asn
100 105 110 100 105 110
Cys Phe Thr Tyr Ser Ile Phe Ser Gln Lys Leu Trp Lys Tyr Tyr Val Cys Phe Thr Tyr Ser Ile Phe Ser Gln Lys Leu Trp Lys Tyr Tyr Val
115 120 125 115 120 125
Lys Lys Tyr Ile Leu His Lys Ile Gln Pro His Gly Phe Ser Arg Tyr Lys Lys Tyr Ile Leu His Lys Ile Gln Pro His Gly Phe Ser Arg Tyr
130 135 140 130 135 140
Cys Leu Gly Ile Glu Val Asn Ser Leu Val Asn Leu Pro Lys Asp Pro Cys Leu Gly Ile Glu Val Asn Ser Leu Val Asn Leu Pro Lys Asp Pro
145 150 155 160 145 150 155 160
Arg Tyr Lys Lys Phe Ile Glu Val Pro Arg Lys Glu Leu Phe Asp Asn Arg Tyr Lys Lys Phe Ile Glu Val Pro Arg Lys Glu Leu Phe Asp Asn
165 170 175 165 170 175
Val Thr Glu Tyr Gln Lys Glu Met Ala Ile Asn Leu Phe Gly Ala Val Val Thr Glu Tyr Gln Lys Glu Met Ala Ile Asn Leu Phe Gly Ala Val
180 185 190 180 185 190
Arg Val Ser Ile Lys Ser Pro Ser Val Leu Val Leu Thr Gln Pro Leu Arg Val Ser Ile Lys Ser Pro Ser Val Leu Val Leu Thr Gln Pro Leu
195 200 205 195 200 205
Ser Ile Asp Lys Glu Phe Met Ser Tyr Asn Asn Lys Ile Glu Thr Ser Ser Ile Asp Lys Glu Phe Met Ser Tyr Asn Asn Lys Ile Glu Thr Ser
210 215 220 210 215 220
Glu Glu Gln Phe Asn Phe Tyr Lys Ser Ile Val Asn Glu Tyr Ile Asn Glu Glu Gln Phe Asn Phe Tyr Lys Ser Ile Val Asn Glu Tyr Ile Asn
225 230 235 240 225 230 235 240
Lys Gly Tyr Asn Val Tyr Leu Lys Val His Pro Arg Asp Val Val Asp Lys Gly Tyr Asn Val Tyr Leu Lys Val His Pro Arg Asp Val Val Asp
245 250 255 245 250 255
Tyr Ser Lys Leu Pro Val Glu Leu Leu Pro Ser Asn Val Pro Met Glu Tyr Ser Lys Leu Pro Val Glu Leu Leu Pro Ser Asn Val Pro Met Glu
260 265 270 260 265 270
Ile Ile Glu Leu Met Leu Thr Gly Arg Phe Glu Cys Gly Ile Thr His Ile Ile Glu Leu Met Leu Thr Gly Arg Phe Glu Cys Gly Ile Thr His
275 280 285 275 280 285
Ser Ser Thr Ala Leu Asp Phe Leu Thr Cys Val Asp Lys Lys Ile Thr Ser Ser Thr Ala Leu Asp Phe Leu Thr Cys Val Asp Lys Lys Ile Thr
290 295 300 290 295 300
Leu Val Asp Leu Lys Asp Ile Lys Leu Val Asp Leu Lys Asp Ile Lys
305 310 305 310
<210> 51<210> 51
<211> 410<211> 410
<212> Белок<212> Protein
<213> Bibersteinia trehalosi<213> Bibersteinia trehalosi
<400> 51<400> 51
Met Glu Phe Cys Lys Met Ala Thr Thr Gln Lys Ile Cys Val Tyr Leu Met Glu Phe Cys Lys Met Ala Thr Thr Gln Lys Ile Cys Val Tyr Leu
1 5 10 15 1 5 10 15
Asp Tyr Ala Thr Ile Pro Ser Leu Asn Tyr Ile Leu His Phe Ala Gln Asp Tyr Ala Thr Ile Pro Ser Leu Asn Tyr Ile Leu His Phe Ala Gln
20 25 30 20 25 30
His Phe Glu Asp Gln Glu Thr Ile Arg Leu Phe Gly Leu Ser Arg Phe His Phe Glu Asp Gln Glu Thr Ile Arg Leu Phe Gly Leu Ser Arg Phe
35 40 45 35 40 45
His Ile Pro Glu Ser Val Ile Gln Arg Tyr Pro Lys Gly Val Val Gln His Ile Pro Glu Ser Val Ile Gln Arg Tyr Pro Lys Gly Val Val Gln
50 55 60 50 55 60
Phe Tyr Pro Asn Gln Glu Lys Asp Phe Ser Ala Leu Leu Leu Ala Leu Phe Tyr Pro Asn Gln Glu Lys Asp Phe Ser Ala Leu Leu Leu Ala Leu
65 70 75 80 65 70 75 80
Lys Asn Ile Leu Ile Glu Val Lys Gln Gln Gln Arg Lys Cys Glu Ile Lys Asn Ile Leu Ile Glu Val Lys Gln Gln Gln Arg Lys Cys Glu Ile
85 90 95 85 90 95
Glu Leu His Leu Asn Leu Phe His Tyr Gln Leu Leu Leu Leu Pro Phe Glu Leu His Leu Asn Leu Phe His Tyr Gln Leu Leu Leu Leu Pro Phe
100 105 110 100 105 110
Leu Ser Leu Tyr Leu Asp Thr Gln Asp Tyr Cys His Leu Thr Leu Lys Leu Ser Leu Tyr Leu Asp Thr Gln Asp Tyr Cys His Leu Thr Leu Lys
115 120 125 115 120 125
Phe Tyr Asp Asp Gly Ser Glu Ala Ile Ser Ala Leu Gln Glu Leu Ala Phe Tyr Asp Asp Gly Ser Glu Ala Ile Ser Ala Leu Gln Glu Leu Ala
130 135 140 130 135 140
Leu Ala Pro Asp Leu Ala Ala Gln Ile Gln Phe Glu Lys Gln Gln Phe Leu Ala Pro Asp Leu Ala Ala Gln Ile Gln Phe Glu Lys Gln Gln Phe
145 150 155 160 145 150 155 160
Asp Glu Leu Val Val Lys Lys Ser Phe Lys Leu Ser Leu Leu Ser Arg Asp Glu Leu Val Val Lys Lys Ser Phe Lys Leu Ser Leu Leu Ser Arg
165 170 175 165 170 175
Tyr Phe Trp Gly Lys Leu Phe Glu Ser Glu Tyr Ile Trp Phe Asn Gln Tyr Phe Trp Gly Lys Leu Phe Glu Ser Glu Tyr Ile Trp Phe Asn Gln
180 185 190 180 185 190
Ala Ile Leu Gln Lys Ala Glu Leu Gln Ile Leu Lys Gln Glu Ile Ser Ala Ile Leu Gln Lys Ala Glu Leu Gln Ile Leu Lys Gln Glu Ile Ser
195 200 205 195 200 205
Ser Ser Arg Gln Met Asp Phe Ala Ile Tyr Gln Gln Met Ser Asp Glu Ser Ser Arg Gln Met Asp Phe Ala Ile Tyr Gln Gln Met Ser Asp Glu
210 215 220 210 215 220
Gln Lys Gln Leu Val Leu Glu Ile Leu Asn Ile Asp Leu Asn Lys Val Gln Lys Gln Leu Val Leu Glu Ile Leu Asn Ile Asp Leu Asn Lys Val
225 230 235 240 225 230 235 240
Ala Tyr Leu Lys Gln Leu Met Glu Asn Gln Pro Ser Phe Leu Phe Leu Ala Tyr Leu Lys Gln Leu Met Glu Asn Gln Pro Ser Phe Leu Phe Leu
245 250 255 245 250 255
Gly Thr Thr Leu Phe Asn Ile Thr Gln Glu Thr Lys Thr Trp Leu Met Gly Thr Thr Leu Phe Asn Ile Thr Gln Glu Thr Lys Thr Trp Leu Met
260 265 270 260 265 270
Gln Met His Val Asp Leu Ile Gln Gln Tyr Cys Leu Pro Ser Gly Gln Gln Met His Val Asp Leu Ile Gln Gln Tyr Cys Leu Pro Ser Gly Gln
275 280 285 275 280 285
Phe Phe Asn Asn Lys Ala Gly Tyr Leu Cys Phe Tyr Lys Gly His Pro Phe Phe Asn Asn Lys Ala Gly Tyr Leu Cys Phe Tyr Lys Gly His Pro
290 295 300 290 295 300
Asn Glu Lys Glu Met Asn Gln Met Ile Leu Ser Gln Phe Lys Asn Leu Asn Glu Lys Glu Met Asn Gln Met Ile Leu Ser Gln Phe Lys Asn Leu
305 310 315 320 305 310 315 320
Ile Ala Leu Pro Asp Asp Ile Pro Leu Glu Ile Leu Leu Leu Leu Gly Ile Ala Leu Pro Asp Asp Ile Pro Leu Glu Ile Leu Leu Leu Leu Gly
325 330 335 325 330 335
Val Ile Pro Ser Lys Val Gly Gly Phe Ala Ser Ser Ala Leu Phe Asn Val Ile Pro Ser Lys Val Gly Gly Phe Ala Ser Ser Ala Leu Phe Asn
340 345 350 340 345 350
Phe Thr Pro Ala Gln Ile Glu Asn Ile Ile Phe Phe Thr Pro Arg Tyr Phe Thr Pro Ala Gln Ile Glu Asn Ile Ile Phe Phe Thr Pro Arg Tyr
355 360 365 355 360 365
Phe Glu Lys Asp Asn Arg Leu His Ala Thr Gln Tyr Arg Leu Met Gln Phe Glu Lys Asp Asn Arg Leu His Ala Thr Gln Tyr Arg Leu Met Gln
370 375 380 370 375 380
Gly Leu Ile Glu Leu Gly Tyr Leu Asp Ala Glu Lys Ser Val Thr His Gly Leu Ile Glu Leu Gly Tyr Leu Asp Ala Glu Lys Ser Val Thr His
385 390 395 400 385 390 395 400
Phe Glu Ile Met Gln Leu Leu Thr Lys Glu Phe Glu Ile Met Gln Leu Leu Thr Lys Glu
405 410 405 410
<210> 52<210> 52
<211> 406<211> 406
<212> Белок<212> Protein
<213> Haemophilus parahaemolyticus<213> Haemophilus parahaemolyticus
<400> 52<400> 52
Met Thr Glu Gln Tyr Ile Lys Asn Val Glu Val Tyr Leu Asp Tyr Ala Met Thr Glu Gln Tyr Ile Lys Asn Val Glu Val Tyr Leu Asp Tyr Ala
1 5 10 15 1 5 10 15
Thr Ile Pro Thr Leu Asn Tyr Phe Tyr His Phe Thr Glu Asn Lys Asp Thr Ile Pro Thr Leu Asn Tyr Phe Tyr His Phe Thr Glu Asn Lys Asp
20 25 30 20 25 30
Asp Ile Ala Thr Ile Arg Leu Phe Gly Leu Gly Arg Phe Asn Ile Ser Asp Ile Ala Thr Ile Arg Leu Phe Gly Leu Gly Arg Phe Asn Ile Ser
35 40 45 35 40 45
Lys Ser Ile Ile Glu Ser Tyr Pro Glu Gly Ile Ile Arg Tyr Cys Pro Lys Ser Ile Ile Glu Ser Tyr Pro Glu Gly Ile Ile Arg Tyr Cys Pro
50 55 60 50 55 60
Ile Ile Phe Glu Asp Gln Thr Ala Phe Gln Gln Leu Phe Ile Thr Leu Ile Ile Phe Glu Asp Gln Thr Ala Phe Gln Gln Leu Phe Ile Thr Leu
65 70 75 80 65 70 75 80
Leu Thr Glu Asp Ser Phe Cys Gln Tyr Arg Phe Asn Phe His Ile Asn Leu Thr Glu Asp Ser Phe Cys Gln Tyr Arg Phe Asn Phe His Ile Asn
85 90 95 85 90 95
Leu Phe His Ser Trp Lys Met Leu Ile Pro Leu Leu His Ile Ile Trp Leu Phe His Ser Trp Lys Met Leu Ile Pro Leu Leu His Ile Ile Trp
100 105 110 100 105 110
Gln Phe Lys His Lys Val Leu Asp Ile Lys Leu Asn Phe Tyr Asp Asp Gln Phe Lys His Lys Val Leu Asp Ile Lys Leu Asn Phe Tyr Asp Asp
115 120 125 115 120 125
Gly Ser Glu Gly Leu Val Thr Leu Ser Lys Ile Glu Gln Asn Tyr Ser Gly Ser Glu Gly Leu Val Thr Leu Ser Lys Ile Glu Gln Asn Tyr Ser
130 135 140 130 135 140
Ser Glu Ile Leu Gln Lys Ile Ile Asp Ile Asp Ser Gln Ser Phe Tyr Ser Glu Ile Leu Gln Lys Ile Ile Asp Ile Asp Ser Gln Ser Phe Tyr
145 150 155 160 145 150 155 160
Ala Asp Lys Leu Ser Phe Leu Asp Glu Asp Ile Ala Arg Tyr Leu Trp Ala Asp Lys Leu Ser Phe Leu Asp Glu Asp Ile Ala Arg Tyr Leu Trp
165 170 175 165 170 175
Asn Ser Leu Phe Glu Ser His Tyr Tyr Leu Leu Asn Asp Phe Leu Leu Asn Ser Leu Phe Glu Ser His Tyr Tyr Leu Leu Asn Asp Phe Leu Leu
180 185 190 180 185 190
Lys Asn Glu Lys Leu Ser Leu Leu Lys Asn Ser Ile Lys Tyr Cys His Lys Asn Glu Lys Leu Ser Leu Leu Lys Asn Ser Ile Lys Tyr Cys His
195 200 205 195 200 205
Ile Met Asp Leu Glu Arg Tyr Leu Gln Phe Thr Gln Glu Glu Lys Asp Ile Met Asp Leu Glu Arg Tyr Leu Gln Phe Thr Gln Glu Glu Lys Asp
210 215 220 210 215 220
Phe Phe Asn Glu Leu Leu Gly Ile Asn Ile Gln Ser Leu Glu Asp Lys Phe Phe Asn Glu Leu Leu Gly Ile Asn Ile Gln Ser Leu Glu Asp Lys
225 230 235 240 225 230 235 240
Ile Lys Ile Phe Gln Gln Lys Lys Thr Phe Ile Phe Thr Gly Thr Thr Ile Lys Ile Phe Gln Gln Lys Lys Thr Phe Ile Phe Thr Gly Thr Thr
245 250 255 245 250 255
Ile Phe Ser Leu Pro Lys Glu Glu Glu Glu Thr Leu Tyr Arg Leu His Ile Phe Ser Leu Pro Lys Glu Glu Glu Glu Thr Leu Tyr Arg Leu His
260 265 270 260 265 270
Leu Asn Ala Ile Leu Asn Tyr Ile His Pro Asn Gly Lys Tyr Phe Ile Leu Asn Ala Ile Leu Asn Tyr Ile His Pro Asn Gly Lys Tyr Phe Ile
275 280 285 275 280 285
Gly Asp Gly Phe Thr Leu Val Ile Lys Gly His Pro His Gln Lys Glu Gly Asp Gly Phe Thr Leu Val Ile Lys Gly His Pro His Gln Lys Glu
290 295 300 290 295 300
Met Asn Ser Arg Leu Glu Lys Ser Phe Glu Lys Ala Val Met Leu Pro Met Asn Ser Arg Leu Glu Lys Ser Phe Glu Lys Ala Val Met Leu Pro
305 310 315 320 305 310 315 320
Asp Asn Ile Pro Phe Glu Ile Leu Tyr Leu Ile Gly Cys Lys Pro Asp Asp Asn Ile Pro Phe Glu Ile Leu Tyr Leu Ile Gly Cys Lys Pro Asp
325 330 335 325 330 335
Lys Ile Gly Gly Phe Val Ser Thr Ser Tyr Phe Ser Cys Asp Lys Lys Lys Ile Gly Gly Phe Val Ser Thr Ser Tyr Phe Ser Cys Asp Lys Lys
340 345 350 340 345 350
Asn Ile Ala Asp Leu Leu Phe Ile Ser Ala Arg Gln Glu Glu Val Arg Asn Ile Ala Asp Leu Leu Phe Ile Ser Ala Arg Gln Glu Glu Val Arg
355 360 365 355 360 365
Lys Asn Asp Tyr Leu Phe Asn Ile Gln Tyr Gln Leu Arg Asp Met Met Lys Asn Asp Tyr Leu Phe Asn Ile Gln Tyr Gln Leu Arg Asp Met Met
370 375 380 370 375 380
Ile Lys Thr Gly Phe Ile Gln Glu Glu Lys Thr His Phe Tyr Ser Asp Ile Lys Thr Gly Phe Ile Gln Glu Glu Lys Thr His Phe Tyr Ser Asp
385 390 395 400 385 390 395 400
Ile Pro Ile Phe Ile Ser Ile Pro Ile Phe Ile Ser
405 405
<210> 53<210> 53
<211> 300<211> 300
<212> Белок<212> Protein
<213> Haemophilus somnus<213>Haemophilus somnus
<400> 53<400> 53
Met Lys Tyr Asn Ile Lys Ile Lys Ala Ile Val Ile Val Ser Ser Leu Met Lys Tyr Asn Ile Lys Ile Lys Ala Ile Val Ile Val Ser Ser Leu
1 5 10 15 1 5 10 15
Arg Met Leu Leu Ile Phe Leu Met Leu Asn Lys Tyr His Leu Asp Glu Arg Met Leu Leu Ile Phe Leu Met Leu Asn Lys Tyr His Leu Asp Glu
20 25 30 20 25 30
Val Leu Phe Val Phe Asn Glu Gly Phe Glu Leu His Lys Lys Tyr Lys Val Leu Phe Val Phe Asn Glu Gly Phe Glu Leu His Lys Lys Tyr Lys
35 40 45 35 40 45
Ile Lys His Tyr Val Ala Ile Lys Lys Lys Ile Thr Lys Phe Trp Arg Ile Lys His Tyr Val Ala Ile Lys Lys Lys Ile Thr Lys Phe Trp Arg
50 55 60 50 55 60
Leu Tyr Tyr Lys Leu Tyr Phe Tyr Arg Phe Lys Ile Asp Arg Ile Pro Leu Tyr Tyr Lys Leu Tyr Phe Tyr Arg Phe Lys Ile Asp Arg Ile Pro
65 70 75 80 65 70 75 80
Val Tyr Gly Ala Asp His Leu Gly Trp Thr Asp Tyr Phe Leu Lys Tyr Val Tyr Gly Ala Asp His Leu Gly Trp Thr Asp Tyr Phe Leu Lys Tyr
85 90 95 85 90 95
Phe Asp Phe Tyr Leu Ile Glu Asp Gly Ile Ala Asn Phe Ser Pro Lys Phe Asp Phe Tyr Leu Ile Glu Asp Gly Ile Ala Asn Phe Ser Pro Lys
100 105 110 100 105 110
Arg Tyr Glu Ile Asn Leu Thr Arg Asn Ile Pro Val Phe Gly Phe His Arg Tyr Glu Ile Asn Leu Thr Arg Asn Ile Pro Val Phe Gly Phe His
115 120 125 115 120 125
Lys Thr Val Lys Lys Ile Tyr Leu Thr Ser Leu Glu Asn Val Pro Ser Lys Thr Val Lys Lys Ile Tyr Leu Thr Ser Leu Glu Asn Val Pro Ser
130 135 140 130 135 140
Asp Ile Arg His Lys Val Glu Leu Ile Ser Leu Glu His Leu Trp Lys Asp Ile Arg His Lys Val Glu Leu Ile Ser Leu Glu His Leu Trp Lys
145 150 155 160 145 150 155 160
Thr Arg Thr Ala Gln Glu Gln His Asn Ile Leu Asp Phe Phe Ala Phe Thr Arg Thr Ala Gln Glu Gln His Asn Ile Leu Asp Phe Phe Ala Phe
165 170 175 165 170 175
Asn Leu Asp Ser Leu Ile Ser Leu Lys Met Lys Lys Tyr Ile Leu Phe Asn Leu Asp Ser Leu Ile Ser Leu Lys Met Lys Lys Tyr Ile Leu Phe
180 185 190 180 185 190
Thr Gln Cys Leu Ser Glu Asp Arg Val Ile Ser Glu Gln Glu Lys Ile Thr Gln Cys Leu Ser Glu Asp Arg Val Ile Ser Glu Gln Glu Lys Ile
195 200 205 195 200 205
Ala Ile Tyr Gln His Ile Ile Lys Asn Tyr Asp Glu Arg Leu Leu Val Ala Ile Tyr Gln His Ile Ile Lys Asn Tyr Asp Glu Arg Leu Leu Val
210 215 220 210 215 220
Ile Lys Pro His Pro Arg Glu Thr Thr Asp Tyr Gln Lys Tyr Phe Glu Ile Lys Pro His Pro Arg Glu Thr Thr Asp Tyr Gln Lys Tyr Phe Glu
225 230 235 240 225 230 235 240
Asn Val Phe Val Tyr Gln Asp Val Val Pro Ser Glu Leu Phe Glu Leu Asn Val Phe Val Tyr Gln Asp Val Val Pro Ser Glu Leu Phe Glu Leu
245 250 255 245 250 255
Leu Asp Val Asn Phe Glu Arg Val Ile Thr Leu Phe Ser Thr Ala Val Leu Asp Val Asn Phe Glu Arg Val Ile Thr Leu Phe Ser Thr Ala Val
260 265 270 260 265 270
Phe Lys Tyr Asp Arg Asn Ile Val Asp Phe Tyr Gly Thr Arg Ile His Phe Lys Tyr Asp Arg Asn Ile Val Asp Phe Tyr Gly Thr Arg Ile His
275 280 285 275 280 285
Asp Lys Ile Tyr Gln Trp Phe Gly Asp Ile Lys Phe Asp Lys Ile Tyr Gln Trp Phe Gly Asp Ile Lys Phe
290 295 300 290 295 300
<210> 54<210> 54
<211> 381<211> 381
<212> Белок<212> Protein
<213> Vibrio harveyi<213> Vibrio harveyi
<400> 54<400> 54
Met Asp Ser Ser Pro Glu Asn Thr Ser Ser Thr Leu Glu Ile Tyr Ile Met Asp Ser Ser Pro Glu Asn Thr Ser Ser Thr Leu Glu Ile Tyr Ile
1 5 10 15 1 5 10 15
Asp Ser Ala Thr Leu Pro Ser Leu Gln His Met Val Lys Ile Ile Asp Asp Ser Ala Thr Leu Pro Ser Leu Gln His Met Val Lys Ile Ile Asp
20 25 30 20 25 30
Glu Gln Ser Gly Asn Lys Lys Leu Ile Asn Trp Lys Arg Tyr Pro Ile Glu Gln Ser Gly Asn Lys Lys Leu Ile Asn Trp Lys Arg Tyr Pro Ile
35 40 45 35 40 45
Asp Asp Glu Leu Leu Leu Asp Lys Ile Asn Ala Leu Ser Phe Ser Asp Asp Asp Glu Leu Leu Leu Asp Lys Ile Asn Ala Leu Ser Phe Ser Asp
50 55 60 50 55 60
Thr Thr Asp Leu Thr Arg Tyr Met Glu Ser Ile Leu Leu Ile Gly Asp Thr Thr Asp Leu Thr Arg Tyr Met Glu Ser Ile Leu Leu Ile Gly Asp
65 70 75 80 65 70 75 80
Ile Lys Arg Val Val Ile Asn Gly Asn Ser Leu Ser Asn Tyr Asn Ile Ile Lys Arg Val Val Ile Asn Gly Asn Ser Leu Ser Asn Tyr Asn Ile
85 90 95 85 90 95
Val Gly Val Met Arg Ser Ile Asn Ala Leu Gly Leu Asp Leu Asp Val Val Gly Val Met Arg Ser Ile Asn Ala Leu Gly Leu Asp Leu Asp Val
100 105 110 100 105 110
Glu Ile Asn Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Glu Ile Asn Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr
115 120 125 115 120 125
Asn Phe Ser Gln Leu Pro Glu Ala Glu Arg Glu Leu Leu Val Ser Met Asn Phe Ser Gln Leu Pro Glu Ala Glu Arg Glu Leu Leu Val Ser Met
130 135 140 130 135 140
Ser Lys Asn Asn Ile Leu Ala Ala Val Asn Gly Ile Gly Ser Tyr Asp Ser Lys Asn Asn Ile Leu Ala Ala Val Asn Gly Ile Gly Ser Tyr Asp
145 150 155 160 145 150 155 160
Ser Gly Ser Pro Glu Asn Ile Tyr Gly Phe Ala Gln Ile Tyr Pro Ala Ser Gly Ser Pro Glu Asn Ile Tyr Gly Phe Ala Gln Ile Tyr Pro Ala
165 170 175 165 170 175
Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Asp Leu Glu Ile Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Asp Leu Glu Ile
180 185 190 180 185 190
Gly Leu Ile Arg Asp Ile Leu Gly Asp Asn Val Lys Gln Met Lys Trp Gly Leu Ile Arg Asp Ile Leu Gly Asp Asn Val Lys Gln Met Lys Trp
195 200 205 195 200 205
Gly Gln Phe Leu Gly Phe Asn Glu Glu Gln Lys Glu Leu Phe Tyr Gln Gly Gln Phe Leu Gly Phe Asn Glu Glu Gln Lys Glu Leu Phe Tyr Gln
210 215 220 210 215 220
Leu Thr Ser Phe Asn Pro Asp Lys Ile Gln Ala Gln Tyr Lys Glu Ser Leu Thr Ser Phe Asn Pro Asp Lys Ile Gln Ala Gln Tyr Lys Glu Ser
225 230 235 240 225 230 235 240
Pro Asn Lys Asn Phe Val Phe Val Gly Thr Asn Ser Arg Ser Ala Thr Pro Asn Lys Asn Phe Val Phe Val Gly Thr Asn Ser Arg Ser Ala Thr
245 250 255 245 250 255
Ala Glu Gln Gln Ile Asn Ile Ile Lys Glu Ala Lys Lys Leu Asp Ser Ala Glu Gln Gln Ile Asn Ile Ile Lys Glu Ala Lys Lys Leu Asp Ser
260 265 270 260 265 270
Glu Ile Ile Pro Asn Ser Ile Asp Gly Tyr Asp Leu Phe Phe Lys Gly Glu Ile Ile Pro Asn Ser Ile Asp Gly Tyr Asp Leu Phe Phe Lys Gly
275 280 285 275 280 285
His Pro Ser Ala Thr Tyr Asn Gln Gln Ile Val Asp Ala His Asp Met His Pro Ser Ala Thr Tyr Asn Gln Gln Ile Val Asp Ala His Asp Met
290 295 300 290 295 300
Thr Glu Ile Tyr Asn Arg Thr Pro Phe Glu Val Leu Ala Met Thr Ser Thr Glu Ile Tyr Asn Arg Thr Pro Phe Glu Val Leu Ala Met Thr Ser
305 310 315 320 305 310 315 320
Ser Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Leu Phe Phe Ser Ser Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Leu Phe Phe Ser
325 330 335 325 330 335
Leu Pro Lys Thr Val Glu Thr Lys Phe Ile Phe Tyr Lys Ser Gly Thr Leu Pro Lys Thr Val Glu Thr Lys Phe Ile Phe Tyr Lys Ser Gly Thr
340 345 350 340 345 350
Asp Ile Glu Ser Asn Ala Leu Ile Gln Val Met Leu Lys Leu Gly Ile Asp Ile Glu Ser Asn Ala Leu Ile Gln Val Met Leu Lys Leu Gly Ile
355 360 365 355 360 365
Ile Thr Asp Glu Lys Val Arg Phe Thr Thr Asp Ile Lys Ile Thr Asp Glu Lys Val Arg Phe Thr Thr Asp Ile Lys
370 375 380 370 375 380
<210> 55<210> 55
<211> 483<211> 483
<212> Белок<212> Protein
<213> Alistipes sp.<213> Alistipes sp.
<400> 55<400> 55
Met Ala Ser Cys Ser Asp Asp Asp Lys Glu Gln Thr Gly Phe Gln Ile Met Ala Ser Cys Ser Asp Asp Asp Lys Glu Gln Thr Gly Phe Gln Ile
1 5 10 15 1 5 10 15
Asp Asp Gly Ser Gly Phe Leu Ser Leu Asp Ala Ala Ala Arg Ser Gly Asp Asp Gly Ser Gly Phe Leu Ser Leu Asp Ala Ala Ala Arg Ser Gly
20 25 30 20 25 30
Ser Ile Ala Ile Thr Ala Asn Asn Ser Trp Ser Val Thr Gln Asp Lys Ser Ile Ala Ile Thr Ala Asn Asn Ser Trp Ser Val Thr Gln Asp Lys
35 40 45 35 40 45
Asp Ser Glu Trp Leu Thr Leu Ser Thr Thr Ser Gly Ala Ala Gly Arg Asp Ser Glu Trp Leu Thr Leu Ser Thr Thr Ser Gly Ala Ala Gly Arg
50 55 60 50 55 60
Thr Glu Ile Gly Ile Met Leu Glu Ala Asn Pro Gly Glu Ala Arg Asn Thr Glu Ile Gly Ile Met Leu Glu Ala Asn Pro Gly Glu Ala Arg Asn
65 70 75 80 65 70 75 80
Ala Gly Leu Thr Phe Asn Ser Gly Gly Arg Thr Tyr Pro Phe Val Ile Ala Gly Leu Thr Phe Asn Ser Gly Gly Arg Thr Tyr Pro Phe Val Ile
85 90 95 85 90 95
Thr Gln Ser Ala His Val Thr Ala Asp Phe Asp Asp Ala Asp His Cys Thr Gln Ser Ala His Val Thr Ala Asp Phe Asp Asp Ala Asp His Cys
100 105 110 100 105 110
Phe Tyr Ile Thr Phe Gly Thr Leu Pro Thr Leu Tyr Ala Gly Leu His Phe Tyr Ile Thr Phe Gly Thr Leu Pro Thr Leu Tyr Ala Gly Leu His
115 120 125 115 120 125
Val Leu Ser His Asp Lys Pro Ser Tyr Val Phe Phe Gln Arg Ser Gln Val Leu Ser His Asp Lys Pro Ser Tyr Val Phe Phe Gln Arg Ser Gln
130 135 140 130 135 140
Thr Phe Arg Pro Glu Glu Phe Pro Ala His Ala Glu Val Thr Ile Ala Thr Phe Arg Pro Glu Glu Phe Pro Ala His Ala Glu Val Thr Ile Ala
145 150 155 160 145 150 155 160
Ala Asp Pro Ser Ala Asn Ala Thr Asp Glu Asp Met Glu Arg Met Arg Ala Asp Pro Ser Ala Asn Ala Thr Asp Glu Asp Met Glu Arg Met Arg
165 170 175 165 170 175
Thr Ala Met Lys Gln Gln Ile Leu Lys Ile Asn Val Glu Asp Pro Thr Thr Ala Met Lys Gln Gln Ile Leu Lys Ile Asn Val Glu Asp Pro Thr
180 185 190 180 185 190
Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Cys Gly Ile Gly Tyr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Cys Gly Ile Gly Tyr
195 200 205 195 200 205
Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Thr Arg Val Lys Val Ser Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Thr Arg Val Lys Val Ser
210 215 220 210 215 220
Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Phe Tyr Asn Tyr Phe Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Phe Tyr Asn Tyr Phe
225 230 235 240 225 230 235 240
Gly Asp Pro Ala Thr Ala Glu Gln Asn Trp Glu Asn Tyr Ala Ala Gln Gly Asp Pro Ala Thr Ala Glu Gln Asn Trp Glu Asn Tyr Ala Ala Gln
245 250 255 245 250 255
Val Glu Ala Leu Asp Trp Gln His Gly Gly Arg Phe Pro Glu Thr Arg Val Glu Ala Leu Asp Trp Gln His Gly Gly Arg Phe Pro Glu Thr Arg
260 265 270 260 265 270
Met Pro Asp Gly Phe Asp Phe Tyr Glu Trp Pro Tyr Tyr Leu Ala Thr Met Pro Asp Gly Phe Asp Phe Tyr Glu Trp Pro Tyr Tyr Leu Ala Thr
275 280 285 275 280 285
Arg Pro Asn Tyr Arg Leu Val Leu Gln Asp Asp Asp Leu Leu Glu Ala Arg Pro Asn Tyr Arg Leu Val Leu Gln Asp Asp Asp Leu Leu Glu Ala
290 295 300 290 295 300
Thr Ser Pro Phe Met Thr Glu Arg Leu Gln Gln Met Arg Thr Glu Ser Thr Ser Pro Phe Met Thr Glu Arg Leu Gln Gln Met Arg Thr Glu Ser
305 310 315 320 305 310 315 320
Lys Gln Pro Tyr Glu Leu Leu Ala Ser Leu Pro Ala Glu Ala Arg Gln Lys Gln Pro Tyr Glu Leu Leu Ala Ser Leu Pro Ala Glu Ala Arg Gln
325 330 335 325 330 335
Arg Phe Phe Arg Met Ala Gly Phe Asp Tyr Asp Ala Phe Ala Ala Leu Arg Phe Phe Arg Met Ala Gly Phe Asp Tyr Asp Ala Phe Ala Ala Leu
340 345 350 340 345 350
Phe Asp Ala Ser Pro Lys Lys Asn Leu Val Ile Ile Gly Thr Ser His Phe Asp Ala Ser Pro Lys Lys Asn Leu Val Ile Ile Gly Thr Ser His
355 360 365 355 360 365
Thr Ser Glu Glu Ser Glu Ala Gln Gln Ala Ala Tyr Val Glu Arg Ile Thr Ser Glu Glu Ser Glu Ala Gln Gln Ala Ala Tyr Val Glu Arg Ile
370 375 380 370 375 380
Ile Gly Asp Tyr Gly Thr Ala Tyr Asp Ile Phe Phe Lys Pro His Pro Ile Gly Asp Tyr Gly Thr Ala Tyr Asp Ile Phe Phe Lys Pro His Pro
385 390 395 400 385 390 395 400
Ala Asp Ser Ser Ser Ser Asn Tyr Glu Glu Arg Phe Glu Gly Leu Thr Ala Asp Ser Ser Ser Ser Asn Tyr Glu Glu Arg Phe Glu Gly Leu Thr
405 410 415 405 410 415
Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ser Leu Leu Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ser Leu Leu
420 425 430 420 425 430
Asp Lys Val Asp Leu Ile Gly Gly Tyr Ser Ser Thr Val Phe Leu Thr Asp Lys Val Asp Leu Ile Gly Gly Tyr Ser Ser Thr Val Phe Leu Thr
435 440 445 435 440 445
Val Pro Val Glu Lys Thr Gly Phe Ile Phe Ala Ala Asn Ala Glu Ser Val Pro Val Glu Lys Thr Gly Phe Ile Phe Ala Ala Asn Ala Glu Ser
450 455 460 450 455 460
Leu Pro Arg Pro Leu Asn Val Leu Phe Arg Asn Ala Glu His Val Arg Leu Pro Arg Pro Leu Asn Val Leu Phe Arg Asn Ala Glu His Val Arg
465 470 475 480 465 470 475 480
Trp Ile Gln Trp Ile Gln
<210> 56<210> 56
<211> 483<211> 483
<212> Белок<212> Protein
<213> Alistipes shahii<213> Alistipes shahii
<400> 56<400> 56
Met Asp Asp Gly Thr Pro Ser Val Ser Ile Asn Gly Gly Thr Asp Phe Met Asp Asp Gly Thr Pro Ser Val Ser Ile Asn Gly Gly Thr Asp Phe
1 5 10 15 1 5 10 15
Leu Ser Leu Asp His Leu Ala Arg Ser Gly Lys Ile Thr Val Asn Ala Leu Ser Leu Asp His Leu Ala Arg Ser Gly Lys Ile Thr Val Asn Ala
20 25 30 20 25 30
Pro Ala Pro Trp Ser Val Thr Leu Ala Pro Glu Asn Tyr Gly Gln Asp Pro Ala Pro Trp Ser Val Thr Leu Ala Pro Glu Asn Tyr Gly Gln Asp
35 40 45 35 40 45
Glu Lys Pro Asp Trp Leu Thr Leu Ser Ala Glu Glu Gly Pro Ala Gly Glu Lys Pro Asp Trp Leu Thr Leu Ser Ala Glu Glu Gly Pro Ala Gly
50 55 60 50 55 60
Tyr Ser Glu Ile Asp Val Thr Phe Ala Glu Asn Pro Gly Pro Ala Arg Tyr Ser Glu Ile Asp Val Thr Phe Ala Glu Asn Pro Gly Pro Ala Arg
65 70 75 80 65 70 75 80
Ser Ala Ser Leu Leu Phe Ser Cys Asp Gly Lys Thr Leu Ala Phe Thr Ser Ala Ser Leu Leu Phe Ser Cys Asp Gly Lys Thr Leu Ala Phe Thr
85 90 95 85 90 95
Val Ser Gln Ser Ala Gly Gly Thr Gly Phe Asp Ala Pro Asp Tyr Tyr Val Ser Gln Ser Ala Gly Gly Thr Gly Phe Asp Ala Pro Asp Tyr Tyr
100 105 110 100 105 110
Phe Tyr Ile Ser Val Gly Thr Met Pro Thr Leu Tyr Ser Gly Leu His Phe Tyr Ile Ser Val Gly Thr Met Pro Thr Leu Tyr Ser Gly Leu His
115 120 125 115 120 125
Leu Leu Ser His Asp Lys Pro Ser Tyr Val Ser Tyr Glu Arg Ala Ser Leu Leu Ser His Asp Lys Pro Ser Tyr Val Ser Tyr Glu Arg Ala Ser
130 135 140 130 135 140
Thr Phe Asp Ala Ala Glu Phe Pro Asp Arg Ala Phe Val Tyr Pro Val Thr Phe Asp Ala Ala Glu Phe Pro Asp Arg Ala Phe Val Tyr Pro Val
145 150 155 160 145 150 155 160
Ala Asp Pro Thr Gly His Ala Thr Asn Glu Glu Leu Arg Ala Met Ser Ala Asp Pro Thr Gly His Ala Thr Asn Glu Glu Leu Arg Ala Met Ser
165 170 175 165 170 175
Glu Ala Met Lys Arg Arg Ile Leu Glu Ile Asn Ala Glu Asp Pro Thr Glu Ala Met Lys Arg Arg Ile Leu Glu Ile Asn Ala Glu Asp Pro Thr
180 185 190 180 185 190
Ala Val Phe Gly Leu Trp Val Asp Asp Leu Arg Cys Arg Leu Gly Tyr Ala Val Phe Gly Leu Trp Val Asp Asp Leu Arg Cys Arg Leu Gly Tyr
195 200 205 195 200 205
Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Arg Val Lys Val Thr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Arg Val Lys Val Thr
210 215 220 210 215 220
Met Leu Ser Asp Gly Thr Ala Thr Tyr Asn Asn Phe His Asn Tyr Phe Met Leu Ser Asp Gly Thr Ala Thr Tyr Asn Asn Phe His Asn Tyr Phe
225 230 235 240 225 230 235 240
Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Asn Asp Tyr Ala Ala Glu Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Asn Asp Tyr Ala Ala Glu
245 250 255 245 250 255
Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg Tyr Pro Glu Thr Arg Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg Tyr Pro Glu Thr Arg
260 265 270 260 265 270
Ala Pro Glu Glu Phe Ala Ser Tyr Thr Trp Pro Tyr Tyr Leu Ser Thr Ala Pro Glu Glu Phe Ala Ser Tyr Thr Trp Pro Tyr Tyr Leu Ser Thr
275 280 285 275 280 285
Arg Pro Asp Tyr Arg Leu Met Leu Gln Asn Ser Ser Leu Met Glu Ser Arg Pro Asp Tyr Arg Leu Met Leu Gln Asn Ser Ser Leu Met Glu Ser
290 295 300 290 295 300
Ser Cys Pro Phe Ile Ala Asp Arg Leu Ala Ala Met Lys Met Glu Ser Ser Cys Pro Phe Ile Ala Asp Arg Leu Ala Ala Met Lys Met Glu Ser
305 310 315 320 305 310 315 320
Val Gln Pro Tyr Glu Leu Leu Thr Ala Leu Pro Glu Ala Ser Lys Gln Val Gln Pro Tyr Glu Leu Leu Thr Ala Leu Pro Glu Ala Ser Lys Gln
325 330 335 325 330 335
Gln Phe Tyr Arg Met Ala Lys Phe Asp Tyr Ala Arg Phe Ala Gly Leu Gln Phe Tyr Arg Met Ala Lys Phe Asp Tyr Ala Arg Phe Ala Gly Leu
340 345 350 340 345 350
Phe Asp Leu Ser Pro Lys Lys Asn Leu Ile Ile Ile Gly Thr Ser His Phe Asp Leu Ser Pro Lys Lys Asn Leu Ile Ile Ile Gly Thr Ser His
355 360 365 355 360 365
Ser Ser Ala Ala Ser Glu Gln Gln Gln Ala Ala Tyr Val Glu Arg Ile Ser Ser Ala Ala Ser Glu Gln Gln Gln Ala Ala Tyr Val Glu Arg Ile
370 375 380 370 375 380
Ile Gln Gln Tyr Gly Ser Asp Tyr Asp Ile Phe Phe Lys Pro His Pro Ile Gln Gln Tyr Gly Ser Asp Tyr Asp Ile Phe Phe Lys Pro His Pro
385 390 395 400 385 390 395 400
Ala Asp Ser Ser Ser Ala Gly Tyr Pro Asp Arg Phe Glu Gly Leu Thr Ala Asp Ser Ser Ser Ala Gly Tyr Pro Asp Arg Phe Glu Gly Leu Thr
405 410 415 405 410 415
Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ala Leu Leu Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ala Leu Leu
420 425 430 420 425 430
Asp Lys Ile Asp Met Ile Gly Gly Tyr Pro Ser Thr Thr Phe Ile Ser Asp Lys Ile Asp Met Ile Gly Gly Tyr Pro Ser Thr Thr Phe Ile Ser
435 440 445 435 440 445
Val Pro Leu Asp Lys Val Gly Phe Leu Phe Ala Ala Asp Ala Asp Gly Val Pro Leu Asp Lys Val Gly Phe Leu Phe Ala Ala Asp Ala Asp Gly
450 455 460 450 455 460
Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Ala Asn Val Glu Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Ala Asn Val Glu
465 470 475 480 465 470 475 480
Trp Ile Gln Trp Ile Gln
<210> 57<210> 57
<211> 401<211> 401
<212> Белок<212> Protein
<213> Actinobacillus suis<213> Actinobacillus suis
<400> 57<400> 57
Met Glu Arg Thr Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe Met Glu Arg Thr Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe
1 5 10 15 1 5 10 15
Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys
20 25 30 20 25 30
His Asp Asp Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met His Asp Asp Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met
35 40 45 35 40 45
Pro Gln Thr Leu Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg Pro Gln Thr Leu Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg
50 55 60 50 55 60
Asn Val Glu His Asn Val Glu Pro Leu Leu Glu Gln Leu Gln Thr Ile Asn Val Glu His Asn Val Glu Pro Leu Leu Glu Gln Leu Gln Thr Ile
65 70 75 80 65 70 75 80
Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu
85 90 95 85 90 95
Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Gln Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Gln
100 105 110 100 105 110
Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly
115 120 125 115 120 125
Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Lys Ser Ser Ser Leu Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Lys Ser Ser Ser Leu
130 135 140 130 135 140
Val Gln Asp Leu Ala Ala Thr Lys Ala Ser Leu Val Ser Leu Phe Glu Val Gln Asp Leu Ala Ala Thr Lys Ala Ser Leu Val Ser Leu Phe Glu
145 150 155 160 145 150 155 160
Asn Gly Glu Gly Ser Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp Asn Gly Glu Gly Ser Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp
165 170 175 165 170 175
Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu
180 185 190 180 185 190
Leu Asp Glu Lys Leu Gln Pro Leu Lys Ala Glu Leu Gly His Tyr Gln Leu Asp Glu Lys Leu Gln Pro Leu Lys Ala Glu Leu Gly His Tyr Gln
195 200 205 195 200 205
Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu
210 215 220 210 215 220
Trp Leu Lys Gln Ile Leu Lys Ile Asp Thr Glu Leu Glu Ser Leu Met Trp Leu Lys Gln Ile Leu Lys Ile Asp Thr Glu Leu Glu Ser Leu Met
225 230 235 240 225 230 235 240
Gln Lys Leu Thr Ala Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe Gln Lys Leu Thr Ala Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe
245 250 255 245 250 255
Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala
260 265 270 260 265 270
Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly
275 280 285 275 280 285
Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile
290 295 300 290 295 300
Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Val Ile Phe Leu Pro Glu Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Val Ile Phe Leu Pro Glu
305 310 315 320 305 310 315 320
Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys
325 330 335 325 330 335
Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys
340 345 350 340 345 350
Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Gln Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Gln
355 360 365 355 360 365
Leu Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu Leu Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu
370 375 380 370 375 380
Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser
385 390 395 400 385 390 395 400
Ser Ser
<210> 58<210> 58
<211> 401<211> 401
<212> Белок<212> Protein
<213> Actinobacillus capsulatus<213> Actinobacillus capsulatus
<400> 58<400> 58
Met Glu Arg Ile Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe Met Glu Arg Ile Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe
1 5 10 15 1 5 10 15
Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys
20 25 30 20 25 30
His Asp His Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met His Asp His Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met
35 40 45 35 40 45
Pro Gln Thr Val Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg Pro Gln Thr Val Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg
50 55 60 50 55 60
Asn Val Glu His Asn Val Glu Gln Leu Leu Glu Gln Leu Gln Thr Ile Asn Val Glu His Asn Val Glu Gln Leu Leu Glu Gln Leu Gln Thr Ile
65 70 75 80 65 70 75 80
Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu
85 90 95 85 90 95
Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Lys Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Lys
100 105 110 100 105 110
Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly
115 120 125 115 120 125
Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Gln Ser Asn Ser Leu Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Gln Ser Asn Ser Leu
130 135 140 130 135 140
Ala Gln Asp Leu Ala Ser Thr Lys Ala Ser Leu Val Ser Leu Phe Lys Ala Gln Asp Leu Ala Ser Thr Lys Ala Ser Leu Val Ser Leu Phe Lys
145 150 155 160 145 150 155 160
Asn Gly Glu Gly Ala Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp Asn Gly Glu Gly Ala Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp
165 170 175 165 170 175
Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu
180 185 190 180 185 190
Ala His Glu Lys Leu Gln Pro Leu Lys Ile Glu Leu Gly His Tyr Gln Ala His Glu Lys Leu Gln Pro Leu Lys Ile Glu Leu Gly His Tyr Gln
195 200 205 195 200 205
Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu
210 215 220 210 215 220
Trp Leu Lys Gln Ile Leu Lys Ile Asp Ala Glu Leu Glu Ser Leu Met Trp Leu Lys Gln Ile Leu Lys Ile Asp Ala Glu Leu Glu Ser Leu Met
225 230 235 240 225 230 235 240
His Lys Leu Thr Thr Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe His Lys Leu Thr Thr Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe
245 250 255 245 250 255
Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala
260 265 270 260 265 270
Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly
275 280 285 275 280 285
Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile
290 295 300 290 295 300
Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Ala Ile Phe Leu Pro Glu Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Ala Ile Phe Leu Pro Glu
305 310 315 320 305 310 315 320
Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys
325 330 335 325 330 335
Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys
340 345 350 340 345 350
Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Asn Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Asn
355 360 365 355 360 365
Arg Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu Arg Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu
370 375 380 370 375 380
Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser
385 390 395 400 385 390 395 400
Ser Ser
<210> 59<210> 59
<211> 311<211> 311
<212> Белок<212> Protein
<213> Haemophilus somnus<213>Haemophilus somnus
<400> 59<400> 59
Met Phe Arg Glu Asp Asn Met Asn Leu Ile Ile Cys Cys Thr Pro Leu Met Phe Arg Glu Asp Asn Met Asn Leu Ile Ile Cys Cys Thr Pro Leu
1 5 10 15 1 5 10 15
Gln Val Ile Ile Ala Glu Lys Ile Ile Glu Arg Tyr Pro Glu Gln Lys Gln Val Ile Ile Ala Glu Lys Ile Ile Glu Arg Tyr Pro Glu Gln Lys
20 25 30 20 25 30
Phe Tyr Gly Val Met Leu Glu Ser Phe Tyr Asn Asp Lys Phe Asp Phe Phe Tyr Gly Val Met Leu Glu Ser Phe Tyr Asn Asp Lys Phe Asp Phe
35 40 45 35 40 45
Tyr Glu Asn Lys Leu Lys His Leu Cys His Glu Phe Phe Cys Ile Lys Tyr Glu Asn Lys Leu Lys His Leu Cys His Glu Phe Phe Cys Ile Lys
50 55 60 50 55 60
Ile Ala Arg Phe Lys Leu Glu Arg Tyr Lys Asn Leu Leu Ser Leu Leu Ile Ala Arg Phe Lys Leu Glu Arg Tyr Lys Asn Leu Leu Ser Leu Leu
65 70 75 80 65 70 75 80
Lys Ile Lys Asn Lys Thr Phe Asp Arg Val Phe Leu Ala Asn Ile Glu Lys Ile Lys Asn Lys Thr Phe Asp Arg Val Phe Leu Ala Asn Ile Glu
85 90 95 85 90 95
Lys Arg Tyr Ile His Ile Ile Leu Ser Asn Ile Phe Phe Lys Glu Leu Lys Arg Tyr Ile His Ile Ile Leu Ser Asn Ile Phe Phe Lys Glu Leu
100 105 110 100 105 110
Tyr Thr Phe Asp Asp Gly Thr Ala Asn Ile Ala Pro Asn Ser His Leu Tyr Thr Phe Asp Asp Gly Thr Ala Asn Ile Ala Pro Asn Ser His Leu
115 120 125 115 120 125
Tyr Gln Glu Tyr Asp His Ser Leu Lys Lys Arg Ile Thr Asp Ile Leu Tyr Gln Glu Tyr Asp His Ser Leu Lys Lys Arg Ile Thr Asp Ile Leu
130 135 140 130 135 140
Leu Pro Asn His Tyr Asn Ser Asn Lys Val Lys Asn Ile Ser Lys Leu Leu Pro Asn His Tyr Asn Ser Asn Lys Val Lys Asn Ile Ser Lys Leu
145 150 155 160 145 150 155 160
His Tyr Ser Ile Tyr Arg Cys Lys Asn Asn Ile Ile Asp Asn Ile Glu His Tyr Ser Ile Tyr Arg Cys Lys Asn Asn Ile Ile Asp Asn Ile Glu
165 170 175 165 170 175
Tyr Met Pro Leu Phe Asn Leu Glu Lys Lys Tyr Thr Ala Gln Asp Lys Tyr Met Pro Leu Phe Asn Leu Glu Lys Lys Tyr Thr Ala Gln Asp Lys
180 185 190 180 185 190
Ser Ile Ser Ile Leu Leu Gly Gln Pro Ile Phe Tyr Asp Glu Glu Lys Ser Ile Ser Ile Leu Leu Gly Gln Pro Ile Phe Tyr Asp Glu Glu Lys
195 200 205 195 200 205
Asn Ile Arg Leu Ile Lys Glu Val Ile Ala Lys Phe Lys Ile Asp Tyr Asn Ile Arg Leu Ile Lys Glu Val Ile Ala Lys Phe Lys Ile Asp Tyr
210 215 220 210 215 220
Tyr Phe Pro His Pro Arg Glu Asp Tyr Tyr Ile Asp Asn Val Ser Tyr Tyr Phe Pro His Pro Arg Glu Asp Tyr Tyr Ile Asp Asn Val Ser Tyr
225 230 235 240 225 230 235 240
Ile Lys Thr Pro Leu Ile Phe Glu Glu Phe Tyr Ala Glu Arg Ser Ile Ile Lys Thr Pro Leu Ile Phe Glu Glu Phe Tyr Ala Glu Arg Ser Ile
245 250 255 245 250 255
Glu Asn Ser Ile Lys Ile Tyr Thr Phe Phe Ser Ser Ala Val Leu Asn Glu Asn Ser Ile Lys Ile Tyr Thr Phe Phe Ser Ser Ala Val Leu Asn
260 265 270 260 265 270
Ile Val Thr Lys Glu Asn Ile Asp Arg Ile Tyr Ala Leu Lys Pro Lys Ile Val Thr Lys Glu Asn Ile Asp Arg Ile Tyr Ala Leu Lys Pro Lys
275 280 285 275 280 285
Leu Thr Glu Lys Ala Tyr Leu Asp Cys Tyr Asp Ile Leu Lys Asp Phe Leu Thr Glu Lys Ala Tyr Leu Asp Cys Tyr Asp Ile Leu Lys Asp Phe
290 295 300 290 295 300
Gly Ile Lys Val Ile Asp Ile Gly Ile Lys Val Ile Asp Ile
305 310 305 310
<210> 60<210> 60
<211> 399<211> 399
<212> Белок<212> Protein
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 60<400> 60
Met Leu Ile Gln Gln Asn Leu Glu Ile Tyr Leu Asp Tyr Ala Thr Ile Met Leu Ile Gln Gln Asn Leu Glu Ile Tyr Leu Asp Tyr Ala Thr Ile
1 5 10 15 1 5 10 15
Pro Ser Leu Ala Cys Phe Met His Phe Ile Gln His Lys Asp Asp Val Pro Ser Leu Ala Cys Phe Met His Phe Ile Gln His Lys Asp Asp Val
20 25 30 20 25 30
Asp Ser Ile Arg Leu Phe Gly Leu Ala Arg Phe Asp Ile Pro Gln Ser Asp Ser Ile Arg Leu Phe Gly Leu Ala Arg Phe Asp Ile Pro Gln Ser
35 40 45 35 40 45
Ile Ile Asp Arg Tyr Pro Ala Asn His Leu Phe Tyr His Asn Ile Asp Ile Ile Asp Arg Tyr Pro Ala Asn His Leu Phe Tyr His Asn Ile Asp
50 55 60 50 55 60
Asn Arg Asp Leu Thr Ala Val Leu Asn Gln Leu Ala Asp Ile Leu Ala Asn Arg Asp Leu Thr Ala Val Leu Asn Gln Leu Ala Asp Ile Leu Ala
65 70 75 80 65 70 75 80
Gln Glu Asn Lys Arg Phe Gln Ile Asn Leu His Leu Asn Leu Phe His Gln Glu Asn Lys Arg Phe Gln Ile Asn Leu His Leu Asn Leu Phe His
85 90 95 85 90 95
Ser Ile Asp Leu Phe Phe Ala Ile Tyr Pro Ile Tyr Gln Gln Tyr Gln Ser Ile Asp Leu Phe Phe Ala Ile Tyr Pro Ile Tyr Gln Gln Tyr Gln
100 105 110 100 105 110
His Lys Ile Ser Thr Ile Gln Leu Gln Leu Tyr Asp Asp Gly Ser Glu His Lys Ile Ser Thr Ile Gln Leu Gln Leu Tyr Asp Asp Gly Ser Glu
115 120 125 115 120 125
Gly Ile Val Thr Gln His Ser Leu Cys Lys Ile Ala Asp Leu Glu Gln Gly Ile Val Thr Gln His Ser Leu Cys Lys Ile Ala Asp Leu Glu Gln
130 135 140 130 135 140
Leu Ile Leu Gln His Lys Asn Val Leu Leu Glu Leu Leu Thr Lys Gly Leu Ile Leu Gln His Lys Asn Val Leu Leu Glu Leu Leu Thr Lys Gly
145 150 155 160 145 150 155 160
Thr Ala Asn Val Pro Asn Pro Thr Leu Leu Arg Tyr Leu Trp Asn Asn Thr Ala Asn Val Pro Asn Pro Thr Leu Leu Arg Tyr Leu Trp Asn Asn
165 170 175 165 170 175
Ile Ile Asp Ser Gln Phe His Leu Ile Ser Asp His Phe Leu Gln His Ile Ile Asp Ser Gln Phe His Leu Ile Ser Asp His Phe Leu Gln His
180 185 190 180 185 190
Pro Lys Leu Gln Pro Leu Lys Arg Leu Leu Lys Arg Tyr Thr Ile Leu Pro Lys Leu Gln Pro Leu Lys Arg Leu Leu Lys Arg Tyr Thr Ile Leu
195 200 205 195 200 205
Asp Phe Thr Cys Tyr Pro Arg Phe Asn Ala Glu Gln Lys Gln Leu Leu Asp Phe Thr Cys Tyr Pro Arg Phe Asn Ala Glu Gln Lys Gln Leu Leu
210 215 220 210 215 220
Lys Glu Ile Leu His Ile Ser Asn Glu Leu Glu Asn Leu Leu Lys Leu Lys Glu Ile Leu His Ile Ser Asn Glu Leu Glu Asn Leu Leu Lys Leu
225 230 235 240 225 230 235 240
Leu Lys Gln His Asn Thr Phe Leu Phe Thr Gly Thr Thr Ala Phe Asn Leu Lys Gln His Asn Thr Phe Leu Phe Thr Gly Thr Thr Ala Phe Asn
245 250 255 245 250 255
Leu Asp Gln Glu Lys Leu Asp Leu Leu Thr Gln Leu His Ile Leu Leu Leu Asp Gln Glu Lys Leu Asp Leu Leu Thr Gln Leu His Ile Leu Leu
260 265 270 260 265 270
Leu Asn Glu His Gln Asn Pro His Ser Thr His Tyr Ile Gly Asn Asn Leu Asn Glu His Gln Asn Pro His Ser Thr His Tyr Ile Gly Asn Asn
275 280 285 275 280 285
Tyr Leu Leu Leu Ile Lys Gly His Ala Asn Ser Pro Ala Leu Asn His Tyr Leu Leu Leu Ile Lys Gly His Ala Asn Ser Pro Ala Leu Asn His
290 295 300 290 295 300
Thr Leu Ala Leu His Phe Pro Asp Ala Ile Phe Leu Pro Ala Asn Ile Thr Leu Ala Leu His Phe Pro Asp Ala Ile Phe Leu Pro Ala Asn Ile
305 310 315 320 305 310 315 320
Pro Phe Glu Ile Phe Ala Met Leu Gly Phe Thr Pro Asn Lys Met Gly Pro Phe Glu Ile Phe Ala Met Leu Gly Phe Thr Pro Asn Lys Met Gly
325 330 335 325 330 335
Gly Phe Ala Ser Thr Ser Tyr Ile Asn Tyr Pro Thr Glu Asn Ile Asn Gly Phe Ala Ser Thr Ser Tyr Ile Asn Tyr Pro Thr Glu Asn Ile Asn
340 345 350 340 345 350
His Leu Phe Phe Leu Thr Ser Asp Gln Pro Ser Ile Arg Thr Lys Trp His Leu Phe Phe Leu Thr Ser Asp Gln Pro Ser Ile Arg Thr Lys Trp
355 360 365 355 360 365
Leu Asp Tyr Glu Lys Gln Phe Gly Leu Met Tyr Ser Leu Leu Ala Met Leu Asp Tyr Glu Lys Gln Phe Gly Leu Met Tyr Ser Leu Leu Ala Met
370 375 380 370 375 380
Gln Lys Ile Asn Glu Asp Gln Ala Phe Met Cys Thr Ile His Asn Gln Lys Ile Asn Glu Asp Gln Ala Phe Met Cys Thr Ile His Asn
385 390 395 385 390 395
<210> 61<210> 61
<211> 497<211> 497
<212> Белок<212> Protein
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 61<400> 61
Met Cys Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val Met Cys Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val
1 5 10 15 1 5 10 15
Asn Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp Asn Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp
20 25 30 20 25 30
Thr Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr Thr Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr
35 40 45 35 40 45
Pro Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val Pro Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val
50 55 60 50 55 60
Ala Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly Ala Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly
65 70 75 80 65 70 75 80
Asp Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val Asp Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val
85 90 95 85 90 95
Ala Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu Ala Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu
100 105 110 100 105 110
Gln Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn Gln Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn
115 120 125 115 120 125
Glu Arg Phe Ile Ser Trp Gly Arg Ile Gly Leu Thr Glu Asp Asn Ala Glu Arg Phe Ile Ser Trp Gly Arg Ile Gly Leu Thr Glu Asp Asn Ala
130 135 140 130 135 140
Glu Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser Glu Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser
145 150 155 160 145 150 155 160
Gln Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg Gln Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg
165 170 175 165 170 175
Leu Asn Leu Glu Leu Asn Thr Asn Thr Ala His Ser Phe Pro Asn Leu Leu Asn Leu Glu Leu Asn Thr Asn Thr Ala His Ser Phe Pro Asn Leu
180 185 190 180 185 190
Ala Pro Ile Leu Arg Ile Ile Ser Ser Lys Ser Asn Ile Leu Ile Ser Ala Pro Ile Leu Arg Ile Ile Ser Ser Lys Ser Asn Ile Leu Ile Ser
195 200 205 195 200 205
Asn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu Tyr Asn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu Tyr
210 215 220 210 215 220
Asn Trp Lys Asp Thr Glu Asp Lys Ser Val Lys Leu Ser Asp Ser Phe Asn Trp Lys Asp Thr Glu Asp Lys Ser Val Lys Leu Ser Asp Ser Phe
225 230 235 240 225 230 235 240
Leu Val Leu Lys Asp Tyr Phe Asn Gly Ile Ser Ser Glu Lys Pro Ser Leu Val Leu Lys Asp Tyr Phe Asn Gly Ile Ser Ser Glu Lys Pro Ser
245 250 255 245 250 255
Gly Ile Tyr Gly Arg Tyr Asn Trp His Gln Leu Tyr Asn Thr Ser Tyr Gly Ile Tyr Gly Arg Tyr Asn Trp His Gln Leu Tyr Asn Thr Ser Tyr
260 265 270 260 265 270
Tyr Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Pro Gln Leu His Asp Tyr Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Pro Gln Leu His Asp
275 280 285 275 280 285
Leu Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Gly Leu Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Gly
290 295 300 290 295 300
Phe Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val Phe Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val
305 310 315 320 305 310 315 320
Gly Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu Gly Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu
325 330 335 325 330 335
Pro Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Pro Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr
340 345 350 340 345 350
Lys Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile Lys Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile
355 360 365 355 360 365
Asn Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe Asn Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe
370 375 380 370 375 380
Lys Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser Lys Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser
385 390 395 400 385 390 395 400
Phe Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu Phe Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu
405 410 415 405 410 415
Met Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser Met Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser
420 425 430 420 425 430
Leu Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr Leu Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr
435 440 445 435 440 445
Ser Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu Ser Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu
450 455 460 450 455 460
Val Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Val Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu
465 470 475 480 465 470 475 480
Phe Trp Ser Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Ala Gln Phe Trp Ser Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Ala Gln
485 490 495 485 490 495
Tyr Tyr
<210> 62<210> 62
<211> 498<211> 498
<212> Белок<212> Protein
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 62<400> 62
Met Ser Glu Glu Asn Thr Gln Ser Ile Ile Lys Asn Asp Ile Asn Lys Met Ser Glu Glu Asn Thr Gln Ser Ile Ile Lys Asn Asp Ile Asn Lys
1 5 10 15 1 5 10 15
Thr Ile Ile Asp Glu Glu Tyr Val Asn Leu Glu Pro Ile Asn Gln Ser Thr Ile Ile Asp Glu Glu Tyr Val Asn Leu Glu Pro Ile Asn Gln Ser
20 25 30 20 25 30
Asn Ile Ser Phe Thr Lys His Ser Trp Val Gln Thr Cys Gly Thr Gln Asn Ile Ser Phe Thr Lys His Ser Trp Val Gln Thr Cys Gly Thr Gln
35 40 45 35 40 45
Gln Leu Leu Thr Glu Gln Asn Lys Glu Ser Ile Ser Leu Ser Val Val Gln Leu Leu Thr Glu Gln Asn Lys Glu Ser Ile Ser Leu Ser Val Val
50 55 60 50 55 60
Ala Pro Arg Leu Asp Asp Asp Glu Lys Tyr Cys Phe Asp Phe Asn Gly Ala Pro Arg Leu Asp Asp Asp Glu Lys Tyr Cys Phe Asp Phe Asn Gly
65 70 75 80 65 70 75 80
Val Ser Asn Lys Gly Glu Lys Tyr Ile Thr Lys Val Thr Leu Asn Val Val Ser Asn Lys Gly Glu Lys Tyr Ile Thr Lys Val Thr Leu Asn Val
85 90 95 85 90 95
Val Ala Pro Ser Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Thr Val Ala Pro Ser Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Thr
100 105 110 100 105 110
Leu Gln Gln Leu Met Asp Ile Ile Lys Ser Glu Glu Glu Asn Pro Thr Leu Gln Gln Leu Met Asp Ile Ile Lys Ser Glu Glu Glu Asn Pro Thr
115 120 125 115 120 125
Ala Gln Arg Tyr Ile Ala Trp Gly Arg Ile Val Pro Thr Asp Glu Gln Ala Gln Arg Tyr Ile Ala Trp Gly Arg Ile Val Pro Thr Asp Glu Gln
130 135 140 130 135 140
Met Lys Glu Leu Asn Ile Thr Ser Phe Ala Leu Ile Asn Asn His Thr Met Lys Glu Leu Asn Ile Thr Ser Phe Ala Leu Ile Asn Asn His Thr
145 150 155 160 145 150 155 160
Pro Ala Asp Leu Val Gln Glu Ile Val Lys Gln Ala Gln Thr Lys His Pro Ala Asp Leu Val Gln Glu Ile Val Lys Gln Ala Gln Thr Lys His
165 170 175 165 170 175
Arg Leu Asn Val Lys Leu Ser Ser Asn Thr Ala His Ser Phe Asp Asn Arg Leu Asn Val Lys Leu Ser Ser Asn Thr Ala His Ser Phe Asp Asn
180 185 190 180 185 190
Leu Val Pro Ile Leu Lys Glu Leu Asn Ser Phe Asn Asn Val Thr Val Leu Val Pro Ile Leu Lys Glu Leu Asn Ser Phe Asn Asn Val Thr Val
195 200 205 195 200 205
Thr Asn Ile Asp Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu Thr Asn Ile Asp Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu
210 215 220 210 215 220
Tyr Asn Trp Arg Asp Thr Leu Asn Lys Thr Asp Asn Leu Lys Ile Gly Tyr Asn Trp Arg Asp Thr Leu Asn Lys Thr Asp Asn Leu Lys Ile Gly
225 230 235 240 225 230 235 240
Lys Asp Tyr Leu Glu Asp Val Ile Asn Gly Ile Asn Glu Asp Thr Ser Lys Asp Tyr Leu Glu Asp Val Ile Asn Gly Ile Asn Glu Asp Thr Ser
245 250 255 245 250 255
Asn Thr Gly Thr Ser Ser Val Tyr Asn Trp Gln Lys Leu Tyr Pro Ala Asn Thr Gly Thr Ser Ser Val Tyr Asn Trp Gln Lys Leu Tyr Pro Ala
260 265 270 260 265 270
Asn Tyr His Phe Leu Arg Lys Asp Tyr Leu Thr Leu Glu Pro Ser Leu Asn Tyr His Phe Leu Arg Lys Asp Tyr Leu Thr Leu Glu Pro Ser Leu
275 280 285 275 280 285
His Glu Leu Arg Asp Tyr Ile Gly Asp Ser Leu Lys Gln Met Gln Trp His Glu Leu Arg Asp Tyr Ile Gly Asp Ser Leu Lys Gln Met Gln Trp
290 295 300 290 295 300
Asp Gly Phe Lys Lys Phe Asn Ser Lys Gln Gln Glu Leu Phe Leu Ser Asp Gly Phe Lys Lys Phe Asn Ser Lys Gln Gln Glu Leu Phe Leu Ser
305 310 315 320 305 310 315 320
Ile Val Asn Phe Asp Lys Gln Lys Leu Gln Asn Glu Tyr Asn Ser Ser Ile Val Asn Phe Asp Lys Gln Lys Leu Gln Asn Glu Tyr Asn Ser Ser
325 330 335 325 330 335
Asn Leu Pro Asn Phe Val Phe Thr Gly Thr Thr Val Trp Ala Gly Asn Asn Leu Pro Asn Phe Val Phe Thr Gly Thr Thr Val Trp Ala Gly Asn
340 345 350 340 345 350
His Glu Arg Glu Tyr Tyr Ala Lys Gln Gln Ile Asn Val Ile Asn Asn His Glu Arg Glu Tyr Tyr Ala Lys Gln Gln Ile Asn Val Ile Asn Asn
355 360 365 355 360 365
Ala Ile Asn Glu Ser Ser Pro His Tyr Leu Gly Asn Ser Tyr Asp Leu Ala Ile Asn Glu Ser Ser Pro His Tyr Leu Gly Asn Ser Tyr Asp Leu
370 375 380 370 375 380
Phe Phe Lys Gly His Pro Gly Gly Gly Ile Ile Asn Thr Leu Ile Met Phe Phe Lys Gly His Pro Gly Gly Gly Ile Ile Asn Thr Leu Ile Met
385 390 395 400 385 390 395 400
Gln Asn Tyr Pro Ser Met Val Asp Ile Pro Ser Lys Ile Ser Phe Glu Gln Asn Tyr Pro Ser Met Val Asp Ile Pro Ser Lys Ile Ser Phe Glu
405 410 415 405 410 415
Val Leu Met Met Thr Asp Met Leu Pro Asp Ala Val Ala Gly Ile Ala Val Leu Met Met Thr Asp Met Leu Pro Asp Ala Val Ala Gly Ile Ala
420 425 430 420 425 430
Ser Ser Leu Tyr Phe Thr Ile Pro Ala Glu Lys Ile Lys Phe Ile Val Ser Ser Leu Tyr Phe Thr Ile Pro Ala Glu Lys Ile Lys Phe Ile Val
435 440 445 435 440 445
Phe Thr Ser Thr Glu Thr Ile Thr Asp Arg Glu Thr Ala Leu Arg Ser Phe Thr Ser Thr Glu Thr Ile Thr Asp Arg Glu Thr Ala Leu Arg Ser
450 455 460 450 455 460
Pro Leu Val Gln Val Met Ile Lys Leu Gly Ile Val Lys Glu Glu Asn Pro Leu Val Gln Val Met Ile Lys Leu Gly Ile Val Lys Glu Glu Asn
465 470 475 480 465 470 475 480
Val Leu Phe Trp Ala Asp Leu Pro Asn Cys Glu Thr Gly Val Cys Ile Val Leu Phe Trp Ala Asp Leu Pro Asn Cys Glu Thr Gly Val Cys Ile
485 490 495 485 490 495
Ala Val Ala Val
<210> 63<210> 63
<211> 482<211> 482
<212> Белок<212> Protein
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 63<400> 63
Met Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val Asn Met Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val Asn
1 5 10 15 1 5 10 15
Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp Thr Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp Thr
20 25 30 20 25 30
Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr Pro Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr Pro
35 40 45 35 40 45
Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val Ala Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val Ala
50 55 60 50 55 60
Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly Asp Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly Asp
65 70 75 80 65 70 75 80
Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val Ala Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val Ala
85 90 95 85 90 95
Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu Gln Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu Gln
100 105 110 100 105 110
Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn Glu Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn Glu
115 120 125 115 120 125
Arg Phe Ile Ser Trp Gly Arg Ile Arg Leu Thr Glu Asp Asn Ala Glu Arg Phe Ile Ser Trp Gly Arg Ile Arg Leu Thr Glu Asp Asn Ala Glu
130 135 140 130 135 140
Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser Gln Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser Gln
145 150 155 160 145 150 155 160
Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg Leu Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg Leu
165 170 175 165 170 175
Asn Leu Glu Leu Asn Thr Asn Thr Gly His Ser Phe Arg Asn Ile Ala Asn Leu Glu Leu Asn Thr Asn Thr Gly His Ser Phe Arg Asn Ile Ala
180 185 190 180 185 190
Pro Ile Leu Arg Ala Thr Ser Ser Lys Asn Asn Ile Leu Ile Ser Asn Pro Ile Leu Arg Ala Thr Ser Ser Lys Asn Asn Ile Leu Ile Ser Asn
195 200 205 195 200 205
Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Ser Leu Tyr Asn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Ser Leu Tyr Asn
210 215 220 210 215 220
Trp Lys Asp Thr Asp Asn Lys Ser Gln Lys Leu Ser Asp Ser Phe Leu Trp Lys Asp Thr Asp Asn Lys Ser Gln Lys Leu Ser Asp Ser Phe Leu
225 230 235 240 225 230 235 240
Val Leu Lys Asp Tyr Leu Asn Gly Ile Ser Ser Glu Lys Pro Asn Gly Val Leu Lys Asp Tyr Leu Asn Gly Ile Ser Ser Glu Lys Pro Asn Gly
245 250 255 245 250 255
Ile Tyr Ser Ile Tyr Asn Trp His Gln Leu Tyr His Ser Ser Tyr Tyr Ile Tyr Ser Ile Tyr Asn Trp His Gln Leu Tyr His Ser Ser Tyr Tyr
260 265 270 260 265 270
Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Thr Lys Leu His Asp Leu Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Thr Lys Leu His Asp Leu
275 280 285 275 280 285
Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Thr Phe Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Thr Phe
290 295 300 290 295 300
Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val Gly Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val Gly
305 310 315 320 305 310 315 320
Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu Pro Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu Pro
325 330 335 325 330 335
Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys
340 345 350 340 345 350
Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile Asn Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile Asn
355 360 365 355 360 365
Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe Lys Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe Lys
370 375 380 370 375 380
Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser Phe Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser Phe
385 390 395 400 385 390 395 400
Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu Met Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu Met
405 410 415 405 410 415
Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser Leu Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser Leu
420 425 430 420 425 430
Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr Ser Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr Ser
435 440 445 435 440 445
Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu Val Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu Val
450 455 460 450 455 460
Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe
465 470 475 480 465 470 475 480
Trp Cys Trp Cys
<210> 64<210> 64
<211> 675<211> 675
<212> Белок<212> Protein
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 64<400> 64
Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys
1 5 10 15 1 5 10 15
Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala
20 25 30 20 25 30
Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro
35 40 45 35 40 45
Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile
50 55 60 50 55 60
Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro
65 70 75 80 65 70 75 80
Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr
85 90 95 85 90 95
Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro
100 105 110 100 105 110
Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln
115 120 125 115 120 125
Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg
130 135 140 130 135 140
Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys
145 150 155 160 145 150 155 160
Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu
165 170 175 165 170 175
Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn
180 185 190 180 185 190
Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro
195 200 205 195 200 205
Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile
210 215 220 210 215 220
Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp
225 230 235 240 225 230 235 240
Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu
245 250 255 245 250 255
Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met
260 265 270 260 265 270
Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe
275 280 285 275 280 285
Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg
290 295 300 290 295 300
Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala
305 310 315 320 305 310 315 320
Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe
325 330 335 325 330 335
Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn
340 345 350 340 345 350
Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu
355 360 365 355 360 365
Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu
370 375 380 370 375 380
Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly
385 390 395 400 385 390 395 400
His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro
405 410 415 405 410 415
Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met
420 425 430 420 425 430
Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr
435 440 445 435 440 445
Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser
450 455 460 450 455 460
Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln
465 470 475 480 465 470 475 480
Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp
485 490 495 485 490 495
Ala Asp His Lys Val Asn Ser Met Glu Val Ala Ile Asp Glu Ala Cys Ala Asp His Lys Val Asn Ser Met Glu Val Ala Ile Asp Glu Ala Cys
500 505 510 500 505 510
Thr Arg Ile Ile Ala Lys Arg Gln Pro Thr Ala Ser Asp Leu Arg Leu Thr Arg Ile Ile Ala Lys Arg Gln Pro Thr Ala Ser Asp Leu Arg Leu
515 520 525 515 520 525
Val Ile Ala Ile Ile Lys Thr Ile Thr Asp Leu Glu Arg Ile Gly Asp Val Ile Ala Ile Ile Lys Thr Ile Thr Asp Leu Glu Arg Ile Gly Asp
530 535 540 530 535 540
Val Ala Glu Ser Ile Ala Lys Val Ala Leu Glu Ser Phe Ser Asn Lys Val Ala Glu Ser Ile Ala Lys Val Ala Leu Glu Ser Phe Ser Asn Lys
545 550 555 560 545 550 555 560
Gln Tyr Asn Leu Leu Val Ser Leu Glu Ser Leu Gly Gln His Thr Val Gln Tyr Asn Leu Leu Val Ser Leu Glu Ser Leu Gly Gln His Thr Val
565 570 575 565 570 575
Arg Met Leu His Glu Val Leu Asp Ala Phe Ala Arg Met Asp Val Lys Arg Met Leu His Glu Val Leu Asp Ala Phe Ala Arg Met Asp Val Lys
580 585 590 580 585 590
Ala Ala Ile Glu Val Tyr Gln Glu Asp Asp Arg Ile Asp Gln Glu Tyr Ala Ala Ile Glu Val Tyr Gln Glu Asp Asp Arg Ile Asp Gln Glu Tyr
595 600 605 595 600 605
Glu Ser Ile Val Arg Gln Leu Met Ala His Met Met Glu Asp Pro Ser Glu Ser Ile Val Arg Gln Leu Met Ala His Met Met Glu Asp Pro Ser
610 615 620 610 615 620
Ser Ile Pro Asn Val Met Lys Val Met Trp Ala Ala Arg Ser Ile Glu Ser Ile Pro Asn Val Met Lys Val Met Trp Ala Ala Arg Ser Ile Glu
625 630 635 640 625 630 635 640
Arg Val Gly Asp Arg Cys Gln Asn Ile Cys Glu Tyr Ile Ile Tyr Phe Arg Val Gly Asp Arg Cys Gln Asn Ile Cys Glu Tyr Ile Ile Tyr Phe
645 650 655 645 650 655
Val Lys Gly Lys Asp Val Arg His Thr Lys Pro Asp Asp Phe Gly Thr Val Lys Gly Lys Asp Val Arg His Thr Lys Pro Asp Asp Phe Gly Thr
660 665 670 660 665 670
Met Leu Asp Met Leu Asp
675 675
<210> 65<210> 65
<211> 510<211> 510
<212> Белок<212> Protein
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 65<400> 65
Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys
1 5 10 15 1 5 10 15
Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala
20 25 30 20 25 30
Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro
35 40 45 35 40 45
Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile
50 55 60 50 55 60
Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro
65 70 75 80 65 70 75 80
Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr
85 90 95 85 90 95
Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro
100 105 110 100 105 110
Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln
115 120 125 115 120 125
Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg
130 135 140 130 135 140
Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys
145 150 155 160 145 150 155 160
Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu
165 170 175 165 170 175
Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn
180 185 190 180 185 190
Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro
195 200 205 195 200 205
Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile
210 215 220 210 215 220
Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp
225 230 235 240 225 230 235 240
Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu
245 250 255 245 250 255
Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met
260 265 270 260 265 270
Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe
275 280 285 275 280 285
Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg
290 295 300 290 295 300
Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala
305 310 315 320 305 310 315 320
Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe
325 330 335 325 330 335
Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn
340 345 350 340 345 350
Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu
355 360 365 355 360 365
Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu
370 375 380 370 375 380
Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly
385 390 395 400 385 390 395 400
His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro
405 410 415 405 410 415
Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met
420 425 430 420 425 430
Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr
435 440 445 435 440 445
Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser
450 455 460 450 455 460
Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln
465 470 475 480 465 470 475 480
Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp
485 490 495 485 490 495
Ala Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Asp Lys Ala Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Asp Lys
500 505 510 500 505 510
<210> 66<210> 66
<211> 422<211> 422
<212> Белок<212> Protein
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 66<400> 66
Met Gly Thr Ile Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Met Gly Thr Ile Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser
1 5 10 15 1 5 10 15
Ile Lys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Ile Lys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe
20 25 30 20 25 30
Arg Cys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Arg Cys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu
35 40 45 35 40 45
Ile Lys Gly Val Phe Phe Asn Pro Cys Val Leu Ser Ser Gln Met Gln Ile Lys Gly Val Phe Phe Asn Pro Cys Val Leu Ser Ser Gln Met Gln
50 55 60 50 55 60
Thr Val Gln Tyr Leu Met Asp Asn Gly Glu Tyr Ser Ile Glu Arg Phe Thr Val Gln Tyr Leu Met Asp Asn Gly Glu Tyr Ser Ile Glu Arg Phe
65 70 75 80 65 70 75 80
Phe Cys Ser Val Ser Thr Asp Arg His Asp Phe Asp Gly Asp Tyr Gln Phe Cys Ser Val Ser Thr Asp Arg His Asp Phe Asp Gly Asp Tyr Gln
85 90 95 85 90 95
Thr Ile Leu Pro Val Asp Gly Tyr Leu Lys Ala His Tyr Pro Phe Val Thr Ile Leu Pro Val Asp Gly Tyr Leu Lys Ala His Tyr Pro Phe Val
100 105 110 100 105 110
Cys Asp Thr Phe Ser Leu Phe Lys Gly His Glu Glu Ile Leu Lys His Cys Asp Thr Phe Ser Leu Phe Lys Gly His Glu Glu Ile Leu Lys His
115 120 125 115 120 125
Val Lys Tyr His Leu Lys Thr Tyr Ser Lys Glu Leu Ser Ala Gly Val Val Lys Tyr His Leu Lys Thr Tyr Ser Lys Glu Leu Ser Ala Gly Val
130 135 140 130 135 140
Leu Met Leu Leu Ser Ala Val Val Leu Gly Tyr Lys Glu Ile Tyr Leu Leu Met Leu Leu Ser Ala Val Val Leu Gly Tyr Lys Glu Ile Tyr Leu
145 150 155 160 145 150 155 160
Val Gly Ile Asp Phe Gly Ala Ser Ser Trp Gly His Phe Tyr Asp Glu Val Gly Ile Asp Phe Gly Ala Ser Ser Trp Gly His Phe Tyr Asp Glu
165 170 175 165 170 175
Ser Gln Ser Gln His Phe Ser Asn His Met Ala Asp Cys His Asn Ile Ser Gln Ser Gln His Phe Ser Asn His Met Ala Asp Cys His Asn Ile
180 185 190 180 185 190
Tyr Tyr Asp Met Leu Thr Ile Cys Leu Cys Gln Lys Tyr Ala Lys Leu Tyr Tyr Asp Met Leu Thr Ile Cys Leu Cys Gln Lys Tyr Ala Lys Leu
195 200 205 195 200 205
Tyr Ala Leu Ala Pro Asn Ser Pro Leu Ser His Leu Leu Thr Leu Asn Tyr Ala Leu Ala Pro Asn Ser Pro Leu Ser His Leu Leu Thr Leu Asn
210 215 220 210 215 220
Pro Gln Ala Lys Tyr Pro Phe Glu Leu Leu Asp Lys Pro Ile Gly Tyr Pro Gln Ala Lys Tyr Pro Phe Glu Leu Leu Asp Lys Pro Ile Gly Tyr
225 230 235 240 225 230 235 240
Thr Ser Asp Leu Ile Ile Ser Ser Pro Leu Glu Glu Lys Leu Leu Glu Thr Ser Asp Leu Ile Ile Ser Ser Pro Leu Glu Glu Lys Leu Leu Glu
245 250 255 245 250 255
Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu
260 265 270 260 265 270
Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys
275 280 285 275 280 285
Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu
290 295 300 290 295 300
Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile
305 310 315 320 305 310 315 320
Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu
325 330 335 325 330 335
Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu
340 345 350 340 345 350
Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys
355 360 365 355 360 365
Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu
370 375 380 370 375 380
Leu Ala Ser Arg Leu Asn Asn Ile Leu Arg Lys Ile Lys Arg Lys Ile Leu Ala Ser Arg Leu Asn Asn Ile Leu Arg Lys Ile Lys Arg Lys Ile
385 390 395 400 385 390 395 400
Leu Pro Phe Phe Trp Gly Gly Gly Val Thr Pro Thr Leu Lys Val Ser Leu Pro Phe Phe Trp Gly Gly Gly Val Thr Pro Thr Leu Lys Val Ser
405 410 415 405 410 415
Phe Arg Trp Gly Ala Ala Phe Arg Trp Gly Ala Ala
420 420
<210> 67<210> 67
<211> 609<211> 609
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 67<400> 67
Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile
1 5 10 15 1 5 10 15
Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala
20 25 30 20 25 30
Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg
35 40 45 35 40 45
Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu
50 55 60 50 55 60
His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu
65 70 75 80 65 70 75 80
Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val
85 90 95 85 90 95
Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu
100 105 110 100 105 110
Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile
115 120 125 115 120 125
Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu
130 135 140 130 135 140
Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val
145 150 155 160 145 150 155 160
Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly
165 170 175 165 170 175
Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser
180 185 190 180 185 190
Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu
195 200 205 195 200 205
Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp
210 215 220 210 215 220
Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln
225 230 235 240 225 230 235 240
Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu
245 250 255 245 250 255
Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile
260 265 270 260 265 270
Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu
275 280 285 275 280 285
Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser
290 295 300 290 295 300
Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly
305 310 315 320 305 310 315 320
Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser
325 330 335 325 330 335
Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu
340 345 350 340 345 350
Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr
355 360 365 355 360 365
Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg
370 375 380 370 375 380
Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val
385 390 395 400 385 390 395 400
Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu
405 410 415 405 410 415
Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His
420 425 430 420 425 430
Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met
435 440 445 435 440 445
Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp
450 455 460 450 455 460
Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala
465 470 475 480 465 470 475 480
Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu
485 490 495 485 490 495
Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp
500 505 510 500 505 510
Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu
515 520 525 515 520 525
Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu
530 535 540 530 535 540
Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met
545 550 555 560 545 550 555 560
His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe
565 570 575 565 570 575
Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys
580 585 590 580 585 590
Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val
595 600 605 595 600 605
Glu Glu
<210> 68<210> 68
<211> 1830<211> 1830
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 68<400> 68
atgtgtggaa ttgttggcgc gatcgcgcaa cgtgatgtag cagaaatcct tcttgaaggt 60atgtgtggaa ttgttggcgc gatcgcgcaa cgtgatgtag cagaaatcct tcttgaaggt 60
ttacgtcgtc tggaataccg cggatatgac tctgccggtc tggccgttgt tgatgcagaa 120ttacgtcgtc tggaataccg cggatatgac tctgccggtc tggccgttgt tgatgcagaa 120
ggtcatatga cccgcctgcg tcgcctcggt aaagtccaga tgctggcaca ggcagcggaa 180ggtcatatga cccgcctgcg tcgcctcggt aaagtccaga tgctggcaca ggcagcggaa 180
gaacatcctc tgcatggcgg cactggtatt gctcacactc gctgggcgac ccacggtgaa 240gaacatcctc tgcatggcgg cactggtatt gctcacactc gctgggcgac ccacggtgaa 240
ccttcagaag tgaatgcgca tccgcatgtt tctgaacaca ttgtggtggt gcataacggc 300ccttcagaag tgaatgcgca tccgcatgtt tctgaacaca ttgtggtggt gcataacggc 300
atcatcgaaa accatgaacc gctgcgtgaa gagctaaaag cgcgtggcta taccttcgtt 360atcatcgaaa accatgaacc gctgcgtgaa gagctaaaag cgcgtggcta taccttcgtt 360
tctgaaaccg acaccgaagt gattgcccat ctggtgaact gggagctgaa acaaggcggg 420tctgaaaccg acaccgaagt gattgcccat ctggtgaact gggagctgaa acaaggcggg 420
actctgcgtg aggccgttct gcgtgctatc ccgcagctgc gtggtgcgta cggtacagtg 480actctgcgtg aggccgttct gcgtgctatc ccgcagctgc gtggtgcgta cggtacagtg 480
atcatggact cccgtcaccc ggataccctg ctggcggcac gttctggtag tccgctggtg 540atcatggact cccgtcaccc ggataccctg ctggcggcac gttctggtag tccgctggtg 540
attggcctgg ggatgggcga aaactttatc gcttctgacc agctggcgct gttgccggtg 600attggcctgg ggatgggcga aaactttatc gcttctgacc agctggcgct gttgccggtg 600
acccgtcgct ttatcttcct tgaagagggc gatattgcgg aaatcactcg ccgttcggta 660acccgtcgct ttatcttcct tgaagagggc gatattgcgg aaatcactcg ccgttcggta 660
aacatcttcg ataaaactgg cgcggaagta aaacgtcagg atatcgaatc caatctgcaa 720aacatcttcg ataaaactgg cgcggaagta aaacgtcagg atatcgaatc caatctgcaa 720
tatgacgcgg gcgataaagg catttaccgt cactacatgc agaaagagat ctacgaacag 780tatgacgcgg gcgataaagg catttaccgt cactacatgc agaaagagat ctacgaacag 780
ccgaacgcga tcaaaaacac ccttaccgga cgcatcagcc acggtcaggt tgatttaagc 840ccgaacgcga tcaaaaacac ccttaccgga cgcatcagcc acggtcaggt tgatttaagc 840
gagctgggac cgaacgccga cgaactgctg tcgaaggttg agcatattca gatcctcgcc 900gagctgggac cgaacgccga cgaactgctg tcgaaggttg agcatattca gatcctcgcc 900
tgtggtactt cttataactc cggtatggtt tcccgctact ggtttgaatc gctagcaggt 960tgtggtactt cttataactc cggtatggtt tcccgctact ggtttgaatc gctagcaggt 960
attccgtgcg acgtcgaaat cgcctctgaa ttccgctatc gcaaatctgc cgtgcgtcgt 1020attccgtgcg acgtcgaaat cgcctctgaa ttccgctatc gcaaatctgc cgtgcgtcgt 1020
aacagcctga tgatcacctt gtcacagtct ggcgaaaccg cggataccct ggctggcctg 1080aacagcctga tgatcacctt gtcacagtct ggcgaaaccg cggataccct ggctggcctg 1080
cgtctgtcga aagagctggg ttaccttggt tcactggcaa tctgtaacgt tccgggttct 1140cgtctgtcga aagagctggg ttaccttggt tcactggcaa tctgtaacgt tccgggttct 1140
tctctggtgc gcgaatccga tctggcgcta atgaccaacg cgggtacaga aatcggcgtg 1200tctctggtgc gcgaatccga tctggcgcta atgaccaacg cgggtacaga aatcggcgtg 1200
gcatccacta aagcattcac cactcagtta actgtgctgt tgatgctggt ggcgaagctg 1260gcatccacta aagcattcac cactcagtta actgtgctgt tgatgctggt ggcgaagctg 1260
tctcgcctga aaggtctgga tgcctccatt gaacatgaca tcgtgcatgg tctgcaggcg 1320tctcgcctga aaggtctgga tgcctccatt gaacatgaca tcgtgcatgg tctgcaggcg 1320
ctgccgagcc gtattgagca gatgctgtct caggacaaac gcattgaagc gctggcagaa 1380ctgccgagcc gtattgagca gatgctgtct caggacaaac gcattgaagc gctggcagaa 1380
gatttctctg acaaacatca cgcgctgttc ctgggccgtg gcgatcagta cccaatcgcg 1440gatttctctg acaaacatca cgcgctgttc ctgggccgtg gcgatcagta cccaatcgcg 1440
ctggaaggcg cattgaagtt gaaagagatc tcttacattc acgctgaagc ctacgctgct 1500ctggaaggcg cattgaagtt gaaagagatc tcttacattc acgctgaagc ctacgctgct 1500
ggcgaactga aacacggtcc gctggcgcta attgatgccg atatgccggt tattgttgtt 1560ggcgaactga aacacggtcc gctggcgcta attgatgccg atatgccggt tattgttgtt 1560
gcaccgaaca acgaattgct ggaaaaactg aaatccaaca ttgaagaagt tcgcgcgcgt 1620gcaccgaaca acgaattgct ggaaaaactg aaatccaaca ttgaagaagt tcgcgcgcgt 1620
ggcggtcagt tgtatgtctt cgccgatcag gatgcgggtt ttgtaagtag cgataacatg 1680ggcggtcagt tgtatgtctt cgccgatcag gatgcgggtt ttgtaagtag cgataacatg 1680
cacatcatcg agatgccgca tgtggaagag gtgattgcac cgatcttcta caccgttccg 1740cacatcatcg agatgccgca tgtggaagag gtgattgcac cgatcttcta caccgttccg 1740
ctgcagctgc tggcttacca tgtcgcgctg atcaaaggca ccgacgttga ccagccgcgt 1800ctgcagctgc tggcttacca tgtcgcgctg atcaaaggca ccgacgttga ccagccgcgt 1800
aacctggcaa aatcggttac ggttgagtaa 1830aacctggcaa aatcggttac ggttgagtaa 1830
<210> 69<210> 69
<211> 609<211> 609
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 69<400> 69
Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Lys Ile Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Lys Ile
1 5 10 15 1 5 10 15
Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala
20 25 30 20 25 30
Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg
35 40 45 35 40 45
Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu
50 55 60 50 55 60
His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu
65 70 75 80 65 70 75 80
Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val
85 90 95 85 90 95
Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu
100 105 110 100 105 110
Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile
115 120 125 115 120 125
Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu
130 135 140 130 135 140
Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val
145 150 155 160 145 150 155 160
Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly
165 170 175 165 170 175
Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser
180 185 190 180 185 190
Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu
195 200 205 195 200 205
Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp
210 215 220 210 215 220
Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln
225 230 235 240 225 230 235 240
Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu
245 250 255 245 250 255
Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile
260 265 270 260 265 270
Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu
275 280 285 275 280 285
Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser
290 295 300 290 295 300
Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly
305 310 315 320 305 310 315 320
Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser
325 330 335 325 330 335
Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu
340 345 350 340 345 350
Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr
355 360 365 355 360 365
Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg
370 375 380 370 375 380
Glu Ser Val Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Glu Ser Val Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val
385 390 395 400 385 390 395 400
Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu
405 410 415 405 410 415
Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His
420 425 430 420 425 430
Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met
435 440 445 435 440 445
Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp
450 455 460 450 455 460
Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala
465 470 475 480 465 470 475 480
Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu
485 490 495 485 490 495
Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp
500 505 510 500 505 510
Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Gly Leu Leu Glu Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Gly Leu Leu Glu
515 520 525 515 520 525
Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu
530 535 540 530 535 540
Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met
545 550 555 560 545 550 555 560
His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe
565 570 575 565 570 575
Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys
580 585 590 580 585 590
Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val
595 600 605 595 600 605
Glu Glu
<210> 70<210> 70
<211> 1830<211> 1830
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 70<400> 70
atgtgcggta tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt 60atgtgcggta tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt 60
ctgcgtcgtc tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa 120ctgcgtcgtc tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa 120
ggtcacatga ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa 180ggtcacatga ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa 180
gaacacccac tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa 240gaacacccac tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa 240
ccgtctgagg tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt 300ccgtctgagg tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt 300
atcatcgaga accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta 360atcatcgaga accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta 360
agcgaaaccg acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt 420agcgaaaccg acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt 420
actctgcgtg aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg 480actctgcgtg aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg 480
atcatggact ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt 540atcatggact ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt 540
atcggtctgg gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt 600atcggtctgg gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt 600
acccgtcgct tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt 660acccgtcgct tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt 660
aacatcttcg acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag 720aacatcttcg acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag 720
tatgacgctg gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag 780tatgacgctg gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag 780
ccgaacgcga tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct 840ccgaacgcga tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct 840
gagctgggtc caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct 900gagctgggtc caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct 900
tgtggtacct cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt 960tgtggtacct cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt 960
atcccatgcg acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt 1020atcccatgcg acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt 1020
aactccctca tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg 1080aactccctca tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg 1080
cgtctcagca aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct 1140cgtctcagca aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct 1140
agcctggttc gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt 1200agcctggttc gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt 1200
gcctctacca aagcgttcac tacccagctc actgtcctgc tgatgctggt tgccaaactg 1260gcctctacca aagcgttcac tacccagctc actgtcctgc tgatgctggt tgccaaactg 1260
tctcgtctca aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc 1320tctcgtctca aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc 1320
ctcccatctc gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa 1380ctcccatctc gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa 1380
gacttcagcg acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg 1440gacttcagcg acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg 1440
ctggaaggtg ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg 1500ctggaaggtg ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg 1500
ggtgagctga aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt 1560ggtgagctga aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt 1560
gctccgaaca acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt 1620gctccgaaca acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt 1620
ggtggtcagc tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg 1680ggtggtcagc tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg 1680
cacatcatcg aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg 1740cacatcatcg aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg 1740
ctgcagctgc tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt 1800ctgcagctgc tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt 1800
aacctggcga aatccgtgac cgtggaataa 1830aacctggcga aatccgtgac cgtggaataa 1830
<210> 71<210> 71
<211> 445<211> 445
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 71<400> 71
Met Ser Asn Arg Lys Tyr Phe Gly Thr Asp Gly Ile Arg Gly Arg Val Met Ser Asn Arg Lys Tyr Phe Gly Thr Asp Gly Ile Arg Gly Arg Val
1 5 10 15 1 5 10 15
Gly Asp Ala Pro Ile Thr Pro Asp Phe Val Leu Lys Leu Gly Trp Ala Gly Asp Ala Pro Ile Thr Pro Asp Phe Val Leu Lys Leu Gly Trp Ala
20 25 30 20 25 30
Ala Gly Lys Val Leu Ala Arg His Gly Ser Arg Lys Ile Ile Ile Gly Ala Gly Lys Val Leu Ala Arg His Gly Ser Arg Lys Ile Ile Ile Gly
35 40 45 35 40 45
Lys Asp Thr Arg Ile Ser Gly Tyr Met Leu Glu Ser Ala Leu Glu Ala Lys Asp Thr Arg Ile Ser Gly Tyr Met Leu Glu Ser Ala Leu Glu Ala
50 55 60 50 55 60
Gly Leu Ala Ala Ala Gly Leu Ser Ala Leu Phe Thr Gly Pro Met Pro Gly Leu Ala Ala Ala Gly Leu Ser Ala Leu Phe Thr Gly Pro Met Pro
65 70 75 80 65 70 75 80
Thr Pro Ala Val Ala Tyr Leu Thr Arg Thr Phe Arg Ala Glu Ala Gly Thr Pro Ala Val Ala Tyr Leu Thr Arg Thr Phe Arg Ala Glu Ala Gly
85 90 95 85 90 95
Ile Val Ile Ser Ala Ser His Asn Pro Phe Tyr Asp Asn Gly Ile Lys Ile Val Ile Ser Ala Ser His Asn Pro Phe Tyr Asp Asn Gly Ile Lys
100 105 110 100 105 110
Phe Phe Ser Ile Asp Gly Thr Lys Leu Pro Asp Ala Val Glu Glu Ala Phe Phe Ser Ile Asp Gly Thr Lys Leu Pro Asp Ala Val Glu Glu Ala
115 120 125 115 120 125
Ile Glu Ala Glu Met Glu Lys Glu Ile Ser Cys Val Asp Ser Ala Glu Ile Glu Ala Glu Met Glu Lys Glu Ile Ser Cys Val Asp Ser Ala Glu
130 135 140 130 135 140
Leu Gly Lys Ala Ser Arg Ile Val Asp Ala Ala Gly Arg Tyr Ile Glu Leu Gly Lys Ala Ser Arg Ile Val Asp Ala Ala Gly Arg Tyr Ile Glu
145 150 155 160 145 150 155 160
Phe Cys Lys Ala Thr Phe Pro Asn Glu Leu Ser Leu Ser Glu Leu Lys Phe Cys Lys Ala Thr Phe Pro Asn Glu Leu Ser Leu Ser Glu Leu Lys
165 170 175 165 170 175
Ile Val Val Asp Cys Ala Asn Gly Ala Thr Tyr His Ile Ala Pro Asn Ile Val Val Asp Cys Ala Asn Gly Ala Thr Tyr His Ile Ala Pro Asn
180 185 190 180 185 190
Val Leu Arg Glu Leu Gly Ala Asn Val Ile Ala Ile Gly Cys Glu Pro Val Leu Arg Glu Leu Gly Ala Asn Val Ile Ala Ile Gly Cys Glu Pro
195 200 205 195 200 205
Asn Gly Val Asn Ile Asn Ala Glu Val Gly Ala Thr Asp Val Arg Ala Asn Gly Val Asn Ile Asn Ala Glu Val Gly Ala Thr Asp Val Arg Ala
210 215 220 210 215 220
Leu Gln Ala Arg Val Leu Ala Glu Lys Ala Asp Leu Gly Ile Ala Phe Leu Gln Ala Arg Val Leu Ala Glu Lys Ala Asp Leu Gly Ile Ala Phe
225 230 235 240 225 230 235 240
Asp Gly Asp Gly Asp Arg Val Ile Met Val Asp His Glu Gly Asn Lys Asp Gly Asp Gly Asp Arg Val Ile Met Val Asp His Glu Gly Asn Lys
245 250 255 245 250 255
Val Asp Gly Asp Gln Ile Met Tyr Ile Ile Ala Arg Glu Gly Leu Arg Val Asp Gly Asp Gln Ile Met Tyr Ile Ile Ala Arg Glu Gly Leu Arg
260 265 270 260 265 270
Gln Gly Gln Leu Arg Gly Gly Ala Val Gly Thr Leu Met Ser Asn Met Gln Gly Gln Leu Arg Gly Gly Ala Val Gly Thr Leu Met Ser Asn Met
275 280 285 275 280 285
Gly Leu Glu Leu Ala Leu Lys Gln Leu Gly Ile Pro Phe Ala Arg Ala Gly Leu Glu Leu Ala Leu Lys Gln Leu Gly Ile Pro Phe Ala Arg Ala
290 295 300 290 295 300
Lys Val Gly Asp Arg Tyr Val Leu Glu Lys Met Gln Glu Lys Gly Trp Lys Val Gly Asp Arg Tyr Val Leu Glu Lys Met Gln Glu Lys Gly Trp
305 310 315 320 305 310 315 320
Arg Ile Gly Ala Glu Asn Ser Gly His Val Ile Leu Leu Asp Lys Thr Arg Ile Gly Ala Glu Asn Ser Gly His Val Ile Leu Leu Asp Lys Thr
325 330 335 325 330 335
Thr Thr Gly Asp Gly Ile Val Ala Gly Leu Gln Val Leu Ala Ala Met Thr Thr Gly Asp Gly Ile Val Ala Gly Leu Gln Val Leu Ala Ala Met
340 345 350 340 345 350
Ala Arg Asn His Met Ser Leu His Asp Leu Cys Ser Gly Met Lys Met Ala Arg Asn His Met Ser Leu His Asp Leu Cys Ser Gly Met Lys Met
355 360 365 355 360 365
Phe Pro Gln Ile Leu Val Asn Val Arg Tyr Thr Ala Gly Ser Gly Asp Phe Pro Gln Ile Leu Val Asn Val Arg Tyr Thr Ala Gly Ser Gly Asp
370 375 380 370 375 380
Pro Leu Glu His Glu Ser Val Lys Ala Val Thr Ala Glu Val Glu Ala Pro Leu Glu His Glu Ser Val Lys Ala Val Thr Ala Glu Val Glu Ala
385 390 395 400 385 390 395 400
Ala Leu Gly Asn Arg Gly Arg Val Leu Leu Arg Lys Ser Gly Thr Glu Ala Leu Gly Asn Arg Gly Arg Val Leu Leu Arg Lys Ser Gly Thr Glu
405 410 415 405 410 415
Pro Leu Ile Arg Val Met Val Glu Gly Glu Asp Glu Ala Gln Val Thr Pro Leu Ile Arg Val Met Val Glu Gly Glu Asp Glu Ala Gln Val Thr
420 425 430 420 425 430
Glu Phe Ala His Arg Ile Ala Asp Ala Val Lys Ala Val Glu Phe Ala His Arg Ile Ala Asp Ala Val Lys Ala Val
435 440 445 435 440 445
<210> 72<210> 72
<211> 1338<211> 1338
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 72<400> 72
atgagtaatc gtaaatattt cggtaccgat gggattcgtg gtcgtgtagg ggatgcgccg 60atgagtaatc gtaaatattt cggtaccgat gggattcgtg gtcgtgtagg ggatgcgccg 60
atcacacctg attttgtgct taagctgggt tgggccgcgg gtaaagtgct ggcgcgccac 120atcacacctg attttgtgct taagctgggt tgggccgcgg gtaaagtgct ggcgcgccac 120
ggctcccgta agattattat tggtaaagac acgcgtattt ctggctatat gctggagtca 180ggctcccgta agattattat tggtaaagac acgcgtattt ctggctatat gctggagtca 180
gcactggaag cgggtctggc ggcagcgggc ctttccgcac tcttcactgg cccgatgcca 240gcactggaag cgggtctggc ggcagcgggc ctttccgcac tcttcactgg cccgatgcca 240
acaccggccg tggcttatct gacgcgtacc ttccgcgcag aggccggaat tgtgatatct 300acaccggccg tggcttatct gacgcgtacc ttccgcgcag aggccggaat tgtgatatct 300
gcatcgcata acccgttcta cgataatggc attaaattct tctctatcga cggcaccaaa 360gcatcgcata acccgttcta cgataatggc attaaattct tctctatcga cggcaccaaa 360
ctgccggatg cggtagaaga ggccatcgaa gcggaaatgg aaaaggagat cagctgcgtt 420ctgccggatg cggtagaaga ggccatcgaa gcggaaatgg aaaaggagat cagctgcgtt 420
gattcggcag aactgggtaa agccagccgt atcgttgatg ccgcgggtcg ctatatcgag 480gattcggcag aactgggtaa agccagccgt atcgttgatg ccgcgggtcg ctatatcgag 480
ttttgcaaag ccacgttccc gaacgaactt agcctcagtg aactgaagat tgtggtggat 540ttttgcaaag ccacgttccc gaacgaactt agcctcagtg aactgaagat tgtggtggat 540
tgtgcaaacg gtgcgactta tcacatcgcg ccgaacgtgc tgcgcgaact gggggcgaac 600tgtgcaaacg gtgcgactta tcacatcgcg ccgaacgtgc tgcgcgaact gggggcgaac 600
gttatcgcta tcggttgtga gccaaacggt gtaaacatca atgccgaagt gggggctacc 660gttatcgcta tcggttgtga gccaaacggt gtaaacatca atgccgaagt gggggctacc 660
gacgttcgcg cgctccaggc tcgtgtgctg gctgaaaaag cggatctcgg tattgccttc 720gacgttcgcg cgctccaggc tcgtgtgctg gctgaaaaag cggatctcgg tattgccttc 720
gacggcgatg gcgatcgcgt gattatggtt gaccatgaag gcaataaagt cgatggcgat 780gacggcgatg gcgatcgcgt gattatggtt gaccatgaag gcaataaagt cgatggcgat 780
cagatcatgt atatcatcgc gcgtgaaggt cttcgtcagg gccagctgcg tggtggcgct 840cagatcatgt atatcatcgc gcgtgaaggt cttcgtcagg gccagctgcg tggtggcgct 840
gtgggtacat tgatgagcaa catggggctt gaactggcgc tgaaacagtt aggaattcca 900gtgggtacat tgatgagcaa catggggctt gaactggcgc tgaaacagtt aggaattcca 900
tttgcgcgcg cgaaagtggg tgaccgctac gtactggaaa aaatgcagga gaaaggctgg 960tttgcgcgcg cgaaagtggg tgaccgctac gtactggaaa aaatgcagga gaaaggctgg 960
cgtatcggtg cagagaattc cggtcatgtg atcctgctgg ataaaactac taccggtgac 1020cgtatcggtg cagagaattc cggtcatgtg atcctgctgg ataaaactac taccggtgac 1020
ggcatcgttg ctggcttgca ggtgctggcg gcgatggcac gtaaccatat gagcctgcac 1080ggcatcgttg ctggcttgca ggtgctggcg gcgatggcac gtaaccatat gagcctgcac 1080
gacctttgca gcggcatgaa aatgttcccg cagattctgg ttaacgtacg ttacaccgca 1140gacctttgca gcggcatgaa aatgttcccg cagattctgg ttaacgtacg ttacaccgca 1140
ggtagcggcg atccacttga gcatgagtca gttaaagccg tgaccgcaga ggttgaagct 1200ggtagcggcg atccacttga gcatgagtca gttaaagccg tgaccgcaga ggttgaagct 1200
gcgctgggca accgtggacg cgtgttgctg cgtaaatccg gcaccgaacc gttaattcgc 1260gcgctgggca accgtggacg cgtgttgctg cgtaaatccg gcaccgaacc gttaattcgc 1260
gtgatggtgg aaggcgaaga cgaagcgcag gtgactgaat ttgcacaccg catcgccgat 1320gtgatggtgg aaggcgaaga cgaagcgcag gtgactgaat ttgcacaccg catcgccgat 1320
gcagtaaaag ccgtttaa 1338gcagtaaaag ccgtttaa 1338
<210> 73<210> 73
<211> 456<211> 456
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 73<400> 73
Met Leu Asn Asn Ala Met Ser Val Val Ile Leu Ala Ala Gly Lys Gly Met Leu Asn Asn Ala Met Ser Val Val Ile Leu Ala Ala Gly Lys Gly
1 5 10 15 1 5 10 15
Thr Arg Met Tyr Ser Asp Leu Pro Lys Val Leu His Thr Leu Ala Gly Thr Arg Met Tyr Ser Asp Leu Pro Lys Val Leu His Thr Leu Ala Gly
20 25 30 20 25 30
Lys Ala Met Val Gln His Val Ile Asp Ala Ala Asn Glu Leu Gly Ala Lys Ala Met Val Gln His Val Ile Asp Ala Ala Asn Glu Leu Gly Ala
35 40 45 35 40 45
Ala His Val His Leu Val Tyr Gly His Gly Gly Asp Leu Leu Lys Gln Ala His Val His Leu Val Tyr Gly His Gly Gly Asp Leu Leu Lys Gln
50 55 60 50 55 60
Ala Leu Lys Asp Asp Asn Leu Asn Trp Val Leu Gln Ala Glu Gln Leu Ala Leu Lys Asp Asp Asn Leu Asn Trp Val Leu Gln Ala Glu Gln Leu
65 70 75 80 65 70 75 80
Gly Thr Gly His Ala Met Gln Gln Ala Ala Pro Phe Phe Ala Asp Asp Gly Thr Gly His Ala Met Gln Gln Ala Ala Pro Phe Phe Ala Asp Asp
85 90 95 85 90 95
Glu Asp Ile Leu Met Leu Tyr Gly Asp Val Pro Leu Ile Ser Val Glu Glu Asp Ile Leu Met Leu Tyr Gly Asp Val Pro Leu Ile Ser Val Glu
100 105 110 100 105 110
Thr Leu Gln Arg Leu Arg Asp Ala Lys Pro Gln Gly Gly Ile Gly Leu Thr Leu Gln Arg Leu Arg Asp Ala Lys Pro Gln Gly Gly Ile Gly Leu
115 120 125 115 120 125
Leu Thr Val Lys Leu Asp Asp Pro Thr Gly Tyr Gly Arg Ile Thr Arg Leu Thr Val Lys Leu Asp Asp Pro Thr Gly Tyr Gly Arg Ile Thr Arg
130 135 140 130 135 140
Glu Asn Gly Lys Val Thr Gly Ile Val Glu His Lys Asp Ala Thr Asp Glu Asn Gly Lys Val Thr Gly Ile Val Glu His Lys Asp Ala Thr Asp
145 150 155 160 145 150 155 160
Glu Gln Arg Gln Ile Gln Glu Ile Asn Thr Gly Ile Leu Ile Ala Asn Glu Gln Arg Gln Ile Gln Glu Ile Asn Thr Gly Ile Leu Ile Ala Asn
165 170 175 165 170 175
Gly Ala Asp Met Lys Arg Trp Leu Ala Lys Leu Thr Asn Asn Asn Ala Gly Ala Asp Met Lys Arg Trp Leu Ala Lys Leu Thr Asn Asn Asn Ala
180 185 190 180 185 190
Gln Gly Glu Tyr Tyr Ile Thr Asp Ile Ile Ala Leu Ala Tyr Gln Glu Gln Gly Glu Tyr Tyr Ile Thr Asp Ile Ile Ala Leu Ala Tyr Gln Glu
195 200 205 195 200 205
Gly Arg Glu Ile Val Ala Val His Pro Gln Arg Leu Ser Glu Val Glu Gly Arg Glu Ile Val Ala Val His Pro Gln Arg Leu Ser Glu Val Glu
210 215 220 210 215 220
Gly Val Asn Asn Arg Leu Gln Leu Ser Arg Leu Glu Arg Val Tyr Gln Gly Val Asn Asn Arg Leu Gln Leu Ser Arg Leu Glu Arg Val Tyr Gln
225 230 235 240 225 230 235 240
Ser Glu Gln Ala Glu Lys Leu Leu Leu Ala Gly Val Met Leu Arg Asp Ser Glu Gln Ala Glu Lys Leu Leu Leu Ala Gly Val Met Leu Arg Asp
245 250 255 245 250 255
Pro Ala Arg Phe Asp Leu Arg Gly Thr Leu Thr His Gly Arg Asp Val Pro Ala Arg Phe Asp Leu Arg Gly Thr Leu Thr His Gly Arg Asp Val
260 265 270 260 265 270
Glu Ile Asp Thr Asn Val Ile Ile Glu Gly Asn Val Thr Leu Gly His Glu Ile Asp Thr Asn Val Ile Ile Glu Gly Asn Val Thr Leu Gly His
275 280 285 275 280 285
Arg Val Lys Ile Gly Thr Gly Cys Val Ile Lys Asn Ser Val Ile Gly Arg Val Lys Ile Gly Thr Gly Cys Val Ile Lys Asn Ser Val Ile Gly
290 295 300 290 295 300
Asp Asp Cys Glu Ile Ser Pro Tyr Thr Val Val Glu Asp Ala Asn Leu Asp Asp Cys Glu Ile Ser Pro Tyr Thr Val Val Glu Asp Ala Asn Leu
305 310 315 320 305 310 315 320
Ala Ala Ala Cys Thr Ile Gly Pro Phe Ala Arg Leu Arg Pro Gly Ala Ala Ala Ala Cys Thr Ile Gly Pro Phe Ala Arg Leu Arg Pro Gly Ala
325 330 335 325 330 335
Glu Leu Leu Glu Gly Ala His Val Gly Asn Phe Val Glu Met Lys Lys Glu Leu Leu Glu Gly Ala His Val Gly Asn Phe Val Glu Met Lys Lys
340 345 350 340 345 350
Ala Arg Leu Gly Lys Gly Ser Lys Ala Gly His Leu Thr Tyr Leu Gly Ala Arg Leu Gly Lys Gly Ser Lys Ala Gly His Leu Thr Tyr Leu Gly
355 360 365 355 360 365
Asp Ala Glu Ile Gly Asp Asn Val Asn Ile Gly Ala Gly Thr Ile Thr Asp Ala Glu Ile Gly Asp Asn Val Asn Ile Gly Ala Gly Thr Ile Thr
370 375 380 370 375 380
Cys Asn Tyr Asp Gly Ala Asn Lys Phe Lys Thr Ile Ile Gly Asp Asp Cys Asn Tyr Asp Gly Ala Asn Lys Phe Lys Thr Ile Ile Gly Asp Asp
385 390 395 400 385 390 395 400
Val Phe Val Gly Ser Asp Thr Gln Leu Val Ala Pro Val Thr Val Gly Val Phe Val Gly Ser Asp Thr Gln Leu Val Ala Pro Val Thr Val Gly
405 410 415 405 410 415
Lys Gly Ala Thr Ile Ala Ala Gly Thr Thr Val Thr Arg Asn Val Gly Lys Gly Ala Thr Ile Ala Ala Gly Thr Thr Val Thr Arg Asn Val Gly
420 425 430 420 425 430
Glu Asn Ala Leu Ala Ile Ser Arg Val Pro Gln Thr Gln Lys Glu Gly Glu Asn Ala Leu Ala Ile Ser Arg Val Pro Gln Thr Gln Lys Glu Gly
435 440 445 435 440 445
Trp Arg Arg Pro Val Lys Lys Lys Trp Arg Arg Pro Val Lys Lys Lys
450 455 450 455
<210> 74<210> 74
<211> 1371<211> 1371
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 74<400> 74
atgttgaata atgctatgag cgtagtgatc cttgccgcag gcaaaggcac gcgcatgtat 60atgttgaata atgctatgag cgtagtgatc cttgccgcag gcaaaggcac gcgcatgtat 60
tccgatcttc cgaaagtgct gcataccctt gccgggaaag cgatggttca gcatgtcatt 120tccgatcttc cgaaagtgct gcataccctt gccgggaaag cgatggttca gcatgtcatt 120
gatgctgcga atgaattagg cgcagcgcac gttcacctgg tgtacggtca cggcggcgat 180gatgctgcga atgaattagg cgcagcgcac gttcacctgg tgtacggtca cggcggcgat 180
ctgctaaaac aggcgctgaa agacgacaac cttaactggg tgcttcaggc agagcagctg 240ctgctaaaac aggcgctgaa agacgacaac cttaactggg tgcttcaggc agagcagctg 240
ggtacgggtc atgcaatgca gcaggccgca cctttctttg ccgatgatga agacatttta 300ggtacgggtc atgcaatgca gcaggccgca cctttctttg ccgatgatga agacatttta 300
atgctctacg gcgacgtgcc gctgatctct gtcgaaacac tccagcgtct gcgtgatgct 360atgctctacg gcgacgtgcc gctgatctct gtcgaaacac tccagcgtct gcgtgatgct 360
aaaccgcagg gtggcattgg tctgctgacg gtgaaactgg atgatccgac cggttatgga 420aaaccgcagg gtggcattgg tctgctgacg gtgaaactgg atgatccgac cggttatgga 420
cgtatcaccc gtgaaaacgg caaagttacc ggcattgttg agcacaaaga tgccaccgac 480cgtatcaccc gtgaaaacgg caaagttacc ggcattgttg agcacaaaga tgccaccgac 480
gagcagcgtc agattcagga gatcaacacc ggcattctga ttgccaacgg cgcagatatg 540gagcagcgtc agattcagga gatcaacacc ggcattctga ttgccaacgg cgcagatatg 540
aaacgctggc tggcgaagct gaccaacaat aatgctcagg gcgaatacta catcaccgac 600aaacgctggc tggcgaagct gaccaacaat aatgctcagg gcgaatacta catcaccgac 600
attattgcgc tggcgtatca ggaagggcgt gaaatcgtcg ccgttcatcc gcaacgttta 660attattgcgc tggcgtatca ggaagggcgt gaaatcgtcg ccgttcatcc gcaacgttta 660
agcgaagtag aaggcgtgaa taaccgcctg caactctccc gtctggagcg tgtttatcag 720agcgaagtag aaggcgtgaa taaccgcctg caactctccc gtctggagcg tgtttatcag 720
tccgaacagg ctgaaaaact gctgttagca ggcgttatgc tgcgcgatcc agcgcgtttt 780tccgaacagg ctgaaaaact gctgttagca ggcgttatgc tgcgcgatcc agcgcgtttt 780
gatctgcgtg gtacgctaac tcacgggcgc gatgttgaaa ttgatactaa cgttatcatc 840gatctgcgtg gtacgctaac tcacgggcgc gatgttgaaa ttgatactaa cgttatcatc 840
gagggcaacg tgactctcgg tcatcgcgtg aaaattggca ccggttgcgt gattaaaaac 900gagggcaacg tgactctcgg tcatcgcgtg aaaattggca ccggttgcgt gattaaaaac 900
agcgtgattg gcgatgattg cgaaatcagt ccgtataccg ttgtggaaga tgcgaatctg 960agcgtgattg gcgatgattg cgaaatcagt ccgtataccg ttgtggaaga tgcgaatctg 960
gcagcggcct gtaccattgg cccgtttgcc cgtttgcgtc ctggtgctga gttgctggaa 1020gcagcggcct gtaccattgg cccgtttgcc cgtttgcgtc ctggtgctga gttgctggaa 1020
ggtgctcacg tcggtaactt cgttgagatg aaaaaagcgc gtctgggtaa aggctcgaaa 1080ggtgctcacg tcggtaactt cgttgagatg aaaaaagcgc gtctgggtaa aggctcgaaa 1080
gctggtcatc tgacttacct gggcgatgcg gaaattggcg ataacgttaa catcggcgcg 1140gctggtcatc tgacttacct gggcgatgcg gaaattggcg ataacgttaa catcggcgcg 1140
ggaaccatta cctgcaacta cgatggtgcg aataaattta agaccattat cggcgacgat 1200ggaaccatta cctgcaacta cgatggtgcg aataaattta agaccattat cggcgacgat 1200
gtgtttgttg gttccgacac tcagctggtg gccccggtaa cagtaggcaa aggcgcgacc 1260gtgtttgttg gttccgacac tcagctggtg gccccggtaa cagtaggcaa aggcgcgacc 1260
attgctgcgg gtacaactgt gacgcgtaat gtcggcgaaa atgcattagc tatcagccgt 1320attgctgcgg gtacaactgt gacgcgtaat gtcggcgaaa atgcattagc tatcagccgt 1320
gtgccgcaga ctcagaaaga aggctggcgt cgtccggtaa agaaaaagtg a 1371gtgccgcaga ctcagaaaga aggctggcgt cgtccggtaa agaaaaagtg a 1371
<210> 75<210> 75
<211> 391<211> 391
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 75<400> 75
Met Lys Lys Ile Leu Tyr Val Thr Gly Ser Arg Ala Glu Tyr Gly Ile Met Lys Lys Ile Leu Tyr Val Thr Gly Ser Arg Ala Glu Tyr Gly Ile
1 5 10 15 1 5 10 15
Val Arg Arg Leu Leu Thr Met Leu Arg Glu Thr Pro Glu Ile Gln Leu Val Arg Arg Leu Leu Thr Met Leu Arg Glu Thr Pro Glu Ile Gln Leu
20 25 30 20 25 30
Asp Leu Ala Val Thr Gly Met His Cys Asp Asn Ala Tyr Gly Asn Thr Asp Leu Ala Val Thr Gly Met His Cys Asp Asn Ala Tyr Gly Asn Thr
35 40 45 35 40 45
Ile His Ile Ile Glu Gln Asp Asn Phe Asn Ile Ile Lys Val Val Asp Ile His Ile Ile Glu Gln Asp Asn Phe Asn Ile Ile Lys Val Val Asp
50 55 60 50 55 60
Ile Asn Ile Asn Thr Thr Ser His Thr His Ile Leu His Ser Met Ser Ile Asn Ile Asn Thr Thr Ser His Thr His Ile Leu His Ser Met Ser
65 70 75 80 65 70 75 80
Val Cys Leu Asn Ser Phe Gly Asp Phe Phe Ser Asn Asn Thr Tyr Asp Val Cys Leu Asn Ser Phe Gly Asp Phe Phe Ser Asn Asn Thr Tyr Asp
85 90 95 85 90 95
Ala Val Met Val Leu Gly Asp Arg Tyr Glu Ile Phe Ser Val Ala Ile Ala Val Met Val Leu Gly Asp Arg Tyr Glu Ile Phe Ser Val Ala Ile
100 105 110 100 105 110
Ala Ala Ser Met His Asn Ile Pro Leu Ile His Ile His Gly Gly Glu Ala Ala Ser Met His Asn Ile Pro Leu Ile His Ile His Gly Gly Glu
115 120 125 115 120 125
Lys Thr Leu Ala Asn Tyr Asp Glu Phe Ile Arg His Ser Ile Thr Lys Lys Thr Leu Ala Asn Tyr Asp Glu Phe Ile Arg His Ser Ile Thr Lys
130 135 140 130 135 140
Met Ser Lys Leu His Leu Thr Ser Thr Glu Glu Tyr Lys Lys Arg Val Met Ser Lys Leu His Leu Thr Ser Thr Glu Glu Tyr Lys Lys Arg Val
145 150 155 160 145 150 155 160
Ile Gln Leu Gly Glu Lys Pro Gly Ser Val Phe Asn Ile Gly Ser Leu Ile Gln Leu Gly Glu Lys Pro Gly Ser Val Phe Asn Ile Gly Ser Leu
165 170 175 165 170 175
Gly Ala Glu Asn Ala Leu Ser Leu His Leu Pro Asn Lys Gln Glu Leu Gly Ala Glu Asn Ala Leu Ser Leu His Leu Pro Asn Lys Gln Glu Leu
180 185 190 180 185 190
Glu Leu Lys Tyr Gly Ser Leu Leu Lys Arg Tyr Phe Val Val Val Phe Glu Leu Lys Tyr Gly Ser Leu Leu Lys Arg Tyr Phe Val Val Val Phe
195 200 205 195 200 205
His Pro Glu Thr Leu Ser Thr Gln Ser Val Asn Asp Gln Ile Asp Glu His Pro Glu Thr Leu Ser Thr Gln Ser Val Asn Asp Gln Ile Asp Glu
210 215 220 210 215 220
Leu Leu Ser Ala Ile Ser Phe Phe Lys Asn Thr His Asp Phe Ile Phe Leu Leu Ser Ala Ile Ser Phe Phe Lys Asn Thr His Asp Phe Ile Phe
225 230 235 240 225 230 235 240
Ile Gly Ser Asn Ala Asp Thr Gly Ser Asp Ile Ile Gln Arg Lys Val Ile Gly Ser Asn Ala Asp Thr Gly Ser Asp Ile Ile Gln Arg Lys Val
245 250 255 245 250 255
Lys Tyr Phe Cys Lys Glu Tyr Lys Phe Arg Tyr Leu Ile Ser Ile Arg Lys Tyr Phe Cys Lys Glu Tyr Lys Phe Arg Tyr Leu Ile Ser Ile Arg
260 265 270 260 265 270
Ser Glu Asp Tyr Leu Ala Met Ile Lys Tyr Ser Cys Gly Leu Ile Gly Ser Glu Asp Tyr Leu Ala Met Ile Lys Tyr Ser Cys Gly Leu Ile Gly
275 280 285 275 280 285
Asn Ser Ser Ser Gly Leu Ile Glu Val Pro Ser Leu Lys Val Ala Thr Asn Ser Ser Ser Gly Leu Ile Glu Val Pro Ser Leu Lys Val Ala Thr
290 295 300 290 295 300
Ile Asn Ile Gly Asp Arg Gln Lys Gly Arg Val Arg Gly Ala Ser Val Ile Asn Ile Gly Asp Arg Gln Lys Gly Arg Val Arg Gly Ala Ser Val
305 310 315 320 305 310 315 320
Ile Asp Val Pro Val Glu Lys Asn Ala Ile Val Arg Gly Ile Asn Ile Ile Asp Val Pro Val Glu Lys Asn Ala Ile Val Arg Gly Ile Asn Ile
325 330 335 325 330 335
Ser Gln Asp Glu Lys Phe Ile Ser Val Val Gln Ser Ser Ser Asn Pro Ser Gln Asp Glu Lys Phe Ile Ser Val Val Gln Ser Ser Ser Asn Pro
340 345 350 340 345 350
Tyr Phe Lys Glu Asn Ala Leu Ile Asn Ala Val Arg Ile Ile Lys Asp Tyr Phe Lys Glu Asn Ala Leu Ile Asn Ala Val Arg Ile Ile Lys Asp
355 360 365 355 360 365
Phe Ile Lys Ser Lys Asn Lys Asp Tyr Lys Asp Phe Tyr Asp Ile Pro Phe Ile Lys Ser Lys Asn Lys Asp Tyr Lys Asp Phe Tyr Asp Ile Pro
370 375 380 370 375 380
Glu Cys Thr Thr Ser Tyr Asp Glu Cys Thr Thr Ser Tyr Asp
385 390 385 390
<210> 76<210> 76
<211> 1176<211> 1176
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 76<400> 76
atgaaaaaaa tattatacgt aactggatct agagctgaat atggaatagt tcggagactt 60atgaaaaaaa tattatacgt aactggatct agagctgaat atggaatagt tcggagactt 60
ttgacaatgc taagagaaac tccagaaata cagcttgatt tggcagttac aggaatgcat 120ttgacaatgc taagagaaac tccagaaata cagcttgatt tggcagttac aggaatgcat 120
tgtgataatg cgtatggaaa tacaatacat attatagaac aagataattt taatattatc 180tgtgataatg cgtatggaaa tacaatacat attatagaac aagataattt taatattatc 180
aaggttgtgg atataaatat caatacaact tcacatactc acattctcca ttcaatgagt 240aaggttgtgg atataaatat caatacaact tcacatactc acattctcca ttcaatgagt 240
gtttgcctca attcgtttgg tgattttttt tcaaataaca catatgatgc ggttatggtt 300gtttgcctca attcgtttgg tgattttttt tcaaataaca catatgatgc ggttatggtt 300
ttaggcgata gatatgaaat attttcagtc gctatcgcag catcaatgca taatattcca 360ttaggcgata gatatgaaat attttcagtc gctatcgcag catcaatgca taatattcca 360
ttaattcata ttcatggtgg tgaaaagaca ttagctaatt atgatgagtt tattaggcat 420ttaattcata ttcatggtgg tgaaaagaca ttagctaatt atgatgagtt tattaggcat 420
tcaattacta aaatgagtaa actccatctt acttctacag aagagtataa aaaacgagta 480tcaattacta aaatgagtaa actccatctt acttctacag aagagtataa aaaacgagta 480
attcaactag gtgaaaagcc tggtagtgtg tttaatattg gttctcttgg tgcagaaaat 540attcaactag gtgaaaagcc tggtagtgtg tttaatattg gttctcttgg tgcagaaaat 540
gctctttcat tgcatttacc aaataagcag gagttggaac taaaatatgg ttcactgtta 600gctctttcat tgcatttacc aaataagcag gagttggaac taaaatatgg ttcactgtta 600
aaacggtact ttgttgtagt attccatcct gaaacacttt ccacgcagtc ggttaatgat 660aaacggtact ttgttgtagt attccatcct gaaacacttt ccacgcagtc ggttaatgat 660
caaatagatg agttattgtc agcgatttct ttttttaaaa atactcacga ctttattttt 720caaatagatg agttattgtc agcgatttct ttttttaaaa atactcacga ctttattttt 720
attggcagta acgctgacac tggttctgat ataattcaga gaaaagtaaa atatttttgc 780attggcagta acgctgacac tggttctgat ataattcaga gaaaagtaaa atatttttgc 780
aaagagtata agttcagata tttgatttct attcgttcag aagattattt ggcaatgatt 840aaagagtata agttcagata tttgatttct attcgttcag aagattattt ggcaatgatt 840
aaatactctt gtgggctaat tgggaactcc tcctctggtt taattgaggt tccatcttta 900aaatactctt gtgggctaat tgggaactcc tcctctggtt taattgaggt tccatcttta 900
aaagttgcaa caattaacat tggtgatagg cagaaaggcc gtgttcgtgg agccagtgta 960aaagttgcaa caattaacat tggtgatagg cagaaaggcc gtgttcgtgg agccagtgta 960
atagatgtac ccgttgaaaa aaatgcaatc gtcagaggga taaatatatc tcaagatgaa 1020atagatgtac ccgttgaaaa aaatgcaatc gtcagaggga taaatatatc tcaagatgaa 1020
aaatttatta gtgttgtaca gtcatctagt aatccttatt ttaaagaaaa tgctttaatt 1080aaatttatta gtgttgtaca gtcatctagt aatccttatt ttaaagaaaa tgctttaatt 1080
aatgctgtta gaattattaa ggattttatt aaatcaaaaa ataaagatta caaagatttt 1140aatgctgtta gaattattaa ggattttatt aaatcaaaaa ataaagatta caaagatttt 1140
tatgacatcc cggaatgtac caccagttat gactag 1176tatgacatcc cggaatgtac caccagttat gactag 1176
<210> 77<210> 77
<211> 159<211> 159
<212> Белок<212> Protein
<213> Saccharomyces cerevisiae<213> Saccharomyces cerevisiae
<400> 77<400> 77
Met Ser Leu Pro Asp Gly Phe Tyr Ile Arg Arg Met Glu Glu Gly Asp Met Ser Leu Pro Asp Gly Phe Tyr Ile Arg Arg Met Glu Glu Gly Asp
1 5 10 15 1 5 10 15
Leu Glu Gln Val Thr Glu Thr Leu Lys Val Leu Thr Thr Val Gly Thr Leu Glu Gln Val Thr Glu Thr Leu Lys Val Leu Thr Thr Val Gly Thr
20 25 30 20 25 30
Ile Thr Pro Glu Ser Phe Ser Lys Leu Ile Lys Tyr Trp Asn Glu Ala Ile Thr Pro Glu Ser Phe Ser Lys Leu Ile Lys Tyr Trp Asn Glu Ala
35 40 45 35 40 45
Thr Val Trp Asn Asp Asn Glu Asp Lys Lys Ile Met Gln Tyr Asn Pro Thr Val Trp Asn Asp Asn Glu Asp Lys Lys Ile Met Gln Tyr Asn Pro
50 55 60 50 55 60
Met Val Ile Val Asp Lys Arg Thr Glu Thr Val Ala Ala Thr Gly Asn Met Val Ile Val Asp Lys Arg Thr Glu Thr Val Ala Ala Thr Gly Asn
65 70 75 80 65 70 75 80
Ile Ile Ile Glu Arg Lys Ile Ile His Glu Leu Gly Leu Cys Gly His Ile Ile Ile Glu Arg Lys Ile Ile His Glu Leu Gly Leu Cys Gly His
85 90 95 85 90 95
Ile Glu Asp Ile Ala Val Asn Ser Lys Tyr Gln Gly Gln Gly Leu Gly Ile Glu Asp Ile Ala Val Asn Ser Lys Tyr Gln Gly Gln Gly Leu Gly
100 105 110 100 105 110
Lys Leu Leu Ile Asp Gln Leu Val Thr Ile Gly Phe Asp Tyr Gly Cys Lys Leu Leu Ile Asp Gln Leu Val Thr Ile Gly Phe Asp Tyr Gly Cys
115 120 125 115 120 125
Tyr Lys Ile Ile Leu Asp Cys Asp Glu Lys Asn Val Lys Phe Tyr Glu Tyr Lys Ile Ile Leu Asp Cys Asp Glu Lys Asn Val Lys Phe Tyr Glu
130 135 140 130 135 140
Lys Cys Gly Phe Ser Asn Ala Gly Val Glu Met Gln Ile Arg Lys Lys Cys Gly Phe Ser Asn Ala Gly Val Glu Met Gln Ile Arg Lys
145 150 155 145 150 155
<210> 78<210> 78
<211> 480<211> 480
<212> ДНК<212> DNA
<213> Saccharomyces cerevisiae<213> Saccharomyces cerevisiae
<400> 78<400> 78
atgagcttac ccgatggatt ttatataagg cgaatggaag agggggattt ggaacaggtc 60atgagcttac ccgatggatt ttatataagg cgaatggaag agggggattt ggaacaggtc 60
actgagacgc taaaggtttt gaccaccgtg ggcactatta cccccgaatc cttcagcaaa 120actgagacgc taaaggtttt gaccaccgtg ggcactatta cccccgaatc cttcagcaaa 120
ctcataaaat actggaatga agccacagta tggaatgata acgaagataa aaaaataatg 180ctcataaaat actggaatga agccacagta tggaatgata acgaagataa aaaaataatg 180
caatataacc ccatggtgat tgtggacaag cgcaccgaga cggttgccgc tacggggaat 240caatataacc ccatggtgat tgtggacaag cgcaccgaga cggttgccgc tacggggaat 240
atcatcatcg aaagaaagat cattcatgaa ctggggctat gtggccacat cgaggacatt 300atcatcatcg aaagaaagat cattcatgaa ctggggctat gtggccacat cgaggacatt 300
gcagtaaact ccaagtatca gggccaaggt ttgggcaagc tcttgattga tcaattggta 360gcagtaaact ccaagtatca gggccaaggt ttgggcaagc tcttgattga tcaattggta 360
actatcggct ttgactacgg ttgttataag attattttag attgcgatga gaaaaatgtc 420actatcggct ttgactacgg ttgttataag attattttag attgcgatga gaaaaatgtc 420
aaattctatg aaaaatgtgg gtttagcaac gcaggcgtgg aaatgcaaat tagaaaatag 480aaattctatg aaaaatgtgg gtttagcaac gcaggcgtgg aaatgcaaat tagaaaatag 480
<210> 79<210> 79
<211> 188<211> 188
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 79<400> 79
Met Tyr Glu Arg Tyr Ala Gly Leu Ile Phe Asp Met Asp Gly Thr Ile Met Tyr Glu Arg Tyr Ala Gly Leu Ile Phe Asp Met Asp Gly Thr Ile
1 5 10 15 1 5 10 15
Leu Asp Thr Glu Pro Thr His Arg Lys Ala Trp Arg Glu Val Leu Gly Leu Asp Thr Glu Pro Thr His Arg Lys Ala Trp Arg Glu Val Leu Gly
20 25 30 20 25 30
His Tyr Gly Leu Gln Tyr Asp Ile Gln Ala Met Ile Ala Leu Asn Gly His Tyr Gly Leu Gln Tyr Asp Ile Gln Ala Met Ile Ala Leu Asn Gly
35 40 45 35 40 45
Ser Pro Thr Trp Arg Ile Ala Gln Ala Ile Ile Glu Leu Asn Gln Ala Ser Pro Thr Trp Arg Ile Ala Gln Ala Ile Ile Glu Leu Asn Gln Ala
50 55 60 50 55 60
Asp Leu Asp Pro His Ala Leu Ala Arg Glu Lys Thr Glu Ala Val Arg Asp Leu Asp Pro His Ala Leu Ala Arg Glu Lys Thr Glu Ala Val Arg
65 70 75 80 65 70 75 80
Ser Met Leu Leu Asp Ser Val Glu Pro Leu Pro Leu Val Asp Val Val Ser Met Leu Leu Asp Ser Val Glu Pro Leu Pro Leu Val Asp Val Val
85 90 95 85 90 95
Lys Ser Trp His Gly Arg Arg Pro Met Ala Val Gly Thr Gly Ser Glu Lys Ser Trp His Gly Arg Arg Pro Met Ala Val Gly Thr Gly Ser Glu
100 105 110 100 105 110
Ser Ala Ile Ala Glu Ala Leu Leu Ala His Leu Gly Leu Arg His Tyr Ser Ala Ile Ala Glu Ala Leu Leu Ala His Leu Gly Leu Arg His Tyr
115 120 125 115 120 125
Phe Asp Ala Val Val Ala Ala Asp His Val Lys His His Lys Pro Ala Phe Asp Ala Val Val Ala Ala Asp His Val Lys His His Lys Pro Ala
130 135 140 130 135 140
Pro Asp Thr Phe Leu Leu Cys Ala Gln Arg Met Gly Val Gln Pro Thr Pro Asp Thr Phe Leu Leu Cys Ala Gln Arg Met Gly Val Gln Pro Thr
145 150 155 160 145 150 155 160
Gln Cys Val Val Phe Glu Asp Ala Asp Phe Gly Ile Gln Ala Ala Arg Gln Cys Val Val Phe Glu Asp Ala Asp Phe Gly Ile Gln Ala Ala Arg
165 170 175 165 170 175
Ala Ala Gly Met Asp Ala Val Asp Val Arg Leu Leu Ala Ala Gly Met Asp Ala Val Asp Val Arg Leu Leu
180 185 180 185
<210> 80<210> 80
<211> 199<211> 199
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 80<400> 80
Met Leu Tyr Ile Phe Asp Leu Gly Asn Val Ile Val Asp Ile Asp Phe Met Leu Tyr Ile Phe Asp Leu Gly Asn Val Ile Val Asp Ile Asp Phe
1 5 10 15 1 5 10 15
Asn Arg Val Leu Gly Ala Trp Ser Asp Leu Thr Arg Ile Pro Leu Ala Asn Arg Val Leu Gly Ala Trp Ser Asp Leu Thr Arg Ile Pro Leu Ala
20 25 30 20 25 30
Ser Leu Lys Lys Ser Phe His Met Gly Glu Ala Phe His Gln His Glu Ser Leu Lys Lys Ser Phe His Met Gly Glu Ala Phe His Gln His Glu
35 40 45 35 40 45
Arg Gly Glu Ile Ser Asp Glu Ala Phe Ala Glu Ala Leu Cys His Glu Arg Gly Glu Ile Ser Asp Glu Ala Phe Ala Glu Ala Leu Cys His Glu
50 55 60 50 55 60
Met Ala Leu Pro Leu Ser Tyr Glu Gln Phe Ser His Gly Trp Gln Ala Met Ala Leu Pro Leu Ser Tyr Glu Gln Phe Ser His Gly Trp Gln Ala
65 70 75 80 65 70 75 80
Val Phe Val Ala Leu Arg Pro Glu Val Ile Ala Ile Met His Lys Leu Val Phe Val Ala Leu Arg Pro Glu Val Ile Ala Ile Met His Lys Leu
85 90 95 85 90 95
Arg Glu Gln Gly His Arg Val Val Val Leu Ser Asn Thr Asn Arg Leu Arg Glu Gln Gly His Arg Val Val Val Leu Ser Asn Thr Asn Arg Leu
100 105 110 100 105 110
His Thr Thr Phe Trp Pro Glu Glu Tyr Pro Glu Ile Arg Asp Ala Ala His Thr Thr Phe Trp Pro Glu Glu Tyr Pro Glu Ile Arg Asp Ala Ala
115 120 125 115 120 125
Asp His Ile Tyr Leu Ser Gln Asp Leu Gly Met Arg Lys Pro Glu Ala Asp His Ile Tyr Leu Ser Gln Asp Leu Gly Met Arg Lys Pro Glu Ala
130 135 140 130 135 140
Arg Ile Tyr Gln His Val Leu Gln Ala Glu Gly Phe Ser Pro Ser Asp Arg Ile Tyr Gln His Val Leu Gln Ala Glu Gly Phe Ser Pro Ser Asp
145 150 155 160 145 150 155 160
Thr Val Phe Phe Asp Asp Asn Ala Asp Asn Ile Glu Gly Ala Asn Gln Thr Val Phe Phe Asp Asp Asn Ala Asp Asn Ile Glu Gly Ala Asn Gln
165 170 175 165 170 175
Leu Gly Ile Thr Ser Ile Leu Val Lys Asp Lys Thr Thr Ile Pro Asp Leu Gly Ile Thr Ser Ile Leu Val Lys Asp Lys Thr Thr Ile Pro Asp
180 185 190 180 185 190
Tyr Phe Ala Lys Val Leu Cys Tyr Phe Ala Lys Val Leu Cys
195 195
<210> 81<210> 81
<211> 567<211> 567
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 81<400> 81
atgtacgagc gttatgcagg tttaattttt gatatggatg gcacaatcct ggatacggag 60atgtacgagc gttatgcagg tttaattttt gatatggatg gcacaatcct ggatacggag 60
cctacgcacc gtaaagcgtg gcgcgaagta ttagggcact acggtcttca gtacgatatt 120cctacgcacc gtaaagcgtg gcgcgaagta ttagggcact acggtcttca gtacgatatt 120
caggcgatga ttgcgcttaa tggatcgccc acctggcgta ttgctcaggc aattattgag 180caggcgatga ttgcgcttaa tggatcgccc acctggcgta ttgctcaggc aattattgag 180
ctgaatcagg ccgatctcga cccgcatgcg ttagcgcgtg aaaaaacaga agcagtaaga 240ctgaatcagg ccgatctcga cccgcatgcg ttagcgcgtg aaaaaacaga agcagtaaga 240
agtatgctgc tggatagcgt cgaaccgctt cctcttgttg atgtggtgaa aagttggcat 300agtatgctgc tggatagcgt cgaaccgctt cctcttgttg atgtggtgaa aagttggcat 300
ggtcgtcgcc caatggctgt aggaacgggg agtgaaagcg ccatcgctga ggcattgctg 360ggtcgtcgcc caatggctgt aggaacgggg agtgaaagcg ccatcgctga ggcattgctg 360
gcgcacctgg gattacgcca ttattttgac gccgtcgtcg ctgccgatca cgtcaaacac 420gcgcacctgg gattacgcca ttattttgac gccgtcgtcg ctgccgatca cgtcaaacac 420
cataaacccg cgccagacac atttttgttg tgcgcgcagc gtatgggcgt gcaaccgacg 480cataaacccg cgccagacac atttttgttg tgcgcgcagc gtatgggcgt gcaaccgacg 480
cagtgtgtgg tctttgaaga tgccgatttc ggtattcagg cggcccgtgc agcaggcatg 540cagtgtgtgg tctttgaaga tgccgatttc ggtattcagg cggcccgtgc agcaggcatg 540
gacgccgtgg atgttcgctt gctgtga 567gacgccgtgg atgttcgctt gctgtga 567
<210> 82<210> 82
<211> 600<211> 600
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 82<400> 82
atgctctata tctttgattt aggtaatgtg attgtcgata tcgactttaa ccgtgtgctg 60atgctctata tctttgattt aggtaatgtg attgtcgata tcgactttaa ccgtgtgctg 60
ggagcctgga gcgatttaac gcgtattccg ctggcatcgc ttaagaagag ttttcatatg 120ggagcctgga gcgatttaac gcgtattccg ctggcatcgc ttaagaagag ttttcatatg 120
ggggaggcgt ttcatcagca tgagcgtggg gaaattagcg acgaagcgtt cgcagaggcg 180ggggaggcgt ttcatcagca tgagcgtggg gaaattagcg acgaagcgtt cgcagaggcg 180
ctgtgtcatg agatggctct accgctaagc tacgagcagt tctctcacgg ctggcaggcg 240ctgtgtcatg agatggctct accgctaagc tacgagcagt tctctcacgg ctggcaggcg 240
gtgtttgttg cgctgcgccc ggaagtgatc gccatcatgc ataaactgcg tgagcagggg 300gtgtttgttg cgctgcgccc ggaagtgatc gccatcatgc ataaactgcg tgagcagggg 300
catcgcgtgg tggtgctttc caataccaac cgcctgcata ccaccttctg gccggaagaa 360catcgcgtgg tggtgctttc caataccaac cgcctgcata ccaccttctg gccggaagaa 360
tacccggaaa ttcgtgatgc tgctgaccat atctatctgt cgcaagatct ggggatgcgc 420tacccggaaa ttcgtgatgc tgctgaccat atctatctgt cgcaagatct ggggatgcgc 420
aaacctgaag cacgaattta ccagcatgtt ttgcaggcgg aaggtttttc acccagcgat 480aaacctgaag cacgaattta ccagcatgtt ttgcaggcgg aaggtttttc acccagcgat 480
acggtctttt tcgacgataa cgccgataat atagaaggag ccaatcagct gggcattacc 540acggtctttt tcgacgataa cgccgataat atagaaggag ccaatcagct gggcattacc 540
agtattctgg tgaaagataa aaccaccatc ccggactatt tcgcgaaggt gttatgctaa 600agtattctgg tgaaagataa aaccaccatc ccggactatt tcgcgaaggt gttatgctaa 600
<210> 83<210> 83
<211> 421<211> 421
<212> Белок<212> Protein
<213> Bacteroides ovatus<213> Bacteroides ovatus
<400> 83<400> 83
Met Asp Ser Lys Asn Asn Ile Gly His Ser Ala Asp Ile Ser Leu Thr Met Asp Ser Lys Asn Asn Ile Gly His Ser Ala Asp Ile Ser Leu Thr
1 5 10 15 1 5 10 15
Ala Glu Leu Pro Ile Pro Ile Tyr Asn Gly Asn Thr Ile Met Asp Phe Ala Glu Leu Pro Ile Pro Ile Tyr Asn Gly Asn Thr Ile Met Asp Phe
20 25 30 20 25 30
Lys Lys Leu Ala Ser Leu Tyr Lys Asp Glu Leu Leu Asp Asn Val Leu Lys Lys Leu Ala Ser Leu Tyr Lys Asp Glu Leu Leu Asp Asn Val Leu
35 40 45 35 40 45
Pro Phe Trp Leu Glu His Ser Gln Asp His Glu Tyr Gly Gly Tyr Phe Pro Phe Trp Leu Glu His Ser Gln Asp His Glu Tyr Gly Gly Tyr Phe
50 55 60 50 55 60
Thr Cys Leu Asp Arg Glu Gly Lys Val Phe Asp Thr Asp Lys Phe Ile Thr Cys Leu Asp Arg Glu Gly Lys Val Phe Asp Thr Asp Lys Phe Ile
65 70 75 80 65 70 75 80
Trp Leu Gln Ser Arg Glu Val Trp Met Phe Ser Met Leu Tyr Asn Lys Trp Leu Gln Ser Arg Glu Val Trp Met Phe Ser Met Leu Tyr Asn Lys
85 90 95 85 90 95
Val Glu Lys Arg Gln Glu Trp Leu Asp Cys Ala Ile Gln Gly Gly Glu Val Glu Lys Arg Gln Glu Trp Leu Asp Cys Ala Ile Gln Gly Gly Glu
100 105 110 100 105 110
Phe Leu Lys Lys Tyr Gly His Asp Gly Asn Tyr Asn Trp Tyr Phe Ser Phe Leu Lys Lys Tyr Gly His Asp Gly Asn Tyr Asn Trp Tyr Phe Ser
115 120 125 115 120 125
Leu Asp Arg Ser Gly Arg Pro Leu Val Glu Pro Tyr Asn Ile Phe Ser Leu Asp Arg Ser Gly Arg Pro Leu Val Glu Pro Tyr Asn Ile Phe Ser
130 135 140 130 135 140
Tyr Thr Phe Ala Thr Met Ala Phe Gly Gln Leu Ser Leu Thr Thr Gly Tyr Thr Phe Ala Thr Met Ala Phe Gly Gln Leu Ser Leu Thr Thr Gly
145 150 155 160 145 150 155 160
Asn Gln Glu Tyr Ala Asp Ile Ala Lys Lys Thr Phe Asp Ile Ile Leu Asn Gln Glu Tyr Ala Asp Ile Ala Lys Lys Thr Phe Asp Ile Ile Leu
165 170 175 165 170 175
Ser Lys Val Asp Asn Pro Lys Gly Arg Trp Asn Lys Leu His Pro Gly Ser Lys Val Asp Asn Pro Lys Gly Arg Trp Asn Lys Leu His Pro Gly
180 185 190 180 185 190
Thr Arg Asn Leu Lys Asn Phe Ala Leu Pro Met Ile Leu Cys Asn Leu Thr Arg Asn Leu Lys Asn Phe Ala Leu Pro Met Ile Leu Cys Asn Leu
195 200 205 195 200 205
Ala Leu Glu Ile Glu His Leu Leu Asp Glu Thr Tyr Leu Arg Glu Thr Ala Leu Glu Ile Glu His Leu Leu Asp Glu Thr Tyr Leu Arg Glu Thr
210 215 220 210 215 220
Met Asp Thr Cys Ile His Glu Val Met Glu Val Phe Tyr Arg Pro Glu Met Asp Thr Cys Ile His Glu Val Met Glu Val Phe Tyr Arg Pro Glu
225 230 235 240 225 230 235 240
Leu Gly Gly Ile Ile Val Glu Asn Val Asp Ile Asp Gly Asn Leu Val Leu Gly Gly Ile Ile Val Glu Asn Val Asp Ile Asp Gly Asn Leu Val
245 250 255 245 250 255
Asp Cys Phe Glu Gly Arg Gln Val Thr Pro Gly His Ala Ile Glu Ala Asp Cys Phe Glu Gly Arg Gln Val Thr Pro Gly His Ala Ile Glu Ala
260 265 270 260 265 270
Met Trp Phe Ile Met Asp Leu Gly Lys Arg Leu Asn Arg Pro Glu Leu Met Trp Phe Ile Met Asp Leu Gly Lys Arg Leu Asn Arg Pro Glu Leu
275 280 285 275 280 285
Ile Glu Lys Ala Lys Glu Thr Thr Leu Thr Met Leu Asn Tyr Gly Trp Ile Glu Lys Ala Lys Glu Thr Thr Leu Thr Met Leu Asn Tyr Gly Trp
290 295 300 290 295 300
Asp Lys Gln Tyr Gly Gly Ile Tyr Tyr Phe Met Asp Arg Asn Gly Cys Asp Lys Gln Tyr Gly Gly Ile Tyr Tyr Phe Met Asp Arg Asn Gly Cys
305 310 315 320 305 310 315 320
Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu Trp Trp Val His Ile Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu Trp Trp Val His Ile
325 330 335 325 330 335
Glu Thr Leu Ile Ser Leu Leu Lys Gly Tyr Gln Leu Thr Gly Asp Lys Glu Thr Leu Ile Ser Leu Leu Lys Gly Tyr Gln Leu Thr Gly Asp Lys
340 345 350 340 345 350
Lys Cys Leu Glu Trp Phe Glu Lys Val His Asp Tyr Thr Trp Glu His Lys Cys Leu Glu Trp Phe Glu Lys Val His Asp Tyr Thr Trp Glu His
355 360 365 355 360 365
Phe Lys Asp Lys Glu Tyr Pro Glu Trp Tyr Gly Tyr Leu Asn Arg Arg Phe Lys Asp Lys Glu Tyr Pro Glu Trp Tyr Gly Tyr Leu Asn Arg Arg
370 375 380 370 375 380
Gly Glu Val Leu Leu Pro Leu Lys Gly Gly Lys Trp Lys Gly Cys Phe Gly Glu Val Leu Leu Pro Leu Lys Gly Gly Lys Trp Lys Gly Cys Phe
385 390 395 400 385 390 395 400
His Val Pro Arg Gly Leu Tyr Gln Cys Trp Lys Thr Leu Glu Glu Ile His Val Pro Arg Gly Leu Tyr Gln Cys Trp Lys Thr Leu Glu Glu Ile
405 410 415 405 410 415
Lys Asn Ile Val Ser Lys Asn Ile Val Ser
420 420
<210> 84<210> 84
<211> 391<211> 391
<212> Белок<212> Protein
<213> Synechocystis sp.<213> Synechocystis sp.
<400> 84<400> 84
Met Ile Ala His Arg Arg Gln Glu Leu Ala Gln Gln Tyr Tyr Gln Ala Met Ile Ala His Arg Arg Gln Glu Leu Ala Gln Gln Tyr Tyr Gln Ala
1 5 10 15 1 5 10 15
Leu His Gln Asp Val Leu Pro Phe Trp Glu Lys Tyr Ser Leu Asp Arg Leu His Gln Asp Val Leu Pro Phe Trp Glu Lys Tyr Ser Leu Asp Arg
20 25 30 20 25 30
Gln Gly Gly Gly Tyr Phe Thr Cys Leu Asp Arg Lys Gly Gln Val Phe Gln Gly Gly Gly Tyr Phe Thr Cys Leu Asp Arg Lys Gly Gln Val Phe
35 40 45 35 40 45
Asp Thr Asp Lys Phe Ile Trp Leu Gln Asn Arg Gln Val Trp Gln Phe Asp Thr Asp Lys Phe Ile Trp Leu Gln Asn Arg Gln Val Trp Gln Phe
50 55 60 50 55 60
Ala Val Phe Tyr Asn Arg Leu Glu Pro Lys Pro Gln Trp Leu Glu Ile Ala Val Phe Tyr Asn Arg Leu Glu Pro Lys Pro Gln Trp Leu Glu Ile
65 70 75 80 65 70 75 80
Ala Arg His Gly Ala Asp Phe Leu Ala Arg His Gly Arg Asp Gln Asp Ala Arg His Gly Ala Asp Phe Leu Ala Arg His Gly Arg Asp Gln Asp
85 90 95 85 90 95
Gly Asn Trp Tyr Phe Ala Leu Asp Gln Glu Gly Lys Pro Leu Arg Gln Gly Asn Trp Tyr Phe Ala Leu Asp Gln Glu Gly Lys Pro Leu Arg Gln
100 105 110 100 105 110
Pro Tyr Asn Val Phe Ser Asp Cys Phe Ala Ala Met Ala Phe Ser Gln Pro Tyr Asn Val Phe Ser Asp Cys Phe Ala Ala Met Ala Phe Ser Gln
115 120 125 115 120 125
Tyr Ala Leu Ala Ser Gly Ala Gln Glu Ala Lys Ala Ile Ala Leu Gln Tyr Ala Leu Ala Ser Gly Ala Gln Glu Ala Lys Ala Ile Ala Leu Gln
130 135 140 130 135 140
Ala Tyr Asn Asn Val Leu Arg Arg Gln His Asn Pro Lys Gly Gln Tyr Ala Tyr Asn Asn Val Leu Arg Arg Gln His Asn Pro Lys Gly Gln Tyr
145 150 155 160 145 150 155 160
Glu Lys Ser Tyr Pro Gly Thr Arg Pro Leu Lys Ser Leu Ala Val Pro Glu Lys Ser Tyr Pro Gly Thr Arg Pro Leu Lys Ser Leu Ala Val Pro
165 170 175 165 170 175
Met Ile Leu Ala Asn Leu Thr Leu Glu Met Glu Trp Leu Leu Pro Pro Met Ile Leu Ala Asn Leu Thr Leu Glu Met Glu Trp Leu Leu Pro Pro
180 185 190 180 185 190
Thr Thr Val Glu Glu Val Leu Ala Gln Thr Val Arg Glu Val Met Thr Thr Thr Val Glu Glu Val Leu Ala Gln Thr Val Arg Glu Val Met Thr
195 200 205 195 200 205
Asp Phe Leu Asp Pro Glu Ile Gly Leu Met Arg Glu Ala Val Thr Pro Asp Phe Leu Asp Pro Glu Ile Gly Leu Met Arg Glu Ala Val Thr Pro
210 215 220 210 215 220
Thr Gly Glu Phe Val Asp Ser Phe Glu Gly Arg Leu Leu Asn Pro Gly Thr Gly Glu Phe Val Asp Ser Phe Glu Gly Arg Leu Leu Asn Pro Gly
225 230 235 240 225 230 235 240
His Gly Ile Glu Ala Met Trp Phe Met Met Asp Ile Ala Gln Arg Ser His Gly Ile Glu Ala Met Trp Phe Met Met Asp Ile Ala Gln Arg Ser
245 250 255 245 250 255
Gly Asp Arg Gln Leu Gln Glu Gln Ala Ile Ala Val Val Leu Asn Thr Gly Asp Arg Gln Leu Gln Glu Gln Ala Ile Ala Val Val Leu Asn Thr
260 265 270 260 265 270
Leu Glu Tyr Ala Trp Asp Glu Glu Phe Gly Gly Ile Phe Tyr Phe Leu Leu Glu Tyr Ala Trp Asp Glu Glu Phe Gly Gly Ile Phe Tyr Phe Leu
275 280 285 275 280 285
Asp Arg Gln Gly His Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu Asp Arg Gln Gly His Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu
290 295 300 290 295 300
Trp Trp Val His Leu Glu Thr Leu Val Ala Leu Ala Lys Gly His Gln Trp Trp Val His Leu Glu Thr Leu Val Ala Leu Ala Lys Gly His Gln
305 310 315 320 305 310 315 320
Ala Thr Gly Gln Glu Lys Cys Trp Gln Trp Phe Glu Arg Val His Asp Ala Thr Gly Gln Glu Lys Cys Trp Gln Trp Phe Glu Arg Val His Asp
325 330 335 325 330 335
Tyr Ala Trp Ser His Phe Ala Asp Pro Glu Tyr Gly Glu Trp Phe Gly Tyr Ala Trp Ser His Phe Ala Asp Pro Glu Tyr Gly Glu Trp Phe Gly
340 345 350 340 345 350
Tyr Leu Asn Arg Arg Gly Glu Val Leu Leu Asn Leu Lys Gly Gly Lys Tyr Leu Asn Arg Arg Gly Glu Val Leu Leu Asn Leu Lys Gly Gly Lys
355 360 365 355 360 365
Trp Lys Gly Cys Phe His Val Pro Arg Ala Leu Trp Leu Cys Ala Glu Trp Lys Gly Cys Phe His Val Pro Arg Ala Leu Trp Leu Cys Ala Glu
370 375 380 370 375 380
Thr Leu Gln Leu Pro Val Ser Thr Leu Gln Leu Pro Val Ser
385 390 385 390
<210> 85<210> 85
<211> 1266<211> 1266
<212> ДНК<212> DNA
<213> Bacteroides ovatus<213> Bacteroides ovatus
<400> 85<400> 85
atggatagta agaataacat tggtcattca gcagacatct ctttaactgc tgaattaccc 60atggatagta agaataacat tggtcattca gcagacatct ctttaactgc tgaattaccc 60
ataccaatct ataatggaaa tacgattatg gatttcaaaa aactggcaag tctgtacaag 120ataccaatct ataatggaaa tacgattatg gatttcaaaa aactggcaag tctgtacaag 120
gatgagctcc tggacaacgt ccttcctttc tggcttgaac attcacaaga ccatgagtat 180gatgagctcc tggacaacgt ccttcctttc tggcttgaac attcacaaga ccatgagtat 180
ggtggttact tcacctgtct ggaccgtgaa ggaaaagtat tcgatacgga taagtttatt 240ggtggttact tcacctgtct ggaccgtgaa ggaaaagtat tcgatacgga taagtttatt 240
tggctgcaaa gtcgtgaggt atggatgttc tccatgcttt acaacaaagt ggagaaacgt 300tggctgcaaa gtcgtgaggt atggatgttc tccatgcttt acaacaaagt ggagaaacgt 300
caggaatggc tagactgtgc cattcagggt ggcgaatttc taaaaaaata tggacatgac 360caggaatggc tagactgtgc cattcagggt ggcgaatttc taaaaaaata tggacatgac 360
ggcaattata actggtattt ttccctcgac cgttcgggta gaccattggt agaaccgtac 420ggcaattata actggtattt ttccctcgac cgttcgggta gaccattggt agaaccgtac 420
aatatattct cgtatacatt cgctaccatg gctttcggac agttgagcct tacaaccggt 480aatatattct cgtatacatt cgctaccatg gctttcggac agttgagcct tacaaccggt 480
aatcaggaat atgcggacat tgccaagaaa actttcgata taatcctttc caaagtggat 540aatcaggaat atgcggacat tgccaagaaa actttcgata taatcctttc caaagtggat 540
aatccgaaag ggagatggaa taagcttcat ccgggtaccc gtaatctgaa gaactttgcc 600aatccgaaag ggagatggaa taagcttcat ccgggtaccc gtaatctgaa gaactttgcc 600
ttgccaatga tcctctgtaa cttggcactg gagatagagc atttattgga tgaaacgtat 660ttgccaatga tcctctgtaa cttggcactg gagatagagc atttattgga tgaaacgtat 660
ctgcgggaaa caatggatac ttgtatccat gaagtgatgg aagttttcta tcgtcctgaa 720ctgcgggaaa caatggatac ttgtatccat gaagtgatgg aagttttcta tcgtcctgaa 720
ctcggaggta tcattgttga aaacgtggac atagacggta atttggtcga ttgttttgaa 780ctcggaggta tcattgttga aaacgtggac atagacggta atttggtcga ttgttttgaa 780
ggccgtcagg tgaccccggg acatgccatt gaagcgatgt ggtttatcat ggatctaggc 840ggccgtcagg tgaccccggg acatgccatt gaagcgatgt ggtttatcat ggatctaggc 840
aagcgtctga atcgtccgga attgatagag aaagccaaag agactactct cacgatgctt 900aagcgtctga atcgtccgga attgatagag aaagccaaag agactactct cacgatgctt 900
aattatggct gggacaagca atatggaggt atctactatt ttatggatcg taacggttgt 960aattatggct gggacaagca atatggaggt atctactatt ttatggatcg taacggttgt 960
cctccccaac aattggagtg ggaccagaaa ctctggtggg tccatatcga aacgcttatt 1020cctccccaac aattggagtg ggaccagaaa ctctggtggg tccatatcga aacgcttatt 1020
tccctgctga aaggctatca attgacggga gacaaaaaat gcttggaatg gtttgaaaag 1080tccctgctga aaggctatca attgacggga gacaaaaaat gcttggaatg gtttgaaaag 1080
gtacatgact acacttggga gcatttcaag gataaagaat atcctgaatg gtatggctac 1140gtacatgact acacttggga gcatttcaag gataaagaat atcctgaatg gtatggctac 1140
ttgaaccgaa gaggcgaagt attgctacca ctcaaaggag gaaaatggaa aggatgcttc 1200ttgaaccgaa gaggcgaagt attgctacca ctcaaaggag gaaaatggaa aggatgcttc 1200
catgtgccaa gaggactgta tcagtgctgg aaaacattag aagaaataaa aaatatagta 1260catgtgccaa gaggactgta tcagtgctgg aaaacattag aagaaataaa aaatatagta 1260
tcctaa 1266tcctaa 1266
<210> 86<210> 86
<211> 1176<211> 1176
<212> ДНК<212> DNA
<213> Synechocystis sp.<213> Synechocystis sp.
<400> 86<400> 86
atgattgccc atcgccgtca ggagttagcc cagcaatatt accaggcttt acaccaggac 60atgattgccc atcgccgtca ggagttagcc cagcaatatt accaggcttt acaccaggac 60
gtattgccct tttgggaaaa atattccctc gatcgccagg ggggcggtta ctttacctgc 120gtattgccct tttgggaaaa atattccctc gatcgccagg ggggcggtta ctttacctgc 120
ttagaccgta aaggccaggt ttttgacaca gataaattca tttggttaca aaaccgtcag 180ttagaccgta aaggccaggt ttttgacaca gataaattca tttggttaca aaaccgtcag 180
gtatggcagt ttgccgtttt ctacaaccgt ttggaaccaa aaccccaatg gttagaaatt 240gtatggcagt ttgccgtttt ctacaaccgt ttggaaccaa aaccccaatg gttagaaatt 240
gcccgccatg gtgctgattt tttagctcgc cacggccgag atcaagacgg taattggtat 300gcccgccatg gtgctgattt tttagctcgc cacggccgag atcaagacgg taattggtat 300
tttgctttgg atcaggaagg caaacccctg cgtcaaccct ataacgtttt ttccgattgc 360tttgctttgg atcaggaagg caaacccctg cgtcaaccct ataacgtttt ttccgattgc 360
ttcgccgcca tggcctttag tcaatatgcc ttagccagtg gggcgcagga agctaaagcc 420ttcgccgcca tggcctttag tcaatatgcc ttagccagtg gggcgcagga agctaaagcc 420
attgccctgc aggcctacaa taacgtccta cgccgtcagc acaatcccaa aggtcaatac 480attgccctgc aggcctacaa taacgtccta cgccgtcagc acaatcccaa aggtcaatac 480
gagaagtcct atccaggtac tagacccctc aaatccctgg cggtgccgat gattttagcc 540gagaagtcct atccaggtac tagacccctc aaatccctgg cggtgccgat gattttagcc 540
aacctcaccc tggagatgga atggttatta ccgcctacta ccgtggaaga ggtgttggcc 600aacctcaccc tggagatgga atggttatta ccgcctacta ccgtggaaga ggtgttggcc 600
caaaccgtca gagaagtgat gacggatttc ctcgacccag aaataggatt aatgcgggaa 660caaaccgtca gagaagtgat gacggatttc ctcgacccag aaataggatt aatgcgggaa 660
gcggtgaccc ccacaggaga atttgttgat agttttgaag ggcggttgct caacccagga 720gcggtgaccc ccacaggaga atttgttgat agttttgaag ggcggttgct caacccagga 720
cacggcattg aagccatgtg gttcatgatg gacattgccc aacgctccgg cgatcgccag 780cacggcattg aagccatgtg gttcatgatg gacattgccc aacgctccgg cgatcgccag 780
ttacaggagc aagccattgc agtggtgttg aacaccctgg aatatgcctg ggatgaagaa 840ttacaggagc aagccattgc agtggtgttg aacaccctgg aatatgcctg ggatgaagaa 840
tttggtggca tattttattt ccttgatcgc cagggccacc ctccccaaca actggaatgg 900tttggtggca tattttattt ccttgatcgc cagggccacc ctccccaaca actggaatgg 900
gaccaaaagc tctggtgggt acatttggaa accctggttg ccctagccaa gggccaccaa 960gaccaaaagc tctggtgggt acatttggaa accctggttg ccctagccaa gggccaccaa 960
gccactggcc aagaaaaatg ttggcaatgg tttgagcggg tccatgatta cgcctggagt 1020gccactggcc aagaaaaatg ttggcaatgg tttgagcggg tccatgatta cgcctggagt 1020
catttcgccg atcctgagta tggggaatgg tttggctacc tgaatcgccg gggagaggtg 1080catttcgccg atcctgagta tggggaatgg tttggctacc tgaatcgccg gggagaggtg 1080
ttactcaacc taaaaggggg gaaatggaaa gggtgcttcc acgtgccccg agctctgtgg 1140ttactcaacc taaaaggggg gaaatggaaa gggtgcttcc acgtgccccg agctctgtgg 1140
ctctgtgcgg aaactctcca acttccggtt agttaa 1176ctctgtgcgg aaactctcca acttccggtt agttaa 1176
<210> 87<210> 87
<211> 229<211> 229
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 87<400> 87
Met Ser Leu Leu Ala Gln Leu Asp Gln Lys Ile Ala Ala Asn Gly Gly Met Ser Leu Leu Ala Gln Leu Asp Gln Lys Ile Ala Ala Asn Gly Gly
1 5 10 15 1 5 10 15
Leu Ile Val Ser Cys Gln Pro Val Pro Asp Ser Pro Leu Asp Lys Pro Leu Ile Val Ser Cys Gln Pro Val Pro Asp Ser Pro Leu Asp Lys Pro
20 25 30 20 25 30
Glu Ile Val Ala Ala Met Ala Leu Ala Ala Glu Gln Ala Gly Ala Val Glu Ile Val Ala Ala Met Ala Leu Ala Ala Glu Gln Ala Gly Ala Val
35 40 45 35 40 45
Ala Ile Arg Ile Glu Gly Val Ala Asn Leu Gln Ala Thr Arg Ala Val Ala Ile Arg Ile Glu Gly Val Ala Asn Leu Gln Ala Thr Arg Ala Val
50 55 60 50 55 60
Val Ser Val Pro Ile Ile Gly Ile Val Lys Arg Asp Leu Glu Asp Ser Val Ser Val Pro Ile Ile Gly Ile Val Lys Arg Asp Leu Glu Asp Ser
65 70 75 80 65 70 75 80
Pro Val Arg Ile Thr Ala Tyr Ile Glu Asp Val Asp Ala Leu Ala Gln Pro Val Arg Ile Thr Ala Tyr Ile Glu Asp Val Asp Ala Leu Ala Gln
85 90 95 85 90 95
Ala Gly Ala Asp Ile Ile Ala Ile Asp Gly Thr Asp Arg Pro Arg Pro Ala Gly Ala Asp Ile Ile Ala Ile Asp Gly Thr Asp Arg Pro Arg Pro
100 105 110 100 105 110
Val Pro Val Glu Thr Leu Leu Ala Arg Ile His His His Gly Leu Leu Val Pro Val Glu Thr Leu Leu Ala Arg Ile His His His Gly Leu Leu
115 120 125 115 120 125
Ala Met Thr Asp Cys Ser Thr Pro Glu Asp Gly Leu Ala Cys Gln Lys Ala Met Thr Asp Cys Ser Thr Pro Glu Asp Gly Leu Ala Cys Gln Lys
130 135 140 130 135 140
Leu Gly Ala Glu Ile Ile Gly Thr Thr Leu Ser Gly Tyr Thr Thr Pro Leu Gly Ala Glu Ile Ile Gly Thr Thr Leu Ser Gly Tyr Thr Thr Pro
145 150 155 160 145 150 155 160
Glu Thr Pro Glu Glu Pro Asp Leu Ala Leu Val Lys Thr Leu Ser Asp Glu Thr Pro Glu Glu Pro Asp Leu Ala Leu Val Lys Thr Leu Ser Asp
165 170 175 165 170 175
Ala Gly Cys Arg Val Ile Ala Glu Gly Arg Tyr Asn Thr Pro Ala Gln Ala Gly Cys Arg Val Ile Ala Glu Gly Arg Tyr Asn Thr Pro Ala Gln
180 185 190 180 185 190
Ala Ala Asp Ala Met Arg His Gly Ala Trp Ala Val Thr Val Gly Ser Ala Ala Asp Ala Met Arg His Gly Ala Trp Ala Val Thr Val Gly Ser
195 200 205 195 200 205
Ala Ile Thr Arg Leu Glu His Ile Cys Gln Trp Tyr Asn Thr Ala Met Ala Ile Thr Arg Leu Glu His Ile Cys Gln Trp Tyr Asn Thr Ala Met
210 215 220 210 215 220
Lys Lys Ala Val Leu Lys Lys Ala Val Leu
225 225
<210> 88<210> 88
<211> 690<211> 690
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 88<400> 88
atgtcgttac ttgcacaact ggatcaaaaa atcgctgcta acggtggcct gattgtctcc 60atgtcgttac ttgcacaact ggatcaaaaa atcgctgcta acggtggcct gattgtctcc 60
tgccagccgg ttccggacag cccgctcgat aaacccgaaa tcgtcgccgc catggcatta 120tgccagccgg ttccggacag cccgctcgat aaacccgaaa tcgtcgccgc catggcatta 120
gcggcagaac aggcgggcgc ggttgccatt cgcattgaag gtgtggcaaa tctgcaagcc 180gcggcagaac aggcgggcgc ggttgccatt cgcattgaag gtgtggcaaa tctgcaagcc 180
acgcgtgcgg tggtgagcgt gccgattatt ggaattgtga aacgcgatct ggaggattct 240acgcgtgcgg tggtgagcgt gccgattatt ggaattgtga aacgcgatct ggaggattct 240
ccggtacgca tcacggccta tattgaagat gttgatgcgc tggcgcaggc gggcgcggac 300ccggtacgca tcacggccta tattgaagat gttgatgcgc tggcgcaggc gggcgcggac 300
attatcgcca ttgacggcac cgaccgcccg cgtccggtgc ctgttgaaac gctgctggca 360attatcgcca ttgacggcac cgaccgcccg cgtccggtgc ctgttgaaac gctgctggca 360
cgtattcacc atcacggttt actggcgatg accgactgct caacgccgga agacggcctg 420cgtattcacc atcacggttt actggcgatg accgactgct caacgccgga agacggcctg 420
gcatgccaaa agctgggagc cgaaattatt ggcactacgc tttctggcta taccacgcct 480gcatgccaaa agctgggagc cgaaattatt ggcactacgc tttctggcta taccacgcct 480
gaaacgccag aagagccgga tctggcgctg gtgaaaacgt tgagcgacgc cggatgtcgg 540gaaacgccag aagagccgga tctggcgctg gtgaaaacgt tgagcgacgc cggatgtcgg 540
gtgattgccg aagggcgtta caacacgcct gctcaggcgg cggatgcgat gcgccacggc 600gtgattgccg aagggcgtta caacacgcct gctcaggcgg cggatgcgat gcgccacggc 600
gcgtgggcgg tgacggtcgg ttctgcaatc acgcgtcttg agcacatttg tcagtggtac 660gcgtgggcgg tgacggtcgg ttctgcaatc acgcgtcttg agcacatttg tcagtggtac 660
aacacagcga tgaaaaaggc ggtgctatga 690aacacagcga tgaaaaaggc ggtgctatga 690
<210> 89<210> 89
<211> 346<211> 346
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 89<400> 89
Met Lys Glu Ile Lys Ile Gln Asn Ile Ile Ile Ser Glu Glu Lys Ala Met Lys Glu Ile Lys Ile Gln Asn Ile Ile Ile Ser Glu Glu Lys Ala
1 5 10 15 1 5 10 15
Pro Leu Val Val Pro Glu Ile Gly Ile Asn His Asn Gly Ser Leu Glu Pro Leu Val Val Pro Glu Ile Gly Ile Asn His Asn Gly Ser Leu Glu
20 25 30 20 25 30
Leu Ala Lys Ile Met Val Asp Ala Ala Phe Ser Ala Gly Ala Lys Ile Leu Ala Lys Ile Met Val Asp Ala Ala Phe Ser Ala Gly Ala Lys Ile
35 40 45 35 40 45
Ile Lys His Gln Thr His Ile Val Glu Asp Glu Met Ser Lys Ala Ala Ile Lys His Gln Thr His Ile Val Glu Asp Glu Met Ser Lys Ala Ala
50 55 60 50 55 60
Lys Lys Val Ile Pro Gly Asn Ala Lys Ile Ser Ile Tyr Glu Ile Met Lys Lys Val Ile Pro Gly Asn Ala Lys Ile Ser Ile Tyr Glu Ile Met
65 70 75 80 65 70 75 80
Gln Lys Cys Ala Leu Asp Tyr Lys Asp Glu Leu Ala Leu Lys Glu Tyr Gln Lys Cys Ala Leu Asp Tyr Lys Asp Glu Leu Ala Leu Lys Glu Tyr
85 90 95 85 90 95
Thr Glu Lys Leu Gly Leu Val Tyr Leu Ser Thr Pro Phe Ser Arg Ala Thr Glu Lys Leu Gly Leu Val Tyr Leu Ser Thr Pro Phe Ser Arg Ala
100 105 110 100 105 110
Gly Ala Asn Arg Leu Glu Asp Met Gly Val Ser Ala Phe Lys Ile Gly Gly Ala Asn Arg Leu Glu Asp Met Gly Val Ser Ala Phe Lys Ile Gly
115 120 125 115 120 125
Ser Gly Glu Cys Asn Asn Tyr Pro Leu Ile Lys His Ile Ala Ala Phe Ser Gly Glu Cys Asn Asn Tyr Pro Leu Ile Lys His Ile Ala Ala Phe
130 135 140 130 135 140
Lys Lys Pro Met Ile Val Ser Thr Gly Met Asn Ser Ile Glu Ser Ile Lys Lys Pro Met Ile Val Ser Thr Gly Met Asn Ser Ile Glu Ser Ile
145 150 155 160 145 150 155 160
Lys Pro Thr Val Lys Ile Leu Leu Asp Asn Glu Ile Pro Phe Val Leu Lys Pro Thr Val Lys Ile Leu Leu Asp Asn Glu Ile Pro Phe Val Leu
165 170 175 165 170 175
Met His Thr Thr Asn Leu Tyr Pro Thr Pro His Asn Leu Val Arg Leu Met His Thr Thr Asn Leu Tyr Pro Thr Pro His Asn Leu Val Arg Leu
180 185 190 180 185 190
Asn Ala Met Leu Glu Leu Lys Lys Glu Phe Ser Cys Met Val Gly Leu Asn Ala Met Leu Glu Leu Lys Lys Glu Phe Ser Cys Met Val Gly Leu
195 200 205 195 200 205
Ser Asp His Thr Thr Asp Asn Leu Ala Cys Leu Gly Ala Val Val Leu Ser Asp His Thr Thr Asp Asn Leu Ala Cys Leu Gly Ala Val Val Leu
210 215 220 210 215 220
Gly Ala Cys Val Leu Glu Arg His Phe Thr Asp Ser Met His Arg Ser Gly Ala Cys Val Leu Glu Arg His Phe Thr Asp Ser Met His Arg Ser
225 230 235 240 225 230 235 240
Gly Pro Asp Ile Val Cys Ser Met Asp Thr Lys Ala Leu Lys Glu Leu Gly Pro Asp Ile Val Cys Ser Met Asp Thr Lys Ala Leu Lys Glu Leu
245 250 255 245 250 255
Ile Ile Gln Ser Glu Gln Met Ala Ile Ile Arg Gly Asn Asn Glu Ser Ile Ile Gln Ser Glu Gln Met Ala Ile Ile Arg Gly Asn Asn Glu Ser
260 265 270 260 265 270
Lys Lys Ala Ala Lys Gln Glu Gln Val Thr Ile Asp Phe Ala Phe Ala Lys Lys Ala Ala Lys Gln Glu Gln Val Thr Ile Asp Phe Ala Phe Ala
275 280 285 275 280 285
Ser Val Val Ser Ile Lys Asp Ile Lys Lys Gly Glu Val Leu Ser Met Ser Val Val Ser Ile Lys Asp Ile Lys Lys Gly Glu Val Leu Ser Met
290 295 300 290 295 300
Asp Asn Ile Trp Val Lys Arg Pro Gly Leu Gly Gly Ile Ser Ala Ala Asp Asn Ile Trp Val Lys Arg Pro Gly Leu Gly Gly Ile Ser Ala Ala
305 310 315 320 305 310 315 320
Glu Phe Glu Asn Ile Leu Gly Lys Lys Ala Leu Arg Asp Ile Glu Asn Glu Phe Glu Asn Ile Leu Gly Lys Lys Ala Leu Arg Asp Ile Glu Asn
325 330 335 325 330 335
Asp Ala Gln Leu Ser Tyr Glu Asp Phe Ala Asp Ala Gln Leu Ser Tyr Glu Asp Phe Ala
340 345 340 345
<210> 90<210> 90
<211> 1041<211> 1041
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 90<400> 90
atgaaagaaa taaaaataca aaatataatc ataagtgaag aaaaagcacc cttagtcgtg 60atgaaagaaa taaaaataca aaatataatc ataagtgaag aaaaagcacc cttagtcgtg 60
cctgaaatag gcattaatca taatggcagt ttagaactag ctaaaattat ggtagatgca 120cctgaaatag gcattaatca taatggcagt ttagaactag ctaaaattat ggtagatgca 120
gcctttagcg caggtgctaa gattataaag catcaaaccc acatcgttga agatgagatg 180gcctttagcg caggtgctaa gattataaag catcaaaccc acatcgttga agatgagatg 180
agtaaggccg ctaaaaaagt aattcctggt aatgcaaaaa taagcattta tgagattatg 240agtaaggccg ctaaaaaagt aattcctggt aatgcaaaaa taagcattta tgagattatg 240
caaaaatgtg ctttagatta taaagatgag ctagcactta aagaatacac agaaaaatta 300caaaaatgtg ctttagatta taaagatgag ctagcactta aagaatacac agaaaaatta 300
ggtcttgttt atcttagcac acctttttct cgtgcaggtg caaaccgctt agaagatatg 360ggtcttgttt atcttagcac acctttttct cgtgcaggtg caaaccgctt agaagatatg 360
ggagttagtg cttttaagat tggttcaggt gagtgtaata attatccgct tattaaacac 420ggagttagtg cttttaagat tggttcaggt gagtgtaata attatccgct tattaaacac 420
atagcagcct ttaaaaagcc tatgatagtt agcacaggaa tgaatagtat tgaaagtata 480atagcagcct ttaaaaagcc tatgatagtt agcacaggaa tgaatagtat tgaaagtata 480
aaaccaactg taaaaatctt attagacaat gaaattccct ttgttttaat gcactcgacc 540aaaccaactg taaaaatctt attagacaat gaaattccct ttgttttaat gcactcgacc 540
aatctttacc caaccccgca taatcttgta agattaaacg ctatgcttga attaaaaaaa 600aatctttacc caaccccgca taatcttgta agattaaacg ctatgcttga attaaaaaaa 600
gaattttctt gcatggtagg cttaagcgac cacacaacag ataatcttgc gtgtttaggt 660gaattttctt gcatggtagg cttaagcgac cacacaacag ataatcttgc gtgtttaggt 660
gcggttgcac ttggtgcttg tgtgcttgaa agacatttta ctgatagtat gcatagaagt 720gcggttgcac ttggtgcttg tgtgcttgaa agacatttta ctgatagtat gcatagaagt 720
ggccctgata tagtttgttc tatggataca aaggctttaa aagagctaat tatccaaagt 780ggccctgata tagtttgttc tatggataca aaggctttaa aagagctaat tatccaaagt 780
gagcaaatgg ctataatgaa aggaaataat gaaagcaaaa aagcagctaa gcaagaacaa 840gagcaaatgg ctataatgaa aggaaataat gaaagcaaaa aagcagctaa gcaagaacaa 840
gttacaattg attttgcctt tgcaagcgta gttagcatta aagatattaa aaaaggcgaa 900gttacaattg attttgcctt tgcaagcgta gttagcatta aagatattaa aaaaggcgaa 900
gttttatcta tggacaatat ctgggttaaa agacctggac ttggtggaat tagtgcggct 960gttttatcta tggacaatat ctgggttaaa agacctggac ttggtggaat tagtgcggct 960
gaatttgaaa atattttagg caaaaaagca ttaagagata tagaaaatga tactcagtta 1020gaatttgaaa atattttagg caaaaaagca ttaagagata tagaaaatga tactcagtta 1020
agctatgagg attttgcgtg a 1041agctatgagg attttgcgtg a 1041
<210> 91<210> 91
<211> 221<211> 221
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 91<400> 91
Met Ser Leu Ala Ile Ile Pro Ala Arg Gly Gly Ser Lys Gly Ile Lys Met Ser Leu Ala Ile Ile Pro Ala Arg Gly Gly Ser Lys Gly Ile Lys
1 5 10 15 1 5 10 15
Asn Lys Asn Leu Val Leu Leu Asn Asn Lys Pro Leu Ile Tyr Tyr Thr Asn Lys Asn Leu Val Leu Leu Asn Asn Lys Pro Leu Ile Tyr Tyr Thr
20 25 30 20 25 30
Ile Lys Ala Ala Leu Asn Ala Lys Ser Ile Ser Lys Val Val Val Ser Ile Lys Ala Ala Leu Asn Ala Lys Ser Ile Ser Lys Val Val Val Ser
35 40 45 35 40 45
Ser Asp Ser Asp Glu Ile Leu Asn Tyr Ala Lys Ser Gln Asn Val Asp Ser Asp Ser Asp Glu Ile Leu Asn Tyr Ala Lys Ser Gln Asn Val Asp
50 55 60 50 55 60
Ile Leu Lys Arg Pro Ile Ser Leu Ala Gln Asp Asp Thr Thr Ser Asp Ile Leu Lys Arg Pro Ile Ser Leu Ala Gln Asp Asp Thr Thr Ser Asp
65 70 75 80 65 70 75 80
Lys Val Leu Leu His Ala Leu Lys Phe Tyr Lys Asp Tyr Glu Asp Val Lys Val Leu Leu His Ala Leu Lys Phe Tyr Lys Asp Tyr Glu Asp Val
85 90 95 85 90 95
Val Phe Leu Gln Pro Thr Ser Pro Leu Arg Thr Asn Ile His Ile Asn Val Phe Leu Gln Pro Thr Ser Pro Leu Arg Thr Asn Ile His Ile Asn
100 105 110 100 105 110
Glu Ala Phe Asn Leu Tyr Lys Asn Ser Asn Ala Asn Ala Leu Ile Ser Glu Ala Phe Asn Leu Tyr Lys Asn Ser Asn Ala Asn Ala Leu Ile Ser
115 120 125 115 120 125
Val Ser Glu Cys Asp Asn Lys Ile Leu Lys Ala Phe Val Cys Asn Asp Val Ser Glu Cys Asp Asn Lys Ile Leu Lys Ala Phe Val Cys Asn Asp
130 135 140 130 135 140
Cys Gly Asp Leu Ala Gly Ile Cys Asn Asp Glu Tyr Pro Phe Met Pro Cys Gly Asp Leu Ala Gly Ile Cys Asn Asp Glu Tyr Pro Phe Met Pro
145 150 155 160 145 150 155 160
Arg Gln Lys Leu Pro Lys Thr Tyr Met Ser Asn Gly Ala Ile Tyr Ile Arg Gln Lys Leu Pro Lys Thr Tyr Met Ser Asn Gly Ala Ile Tyr Ile
165 170 175 165 170 175
Leu Lys Ile Lys Glu Phe Leu Asn Asn Pro Ser Phe Leu Gln Ser Lys Leu Lys Ile Lys Glu Phe Leu Asn Asn Pro Ser Phe Leu Gln Ser Lys
180 185 190 180 185 190
Thr Lys His Phe Leu Met Asp Glu Ser Ser Ser Leu Asp Ile Asp Cys Thr Lys His Phe Leu Met Asp Glu Ser Ser Ser Leu Asp Ile Asp Cys
195 200 205 195 200 205
Leu Glu Asp Leu Lys Lys Val Glu Gln Ile Trp Lys Lys Leu Glu Asp Leu Lys Lys Val Glu Gln Ile Trp Lys Lys
210 215 220 210 215 220
<210> 92<210> 92
<211> 666<211> 666
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 92<400> 92
atgagcctgg ccattatccc ggcacgtggc ggttctaaag gcatcaaaaa caaaaacctg 60atgagcctgg ccattatccc ggcacgtggc ggttctaaag gcatcaaaaa caaaaacctg 60
gttctgctga acaataaacc gctgatttat tacaccatca aagcggccct gaacgccaaa 120gttctgctga acaataaacc gctgatttat tacaccatca aagcggccct gaacgccaaa 120
agtattagca aagtggttgt gagctctgat tctgatgaaa tcctgaacta cgcaaaaagt 180agtattagca aagtggttgt gagctctgat tctgatgaaa tcctgaacta cgcaaaaagt 180
cagaacgttg atatcctgaa acgtccgatc agtctggcac aggatgatac cacgagcgat 240cagaacgttg atatcctgaa acgtccgatc agtctggcac aggatgatac cacgagcgat 240
aaagtgctgc tgcatgcgct gaaattctac aaagattacg aagatgttgt gttcctgcag 300aaagtgctgc tgcatgcgct gaaattctac aaagattacg aagatgttgt gttcctgcag 300
ccgaccagcc cgctgcgtac gaatattcac atcaacgaag cgttcaacct gtacaaaaac 360ccgaccagcc cgctgcgtac gaatattcac atcaacgaag cgttcaacct gtacaaaaac 360
agcaacgcaa acgcgctgat ttctgttagt gaatgcgata acaaaatcct gaaagcgttt 420agcaacgcaa acgcgctgat ttctgttagt gaatgcgata acaaaatcct gaaagcgttt 420
gtgtgcaatg attgtggcga tctggccggt atttgtaacg atgaataccc gttcatgccg 480gtgtgcaatg attgtggcga tctggccggt atttgtaacg atgaataccc gttcatgccg 480
cgccagaaac tgccgaaaac ctatatgagc aatggtgcca tctacatcct gaaaatcaaa 540cgccagaaac tgccgaaaac ctatatgagc aatggtgcca tctacatcct gaaaatcaaa 540
gaattcctga acaacccgag cttcctgcag tctaaaacga aacatttcct gatggatgaa 600gaattcctga acaacccgag cttcctgcag tctaaaacga aacatttcct gatggatgaa 600
agtagctctc tggatattga ttgcctggaa gatctgaaaa aagtggaaca gatctggaaa 660agtagctctc tggatattga ttgcctggaa gatctgaaaa aagtggaaca gatctggaaa 660
aaataa 666aaataa 666
<210> 93<210> 93
<211> 417<211> 417
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 93<400> 93
Met Tyr Tyr Leu Lys Asn Thr Asn Phe Trp Met Phe Gly Leu Phe Phe Met Tyr Tyr Leu Lys Asn Thr Asn Phe Trp Met Phe Gly Leu Phe Phe
1 5 10 15 1 5 10 15
Phe Phe Tyr Phe Phe Ile Met Gly Ala Tyr Phe Pro Phe Phe Pro Ile Phe Phe Tyr Phe Phe Ile Met Gly Ala Tyr Phe Pro Phe Phe Pro Ile
20 25 30 20 25 30
Trp Leu His Asp Ile Asn His Ile Ser Lys Ser Asp Thr Gly Ile Ile Trp Leu His Asp Ile Asn His Ile Ser Lys Ser Asp Thr Gly Ile Ile
35 40 45 35 40 45
Phe Ala Ala Ile Ser Leu Phe Ser Leu Leu Phe Gln Pro Leu Phe Gly Phe Ala Ala Ile Ser Leu Phe Ser Leu Leu Phe Gln Pro Leu Phe Gly
50 55 60 50 55 60
Leu Leu Ser Asp Lys Leu Gly Leu Arg Lys Tyr Leu Leu Trp Ile Ile Leu Leu Ser Asp Lys Leu Gly Leu Arg Lys Tyr Leu Leu Trp Ile Ile
65 70 75 80 65 70 75 80
Thr Gly Met Leu Val Met Phe Ala Pro Phe Phe Ile Phe Ile Phe Gly Thr Gly Met Leu Val Met Phe Ala Pro Phe Phe Ile Phe Ile Phe Gly
85 90 95 85 90 95
Pro Leu Leu Gln Tyr Asn Ile Leu Val Gly Ser Ile Val Gly Gly Ile Pro Leu Leu Gln Tyr Asn Ile Leu Val Gly Ser Ile Val Gly Gly Ile
100 105 110 100 105 110
Tyr Leu Gly Phe Cys Phe Asn Ala Gly Ala Pro Ala Val Glu Ala Phe Tyr Leu Gly Phe Cys Phe Asn Ala Gly Ala Pro Ala Val Glu Ala Phe
115 120 125 115 120 125
Ile Glu Lys Val Ser Arg Arg Ser Asn Phe Glu Phe Gly Arg Ala Arg Ile Glu Lys Val Ser Arg Arg Ser Asn Phe Glu Phe Gly Arg Ala Arg
130 135 140 130 135 140
Met Phe Gly Cys Val Gly Trp Ala Leu Cys Ala Ser Ile Val Gly Ile Met Phe Gly Cys Val Gly Trp Ala Leu Cys Ala Ser Ile Val Gly Ile
145 150 155 160 145 150 155 160
Met Phe Thr Ile Asn Asn Gln Phe Val Phe Trp Leu Gly Ser Gly Cys Met Phe Thr Ile Asn Asn Gln Phe Val Phe Trp Leu Gly Ser Gly Cys
165 170 175 165 170 175
Ala Leu Ile Leu Ala Val Leu Leu Phe Phe Ala Lys Thr Asp Ala Pro Ala Leu Ile Leu Ala Val Leu Leu Phe Phe Ala Lys Thr Asp Ala Pro
180 185 190 180 185 190
Ser Ser Ala Thr Val Ala Asn Ala Val Gly Ala Asn His Ser Ala Phe Ser Ser Ala Thr Val Ala Asn Ala Val Gly Ala Asn His Ser Ala Phe
195 200 205 195 200 205
Ser Leu Lys Leu Ala Leu Glu Leu Phe Arg Gln Pro Lys Leu Trp Phe Ser Leu Lys Leu Ala Leu Glu Leu Phe Arg Gln Pro Lys Leu Trp Phe
210 215 220 210 215 220
Leu Ser Leu Tyr Val Ile Gly Val Ser Cys Thr Tyr Asp Val Phe Asp Leu Ser Leu Tyr Val Ile Gly Val Ser Cys Thr Tyr Asp Val Phe Asp
225 230 235 240 225 230 235 240
Gln Gln Phe Ala Asn Phe Phe Thr Ser Phe Phe Ala Thr Gly Glu Gln Gln Gln Phe Ala Asn Phe Phe Thr Ser Phe Phe Ala Thr Gly Glu Gln
245 250 255 245 250 255
Gly Thr Arg Val Phe Gly Tyr Val Thr Thr Met Gly Glu Leu Leu Asn Gly Thr Arg Val Phe Gly Tyr Val Thr Thr Met Gly Glu Leu Leu Asn
260 265 270 260 265 270
Ala Ser Ile Met Phe Phe Ala Pro Leu Ile Ile Asn Arg Ile Gly Gly Ala Ser Ile Met Phe Phe Ala Pro Leu Ile Ile Asn Arg Ile Gly Gly
275 280 285 275 280 285
Lys Asn Ala Leu Leu Leu Ala Gly Thr Ile Met Ser Val Arg Ile Ile Lys Asn Ala Leu Leu Leu Ala Gly Thr Ile Met Ser Val Arg Ile Ile
290 295 300 290 295 300
Gly Ser Ser Phe Ala Thr Ser Ala Leu Glu Val Val Ile Leu Lys Thr Gly Ser Ser Phe Ala Thr Ser Ala Leu Glu Val Val Ile Leu Lys Thr
305 310 315 320 305 310 315 320
Leu His Met Phe Glu Val Pro Phe Leu Leu Val Gly Cys Phe Lys Tyr Leu His Met Phe Glu Val Pro Phe Leu Leu Val Gly Cys Phe Lys Tyr
325 330 335 325 330 335
Ile Thr Ser Gln Phe Glu Val Arg Phe Ser Ala Thr Ile Tyr Leu Val Ile Thr Ser Gln Phe Glu Val Arg Phe Ser Ala Thr Ile Tyr Leu Val
340 345 350 340 345 350
Cys Phe Cys Phe Phe Lys Gln Leu Ala Met Ile Phe Met Ser Val Leu Cys Phe Cys Phe Phe Lys Gln Leu Ala Met Ile Phe Met Ser Val Leu
355 360 365 355 360 365
Ala Gly Asn Met Tyr Glu Ser Ile Gly Phe Gln Gly Ala Tyr Leu Val Ala Gly Asn Met Tyr Glu Ser Ile Gly Phe Gln Gly Ala Tyr Leu Val
370 375 380 370 375 380
Leu Gly Leu Val Ala Leu Gly Phe Thr Leu Ile Ser Val Phe Thr Leu Leu Gly Leu Val Ala Leu Gly Phe Thr Leu Ile Ser Val Phe Thr Leu
385 390 395 400 385 390 395 400
Ser Gly Pro Gly Pro Leu Ser Leu Leu Arg Arg Gln Val Asn Glu Val Ser Gly Pro Gly Pro Leu Ser Leu Leu Arg Arg Gln Val Asn Glu Val
405 410 415 405 410 415
Ala Ala
<210> 94<210> 94
<211> 1254<211> 1254
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 94<400> 94
atgtactatt taaaaaacac aaacttttgg atgttcggtt tattcttttt cttttacttt 60atgtactatt taaaaaacac aaacttttgg atgttcggtt tattcttttt ctttttttt 60
tttatcatgg gagcctactt cccgtttttc ccgatttggc tacatgacat caaccatatc 120tttatcatgg gagcctactt cccgtttttc ccgatttggc tacatgacat caaccatatc 120
agcaaaagtg atacgggtat tatttttgcc gctatttctc tgttctcgct attattccaa 180agcaaaagtg atacgggtat tatttttgcc gctatttctc tgttctcgct attattccaa 180
ccgctgtttg gtctgctttc tgacaaactc gggctgcgca aatacctgct gtggattatt 240ccgctgtttg gtctgctttc tgacaaactc gggctgcgca aatacctgct gtggattatt 240
accggcatgt tagtgatgtt tgcgccgttc tttattttta tcttcgggcc actgttacaa 300accggcatgt tagtgatgtt tgcgccgttc tttattttta tcttcgggcc actgttacaa 300
tacaacattt tagtaggatc gattgttggt ggtatttatc taggcttttg ttttaacgcc 360tacaacattt tagtaggatc gattgttggt ggtatttatc taggcttttg ttttaacgcc 360
ggtgcgccag cagtagaggc atttattgag aaagtcagcc gtcgcagtaa tttcgaattt 420ggtgcgccag cagtagaggc atttattgag aaagtcagcc gtcgcagtaa tttcgaattt 420
ggtcgcgcgc ggatgtttgg ctgtgttggc tgggcgctgt gtgcctcgat tgtcggcatc 480ggtcgcgcgc ggatgtttgg ctgtgttggc tgggcgctgt gtgcctcgat tgtcggcatc 480
atgttcacca tcaataatca gtttgttttc tggctgggct ctggctgtgc actcatcctc 540atgttcacca tcaataatca gtttgttttc tggctgggct ctggctgtgc actcatcctc 540
gccgttttac tctttttcgc caaaacggat gcgccctctt ctgccacggt tgccaatgcg 600gccgttttac tctttttcgc caaaacggat gcgccctctt ctgccacggt tgccaatgcg 600
gtaggtgcca accattcggc atttagcctt aagctggcac tggaactgtt cagacagcca 660gtaggtgcca accattcggc atttagcctt aagctggcac tggaactgtt cagacagcca 660
aaactgtggt ttttgtcact gtatgttatt ggcgtttcct gcacctacga tgtttttgac 720aaactgtggt ttttgtcact gtatgttatt ggcgtttcct gcacctacga tgtttttgac 720
caacagtttg ctaatttctt tacttcgttc tttgctaccg gtgaacaggg tacgcgggta 780caacagtttg ctaatttctt tacttcgttc tttgctaccg gtgaacaggg tacgcgggta 780
tttggctacg taacgacaat gggcgaatta cttaacgcct cgattatgtt ctttgcgcca 840tttggctacg taacgacaat gggcgaatta cttaacgcct cgattatgtt ctttgcgcca 840
ctgatcatta atcgcatcgg tgggaaaaac gccctgctgc tggctggcac tattatgtct 900ctgatcatta atcgcatcgg tgggaaaaac gccctgctgc tggctggcac tattatgtct 900
gtacgtatta ttggctcatc gttcgccacc tcagcgctgg aagtggttat tctgaaaacg 960gtacgtatta ttggctcatc gttcgccacc tcagcgctgg aagtggttat tctgaaaacg 960
ctgcatatgt ttgaagtacc gttcctgctg gtgggctgct ttaaatatat taccagccag 1020ctgcatatgt ttgaagtacc gttcctgctg gtgggctgct ttaaatatat taccagccag 1020
tttgaagtgc gtttttcagc gacgatttat ctggtctgtt tctgcttctt taagcaactg 1080tttgaagtgc gtttttcagc gacgatttat ctggtctgtt tctgcttctt taagcaactg 1080
gcgatgattt ttatgtctgt actggcgggc aatatgtatg aaagcatcgg tttccagggc 1140gcgatgattt ttatgtctgt actggcgggc aatatgtatg aaagcatcgg tttccagggc 1140
gcttatctgg tgctgggtct ggtggcgctg ggcttcacct taatttccgt gttcacgctt 1200gcttatctgg tgctgggtct ggtggcgctg ggcttcacct taatttccgt gttcacgctt 1200
agcggccccg gtccgctttc tctactgcgt cgtcaggtga atgaagtcgc ttaa 1254agcggccccg gtccgctttc tctactgcgt cgtcaggtga atgaagtcgc ttaa 1254
<210> 95<210> 95
<211> 1024<211> 1024
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 95<400> 95
Met Thr Met Ile Thr Asp Ser Leu Ala Val Val Leu Gln Arg Arg Asp Met Thr Met Ile Thr Asp Ser Leu Ala Val Val Leu Gln Arg Arg Asp
1 5 10 15 1 5 10 15
Trp Glu Asn Pro Gly Val Thr Gln Leu Asn Arg Leu Ala Ala His Pro Trp Glu Asn Pro Gly Val Thr Gln Leu Asn Arg Leu Ala Ala His Pro
20 25 30 20 25 30
Pro Phe Ala Ser Trp Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro Pro Phe Ala Ser Trp Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro
35 40 45 35 40 45
Ser Gln Gln Leu Arg Ser Leu Asn Gly Glu Trp Arg Phe Ala Trp Phe Ser Gln Gln Leu Arg Ser Leu Asn Gly Glu Trp Arg Phe Ala Trp Phe
50 55 60 50 55 60
Pro Ala Pro Glu Ala Val Pro Glu Ser Trp Leu Glu Cys Asp Leu Pro Pro Ala Pro Glu Ala Val Pro Glu Ser Trp Leu Glu Cys Asp Leu Pro
65 70 75 80 65 70 75 80
Glu Ala Asp Thr Val Val Val Pro Ser Asn Trp Gln Met His Gly Tyr Glu Ala Asp Thr Val Val Val Pro Ser Asn Trp Gln Met His Gly Tyr
85 90 95 85 90 95
Asp Ala Pro Ile Tyr Thr Asn Val Thr Tyr Pro Ile Thr Val Asn Pro Asp Ala Pro Ile Tyr Thr Asn Val Thr Tyr Pro Ile Thr Val Asn Pro
100 105 110 100 105 110
Pro Phe Val Pro Thr Glu Asn Pro Thr Gly Cys Tyr Ser Leu Thr Phe Pro Phe Val Pro Thr Glu Asn Pro Thr Gly Cys Tyr Ser Leu Thr Phe
115 120 125 115 120 125
Asn Val Asp Glu Ser Trp Leu Gln Glu Gly Gln Thr Arg Ile Ile Phe Asn Val Asp Glu Ser Trp Leu Gln Glu Gly Gln Thr Arg Ile Ile Phe
130 135 140 130 135 140
Asp Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly Arg Trp Val Asp Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly Arg Trp Val
145 150 155 160 145 150 155 160
Gly Tyr Gly Gln Asp Ser Arg Leu Pro Ser Glu Phe Asp Leu Ser Ala Gly Tyr Gly Gln Asp Ser Arg Leu Pro Ser Glu Phe Asp Leu Ser Ala
165 170 175 165 170 175
Phe Leu Arg Ala Gly Glu Asn Arg Leu Ala Val Met Val Leu Arg Trp Phe Leu Arg Ala Gly Glu Asn Arg Leu Ala Val Met Val Leu Arg Trp
180 185 190 180 185 190
Ser Asp Gly Ser Tyr Leu Glu Asp Gln Asp Met Trp Arg Met Ser Gly Ser Asp Gly Ser Tyr Leu Glu Asp Gln Asp Met Trp Arg Met Ser Gly
195 200 205 195 200 205
Ile Phe Arg Asp Val Ser Leu Leu His Lys Pro Thr Thr Gln Ile Ser Ile Phe Arg Asp Val Ser Leu Leu His Lys Pro Thr Thr Gln Ile Ser
210 215 220 210 215 220
Asp Phe His Val Ala Thr Arg Phe Asn Asp Asp Phe Ser Arg Ala Val Asp Phe His Val Ala Thr Arg Phe Asn Asp Asp Phe Ser Arg Ala Val
225 230 235 240 225 230 235 240
Leu Glu Ala Glu Val Gln Met Cys Gly Glu Leu Arg Asp Tyr Leu Arg Leu Glu Ala Glu Val Gln Met Cys Gly Glu Leu Arg Asp Tyr Leu Arg
245 250 255 245 250 255
Val Thr Val Ser Leu Trp Gln Gly Glu Thr Gln Val Ala Ser Gly Thr Val Thr Val Ser Leu Trp Gln Gly Glu Thr Gln Val Ala Ser Gly Thr
260 265 270 260 265 270
Ala Pro Phe Gly Gly Glu Ile Ile Asp Glu Arg Gly Gly Tyr Ala Asp Ala Pro Phe Gly Gly Glu Ile Ile Asp Glu Arg Gly Gly Tyr Ala Asp
275 280 285 275 280 285
Arg Val Thr Leu Arg Leu Asn Val Glu Asn Pro Lys Leu Trp Ser Ala Arg Val Thr Leu Arg Leu Asn Val Glu Asn Pro Lys Leu Trp Ser Ala
290 295 300 290 295 300
Glu Ile Pro Asn Leu Tyr Arg Ala Val Val Glu Leu His Thr Ala Asp Glu Ile Pro Asn Leu Tyr Arg Ala Val Val Glu Leu His Thr Ala Asp
305 310 315 320 305 310 315 320
Gly Thr Leu Ile Glu Ala Glu Ala Cys Asp Val Gly Phe Arg Glu Val Gly Thr Leu Ile Glu Ala Glu Ala Cys Asp Val Gly Phe Arg Glu Val
325 330 335 325 330 335
Arg Ile Glu Asn Gly Leu Leu Leu Leu Asn Gly Lys Pro Leu Leu Ile Arg Ile Glu Asn Gly Leu Leu Leu Leu Asn Gly Lys Pro Leu Leu Ile
340 345 350 340 345 350
Arg Gly Val Asn Arg His Glu His His Pro Leu His Gly Gln Val Met Arg Gly Val Asn Arg His Glu His His Pro Leu His Gly Gln Val Met
355 360 365 355 360 365
Asp Glu Gln Thr Met Val Gln Asp Ile Leu Leu Met Lys Gln Asn Asn Asp Glu Gln Thr Met Val Gln Asp Ile Leu Leu Met Lys Gln Asn Asn
370 375 380 370 375 380
Phe Asn Ala Val Arg Cys Ser His Tyr Pro Asn His Pro Leu Trp Tyr Phe Asn Ala Val Arg Cys Ser His Tyr Pro Asn His Pro Leu Trp Tyr
385 390 395 400 385 390 395 400
Thr Leu Cys Asp Arg Tyr Gly Leu Tyr Val Val Asp Glu Ala Asn Ile Thr Leu Cys Asp Arg Tyr Gly Leu Tyr Val Val Asp Glu Ala Asn Ile
405 410 415 405 410 415
Glu Thr His Gly Met Val Pro Met Asn Arg Leu Thr Asp Asp Pro Arg Glu Thr His Gly Met Val Pro Met Asn Arg Leu Thr Asp Asp Pro Arg
420 425 430 420 425 430
Trp Leu Pro Ala Met Ser Glu Arg Val Thr Arg Met Val Gln Arg Asp Trp Leu Pro Ala Met Ser Glu Arg Val Thr Arg Met Val Gln Arg Asp
435 440 445 435 440 445
Arg Asn His Pro Ser Val Ile Ile Trp Ser Leu Gly Asn Glu Ser Gly Arg Asn His Pro Ser Val Ile Ile Trp Ser Leu Gly Asn Glu Ser Gly
450 455 460 450 455 460
His Gly Ala Asn His Asp Ala Leu Tyr Arg Trp Ile Lys Ser Val Asp His Gly Ala Asn His Asp Ala Leu Tyr Arg Trp Ile Lys Ser Val Asp
465 470 475 480 465 470 475 480
Pro Ser Arg Pro Val Gln Tyr Glu Gly Gly Gly Ala Asp Thr Thr Ala Pro Ser Arg Pro Val Gln Tyr Glu Gly Gly Gly Ala Asp Thr Thr Ala
485 490 495 485 490 495
Thr Asp Ile Ile Cys Pro Met Tyr Ala Arg Val Asp Glu Asp Gln Pro Thr Asp Ile Ile Cys Pro Met Tyr Ala Arg Val Asp Glu Asp Gln Pro
500 505 510 500 505 510
Phe Pro Ala Val Pro Lys Trp Ser Ile Lys Lys Trp Leu Ser Leu Pro Phe Pro Ala Val Pro Lys Trp Ser Ile Lys Lys Trp Leu Ser Leu Pro
515 520 525 515 520 525
Gly Glu Thr Arg Pro Leu Ile Leu Cys Glu Tyr Ala His Ala Met Gly Gly Glu Thr Arg Pro Leu Ile Leu Cys Glu Tyr Ala His Ala Met Gly
530 535 540 530 535 540
Asn Ser Leu Gly Gly Phe Ala Lys Tyr Trp Gln Ala Phe Arg Gln Tyr Asn Ser Leu Gly Gly Phe Ala Lys Tyr Trp Gln Ala Phe Arg Gln Tyr
545 550 555 560 545 550 555 560
Pro Arg Leu Gln Gly Gly Phe Val Trp Asp Trp Val Asp Gln Ser Leu Pro Arg Leu Gln Gly Gly Phe Val Trp Asp Trp Val Asp Gln Ser Leu
565 570 575 565 570 575
Ile Lys Tyr Asp Glu Asn Gly Asn Pro Trp Ser Ala Tyr Gly Gly Asp Ile Lys Tyr Asp Glu Asn Gly Asn Pro Trp Ser Ala Tyr Gly Gly Asp
580 585 590 580 585 590
Phe Gly Asp Thr Pro Asn Asp Arg Gln Phe Cys Met Asn Gly Leu Val Phe Gly Asp Thr Pro Asn Asp Arg Gln Phe Cys Met Asn Gly Leu Val
595 600 605 595 600 605
Phe Ala Asp Arg Thr Pro His Pro Ala Leu Thr Glu Ala Lys His Gln Phe Ala Asp Arg Thr Pro His Pro Ala Leu Thr Glu Ala Lys His Gln
610 615 620 610 615 620
Gln Gln Phe Phe Gln Phe Arg Leu Ser Gly Gln Thr Ile Glu Val Thr Gln Gln Phe Phe Gln Phe Arg Leu Ser Gly Gln Thr Ile Glu Val Thr
625 630 635 640 625 630 635 640
Ser Glu Tyr Leu Phe Arg His Ser Asp Asn Glu Leu Leu His Trp Met Ser Glu Tyr Leu Phe Arg His Ser Asp Asn Glu Leu Leu His Trp Met
645 650 655 645 650 655
Val Ala Leu Asp Gly Lys Pro Leu Ala Ser Gly Glu Val Pro Leu Asp Val Ala Leu Asp Gly Lys Pro Leu Ala Ser Gly Glu Val Pro Leu Asp
660 665 670 660 665 670
Val Ala Pro Gln Gly Lys Gln Leu Ile Glu Leu Pro Glu Leu Pro Gln Val Ala Pro Gln Gly Lys Gln Leu Ile Glu Leu Pro Glu Leu Pro Gln
675 680 685 675 680 685
Pro Glu Ser Ala Gly Gln Leu Trp Leu Thr Val Arg Val Val Gln Pro Pro Glu Ser Ala Gly Gln Leu Trp Leu Thr Val Arg Val Val Gln Pro
690 695 700 690 695 700
Asn Ala Thr Ala Trp Ser Glu Ala Gly His Ile Ser Ala Trp Gln Gln Asn Ala Thr Ala Trp Ser Glu Ala Gly His Ile Ser Ala Trp Gln Gln
705 710 715 720 705 710 715 720
Trp Arg Leu Ala Glu Asn Leu Ser Val Thr Leu Pro Ala Ala Ser His Trp Arg Leu Ala Glu Asn Leu Ser Val Thr Leu Pro Ala Ala Ser His
725 730 735 725 730 735
Ala Ile Pro His Leu Thr Thr Ser Glu Met Asp Phe Cys Ile Glu Leu Ala Ile Pro His Leu Thr Thr Ser Glu Met Asp Phe Cys Ile Glu Leu
740 745 750 740 745 750
Gly Asn Lys Arg Trp Gln Phe Asn Arg Gln Ser Gly Phe Leu Ser Gln Gly Asn Lys Arg Trp Gln Phe Asn Arg Gln Ser Gly Phe Leu Ser Gln
755 760 765 755 760 765
Met Trp Ile Gly Asp Lys Lys Gln Leu Leu Thr Pro Leu Arg Asp Gln Met Trp Ile Gly Asp Lys Lys Gln Leu Leu Thr Pro Leu Arg Asp Gln
770 775 780 770 775 780
Phe Thr Arg Ala Pro Leu Asp Asn Asp Ile Gly Val Ser Glu Ala Thr Phe Thr Arg Ala Pro Leu Asp Asn Asp Ile Gly Val Ser Glu Ala Thr
785 790 795 800 785 790 795 800
Arg Ile Asp Pro Asn Ala Trp Val Glu Arg Trp Lys Ala Ala Gly His Arg Ile Asp Pro Asn Ala Trp Val Glu Arg Trp Lys Ala Ala Gly His
805 810 815 805 810 815
Tyr Gln Ala Glu Ala Ala Leu Leu Gln Cys Thr Ala Asp Thr Leu Ala Tyr Gln Ala Glu Ala Ala Leu Leu Gln Cys Thr Ala Asp Thr Leu Ala
820 825 830 820 825 830
Asp Ala Val Leu Ile Thr Thr Ala His Ala Trp Gln His Gln Gly Lys Asp Ala Val Leu Ile Thr Thr Ala His Ala Trp Gln His Gln Gly Lys
835 840 845 835 840 845
Thr Leu Phe Ile Ser Arg Lys Thr Tyr Arg Ile Asp Gly Ser Gly Gln Thr Leu Phe Ile Ser Arg Lys Thr Tyr Arg Ile Asp Gly Ser Gly Gln
850 855 860 850 855 860
Met Ala Ile Thr Val Asp Val Glu Val Ala Ser Asp Thr Pro His Pro Met Ala Ile Thr Val Asp Val Glu Val Ala Ser Asp Thr Pro His Pro
865 870 875 880 865 870 875 880
Ala Arg Ile Gly Leu Asn Cys Gln Leu Ala Gln Val Ala Glu Arg Val Ala Arg Ile Gly Leu Asn Cys Gln Leu Ala Gln Val Ala Glu Arg Val
885 890 895 885 890 895
Asn Trp Leu Gly Leu Gly Pro Gln Glu Asn Tyr Pro Asp Arg Leu Thr Asn Trp Leu Gly Leu Gly Pro Gln Glu Asn Tyr Pro Asp Arg Leu Thr
900 905 910 900 905 910
Ala Ala Cys Phe Asp Arg Trp Asp Leu Pro Leu Ser Asp Met Tyr Thr Ala Ala Cys Phe Asp Arg Trp Asp Leu Pro Leu Ser Asp Met Tyr Thr
915 920 925 915 920 925
Pro Tyr Val Phe Pro Ser Glu Asn Gly Leu Arg Cys Gly Thr Arg Glu Pro Tyr Val Phe Pro Ser Glu Asn Gly Leu Arg Cys Gly Thr Arg Glu
930 935 940 930 935 940
Leu Asn Tyr Gly Pro His Gln Trp Arg Gly Asp Phe Gln Phe Asn Ile Leu Asn Tyr Gly Pro His Gln Trp Arg Gly Asp Phe Gln Phe Asn Ile
945 950 955 960 945 950 955 960
Ser Arg Tyr Ser Gln Gln Gln Leu Met Glu Thr Ser His Arg His Leu Ser Arg Tyr Ser Gln Gln Gln Leu Met Glu Thr Ser His Arg His Leu
965 970 975 965 970 975
Leu His Ala Glu Glu Gly Thr Trp Leu Asn Ile Asp Gly Phe His Met Leu His Ala Glu Glu Gly Thr Trp Leu Asn Ile Asp Gly Phe His Met
980 985 990 980 985 990
Gly Ile Gly Gly Asp Asp Ser Trp Ser Pro Ser Val Ser Ala Glu Phe Gly Ile Gly Gly Asp Asp Ser Trp Ser Pro Ser Val Ser Ala Glu Phe
995 1000 1005 995 1000 1005
Gln Leu Ser Ala Gly Arg Tyr His Tyr Gln Leu Val Trp Cys Gln Gln Leu Ser Ala Gly Arg Tyr His Tyr Gln Leu Val Trp Cys Gln
1010 1015 1020 1010 1015 1020
Lys Lys
<210> 96<210> 96
<211> 3075<211> 3075
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 96<400> 96
atgaccatga ttacggattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 60atgaccatga ttacggattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 60
ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 120ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 120
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 180gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 180
tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 240tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 240
gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 300gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 300
tacaccaacg tgacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 360tacaccaacg tgacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 360
acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 420acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 420
cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 480cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 480
ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 540ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 540
ggagaaaacc gcctcgcggt gatggtgctg cgctggagtg acggcagtta tctggaagat 600ggagaaaacc gcctcgcggt gatggtgctg cgctggagtg acggcagtta tctggaagat 600
caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 660caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 660
acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 720acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 720
ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 780ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 780
ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 840ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 840
gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 900gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 900
ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 960ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 960
ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1020ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1020
ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1080ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1080
catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1140catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1140
aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1200aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1200
acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1260acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1260
atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1320atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1320
gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1380gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1380
aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1440aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1440
ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1500ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1500
tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1560tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1560
atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1620atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1620
cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1680cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1680
ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 1740ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 1740
gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 1800gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 1800
cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 1860cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 1860
gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 1920gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 1920
agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 1980agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 1980
ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2040ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2040
attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2100attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2100
gtagtgcaac cgaacgcgac cgcatggtca gaagccggac acatcagcgc ctggcagcag 2160gtagtgcaac cgaacgcgac cgcatggtca gaagccggac acatcagcgc ctggcagcag 2160
tggcgtctgg ctgaaaacct cagcgtgaca ctccccgccg cgtcccacgc catcccgcat 2220tggcgtctgg ctgaaaacct cagcgtgaca ctccccgccg cgtcccacgc catcccgcat 2220
ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2280ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2280
cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2340cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2340
ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2400ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2400
cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2460cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2460
gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2520gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2520
cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2580cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2580
ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2640ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2640
gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 2700gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 2700
ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 2760ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 2760
ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 2820ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 2820
gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 2880gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 2880
agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 2940agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 2940
gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3000gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3000
agcccgtcag tatcggcgga attccagctg agcgccggtc gctaccatta ccagttggtc 3060agcccgtcag tatcggcgga attccagctg agcgccggtc gctaccatta ccagttggtc 3060
tggtgtcaaa aataa 3075tggtgtcaaa aataa 3075
<210> 97<210> 97
<211> 3123<211> 3123
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 97<400> 97
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180
caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240
ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300
aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360
tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420
ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480
gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540
atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600
caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660
tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720
ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780
cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840
ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900
gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960
caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020
atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080
ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140
caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200
gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260
ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320
gaaaccctgc aactgccggt ctcttaataa tcgaaggaga tacaacatga gcttacccga 1380gaaaccctgc aactgccggt ctcttaataa tcgaaggaga tacaacatga gcttacccga 1380
tggattttat ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa 1440tggattttat ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa 1440
ggttttgacc accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg 1500ggttttgacc accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg 1500
gaatgaagcc acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat 1560gaatgaagcc acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat 1560
ggtgattgtg gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag 1620ggtgattgtg gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag 1620
aaagatcatt catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa 1680aaagatcatt catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa 1680
gtatcagggc caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga 1740gtatcagggc caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga 1740
ctacggttgt tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa 1800ctacggttgt tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa 1800
atgtgggttt agcaacgcag gcgtggaaat gcaaattaga aaatagaata actagcataa 1860atgtgggttt agcaacgcag gcgtggaaat gcaaattaga aaatagaata actagcataa 1860
acccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 1920acccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 1920
cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 1980cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 1980
cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2040cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2040
aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc aggccggcct gttaagacgg 2100aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc aggccggcct gttaagacgg 2100
ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg aagttatcga 2160ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg aagttatcga 2160
gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac accgtggaaa 2220gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac accgtggaaa 2220
cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact gtaatgcaag 2280cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact gtaatgcaag 2280
tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg tggtaacggc 2340tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg tggtaacggc 2340
gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat gcctcgggca 2400gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat gcctcgggca 2400
tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag cagcaacgat 2460tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag cagcaacgat 2460
gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa agttaggtgg 2520gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa agttaggtgg 2520
ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca aatccatgcg 2580ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca aatccatgcg 2580
ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact cccaacatca 2640ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact cccaacatca 2640
gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg cgcttgctgc 2700gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg cgcttgctgc 2700
cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca ggtttgagca 2760cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca ggtttgagca 2760
gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc ggaggcaggg 2820gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc ggaggcaggg 2820
cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg gtgcttatgt 2880cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg gtgcttatgt 2880
gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata caaagttggg 2940gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata caaagttggg 2940
catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct aacaattcgt 3000catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct aacaattcgt 3000
tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg tataatgtat 3060tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg tataatgtat 3060
gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt gtataagaga 3120gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt gtataagaga 3120
cag 3123cag 3123
<210> 98<210> 98
<211> 2965<211> 2965
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 98<400> 98
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgaaagaa 180cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgaaagaa 180
atcaaaatcc agaacatcat catcagcgaa gaaaaagcgc cgctggttgt gccggaaatc 240atcaaaatcc agaacatcat catcagcgaa gaaaaagcgc cgctggttgt gccggaaatc 240
ggcattaacc ataatggtag tctggaactg gcaaaaatca tggtggatgc ggcctttagc 300ggcattaacc ataatggtag tctggaactg gcaaaaatca tggtggatgc ggcctttagc 300
gccggtgcaa aaatcattaa acatcagacc cacattgtgg aagatgaaat gtctaaagca 360gccggtgcaa aaatcattaa acatcagacc cacattgtgg aagatgaaat gtctaaagca 360
gcgaaaaaag ttatcccggg caacgcgaaa atcagtatct acgaaatcat gcagaaatgc 420gcgaaaaaag ttatcccggg caacgcgaaa atcagtatct acgaaatcat gcagaaatgc 420
gcgctggatt acaaagatga actggccctg aaagaatata ccgaaaaact gggtctggtg 480gcgctggatt acaaagatga actggccctg aaagaatata ccgaaaaact gggtctggtg 480
tacctgtcta ccccgtttag tcgtgcgggt gcaaaccgtc tggaagatat gggtgttagt 540tacctgtcta ccccgtttag tcgtgcgggt gcaaaccgtc tggaagatat gggtgttagt 540
gcgttcaaaa tcggcagcgg tgaatgtaac aattatccgc tgatcaaaca tattgccgca 600gcgttcaaaa tcggcagcgg tgaatgtaac aattatccgc tgatcaaaca tattgccgca 600
tttaaaaaac cgatgattgt tagcaccggc atgaatagca tcgaatctat taaaccgacg 660tttaaaaaac cgatgattgt tagcaccggc atgaatagca tcgaatctat taaaccgacg 660
gtgaaaatcc tgctggataa cgaaattccg tttgttctga tgcataccac gaatctgtac 720gtgaaaatcc tgctggataa cgaaattccg tttgttctga tgcataccac gaatctgtac 720
ccgaccccgc acaacctggt gcgtctgaat gccatgctgg aactgaaaaa agaattctct 780ccgaccccgc acaacctggt gcgtctgaat gccatgctgg aactgaaaaa agaattctct 780
tgcatggttg gtctgagtga tcacaccacg gataatctgg catgcctggg tgcagtggtt 840tgcatggttg gtctgagtga tcacaccacg gataatctgg catgcctggg tgcagtggtt 840
ctgggtgcgt gtgtgctgga acgtcatttc accgatagca tgcaccgctc tggtccggat 900ctgggtgcgt gtgtgctgga acgtcatttc accgatagca tgcaccgctc tggtccggat 900
attgtttgta gtatggatac gaaagcactg aaagaactga tcattcagag cgaacagatg 960attgtttgta gtatggatac gaaagcactg aaagaactga tcattcagag cgaacagatg 960
gcgatcattc gcggcaacaa tgaatctaaa aaagcggcca aacaggaaca ggtgaccatc 1020gcgatcattc gcggcaacaa tgaatctaaa aaagcggcca aacaggaaca ggtgaccatc 1020
gattttgcat tcgcgagtgt ggttagcatc aaagatatca aaaaaggcga agtgctgagc 1080gattttgcat tcgcgagtgt ggttagcatc aaagatatca aaaaaggcga agtgctgagc 1080
atggataata tttgggttaa acgtccgggt ctgggcggta tctctgcagc ggaatttgaa 1140atggataata tttgggttaa acgtccgggt ctgggcggta tctctgcagc ggaatttgaa 1140
aacattctgg gcaaaaaagc actgcgcgat attgaaaatg atgcgcagct gtcttatgaa 1200aacattctgg gcaaaaaagc actgcgcgat attgaaaatg atgcgcagct gtcttatgaa 1200
gatttcgcct aataaatcga tactagcata accccttggg gcctctaaac gcgtcgacac 1260gatttcgcct aataaatcga tactagcata accccttggg gcctctaaac gcgtcgacac 1260
gcaaaaaggc catccgtcag gatggccttc tgcttaattt gatgcctggc agtttatggc 1320gcaaaaaggc catccgtcag gatggccttc tgcttaattt gatgcctggc agtttatggc 1320
gggcgtcctg cccgccaccc tccgggccgt tgcttcgcaa cgttcaaatc cgctcccggc 1380gggcgtcctg cccgccaccc tccggggccgt tgcttcgcaa cgttcaaatc cgctcccggc 1380
ggatttgtcc tactcaggag agcgttcacc gacaaacaac agataaaacg aaaggcccag 1440ggatttgtcc tactcaggag agcgttcacc gacaaacaac agataaaacg aaaggcccag 1440
tctttcgact gagcctttcg ttttatttga tgcctggcag ttccctactc tcgcatgggg 1500tctttcgact gagcctttcg ttttatttga tgcctggcag ttccctactc tcgcatgggg 1500
agaccccaca ctaccatccg gtatcgataa gcttgatggc gaaaggggga tgtgctgcaa 1560agaccccaca ctaccatccg gtatcgataa gcttgatggc gaaaggggga tgtgctgcaa 1560
ggcgattaag ttgggtaacg ccagggtttt cccagtcacg acgttgtaaa acgacggcca 1620ggcgattaag ttgggtaacg cccaggtttt cccagtcacg acgttgtaaa acgacggcca 1620
gtgaattcga gctcggtacc taccgttcgt ataatgtatg ctatacgaag ttatcgagct 1680gtgaattcga gctcggtacc taccgttcgt ataatgtatg ctatacgaag ttatcgagct 1680
ctagagaatg atcccctccc tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 1740ctagagaatg atcccctccc tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 1740
ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 1800ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 1800
gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca 1860gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca 1860
gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat 1920gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat 1920
tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt 1980tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt 1980
tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 2040tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 2040
gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 2100gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 2100
agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 2160agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 2160
tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 2220tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 2220
tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 2280tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 2280
gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 2340gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 2340
tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 2400tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 2400
ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 2460ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 2460
cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 2520cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 2520
atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 2580atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 2580
gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 2640gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 2640
tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc 2700tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc 2700
gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 2760gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 2760
ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 2820ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 2820
atcgccttct tgacgagttc ttctgagcgg gactctggga atttcgacga cctgcagcca 2880atcgccttct tgacgagttc ttctgagcgg gactctggga atttcgacga cctgcagcca 2880
agcataactt cgtataatgt atgctatacg aacggtagga tcctctagag tcgacctgca 2940agcataactt cgtataatgt atgctatacg aacggtagga tcctctagag tcgacctgca 2940
ggcatgagat gtgtataaga gacag 2965ggcatgagat gtgtataaga gacag 2965
<210> 99<210> 99
<211> 3904<211> 3904
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 99<400> 99
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180
caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240
ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300
aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360
tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420
ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480
gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540
atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600
caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660
tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720
ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780
cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840
ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900
gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960
caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020
atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080
ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140
caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200
gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260
ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320
gaaaccctgc aactgccggt ctcttaattt cgtcgacaca caggaaacat attaaaaatt 1380gaaaccctgc aactgccggt ctcttaattt cgtcgacaca caggaaacat attaaaaatt 1380
aaaacctgca ggagtttaaa cgcggccgcg atatcgttgt aaaacgacgg ccagtgcaag 1440aaaacctgca ggagtttaaa cgcggccgcg atatcgttgt aaaacgacgg ccagtgcaag 1440
aatcataaaa aatttatttg ctttcaggaa aatttttctg tataatagat tcataaattt 1500aatcataaaa aatttatttg ctttcaggaa aatttttctg tataatagat tcataaattt 1500
gagagaggag tttttgtgag cggataacaa ttccccatct tagtatatta gttaagtata 1560gagagaggag tttttgtgag cggataacaa ttccccatct tagtatatta gttaagtata 1560
aatacacaag gagatataca tatgaaagaa atcaaaatcc agaacatcat catcagcgaa 1620aatacacaag gagatataca tatgaaagaa atcaaaatcc agaacatcat catcagcgaa 1620
gaaaaagcgc cgctggttgt gccggaaatc ggcattaacc ataatggtag tctggaactg 1680gaaaaagcgc cgctggttgt gccggaaatc ggcattaacc ataatggtag tctggaactg 1680
gcaaaaatca tggtggatgc ggcctttagc gccggtgcaa aaatcattaa acatcagacc 1740gcaaaaatca tggtggatgc ggcctttagc gccggtgcaa aaatcattaa acatcagacc 1740
cacattgtgg aagatgaaat gtctaaagca gcgaaaaaag ttatcccggg caacgcgaaa 1800cacattgtgg aagatgaaat gtctaaagca gcgaaaaaag ttatcccggg caacgcgaaa 1800
atcagtatct acgaaatcat gcagaaatgc gcgctggatt acaaagatga actggccctg 1860atcagtatct acgaaatcat gcagaaatgc gcgctggatt acaaagatga actggccctg 1860
aaagaatata ccgaaaaact gggtctggtg tacctgtcta ccccgtttag tcgtgcgggt 1920aaagaatata ccgaaaaact gggtctggtg tacctgtcta ccccgtttag tcgtgcgggt 1920
gcaaaccgtc tggaagatat gggtgttagt gcgttcaaaa tcggcagcgg tgaatgtaac 1980gcaaaccgtc tggaagatat gggtgttagt gcgttcaaaa tcggcagcgg tgaatgtaac 1980
aattatccgc tgatcaaaca tattgccgca tttaaaaaac cgatgattgt tagcaccggc 2040aattatccgc tgatcaaaca tattgccgca tttaaaaaac cgatgattgt tagcaccggc 2040
atgaatagca tcgaatctat taaaccgacg gtgaaaatcc tgctggataa cgaaattccg 2100atgaatagca tcgaatctat taaaccgacg gtgaaaatcc tgctggataa cgaaattccg 2100
tttgttctga tgcataccac gaatctgtac ccgaccccgc acaacctggt gcgtctgaat 2160tttgttctga tgcataccac gaatctgtac ccgaccccgc acaacctggt gcgtctgaat 2160
gccatgctgg aactgaaaaa agaattctct tgcatggttg gtctgagtga tcacaccacg 2220gccatgctgg aactgaaaaa agaattctct tgcatggttg gtctgagtga tcacaccacg 2220
gataatctgg catgcctggg tgcagtggtt ctgggtgcgt gtgtgctgga acgtcatttc 2280gataatctgg catgcctggg tgcagtggtt ctgggtgcgt gtgtgctgga acgtcatttc 2280
accgatagca tgcaccgctc tggtccggat attgtttgta gtatggatac gaaagcactg 2340accgatagca tgcaccgctc tggtccggat attgtttgta gtatggatac gaaagcactg 2340
aaagaactga tcattcagag cgaacagatg gcgatcattc gcggcaacaa tgaatctaaa 2400aaagaactga tcattcagag cgaacagatg gcgatcattc gcggcaacaa tgaatctaaa 2400
aaagcggcca aacaggaaca ggtgaccatc gattttgcat tcgcgagtgt ggttagcatc 2460aaagcggcca aacaggaaca ggtgaccatc gattttgcat tcgcgagtgt ggttagcatc 2460
aaagatatca aaaaaggcga agtgctgagc atggataata tttgggttaa acgtccgggt 2520aaagatatca aaaaaggcga agtgctgagc atggataata tttgggttaa acgtccgggt 2520
ctgggcggta tctctgcagc ggaatttgaa aacattctgg gcaaaaaagc actgcgcgat 2580ctgggcggta tctctgcagc ggaatttgaa aacattctgg gcaaaaaagc actgcgcgat 2580
attgaaaatg atgcgcagct gtcttatgaa gatttcgcct aaaataacta gcataacccc 2640attgaaaatg atgcgcagct gtcttatgaa gatttcgcct aaaataacta gcataacccc 2640
ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2700ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2700
tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2760tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2760
tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2820tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2820
aggctcagtc gaaagactgg gcctttcggg atccaggccg gcctgttaac gaattaatct 2880aggctcagtc gaaagactgg gcctttcggg atccaggccg gcctgttaac gaattaatct 2880
tccgcggcgg tatcgataag cttgatatcg aattccgaag ttcctattct ctagaaagta 2940tccgcggcgg tatcgataag cttgatatcg aattccgaag ttcctattct ctagaaagta 2940
taggaacttc aggtctgaag aggagtttac gtccagccaa gctagcttgg ctgcaggtcg 3000taggaacttc aggtctgaag aggagtttac gtccagccaa gctagcttgg ctgcaggtcg 3000
tcgaaattct accgggtagg ggaggcgctt ttcccaaggc agtctggagc atgcgcttta 3060tcgaaattct accgggtagg ggaggcgctt ttcccaaggc agtctggagc atgcgcttta 3060
gcagccccgc tgggcacttg gcgctacaca agtggcctct ggcctcgcac acattccaca 3120gcagccccgc tgggcacttg gcgctacaca agtggcctct ggcctcgcac acattccaca 3120
tccaccggta ggcgccaacc ggctccgttc tttggtggcc ccttcgcgcc accttctact 3180tccaccggta ggcgccaacc ggctccgttc tttggtggcc ccttcgcgcc accttctact 3180
cctcccctag tcaggaagtt cccccccgcc ccgcagctcg cgtcgtgcag gacgtgacaa 3240cctcccctag tcaggaagtt cccccccgcc ccgcagctcg cgtcgtgcag gacgtgacaa 3240
atggaagtag cacgtctcac tagtctcgtg cagatggaca gcaccgctga gcaatggaag 3300atggaagtag cacgtctcac tagtctcgtg cagatggaca gcaccgctga gcaatggaag 3300
cgggtaggcc tttggggcag cggccaatag cagctttgct ccttcgcttt ctgggctcag 3360cgggtaggcc tttggggcag cggccaatag cagctttgct ccttcgcttt ctgggctcag 3360
gggcgggctc agggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 3420gggcgggctc agggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 3420
cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 3480cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 3480
cctgcagcct gttgacaatt aatcatcggc atagtatatc ggcatagtat aatacgacaa 3540cctgcagcct gttgacaatt aatcatcggc atagtatatc ggcatagtat aatacgacaa 3540
ggtgaggaac taaaccatgg gtcaaagtag cgatgaagcc aacgctcccg ttgcagggca 3600ggtgaggaac taaaccatgg gtcaaagtag cgatgaagcc aacgctcccg ttgcagggca 3600
gtttgcgctt cccctgagtg ccacctttgg cttaggggat cgcgtacgca agaaatctgg 3660gtttgcgctt cccctgagtg ccacctttgg cttaggggat cgcgtacgca agaaatctgg 3660
tgccgcttgg cagggtcaag tcgtcggttg gtattgcaca aaactcactc ctgaaggcta 3720tgccgcttgg cagggtcaag tcgtcggttg gtattgcaca aaactcactc ctgaaggcta 3720
tgcggtcgag tccgaatccc acccaggctc agtgcaaatt tatcctgtgg ctgcacttga 3780tgcggtcgag tccgaatccc acccaggctc agtgcaaatt tatcctgtgg ctgcacttga 3780
acgtgtggcc taatgagggg atcaattctc tagagctcgc tgatcagaag ttcctattct 3840acgtgtggcc taatgagggg atcaattctc tagagctcgc tgatcagaag ttcctattct 3840
ctagaaagta taggaacttc gatggcgcct catccctgaa gccaaagatg tgtataagag 3900ctagaaagta taggaacttc gatggcgcct catccctgaa gccaaagatg tgtataagag 3900
acag 3904acag 3904
<210> 100<210> 100
<211> 3793<211> 3793
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 100<400> 100
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaaa atgtgcggta 180cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaaa atgtgcggta 180
tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt ctgcgtcgtc 240tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt ctgcgtcgtc 240
tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa ggtcacatga 300tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa ggtcacatga 300
ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa gaacacccac 360ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa gaacacccac 360
tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa ccgtctgagg 420tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa ccgtctgagg 420
tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt atcatcgaga 480tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt atcatcgaga 480
accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta agcgaaaccg 540accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta agcgaaaccg 540
acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt actctgcgtg 600acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt actctgcgtg 600
aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg atcatggact 660aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg atcatggact 660
ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt atcggtctgg 720ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt atcggtctgg 720
gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt acccgtcgct 780gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt acccgtcgct 780
tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt aacatcttcg 840tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt aacatcttcg 840
acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag tatgacgctg 900acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag tatgacgctg 900
gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag ccgaacgcga 960gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag ccgaacgcga 960
tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct gagctgggtc 1020tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct gagctgggtc 1020
caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct tgtggtacct 1080caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct tgtggtacct 1080
cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt atcccatgcg 1140cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt atcccatgcg 1140
acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt aactccctca 1200acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt aactccctca 1200
tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg cgtctcagca 1260tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg cgtctcagca 1260
aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct agcctggttc 1320aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct agcctggttc 1320
gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt gcctctacca 1380gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt gcctctacca 1380
aagcgttcac tacccagctc actgtcctgc tgatgctggt tgccaaactg tctcgtctca 1440aagcgttcac tacccagctc actngtcctgc tgatgctggt tgccaaactg tctcgtctca 1440
aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc ctcccatctc 1500aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc ctcccatctc 1500
gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa gacttcagcg 1560gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa gacttcagcg 1560
acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg ctggaaggtg 1620acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg ctggaaggtg 1620
ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg ggtgagctga 1680ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg ggtgagctga 1680
aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt gctccgaaca 1740aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt gctccgaaca 1740
acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt ggtggtcagc 1800acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt ggtggtcagc 1800
tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg cacatcatcg 1860tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg cacatcatcg 1860
aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg ctgcagctgc 1920aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg ctgcagctgc 1920
tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt aacctggcga 1980tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt aacctggcga 1980
aatccgtgac cgtggaataa cgaaggagat agaaccatga gcttacccga tggattttat 2040aatccgtgac cgtggaataa cgaaggagat agaaccatga gcttacccga tggattttat 2040
ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa ggttttgacc 2100ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa ggttttgacc 2100
accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg gaatgaagcc 2160accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg gaatgaagcc 2160
acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat ggtgattgtg 2220acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat ggtgattgtg 2220
gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag aaagatcatt 2280gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag aaagatcatt 2280
catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa gtatcagggc 2340catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa gtatcagggc 2340
caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga ctacggttgt 2400caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga ctacggttgt 2400
tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa atgtgggttt 2460tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa atgtgggttt 2460
agcaacgcag gcgtggaaat gcaaattaga aaatagcatc cgtatcggaa acactagcat 2520agcaacgcag gcgtggaaat gcaaattaga aaatagcatc cgtatcggaa acactagcat 2520
aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 2580aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 2580
cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 2640cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 2640
cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2700cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2700
aacgaaaggc tcagtcgaaa gactgggcct ttcgcttcca caactttgta taataaagtt 2760aacgaaaggc tcagtcgaaa gactgggcct ttcgcttcca caactttgta taataaagtt 2760
gtccccacgg ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg 2820gtccccacgg ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg 2820
aagttatcga gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac 2880aagttatcga gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac 2880
accgtggaaa cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact 2940accgtggaaa cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact 2940
gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg 3000gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg 3000
tggtaacggc gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat 3060tggtaacggc gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat 3060
gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag 3120gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag 3120
cagcaacgat gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa 3180cagcaacgat gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa 3180
agttaggtgg ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca 3240agttaggtgg ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca 3240
aatccatgcg ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact 3300aatccatgcg ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact 3300
cccaacatca gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg 3360cccaacatca gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg 3360
cgcttgctgc cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca 3420cgcttgctgc cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca 3420
ggtttgagca gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc 3480ggtttgagca gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc 3480
ggaggcaggg cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg 3540ggaggcaggg cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg 3540
gtgcttatgt gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata 3600gtgcttatgt gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata 3600
caaagttggg catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct 3660caaagttggg catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct 3660
aacaattcgt tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg 3720aacaattcgt tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg 3720
tataatgtat gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt 3780tataatgtat gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt 3780
gtataagaga cag 3793gtataagaga cag 3793
<210> 101<210> 101
<211> 3847<211> 3847
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 101<400> 101
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaac catgtccaac 180cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaac catgtccaac 180
aatggctcgt caccgctggt gctttggtat aaccaactcg gcatgaatga tgtagacagg 240aatggctcgt caccgctggt gctttggtat aaccaactcg gcatgaatga tgtagacagg 240
gttgggggca aaaatgcctc cctgggtgaa atgattacta acctttccgg aatgggtgtt 300gttgggggca aaaatgcctc cctgggtgaa atgattacta acctttccgg aatgggtgtt 300
tccgttccga atggtttcgc cacaaccgcc gacgcgttta accagtttct ggaccaaagc 360tccgttccga atggtttcgc cacaaccgcc gacgcgttta accagtttct ggaccaaagc 360
ggcgtaaacc agcgcattta tgaactgctg gataaaacgg atattgacga tgttactcag 420ggcgtaaacc agcgcattta tgaactgctg gataaaacgg atattgacga tgttactcag 420
cttgcgaaag cgggcgcgca aatccgccag tggattatcg acactccctt ccagcctgag 480cttgcgaaag cgggcgcgca aatccgccag tggattatcg acactccctt ccagcctgag 480
ctggaaaacg ccatcagcga agcctatgca cagctttctg ccgatgacga aaacgcctct 540ctggaaaacg ccatcagcga agcctatgca cagctttctg ccgatgacga aaacgcctct 540
tttgcggtgc gctcctccgc caccgcagaa gatatgccgg acgcttcttt tgccggtcag 600tttgcggtgc gctcctccgc caccgcagaa gatatgccgg acgcttcttt tgccggtcag 600
caggaaacct tcctcaacgt tcagggtttt gacgccgttc tcgtggcagt gaaacatgta 660caggaaacct tcctcaacgt tcagggtttt gacgccgttc tcgtggcagt gaaacatgta 660
tttgcttctc tgtttaacga tcgcgccatc tcttatcgtg tgcaccaggg ttacgatcac 720tttgcttctc tgtttaacga tcgcgccatc tcttatcgtg tgcaccaggg ttacgatcac 720
cgtggtgtgg cgctctccgc cggtgttcaa cggatggtgc gctctgacct cgcatcatct 780cgtggtgtgg cgctctccgc cggtgttcaa cggatggtgc gctctgacct cgcatcatct 780
ggcgtgatgt tctccattga taccgaatcc ggctttgacc aggtggtgtt tatcacttcc 840ggcgtgatgt tctccattga taccgaatcc ggctttgacc aggtggtgtt tatcacttcc 840
gcatggggcc ttggtgagat ggtcgtgcag ggtgcggtta acccggatga gttttacgtg 900gcatggggcc ttggtgagat ggtcgtgcag ggtgcggtta acccggatga gttttacgtg 900
cataaaccga cactggcggc gaatcgcccg gctatcgtgc gccgcaccat ggggtcgaaa 960cataaaccga cactggcggc gaatcgcccg gctatcgtgc gccgcaccat ggggtcgaaa 960
aaaatccgca tggtttacgc gccgacccag gagcacggca agcaggttaa aatcgaagac 1020aaaatccgca tggtttacgc gccgacccag gagcacggca agcaggttaa aatcgaagac 1020
gtaccgcagg aacagcgtga catcttctcg ctgaccaacg aagaagtgca ggaactggca 1080gtaccgcagg aacagcgtga catcttctcg ctgaccaacg aagaagtgca ggaactggca 1080
aaacaggccg tacaaattga gaaacactac ggtcgcccga tggatattga gtgggcgaaa 1140aaacaggccg tacaaattga gaaacactac ggtcgcccga tggatattga gtgggcgaaa 1140
gatggccaca ccggtaaact gttcattgtg caggcgcgtc cggaaaccgt gcgctcacgc 1200gatggccaca ccggtaaact gttcattgtg caggcgcgtc cggaaaccgt gcgctcacgc 1200
ggtcaggtca tggagcgtta tacgctgcat tcacagggta agattatcgc cgaaggccgt 1260ggtcaggtca tggagcgtta tacgctgcat tcacagggta agattatcgc cgaaggccgt 1260
gctatcggtc atcgcatcgg tgcgggtccg gtgaaagtca tccatgatat cagcgaaatg 1320gctatcggtc atcgcatcgg tgcgggtccg gtgaaagtca tccatgatat cagcgaaatg 1320
aaccgcatcg aacctggtga cgtgctggtc actgacatga ccgacccgga ctgggaaccg 1380aaccgcatcg aacctggtga cgtgctggtc actgacatga ccgacccgga ctgggaaccg 1380
atcatgaaga aagcatctgc catcgtcacc aaccgtggcg gtcgtacctg tcacgcggcg 1440atcatgaaga aagcatctgc catcgtcacc aaccgtggcg gtcgtacctg tcacgcggcg 1440
atcatcgctc gtgaactggg cattccggcg gtagtgggct gtggtgatgc aacagaacgg 1500atcatcgctc gtgaactggg cattccggcg gtagtgggct gtggtgatgc aacagaacgg 1500
atgaaagacg gtgagaacgt cactgtttct tgtgccgaag gtgataccgg ttacgtctat 1560atgaaagacg gtgagaacgt cactgtttct tgtgccgaag gtgataccgg ttacgtctat 1560
gcggagttgc tggaatttag cgtgaaaagc tccagcgtag aaacgatgcc ggatctgccg 1620gcggagttgc tggaatttag cgtgaaaagc tccagcgtag aaacgatgcc ggatctgccg 1620
ttgaaagtga tgatgaacgt cggtaacccg gaccgagctt tcgacttcgc ctgtctgccg 1680ttgaaagtga tgatgaacgt cggtaacccg gaccgagctt tcgacttcgc ctgtctgccg 1680
aacgaaggcg tgggacttgc gcgtctggaa tttatcatca accgtatgat tggcgtccac 1740aacgaaggcg tgggacttgc gcgtctggaa tttatcatca accgtatgat tggcgtccac 1740
ccacgcgcac tgcttgagtt tgacgatcag gaaccgcagt tgcaaaacga aatccgcgag 1800ccacgcgcac tgcttgagtt tgacgatcag gaaccgcagt tgcaaaacga aatccgcgag 1800
atgatgaaag gttttgattc tccgcgtgaa ttttacgttg gtcgtctgac tgaagggatc 1860atgatgaaag gttttgattc tccgcgtgaa ttttacgttg gtcgtctgac tgaagggatc 1860
gcgacgctgg gtgccgcgtt ttatccgaag cgcgtcattg tccgtctctc tgattttaaa 1920gcgacgctgg gtgccgcgtt ttatccgaag cgcgtcattg tccgtctctc tgattttaaa 1920
tcgaacgaat atgccaacct ggtcggtggt gagcgttacg agccagatga agagaacccg 1980tcgaacgaat atgccaacct ggtcggtggt gagcgttacg agccagatga agagaacccg 1980
atgctcggct tccgtggcgc gggacgctat atttccgaca gcttccgcga ctgtttcgcg 2040atgctcggct tccgtggcgc gggacgctat atttccgaca gcttccgcga ctgtttcgcg 2040
ctggagtgcg aagcagtgaa acgtgtgcgc aacgacatgg ggctgaccaa cgttgagatc 2100ctggagtgcg aagcagtgaa acgtgtgcgc aacgacatgg ggctgaccaa cgttgagatc 2100
atgatcccgt tcgtgcgaac cgtagatcag gcgaaagcgg tggttgagga actggcgcgt 2160atgatcccgt tcgtgcgaac cgtagatcag gcgaaagcgg tggttgagga actggcgcgt 2160
caggggctga aacgtggtga gaacgggctg aaaatcatca tgatgtgtga aatcccgtcc 2220caggggctga aacgtggtga gaacgggctg aaaatcatca tgatgtgtga aatcccgtcc 2220
aacgccttgc tggccgagca gttcctcgaa tatttcgacg gcttctcaat tggctcaaac 2280aacgccttgc tggccgagca gttcctcgaa tatttcgacg gcttctcaat tggctcaaac 2280
gacatgacgc agctggcgct cggtctggat cgtgactccg gcgtggtgtc tgaactgttc 2340gacatgacgc agctggcgct cggtctggat cgtgactccg gcgtggtgtc tgaactgttc 2340
gatgagcgca acgatgcggt gaaagcactg ctgtcgatgg cgattcgtgc cgcgaagaaa 2400gatgagcgca acgatgcggt gaaagcactg ctgtcgatgg cgattcgtgc cgcgaagaaa 2400
cagggcaaat atgtcgggat ttgcggtcag ggtccgtccg accacgaaga ctttgccgca 2460cagggcaaat atgtcgggat ttgcggtcag ggtccgtccg accacgaaga ctttgccgca 2460
tggttgatgg aagaggggat cgatagcctg tctctgaacc cggacaccgt ggtgcaaacc 2520tggttgatgg aagaggggat cgatagcctg tctctgaacc cggacaccgt ggtgcaaacc 2520
tggttaagcc tggctgaact gaagaaataa catccgtatc ggaaacacta gcataacccc 2580tggttaagcc tggctgaact gaagaaataa catccgtatc ggaaacacta gcataacccc 2580
ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2640ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2640
tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2700tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2700
tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2760tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2760
aggctcagtc gaaagactgg gcctttcgct tccacaactt tgtataataa agttgtcccc 2820aggctcagtc gaaagactgg gcctttcgct tccacaactt tgtataataa agttgtcccc 2820
acggccagtg aattcgagct cggtacctac cgttcgtata atgtatgcta tacgaagtta 2880acggccagtg aattcgagct cggtacctac cgttcgtata atgtatgcta tacgaagtta 2880
tcgagctcta gagaatgatc ccctcattag gccacacgtt caagtgcagc gcacaccgtg 2940tcgagctcta gagaatgatc ccctcattag gccacacgtt caagtgcagc gcacaccgtg 2940
gaaacggatg aaggcacgaa cccagttgac ataagcctgt tcggttcgta aactgtaatg 3000gaaacggatg aaggcacgaa cccagttgac ataagcctgt tcggttcgta aactgtaatg 3000
caagtagcgt atgcgctcac gcaactggtc cagaaccttg accgaacgca gcggtggtaa 3060caagtagcgt atgcgctcac gcaactggtc cagaaccttg accgaacgca gcggtggtaa 3060
cggcgcagtg gcggttttca tggcttgtta tgactgtttt tttgtacagt ctatgcctcg 3120cggcgcagtg gcggttttca tggcttgtta tgactgtttt tttgtacagt ctatgcctcg 3120
ggcatccaag cagcaagcgc gttacgccgt gggtcgatgt ttgatgttat ggagcagcaa 3180ggcatccaag cagcaagcgc gttacgccgt gggtcgatgt ttgatgttat ggagcagcaa 3180
cgatgttacg cagcagcaac gatgttacgc agcagggcag tcgccctaaa acaaagttag 3240cgatgttacg cagcagcaac gatgttacgc agcagggcag tcgccctaaa acaaagttag 3240
gtggctcaag tatgggcatc attcgcacat gtaggctcgg ccctgaccaa gtcaaatcca 3300gtggctcaag tatgggcatc attcgcacat gtaggctcgg ccctgaccaa gtcaaatcca 3300
tgcgggctgc tcttgatctt ttcggtcgtg agttcggaga cgtagccacc tactcccaac 3360tgcgggctgc tcttgatctt ttcggtcgtg agttcggaga cgtagccacc tactcccaac 3360
atcagccgga ctccgattac ctcgggaact tgctccgtag taagacattc atcgcgcttg 3420atcagccgga ctccgattac ctcgggaact tgctccgtag taagacattc atcgcgcttg 3420
ctgccttcga ccaagaagcg gttgttggcg ctctcgcggc ttacgttctg cccaggtttg 3480ctgccttcga ccaagaagcg gttgttggcg ctctcgcggc ttacgttctg cccaggtttg 3480
agcagccgcg tagtgagatc tatatctatg atctcgcagt ctccggcgag caccggaggc 3540agcagccgcg tagtgagatc tatatctatg atctcgcagt ctccggcgag caccggaggc 3540
agggcattgc caccgcgctc atcaatctcc tcaagcatga ggccaacgcg cttggtgctt 3600agggcattgc caccgcgctc atcaatctcc tcaagcatga ggccaacgcg cttggtgctt 3600
atgtgatcta cgtgcaagca gattacggtg acgatcccgc agtggctctc tatacaaagt 3660atgtgatcta cgtgcaagca gattacggtg acgatcccgc agtggctctc tatacaaagt 3660
tgggcatacg ggaagaagtg atgcactttg atatcgaccc aagtaccgcc acctaacaat 3720tgggcatacg ggaagaagtg atgcactttg atatcgaccc aagtaccgcc acctaacaat 3720
tcgttcaagc cgagatcgta gaatttcgac gacctgcagc caagcataac ttcgtataat 3780tcgttcaagc cgagatcgta gaatttcgac gacctgcagc caagcataac ttcgtataat 3780
gtatgctata cgaacggtag gatcctctag agtcgacctg caggcatgag atgtgtataa 3840gtatgctata cgaacggtag gatcctctag agtcgacctg caggcatgag atgtgtataa 3840
gagacag 3847gagacag 3847
<210> 102<210> 102
<211> 5554<211> 5554
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Плазмида<223> Plasmid
<400> 102<400> 102
catcgattta ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac 60catcgattta ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac 60
ggaactcgct cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat 120ggaactcgct cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat 120
cgtcaaaacc aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca 180cgtcaaaacc aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca 180
gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct 240gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct 240
ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga 300ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga 300
tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat 360tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat 360
tatccatcgg tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct 420tatccatcgg tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct 420
caagcagatt tatcgccagc agctccgaat agcgcccttc cccttgcccg gcgttaatga 480caagcagatt tatcgccagc agctccgaat agcgcccttc cccttgcccg gcgttaatga 480
tttgcccaaa caggtcgctg aaatgcggct ggtgcgcttc atccgggcga aagaaccccg 540tttgcccaaa caggtcgctg aaatgcggct ggtgcgcttc atccgggcga aagaaccccg 540
tattggcaaa tattgacggc cagttaagcc attcatgcca gtaggcgcgc ggacgaaagt 600tattggcaaa tattgacggc cagttaagcc attcatgcca gtaggcgcgc ggacgaaagt 600
aaacccactg gtgataccat tcgcgagcct ccggatgacg accgtagtga tgaatctctc 660aaacccactg gtgataccat tcgcgagcct ccggatgacg accgtagtga tgaatctctc 660
ctggcgggaa cagcaaaata tcacccggtc ggcaaacaaa ttctcgtccc tgatttttca 720ctggcgggaa cagcaaaata tcacccggtc ggcaaacaaa ttctcgtccc tgatttttca 720
ccaccccctg accgcgaatg gtgagattga gaatataacc tttcattccc agcggtcggt 780ccaccccctg accgcgaatg gtgagattga gaatataacc tttcattccc agcggtcggt 780
cgataaaaaa atcgagataa ccgttggcct caatcggcgt taaacccgcc accagatggg 840cgataaaaaa atcgagataa ccgttggcct caatcggcgt taaacccgcc accagatggg 840
cattaaacga gtatcccggc agcaggggat cattttgcgc ttcagccata cttttcatac 900cattaaacga gtatcccggc agcaggggat cattttgcgc ttcagccata cttttcatac 900
tcccgccatt cagagaagaa accaattgtc catattgcat cagacattgc cgtcactgcg 960tcccgccatt cagagaagaa accaattgtc catattgcat cagacattgc cgtcactgcg 960
tcttttactg gctcttctcg ctaaccaaac cggtaacccc gcttattaaa agcattctgt 1020tcttttactg gctcttctcg ctaaccaaac cggtaacccc gcttattaaa agcattctgt 1020
aacaaagcgg gaccaaagcc atgacaaaaa cgcgtaacaa aagtgtctat aatcacggca 1080aacaaagcgg gaccaaagcc atgacaaaaa cgcgtaacaa aagtgtctat aatcacggca 1080
gaaaagtcca cattgattat ttgcacggcg tcacactttg ctatgccata gcatttttat 1140gaaaagtcca cattgattat ttgcacggcg tcacactttg ctatgccata gcatttttat 1140
ccataagatt agcggatcct acctgacgct ttttatcgca actctctact gtttctccat 1200ccataagatt agcggatcct acctgacgct ttttatcgca actctctact gtttctccat 1200
acccgttttt ttgggaattc gagctctaag gaggttataa aaaatgtcta atctgctgac 1260acccgttttt ttgggaattc gagctctaag gaggttataa aaaatgtcta atctgctgac 1260
ggtccaccaa aacctgccgg ctctgccggt cgatgctacc tctgatgaag ttcgcaaaaa 1320ggtccaccaa aacctgccgg ctctgccggt cgatgctacc tctgatgaag ttcgcaaaaa 1320
cctgatggat atgtttcgtg atcgccaggc attcagcgaa catacctgga aaatgctgct 1380cctgatggat atgtttcgtg atcgccaggc attcagcgaa catacctgga aaatgctgct 1380
gtccgtgtgc cgttcatggg cggcctggtg taaactgaac aatcgcaaat ggtttccggc 1440gtccgtgtgc cgttcatggg cggcctggtg taaactgaac aatcgcaaat ggtttccggc 1440
ggaaccggaa gatgtccgtg actatctgct gtacctgcag gcccgcggtc tggcagttaa 1500ggaaccggaa gatgtccgtg actatctgct gtacctgcag gcccgcggtc tggcagttaa 1500
aacgatccag caacatctgg gccaactgaa tatgctgcac cgtcgctccg gtctgccgcg 1560aacgatccag caacatctgg gccaactgaa tatgctgcac cgtcgctccg gtctgccgcg 1560
tccgagcgat tctaatgcgg tgtcactggt tatgcgtcgc attcgtaaag aaaacgtgga 1620tccgagcgat tctaatgcgg tgtcactggt tatgcgtcgc attcgtaaag aaaacgtgga 1620
tgcaggcgaa cgcgctaaac aggcactggc ttttgaacgt accgatttcg accaagttcg 1680tgcaggcgaa cgcgctaaac aggcactggc ttttgaacgt accgatttcg accaagttcg 1680
ctcgctgatg gaaaacagcg atcgttgcca ggacatccgc aatctggcgt tcctgggtat 1740ctcgctgatg gaaaacagcg atcgttgcca ggacatccgc aatctggcgt tcctgggtat 1740
tgcctataac accctgctgc gcattgcaga aatcgctcgt attcgcgtga aagatatcag 1800tgcctataac accctgctgc gcattgcaga aatcgctcgt attcgcgtga aagatatcag 1800
ccgtacggac ggcggtcgca tgctgattca catcggccgt accaaaacgc tggtctctac 1860ccgtacggac ggcggtcgca tgctgattca catcggccgt accaaaacgc tggtctctac 1860
cgcaggcgtg gaaaaagctc tgagtctggg tgtgacgaaa ctggttgaac gctggattag 1920cgcaggcgtg gaaaaagctc tgagtctggg tgtgacgaaa ctggttgaac gctggattag 1920
tgtctccggc gtggcggatg acccgaacaa ttacctgttt tgtcgtgttc gcaaaaatgg 1980tgtctccggc gtggcggatg acccgaacaa ttacctgttt tgtcgtgttc gcaaaaatgg 1980
tgtcgcagct ccgtcagcca cctcgcagct gagcacgcgt gcactggaag gcatcttcga 2040tgtcgcagct ccgtcagcca cctcgcagct gagcacgcgt gcactggaag gcatcttcga 2040
agctacccat cgcctgattt atggcgccaa agatgactcg ggtcaacgtt acctggcgtg 2100agctacccat cgcctgattt atggcgccaa agatgactcg ggtcaacgtt acctggcgtg 2100
gtctggtcac agtgcacgtg ttggtgccgc acgtgatatg gcccgtgccg gtgtttccat 2160gtctggtcac agtgcacgtg ttggtgccgc acgtgatatg gcccgtgccg gtgtttccat 2160
cccggaaatt atgcaggcag gcggttggac caacgttaat atcgtcatga actatattcg 2220cccggaaatt atgcaggcag gcggttggac caacgttaat atcgtcatga actatattcg 2220
caatctggac tcggaaacgg gtgctatggt tcgcctgctg gaagacggtg actaatgagt 2280caatctggac tcggaaacgg gtgctatggt tcgcctgctg gaagacggtg actaatgagt 2280
gccggagttc atcgaaaaaa tggacgaggc actggctgaa attggttttg tatttgggga 2340gccggagttc atcgaaaaaa tggacgaggc actggctgaa attggttttg tatttgggga 2340
gcaatggcga tgacgcatcc tcacgataat atccgggtag gcgcaatcac tttcgtctac 2400gcaatggcga tgacgcatcc tcacgataat atccgggtag gcgcaatcac tttcgtctac 2400
tccgttacaa agcgaggctg ggtatttccc ggcctttctg ttatccgaaa tccactgaaa 2460tccgttacaa agcgaggctg ggtatttccc ggcctttctg ttatccgaaa tccactgaaa 2460
gcacagcggc tggctgagga gataaataat aaacgagggg ctgtatgcac aaagcatctt 2520gcacagcggc tggctgagga gataaataat aaacgagggg ctgtatgcac aaagcatctt 2520
ctgttgagtt aagaacgagt atcgagatgg cacatagcct tgctcaaatt ggaatcaggt 2580ctgttgagtt aagaacgagt atcgagatgg cacatagcct tgctcaaatt ggaatcaggt 2580
ttgtgccaat accagtagaa acagacgaag aatccatggg tatggacagt tttccctttg 2640ttgtgccaat accagtagaa agacgaag aatccatggg tatggacagt tttccctttg 2640
atatgtaacg gtgaacagtt gttctacttt tgtttgttag tcttgatgct tcactgatag 2700atatgtaacg gtgaacagtt gttctacttt tgtttgttag tcttgatgct tcactgatag 2700
atacaagagc cataagaacc tcagatcctt ccgtatttag ccagtatgtt ctctagtgtg 2760atacaagagc cataagaacc tcagatcctt ccgtatttag ccagtatgtt ctctagtgtg 2760
gttcgttgtt tttgcgtgag ccatgagaac gaaccattga gatcatactt actttgcatg 2820gttcgttgtt tttgcgtgag ccatgagaac gaaccattga gatcatactt actttgcatg 2820
tcactcaaaa attttgcctc aaaactggtg agctgaattt ttgcagttaa agcatcgtgt 2880tcactcaaaa attttgcctc aaaactggtg agctgaattt ttgcagttaa agcatcgtgt 2880
agtgtttttc ttagtccgtt acgtaggtag gaatctgatg taatggttgt tggtattttg 2940agtgtttttc ttagtccgtt acgtaggtag gaatctgatg taatggttgt tggtattttg 2940
tcaccattca tttttatctg gttgttctca agttcggtta cgagatccat ttgtctatct 3000tcaccattca tttttatctg gttgttctca agttcggtta cgagatccat ttgtctatct 3000
agttcaactt ggaaaatcaa cgtatcagtc gggcggcctc gcttatcaac caccaatttc 3060agttcaactt ggaaaatcaa cgtatcagtc gggcggcctc gcttatcaac caccaatttc 3060
atattgctgt aagtgtttaa atctttactt attggtttca aaacccattg gttaagcctt 3120atattgctgt aagtgtttaa atctttactt attggtttca aaacccattg gttaagcctt 3120
ttaaactcat ggtagttatt ttcaagcatt aacatgaact taaattcatc aaggctaatc 3180ttaaactcat ggtagttatt ttcaagcatt aacatgaact taaattcatc aaggctaatc 3180
tctatatttg ccttgtgagt tttcttttgt gttagttctt ttaataacca ctcataaatc 3240tctatatttg ccttgtgagt tttcttttgt gttagttctt ttaataacca ctcataaatc 3240
ctcatagagt atttgttttc aaaagactta acatgttcca gattatattt tatgaatttt 3300ctcatagagt atttgttttc aaaagactta acatgttcca gattatattt tatgaatttt 3300
tttaactgga aaagataagg caatatctct tcactaaaaa ctaattctaa tttttcgctt 3360tttaactgga aaagataagg caatatctct tcactaaaaa ctaattctaa tttttcgctt 3360
gagaacttgg catagtttgt ccactggaaa atctcaaagc ctttaaccaa aggattcctg 3420gagaacttgg catagtttgt ccactggaaa atctcaaagc ctttaaccaa aggattcctg 3420
atttccacag ttctcgtcat cagctctctg gttgctttag ctaatacacc ataagcattt 3480atttccacag ttctcgtcat cagctctctg gttgctttag ctaatacacc ataagcattt 3480
tccctactga tgttcatcat ctgagcgtat tggttataag tgaacgatac cgtccgttct 3540tccctactga tgttcatcat ctgagcgtat tggttataag tgaacgatac cgtccgttct 3540
ttccttgtag ggttttcaat cgtggggttg agtagtgcca cacagcataa aattagcttg 3600ttccttgtag ggttttcaat cgtggggttg agtagtgcca cacagcataa aattagcttg 3600
gtttcatgct ccgttaagtc atagcgacta atcgctagtt catttgcttt gaaaacaact 3660gtttcatgct ccgttaagtc atagcgacta atcgctagtt catttgcttt gaaaacaact 3660
aattcagaca tacatctcaa ttggtctagg tgattttaat cactatacca attgagatgg 3720aattcagaca tacatctcaa ttggtctagg tgattttaat cactatacca attgagatgg 3720
gctagtcaat gataattact agtccttttc ctttgagttg tgggtatctg taaattctgc 3780gctagtcaat gataattact agtccttttc ctttgagttg tgggtatctg taaattctgc 3780
tagacctttg ctggaaaact tgtaaattct gctagaccct ctgtaaattc cgctagacct 3840tagacctttg ctggaaaact tgtaaattct gctagaccct ctgtaaattc cgctagacct 3840
ttgtgtgttt tttttgttta tattcaagtg gttataattt atagaataaa gaaagaataa 3900ttgtgtgttt tttttgttta tattcaagtg gttataattt atagaataaa gaaagaataa 3900
aaaaagataa aaagaataga tcccagccct gtgtataact cactacttta gtcagttccg 3960aaaaagataa aaagaataga tcccagccct gtgtataact cactacttta gtcagttccg 3960
cagtattaca aaaggatgtc gcaaacgctg tttgctcctc tacaaaacag accttaaaac 4020cagtattaca aaaggatgtc gcaaacgctg tttgctcctc tacaaaacag accttaaaac 4020
cctaaaggct taagtagcac cctcgcaagc tcggttgcgg ccgcaatcgg gcaaatcgct 4080cctaaaggct taagtagcac cctcgcaagc tcggttgcgg ccgcaatcgg gcaaatcgct 4080
gaatattcct tttgtctccg accatcaggc acctgagtcg ctgtcttttt cgtgacattc 4140gaatattcct tttgtctccg accatcaggc acctgagtcg ctgtcttttt cgtgacattc 4140
agttcgctgc gctcacggct ctggcagtga atgggggtaa atggcactac aggcgccttt 4200agttcgctgc gctcacggct ctggcagtga atgggggtaa atggcactac aggcgccttt 4200
tatggattca tgcaaggaaa ctacccataa tacaagaaaa gcccgtcacg ggcttctcag 4260tatggattca tgcaaggaaa ctacccataa tacaagaaaa gcccgtcacg ggcttctcag 4260
ggcgttttat ggcgggtctg ctatgtggtg ctatctgact ttttgctgtt cagcagttcc 4320ggcgttttat ggcgggtctg ctatgtggtg ctatctgact ttttgctgtt cagcagttcc 4320
tgccctctga ttttccagtc tgaccacttc ggattatccc gtgacaggtc attcagactg 4380tgccctctga ttttccagtc tgaccacttc ggattatccc gtgacaggtc attcagactg 4380
gctaatgcac ccagtaaggc agcggtatca tcaacggggt ctgacgctca gtggaacgaa 4440gctaatgcac ccagtaaggc agcggtatca tcaacggggt ctgacgctca gtggaacgaa 4440
aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 4500aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 4500
ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 4560ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 4560
agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 4620agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 4620
atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 4680atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 4680
cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 4740cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 4740
aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 4800aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 4800
cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 4860cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 4860
aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca 4920aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca 4920
ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa 4980ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa 4980
gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca 5040gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca 5040
ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt 5100ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt 5100
tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt 5160tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt 5160
tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg 5220tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg 5220
ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga 5280ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga 5280
tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc 5340tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc 5340
agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg 5400agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg 5400
acacggaaat gttgaatact catactcttc ctttttcaat attattgaag catttatcag 5460acacggaaat gttgaatact catactcttc ctttttcaat attattgaag catttatcag 5460
ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg 5520ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg 5520
gttccgcgca catttccccg aaaagtgcca cctg 5554gttccgcgca catttccccg aaaagtgcca cctg 5554
<210> 103<210> 103
<211> 3415<211> 3415
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 103<400> 103
ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60
ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120
attttgttta actttaagaa ggagatatac aaatgaacaa cgacaactcc acgaccacca 180attttgttta actttaagaa ggagatatac aaatgaacaa cgacaactcc acgaccacca 180
acaataacgc tattgaaatc tatgtggatc gtgcgaccct gccgacgatc cagcaaatga 240acaataacgc tattgaaatc tatgtggatc gtgcgaccct gccgacgatc cagcaaatga 240
ccaaaattgt tagccagaaa acgtctaaca aaaaactgat ctcatggtcg cgctacccga 300ccaaaattgt tagccagaaa acgtctaaca aaaaactgat ctcatggtcg cgctacccga 300
ttaccgataa aagcctgctg aagaaaatta acgcggaatt tttcaaagaa caatttgaac 360ttaccgataa aagcctgctg aagaaaatta acgcggaatt tttcaaagaa caatttgaac 360
tgacggaaag cctgaaaaac atcatcctgt ctgaaaacat cgataacctg atcattcatg 420tgacggaaag cctgaaaaac atcatcctgt ctgaaaacat cgataacctg atcattcatg 420
gcaataccct gtggagtatt gatgtggttg acattatcaa agaagtcaac ctgctgggca 480gcaataccct gtggagtatt gatgtggttg acattatcaa agaagtcaac ctgctgggca 480
aaaatattcc gatcgaactg cacttttatg atgacggttc cgccgaatac gttcgtatct 540aaaatattcc gatcgaactg cacttttatg atgacggttc cgccgaatac gttcgtatct 540
acgaatttag taaactgccg gaatccgaac agaaatacaa aaccagcctg tctaaaaaca 600acgaatttag taaactgccg gaatccgaac agaaatacaa aaccagcctg tctaaaaaca 600
acatcaaatt ctcaatcgat ggcaccgact cgttcaaaaa cacgatcgaa aacatctacg 660acatcaaatt ctcaatcgat ggcaccgact cgttcaaaaa cacgatcgaa aacatctacg 660
gtttcagcca actgtatccg accacgtacc acatgctgcg tgcagatatc ttcgacacca 720gtttcagcca actgtatccg accacgtacc acatgctgcg tgcagatatc ttcgacacca 720
cgctgaaaat taacccgctg cgcgaactgc tgtcaaacaa catcaaacag atgaaatggg 780cgctgaaaat taacccgctg cgcgaactgc tgtcaaacaa catcaaacag atgaaatggg 780
attacttcaa agacttcaac tacaaacaaa aagatatctt ttactcactg accaacttca 840attacttcaa agacttcaac tacaaacaaa aagatatctt ttactcactg accaacttca 840
acccgaaaga aatccaggaa gacttcaaca aaaactcgaa caaaaacttc atcttcatcg 900acccgaaaga aatccaggaa gacttcaaca aaaactcgaa caaaaacttc atcttcatcg 900
gcagtaactc cgcgaccgcc acggcagaag aacaaatcaa tattatcagc gaagcgaaga 960gcagtaactc cgcgaccgcc acggcagaag aacaaatcaa tattatcagc gaagcgaaga 960
aagaaaacag cagcattatc accaattcaa tttcggatta tgacctgttt ttcaaaggtc 1020aagaaaacag cagcattatc accaattcaa tttcggatta tgacctgttt ttcaaaggtc 1020
atccgtctgc cacgtttaac gaacagatta tcaatgcaca cgatatgatc gaaatcaaca 1080atccgtctgc cacgtttaac gaacagatta tcaatgcaca cgatatgatc gaaatcaaca 1080
acaaaatccc gttcgaagct ctgatcatga ccggcattct gccggatgcc gttggcggta 1140acaaaatccc gttcgaagct ctgatcatga ccggcattct gccggatgcc gttggcggta 1140
tgggtagttc cgtctttttc agtatcccga aagaagtcaa aaacaaattc gtgttctata 1200tgggtagttc cgtctttttc agtatcccga aagaagtcaa aaacaaattc gtgttctata 1200
aaagtggtac ggatatcgaa aataactccc tgattcaggt gatgctgaaa ctgaatctga 1260aaagtggtac ggatatcgaa aataactccc tgattcaggt gatgctgaaa ctgaatctga 1260
ttaaccgcga taatattaaa ctgatctctg acatttaatt tcgtcgacac acaggaaaca 1320ttaaccgcga taatattaaa ctgatctctg acatttaatt tcgtcgacac acaggaaaca 1320
tattaaaaat taaaacctgc aggagtttaa acgcggccgc gatatcgttg taaaacgacg 1380tattaaaaat taaaacctgc aggagtttaa acgcggccgc gatatcgttg taaaacgacg 1380
gccagtgcaa gaatcataaa aaatttattt gctttcagga aaatttttct gtataataga 1440gccagtgcaa gaatcataaa aaatttattt gctttcagga aaatttttct gtataataga 1440
ttcataaatt tgagagagga gtttttgtga gcggataaca attccccatc ttagtatatt 1500ttcataaatt tgagagagga gtttttgtga gcggataaca attccccatc ttagtatatt 1500
agttaagtat aaatacacaa ggagatatac atatgagcct ggccattatc ccggcacgtg 1560agttaagtat aaatacacaa ggagatatac atatgagcct ggccattatc ccggcacgtg 1560
gcggttctaa aggcatcaaa aacaaaaacc tggttctgct gaacaataaa ccgctgattt 1620gcggttctaa aggcatcaaa aacaaaaacc tggttctgct gaacaataaa ccgctgattt 1620
attacaccat caaagcggcc ctgaacgcca aaagtattag caaagtggtt gtgagctctg 1680attacaccat caaagcggcc ctgaacgcca aaagtattag caaagtggtt gtgagctctg 1680
attctgatga aatcctgaac tacgcaaaaa gtcagaacgt tgatatcctg aaacgtccga 1740attctgatga aatcctgaac tacgcaaaaa gtcagaacgt tgatatcctg aaacgtccga 1740
tcagtctggc acaggatgat accacgagcg ataaagtgct gctgcatgcg ctgaaattct 1800tcagtctggc acaggatgat accacgagcg ataaagtgct gctgcatgcg ctgaaattct 1800
acaaagatta cgaagatgtt gtgttcctgc agccgaccag cccgctgcgt acgaatattc 1860acaaagatta cgaagatgtt gtgttcctgc agccgaccag cccgctgcgt acgaatattc 1860
acatcaacga agcgttcaac ctgtacaaaa acagcaacgc aaacgcgctg atttctgtta 1920acatcaacga agcgttcaac ctgtacaaaa acagcaacgc aaacgcgctg atttctgtta 1920
gtgaatgcga taacaaaatc ctgaaagcgt ttgtgtgcaa tgattgtggc gatctggccg 1980gtgaatgcga taacaaaatc ctgaaagcgt ttgtgtgcaa tgattgtggc gatctggccg 1980
gtatttgtaa cgatgaatac ccgttcatgc cgcgccagaa actgccgaaa acctatatga 2040gtatttgtaa cgatgaatac ccgttcatgc cgcgccagaa actgccgaaa acctatatga 2040
gcaatggtgc catctacatc ctgaaaatca aagaattcct gaacaacccg agcttcctgc 2100gcaatggtgc catctacatc ctgaaaatca aagaattcct gaacaacccg agcttcctgc 2100
agtctaaaac gaaacatttc ctgatggatg aaagtagctc tctggatatt gattgcctgg 2160agtctaaaac gaaacatttc ctgatggatg aaagtagctc tctggatatt gattgcctgg 2160
aagatctgaa aaaagtggaa cagatctgga aaaaataaaa tactgaaacc aatttgcctg 2220aagatctgaa aaaagtggaa cagatctgga aaaaataaaa tactgaaacc aatttgcctg 2220
gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta 2280gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta 2280
gcgccgatgg tagtgtgggg tctccccatg cgagagtagg gaactgccag gcatcaaata 2340gcgccgatgg tagtgtgggg tctccccatg cgagagtagg gaactgccag gcatcaaata 2340
aaacgaaagg ctcagtcgaa agactgggcc tttcgcttcc acaactttgt ataataaagt 2400aaacgaaagg ctcagtcgaa agactgggcc tttcgcttcc acaactttgt ataataaagt 2400
tgtccccacg gccagtgaat tcgagctcgg tacctaccgt tcgtataatg tatgctatac 2460tgtccccacg gccagtgaat tcgagctcgg tacctaccgt tcgtataatg tatgctatac 2460
gaagttatcg agctctagag aatgatcccc tcattaggcc acacgttcaa gtgcagcgca 2520gaagttatcg agctctagag aatgatcccc tcattaggcc acacgttcaa gtgcagcgca 2520
caccgtggaa acggatgaag gcacgaaccc agttgacata agcctgttcg gttcgtaaac 2580caccgtggaa acggatgaag gcacgaaccc agttgacata agcctgttcg gttcgtaaac 2580
tgtaatgcaa gtagcgtatg cgctcacgca actggtccag aaccttgacc gaacgcagcg 2640tgtaatgcaa gtagcgtatg cgctcacgca actggtccag aaccttgacc gaacgcagcg 2640
gtggtaacgg cgcagtggcg gttttcatgg cttgttatga ctgttttttt gtacagtcta 2700gtggtaacgg cgcagtggcg gttttcatgg cttgttatga ctgttttttt gtacagtcta 2700
tgcctcgggc atccaagcag caagcgcgtt acgccgtggg tcgatgtttg atgttatgga 2760tgcctcgggc atccaagcag caagcgcgtt acgccgtggg tcgatgtttg atgttatgga 2760
gcagcaacga tgttacgcag cagcaacgat gttacgcagc agggcagtcg ccctaaaaca 2820gcagcaacga tgttacgcag cagcaacgat gttacgcagc agggcagtcg ccctaaaaca 2820
aagttaggtg gctcaagtat gggcatcatt cgcacatgta ggctcggccc tgaccaagtc 2880aagttaggtg gctcaagtat gggcatcatt cgcacatgta ggctcggccc tgaccaagtc 2880
aaatccatgc gggctgctct tgatcttttc ggtcgtgagt tcggagacgt agccacctac 2940aaatccatgc gggctgctct tgatcttttc ggtcgtgagt tcggagacgt agccacctac 2940
tcccaacatc agccggactc cgattacctc gggaacttgc tccgtagtaa gacattcatc 3000tcccaacatc agccggactc cgattacctc gggaacttgc tccgtagtaa gacattcatc 3000
gcgcttgctg ccttcgacca agaagcggtt gttggcgctc tcgcggctta cgttctgccc 3060gcgcttgctg ccttcgacca agaagcggtt gttggcgctc tcgcggctta cgttctgccc 3060
aggtttgagc agccgcgtag tgagatctat atctatgatc tcgcagtctc cggcgagcac 3120aggtttgagc agccgcgtag tgagatctat atctatgatc tcgcagtctc cggcgagcac 3120
cggaggcagg gcattgccac cgcgctcatc aatctcctca agcatgaggc caacgcgctt 3180cggaggcagg gcattgccac cgcgctcatc aatctcctca agcatgaggc caacgcgctt 3180
ggtgcttatg tgatctacgt gcaagcagat tacggtgacg atcccgcagt ggctctctat 3240ggtgcttatg tgatctacgt gcaagcagat tacggtgacg atcccgcagt ggctctctat 3240
acaaagttgg gcatacggga agaagtgatg cactttgata tcgacccaag taccgccacc 3300acaaagttgg gcatacggga agaagtgatg cactttgata tcgacccaag taccgccacc 3300
taacaattcg ttcaagccga gatcgtagaa tttcgacgac ctgcagccaa gcataacttc 3360taacaattcg ttcaagccga gatcgtagaa tttcgacgac ctgcagccaa gcataacttc 3360
gtataatgta tgctatacga acggtaggat cctctagagt cgacctgcag gcatg 3415gtataatgta tgctatacga acggtaggat cctctagagt cgacctgcag gcatg 3415
<210> 104<210> 104
<211> 3763<211> 3763
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 104<400> 104
ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60
ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120
attttgttta actttaagaa ggagatatac aaatgtgtaa cgataatcaa aatacggtcg 180attttgttta actttaagaa ggagatatac aaatgtgtaa cgataatcaa aatacggtcg 180
atgttgttgt gagcaccgtt aacgataacg tcatcgaaaa caacacgtac caagttaaac 240atgttgttgt gagcaccgtt aacgataacg tcatcgaaaa caacacgtac caagttaaac 240
cgatcgatac cccgaccacg tttgacagtt actcctggat tcagacgtgc ggcaccccga 300cgatcgatac cccgaccacg tttgacagtt actcctggat tcagacgtgc ggcaccccga 300
tcctgaaaga tgacgaaaaa tattcactgt cgtttgattt cgtcgccccg gaactggatc 360tcctgaaaga tgacgaaaaa tattcactgt cgtttgattt cgtcgccccg gaactggatc 360
aggacgaaaa attctgtttc gaatttaccg gcgatgttga cggtaaacgt tatgtcacgc 420aggacgaaaa attctgtttc gaatttaccg gcgatgttga cggtaaacgt tatgtcacgc 420
agaccaacct gacggtggtt gcaccgaccc tggaagttta cgtcgatcat gctagtctgc 480agaccaacct gacggtggtt gcaccgaccc tggaagttta cgtcgatcat gctagtctgc 480
cgtccctgca gcaactgatg aaaatcatcc agcagaaaaa cgaatactca cagaatgaac 540cgtccctgca gcaactgatg aaaatcatcc agcagaaaaa cgaatactca cagaatgaac 540
gtttcatttc gtggggccgc atcggtctga cggaagataa cgcggaaaaa ctgaatgccc 600gtttcatttc gtggggccgc atcggtctga cggaagataa cgcggaaaaa ctgaatgccc 600
atatttatcc gctggcaggc aacaatacct cacaggaact ggtggatgca gtgatcgatt 660atatttatcc gctggcaggc aacaatacct cacaggaact ggtggatgca gtgatcgatt 660
acgctgactc gaaaaaccgt ctgaatctgg aactgaacac gaataccgcg cacagctttc 720acgctgactc gaaaaaccgt ctgaatctgg aactgaacac gaataccgcg cacagctttc 720
cgaacctggc cccgattctg cgcattatca gctctaaaag caacatcctg atctctaaca 780cgaacctggc cccgattctg cgcattatca gctctaaaag caacatcctg atctctaaca 780
tcaacctgta cgatgacggc agtgctgaat atgtgaacct gtacaattgg aaagataccg 840tcaacctgta cgatgacggc agtgctgaat atgtgaacct gtacaattgg aaagataccg 840
aagacaaatc cgtgaaactg agcgattctt tcctggttct gaaagactac tttaacggta 900aagacaaatc cgtgaaactg agcgattctt tcctggttct gaaagactac tttaacggta 900
ttagttccga aaaaccgagc ggcatctatg gtcgctacaa ctggcatcaa ctgtataata 960ttagttccga aaaaccgagc ggcatctatg gtcgctacaa ctggcatcaa ctgtataata 960
cgtcttatta cttcctgcgt aaagattacc tgaccgttga accgcagctg cacgacctgc 1020cgtcttatta cttcctgcgt aaagattacc tgaccgttga accgcagctg cacgacctgc 1020
gcgaatatct gggcggtagt ctgaaacaaa tgtcctggga tggcttttca cagctgtcga 1080gcgaatatct gggcggtagt ctgaaacaaa tgtcctggga tggcttttca cagctgtcga 1080
aaggtgacaa agaactgttc ctgaacattg tcggctttga tcaggaaaaa ctgcagcaag 1140aaggtgacaa agaactgttc ctgaacattg tcggctttga tcaggaaaaa ctgcagcaag 1140
aataccagca atcagaactg ccgaatttcg tgtttacggg caccacgacc tgggcaggcg 1200aataccagca atcagaactg ccgaatttcg tgtttacggg caccacgacc tgggcaggcg 1200
gtgaaaccaa agaatattac gctcagcaac aggtgaacgt cgtgaacaat gcgattaatg 1260gtgaaaccaa agaatattac gctcagcaac aggtgaacgt cgtgaacaat gcgattaatg 1260
aaaccagccc gtattacctg ggccgtgaac atgacctgtt tttcaaaggt cacccgcgcg 1320aaaccagccc gtattacctg ggccgtgaac atgacctgtt tttcaaaggt cacccgcgcg 1320
gcggtattat caatgatatt atcctgggca gtttcaacaa tatgattgac atcccggcca 1380gcggtattat caatgatatt atcctgggca gtttcaacaa tatgattgac atcccggcca 1380
aagtgtcctt tgaagttctg atgatgacgg gtatgctgcc ggataccgtg ggcggtattg 1440aagtgtcctt tgaagttctg atgatgacgg gtatgctgcc ggataccgtg ggcggtattg 1440
cgtcatcgct gtattttagc atcccggccg aaaaagtctc tttcattgtg tttaccagct 1500cgtcatcgct gtattttagc atcccggccg aaaaagtctc tttcattgtg tttaccagct 1500
ctgatacgat caccgatcgt gaagacgcgc tgaaatctcc gctggtgcag gttatgatga 1560ctgatacgat caccgatcgt gaagacgcgc tgaaatctcc gctggtgcag gttatgatga 1560
ccctgggcat tgttaaagaa aaagatgtgc tgttctggtc ggatctgccg gattgttcct 1620ccctgggcat tgttaaagaa aaagatgtgc tgttctggtc ggatctgccg gattgttcct 1620
cgggtgtttg tattgctcag tattaatttc gtcgacacac aggaaacata ttaaaaatta 1680cgggtgtttg tattgctcag tattaatttc gtcgacacac aggaaacata ttaaaaatta 1680
aaacctgcag gagtttaaac gcggccgcga tatcgttgta aaacgacggc cagtgcaaga 1740aaacctgcag gagtttaaac gcggccgcga tatcgttgta aaacgacggc cagtgcaaga 1740
atcataaaaa atttatttgc tttcaggaaa atttttctgt ataatagatt cataaatttg 1800atcataaaaa atttatttgc tttcaggaaa atttttctgt ataatagatt cataaatttg 1800
agagaggagt ttttgtgagc ggataacaat tccccatctt agtatattag ttaagtataa 1860agagaggagt ttttgtgagc ggataacaat tccccatctt agtatattag ttaagtataa 1860
atacacaagg agatatacat atgagcctgg ccattatccc ggcacgtggc ggttctaaag 1920atacacaagg agatatacat atgagcctgg ccattatccc ggcacgtggc ggttctaaag 1920
gcatcaaaaa caaaaacctg gttctgctga acaataaacc gctgatttat tacaccatca 1980gcatcaaaaa caaaaacctg gttctgctga acaataaacc gctgatttat tacaccatca 1980
aagcggccct gaacgccaaa agtattagca aagtggttgt gagctctgat tctgatgaaa 2040aagcggccct gaacgccaaa agtattagca aagtggttgt gagctctgat tctgatgaaa 2040
tcctgaacta cgcaaaaagt cagaacgttg atatcctgaa acgtccgatc agtctggcac 2100tcctgaacta cgcaaaaagt cagaacgttg atatcctgaa acgtccgatc agtctggcac 2100
aggatgatac cacgagcgat aaagtgctgc tgcatgcgct gaaattctac aaagattacg 2160aggatgatac cacgagcgat aaagtgctgc tgcatgcgct gaaattctac aaagattacg 2160
aagatgttgt gttcctgcag ccgaccagcc cgctgcgtac gaatattcac atcaacgaag 2220aagatgttgt gttcctgcag ccgaccagcc cgctgcgtac gaatattcac atcaacgaag 2220
cgttcaacct gtacaaaaac agcaacgcaa acgcgctgat ttctgttagt gaatgcgata 2280cgttcaacct gtacaaaaac agcaacgcaa acgcgctgat ttctgttagt gaatgcgata 2280
acaaaatcct gaaagcgttt gtgtgcaatg attgtggcga tctggccggt atttgtaacg 2340acaaaatcct gaaagcgttt gtgtgcaatg attgtggcga tctggccggt atttgtaacg 2340
atgaataccc gttcatgccg cgccagaaac tgccgaaaac ctatatgagc aatggtgcca 2400atgaataccc gttcatgccg cgccagaaac tgccgaaaac ctatatgagc aatggtgcca 2400
tctacatcct gaaaatcaaa gaattcctga acaacccgag cttcctgcag tctaaaacga 2460tctacatcct gaaaatcaaa gaattcctga acaacccgag cttcctgcag tctaaaacga 2460
aacatttcct gatggatgaa agtagctctc tggatattga ttgcctggaa gatctgaaaa 2520aacatttcct gatggatgaa agtagctctc tggatattga ttgcctggaa gatctgaaaa 2520
aagtggaaca gatctggaaa aaataaaata ctgaaaccaa tttgcctggc ggcagtagcg 2580aagtggaaca gatctggaaa aaataaaata ctgaaaccaa tttgcctggc ggcagtagcg 2580
cggtggtccc acctgacccc atgccgaact cagaagtgaa acgccgtagc gccgatggta 2640cggtggtccc acctgacccc atgccgaact cagaagtgaa acgccgtagc gccgatggta 2640
gtgtggggtc tccccatgcg agagtaggga actgccaggc atcaaataaa acgaaaggct 2700gtgtggggtc tccccatgcg agagtaggga actgccaggc atcaaataaa acgaaaggct 2700
cagtcgaaag actgggcctt tcgcttccac aactttgtat aataaagttg tccccacggc 2760cagtcgaaag actgggcctt tcgcttccac aactttgtat aataaagttg tccccacggc 2760
cagtgaattc gagctcggta cctaccgttc gtataatgta tgctatacga agttatcgag 2820cagtgaattc gagctcggta cctaccgttc gtataatgta tgctatacga agttatcgag 2820
ctctagagaa tgatcccctc attaggccac acgttcaagt gcagcgcaca ccgtggaaac 2880ctctagagaa tgatcccctc attaggccac acgttcaagt gcagcgcaca ccgtggaaac 2880
ggatgaaggc acgaacccag ttgacataag cctgttcggt tcgtaaactg taatgcaagt 2940ggatgaaggc acgaacccag ttgacataag cctgttcggt tcgtaaactg taatgcaagt 2940
agcgtatgcg ctcacgcaac tggtccagaa ccttgaccga acgcagcggt ggtaacggcg 3000agcgtatgcg ctcacgcaac tggtccagaa ccttgaccga acgcagcggt ggtaacggcg 3000
cagtggcggt tttcatggct tgttatgact gtttttttgt acagtctatg cctcgggcat 3060cagtggcggt tttcatggct tgttatgact gtttttttgt acagtctatg cctcgggcat 3060
ccaagcagca agcgcgttac gccgtgggtc gatgtttgat gttatggagc agcaacgatg 3120ccaagcagca agcgcgttac gccgtgggtc gatgtttgat gttatggagc agcaacgatg 3120
ttacgcagca gcaacgatgt tacgcagcag ggcagtcgcc ctaaaacaaa gttaggtggc 3180ttacgcagca gcaacgatgt tacgcagcag ggcagtcgcc ctaaaacaaa gttaggtggc 3180
tcaagtatgg gcatcattcg cacatgtagg ctcggccctg accaagtcaa atccatgcgg 3240tcaagtatgg gcatcattcg cacatgtagg ctcggccctg accaagtcaa atccatgcgg 3240
gctgctcttg atcttttcgg tcgtgagttc ggagacgtag ccacctactc ccaacatcag 3300gctgctcttg atcttttcgg tcgtgagttc ggagacgtag ccacctactc ccaacatcag 3300
ccggactccg attacctcgg gaacttgctc cgtagtaaga cattcatcgc gcttgctgcc 3360ccggactccg attacctcgg gaacttgctc cgtagtaaga cattcatcgc gcttgctgcc 3360
ttcgaccaag aagcggttgt tggcgctctc gcggcttacg ttctgcccag gtttgagcag 3420ttcgaccaag aagcggttgt tggcgctctc gcggcttacg ttctgcccag gtttgagcag 3420
ccgcgtagtg agatctatat ctatgatctc gcagtctccg gcgagcaccg gaggcagggc 3480ccgcgtagtg agatctatat ctatgatctc gcagtctccg gcgagcaccg gaggcagggc 3480
attgccaccg cgctcatcaa tctcctcaag catgaggcca acgcgcttgg tgcttatgtg 3540attgccaccg cgctcatcaa tctcctcaag catgaggcca acgcgcttgg tgcttatgtg 3540
atctacgtgc aagcagatta cggtgacgat cccgcagtgg ctctctatac aaagttgggc 3600atctacgtgc aagcagatta cggtgacgat cccgcagtgg ctctctatac aaagttgggc 3600
atacgggaag aagtgatgca ctttgatatc gacccaagta ccgccaccta acaattcgtt 3660atacgggaag aagtgatgca ctttgatatc gacccaagta ccgccaccta acaattcgtt 3660
caagccgaga tcgtagaatt tcgacgacct gcagccaagc ataacttcgt ataatgtatg 3720caagccgaga tcgtagaatt tcgacgacct gcagccaagc ataacttcgt ataatgtatg 3720
ctatacgaac ggtaggatcc tctagagtcg acctgcaggc atg 3763ctatacgaac ggtaggatcc tctagagtcg acctgcaggc atg 3763
ПЕРЕЧЕНЬ ПОСЛЕДОВАТЕЛЬНОСТЕЙ LIST OF SEQUENCES
<110> Jennewein Biotechnologie GmbH<110> Jennewein Biotechnologie GmbH
<120> Получение сиалилированных сахаридов<120> Preparation of sialylated saccharides
<130> P 1802 WO<130>P 1802 WO
<160> 104 <160> 104
<170> PatentIn версия 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 1410<211> 1410
<212> ДНК<212> DNA
<213> Campylobacter coli<213> Campylobacter coli
<400> 1<400> 1
atgcaaaacg tcattatcgc tggtaacggt ccgagcctgc aatcaatcaa ctatcaacgc 60atgcaaaacg tcattatcgc tggtaacggt ccgagcctgc aatcaatcaa ctatcaacgc 60
ctgccgaaag aatacgacat cttccgctgc aaccagttct acttcgaaga taaatactac 120ctgccgaaag aatacgacat cttccgctgc aaccagttct acttcgaaga taaatactac 120
ctgggcaaaa acatcaaagc ggcctttttc aatccgtatc cgttcctgca gcaataccat 180ctgggcaaaa acatcaaagc ggcctttttc aatccgtatc cgttcctgca gcaataccat 180
accgcgaaac agctggtgtt caacaacgaa tacaaaatcg aaaacatctt ttgtagcacg 240accgcgaaac agctggtgtt caacaacgaa tacaaaatcg aaaacatctt ttgtagcacg 240
ttcaatctgc cgttcatcga aaaagataac ttcatcaaca aattttacga tttctttccg 300ttcaatctgc cgttcatcga aaaagataac ttcatcaaca aattttacga tttctttccg 300
gacgctaaac tgggtcacaa aatcatcgaa aacctgaaag aattttacgc gtacatcaaa 360gacgctaaac tgggtcacaa aatcatcgaa aacctgaaag aattttacgc gtacatcaaa 360
tacaacgaaa tctacctgaa caaacgtatt accagcggca tctatatgtg cgcaattgct 420tacaacgaaa tctacctgaa caaacgtatt accagcggca tctatatgtg cgcaattgct 420
atcgcgctgg gttataaaaa catttacctg tgtggcatcg atttctatga aggtgaaacg 480atcgcgctgg gttataaaaa catttacctg tgtggcatcg atttctatga aggtgaaacg 480
atctacccgt tcaaagccat gtctaaaaac attaagaaaa tttttccgtg gatcaaagat 540atctacccgt tcaaagccat gtctaaaaac attaagaaaa tttttccgtg gatcaaagat 540
ttcaacccga gtaacttcca ttccaaagaa tacgacatcg aaatcctgaa actgctggaa 600ttcaacccga gtaacttcca ttccaaagaa tacgacatcg aaatcctgaa actgctggaa 600
tcaatctaca aagttaacat ctacgcactg tgcgataact cggccctggc aaattacttc 660tcaatctaca aagttaacat ctacgcactg tgcgataact cggccctggc aaattacttc 660
ccgctgctgg tgaacaccga caattcattt gttctggaaa acaaatcgga tgactgtatc 720ccgctgctgg tgaacaccga caattcattt gttctggaaa acaaatcgga tgactgtatc 720
aacgatatcc tgctgaccaa caatacgccg ggcattaact tctataaaag ccagatccaa 780aacgatatcc tgctgaccaa caatacgccg ggcattaact tctataaaag ccagatccaa 780
gtcaacaata ccgaaattct gctgctgaac tttcagaata tgatcagcgc caaagaaaac 840gtcaacaata ccgaaattct gctgctgaac tttcagaata tgatcagcgc caaagaaaac 840
gaaatttcta acctgaacaa aatcctgcaa gactcataca aaaccatcaa cacgaaagaa 900gaaatttcta acctgaacaa aatcctgcaa gactcataca aaaccatcaa cacgaaagaa 900
aacgaaatta gtaatctgaa taaaatcctg caggattcct ataaaacgat taataccaaa 960aacgaaatta gtaatctgaa taaaatcctg caggattcct ataaaacgat taataccaaa 960
gaaaatgaaa tttcgaatct gaacaaaatc ctgcaggata aagacaaact gctgatcgtt 1020gaaaatgaaa tttcgaatct gaacaaaatc ctgcaggata aagacaaact gctgatcgtt 1020
aaagaaaacc tgctgaattt caaaagccgt catggtaaag ccaaatttcg cattcagaac 1080aaagaaaacc tgctgaattt caaaagccgt catggtaaag ccaaatttcg cattcagaac 1080
caactgtctt ataaactggg ccaggcaatg atggtcaata gcaaatctct gctgggttat 1140caactgtctt ataaactggg ccaggcaatg atggtcaata gcaaatctct gctgggttat 1140
atccgtatgc cgtttgtgct gagttacatc aaagacaaac acaaacagga acaaaaaatc 1200atccgtatgc cgtttgtgct gagttacatc aaagacaaac acaaacagga acaaaaaatc 1200
tatcaggaaa aaattaagaa agatccgagc ctgaccctgc cgccgctgga agattatccg 1260tatcaggaaa aaattaagaa agatccgagc ctgaccctgc cgccgctgga agattatccg 1260
gactacaaag aagctctgaa agaaaaagaa tgcctgacct atcgcctggg ccagacgctg 1320gactacaaag aagctctgaa agaaaaagaa tgcctgacct atcgcctggg cgacgctg 1320
attaaagcgg atcaagaatg gtacaaaggt ggctatgtga aaatgtggtt cgaaatcaaa 1380attaaagcgg atcaagaatg gtacaaaggt ggctatgtga aaatgtggtt cgaaatcaaa 1380
aaactgaaga aagaatacaa aaagaaataa 1410aaactgaaga aagaatacaa aaagaaataa 1410
<210> 2<210> 2
<211> 1146<211> 1146
<212> ДНК<212> DNA
<213> Vibrio sp.<213> Vibrio sp.
<400> 2<400> 2
atgaacaacg acaactccac gaccaccaac aataacgcta ttgaaatcta tgtggatcgt 60atgaacaacg acaactccac gaccaccaac aataacgcta ttgaaatcta tgtggatcgt 60
gcgaccctgc cgacgatcca gcaaatgacc aaaattgtta gccagaaaac gtctaacaaa 120gcgaccctgc cgacgatcca gcaaatgacc aaaattgtta gccagaaaac gtctaacaaa 120
aaactgatct catggtcgcg ctacccgatt accgataaaa gcctgctgaa gaaaattaac 180aaactgatct catggtcgcg ctacccgatt accgataaaa gcctgctgaa gaaaattaac 180
gcggaatttt tcaaagaaca atttgaactg acggaaagcc tgaaaaacat catcctgtct 240gcggaatttt tcaaagaaca atttgaactg acggaaagcc tgaaaaacat catcctgtct 240
gaaaacatcg ataacctgat cattcatggc aataccctgt ggagtattga tgtggttgac 300gaaaacatcg ataacctgat cattcatggc aataccctgt ggagtattga tgtggttgac 300
attatcaaag aagtcaacct gctgggcaaa aatattccga tcgaactgca cttttatgat 360attatcaaag aagtcaacct gctgggcaaa aatattccga tcgaactgca cttttatgat 360
gacggttccg ccgaatacgt tcgtatctac gaatttagta aactgccgga atccgaacag 420gacggttccg ccgaatacgt tcgtatctac gaatttagta aactgccgga atccgaacag 420
aaatacaaaa ccagcctgtc taaaaacaac atcaaattct caatcgatgg caccgactcg 480aaatacaaaa ccagcctgtc taaaaacaac atcaaattct caatcgatgg caccgactcg 480
ttcaaaaaca cgatcgaaaa catctacggt ttcagccaac tgtatccgac cacgtaccac 540ttcaaaaaca cgatcgaaaa catctacggt ttcagccaac tgtatccgac cacgtaccac 540
atgctgcgtg cagatatctt cgacaccacg ctgaaaatta acccgctgcg cgaactgctg 600atgctgcgtg cagatatctt cgacaccacg ctgaaaatta acccgctgcg cgaactgctg 600
tcaaacaaca tcaaacagat gaaatgggat tacttcaaag acttcaacta caaacaaaaa 660tcaaacaaca tcaaacagat gaaatgggat tacttcaaag acttcaacta caaacaaaaa 660
gatatctttt actcactgac caacttcaac ccgaaagaaa tccaggaaga cttcaacaaa 720gatatctttt actcactgac caacttcaac ccgaaagaaa tccaggaaga cttcaacaaa 720
aactcgaaca aaaacttcat cttcatcggc agtaactccg cgaccgccac ggcagaagaa 780aactcgaaca aaaacttcat cttcatcggc agtaactccg cgaccgccac ggcagaagaa 780
caaatcaata ttatcagcga agcgaagaaa gaaaacagca gcattatcac caattcaatt 840caaatcaata ttatcagcga agcgaagaaa gaaaacagca gcattatcac caattcaatt 840
tcggattatg acctgttttt caaaggtcat ccgtctgcca cgtttaacga acagattatc 900tcggattatg acctgttttt caaaggtcat ccgtctgcca cgtttaacga acagattatc 900
aatgcacacg atatgatcga aatcaacaac aaaatcccgt tcgaagctct gatcatgacc 960aatgcacacg atatgatcga aatcaacaac aaaatcccgt tcgaagctct gatcatgacc 960
ggcattctgc cggatgccgt tggcggtatg ggtagttccg tctttttcag tatcccgaaa 1020ggcattctgc cggatgccgt tggcggtatg ggtagttccg tctttttcag tatcccgaaa 1020
gaagtcaaaa acaaattcgt gttctataaa agtggtacgg atatcgaaaa taactccctg 1080gaagtcaaaa acaaattcgt gttctataaa agtggtacgg atatcgaaaa taactccctg 1080
attcaggtga tgctgaaact gaatctgatt aaccgcgata atattaaact gatctctgac 1140attcaggtga tgctgaaact gaatctgatt aaccgcgata atattaaact gatctctgac 1140
atttaa 1146atttaa 1146
<210> 3<210> 3
<211> 1173<211> 1173
<212> ДНК<212> DNA
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 3<400> 3
atgggctgta atagcgactc caaccacaac aactccgacg gcaacatcac caaaaacaaa 60atgggctgta atagcgactc caaccacaac aactccgacg gcaacatcac caaaaacaaa 60
acgatcgaag tttatgtcga tcgtgcaacc ctgccgacga ttcagcaaat gacccagatc 120acgatcgaag tttatgtcga tcgtgcaacc ctgccgacga ttcagcaaat gacccagatc 120
atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgctaccc gatcaatgat 180atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgctaccc gatcaatgat 180
gaagaactgc tggaatcaat taacggctcg tttttcaaaa acaactctga actgatcaaa 240gaagaactgc tggaatcaat taacggctcg tttttcaaaa acaactctga actgatcaaa 240
agtctggatt ccatgattct gaccaatgac attaagaaag tgatcatcaa cggtaacacg 300agtctggatt ccatgattct gaccaatgac attaagaaag tgatcatcaa cggtaacacg 300
ctgtgggcgg ccgatgtggt taacatcatc aaatcaatcg aagcgttcgg caagaaaacc 360ctgtgggcgg ccgatgtggt taacatcatc aaatcaatcg aagcgttcgg caagaaaacc 360
gaaatcgaac tgaactttta tgatgacggt tcggccgaat atgtgcgtct gtacgacttt 420gaaatcgaac tgaactttta tgatgacggt tcggccgaat atgtgcgtct gtacgacttt 420
agcaaactgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattctg 480agcaaactgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattctg 480
agcagcatca acggcaccca gccgttcgaa aacgtcgtgg aaaacatcta cggtttcagt 540agcagcatca acggcaccca gccgttcgaa aacgtcgtgg aaaacatcta cggtttcagt 540
caactgtacc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600caactgtacc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600
ctgcgcagtc tgaaaggcgt tctgtccaac aacatcaaac agatgaaatg ggattacttc 660ctgcgcagtc tgaaaggcgt tctgtccaac aacatcaaac agatgaaatg ggattacttc 660
aaaaccttca acagccagca aaaagacaaa ttctacaact tcacgggttt taacccggat 720aaaaccttca acagccagca aaaagacaaa ttctacaact tcacgggttt taacccggat 720
gaaattatgg aacaatacaa agcaagcccg aacaaaaatt ttatcttcgt cggcaccaat 780gaaattatgg aacaatacaa agcaagcccg aacaaaaatt ttatcttcgt cggcaccaat 780
tctggcaccg caacggctga acagcaaatt gatatcctga ccgaagctaa aaacccgaac 840tctggcaccg caacggctga acagcaaatt gatatcctga ccgaagctaa aaacccgaac 840
agcccgatta tcacgaaatc gatccagggc ttcgacctgt ttttcaaagg tcatccgtct 900agcccgatta tcacgaaatc gatccagggc ttcgacctgt ttttcaaagg tcatccgtct 900
gcaacctaca acaaacaaat catcgatgct cacaacatga tcgaaatcta caacaaaatc 960gcaacctaca acaaacaaat catcgatgct cacaacatga tcgaaatcta caacaaaatc 960
ccgttcgaag cgctgatcat gaccgatgcc ctgccggatg cggtgggcgg tatgggcagc 1020ccgttcgaag cgctgatcat gaccgatgcc ctgccggatg cggtgggcgg tatgggcagc 1020
agcgtgtttt tcagcctgcc gaataccgtg gaaaacaaat tcattttcta taaatccgat 1080agcgtgtttt tcagcctgcc gaataccgtg gaaaacaaat tcattttcta taaatccgat 1080
acggacattg aaaacaatgc cctgatccag gttatgattg aactgaatat cgtgaaccgt 1140acggacattg aaaacaatgc cctgatccag gttatgattg aactgaatat cgtgaaccgt 1140
aatgatgtga aactgatctc ggacctgcaa taa 1173aatgatgtga aactgatctc ggacctgcaa taa 1173
<210> 4<210> 4
<211> 1167<211> 1167
<212> ДНК<212> DNA
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 4<400> 4
atgaaaacga ttaccctgta tctggacccg gcgtccctgc cggcactgaa ccaactgatg 60atgaaaacga ttaccctgta tctggacccg gcgtccctgc cggcactgaa ccaactgatg 60
gattttacgc agaacaatga agacaaaacc catccgcgta tctttggcct gtctcgcttc 120gattttacgc agaacaatga agacaaaacc catccgcgta tctttggcct gtctcgcttc 120
aaaattccgg ataacattat cacccaatat cagaatatcc actttgttga actgaaagac 180aaaattccgg ataacattat cacccaatat cagaatatcc actttgttga actgaaagac 180
aatcgtccga cggaagccct gttcaccatt ctggatcagt acccgggtaa cattgaactg 240aatcgtccga cggaagccct gttcaccatt ctggatcagt acccggggtaa cattgaactg 240
gacatccatc tgaatattgc tcacagcgtc cagctgattc gtccgatcct ggcgtatcgc 300gacatccatc tgaatattgc tcacagcgtc cagctgattc gtccgatcct ggcgtatcgc 300
tttaaacatc tggatcgtgt gtccatccag cgcctgaacc tgtatgatga cggctcaatg 360tttaaacatc tggatcgtgt gtccatccag cgcctgaacc tgtatgatga cggctcaatg 360
gaatacgttg atctggaaaa agaagaaaac aaagacatct cggcagaaat taaacaagct 420gaatacgttg atctggaaaa agaagaaaac aaagacatct cggcagaaat taaacaagct 420
gaaaaacagc tgagccatta tctgctgacg ggtaaaatca aattcgataa cccgaccatt 480gaaaaacagc tgagccatta tctgctgacg ggtaaaatca aattcgataa cccgaccatt 480
gcgcgctacg tttggcagtc tgcctttccg gtcaaatatc acttcctgag tacggactac 540gcgcgctacg tttggcagtc tgcctttccg gtcaaatatc acttcctgag tacggactac 540
tttgaaaaag cagaatttct gcaaccgctg aaagaatatc tggcggaaaa ttaccagaaa 600tttgaaaaag cagaatttct gcaaccgctg aaagaatatc tggcggaaaa ttaccagaaa 600
atggattgga cggcctatca gcaactgacc ccggaacagc aagcatttta cctgaccctg 660atggattgga cggcctatca gcaactgacc ccggaacagc aagcatttta cctgaccctg 660
gttggcttca acgacgaagt caaacagagt ctggaagtgc agcaagcgaa atttattttc 720gttggcttca acgacgaagt caaacagagt ctggaagtgc agcaagcgaa atttattttc 720
acgggcacca cgacctggga aggtaatacc gatgttcgtg aatattacgc ccagcaacag 780acgggcacca cgacctggga aggtaatacc gatgttcgtg aatattacgc ccagcaacag 780
ctgaacctgc tgaatcattt tacccaggcg ggcggcgacc tgtttattgg tgaccattac 840ctgaacctgc tgaatcattt tacccaggcg ggcggcgacc tgtttattgg tgaccattac 840
aaaatttact tcaaaggtca cccgcgcggc ggtgaaatca acgattacat cctgaacaac 900aaaatttact tcaaaggtca cccgcgcggc ggtgaaatca acgattacat cctgaacaac 900
gcaaaaaaca tcacgaatat cccggctaat atctctttcg aagtgctgat gatgaccggc 960gcaaaaaaca tcacgaatat cccggctaat atctctttcg aagtgctgat gatgaccggc 960
ctgctgccgg ataaagtcgg cggtgtggct agctctctgt acttcagtct gccgaaagaa 1020ctgctgccgg ataaagtcgg cggtgtggct agctctctgt acttcagtct gccgaaagaa 1020
aaaattagtc acatcatctt caccagcaac aaacaggtca aatcaaaaga agatgccctg 1080aaaattagtc acatcatctt caccagcaac aaacaggtca aatcaaaaga agatgccctg 1080
aacaatccgt acgtgaaagt tatgcgtcgc ctgggtatta tcgatgaatc gcaagtgatc 1140aacaatccgt acgtgaaagt tatgcgtcgc ctgggtatta tcgatgaatc gcaagtgatc 1140
ttttgggaca gcctgaaaca gctgtaa 1167ttttgggaca gcctgaaaca gctgtaa 1167
<210> 5<210> 5
<211> 1116<211> 1116
<212> ДНК<212> DNA
<213> Neisseria meningitidis<213> Neisseria meningitidis
<400> 5<400> 5
atgggcctga aaaaagcctg cctgaccgtg ctgtgtctga tcgtgttttg cttcggcatc 60atgggcctga aaaaagcctg cctgaccgtg ctgtgtctga tcgtgttttg cttcggcatc 60
ttttatacgt tcgatcgtgt gaaccagggt gaacgcaatg cagttagtct gctgaaagaa 120ttttatacgt tcgatcgtgt gaaccagggt gaacgcaatg cagttagtct gctgaaagaa 120
aaactgttta acgaagaagg cgaaccggtg aatctgatct tctgttacac cattctgcaa 180aaactgttta acgaagaagg cgaaccggtg aatctgatct tctgttacac cattctgcaa 180
atgaaagttg ccgaacgtat tatggcacag catccgggtg aacgctttta tgtggttctg 240atgaaagttg ccgaacgtat tatggcacag catccgggtg aacgctttta tgtggttctg 240
atgagcgaaa accgtaacga aaaatacgat tactacttca accagatcaa agataaagcg 300atgagcgaaa accgtaacga aaaatacgat tactacttca accagatcaa agataaagcg 300
gaacgcgcct atttctttca cctgccgtac ggcctgaaca aaagttttaa tttcattccg 360gaacgcgcct atttctttca cctgccgtac ggcctgaaca aaagttttaa tttcattccg 360
acgatggcgg aactgaaagt gaaaagcatg ctgctgccga aagttaaacg tatctatctg 420acgatggcgg aactgaaagt gaaaagcatg ctgctgccga aagttaaacg tatctatctg 420
gcaagcctgg aaaaagtgtc tattgcggcc tttctgagca cctacccgga tgcggaaatc 480gcaagcctgg aaaaagtgtc tattgcggcc tttctgagca cctacccgga tgcggaaatc 480
aaaaccttcg atgatggcac gggtaatctg attcagagct ctagttatct gggcgatgaa 540aaaaccttcg atgatggcac gggtaatctg attcagagct ctagttatct gggcgatgaa 540
ttttctgtta acggtacgat caaacgtaat ttcgcccgca tgatgatcgg tgattggtct 600ttttctgtta acggtacgat caaacgtaat ttcgcccgca tgatgatcgg tgattggtct 600
attgcgaaaa cccgcaacgc cagtgatgaa cattacacga tcttcaaagg cctgaaaaac 660attgcgaaaa cccgcaacgc cagtgatgaa cattacacga tcttcaaagg cctgaaaaac 660
atcatggatg atggtcgtcg caaaatgacc tacctgccgc tgttcgatgc gtctgaactg 720atcatggatg atggtcgtcg caaaatgacc tacctgccgc tgttcgatgc gtctgaactg 720
aaaacgggcg atgaaaccgg cggtacggtg cgtattctgc tgggtagccc ggataaagaa 780aaaacgggcg atgaaaccgg cggtacggtg cgtattctgc tgggtagccc ggataaagaa 780
atgaaagaaa tctctgaaaa agcagcgaaa aacttcaaaa tccagtatgt tgccccgcac 840atgaaagaaa tctctgaaaa agcagcgaaa aacttcaaaa tccagtatgt tgccccgcac 840
ccgcgtcaga cctacggcct gagtggtgtg accacgctga acagcccgta tgttattgaa 900ccgcgtcaga cctacggcct gagtggtgtg accacgctga acagcccgta tgttattgaa 900
gattacatcc tgcgtgaaat taagaaaaac ccgcataccc gctatgaaat ctacacgttt 960gattacatcc tgcgtgaaat taagaaaaac ccgcataccc gctatgaaat ctacacgttt 960
ttcagcggcg ccgcactgac catgaaagat tttccgaacg tgcacgttta tgcactgaaa 1020ttcagcggcg ccgcactgac catgaaagat tttccgaacg tgcacgttta tgcactgaaa 1020
ccggcgtctc tgccggaaga ttattggctg aaaccggtgt acgcgctgtt tacccagagt 1080ccggcgtctc tgccggaaga ttattggctg aaaccggtgt acgcgctgtt tacccagagt 1080
ggtattccga tcctgacgtt cgatgataaa aattaa 1116ggtattccga tcctgacgtt cgatgataaa aattaa 1116
<210> 6<210> 6
<211> 852<211> 852
<212> ДНК<212> DNA
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 6<400> 6
atggataaat ttgcagaaca tgaaattccg aaagcagtga tcgttgctgg caacggtgaa 60atggataaat ttgcagaaca tgaaattccg aaagcagtga tcgttgctgg caacggtgaa 60
agtctgtccc agattgatta tcgtctgctg ccgaaaaact acgacgtctt ccgttgcaac 120agtctgtccc agattgatta tcgtctgctg ccgaaaaact acgacgtctt ccgttgcaac 120
caattctact tcgaagaacg ctacttcctg ggcaataaaa tcaaagccgt gtttttcacc 180caattctact tcgaagaacg ctacttcctg ggcaataaaa tcaaagccgt gtttttcacc 180
ccgggtgttt ttctggaaca gtattacacg ctgtatcatc tgaaacgcaa caatgaatac 240ccgggtgttt ttctggaaca gtattacacg ctgtatcatc tgaaacgcaa caatgaatac 240
tttgtcgata acgtgattct gagctctttc aatcacccga ccgtggacct ggaaaaatca 300tttgtcgata acgtgattct gagctctttc aatcacccga ccgtggacct ggaaaaatca 300
cagaaaatcc aagcactgtt catcgatgtt atcaacggct acgaaaaata cctgtcgaaa 360cagaaaatcc aagcactgtt catcgatgtt atcaacggct acgaaaaata cctgtcgaaa 360
ctgaccgctt tcgatgttta tctgcgttac aaagaactgt atgaaaatca gcgcattacg 420ctgaccgctt tcgatgttta tctgcgttac aaagaactgt atgaaaatca gcgcattacg 420
agcggtgttt acatgtgcgc tgtcgcgatc gccatgggct ataccgatat ttacctgacg 480agcggtgttt acatgtgcgc tgtcgcgatc gccatgggct ataccgatat ttacctgacg 480
ggtatcgact tttatcaagc gtctgaagaa aactacgcct tcgataacaa aaaaccgaat 540ggtatcgact tttatcaagc gtctgaagaa aactacgcct tcgataacaa aaaaccgaat 540
attatccgtc tgctgccgga ctttcgcaaa gaaaaaaccc tgttcagcta tcattctaaa 600attatccgtc tgctgccgga ctttcgcaaa gaaaaaaccc tgttcagcta tcattctaaa 600
gatattgacc tggaagcgct gtcatttctg cagcaacatt accacgtgaa cttctactca 660gatattgacc tggaagcgct gtcatttctg cagcaacatt accacgtgaa cttctactca 660
atctcgccga tgagtccgct gtccaaacat tttccgatcc cgacggttga agatgactgt 720atctcgccga tgagtccgct gtccaaacat tttccgatcc cgacggttga agatgactgt 720
gaaaccacgt tcgtcgcccc gctgaaagaa aactatatta atgacatcct gctgccgccg 780gaaaccacgt tcgtcgcccc gctgaaagaa aactatatta atgacatcct gctgccgccg 780
cactttgtct atgaaaaact gggcgtggat aaactggcgg ccgcactgga acatcaccat 840cactttgtct atgaaaaact gggcgtggat aaactggcgg ccgcactgga acatcaccat 840
caccatcact aa 852caccatcact aa 852
<210> 7<210> 7
<211> 1158<211> 1158
<212> ДНК<212> DNA
<213> Pasteurella dagmatis<213> Pasteurella dagmatis
<400> 7<400> 7
atgaccattt acctggaccc ggcgtctctg ccgaccctga accaactgat gcattttacg 60atgaccattt acctggaccc ggcgtctctg ccgaccctga accaactgat gcattttacg 60
aaagaaagcg aagacaaaga aaccgcacgt atttttggct tctctcgctt taaactgccg 120aaagaaagcg aagacaaaga aaccgcacgt atttttggct tctctcgctt taaactgccg 120
gaaaaaatca cggaacagta caacaacatc catttcgtgg aaatcaaaaa caatcgtccg 180gaaaaaatca cggaacagta caacaacatc catttcgtgg aaatcaaaaa caatcgtccg 180
acggaagata ttttcaccat cctggaccag tacccggaaa aactggaact ggatctgcat 240acggaagata ttttcaccat cctggaccag tacccggaaa aactggaact ggatctgcat 240
ctgaacattg cacacagcat ccagctgttt catccgattc tgcaatatcg tttcaaacac 300ctgaacattg cacacagcat ccagctgttt catccgattc tgcaatatcg tttcaaacac 300
ccggatcgca ttagtatcaa atccctgaac ctgtatgatg acggcaccat ggaatacgtt 360ccggatcgca ttagtatcaa atccctgaac ctgtatgatg acggcaccat ggaatacgtt 360
gatctggaaa aagaagaaaa caaagacatc aaaagtgcga tcaaaaaagc cgaaaaacag 420gatctggaaa aagaagaaaa caaagacatc aaaagtgcga tcaaaaaagc cgaaaaacag 420
ctgtccgatt atctgctgac gggtaaaatt aactttgaca atccgaccct ggcacgctac 480ctgtccgatt atctgctgac gggtaaaatt aactttgaca atccgaccct ggcacgctac 480
gtttggcagt cacaatatcc ggtcaaatac catttcctgt cgacggaata ttttgaaaaa 540gtttggcagt cacaatatcc ggtcaaatac catttcctgt cgacggaata ttttgaaaaa 540
gctgaattcc tgcagccgct gaaaacctat ctggcgggca aataccaaaa aatggattgg 600gctgaattcc tgcagccgct gaaaacctat ctggcgggca aataccaaaa aatggattgg 600
tcagcctatg aaaaactgtc gccggaacag caaacgtttt acctgaaact ggtcggtttc 660tcagcctatg aaaaactgtc gccggaacag caaacgtttt acctgaaact ggtcggtttc 660
agtgatgaaa ccaaacagct gtttcacacg gaacaaacca aatttatttt cacgggcacc 720agtgatgaaa ccaaacagct gtttcacacg gaacaaacca aatttatttt cacgggcacc 720
acgacctggg agggtaacac cgatatccgt gaatattacg cgaaacagca actgaatctg 780acgacctggg agggtaacac cgatatccgt gaatattacg cgaaacagca actgaatctg 780
ctgaaacatt ttacccacag cgaaggcgac ctgtttatcg gtgaccagta caaaatctac 840ctgaaacatt ttacccacag cgaaggcgac ctgtttatcg gtgaccagta caaaatctac 840
ttcaaaggcc atccgcgcgg cggtgatatt aacgactata tcctgaaaca cgcaaaagat 900ttcaaaggcc atccgcgcgg cggtgatatt aacgactata tcctgaaaca cgcaaaagat 900
attacgaaca tcccggctaa tattagcttc gaaatcctga tgatgaccgg tctgctgccg 960attacgaaca tcccggctaa tattagcttc gaaatcctga tgatgaccgg tctgctgccg 960
gacaaagtcg gcggtgtggc gagctctctg tacttctctc tgccgaaaga aaaaatcagc 1020gacaaagtcg gcggtgtggc gagctctctg tacttctctc tgccgaaaga aaaaatcagc 1020
cacattatct tcacctctaa caagaaaatt aaaaacaaag aagatgccct gaatgacccg 1080cacattatct tcacctctaa caagaaaatt aaaaacaaag aagatgccct gaatgacccg 1080
tacgtgcgtg ttatgctgcg tctgggtatg attgacaaaa gccaaattat cttctgggat 1140tacgtgcgtg ttatgctgcg tctgggtatg attgacaaaa gccaaattat cttctgggat 1140
tctctgaaac aactgtaa 1158tctctgaaac aactgtaa 1158
<210> 8<210> 8
<211> 1173<211> 1173
<212> ДНК<212> DNA
<213> Photobacterium phosphoreum<213> Photobacterium phosphoreum
<400> 8<400> 8
atgggctgta actccgatag caaacacaat aacagtgatg gcaatattac caaaaacaaa 60atgggctgta actccgatag caaacacaat aacagtgatg gcaatattac caaaaacaaa 60
acgatcgaag tctatgtgga ccgtgcgacc ctgccgacga ttcagcaaat gacccagatc 120acgatcgaag tctatgtgga ccgtgcgacc ctgccgacga ttcagcaaat gacccagatc 120
atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgttaccc gatcaatgat 180atcaacgaaa atagcaacaa caaaaaactg atttcatggt cgcgttaccc gatcaatgat 180
gaaacgctgc tggaatcaat taatggctcg tttttcaaaa accgcccgga actgatcaaa 240gaaacgctgc tggaatcaat taatggctcg tttttcaaaa accgcccgga actgatcaaa 240
agtctggatt ccatgattct gaccaacgaa attaagaaag tgatcatcaa cggtaacacg 300agtctggatt ccatgattct gaccaacgaa attaagaaag tgatcatcaa cggtaacacg 300
ctgtgggcag ttgacgtggt taatattatc aaaagcattg aagctctggg caagaaaacc 360ctgtgggcag ttgacgtggt taatattatc aaaagcattg aagctctggg caagaaaacc 360
gaaatcgaac tgaacttcta tgatgacggt tctgcggaat atgtgcgtct gtacgatttt 420gaaatcgaac tgaacttcta tgatgacggt tctgcggaat atgtgcgtct gtacgatttt 420
agccgcctgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattcag 480agccgcctgc cggaatctga acaggaatac aaaattagcc tgtctaaaga taacattcag 480
agcagcatca acggcaccca accgttcgac aacagcatcg aaaacatcta cggtttctct 540agcagcatca acggcaccca accgttcgac aacagcatcg aaaacatcta cggtttctct 540
cagctgtatc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600cagctgtatc cgaccacgta ccacatgctg cgtgccgata tctttgaaac caatctgccg 600
ctgacgagtc tgaaacgcgt tatctccaac aacatcaaac agatgaaatg ggattacttc 660ctgacgagtc tgaaacgcgt tatctccaac aacatcaaac agatgaaatg ggattacttc 660
accacgttca attcccagca gaaaaacaaa ttttacaact tcaccggctt caacccggaa 720accacgttca attcccagca gaaaaacaaa ttttacaact tcaccggctt caacccggaa 720
aaaatcaaag aacaatacaa agcgagtccg cacgaaaatt ttattttcat tggcaccaac 780aaaatcaaag aacaatacaa agcgagtccg cacgaaaatt ttattttcat tggcaccaac 780
tccggcaccg ccaccgcaga acagcaaatt gatatcctga ccgaagccaa aaaaccggac 840tccggcaccg ccaccgcaga acagcaaatt gatatcctga ccgaagccaa aaaaccggac 840
tcaccgatta tcaccaacag cattcagggc ctggacctgt ttttcaaagg tcatccgtct 900tcaccgatta tcaccaacag cattcagggc ctggacctgt ttttcaaagg tcatccgtct 900
gcgacctata accagcaaat tatcgacgcc cacaacatga tcgaaatcta caacaaaatc 960gcgacctata accagcaaat tatcgacgcc cacaacatga tcgaaatcta caacaaaatc 960
ccgttcgaag cactgatcat gaccgatgca ctgccggacg ctgttggcgg tatgggtagt 1020ccgttcgaag cactgatcat gaccgatgca ctgccggacg ctgttggcgg tatgggtagt 1020
tccgtctttt tctcactgcc gaataccgtc gaaaacaaat tcattttcta taaatcggat 1080tccgtctttt tctcactgcc gaataccgtc gaaaacaaat tcattttcta taaatcggat 1080
acggacattg aaaacaatgc tctgatccag gttatgatcg aactgaatat cgtgaaccgc 1140acggacattg aaaacaatgc tctgatccag gttatgatcg aactgaatat cgtgaaccgc 1140
aatgatgtga aactgattag tgacctgcaa taa 1173aatgatgtga aactgattag tgacctgcaa taa 1173
<210> 9<210> 9
<211> 1254<211> 1254
<212> ДНК<212> DNA
<213> Avibacterium paragallinarum<213> Avibacterium paragallinarum
<400> 9<400> 9
atgcgtaaaa tcatcacctt cttcagcctg ttcttctcga tctcagcgtg gtgtcaaaaa 60atgcgtaaaa tcatcacctt cttcagcctg ttcttctcga tctcagcgtg gtgtcaaaaa 60
atggaaatct acctggacta tgcgtcgctg ccgagcctga acatgatcct gaacctggtt 120atggaaatct acctggacta tgcgtcgctg ccgagcctga acatgatcct gaacctggtt 120
gaaaacaaaa acaacgaaaa agtcgaacgt attatcggct tcgaacgctt tgatttcaac 180gaaaacaaaa acaacgaaaa agtcgaacgt attatcggct tcgaacgctt tgatttcaac 180
aaagaaattc tgaatagctt ctctaaagaa cgtatcgaat ttagtaaagt ctccattctg 240aaagaaattc tgaatagctt ctctaaagaa cgtatcgaat ttagtaaagt ctccattctg 240
gatatcaaag aattttcaga caaactgtac ctgaacattg aaaaatcgga tacgccggtg 300gatatcaaag aattttcaga caaactgtac ctgaacattg aaaaatcgga tacgccggtg 300
gacctgatta tccataccaa tctggatcac tcagttcgtt cgctgctgag catctttaaa 360gacctgatta tccataccaa tctggatcac tcagttcgtt cgctgctgag catctttaaa 360
accctgagtc cgctgttcca taaaatcaac atcgaaaaac tgtacctgta cgatgacggc 420accctgagtc cgctgttcca taaaatcaac atcgaaaaac tgtacctgta cgatgacggc 420
agcggtaact atgttgatct gtaccagcac cgccaagaaa atatttctgc gattctgatc 480agcggtaact atgttgatct gtaccagcac cgccaagaaa atatttctgc gattctgatc 480
gaagcccaga aaaaactgaa agacgcgctg gaaaatcgtg aaacggatac cgacaaactg 540gaagcccaga aaaaactgaa agacgcgctg gaaaatcgtg aaacggatac cgacaaactg 540
catagcctga cgcgctatac ctggcacaaa atctttccga cggaatatat cctgctgcgt 600catagcctga cgcgctatac ctggcacaaa atctttccga cggaatatat cctgctgcgt 600
ccggattacc tggatattga cgaaaaaatg caaccgctga aacatttcct gagcgatacc 660ccggattacc tggatattga cgaaaaaatg caaccgctga aacatttcct gagcgatacc 660
atcgtgtcta tggacctgtc tcgctttagt catttctcca aaaaccagaa agaactgttt 720atcgtgtcta tggacctgtc tcgctttagt catttctcca aaaaccagaa agaactgttt 720
ctgaaaatca cgcacttcga tcaaaacatc ttcaacgaac tgaacatcgg caccaaaaac 780ctgaaaatca cgcacttcga tcaaaacatc ttcaacgaac tgaacatcgg caccaaaaac 780
aaagaataca aaacgttcat cttcaccggc accacgacct gggaaaaaga taagaaaaaa 840aaagaataca aaacgttcat cttcaccggc accacgacct gggaaaaaga taagaaaaaa 840
cgtctgaaca acgcgaaact gcagacggaa attctggaat cttttatcaa accgaacggc 900cgtctgaaca acgcgaaact gcagacggaa attctggaat cttttatcaa accgaacggc 900
aaattctacc tgggtaacga tatcaaaatc tttttcaaag gccacccgaa aggtgatgac 960aaattctacc tgggtaacga tatcaaaatc tttttcaaag gccacccgaa aggtgatgac 960
attaacgact acattatccg caaaaccggc gcagaaaaaa ttccggctaa catcccgttt 1020attaacgact acattatccg caaaaccggc gcagaaaaaa ttccggctaa catcccgttt 1020
gaagttctga tgatgacgaa tagtctgccg gattatgtcg gcggtattat gagtaccgtg 1080gaagttctga tgatgacgaa tagtctgccg gattatgtcg gcggtattat gagtaccgtg 1080
tacttttccc tgccgccgaa aaatattgat aaagtggttt tcctgggttc cgaaaaaatc 1140tacttttccc tgccgccgaa aaatattgat aaagtggttt tcctgggttc cgaaaaaatc 1140
aaaaacgaaa acgacgccaa atcacagacc ctgtcgaaac tgatgctgat gctgaacgtc 1200aaaaacgaaa acgacgccaa atcacagacc ctgtcgaaac tgatgctgat gctgaacgtc 1200
atcacgccgg aacagatttt ctttgaagaa atgccgaacc cgattaactt ttaa 1254atcacgccgg aacagatttt ctttgaagaa atgccgaacc cgattaactt ttaa 1254
<210> 10<210> 10
<211> 1293<211> 1293
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 10<400> 10
atgacccgca cccgtatgga aaacgaactg attgtgagca aaaacatgca gaacattatt 60atgacccgca cccgtatgga aaacgaactg attgtgagca aaaacatgca gaacattatt 60
atcgccggta acggtccgag cctgaaaaat attaactata aacgtctgcc gcgcgaatac 120atcgccggta acggtccgag cctgaaaaat attaactata aacgtctgcc gcgcgaatac 120
gatgtgttcc gttgcaacca gttctacttc gaagacaaat actacctggg caagaaaatt 180gatgtgttcc gttgcaacca gttctacttc gaagacaaat actacctggg caagaaaatt 180
aaagccgtgt ttttcaatcc gggcgtgttt ctgcaacaat atcataccgc aaaacagctg 240aaagccgtgt ttttcaatcc gggcgtgttt ctgcaacaat atcataccgc aaaacagctg 240
attctgaaaa acgaatacga aatcaaaaac atcttttgta gcaccttcaa tctgccgttt 300attctgaaaa acgaatacga aatcaaaaac atcttttgta gcaccttcaa tctgccgttt 300
atcgaatcta acgatttcct gcaccaattt tataactttt tcccggacgc taaactgggc 360atcgaatcta acgatttcct gcaccaattt tataactttt tcccggacgc taaactgggc 360
tacgaagtca tcgaaaacct gaaagaattt tacgcgtaca tcaaatacaa cgaaatctac 420tacgaagtca tcgaaaacct gaaagaattt tacgcgtaca tcaaatacaa cgaaatctac 420
ttcaacaaac gcatcacctc tggcgtgtat atgtgcgcga ttgccatcgc actgggttat 480ttcaacaaac gcatcacctc tggcgtgtat atgtgcgcga ttgccatcgc actgggttat 480
aaaacgattt acctgtgtgg catcgatttc tatgaaggtg acgttattta cccgtttgaa 540aaaacgattt acctgtgtgg catcgatttc tatgaaggtg acgttattta cccgtttgaa 540
gcaatgagta ccaacattaa aacgatcttc ccgggtatca aagatttcaa accgagtaac 600gcaatgagta ccaacattaa aacgatcttc ccgggtatca aagatttcaa accgagtaac 600
tgccattcca aagaatatga catcgaagcg ctgaaactgc tgaaaagcat ctacaaagtt 660tgccattcca aagaatatga catcgaagcg ctgaaactgc tgaaaagcat ctacaaagtt 660
aacatctacg ccctgtgtga tgacagtatt ctggcaaatc atttcccgct gtccattaac 720aacatctacg ccctgtgtga tgacagtatt ctggcaaatc atttcccgct gtccattaac 720
atcaacaaca acttcaccct ggaaaacaaa cacaacaact caatcaacga tattctgctg 780atcaacaaca acttcaccct ggaaaacaaa cacaacaact caatcaacga tattctgctg 780
accgacaata cgccgggcgt ctcgttttat aaaaatcagc tgaaagccga taacaaaatc 840accgacaata cgccgggcgt ctcgttttat aaaaatcagc tgaaagccga taacaaaatc 840
atgctgaact tctacaacat cctgcatagc aaagataacc tgatcaaatt cctgaacaaa 900atgctgaact tctacaacat cctgcatagc aaagataacc tgatcaaatt cctgaacaaa 900
gaaatcgctg ttctgaaaaa acagaccacg caacgtgcta aagcgcgcat tcagaaccac 960gaaatcgctg ttctgaaaaa acagaccacg caacgtgcta aagcgcgcat tcagaaccac 960
ctgagctata aactgggcca agccctgatt atcaatagca aatctgtcct gggtttcctg 1020ctgagctata aactgggcca agccctgatt atcaatagca aatctgtcct gggtttcctg 1020
tctctgccgt ttattatcct gtcaattgtg atctcgcaca aacaggaaca aaaagcgtat 1080tctctgccgt ttattatcct gtcaattgtg atctcgcaca aacaggaaca aaaagcgtat 1080
aaattcaaag tgaagaaaaa cccgaacctg gcactgccgc cgctggaaac ctatccggat 1140aaattcaaag tgaagaaaaa cccgaacctg gcactgccgc cgctggaaac ctatccggat 1140
tacaacgaag ccctgaaaga aaaagaatgc ttcacgtaca aactgggcga agaatttatc 1200tacaacgaag ccctgaaaga aaaagaatgc ttcacgtaca aactgggcga agaatttatc 1200
aaagcaggta aaaactggta tggcgaaggt tacatcaaat ttatcttcaa agatgttccg 1260aaagcaggta aaaactggta tggcgaaggt tacatcaaat ttatcttcaa agatgttccg 1260
cgtctgaaac gtgaatttga aaaaggcgaa taa 1293cgtctgaaac gtgaatttga aaaaggcgaa taa 1293
<210> 11<210> 11
<211> 1188<211> 1188
<212> ДНК<212> DNA
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 11<400> 11
atgaataaga aaccgctgat tattgctggc aacgggccaa gcatcaaaga cttagattat 60atgaataaga aaccgctgat tattgctggc aacgggccaa gcatcaaaga cttagattat 60
gcgttgttcc cgaaagactt tgatgtattc cgatgtaatc aattctactt cgaggacaaa 120gcgttgttcc cgaaagactt tgatgtattc cgatgtaatc aattctactt cgaggacaaa 120
tactatttag ggcgggaaat aaaaggggtg ttctttaacg cgcacgtctt cgatctccaa 180tactatttag ggcgggaaat aaaaggggtg ttctttaacg cgcacgtctt cgatctccaa 180
atgaagatca ctaaagccat agtcaaaaac ggggaatatc acccggacca catatattgc 240atgaagatca ctaaagccat agtcaaaaac ggggaatatc acccggacca catatattgc 240
acacatgtcg aaccgtacgg ttacgttaac ggaaaccagc aactcatgca agagtacctg 300acacatgtcg aaccgtacgg ttacgttaac ggaaaccagc aactcatgca agagtacctg 300
gaaaaacatt ttgtgggagt ccgaagcacg tacgcatacc tgaaagatct agagccattc 360gaaaaacatt ttgtgggagt ccgaagcacg tacgcatacc tgaaagatct agagccattc 360
tttattctgc acagtaagta tcgcaacttc tacgaccagc acttcacaac gggcatcatg 420tttattctgc acagtaagta tcgcaacttc tacgaccagc acttcacaac gggcatcatg 420
atgctactgg tggccatcca attgggatac aaagaaatat acctgtgcgg aatagacttc 480atgctactgg tggccatcca attgggatac aaagaaatat acctgtgcgg aatagacttc 480
tacgaaaacg gattcggaca tttctacgag aaccaagggg gattctttga agaggatagc 540tacgaaaacg gattcggaca tttctacgag aaccaagggg gattctttga agaggatagc 540
gatccgatgc acgataagaa catagacatc caagcactgg aactggcaaa gaaatacgcg 600gatccgatgc acgataagaa catagacatc caagcactgg aactggcaaa gaaatacgcg 600
aaaatctacg cactggtacc gaacagcgcc ctagtgaaaa tgattccgtt gagcagccaa 660aaaatctacg cactggtacc gaacagcgcc ctagtgaaaa tgattccgtt gagcagccaa 660
aaaggagttc tggaaaaggt gaaggaccgg atcgggttgg gcgagtttaa gagagagaaa 720aaaggagttc tggaaaaggt gaaggaccgg atcgggttgg gcgagtttaa gagagagaaa 720
ttcgggcaaa aagaattgga aagacagaag gaattagaac gacaaaaaga gctcgaacgc 780ttcgggcaaa aagaattgga aagacagaag gaattagaac gacaaaaaga gctcgaacgc 780
caaaaggagc ttgaacgtca aaaggaactt gaacgacaaa aagagttgga gaggcagaaa 840caaaaggagc ttgaacgtca aaaggaactt gaacgacaaa aagagttgga gaggcagaaa 840
gaactcgaac gccaaaaaga attagagaga cagaaggaat tagagcgcca aaaggagctt 900gaactcgaac gccaaaaaga attagagaga cagaaggaat tagagcgcca aaaggagctt 900
gagcgtcaaa aagaattaga gaggcagaag gagttagaaa ggcagaaaga actggagaga 960gagcgtcaaa aagaattaga gaggcagaag gagttagaaa ggcagaaaga actggagaga 960
cagaaagaac tcgaaaggca gaaggagttg gaacgccaaa aagaactaga attagaacga 1020cagaaagaac tcgaaaggca gaaggagttg gaacgccaaa aagaactaga attagaacga 1020
tccttaaaag cacgattgaa agcggtactc gcgagcaaag gcatccgcgg cgacaacctg 1080tccttaaaag cacgattgaa agcggtactc gcgagcaaag gcatccgcgg cgacaacctg 1080
ataatcgtaa gtttaaaaga cacctaccga ctgtttaaag ggggatttgc gttactcttg 1140ataatcgtaa gtttaaaaga cacctaccga ctgtttaaag ggggatttgc gttactcttg 1140
gacctgaagg cgctaaagtc aatcattaaa gcattcctga agagataa 1188gacctgaagg cgctaaagtc aatcattaaa gcattcctga agagataa 1188
<210> 12<210> 12
<211> 783<211> 783
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 12<400> 12
atgggcaaaa aagtgattat tgcgggcaac ggcccgagcc tgaaagaaat tgattatagc 60atgggcaaaa aagtgattat tgcgggcaac ggcccgagcc tgaaagaaat tgattatagc 60
cgtctgccga acgattttga tgtgtttcgc tgcaaccagt tttatttcga agataaatat 120cgtctgccga acgattttga tgtgtttcgc tgcaaccagt tttatttcga agataaatat 120
tacctgggca aaaaatgcaa agcggtgttc tataatccga tcctgttctt cgaacagtat 180tacctgggca aaaaatgcaa agcggtgttc tataatccga tcctgttctt cgaacagtat 180
tacaccctga aacatctgat tcagaaccag gaatatgaaa ccgaactgat catgtgcagc 240tacaccctga aacatctgat tcagaaccag gaatatgaaa ccgaactgat catgtgcagc 240
aactataacc aggcgcatct ggaaaacgaa aactttgtga aaaccttcta cgattatttt 300aactataacc aggcgcatct ggaaaacgaa aactttgtga aaaccttcta cgattatttt 300
ccggatgcgc atctgggcta tgattttttc aaacagctga aagatttcaa cgcgtacttc 360ccggatgcgc atctgggcta tgattttttc aaacagctga aagatttcaa cgcgtacttc 360
aaattccacg aaatctattt caaccagcgt attaccagcg gcgtgtatat gtgcgcggtg 420aaattccacg aaatctattt caaccagcgt attaccagcg gcgtgtatat gtgcgcggtg 420
gcgattgcgc tgggctataa agaaatttat ctgagcggca tcgattttta tcagaacggc 480gcgattgcgc tgggctataa agaaatttat ctgagcggca tcgattttta tcagaacggc 480
agcagctatg cgtttgatac caaacagaaa aacctgctga aactggcccc gaactttaaa 540agcagctatg cgtttgatac caaacagaaa aacctgctga aactggcccc gaactttaaa 540
aacgataaca gccactatat tggccatagc aaaaacaccg atatcaaagc gctggaattt 600aacgataaca gccactatat tggccatagc aaaaacaccg atatcaaagc gctggaattt 600
ctggaaaaaa cctataaaat caaactgtat tgcctgtgcc cgaacagcct gctggccaac 660ctggaaaaaa cctataaaat caaactgtat tgcctgtgcc cgaacagcct gctggccaac 660
tttattgaac tggcaccgaa tctgaacagc aacttcatca tccaggaaaa aaacaactat 720tttattgaac tggcaccgaa tctgaacagc aacttcatca tccaggaaaa aaacaactat 720
accaaagata ttctgattcc gagcagcgaa gcgtatggca aattcagcaa aaacatcaac 780accaaagata ttctgattcc gagcagcgaa gcgtatggca aattcagcaa aaacatcaac 780
taa 783taa 783
<210> 13<210> 13
<211> 897<211> 897
<212> ДНК<212> DNA
<213> Streptococcus entericus<213> Streptococcus entericus
<400> 13<400> 13
atgaagaaag tctacttctg ccatacggtc taccatctgc tgattaccct gtgcaaaatt 60atgaagaaag tctacttctg ccatacggtc taccatctgc tgattaccct gtgcaaaatt 60
agcgttgaag aacaagttga aattattgtg ttcgataccg ttagtaatca tgaactgatt 120agcgttgaag aacaagttga aattattgtg ttcgataccg ttagtaatca tgaactgatt 120
gtccagaaaa tccgcgacgt gtttgttaac accacggtgc tgttcgcaga acaaaatacc 180gtccagaaaa tccgcgacgt gtttgttaac accacggtgc tgttcgcaga acaaaatacc 180
gatttttcca ttctggaaat cgatcgcgct acggacattt atgtgttcaa cgactggacc 240gatttttcca ttctggaaat cgatcgcgct acggacattt atgtgttcaa cgactggacc 240
ccgatcggcg cgtatctgcg taaaaacaaa ctgttttacc atctgatcga agatggttat 300ccgatcggcg cgtatctgcg taaaaacaaa ctgttttacc atctgatcga agatggttat 300
aactaccacg aatataacgt ttacgcgaat gccctgacca tgaaacgtcg cctgctgaac 360aactaccacg aatataacgt ttacgcgaat gccctgacca tgaaacgtcg cctgctgaac 360
ttcgtgctgc gtcgcgaaga accgtcaggc ttttcgcgtt atgttcgcag cattgaagtt 420ttcgtgctgc gtcgcgaaga accgtcaggc ttttcgcgtt atgttcgcag cattgaagtt 420
aaccgtgtca aatacctgcc gaatgattgc cgcaaaagca aatgggttga aaaaccgcgt 480aaccgtgtca aatacctgcc gaatgattgc cgcaaaagca aatgggttga aaaaccgcgt 480
tctgccctgt tcgaaaatct ggtcccggaa cataaacaga aaatcatcac gatcttcggc 540tctgccctgt tcgaaaatct ggtcccggaa cataaacaga aaatcatcac gatcttcggc 540
ctggaaaact atcaagatag cctgcgcggt gtcctggtgc tgacccagcc gctggtgcaa 600ctggaaaact atcaagatag cctgcgcggt gtcctggtgc tgacccagcc gctggtgcaa 600
gactactggg atcgcgacat taccacggaa gaagaacagc tggaatttta tcgtcaaatc 660gactactggg atcgcgacat taccacggaa gaagaacagc tggaatttta tcgtcaaatc 660
gtggaatctt acggcgaagg tgaacaggtg tttttcaaaa ttcacccgcg tgataaagtt 720gtggaatctt acggcgaagg tgaacaggtg tttttcaaaa ttcacccgcg tgataaagtt 720
gactatagct ctctgaccaa cgtcattttt ctgaagaaaa acgtcccgat ggaagtgtac 780gactatagct ctctgaccaa cgtcattttt ctgaagaaaa acgtcccgat ggaagtgtac 780
gaactgattg ccgattgtca ttttaccaaa ggtatcacgc acagttccac cgcactggac 840gaactgattg ccgattgtca ttttaccaaa ggtatcacgc acagttccac cgcactggac 840
ttcctgtcct gtgtggataa gaaaatcacc ctgaaacaaa tgaaagcaaa tagttaa 897ttcctgtcct gtgtggataa gaaaatcacc ctgaaacaaa tgaaagcaaa tagttaa 897
<210> 14<210> 14
<211> 888<211> 888
<212> ДНК<212> DNA
<213> Haemophilus ducreyi<213>Haemophilus ducreyi
<400> 14<400> 14
atgaaagaaa tcgccatcat ctccaaccaa cgcatgttct tcctgtactg tctgctgacc 60atgaaagaaa tcgccatcat ctccaaccaa cgcatgttct tcctgtactg tctgctgacc 60
aataaaaatg tcgaagacgt gttcttcatt tttgaaaaag gcgcgatgcc gaacaatctg 120aataaaaatg tcgaagacgt gttcttcatt tttgaaaaag gcgcgatgcc gaacaatctg 120
accagcattt ctcatttcat cgtgctggat cacagtaaat ccgaatgcta tgactttttc 180accagcattt ctcatttcat cgtgctggat cacagtaaat ccgaatgcta tgactttttc 180
tacttcaact tcatcagttg taaatatcgt ctgcgcggcc tggatgttta cggtgcagac 240tacttcaact tcatcagttg taaatatcgt ctgcgcggcc tggatgttta cggtgcagac 240
catatcaaag gcgctaaatt tttcctggaa cgtcaccgct ttttcgtggt tgaagatggt 300catatcaaag gcgctaaatt tttcctggaa cgtcaccgct ttttcgtggt tgaagatggt 300
atgatgaact acagcaaaaa catgtacgca ttctctctgt tccgtacccg caatccggtg 360atgatgaact acagcaaaaa catgtacgca ttctctctgt tccgtacccg caatccggtg 360
attctgccgg gcggttttca tccgaacgtt aaaaccatct tcctgacgaa agataatccg 420attctgccgg gcggttttca tccgaacgtt aaaaccatct tcctgacgaa agataatccg 420
attccggacc agatcgctca caaacgtgaa atcatcaaca tcaaaaccct gtggcaagcg 480attccggacc agatcgctca caaacgtgaa atcatcaaca tcaaaaccct gtggcaagcg 480
aaaaccgcca cggaaaaaac gaaaattctg agctttttcg aaatcgatat gcaggaaatt 540aaaaccgcca cggaaaaaac gaaaattctg agctttttcg aaatcgatat gcaggaaatt 540
tcagttatca aaaaccgctc gtttgtcctg tatacccaac cgctgtcaga agataaactg 600tcagttatca aaaaccgctc gtttgtcctg tatacccaac cgctgtcaga agataaactg 600
ctgacggaag cggaaaaaat tgacatctat cgtaccattc tgacgaaata caaccattcg 660ctgacggaag cggaaaaaat tgacatctat cgtaccattc tgacgaaata caaccattcg 660
cagaccgtta tcaaaccgca cccgcgcgat aaaacggact ataaacaact gtttccggat 720cagaccgtta tcaaaccgca cccgcgcgat aaaacggact ataaacaact gtttccggat 720
gcctatgtca tgaaaggcac ctacccgagt gaactgctga cgctgctggg tgtcaacttc 780gcctatgtca tgaaaggcac ctacccgagt gaactgctga cgctgctggg tgtcaacttc 780
aacaaagtga tcaccctgtt ttccacggcg gtcttcgatt atccgaaaga aaaaatcgac 840aacaaagtga tcaccctgtt ttccacggcg gtcttcgatt atccgaaaga aaaaatcgac 840
ttctacggca ccgcggtgca tccgaaactg ctggatttct ttgactaa 888ttctacggca ccgcggtgca tccgaaactg ctggatttct ttgactaa 888
<210> 15<210> 15
<211> 1467<211> 1467
<212> ДНК<212> DNA
<213> Alistipes sp.<213> Alistipes sp.
<400> 15<400> 15
atggccctgc tgagcggtac cgccgcatgc tcagatgacg aagtctcgca gaacctgatc 60atggccctgc tgagcggtac cgccgcatgc tcagatgacg aagtctcgca gaacctgatc 60
gtgattaatg gcggtgaaca ttttctgagc ctggatggtc tggcccgtgc aggtaaaatt 120gtgattaatg gcggtgaaca ttttctgagc ctggatggtc tggcccgtgc aggtaaaatt 120
agcgtgctgg caccggctcc gtggcgtgtt acgaaagcag ctggtgatac ctggtttcgc 180agcgtgctgg caccggctcc gtggcgtgtt acgaaagcag ctggtgatac ctggtttcgc 180
ctgagcgcaa ccgaaggtcc ggctggttac agcgaagtgg aactgtctct ggatgaaaat 240ctgagcgcaa ccgaaggtcc ggctggttac agcgaagtgg aactgtctct ggatgaaaat 240
ccgggtgccg cacgtagcgc acagctggcg tttgcctgtg gtgatgcgat tgtgccgttc 300ccgggtgccg cacgtagcgc acagctggcg tttgcctgtg gtgatgcgat tgtgccgttc 300
cgcctgagtc aaggcgcact gtccgctggt tatgattcac cggactatta cttttacgtt 360cgcctgagtc aaggcgcact gtccgctggt tatgattcac cggactatta cttttacgtt 360
accttcggca cgatgccgac cctgtatgcc ggtatccatc tgctgagcca cgataaaccg 420accttcggca cgatgccgac cctgtatgcc ggtatccatc tgctgagcca cgataaaccg 420
ggctatgtct tttactcacg ttcgaaaacg tttgacccgg ccgaattccc ggcacgtgct 480ggctatgtct tttactcacg ttcgaaaacg tttgacccgg ccgaattccc ggcacgtgct 480
gaagttacca ccgcagctga tcgtaccgcc gatgcaaccc aggccgaaat ggaagcaatg 540gaagttacca ccgcagctga tcgtaccgcc gatgcaaccc aggccgaaat ggaagcaatg 540
gctcgcgaaa tgaaacgtcg catcctggaa attaactctg cggatccgac cgccgtgttt 600gctcgcgaaa tgaaacgtcg catcctggaa attaactctg cggatccgac cgccgtgttt 600
ggcctgtatg ttgatgacct gcgttgccgc attggctacg attggttcgt ggcgcagggt 660ggcctgtatg ttgatgacct gcgttgccgc attggctacg attggttcgt ggcgcagggt 660
atcgacagtg cccgtgtcaa agtgagcatg ctgtctgatg gcaccggcac gtacaacaat 720atcgacagtg cccgtgtcaa agtgagcatg ctgtctgatg gcaccggcac gtacaacaat 720
ttttataact acttcggtga cgcggccacg gcggaacaaa attgggaaag ttatgcgtcc 780ttttataact acttcggtga cgcggccacg gcggaacaaa attgggaaag ttatgcgtcc 780
gaagttgaag ccctggattg gaatcacggc ggtcgttatc cggaaacccg ctcgctgccg 840gaagttgaag ccctggattg gaatcacggc ggtcgttatc cggaaacccg ctcgctgccg 840
gaatttgaaa gctacacgtg gccgtattac ctgtctaccc gtccggatta tcgcctggtg 900gaatttgaaa gctacacgtg gccgtattac ctgtctaccc gtccggatta tcgcctggtg 900
gttcaggacg gcagtctgct ggaaagctct tgtccgttta ttaccgaaaa actgggtgaa 960gttcaggacg gcagtctgct ggaaagctct tgtccgttta ttaccgaaaa actgggtgaa 960
atggaaatcg aatccattca accgtatgaa atgctgtcag ccctgccgga aagttcccgt 1020atggaaatcg aatccattca accgtatgaa atgctgtcag ccctgccgga aagttcccgt 1020
aaacgctttt atgatatggc aggcttcgat tacgacaaat ttgcagctct gttcgatgcg 1080aaacgctttt atgatatggc aggcttcgat tacgacaaat ttgcagctct gttcgatgcg 1080
tccccgaaga aaaacctgat tatcattggt acctctcatg cggatgatgc cagtgcacgt 1140tccccgaaga aaaacctgat tatcattggt acctctcatg cggatgatgc cagtgcacgt 1140
ctgcagcgtg attacgttgc acgcatcatg gaacagtatg gcgctcaata cgatgtcttt 1200ctgcagcgtg attacgttgc acgcatcatg gaacagtatg gcgctcaata cgatgtcttt 1200
ttcaaaccgc acccggcaga caccacgtca gctggttatg aaacggaatt tccgggcctg 1260ttcaaaccgc acccggcaga caccacgtca gctggttatg aaacggaatt tccgggcctg 1260
accctgctgc cgggtcaaat gccgtttgaa atcttcgttt ggtccctgat tgatcgtgtc 1320accctgctgc cgggtcaaat gccgtttgaa atcttcgttt ggtccctgat tgatcgtgtc 1320
gacatgatcg gcggttatcc gtcaacggtc tttctgaccg ttccggtcga taaagtgcgc 1380gacatgatcg gcggttatcc gtcaacggtc tttctgaccg ttccggtcga taaagtgcgc 1380
tttatttttg ccgcggatgc agcttctctg gtgcgtccgc tgaatatcct gttccgcgat 1440tttatttttg ccgcggatgc agcttctctg gtgcgtccgc tgaatatcct gttccgcgat 1440
gcgaccgacg ttgaatggat gcagtaa 1467gcgaccgacg ttgaatggat gcagtaa 1467
<210> 16<210> 16
<211> 876<211> 876
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 16<400> 16
atgaagaaag tgattatcgc cggcaatggt ccgagcctga aagaaattga ttattctcgt 60atgaagaaag tgattatcgc cggcaatggt ccgagcctga aagaaattga ttattctcgt 60
ctgccgaatg atttcgacgt ctttcgctgc aaccagttct actttgaaga caaatattac 120ctgccgaatg atttcgacgt ctttcgctgc aaccagttct actttgaaga caaatattac 120
ctgggcaaaa aatgtaaagc cgtgttttat accccgaact ttttctttga acagtattac 180ctgggcaaaa aatgtaaagc cgtgttttat accccgaact ttttctttga acagtattac 180
acgctgaaac atctgattca gaaccaagaa tatgaaaccg aactgatcat gtgctcaaac 240acgctgaaac atctgattca gaaccaagaa tatgaaaccg aactgatcat gtgctcaaac 240
tacaatcaag cacatctgga aaacgaaaac ttcgtcaaaa cgttctacga ttacttcccg 300tacaatcaag cacatctgga aaacgaaaac ttcgtcaaaa cgttctacga ttacttcccg 300
gacgctcacc tgggttacga tttctttaaa cagctgaaag aattcaacgc gtacttcaaa 360gacgctcacc tgggttacga tttctttaaa cagctgaaag aattcaacgc gtacttcaaa 360
ttccacgaaa tctacttcaa ccaacgtatc acctcaggcg tgtatatgtg tgcggttgcc 420ttccacgaaa tctacttcaa ccaacgtatc acctcaggcg tgtatatgtg tgcggttgcc 420
attgcactgg gttataaaga aatttacctg tcgggcatcg atttttatca gaatggtagc 480attgcactgg gttataaaga aatttacctg tcgggcatcg atttttatca gaatggtagc 480
tcttacgcct tcgacacgaa acaagaaaat ctgctgaaac tggcaccgga ttttaaaaac 540tcttacgcct tcgacacgaa acaagaaaat ctgctgaaac tggcaccgga ttttaaaaac 540
gaccgctcac attatattgg ccactcgaaa aacaccgata tcaaagctct ggaattcctg 600gaccgctcac attatattgg ccactcgaaa aacaccgata tcaaagctct ggaattcctg 600
gaaaaaacgt acaaaatcaa actgtactgc ctgtgtccga atagtctgct ggctaacttt 660gaaaaaacgt acaaaatcaa actgtactgc ctgtgtccga atagtctgct ggctaacttt 660
atcgaactgg cgccgaacct gaattccaac ttcatcatcc aggagaaaaa caactacacc 720atcgaactgg cgccgaacct gaattccaac ttcatcatcc aggagaaaaa caactacacc 720
aaagatatcc tgatcccgag ttccgaagcg tacggcaaat ttagcaaaaa catcaacttc 780aaagatatcc tgatcccgag ttccgaagcg tacggcaaat ttagcaaaaa catcaacttc 780
aagaaaatta aaatcaaaga aaacgtgtat tacaaactga ttaaagatct gctgcgtctg 840aagaaaatta aaatcaaaga aaacgtgtat tacaaactga ttaaagatct gctgcgtctg 840
ccgtctgaca tcaaacatta ttttaaaggt aaataa 876ccgtctgaca tcaaacatta ttttaaaggt aaataa 876
<210> 17<210> 17
<211> 939<211> 939
<212> ДНК<212> DNA
<213> Streptococcus agalactiae<213> Streptococcus agalactiae
<400> 17<400> 17
atgacgaatc gcaaaatcta tgtctgccac accctgtacc atctgctgat ctgcctgtat 60atgacgaatc gcaaaatcta tgtctgccac accctgtacc atctgctgat ctgcctgtat 60
aaagaagaaa tctactcaaa tctggaaatt atcctgagca gcagcattcc ggatgtggac 120aaagaagaaa tctactcaaa tctggaaatt atcctgagca gcagcattcc ggatgtggac 120
aacctggaga aaaaactgaa aagcaaaacc atcaacatcc atattctgga agaatcctca 180aacctggaga aaaaactgaa aagcaaaacc atcaacatcc atattctgga agaatcctca 180
ggcgaatctg aagaactgct gagtgttctg aaagatgcag gtctgtctta cagtaaattc 240ggcgaatctg aagaactgct gagtgttctg aaagatgcag gtctgtctta cagtaaattc 240
gatagcaact gcttcatctt caacgacgct accccgattg gccgtacgct gatcaaacac 300gatagcaact gcttcatctt caacgacgct accccgattg gccgtacgct gatcaaacac 300
ggtatttatt acaatctgat cgaagatggc ctgaactgtt ttacctactc gattttcagc 360ggtatttatt acaatctgat cgaagatggc ctgaactgtt ttacctactc gattttcagc 360
cagaaactgt ggaaatacta cgtgaaaaaa tacatcctgc ataaaattca accgcacggc 420cagaaactgt ggaaatacta cgtgaaaaaa tacatcctgc ataaaattca accgcacggc 420
ttttcccgct actgcctggg tatcgaagtg aacagtctgg ttaatctgcc gaaagatccg 480ttttcccgct actgcctggg tatcgaagtg aacagtctgg ttaatctgcc gaaagatccg 480
cgttacaaaa aattcatcga agtcccgcgc aaagaactgt tcgacaatgt tacggaatac 540cgttacaaaa aattcatcga agtcccgcgc aaagaactgt tcgacaatgt tacggaatac 540
cagaaagaaa tggcgatcaa cctgtttggc gccgtccgtg tgtctattaa atccccgtca 600cagaaagaaa tggcgatcaa cctgtttggc gccgtccgtg tgtctattaa atccccgtca 600
gttctggtcc tgacccagcc gctgtccatc gataaagaat ttatgtcata caacaacaaa 660gttctggtcc tgacccagcc gctgtccatc gataaagaat ttatgtcata caacaacaaa 660
atcgaaacgt cggaagaaca attcaacttc tacaaaagca tcgtgaacga atacatcaac 720atcgaaacgt cggaagaaca attcaacttc tacaaaagca tcgtgaacga atacatcaac 720
aaaggttaca acgtctacct gaaagtgcat ccgcgtgatg tggttgacta ttctaaactg 780aaaggttaca acgtctacct gaaagtgcat ccgcgtgatg tggttgacta ttctaaactg 780
ccggttgaac tgctgccgag taacgtcccg atggaaatta tcgaactgat gctgaccggc 840ccggttgaac tgctgccgag taacgtcccg atggaaatta tcgaactgat gctgaccggc 840
cgctttgaat gcggtattac ccatagcagc accgccctgg atttcctgac ctgtgtggac 900cgctttgaat gcggtattac ccatagcagc accgccctgg atttcctgac ctgtgtggac 900
aagaaaatta cgctggttga tctgaaagac attaaataa 939aagaaaatta cgctggttga tctgaaagac attaaataa 939
<210> 18<210> 18
<211> 1233<211> 1233
<212> ДНК<212> DNA
<213> Bibersteinia trehalosi<213> Bibersteinia trehalosi
<400> 18<400> 18
atggaattct gcaaaatggc aacgacgcaa aaaatctgtg tctacctgga ctatgctacg 60atggaattct gcaaaatggc aacgacgcaa aaaatctgtg tctacctgga ctatgctacg 60
atcccgagcc tgaactacat cctgcacttt gcgcaacatt tcgaagatca ggaaaccatt 120atcccgagcc tgaactacat cctgcacttt gcgcaacatt tcgaagatca ggaaaccatt 120
cgtctgtttg gcctgtcccg cttccacatt ccggaatcag tcatccagcg ctatccgaaa 180cgtctgtttg gcctgtcccg cttccacatt ccggaatcag tcatccagcg ctatccgaaa 180
ggtgtggttc aattttaccc gaaccaggaa aaagacttca gcgcgctgct gctggccctg 240ggtgtggttc aattttaccc gaaccaggaa aaagacttca gcgcgctgct gctggccctg 240
aaaaacatcc tgatcgaagt taaacagcaa cagcgtaaat gcgaaatcga actgcatctg 300aaaaacatcc tgatcgaagt taaacagcaa cagcgtaaat gcgaaatcga actgcatctg 300
aacctgtttc actatcagct gctgctgctg ccgttcctga gtctgtatct ggatacccag 360aacctgtttc actatcagct gctgctgctg ccgttcctga gtctgtatct ggatacccag 360
gactactgtc atctgacgct gaaattttac gatgacggct ctgaagcgat tagtgccctg 420gactactgtc atctgacgct gaaattttac gatgacggct ctgaagcgat tagtgccctg 420
caggaactgg cactggctcc ggatctggcg gcccaaatcc agtttgaaaa acaacagttc 480caggaactgg cactggctcc ggatctggcg gcccaaatcc agtttgaaaa acaacagttc 480
gacgaactgg tcgtgaaaaa atcgtttaaa ctgtcgctgc tgagccgcta tttttggggt 540gacgaactgg tcgtgaaaaa atcgtttaaa ctgtcgctgc tgagccgcta tttttggggt 540
aaactgttcg aaagcgaata catttggttc aatcaagcaa tcctgcagaa agctgaactg 600aaactgttcg aaagcgaata catttggttc aatcaagcaa tcctgcagaa agctgaactg 600
caaattctga aacaggaaat cagctctagt cgtcagatgg attttgcaat ttatcaacag 660caaattctga aacaggaaat cagctctagt cgtcagatgg attttgcaat ttatcaacag 660
atgtccgacg aacaaaaaca gctggtgctg gaaattctga acatcgatct gaataaagtt 720atgtccgacg aacaaaaaca gctggtgctg gaaattctga acatcgatct gaataaagtt 720
gcttacctga aacaactgat ggaaaaccag ccgtcttttc tgttcctggg caccacgctg 780gcttacctga aacaactgat ggaaaaccag ccgtcttttc tgttcctggg caccacgctg 780
tttaatatta cccaggaaac caaaacgtgg ctgatgcaga tgcatgtgga tctgatccaa 840tttaatatta cccaggaaac caaaacgtgg ctgatgcaga tgcatgtgga tctgatccaa 840
cagtattgcc tgccgagcgg ccagtttttc aacaataaag ccggctatct gtgtttttac 900cagtattgcc tgccgagcgg ccagtttttc aacaataaag ccggctatct gtgtttttac 900
aaaggtcacc cgaacgaaaa agaaatgaac caaatgatcc tgtctcagtt caaaaacctg 960aaaggtcacc cgaacgaaaa agaaatgaac caaatgatcc tgtctcagtt caaaaacctg 960
atcgcgctgc cggatgacat tccgctggaa atcctgctgc tgctgggcgt tattccgagt 1020atcgcgctgc cggatgacat tccgctggaa atcctgctgc tgctgggcgt tattccgagt 1020
aaagtcggcg gttttgcatc ctcagctctg tttaacttca ccccggcgca gatcgaaaat 1080aaagtcggcg gttttgcatc ctcagctctg tttaacttca ccccggcgca gatcgaaaat 1080
attatctttt tcacgccgcg ttatttcgaa aaagataatc gcctgcacgc cacgcaatac 1140attatctttt tcacgccgcg ttatttcgaa aaagataatc gcctgcacgc cacgcaatac 1140
cgtctgatgc agggcctgat tgaactgggt tatctggacg ctgaaaaatc tgtgacccac 1200cgtctgatgc agggcctgat tgaactgggt tatctggacg ctgaaaaatc tgtgacccac 1200
tttgaaatca tgcaactgct gacgaaagaa taa 1233tttgaaatca tgcaactgct gacgaaagaa taa 1233
<210> 19<210> 19
<211> 1221<211> 1221
<212> ДНК<212> DNA
<213> Haemophilus parahaemolyticus<213> Haemophilus parahaemolyticus
<400> 19<400> 19
atgaccgaac agtacatcaa aaacgtggaa gtttacctgg attacgcgac catcccgacg 60atgaccgaac agtacatcaa aaacgtggaa gtttacctgg attacgcgac catcccgacg 60
ctgaactact tctaccattt caccgaaaac aaagatgaca tcgccacgat tcgtctgttt 120ctgaactact tctaccattt caccgaaaac aaagatgaca tcgccacgat tcgtctgttt 120
ggcctgggtc gcttcaacat cagtaaatcc atcatcgaaa gctacccgga aggcattatc 180ggcctgggtc gcttcaacat cagtaaatcc atcatcgaaa gctacccgga aggcattatc 180
cgttactgcc cgattatctt tgaagatcaa accgcatttc agcaactgtt cattaccctg 240cgttactgcc cgattatctt tgaagatcaa accgcatttc agcaactgtt cattaccctg 240
ctgacggaag acagtttttg tcagtatcgc tttaacttcc atattaacct gtttcactcc 300ctgacggaag acagtttttg tcagtatcgc tttaacttcc atattaacct gtttcactcc 300
tggaaaatgc tgatcccgct gctgcatatt atctggcagt ttaaacacaa agtcctggat 360tggaaaatgc tgatcccgct gctgcatatt atctggcagt ttaaacacaa agtcctggat 360
attaaactga acttctatga tgacggcagt gaaggtctgg tgacgctgtc caaaatcgaa 420attaaactga acttctatga tgacggcagt gaaggtctgg tgacgctgtc caaaatcgaa 420
cagaactaca gctctgaaat cctgcaaaaa atcatcgata tcgactcaca gtcgttttat 480cagaactaca gctctgaaat cctgcaaaaa atcatcgata tcgactcaca gtcgttttat 480
gcagataaac tgtctttcct ggatgaagac attgctcgtt acctgtggaa cagtctgttt 540gcagataaac tgtctttcct ggatgaagac attgctcgtt acctgtggaa cagtctgttt 540
gaatcccatt attacctgct gaacgacttc ctgctgaaaa acgaaaaact gtcactgctg 600gaatcccatt attacctgct gaacgacttc ctgctgaaaa acgaaaaact gtcactgctg 600
aaaaactcga tcaaatactg ccacatcatg gatctggaac gctacctgca gtttacccaa 660aaaaactcga tcaaatactg ccacatcatg gatctggaac gctacctgca gtttacccaa 660
gaagaaaaag actttttcaa cgaactgctg ggcatcaaca tccagagtct ggaagataaa 720gaagaaaaag actttttcaa cgaactgctg ggcatcaaca tccagagtct ggaagataaa 720
atcaaaatct tccagcagaa gaaaaccttt attttcacgg gtaccacgat cttcagcctg 780atcaaaatct tccagcagaa gaaaaccttt attttcacgg gtaccacgat cttcagcctg 780
ccgaaagaag aagaagaaac cctgtatcgt ctgcatctga acgcaatcct gaattatatt 840ccgaaagaag aagaagaaac cctgtatcgt ctgcatctga acgcaatcct gaattatatt 840
cacccgaacg gcaaatactt tattggcgat ggtttcacgc tggttatcaa aggtcatccg 900cacccgaacg gcaaatactt tattggcgat ggtttcacgc tggttatcaa aggtcatccg 900
caccagaaag aaatgaacag ccgcctggaa aaatcttttg aaaaagctgt catgctgccg 960caccagaaag aaatgaacag ccgcctggaa aaatcttttg aaaaagctgt catgctgccg 960
gataatatcc cgttcgaaat tctgtatctg atcggctgca aaccggacaa aattggcggt 1020gataatatcc cgttcgaaat tctgtatctg atcggctgca aaccggacaa aattggcggt 1020
tttgtgagca cctcttactt cagctgtgat aagaaaaaca ttgcggacct gctgtttatc 1080tttgtgagca cctcttactt cagctgtgat aagaaaaaca ttgcggacct gctgtttatc 1080
tctgcccgtc aagaagaagt tcgcaaaaac gattacctgt ttaacatcca gtaccaactg 1140tctgcccgtc aagaagaagt tcgcaaaaac gattacctgt ttaacatcca gtaccaactg 1140
cgtgacatga tgattaaaac cggttttatc caggaagaaa aaacgcactt ctactcagat 1200cgtgacatga tgattaaaac cggttttatc caggaagaaa aaacgcactt ctactcagat 1200
atcccgatct tcatctcgta a 1221atcccgatct tcatctcgta a 1221
<210> 20<210> 20
<211> 903<211> 903
<212> ДНК<212> DNA
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 20<400> 20
atgaaatata acatcaaaat taaagctatc gtcatcgtgt cgagcctgcg tatgctgctg 60atgaaatata acatcaaaat taaagctatc gtcatcgtgt cgagcctgcg tatgctgctg 60
atcttcctga tgctgaataa ataccacctg gatgaagttc tgtttgtctt caacgaaggc 120atcttcctga tgctgaataa ataccacctg gatgaagttc tgtttgtctt caacgaaggc 120
ttcgaactgc ataaaaaata caaaatcaaa cactatgtgg cgattaaaaa gaaaattacc 180ttcgaactgc ataaaaaata caaaatcaaa cactatgtgg cgattaaaaa gaaaattacc 180
aaattctggc gtctgtacta caaactgtac ttctaccgtt tcaaaattga ccgcatcccg 240aaattctggc gtctgtacta caaactgtac ttctaccgtt tcaaaattga ccgcatcccg 240
gtttatggcg cagatcatct gggttggacc gactattttc tgaaatactt cgatttctac 300gtttatggcg cagatcatct gggttggacc gactattttc tgaaatactt cgatttctac 300
ctgattgaag acggcatcgc taacttctcc ccgaaacgtt acgaaattaa cctgacgcgc 360ctgattgaag acggcatcgc taacttctcc ccgaaacgtt acgaaattaa cctgacgcgc 360
aatatcccgg tctttggttt ccataaaacc gtgaagaaaa tttacctgac gagtctggaa 420aatatcccgg tctttggttt ccataaaacc gtgaagaaaa tttacctgac gagtctggaa 420
aatgttccgt ccgatattcg tcataaagtc gaactgatca gcctggaaca cctgtggaaa 480aatgttccgt ccgatattcg tcataaagtc gaactgatca gcctggaaca cctgtggaaa 480
acccgcacgg cgcaggaaca acacaacatc ctggatttct ttgcctttaa tctggacagc 540acccgcacgg cgcaggaaca acacaacatc ctggatttct ttgcctttaa tctggacagc 540
ctgatctctc tgaaaatgaa aaaatacatc ctgttcaccc agtgcctgtc agaagatcgc 600ctgatctctc tgaaaatgaa aaaatacatc ctgttcaccc agtgcctgtc agaagatcgc 600
gtcatttcgg aacaggaaaa aatcgcgatc taccaacata tcatcaaaaa ctacgatgaa 660gtcatttcgg aacaggaaaa aatcgcgatc taccaacata tcatcaaaaa ctacgatgaa 660
cgtctgctgg ttatcaaacc gcacccgcgc gaaaccacgg actatcagaa atactttgaa 720cgtctgctgg ttatcaaacc gcacccgcgc gaaaccacgg actatcagaa atactttgaa 720
aatgtcttcg tgtaccaaga tgtggttccg agcgaactgt ttgaactgct ggacgtgaac 780aatgtcttcg tgtaccaaga tgtggttccg agcgaactgt ttgaactgct ggacgtgaac 780
ttcgaacgtg ttattaccct gttttctacg gccgtgttca aatatgatcg caatatcgtt 840ttcgaacgtg ttattaccct gttttctacg gccgtgttca aatatgatcg caatatcgtt 840
gacttctacg gtacgcgcat ccacgacaaa atctatcaat ggttcggcga catcaaattc 900gacttctacg gtacgcgcat ccacgacaaa atctatcaat ggttcggcga catcaaattc 900
taa 903taa 903
<210> 21<210> 21
<211> 1146<211> 1146
<212> ДНК<212> DNA
<213> Vibrio harveyi<213> Vibrio harveyi
<400> 21<400> 21
atggattctt cgccggaaaa caccagctct acgctggaaa tttacatcga ttcagcaacc 60atggattctt cgccggaaaa caccagctct acgctggaaa tttacatcga ttcagcaacc 60
ctgccgtcgc tgcagcacat ggtgaaaatt atcgacgaac aaagtggcaa caaaaaactg 120ctgccgtcgc tgcagcacat ggtgaaaatt atcgacgaac aaagtggcaa caaaaaactg 120
atcaactgga aacgttatcc gatcgatgac gaactgctgc tggataaaat caacgctctg 180atcaactgga aacgttatcc gatcgatgac gaactgctgc tggataaaat caacgctctg 180
agcttttctg ataccacgga cctgacccgt tatatggaaa gtattctgct gatcggcgat 240agcttttctg ataccacgga cctgacccgt tatatggaaa gtattctgct gatcggcgat 240
attaaacgcg tggttattaa cggtaatagt ctgtccaact acaatattgt cggcgtgatg 300attaaacgcg tggttattaa cggtaatagt ctgtccaact acaatattgt cggcgtgatg 300
cgctccatca acgccctggg tctggatctg gacgttgaaa tcaattttta tgatgacggt 360cgctccatca acgccctggg tctggatctg gacgttgaaa tcaattttta tgatgacggt 360
tcagcagaat atgtccgtct gtacaacttc tcgcagctgc cggaagctga acgcgaactg 420tcagcagaat atgtccgtct gtacaacttc tcgcagctgc cggaagctga acgcgaactg 420
ctggtgtcaa tgtcgaaaaa caatattctg gcggccgtta acggcatcgg ttcttatgat 480ctggtgtcaa tgtcgaaaaa caatattctg gcggccgtta acggcatcgg ttcttatgat 480
agcggctctc cggaaaatat ttacggtttt gcgcagattt atccggccac ctaccacatg 540agcggctctc cggaaaatat ttacggtttt gcgcagattt atccggccac ctaccacatg 540
ctgcgtgcgg acattttcga tacggacctg gaaatcggcc tgattcgcga tatcctgggt 600ctgcgtgcgg acattttcga tacggacctg gaaatcggcc tgattcgcga tatcctgggt 600
gacaacgtca aacagatgaa atggggccaa tttctgggtt tcaacgaaga acagaaagaa 660gacaacgtca aacagatgaa atggggccaa tttctgggtt tcaacgaaga acagaaagaa 660
ctgttttatc aactgaccag cttcaacccg gataaaatcc aggcgcaata caaagaatct 720ctgttttatc aactgaccag cttcaacccg gataaaatcc aggcgcaata caaagaatct 720
ccgaacaaaa acttcgtttt cgtcggcacc aacagtcgtt ccgcaacggc tgaacagcaa 780ccgaacaaaa acttcgtttt cgtcggcacc aacagtcgtt ccgcaacggc tgaacagcaa 780
atcaacatca tcaaagaagc caaaaaactg gatagcgaaa ttatcccgaa cagcatcgat 840atcaacatca tcaaagaagc caaaaaactg gatagcgaaa ttatcccgaa cagcatcgat 840
ggctatgacc tgtttttcaa aggtcatccg agcgcgacct acaaccagca aattgttgat 900ggctatgacc tgtttttcaa aggtcatccg agcgcgacct acaaccagca aattgttgat 900
gcccacgaca tgaccgaaat ctataatcgc acgccgtttg aagtcctggc aatgacgagt 960gcccacgaca tgaccgaaat ctataatcgc acgccgtttg aagtcctggc aatgacgagt 960
tccctgccgg atgctgtggg cggtatgggc tcatcgctgt ttttctcact gccgaaaacc 1020tccctgccgg atgctgtggg cggtatgggc tcatcgctgt ttttctcact gccgaaaacc 1020
gtggaaacga aattcatttt ctataaaagt ggcaccgata ttgaatccaa tgcgctgatc 1080gtggaaacga aattcatttt ctataaaagt ggcaccgata ttgaatccaa tgcgctgatc 1080
caggttatgc tgaaactggg tatcattacg gacgaaaaag tgcgctttac gacggacatc 1140caggttatgc tgaaactggg tatcattacg gacgaaaaag tgcgctttac gacggacatc 1140
aaataa 1146aaataa 1146
<210> 22<210> 22
<211> 1452<211> 1452
<212> ДНК<212> DNA
<213> Alistipes sp.<213> Alistipes sp.
<400> 22<400> 22
atggccagct gttctgatga cgataaagaa cagacgggtt ttcaaatcga cgatggctct 60atggccagct gttctgatga cgataaagaa cagacggggtt ttcaaatcga cgatggctct 60
ggtttcctga gtctggatgc agctgcgcgt agtggctcca ttgccatcac cgcaaacaat 120ggtttcctga gtctggatgc agctgcgcgt agtggctcca ttgccatcac cgcaaacaat 120
tcatggtcgg tgacgcagga taaagacagc gaatggctga ccctgagcac cacgtctggt 180tcatggtcgg tgacgcagga taaagacagc gaatggctga ccctgagcac cacgtctggt 180
gcagcaggtc gtaccgaaat tggtatcatg ctggaagcga acccgggcga agctcgtaat 240gcagcaggtc gtaccgaaat tggtatcatg ctggaagcga acccgggcga agctcgtaat 240
gcgggtctga cctttaactc tggcggtcgc acgtatccgt tcgtgattac ccagagtgcc 300gcgggtctga cctttaactc tggcggtcgc acgtatccgt tcgtgattac ccagagtgcc 300
catgttacgg cagattttga cgatgctgac cactgctttt atatcacctt tggtaccctg 360catgttacgg cagattttga cgatgctgac cactgctttt atatcacctt tggtaccctg 360
ccgaccctgt atgcaggtct gcatgtgctg tcccacgata aaccgtcata tgtgtttttc 420ccgaccctgt atgcaggtct gcatgtgctg tcccacgata aaccgtcata tgtgtttttc 420
cagcgttccc aaacctttcg cccggaagaa ttcccggccc atgcagaagt tacgattgct 480cagcgttccc aaacctttcg cccggaagaa ttcccggccc atgcagaagt tacgattgct 480
gcggatccgt cagctaatgc gaccgatgaa gacatggaac gtatgcgcac ggccatgaaa 540gcggatccgt cagctaatgc gaccgatgaa gacatggaac gtatgcgcac ggccatgaaa 540
cagcaaattc tgaaaatcaa cgttgaagat ccgaccgcag tttttggcct gtatgtcgac 600cagcaaattc tgaaaatcaa cgttgaagat ccgaccgcag tttttggcct gtatgtcgac 600
gatctgcgtt gtggcattgg ttacgattgg ttcgtcgccc agggtatcga cagtacccgc 660gatctgcgtt gtggcattgg ttacgattgg ttcgtcgccc agggtatcga cagtacccgc 660
gtgaaagtta gtatgctgtc cgatggcacc ggcacgtaca acaacttcta caactacttc 720gtgaaagtta gtatgctgtc cgatggcacc ggcacgtaca acaacttcta caactacttc 720
ggcgatccgg ccaccgcaga acaaaactgg gaaaattacg ccgcacaggt ggaagcgctg 780ggcgatccgg ccaccgcaga acaaaactgg gaaaattacg ccgcacaggt ggaagcgctg 780
gattggcaac acggcggtcg ttttccggaa acccgcatgc cggatggttt tgacttctat 840gattggcaac acggcggtcg ttttccggaa acccgcatgc cggatggttt tgacttctat 840
gaatggccgt attacctggc aacgcgtccg aactaccgcc tggttctgca ggacgatgac 900gaatggccgt attacctggc aacgcgtccg aactaccgcc tggttctgca ggacgatgac 900
ctgctggaag cgacgtctcc gtttatgacc gaacgtctgc agcaaatgcg caccgaatcg 960ctgctggaag cgacgtctcc gtttatgacc gaacgtctgc agcaaatgcg caccgaatcg 960
aaacagccgt atgaactgct ggccagcctg ccggctgaag cccgtcaacg ctttttccgt 1020aaacagccgt atgaactgct ggccagcctg ccggctgaag cccgtcaacg ctttttccgt 1020
atggctggct ttgattacga cgcgtttgct gcgctgttcg atgccagccc gaagaaaaac 1080atggctggct ttgattacga cgcgtttgct gcgctgttcg atgccagccc gaagaaaaac 1080
ctggtcatta tcggcacgtc acatacctcg gaagaaagcg aagcacagca agccgcatat 1140ctggtcatta tcggcacgtc acatacctcg gaagaaagcg aagcacagca agccgcatat 1140
gtggaacgta ttatcggcga ttatggtacc gcctacgaca ttttctttaa accgcacccg 1200gtggaacgta ttatcggcga ttatggtacc gcctacgaca ttttctttaa accgcacccg 1200
gcagatagct ctagttccaa ctacgaagaa cgctttgaag gtctgaccct gctgccgggt 1260gcagatagct ctagttccaa ctacgaagaa cgctttgaag gtctgaccct gctgccgggt 1260
cagatgccgt ttgaaatttt cgtctggtcg ctgctggata aagtggacct gatcggcggt 1320cagatgccgt ttgaaatttt cgtctggtcg ctgctggata aagtggacct gatcggcggt 1320
tattcatcga cggtgtttct gaccgtcccg gtggaaaaaa ccggctttat tttcgctgcg 1380tattcatcga cggtgtttct gaccgtcccg gtggaaaaaa ccggctttat tttcgctgcg 1380
aatgctgaaa gcctgccgcg cccgctgaac gttctgttcc gtaatgcgga acatgtccgc 1440aatgctgaaa gcctgccgcg cccgctgaac gttctgttcc gtaatgcgga acatgtccgc 1440
tggatccagt aa 1452tggatccagt aa 1452
<210> 23<210> 23
<211> 1452<211> 1452
<212> ДНК<212> DNA
<213> Alistipes shahii<213> Alistipes shahii
<400> 23<400> 23
atggacgatg gcaccccgag tgtcagcatc aacggcggca ccgacttcct gagcctggac 60atggacgatg gcaccccgag tgtcagcatc aacggcggca ccgacttcct gagcctggac 60
cacctggcac gcagcggcaa aatcacggtc aacgcaccgg ctccgtggtc tgtgaccctg 120cacctggcac gcagcggcaa aatcacggtc aacgcaccgg ctccgtggtc tgtgaccctg 120
gccccggaaa attacggcca ggatgaaaaa ccggactggc tgaccctgag cgccgaagaa 180gccccggaaa attacggcca ggatgaaaaa ccggactggc tgaccctgag cgccgaagaa 180
ggcccggcag gttatagcga aatcgatgtt acctttgcgg aaaacccggg tccggcccgt 240ggcccggcag gttatagcga aatcgatgtt acctttgcgg aaaacccggg tccggcccgt 240
tccgcatcac tgctgttcag ctgcgatggt aaaaccctgg cctttacggt ttcgcagagc 300tccgcatcac tgctgttcag ctgcgatggt aaaaccctgg cctttacggt ttcgcagagc 300
gcaggcggta cgggtttcga tgctccggac tattactttt atatttcggt cggcaccatg 360gcaggcggta cgggtttcga tgctccggac tattactttt atatttcggt cggcaccatg 360
ccgacgctgt actcgggtct gcatctgctg agccacgata aaccgtctta tgttagttac 420ccgacgctgt actcgggtct gcatctgctg agccacgata aaccgtctta tgttagttac 420
gaacgtgcga gcacctttga tgcggccgaa ttcccggacc gcgcgtttgt ctatccggtg 480gaacgtgcga gcacctttga tgcggccgaa ttcccggacc gcgcgtttgt ctatccggtg 480
gccgatccga ccggtcatgc aaccaacgaa gaactgcgtg cgatgagcga agccatgaaa 540gccgatccga ccggtcatgc aaccaacgaa gaactgcgtg cgatgagcga agccatgaaa 540
cgtcgcatcc tggaaattaa tgcagaagat ccgaccgctg ttttcggtct gtgggtcgat 600cgtcgcatcc tggaaattaa tgcagaagat ccgaccgctg ttttcggtct gtgggtcgat 600
gacctgcgtt gccgcctggg ctacgattgg tttgtggctc aaggtatcga ctctgcgcgc 660gacctgcgtt gccgcctggg ctacgattgg tttgtggctc aaggtatcga ctctgcgcgc 660
gtgaaagtta cgatgctgag tgatggcacc gcgacgtata acaattttca taactacttc 720gtgaaagtta cgatgctgag tgatggcacc gcgacgtata acaattttca taactacttc 720
ggtgacgcag ctaccgccga acagaactgg aatgattatg cggccgaagt tgaagcactg 780ggtgacgcag ctaccgccga acagaactgg aatgattatg cggccgaagt tgaagcactg 780
gactggaatc atggcggtcg ttatccggaa acccgtgccc cggaagaatt cgcctcctac 840gactggaatc atggcggtcg ttatccggaa acccgtgccc cggaagaatt cgcctcctac 840
acctggccgt attacctgtc aacgcgtccg gattatcgcc tgatgctgca aaacagctct 900acctggccgt attacctgtc aacgcgtccg gattatcgcc tgatgctgca aaacagctct 900
ctgatggaaa gttcctgtcc gtttatcgca gatcgcctgg cagctatgaa aatggaatcc 960ctgatggaaa gttcctgtcc gtttatcgca gatcgcctgg cagctatgaa aatggaatcc 960
gtgcagccgt atgaactgct gacggcactg ccggaagctt caaaacagca attctatcgt 1020gtgcagccgt atgaactgct gacggcactg ccggaagctt caaaacagca attctatcgt 1020
atggccaaat ttgattacgc acgctttgct ggcctgttcg acctgtctcc gaagaaaaac 1080atggccaaat ttgattacgc acgctttgct ggcctgttcg acctgtctcc gaagaaaaac 1080
ctgattatca ttggtacctc tcattcatcg gcggccagtg aacagcaaca ggcagcttac 1140ctgattatca ttggtacctc tcattcatcg gcggccagtg aacagcaaca ggcagcttac 1140
gtcgaacgta tcattcaaca gtatggcagt gattacgaca ttttctttaa accgcacccg 1200gtcgaacgta tcattcaaca gtatggcagt gattacgaca ttttctttaa accgcacccg 1200
gcagatagct ctagtgctgg ttatccggac cgctttgaag gtctgaccct gctgccgggt 1260gcagatagct ctagtgctgg ttatccggac cgctttgaag gtctgaccct gctgccgggt 1260
cagatgccgt ttgaaatctt cgtttgggcg ctgctggata aaatcgacat gattggcggt 1320cagatgccgt ttgaaatctt cgtttgggcg ctgctggata aaatcgacat gattggcggt 1320
tatccgtcca ccacgtttat ttcagtgccg ctggataaag ttggctttct gttcgcggcc 1380tatccgtcca ccacgtttat ttcagtgccg ctggataaag ttggctttct gttcgcggcc 1380
gatgccgacg gtctggtccg cccgctgaat atcctgttcc gtgacgctgc aaatgtcgaa 1440gatgccgacg gtctggtccg cccgctgaat atcctgttcc gtgacgctgc aaatgtcgaa 1440
tggattcaat aa 1452tggattcaat aa 1452
<210> 24<210> 24
<211> 1206<211> 1206
<212> ДНК<212> DNA
<213> Actinobacillus suis<213> Actinobacillus suis
<400> 24<400> 24
atggaacgca cgccgcaact gcaagcggtg gacatttaca ttgacttcgc aacgatcccg 60atggaacgca cgccgcaact gcaagcggtg gacatttaca ttgacttcgc aacgatcccg 60
agcctgagct actttctgca ctttctgaaa cataaacacg atgatcagcg tctgcgtctg 120agcctgagct actttctgca ctttctgaaa cataaacacg atgatcagcg tctgcgtctg 120
ttcagcctgg cccgttttga aatgccgcaa accctgattg aacagtatga aggcattatc 180ttcagcctgg cccgttttga aatgccgcaa accctgattg aacagtatga aggcattatc 180
cagttctcgc gcaacgtgga acataatgtt gaaccgctgc tggaacagct gcaaacgatc 240cagttctcgc gcaacgtgga acataatgtt gaaccgctgc tggaacagct gcaaacgatc 240
ctgtcacaag aaggtaaaca gtttgaactg catctgcacc tgaacctgtt tcattcgttc 300ctgtcacaag aaggtaaaca gtttgaactg catctgcacc tgaacctgtt tcattcgttc 300
gaaatgtttc tgaatctgag cccgacctac acgcagtaca aagaaaaaat ctctaaaatc 360gaaatgtttc tgaatctgag cccgacctac acgcagtaca aagaaaaaat ctctaaaatc 360
gttctgcacc tgtatgatga cggcagtgaa ggtgtcatga aacagtacca actgcagaaa 420gttctgcacc tgtatgatga cggcagtgaa ggtgtcatga aacagtacca actgcagaaa 420
agctctagtc tggtgcagga tctggcggcc accaaagcat ctctggttag cctgttcgaa 480agctctagtc tggtgcagga tctggcggcc accaaagcat ctctggttag cctgttcgaa 480
aacggcgaag gttcgtttag ccagattgat ctgatccgtt atgtctggaa tgctgtgctg 540aacggcgaag gttcgtttag ccagattgat ctgatccgtt atgtctggaa tgctgtgctg 540
gaaacccatt attacctgct gtctgatcac tttctgctgg acgaaaaact gcagccgctg 600gaaacccatt attacctgct gtctgatcac tttctgctgg acgaaaaact gcagccgctg 600
aaagcagaac tgggccatta ccaactgctg aacctgagtg cttatcagta cctgtcctca 660aaagcagaac tgggccatta ccaactgctg aacctgagtg cttatcagta cctgtcctca 660
gaagatctgc tgtggctgaa acagattctg aaaatcgaca ccgaactgga aagcctgatg 720gaagatctgc tgtggctgaa acagattctg aaaatcgaca ccgaactgga aagcctgatg 720
caaaaactga cggcgcagcc ggtgtatttc tttagcggta ccacgttttt caacatcagt 780caaaaactga cggcgcagcc ggtgtatttc tttagcggta ccacgttttt caacatcagt 780
ttcgaagata aacaacgtct ggcgaatatc catgccattc tgatccgcga acacctggac 840ttcgaagata aacaacgtct ggcgaatatc catgccattc tgatccgcga acacctggac 840
ccgaactccc agctgtttat tggcgaaccg tacctgtttg tcttcaaagg tcatccgaac 900ccgaactccc agctgtttat tggcgaaccg tacctgtttg tcttcaaagg tcatccgaac 900
tcaccggaaa ttaatcaggc cctgcgtgaa tattacccga acgttatctt cctgccggaa 960tcaccggaaa ttaatcaggc cctgcgtgaa tattacccga acgttatctt cctgccggaa 960
aatattccgt ttgaaatcct gaccctgctg ggcttctccc cgcaaaaaat tggcggtttt 1020aatattccgt ttgaaatcct gaccctgctg ggcttctccc cgcaaaaaat tggcggtttt 1020
gcgtcaacga tccacgttaa ttccgaacag tcaaaactgg ccaaactgtt tttcctgacc 1080gcgtcaacga tccacgttaa ttccgaacag tcaaaactgg ccaaactgtt tttcctgacc 1080
tcgacggatg aacaagaacg ccagctgagc gacggttata ttaaacaata cgcactggct 1140tcgacggatg aacaagaacg ccagctgagc gacggttata ttaaacaata cgcactggct 1140
caggctatgc tggaaatgca actggtctcg caagaacaag tctattactg ctcgctgtcg 1200caggctatgc tggaaatgca actggtctcg caagaacaag tctattactg ctcgctgtcg 1200
tcgtaa 1206tcgtaa 1206
<210> 25<210> 25
<211> 1206<211> 1206
<212> ДНК<212> DNA
<213> Actinobacillus capsulatus<213> Actinobacillus capsulatus
<400> 25<400> 25
atggaacgca tcccgcaact gcaagctgtc gatatttaca ttgacttcgc cacgatcccg 60atggaacgca tcccgcaact gcaagctgtc gatatttaca ttgacttcgc cacgatcccg 60
agcctgtcct actttctgca ctttctgaaa cataaacacg atcatcagcg tctgcgcctg 120agcctgtcct actttctgca ctttctgaaa cataaacacg atcatcagcg tctgcgcctg 120
ttcagcctgg cgcgttttga aatgccgcag accgtcattg aacaatatga aggcattatc 180ttcagcctgg cgcgttttga aatgccgcag accgtcattg aacaatatga aggcattatc 180
cagttctcac gcaacgtgga acacaatgtt gaacaactgc tggaacagct gcaaacgatc 240cagttctcac gcaacgtgga acacaatgtt gaacaactgc tggaacagct gcaaacgatc 240
ctgtcgcagg aaggtaaaca atttgaactg cacctgcatc tgaacctgtt tcacagtttc 300ctgtcgcagg aaggtaaaca atttgaactg cacctgcatc tgaacctgtt tcacagtttc 300
gaaatgtttc tgaatctgtc cccgacctac acgaaataca aagaaaaaat ctcaaaaatc 360gaaatgtttc tgaatctgtc cccgacctac acgaaataca aagaaaaaat ctcaaaaatc 360
gttctgcatc tgtatgatga cggctcggaa ggtgtcatga aacagtacca actgcagcaa 420gttctgcatc tgtatgatga cggctcggaa ggtgtcatga aacagtacca actgcagcaa 420
agtaactccc tggcacagga tctggctagc accaaagcgt cactggtttc gctgttcaaa 480agtaactccc tggcacagga tctggctagc accaaagcgt cactggtttc gctgttcaaa 480
aacggcgaag gtgccttttc tcagattgat ctgatccgtt atgtctggaa tgcagtgctg 540aacggcgaag gtgccttttc tcagattgat ctgatccgtt atgtctggaa tgcagtgctg 540
gaaacccact attacctgct gtcagaccac tttctggccc atgaaaaact gcagccgctg 600gaaacccact attacctgct gtcagaccac tttctggccc atgaaaaact gcagccgctg 600
aaaattgaac tgggccatta ccagctgctg aatctgtctg cctatcaata cctgagctct 660aaaattgaac tgggccatta ccagctgctg aatctgtctg cctatcaata cctgagctct 660
gaagatctgc tgtggctgaa acaaattctg aaaatcgacg cagaactgga aagtctgatg 720gaagatctgc tgtggctgaa acaaattctg aaaatcgacg cagaactgga aagtctgatg 720
cataaactga ccacgcagcc ggtgtatttc tttagcggta ccacgttttt caacatttcg 780cataaactga ccacgcagcc ggtgtatttc tttagcggta ccacgttttt caacatttcg 780
ttcgaagata aacagcgtct ggccaatatc cacgcaattc tgatccgcga acatctggac 840ttcgaagata aacagcgtct ggccaatatc cacgcaattc tgatccgcga acatctggac 840
ccgaacagtc agctgtttat cggcgaaccg tacctgtttg ttttcaaagg tcacccgaac 900ccgaacagtc agctgtttat cggcgaaccg tacctgtttg ttttcaaagg tcacccgaac 900
tccccggaaa ttaatcaggc tctgcgcgaa tattacccga acgcgatctt cctgccggaa 960tccccggaaa ttaatcaggc tctgcgcgaa tattacccga acgcgatctt cctgccggaa 960
aatattccgt ttgaaatcct gaccctgctg ggcttcagcc cgcagaaaat tggcggtttt 1020aatattccgt ttgaaatcct gaccctgctg ggcttcagcc cgcagaaaat tggcggtttt 1020
gcttctacga tccatgtgaa cagcgaacaa tctaaactgg cgaaactgtt tttcctgacc 1080gcttctacga tccatgtgaa cagcgaacaa tctaaactgg cgaaactgtt tttcctgacc 1080
agtacggatg aacaggaacg taatcgctcc gacggttata ttaaacagta cgcgctggcc 1140agtacggatg aacaggaacg taatcgctcc gacggttata ttaaacagta cgcgctggcc 1140
caagcaatgc tggaaatgca actggtctcg caagaacaag tctactactg ctcgctgtcg 1200caagcaatgc tggaaatgca actggtctcg caagaacaag tctactactg ctcgctgtcg 1200
tcgtaa 1206tcgtaa 1206
<210> 26<210> 26
<211> 936<211> 936
<212> ДНК<212> DNA
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 26<400> 26
atgttccgtg aagacaatat gaacctgatt atctgctgta cgccgctgca agtgattatc 60atgttccgtg aagacaatat gaacctgatt atctgctgta cgccgctgca agtgattatc 60
gccgaaaaaa ttatcgaacg ctatccggaa cagaaatttt atggcgttat gctggaatca 120gccgaaaaaa ttatcgaacg ctatccggaa cagaaatttt atggcgttat gctggaatca 120
ttctacaacg ataaattcga cttctacgaa aacaaactga aacatctgtg ccacgaattt 180ttctacaacg ataaattcga cttctacgaa aacaaactga aacatctgtg ccacgaattt 180
ttctgtatca aaatcgcacg tttcaaactg gaacgctata aaaacctgct gtcactgctg 240ttctgtatca aaatcgcacg tttcaaactg gaacgctata aaaacctgct gtcactgctg 240
aaaatcaaaa acaaaacctt cgatcgtgtc ttcctggcta acatcgaaaa acgctacatc 300aaaatcaaaa acaaaacctt cgatcgtgtc ttcctggcta acatcgaaaa acgctacatc 300
catatcatcc tgtcgaacat tttctttaaa gaactgtaca ccttcgatga cggcacggcg 360catatcatcc tgtcgaacat tttctttaaa gaactgtaca ccttcgatga cggcacggcg 360
aacatcgccc cgaatagtca tctgtatcaa gaatacgatc actccctgaa aaaacgtatt 420aacatcgccc cgaatagtca tctgtatcaa gaatacgatc actccctgaa aaaacgtatt 420
accgacatcc tgctgccgaa ccattacaac agcaacaaag tgaaaaacat cagcaaactg 480accgacatcc tgctgccgaa ccattacaac agcaacaaag tgaaaaacat cagcaaactg 480
cactactcta tctaccgctg caaaaacaac atcatcgata acatcgaata catgccgctg 540cactactcta tctaccgctg caaaaacaac atcatcgata acatcgaata catgccgctg 540
tttaacctgg agaaaaaata cacggcacag gataaaagta tttccatcct gctgggtcaa 600tttaacctgg agaaaaaata cacggcacag gataaaagta tttccatcct gctgggtcaa 600
ccgattttct atgacgaaga gaaaaacatt cgtctgatca aagaagtcat cgccaaattc 660ccgattttct atgacgaaga gaaaaacatt cgtctgatca aagaagtcat cgccaaattc 660
aaaatcgatt actacttccc gcacccgcgc gaagattact acatcgacaa cgtgtcttac 720aaaatcgatt actacttccc gcacccgcgc gaagattact acatcgacaa cgtgtcttac 720
atcaaaaccc cgctgatctt tgaagaattt tacgcggaac gttcaatcga aaattcgatc 780atcaaaaccc cgctgatctt tgaagaattt tacgcggaac gttcaatcga aaattcgatc 780
aaaatctata cctttttcag ctctgccgtg ctgaacatcg ttacgaaaga aaatattgat 840aaaatctata cctttttcag ctctgccgtg ctgaacatcg ttacgaaaga aaatattgat 840
cgcatctacg cactgaaacc gaaactgacg gaaaaagcgt atctggattg ttacgacatc 900cgcatctacg cactgaaacc gaaactgacg gaaaaagcgt atctggattg ttacgacatc 900
ctgaaagatt tcggtatcaa agttatcgac atctaa 936ctgaaagatt tcggtatcaa agttatcgac atctaa 936
<210> 27<210> 27
<211> 1200<211> 1200
<212> ДНК<212> DNA
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 27<400> 27
atgctgattc aacagaacct ggaaatctac ctggactacg caaccatccc gagcctggcc 60atgctgattc aacagaacct ggaaatctac ctggactacg caaccatccc gagcctggcc 60
tgctttatgc acttcattca acacaaagat gacgtcgata gtattcgtct gtttggcctg 120tgctttatgc acttcattca acacaaagat gacgtcgata gtattcgtct gtttggcctg 120
gcacgcttcg atatcccgca gtccattatc gaccgttacc cggctaacca cctgttttat 180gcacgcttcg atatcccgca gtccattatc gaccgttacc cggctaacca cctgttttat 180
cacaacatcg ataatcgcga cctgaccgca gtgctgaacc agctggcgga tattctggcc 240cacaacatcg ataatcgcga cctgaccgca gtgctgaacc agctggcgga tattctggcc 240
caggaaaata aacgttttca aatcaacctg catctgaacc tgtttcacag cattgacctg 300caggaaaata aacgttttca aatcaacctg catctgaacc tgtttcacag cattgacctg 300
tttttcgcta tttatccgat ctaccagcaa tatcagcata aaatttctac catccagctg 360tttttcgcta tttatccgat ctaccagcaa tatcagcata aaatttctac catccagctg 360
caactgtacg atgacggcag cgaaggtatt gttacgcagc attctctgtg caaaattgcg 420caactgtacg atgacggcag cgaaggtatt gttacgcagc attctctgtg caaaattgcg 420
gatctggaac agctgatcct gcaacacaaa aacgtgctgc tggaactgct gaccaaaggc 480gatctggaac agctgatcct gcaacacaaa aacgtgctgc tggaactgct gaccaaaggc 480
acggccaacg ttccgaatcc gaccctgctg cgttatctgt ggaacaatat tatcgattca 540acggccaacg ttccgaatcc gaccctgctg cgttatctgt ggaacaatat tatcgattca 540
cagtttcatc tgatctcgga ccattttctg caacacccga aactgcaacc gctgaaacgt 600cagtttcatc tgatctcgga ccattttctg caacacccga aactgcaacc gctgaaacgt 600
ctgctgaaac gctacaccat tctggatttt acgtgttatc cgcgcttcaa tgccgaacag 660ctgctgaaac gctacaccat tctggatttt acgtgttatc cgcgcttcaa tgccgaacag 660
aaacaactgc tgaaagaaat tctgcatatc tcaaacgaac tggaaaatct gctgaaactg 720aaacaactgc tgaaagaaat tctgcatatc tcaaacgaac tggaaaatct gctgaaactg 720
ctgaaacagc acaacacctt tctgttcacg ggcaccacgg cgtttaatct ggatcaggaa 780ctgaaacagc acaacacctt tctgttcacg ggcaccacgg cgtttaatct ggatcaggaa 780
aaactggacc tgctgaccca actgcatatc ctgctgctga acgaacacca gaatccgcat 840aaactggacc tgctgaccca actngcatatc ctgctgctga acgaacacca gaatccgcat 840
tcaacgcact acattggcaa caattatctg ctgctgatca aaggtcatgc aaactcgccg 900tcaacgcact acattggcaa caattatctg ctgctgatca aaggtcatgc aaactcgccg 900
gctctgaatc ataccctggc gctgcacttt ccggatgcga ttttcctgcc ggccaatatt 960gctctgaatc ataccctggc gctgcacttt ccggatgcga ttttcctgcc ggccaatatt 960
ccgtttgaaa tcttcgcgat gctgggcttt acgccgaaca aaatgggcgg tttcgccagc 1020ccgtttgaaa tcttcgcgat gctgggcttt acgccgaaca aaatgggcgg tttcgccagc 1020
acctcttaca ttaattatcc gacggaaaac atcaatcacc tgtttttcct gaccagtgat 1080acctcttaca ttaattatcc gacggaaaac atcaatcacc tgtttttcct gaccagtgat 1080
cagccgtcca ttcgcacgaa atggctggac tacgaaaaac aatttggtct gatgtattcc 1140cagccgtcca ttcgcacgaa atggctggac tacgaaaaac aatttggtct gatgtattcc 1140
ctgctggcaa tgcagaaaat caacgaagat caggcgttta tgtgcaccat tcacaattaa 1200ctgctggcaa tgcagaaaat caacgaagat caggcgttta tgtgcaccat tcacaattaa 1200
<210> 28<210> 28
<211> 1494<211> 1494
<212> ДНК<212> DNA
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 28<400> 28
atgtgtaacg ataatcaaaa tacggtcgat gttgttgtga gcaccgttaa cgataacgtc 60atgtgtaacg ataatcaaaa tacggtcgat gttgttgtga gcaccgttaa cgataacgtc 60
atcgaaaaca acacgtacca agttaaaccg atcgataccc cgaccacgtt tgacagttac 120atcgaaaaca acacgtacca agttaaaccg atcgataccc cgaccacgtt tgacagttac 120
tcctggattc agacgtgcgg caccccgatc ctgaaagatg acgaaaaata ttcactgtcg 180tcctggattc agacgtgcgg caccccgatc ctgaaagatg acgaaaaata ttcactgtcg 180
tttgatttcg tcgccccgga actggatcag gacgaaaaat tctgtttcga atttaccggc 240tttgatttcg tcgccccgga actggatcag gacgaaaaat tctgtttcga atttaccggc 240
gatgttgacg gtaaacgtta tgtcacgcag accaacctga cggtggttgc accgaccctg 300gatgttgacg gtaaacgtta tgtcacgcag accaacctga cggtggttgc accgaccctg 300
gaagtttacg tcgatcatgc tagtctgccg tccctgcagc aactgatgaa aatcatccag 360gaagtttacg tcgatcatgc tagtctgccg tccctgcagc aactgatgaa aatcatccag 360
cagaaaaacg aatactcaca gaatgaacgt ttcatttcgt ggggccgcat cggtctgacg 420cagaaaaacg aatactcaca gaatgaacgt ttcatttcgt ggggccgcat cggtctgacg 420
gaagataacg cggaaaaact gaatgcccat atttatccgc tggcaggcaa caatacctca 480gaagataacg cggaaaaact gaatgcccat atttatccgc tggcaggcaa caatacctca 480
caggaactgg tggatgcagt gatcgattac gctgactcga aaaaccgtct gaatctggaa 540caggaactgg tggatgcagt gatcgattac gctgactcga aaaaccgtct gaatctggaa 540
ctgaacacga ataccgcgca cagctttccg aacctggccc cgattctgcg cattatcagc 600ctgaacacga ataccgcgca cagctttccg aacctggccc cgattctgcg cattatcagc 600
tctaaaagca acatcctgat ctctaacatc aacctgtacg atgacggcag tgctgaatat 660tctaaaagca acatcctgat ctctaacatc aacctgtacg atgacggcag tgctgaatat 660
gtgaacctgt acaattggaa agataccgaa gacaaatccg tgaaactgag cgattctttc 720gtgaacctgt acaattggaa agataccgaa gacaaatccg tgaaactgag cgattctttc 720
ctggttctga aagactactt taacggtatt agttccgaaa aaccgagcgg catctatggt 780ctggttctga aagactactt taacggtatt agttccgaaa aaccgagcgg catctatggt 780
cgctacaact ggcatcaact gtataatacg tcttattact tcctgcgtaa agattacctg 840cgctacaact ggcatcaact gtataatacg tcttattact tcctgcgtaa agattacctg 840
accgttgaac cgcagctgca cgacctgcgc gaatatctgg gcggtagtct gaaacaaatg 900accgttgaac cgcagctgca cgacctgcgc gaatatctgg gcggtagtct gaaacaaatg 900
tcctgggatg gcttttcaca gctgtcgaaa ggtgacaaag aactgttcct gaacattgtc 960tcctgggatg gcttttcaca gctgtcgaaa ggtgacaaag aactgttcct gaacattgtc 960
ggctttgatc aggaaaaact gcagcaagaa taccagcaat cagaactgcc gaatttcgtg 1020ggctttgatc aggaaaaact gcagcaagaa taccagcaat cagaactgcc gaatttcgtg 1020
tttacgggca ccacgacctg ggcaggcggt gaaaccaaag aatattacgc tcagcaacag 1080tttacgggca ccacgacctg ggcaggcggt gaaaccaaag aatattacgc tcagcaacag 1080
gtgaacgtcg tgaacaatgc gattaatgaa accagcccgt attacctggg ccgtgaacat 1140gtgaacgtcg tgaacaatgc gattaatgaa accagcccgt attacctggg ccgtgaacat 1140
gacctgtttt tcaaaggtca cccgcgcggc ggtattatca atgatattat cctgggcagt 1200gacctgtttt tcaaaggtca cccgcgcggc ggtattatca atgatattat cctgggcagt 1200
ttcaacaata tgattgacat cccggccaaa gtgtcctttg aagttctgat gatgacgggt 1260ttcaacaata tgattgacat cccggccaaa gtgtcctttg aagttctgat gatgacgggt 1260
atgctgccgg ataccgtggg cggtattgcg tcatcgctgt attttagcat cccggccgaa 1320atgctgccgg ataccgtggg cggtattgcg tcatcgctgt attttagcat cccggccgaa 1320
aaagtctctt tcattgtgtt taccagctct gatacgatca ccgatcgtga agacgcgctg 1380aaagtctctt tcattgtgtt taccagctct gatacgatca ccgatcgtga agacgcgctg 1380
aaatctccgc tggtgcaggt tatgatgacc ctgggcattg ttaaagaaaa agatgtgctg 1440aaatctccgc tggtgcaggt tatgatgacc ctgggcattg ttaaagaaaa agatgtgctg 1440
ttctggtcgg atctgccgga ttgttcctcg ggtgtttgta ttgctcagta ttaa 1494ttctggtcgg atctgccgga ttgttcctcg ggtgtttgta ttgctcagta ttaa 1494
<210> 29<210> 29
<211> 1497<211> 1497
<212> ДНК<212> DNA
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 29<400> 29
atgagtgaag aaaacaccca gtccattatt aaaaacgaca tcaacaaaac catcatcgat 60atgagtgaag aaaacaccca gtccattatt aaaaacgaca tcaacaaaac catcatcgat 60
gaagaatacg ttaacctgga accgatcaac cagtctaaca tcagttttac caaacatagc 120gaagaatacg ttaacctgga accgatcaac cagtctaaca tcagttttac caaacatagc 120
tgggtccaga cctgcggtac gcagcaactg ctgacggaac aaaacaaaga atcaatttcg 180tgggtccaga cctgcggtac gcagcaactg ctgacggaac aaaacaaaga atcaatttcg 180
ctgagcgtgg ttgcgccgcg tctggatgac gatgaaaaat actgtttcga tttcaacggt 240ctgagcgtgg ttgcgccgcg tctggatgac gatgaaaaat actgtttcga tttcaacggt 240
gttagtaata aaggcgaaaa atacatcacc aaagtcacgc tgaatgtcgt ggcaccgtct 300gttagtaata aaggcgaaaa atacatcacc aaagtcacgc tgaatgtcgt ggcaccgtct 300
ctggaagttt atgtggatca tgctagtctg ccgaccctgc aacaactgat ggatattatc 360ctggaagttt atgtggatca tgctagtctg ccgaccctgc aacaactgat ggatattatc 360
aaatcggaag aagaaaaccc gaccgcacag cgttacattg cttggggccg catcgtgccg 420aaatcggaag aagaaaaccc gaccgcacag cgttacattg cttggggccg catcgtgccg 420
acggacgaac agatgaaaga actgaatatt accagctttg cgctgatcaa caatcacacg 480acggacgaac agatgaaaga actgaatatt accagctttg cgctgatcaa caatcacacg 480
ccggccgatc tggttcagga aattgtcaaa caggcgcaaa ccaaacatcg tctgaacgtg 540ccggccgatc tggttcagga aattgtcaaa caggcgcaaa ccaaacatcg tctgaacgtg 540
aaactgagca gcaatacggc ccactcgttt gacaatctgg ttccgattct gaaagaactg 600aaactgagca gcaatacggc ccactcgttt gacaatctgg ttccgattct gaaagaactg 600
aacagcttca acaatgtgac cgttacgaat atcgatctgt atgacgatgg cagcgcggaa 660aacagcttca acaatgtgac cgttacgaat atcgatctgt atgacgatgg cagcgcggaa 660
tatgttaacc tgtacaattg gcgcgacacc ctgaacaaaa cggataatct gaaaattggc 720tatgttaacc tgtacaattg gcgcgacacc ctgaacaaaa cggataatct gaaaattggc 720
aaagactatc tggaagatgt cattaacggt atcaatgaag ataccagcaa caccggcacg 780aaagactatc tggaagatgt cattaacggt atcaatgaag ataccagcaa caccggcacg 780
agttccgtgt acaattggca gaaactgtat ccggctaact accattttct gcgtaaagat 840agttccgtgt acaattggca gaaactgtat ccggctaact accattttct gcgtaaagat 840
tatctgaccc tggaaccgtc cctgcacgaa ctgcgcgact acattggtga ttcactgaaa 900tatctgaccc tggaaccgtc cctgcacgaa ctgcgcgact acattggtga ttcactgaaa 900
cagatgcaat gggacggctt caaaaaattc aactcgaaac agcaagaact gtttctgagc 960cagatgcaat gggacggctt caaaaaattc aactcgaaac agcaagaact gtttctgagc 960
atcgtgaatt tcgataaaca gaaactgcaa aacgaataca attcatcgaa cctgccgaat 1020atcgtgaatt tcgataaaca gaaactgcaa aacgaataca attcatcgaa cctgccgaat 1020
tttgtgttca ccggtaccac ggtttgggca ggcaaccacg aacgcgaata ctacgctaaa 1080tttgtgttca ccggtaccac ggtttgggca ggcaaccacg aacgcgaata ctacgctaaa 1080
cagcaaatca acgttatcaa caacgccatc aacgaaagct ctccgcatta tctgggtaat 1140cagcaaatca acgttatcaa caacgccatc aacgaaagct ctccgcatta tctgggtaat 1140
tcctacgacc tgtttttcaa aggccacccg ggcggtggca ttatcaacac cctgatcatg 1200tcctacgacc tgtttttcaa aggccacccg ggcggtggca ttatcaacac cctgatcatg 1200
cagaattatc cgtcaatggt cgatattccg tccaaaatct catttgaagt gctgatgatg 1260cagaattatc cgtcaatggt cgatattccg tccaaaatct catttgaagt gctgatgatg 1260
accgacatgc tgccggatgc cgtggcaggt attgcgagtt ccctgtactt cacgatcccg 1320accgacatgc tgccggatgc cgtggcaggt attgcgagtt ccctgtactt cacgatcccg 1320
gccgaaaaaa tcaaattcat cgttttcacc tctacggaaa ccattacgga tcgtgaaacc 1380gccgaaaaaa tcaaattcat cgttttcacc tctacggaaa ccattacgga tcgtgaaacc 1380
gccctgcgta gtccgctggt ccaggtgatg attaaactgg gcatcgtgaa agaagaaaat 1440gccctgcgta gtccgctggt ccaggtgatg attaaactgg gcatcgtgaa agaagaaaat 1440
gtgctgttct gggcggacct gccgaattgc gaaacgggtg tctgtattgc tgtctga 1497gtgctgttct gggcggacct gccgaattgc gaaacgggtg tctgtattgc tgtctga 1497
<210> 30<210> 30
<211> 1449<211> 1449
<212> ДНК<212> DNA
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 30<400> 30
atgaacgata atcaaaatac ggtggacgtg gtggtctcaa ccgtcaacga taacgtgatc 60atgaacgata atcaaaatac ggtggacgtg gtggtctcaa ccgtcaacga taacgtgatc 60
gaaaacaaca cgtaccaagt caaaccgatc gataccccga ccacgttcga ctcatactcg 120gaaaacaaca cgtaccaagt caaaccgatc gataccccga ccacgttcga ctcatactcg 120
tggattcaga cgtgcggcac cccgatcctg aaagatgacg aaaaatatag cctgtctttt 180tggattcaga cgtgcggcac cccgatcctg aaagatgacg aaaaatatag cctgtctttt 180
gatttcgttg ccccggaact ggatcaagac gaaaaattct gtttcgaatt taccggcgat 240gatttcgttg ccccggaact ggatcaagac gaaaaattct gtttcgaatt taccggcgat 240
gtggatggta aacgttatgt gacgcagacc aacctgacgg tggttgcacc gaccctggaa 300gtggatggta aacgttatgt gacgcagacc aacctgacgg tggttgcacc gaccctggaa 300
gtttacgtcg atcatgcttc actgccgtcg ctgcagcaac tgatgaaaat catccagcag 360gtttacgtcg atcatgcttc actgccgtcg ctgcagcaac tgatgaaaat catccagcag 360
aaaaacgaat acagccagaa tgaacgcttt atttcttggg gccgtatccg cctgacggaa 420aaaaacgaat acagccagaa tgaacgcttt atttcttggg gccgtatccg cctgacggaa 420
gataacgcgg aaaaactgaa tgcccatatt tatccgctgg caggcaacaa taccagccag 480gataacgcgg aaaaactgaa tgcccatatt tatccgctgg caggcaacaa taccagccag 480
gaactggtgg acgcagttat cgattacgct gactctaaaa accgtctgaa tctggaactg 540gaactggtgg acgcagttat cgattacgct gactctaaaa accgtctgaa tctggaactg 540
aacacgaata ccggccacag tttccgtaac attgcgccga tcctgcgcgc caccagctct 600aacacgaata ccggccacag tttccgtaac attgcgccga tcctgcgcgc caccagctct 600
aaaaacaaca tcctgatctc caacatcaac ctgtacgatg acggtagtgc tgaatatgtg 660aaaaacaaca tcctgatctc caacatcaac ctgtacgatg acggtagtgc tgaatatgtg 660
tccctgtaca actggaaaga taccgacaat aaatcacaga aactgagtga ttcctttctg 720tccctgtaca actggaaaga taccgacaat aaatcacaga aactgagtga ttcctttctg 720
gttctgaaag actacctgaa tggcatcagt tccgaaaaac cgaacggtat ttatagcatc 780gttctgaaag actacctgaa tggcatcagt tccgaaaaac cgaacggtat ttatagcatc 780
tacaattggc atcagctgta tcactcatcg tattacttcc tgcgtaaaga ttacctgacg 840tacaattggc atcagctgta tcactcatcg tattacttcc tgcgtaaaga ttacctgacg 840
gtggaaacca aactgcacga cctgcgcgaa tatctgggcg gttcactgaa acaaatgtcg 900gtggaaacca aactgcacga cctgcgcgaa tatctgggcg gttcactgaa acaaatgtcg 900
tgggatacct ttagccagct gtctaaaggc gacaaagaac tgttcctgaa cattgttggt 960tgggatacct ttagccagct gtctaaaggc gacaaagaac tgttcctgaa cattgttggt 960
tttgatcagg aaaaactgca gcaagaatac cagcaaagcg aactgccgaa tttcgtcttt 1020tttgatcagg aaaaactgca gcaagaatac cagcaaagcg aactgccgaa tttcgtcttt 1020
acgggcacca cgacctgggc aggcggtgaa accaaagaat attacgctca gcaacaggtg 1080acgggcacca cgacctgggc aggcggtgaa accaaagaat attacgctca gcaacaggtg 1080
aacgtcgtga acaatgcgat taatgaaacc tctccgtatt acctgggccg tgaacatgac 1140aacgtcgtga acaatgcgat taatgaaacc tctccgtatt acctgggccg tgaacatgac 1140
ctgtttttca aaggtcaccc gcgcggcggt attatcaatg atattatcct gggctcattc 1200ctgtttttca aaggtcaccc gcgcggcggt attatcaatg atattatcct gggctcattc 1200
aacaatatga ttgacatccc ggccaaagtt tcgtttgaag tcctgatgat gacgggtatg 1260aacaatatga ttgacatccc ggccaaagtt tcgtttgaag tcctgatgat gacgggtatg 1260
ctgccggata ccgttggcgg tattgcgagc agcctgtatt ttagtatccc ggccgaaaaa 1320ctgccggata ccgttggcgg tattgcgagc agcctgtatt ttagtatccc ggccgaaaaa 1320
gtgtccttca ttgtttttac cagttccgat acgatcaccg atcgcgaaga cgcgctgaaa 1380gtgtccttca ttgtttttac cagttccgat acgatcaccg atcgcgaaga cgcgctgaaa 1380
agtccgctgg tccaagtgat gatgaccctg ggcattgtga aagaaaaaga tgtgctgttc 1440agtccgctgg tccaagtgat gatgaccctg ggcattgtga aagaaaaaga tgtgctgttc 1440
tggtgctaa 1449tggtgctaa 1449
<210> 31<210> 31
<211> 2028<211> 2028
<212> ДНК<212> DNA
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 31<400> 31
atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60
acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120
cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga acagacgtgc 180cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga agacgtgc 180
ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240
gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300
tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360
gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420
tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480
ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540
atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600
catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660
atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720
aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780
ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840
ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900
cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960
aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020
ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080
tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140
gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200
catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260
atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320
gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380
ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440
gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agaccacaaa 1500gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agaccacaaa 1500
gttaatagca tggaagtcgc gattgatgaa gcctgcaccc gcattatcgc aaaacgtcag 1560gttaatagca tggaagtcgc gattgatgaa gcctgcaccc gcattatcgc aaaacgtcag 1560
ccgacggctt ctgatctgcg cctggtgatt gcgattatca aaacgatcac cgatctggaa 1620ccgacggctt ctgatctgcg cctggtgatt gcgattatca aaacgatcac cgatctggaa 1620
cgtattggcg acgttgccga atctattgcg aaagtcgcgc tggaatcttt ttctaacaaa 1680cgtattggcg acgttgccga atctattgcg aaagtcgcgc tggaatcttt ttctaacaaa 1680
cagtacaatc tgctggttag cctggaatct ctgggtcaac ataccgtgcg catgctgcac 1740cagtacaatc tgctggttag cctggaatct ctgggtcaac ataccgtgcg catgctgcac 1740
gaagttctgg atgcattcgc tcgtatggac gtcaaagcag ctatcgaagt gtatcaggaa 1800gaagttctgg atgcattcgc tcgtatggac gtcaaagcag ctatcgaagt gtatcaggaa 1800
gatgaccgca tcgatcaaga atacgaaagt attgtccgtc agctgatggc ccacatgatg 1860gatgaccgca tcgatcaaga atacgaaagt attgtccgtc agctgatggc ccacatgatg 1860
gaagatccgt catcgattcc gaacgttatg aaagtcatgt gggcggcccg ttccatcgaa 1920gaagatccgt catcgattcc gaacgttatg aaagtcatgt gggcggcccg ttccatcgaa 1920
cgcgttggtg atcgttgcca gaatatttgt gaatacatca tctacttcgt gaaaggcaaa 1980cgcgttggtg atcgttgcca gaatatttgt gaatacatca tctacttcgt gaaaggcaaa 1980
gatgttcgcc acaccaaacc ggatgacttc ggtacgatgc tggactaa 2028gatgttcgcc acaccaaacc ggatgacttc ggtacgatgc tggactaa 2028
<210> 32<210> 32
<211> 1533<211> 1533
<212> ДНК<212> DNA
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 32<400> 32
atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60atgaaaaaga tcctgaccgt cctgagcatc tttatcctga gcgcctgtaa tagcgacaac 60
acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120acctctctga aagaaaccgt ctccagcaac agcgcggatg tggttgaaac ggaaacctat 120
cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga acagacgtgc 180cagctgaccc cgattgacgc cccgagcagc tttctgagcc attcttggga agacgtgc 180
ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240ggcaccccga tcctgaatga aagtgataaa caagcgattt cctttgactt cgtggccccg 240
gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300gaactgaaac aggatgaaaa atactgtttc acgttcaaag gcatcaccgg tgaccaccgc 300
tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360tacattacga acaccaccct gaccgttgtg gcaccgacgc tggaagtgta tatcgatcat 360
gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420gctagtctgc cgagcctgca acaactgatt cacattatcc aggcgaaaga tgaatacccg 420
tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480tcaaaccaac gctttgtttc gtggaaacgt gttaccgtcg atgcggacaa cgccaataaa 480
ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540ctgaatattc atacctatcc gctgaaaggc aacaatacgt caccggaaat ggttgcggcc 540
atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600atcgatgaat atgcacaatc gaaaaaccgc ctgaatattg aattttacac gaataccgct 600
catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660catgtcttca acaatctgcc gccgattatc cagccgctgt acaacaacga aaaagtcaaa 660
atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720atttcacaca tctcgctgta cgatgacggt agttccgaat atgtgagtct gtaccagtgg 720
aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780aaagataccc cgaacaaaat tgaaacgctg gaaggcgaag tgagcctgct ggcaaattat 780
ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840ctggctggca ccagcccgga tgcaccgaaa ggcatgggta accgttataa ttggcataaa 840
ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900ctgtacgata ccgactatta ctttctgcgc gaagattatc tggacgtgga agcgaacctg 900
cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960cacgatctgc gtgactacct gggttcatcg gcaaaacaga tgccgtggga tgaatttgct 960
aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020aaactgagtg actcccagca aaccctgttt ctggatatcg ttggcttcga caaagaacag 1020
ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080ctgcaacaac agtattcaca atcgccgctg ccgaatttta tttttaccgg caccaccacc 1080
tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140tgggcgggcg gtgaaacgaa agaatattac gcccaacagc aagtgaacgt tattaacaat 1140
gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200gccatcaatg aaaccagccc gtattacctg ggcaaagatt acgacctgtt tttcaaaggt 1200
catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260catccggcag gcggtgtgat caacgatatt atcctgggca gttttccgga catgattaat 1260
atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320atcccggcta aaatttcctt cgaagtgctg atgatgaccg atatgctgcc ggacacggtt 1320
gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380gcaggtatcg ctagctctct gtattttacc attccggcgg ataaagtgaa ctttatcgtt 1380
ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440ttcacgagtt ccgatacgat taccgaccgt gaagaagccc tgaaaagccc gctggtccag 1440
gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agacctgccg 1500gtgatgctga ccctgggcat cgtcaaagaa aaagatgtgc tgttctgggc agacctgccg 1500
gactgctcgt ctggtgtgtg tatcgacaaa taa 1533gactgctcgt ctggtgtgtg tatcgacaaa taa 1533
<210> 33<210> 33
<211> 1269<211> 1269
<212> ДНК<212> DNA
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 33<400> 33
atggggacca ttaaaaagcc cttaatcata gcaggaaatg gtccatcaat taaggaccta 60atggggacca ttaaaaagcc cttaatcata gcaggaaatg gtccatcaat taaggaccta 60
gactatgctt tatttccaaa agacttcgat gtctttcgct gcaaccagtt ttacttcgag 120gactatgctt tatttccaaa agacttcgat gtctttcgct gcaaccagtt ttacttcgag 120
gataaatatt acctaggacg cgaaataaaa ggagtgttct ttaacccttg tgtattaagc 180gataaatatt acctaggacg cgaaataaaa ggagtgttct ttaacccttg tgtattaagc 180
agtcaaatgc aaacagtgca ataccttatg gacaatggcg aatatagcat agaacgcttc 240agtcaaatgc aaacagtgca ataccttatg gacaatggcg aatatagcat agaacgcttc 240
ttttgcagtg tttcaacaga tcgccacgat tttgatgggg attaccaaac gattttaccg 300ttttgcagtg tttcaacaga tcgccacgat tttgatgggg attaccaaac gattttaccg 300
gtagacggtt atttaaaagc acactatccg ttcgtctgcg atacattcag cttattcaaa 360gtagacggtt atttaaaagc acactatccg ttcgtctgcg atacattcag cttattcaaa 360
ggtcacgaag aaatcttaaa acacgtgaaa taccacctga aaacgtacag caaagaactt 420ggtcacgaag aaatcttaaa acacgtgaaa taccacctga aaacgtacag caaagaactt 420
agtgcgggtg tcttaatgtt attgagtgca gtggtattag gatacaaaga aatataccta 480agtgcgggtg tcttaatgtt attgagtgca gtggtattag gatacaaaga aatataccta 480
gtaggaatcg acttcggcgc ctcatcttgg gggcacttct atgacgaaag ccaatcccaa 540gtaggaatcg acttcggcgc ctcatcttgg gggcacttct atgacgaaag ccaatcccaa 540
cactttagca atcacatggc agattgtcac aatatctatt acgacatgct gactatttgt 600cactttagca atcacatggc agattgtcac aatatctatt acgacatgct gactatttgt 600
ctctgtcaaa agtatgcaaa attgtacgca ttagcaccca attcaccatt atcacatttg 660ctctgtcaaa agtatgcaaa attgtacgca ttagcaccca attcaccatt atcacatttg 660
cttacactaa atccacaggc caaataccca tttgaactat tagataaacc tatcgggtat 720cttacactaa atccacaggc caaataccca tttgaactat tagataaacc tatcgggtat 720
actagcgacc taattattag tagcccgttg gaagagaagt tgctcgaatt taagaatatc 780actagcgacc taattattag tagcccgttg gaagagaagt tgctcgaatt taagaatatc 780
gaagagaagt tgcttgagtt caaaaacata gaagagaaac tcttagagtt caagaatatt 840gaagagaagt tgcttgagtt caaaaacata gaagagaaac tcttagagtt caagaatatt 840
gaagagaaac tattagaatt taaaaacatc gaggaaaaac ttttggagtt caaaaatata 900gaagagaaac tattagaatt taaaaacatc gaggaaaaac ttttggagtt caaaaatata 900
gaagagaaac tcctagagtt caagaacatt gaggaaaagt tgcttgagtt caaaaatatt 960gaagagaaac tcctagagtt caagaacatt gaggaaaagt tgcttgagtt caaaaatatt 960
gaggaaaagt tgctcgaatt taagaatatc gaggaaaaac ttttggaatt taagaacata 1020gaggaaaagt tgctcgaatt taagaatatc gaggaaaaac ttttggaatt taagaacata 1020
gaagaaaagt tactcgaatt taaaaacatt gaagagaaac tattggaatt taaaaatata 1080gaagaaaagt tactcgaatt taaaaacatt gaagagaaac tattggaatt taaaaatata 1080
gaggaaaagt tacttgagtt caaaaacata gaggaaaagt tacttgaatt taagaacata 1140gaggaaaagt tacttgagtt caaaaacata gaggaaaagt tacttgaatt taagaacata 1140
gaagagaaac ttctcgcaag ccgactgaac aacattctac gtaaaatcaa gcggaaaata 1200gaagagaaac ttctcgcaag ccgactgaac aacattctac gtaaaatcaa gcggaaaata 1200
cttccattct tttggggcgg aggtgtaacc ccaacattaa aagttagttt ccgttgggga 1260cttccattct tttggggcgg aggtgtaacc ccaacattaa aagttagttt ccgttgggga 1260
gctgcataa 1269gctgcataa 1269
<210> 34<210> 34
<211> 469<211> 469
<212> Белок<212> Protein
<213> Campylobacter coli<213> Campylobacter coli
<400> 34<400> 34
Met Gln Asn Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Gln Ser Ile Met Gln Asn Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Gln Ser Ile
1 5 10 15 1 5 10 15
Asn Tyr Gln Arg Leu Pro Lys Glu Tyr Asp Ile Phe Arg Cys Asn Gln Asn Tyr Gln Arg Leu Pro Lys Glu Tyr Asp Ile Phe Arg Cys Asn Gln
20 25 30 20 25 30
Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Asn Ile Lys Ala Ala Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Asn Ile Lys Ala Ala
35 40 45 35 40 45
Phe Phe Asn Pro Tyr Pro Phe Leu Gln Gln Tyr His Thr Ala Lys Gln Phe Phe Asn Pro Tyr Pro Phe Leu Gln Gln Tyr His Thr Ala Lys Gln
50 55 60 50 55 60
Leu Val Phe Asn Asn Glu Tyr Lys Ile Glu Asn Ile Phe Cys Ser Thr Leu Val Phe Asn Asn Glu Tyr Lys Ile Glu Asn Ile Phe Cys Ser Thr
65 70 75 80 65 70 75 80
Phe Asn Leu Pro Phe Ile Glu Lys Asp Asn Phe Ile Asn Lys Phe Tyr Phe Asn Leu Pro Phe Ile Glu Lys Asp Asn Phe Ile Asn Lys Phe Tyr
85 90 95 85 90 95
Asp Phe Phe Pro Asp Ala Lys Leu Gly His Lys Ile Ile Glu Asn Leu Asp Phe Phe Pro Asp Ala Lys Leu Gly His Lys Ile Ile Glu Asn Leu
100 105 110 100 105 110
Lys Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Leu Asn Lys Lys Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Leu Asn Lys
115 120 125 115 120 125
Arg Ile Thr Ser Gly Ile Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly Arg Ile Thr Ser Gly Ile Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly
130 135 140 130 135 140
Tyr Lys Asn Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Glu Thr Tyr Lys Asn Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Glu Thr
145 150 155 160 145 150 155 160
Ile Tyr Pro Phe Lys Ala Met Ser Lys Asn Ile Lys Lys Ile Phe Pro Ile Tyr Pro Phe Lys Ala Met Ser Lys Asn Ile Lys Lys Ile Phe Pro
165 170 175 165 170 175
Trp Ile Lys Asp Phe Asn Pro Ser Asn Phe His Ser Lys Glu Tyr Asp Trp Ile Lys Asp Phe Asn Pro Ser Asn Phe His Ser Lys Glu Tyr Asp
180 185 190 180 185 190
Ile Glu Ile Leu Lys Leu Leu Glu Ser Ile Tyr Lys Val Asn Ile Tyr Ile Glu Ile Leu Lys Leu Leu Glu Ser Ile Tyr Lys Val Asn Ile Tyr
195 200 205 195 200 205
Ala Leu Cys Asp Asn Ser Ala Leu Ala Asn Tyr Phe Pro Leu Leu Val Ala Leu Cys Asp Asn Ser Ala Leu Ala Asn Tyr Phe Pro Leu Leu Val
210 215 220 210 215 220
Asn Thr Asp Asn Ser Phe Val Leu Glu Asn Lys Ser Asp Asp Cys Ile Asn Thr Asp Asn Ser Phe Val Leu Glu Asn Lys Ser Asp Asp Cys Ile
225 230 235 240 225 230 235 240
Asn Asp Ile Leu Leu Thr Asn Asn Thr Pro Gly Ile Asn Phe Tyr Lys Asn Asp Ile Leu Leu Thr Asn Asn Thr Pro Gly Ile Asn Phe Tyr Lys
245 250 255 245 250 255
Ser Gln Ile Gln Val Asn Asn Thr Glu Ile Leu Leu Leu Asn Phe Gln Ser Gln Ile Gln Val Asn Asn Thr Glu Ile Leu Leu Leu Asn Phe Gln
260 265 270 260 265 270
Asn Met Ile Ser Ala Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile Asn Met Ile Ser Ala Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile
275 280 285 275 280 285
Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Glu Asn Glu Ile Ser Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Glu Asn Glu Ile Ser
290 295 300 290 295 300
Asn Leu Asn Lys Ile Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys Asn Leu Asn Lys Ile Leu Gln Asp Ser Tyr Lys Thr Ile Asn Thr Lys
305 310 315 320 305 310 315 320
Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile Leu Gln Asp Lys Asp Lys Glu Asn Glu Ile Ser Asn Leu Asn Lys Ile Leu Gln Asp Lys Asp Lys
325 330 335 325 330 335
Leu Leu Ile Val Lys Glu Asn Leu Leu Asn Phe Lys Ser Arg His Gly Leu Leu Ile Val Lys Glu Asn Leu Leu Asn Phe Lys Ser Arg His Gly
340 345 350 340 345 350
Lys Ala Lys Phe Arg Ile Gln Asn Gln Leu Ser Tyr Lys Leu Gly Gln Lys Ala Lys Phe Arg Ile Gln Asn Gln Leu Ser Tyr Lys Leu Gly Gln
355 360 365 355 360 365
Ala Met Met Val Asn Ser Lys Ser Leu Leu Gly Tyr Ile Arg Met Pro Ala Met Met Val Asn Ser Lys Ser Leu Leu Gly Tyr Ile Arg Met Pro
370 375 380 370 375 380
Phe Val Leu Ser Tyr Ile Lys Asp Lys His Lys Gln Glu Gln Lys Ile Phe Val Leu Ser Tyr Ile Lys Asp Lys His Lys Gln Glu Gln Lys Ile
385 390 395 400 385 390 395 400
Tyr Gln Glu Lys Ile Lys Lys Asp Pro Ser Leu Thr Leu Pro Pro Leu Tyr Gln Glu Lys Ile Lys Lys Asp Pro Ser Leu Thr Leu Pro Pro Leu
405 410 415 405 410 415
Glu Asp Tyr Pro Asp Tyr Lys Glu Ala Leu Lys Glu Lys Glu Cys Leu Glu Asp Tyr Pro Asp Tyr Lys Glu Ala Leu Lys Glu Lys Glu Cys Leu
420 425 430 420 425 430
Thr Tyr Arg Leu Gly Gln Thr Leu Ile Lys Ala Asp Gln Glu Trp Tyr Thr Tyr Arg Leu Gly Gln Thr Leu Ile Lys Ala Asp Gln Glu Trp Tyr
435 440 445 435 440 445
Lys Gly Gly Tyr Val Lys Met Trp Phe Glu Ile Lys Lys Leu Lys Lys Lys Gly Gly Tyr Val Lys Met Trp Phe Glu Ile Lys Lys Leu Lys Lys
450 455 460 450 455 460
Glu Tyr Lys Lys Lys Glu Tyr Lys Lys Lys
465 465
<210> 35<210> 35
<211> 381<211> 381
<212> Белок<212> Protein
<213> Vibrio sp.<213> Vibrio sp.
<400> 35<400> 35
Met Asn Asn Asp Asn Ser Thr Thr Thr Asn Asn Asn Ala Ile Glu Ile Met Asn Asn Asp Asn Ser Thr Thr Thr Asn Asn Asn Ala Ile Glu Ile
1 5 10 15 1 5 10 15
Tyr Val Asp Arg Ala Thr Leu Pro Thr Ile Gln Gln Met Thr Lys Ile Tyr Val Asp Arg Ala Thr Leu Pro Thr Ile Gln Gln Met Thr Lys Ile
20 25 30 20 25 30
Val Ser Gln Lys Thr Ser Asn Lys Lys Leu Ile Ser Trp Ser Arg Tyr Val Ser Gln Lys Thr Ser Asn Lys Lys Leu Ile Ser Trp Ser Arg Tyr
35 40 45 35 40 45
Pro Ile Thr Asp Lys Ser Leu Leu Lys Lys Ile Asn Ala Glu Phe Phe Pro Ile Thr Asp Lys Ser Leu Leu Lys Lys Ile Asn Ala Glu Phe Phe
50 55 60 50 55 60
Lys Glu Gln Phe Glu Leu Thr Glu Ser Leu Lys Asn Ile Ile Leu Ser Lys Glu Gln Phe Glu Leu Thr Glu Ser Leu Lys Asn Ile Ile Leu Ser
65 70 75 80 65 70 75 80
Glu Asn Ile Asp Asn Leu Ile Ile His Gly Asn Thr Leu Trp Ser Ile Glu Asn Ile Asp Asn Leu Ile Ile His Gly Asn Thr Leu Trp Ser Ile
85 90 95 85 90 95
Asp Val Val Asp Ile Ile Lys Glu Val Asn Leu Leu Gly Lys Asn Ile Asp Val Val Asp Ile Ile Lys Glu Val Asn Leu Leu Gly Lys Asn Ile
100 105 110 100 105 110
Pro Ile Glu Leu His Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Pro Ile Glu Leu His Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg
115 120 125 115 120 125
Ile Tyr Glu Phe Ser Lys Leu Pro Glu Ser Glu Gln Lys Tyr Lys Thr Ile Tyr Glu Phe Ser Lys Leu Pro Glu Ser Glu Gln Lys Tyr Lys Thr
130 135 140 130 135 140
Ser Leu Ser Lys Asn Asn Ile Lys Phe Ser Ile Asp Gly Thr Asp Ser Ser Leu Ser Lys Asn Asn Ile Lys Phe Ser Ile Asp Gly Thr Asp Ser
145 150 155 160 145 150 155 160
Phe Lys Asn Thr Ile Glu Asn Ile Tyr Gly Phe Ser Gln Leu Tyr Pro Phe Lys Asn Thr Ile Glu Asn Ile Tyr Gly Phe Ser Gln Leu Tyr Pro
165 170 175 165 170 175
Thr Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Thr Leu Lys Thr Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Thr Leu Lys
180 185 190 180 185 190
Ile Asn Pro Leu Arg Glu Leu Leu Ser Asn Asn Ile Lys Gln Met Lys Ile Asn Pro Leu Arg Glu Leu Leu Ser Asn Asn Ile Lys Gln Met Lys
195 200 205 195 200 205
Trp Asp Tyr Phe Lys Asp Phe Asn Tyr Lys Gln Lys Asp Ile Phe Tyr Trp Asp Tyr Phe Lys Asp Phe Asn Tyr Lys Gln Lys Asp Ile Phe Tyr
210 215 220 210 215 220
Ser Leu Thr Asn Phe Asn Pro Lys Glu Ile Gln Glu Asp Phe Asn Lys Ser Leu Thr Asn Phe Asn Pro Lys Glu Ile Gln Glu Asp Phe Asn Lys
225 230 235 240 225 230 235 240
Asn Ser Asn Lys Asn Phe Ile Phe Ile Gly Ser Asn Ser Ala Thr Ala Asn Ser Asn Lys Asn Phe Ile Phe Ile Gly Ser Asn Ser Ala Thr Ala
245 250 255 245 250 255
Thr Ala Glu Glu Gln Ile Asn Ile Ile Ser Glu Ala Lys Lys Glu Asn Thr Ala Glu Glu Gln Ile Asn Ile Ile Ser Glu Ala Lys Lys Glu Asn
260 265 270 260 265 270
Ser Ser Ile Ile Thr Asn Ser Ile Ser Asp Tyr Asp Leu Phe Phe Lys Ser Ser Ile Ile Thr Asn Ser Ile Ser Asp Tyr Asp Leu Phe Phe Lys
275 280 285 275 280 285
Gly His Pro Ser Ala Thr Phe Asn Glu Gln Ile Ile Asn Ala His Asp Gly His Pro Ser Ala Thr Phe Asn Glu Gln Ile Ile Asn Ala His Asp
290 295 300 290 295 300
Met Ile Glu Ile Asn Asn Lys Ile Pro Phe Glu Ala Leu Ile Met Thr Met Ile Glu Ile Asn Asn Lys Ile Pro Phe Glu Ala Leu Ile Met Thr
305 310 315 320 305 310 315 320
Gly Ile Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Val Phe Phe Gly Ile Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Val Phe Phe
325 330 335 325 330 335
Ser Ile Pro Lys Glu Val Lys Asn Lys Phe Val Phe Tyr Lys Ser Gly Ser Ile Pro Lys Glu Val Lys Asn Lys Phe Val Phe Tyr Lys Ser Gly
340 345 350 340 345 350
Thr Asp Ile Glu Asn Asn Ser Leu Ile Gln Val Met Leu Lys Leu Asn Thr Asp Ile Glu Asn Asn Ser Leu Ile Gln Val Met Leu Lys Leu Asn
355 360 365 355 360 365
Leu Ile Asn Arg Asp Asn Ile Lys Leu Ile Ser Asp Ile Leu Ile Asn Arg Asp Asn Ile Lys Leu Ile Ser Asp Ile
370 375 380 370 375 380
<210> 36<210> 36
<211> 390<211> 390
<212> Белок<212> Protein
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 36<400> 36
Met Gly Cys Asn Ser Asp Ser Asn His Asn Asn Ser Asp Gly Asn Ile Met Gly Cys Asn Ser Asp Ser Asn His Asn Asn Ser Asp Gly Asn Ile
1 5 10 15 1 5 10 15
Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro
20 25 30 20 25 30
Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys
35 40 45 35 40 45
Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Glu Leu Leu Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Glu Leu Leu
50 55 60 50 55 60
Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Asn Ser Glu Leu Ile Lys Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Asn Ser Glu Leu Ile Lys
65 70 75 80 65 70 75 80
Ser Leu Asp Ser Met Ile Leu Thr Asn Asp Ile Lys Lys Val Ile Ile Ser Leu Asp Ser Met Ile Leu Thr Asn Asp Ile Lys Lys Val Ile Ile
85 90 95 85 90 95
Asn Gly Asn Thr Leu Trp Ala Ala Asp Val Val Asn Ile Ile Lys Ser Asn Gly Asn Thr Leu Trp Ala Ala Asp Val Val Asn Ile Ile Lys Ser
100 105 110 100 105 110
Ile Glu Ala Phe Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp Ile Glu Ala Phe Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp
115 120 125 115 120 125
Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Lys Leu Pro Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Lys Leu Pro
130 135 140 130 135 140
Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Leu Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Leu
145 150 155 160 145 150 155 160
Ser Ser Ile Asn Gly Thr Gln Pro Phe Glu Asn Val Val Glu Asn Ile Ser Ser Ile Asn Gly Thr Gln Pro Phe Glu Asn Val Val Glu Asn Ile
165 170 175 165 170 175
Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala
180 185 190 180 185 190
Asp Ile Phe Glu Thr Asn Leu Pro Leu Arg Ser Leu Lys Gly Val Leu Asp Ile Phe Glu Thr Asn Leu Pro Leu Arg Ser Leu Lys Gly Val Leu
195 200 205 195 200 205
Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Lys Thr Phe Asn Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Lys Thr Phe Asn
210 215 220 210 215 220
Ser Gln Gln Lys Asp Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Asp Ser Gln Gln Lys Asp Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Asp
225 230 235 240 225 230 235 240
Glu Ile Met Glu Gln Tyr Lys Ala Ser Pro Asn Lys Asn Phe Ile Phe Glu Ile Met Glu Gln Tyr Lys Ala Ser Pro Asn Lys Asn Phe Ile Phe
245 250 255 245 250 255
Val Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile Val Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile
260 265 270 260 265 270
Leu Thr Glu Ala Lys Asn Pro Asn Ser Pro Ile Ile Thr Lys Ser Ile Leu Thr Glu Ala Lys Asn Pro Asn Ser Pro Ile Ile Thr Lys Ser Ile
275 280 285 275 280 285
Gln Gly Phe Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn Gln Gly Phe Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn
290 295 300 290 295 300
Lys Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile Lys Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile
305 310 315 320 305 310 315 320
Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly
325 330 335 325 330 335
Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn
340 345 350 340 345 350
Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu
355 360 365 355 360 365
Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys
370 375 380 370 375 380
Leu Ile Ser Asp Leu Gln Leu Ile Ser Asp Leu Gln
385 390 385 390
<210> 37<210> 37
<211> 388<211> 388
<212> Белок<212> Protein
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 37<400> 37
Met Lys Thr Ile Thr Leu Tyr Leu Asp Pro Ala Ser Leu Pro Ala Leu Met Lys Thr Ile Thr Leu Tyr Leu Asp Pro Ala Ser Leu Pro Ala Leu
1 5 10 15 1 5 10 15
Asn Gln Leu Met Asp Phe Thr Gln Asn Asn Glu Asp Lys Thr His Pro Asn Gln Leu Met Asp Phe Thr Gln Asn Asn Glu Asp Lys Thr His Pro
20 25 30 20 25 30
Arg Ile Phe Gly Leu Ser Arg Phe Lys Ile Pro Asp Asn Ile Ile Thr Arg Ile Phe Gly Leu Ser Arg Phe Lys Ile Pro Asp Asn Ile Ile Thr
35 40 45 35 40 45
Gln Tyr Gln Asn Ile His Phe Val Glu Leu Lys Asp Asn Arg Pro Thr Gln Tyr Gln Asn Ile His Phe Val Glu Leu Lys Asp Asn Arg Pro Thr
50 55 60 50 55 60
Glu Ala Leu Phe Thr Ile Leu Asp Gln Tyr Pro Gly Asn Ile Glu Leu Glu Ala Leu Phe Thr Ile Leu Asp Gln Tyr Pro Gly Asn Ile Glu Leu
65 70 75 80 65 70 75 80
Asp Ile His Leu Asn Ile Ala His Ser Val Gln Leu Ile Arg Pro Ile Asp Ile His Leu Asn Ile Ala His Ser Val Gln Leu Ile Arg Pro Ile
85 90 95 85 90 95
Leu Ala Tyr Arg Phe Lys His Leu Asp Arg Val Ser Ile Gln Arg Leu Leu Ala Tyr Arg Phe Lys His Leu Asp Arg Val Ser Ile Gln Arg Leu
100 105 110 100 105 110
Asn Leu Tyr Asp Asp Gly Ser Met Glu Tyr Val Asp Leu Glu Lys Glu Asn Leu Tyr Asp Asp Gly Ser Met Glu Tyr Val Asp Leu Glu Lys Glu
115 120 125 115 120 125
Glu Asn Lys Asp Ile Ser Ala Glu Ile Lys Gln Ala Glu Lys Gln Leu Glu Asn Lys Asp Ile Ser Ala Glu Ile Lys Gln Ala Glu Lys Gln Leu
130 135 140 130 135 140
Ser His Tyr Leu Leu Thr Gly Lys Ile Lys Phe Asp Asn Pro Thr Ile Ser His Tyr Leu Leu Thr Gly Lys Ile Lys Phe Asp Asn Pro Thr Ile
145 150 155 160 145 150 155 160
Ala Arg Tyr Val Trp Gln Ser Ala Phe Pro Val Lys Tyr His Phe Leu Ala Arg Tyr Val Trp Gln Ser Ala Phe Pro Val Lys Tyr His Phe Leu
165 170 175 165 170 175
Ser Thr Asp Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Glu Ser Thr Asp Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Glu
180 185 190 180 185 190
Tyr Leu Ala Glu Asn Tyr Gln Lys Met Asp Trp Thr Ala Tyr Gln Gln Tyr Leu Ala Glu Asn Tyr Gln Lys Met Asp Trp Thr Ala Tyr Gln Gln
195 200 205 195 200 205
Leu Thr Pro Glu Gln Gln Ala Phe Tyr Leu Thr Leu Val Gly Phe Asn Leu Thr Pro Glu Gln Gln Ala Phe Tyr Leu Thr Leu Val Gly Phe Asn
210 215 220 210 215 220
Asp Glu Val Lys Gln Ser Leu Glu Val Gln Gln Ala Lys Phe Ile Phe Asp Glu Val Lys Gln Ser Leu Glu Val Gln Gln Ala Lys Phe Ile Phe
225 230 235 240 225 230 235 240
Thr Gly Thr Thr Thr Trp Glu Gly Asn Thr Asp Val Arg Glu Tyr Tyr Thr Gly Thr Thr Thr Trp Glu Gly Asn Thr Asp Val Arg Glu Tyr Tyr
245 250 255 245 250 255
Ala Gln Gln Gln Leu Asn Leu Leu Asn His Phe Thr Gln Ala Gly Gly Ala Gln Gln Gln Leu Asn Leu Leu Asn His Phe Thr Gln Ala Gly Gly
260 265 270 260 265 270
Asp Leu Phe Ile Gly Asp His Tyr Lys Ile Tyr Phe Lys Gly His Pro Asp Leu Phe Ile Gly Asp His Tyr Lys Ile Tyr Phe Lys Gly His Pro
275 280 285 275 280 285
Arg Gly Gly Glu Ile Asn Asp Tyr Ile Leu Asn Asn Ala Lys Asn Ile Arg Gly Gly Glu Ile Asn Asp Tyr Ile Leu Asn Asn Ala Lys Asn Ile
290 295 300 290 295 300
Thr Asn Ile Pro Ala Asn Ile Ser Phe Glu Val Leu Met Met Thr Gly Thr Asn Ile Pro Ala Asn Ile Ser Phe Glu Val Leu Met Met Thr Gly
305 310 315 320 305 310 315 320
Leu Leu Pro Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Leu Pro Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser
325 330 335 325 330 335
Leu Pro Lys Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Gln Leu Pro Lys Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Gln
340 345 350 340 345 350
Val Lys Ser Lys Glu Asp Ala Leu Asn Asn Pro Tyr Val Lys Val Met Val Lys Ser Lys Glu Asp Ala Leu Asn Asn Pro Tyr Val Lys Val Met
355 360 365 355 360 365
Arg Arg Leu Gly Ile Ile Asp Glu Ser Gln Val Ile Phe Trp Asp Ser Arg Arg Leu Gly Ile Ile Asp Glu Ser Gln Val Ile Phe Trp Asp Ser
370 375 380 370 375 380
Leu Lys Gln Leu Leu Lys Gln Leu
385 385
<210> 38<210> 38
<211> 371<211> 371
<212> Белок<212> Protein
<213> Neisseria meningitidis<213> Neisseria meningitidis
<400> 38<400> 38
Met Gly Leu Lys Lys Ala Cys Leu Thr Val Leu Cys Leu Ile Val Phe Met Gly Leu Lys Lys Ala Cys Leu Thr Val Leu Cys Leu Ile Val Phe
1 5 10 15 1 5 10 15
Cys Phe Gly Ile Phe Tyr Thr Phe Asp Arg Val Asn Gln Gly Glu Arg Cys Phe Gly Ile Phe Tyr Thr Phe Asp Arg Val Asn Gln Gly Glu Arg
20 25 30 20 25 30
Asn Ala Val Ser Leu Leu Lys Glu Lys Leu Phe Asn Glu Glu Gly Glu Asn Ala Val Ser Leu Leu Lys Glu Lys Leu Phe Asn Glu Glu Gly Glu
35 40 45 35 40 45
Pro Val Asn Leu Ile Phe Cys Tyr Thr Ile Leu Gln Met Lys Val Ala Pro Val Asn Leu Ile Phe Cys Tyr Thr Ile Leu Gln Met Lys Val Ala
50 55 60 50 55 60
Glu Arg Ile Met Ala Gln His Pro Gly Glu Arg Phe Tyr Val Val Leu Glu Arg Ile Met Ala Gln His Pro Gly Glu Arg Phe Tyr Val Val Leu
65 70 75 80 65 70 75 80
Met Ser Glu Asn Arg Asn Glu Lys Tyr Asp Tyr Tyr Phe Asn Gln Ile Met Ser Glu Asn Arg Asn Glu Lys Tyr Asp Tyr Tyr Phe Asn Gln Ile
85 90 95 85 90 95
Lys Asp Lys Ala Glu Arg Ala Tyr Phe Phe His Leu Pro Tyr Gly Leu Lys Asp Lys Ala Glu Arg Ala Tyr Phe Phe His Leu Pro Tyr Gly Leu
100 105 110 100 105 110
Asn Lys Ser Phe Asn Phe Ile Pro Thr Met Ala Glu Leu Lys Val Lys Asn Lys Ser Phe Asn Phe Ile Pro Thr Met Ala Glu Leu Lys Val Lys
115 120 125 115 120 125
Ser Met Leu Leu Pro Lys Val Lys Arg Ile Tyr Leu Ala Ser Leu Glu Ser Met Leu Leu Pro Lys Val Lys Arg Ile Tyr Leu Ala Ser Leu Glu
130 135 140 130 135 140
Lys Val Ser Ile Ala Ala Phe Leu Ser Thr Tyr Pro Asp Ala Glu Ile Lys Val Ser Ile Ala Ala Phe Leu Ser Thr Tyr Pro Asp Ala Glu Ile
145 150 155 160 145 150 155 160
Lys Thr Phe Asp Asp Gly Thr Gly Asn Leu Ile Gln Ser Ser Ser Tyr Lys Thr Phe Asp Asp Gly Thr Gly Asn Leu Ile Gln Ser Ser Ser Tyr
165 170 175 165 170 175
Leu Gly Asp Glu Phe Ser Val Asn Gly Thr Ile Lys Arg Asn Phe Ala Leu Gly Asp Glu Phe Ser Val Asn Gly Thr Ile Lys Arg Asn Phe Ala
180 185 190 180 185 190
Arg Met Met Ile Gly Asp Trp Ser Ile Ala Lys Thr Arg Asn Ala Ser Arg Met Met Ile Gly Asp Trp Ser Ile Ala Lys Thr Arg Asn Ala Ser
195 200 205 195 200 205
Asp Glu His Tyr Thr Ile Phe Lys Gly Leu Lys Asn Ile Met Asp Asp Asp Glu His Tyr Thr Ile Phe Lys Gly Leu Lys Asn Ile Met Asp Asp
210 215 220 210 215 220
Gly Arg Arg Lys Met Thr Tyr Leu Pro Leu Phe Asp Ala Ser Glu Leu Gly Arg Arg Lys Met Thr Tyr Leu Pro Leu Phe Asp Ala Ser Glu Leu
225 230 235 240 225 230 235 240
Lys Thr Gly Asp Glu Thr Gly Gly Thr Val Arg Ile Leu Leu Gly Ser Lys Thr Gly Asp Glu Thr Gly Gly Thr Val Arg Ile Leu Leu Gly Ser
245 250 255 245 250 255
Pro Asp Lys Glu Met Lys Glu Ile Ser Glu Lys Ala Ala Lys Asn Phe Pro Asp Lys Glu Met Lys Glu Ile Ser Glu Lys Ala Ala Lys Asn Phe
260 265 270 260 265 270
Lys Ile Gln Tyr Val Ala Pro His Pro Arg Gln Thr Tyr Gly Leu Ser Lys Ile Gln Tyr Val Ala Pro His Pro Arg Gln Thr Tyr Gly Leu Ser
275 280 285 275 280 285
Gly Val Thr Thr Leu Asn Ser Pro Tyr Val Ile Glu Asp Tyr Ile Leu Gly Val Thr Thr Leu Asn Ser Pro Tyr Val Ile Glu Asp Tyr Ile Leu
290 295 300 290 295 300
Arg Glu Ile Lys Lys Asn Pro His Thr Arg Tyr Glu Ile Tyr Thr Phe Arg Glu Ile Lys Lys Asn Pro His Thr Arg Tyr Glu Ile Tyr Thr Phe
305 310 315 320 305 310 315 320
Phe Ser Gly Ala Ala Leu Thr Met Lys Asp Phe Pro Asn Val His Val Phe Ser Gly Ala Ala Leu Thr Met Lys Asp Phe Pro Asn Val His Val
325 330 335 325 330 335
Tyr Ala Leu Lys Pro Ala Ser Leu Pro Glu Asp Tyr Trp Leu Lys Pro Tyr Ala Leu Lys Pro Ala Ser Leu Pro Glu Asp Tyr Trp Leu Lys Pro
340 345 350 340 345 350
Val Tyr Ala Leu Phe Thr Gln Ser Gly Ile Pro Ile Leu Thr Phe Asp Val Tyr Ala Leu Phe Thr Gln Ser Gly Ile Pro Ile Leu Thr Phe Asp
355 360 365 355 360 365
Asp Lys Asn Asp Lys Asn
370 370
<210> 39<210> 39
<211> 283<211> 283
<212> Белок<212> Protein
<213> Pasteurella multocida<213> Pasteurella multocida
<400> 39<400> 39
Met Asp Lys Phe Ala Glu His Glu Ile Pro Lys Ala Val Ile Val Ala Met Asp Lys Phe Ala Glu His Glu Ile Pro Lys Ala Val Ile Val Ala
1 5 10 15 1 5 10 15
Gly Asn Gly Glu Ser Leu Ser Gln Ile Asp Tyr Arg Leu Leu Pro Lys Gly Asn Gly Glu Ser Leu Ser Gln Ile Asp Tyr Arg Leu Leu Pro Lys
20 25 30 20 25 30
Asn Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Phe Glu Glu Arg Tyr Asn Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Phe Glu Glu Arg Tyr
35 40 45 35 40 45
Phe Leu Gly Asn Lys Ile Lys Ala Val Phe Phe Thr Pro Gly Val Phe Phe Leu Gly Asn Lys Ile Lys Ala Val Phe Phe Thr Pro Gly Val Phe
50 55 60 50 55 60
Leu Glu Gln Tyr Tyr Thr Leu Tyr His Leu Lys Arg Asn Asn Glu Tyr Leu Glu Gln Tyr Tyr Thr Leu Tyr His Leu Lys Arg Asn Asn Glu Tyr
65 70 75 80 65 70 75 80
Phe Val Asp Asn Val Ile Leu Ser Ser Phe Asn His Pro Thr Val Asp Phe Val Asp Asn Val Ile Leu Ser Ser Phe Asn His Pro Thr Val Asp
85 90 95 85 90 95
Leu Glu Lys Ser Gln Lys Ile Gln Ala Leu Phe Ile Asp Val Ile Asn Leu Glu Lys Ser Gln Lys Ile Gln Ala Leu Phe Ile Asp Val Ile Asn
100 105 110 100 105 110
Gly Tyr Glu Lys Tyr Leu Ser Lys Leu Thr Ala Phe Asp Val Tyr Leu Gly Tyr Glu Lys Tyr Leu Ser Lys Leu Thr Ala Phe Asp Val Tyr Leu
115 120 125 115 120 125
Arg Tyr Lys Glu Leu Tyr Glu Asn Gln Arg Ile Thr Ser Gly Val Tyr Arg Tyr Lys Glu Leu Tyr Glu Asn Gln Arg Ile Thr Ser Gly Val Tyr
130 135 140 130 135 140
Met Cys Ala Val Ala Ile Ala Met Gly Tyr Thr Asp Ile Tyr Leu Thr Met Cys Ala Val Ala Ile Ala Met Gly Tyr Thr Asp Ile Tyr Leu Thr
145 150 155 160 145 150 155 160
Gly Ile Asp Phe Tyr Gln Ala Ser Glu Glu Asn Tyr Ala Phe Asp Asn Gly Ile Asp Phe Tyr Gln Ala Ser Glu Glu Asn Tyr Ala Phe Asp Asn
165 170 175 165 170 175
Lys Lys Pro Asn Ile Ile Arg Leu Leu Pro Asp Phe Arg Lys Glu Lys Lys Lys Pro Asn Ile Ile Arg Leu Leu Pro Asp Phe Arg Lys Glu Lys
180 185 190 180 185 190
Thr Leu Phe Ser Tyr His Ser Lys Asp Ile Asp Leu Glu Ala Leu Ser Thr Leu Phe Ser Tyr His Ser Lys Asp Ile Asp Leu Glu Ala Leu Ser
195 200 205 195 200 205
Phe Leu Gln Gln His Tyr His Val Asn Phe Tyr Ser Ile Ser Pro Met Phe Leu Gln Gln His Tyr His Val Asn Phe Tyr Ser Ile Ser Pro Met
210 215 220 210 215 220
Ser Pro Leu Ser Lys His Phe Pro Ile Pro Thr Val Glu Asp Asp Cys Ser Pro Leu Ser Lys His Phe Pro Ile Pro Thr Val Glu Asp Asp Cys
225 230 235 240 225 230 235 240
Glu Thr Thr Phe Val Ala Pro Leu Lys Glu Asn Tyr Ile Asn Asp Ile Glu Thr Thr Phe Val Ala Pro Leu Lys Glu Asn Tyr Ile Asn Asp Ile
245 250 255 245 250 255
Leu Leu Pro Pro His Phe Val Tyr Glu Lys Leu Gly Val Asp Lys Leu Leu Leu Pro Pro His Phe Val Tyr Glu Lys Leu Gly Val Asp Lys Leu
260 265 270 260 265 270
Ala Ala Ala Leu Glu His His His His His His Ala Ala Ala Leu Glu His His His His His
275 280 275 280
<210> 40<210> 40
<211> 385<211> 385
<212> Белок<212> Protein
<213> Pasteurella dagmatis<213> Pasteurella dagmatis
<400> 40<400> 40
Met Thr Ile Tyr Leu Asp Pro Ala Ser Leu Pro Thr Leu Asn Gln Leu Met Thr Ile Tyr Leu Asp Pro Ala Ser Leu Pro Thr Leu Asn Gln Leu
1 5 10 15 1 5 10 15
Met His Phe Thr Lys Glu Ser Glu Asp Lys Glu Thr Ala Arg Ile Phe Met His Phe Thr Lys Glu Ser Glu Asp Lys Glu Thr Ala Arg Ile Phe
20 25 30 20 25 30
Gly Phe Ser Arg Phe Lys Leu Pro Glu Lys Ile Thr Glu Gln Tyr Asn Gly Phe Ser Arg Phe Lys Leu Pro Glu Lys Ile Thr Glu Gln Tyr Asn
35 40 45 35 40 45
Asn Ile His Phe Val Glu Ile Lys Asn Asn Arg Pro Thr Glu Asp Ile Asn Ile His Phe Val Glu Ile Lys Asn Asn Arg Pro Thr Glu Asp Ile
50 55 60 50 55 60
Phe Thr Ile Leu Asp Gln Tyr Pro Glu Lys Leu Glu Leu Asp Leu His Phe Thr Ile Leu Asp Gln Tyr Pro Glu Lys Leu Glu Leu Asp Leu His
65 70 75 80 65 70 75 80
Leu Asn Ile Ala His Ser Ile Gln Leu Phe His Pro Ile Leu Gln Tyr Leu Asn Ile Ala His Ser Ile Gln Leu Phe His Pro Ile Leu Gln Tyr
85 90 95 85 90 95
Arg Phe Lys His Pro Asp Arg Ile Ser Ile Lys Ser Leu Asn Leu Tyr Arg Phe Lys His Pro Asp Arg Ile Ser Ile Lys Ser Leu Asn Leu Tyr
100 105 110 100 105 110
Asp Asp Gly Thr Met Glu Tyr Val Asp Leu Glu Lys Glu Glu Asn Lys Asp Asp Gly Thr Met Glu Tyr Val Asp Leu Glu Lys Glu Glu Asn Lys
115 120 125 115 120 125
Asp Ile Lys Ser Ala Ile Lys Lys Ala Glu Lys Gln Leu Ser Asp Tyr Asp Ile Lys Ser Ala Ile Lys Lys Ala Glu Lys Gln Leu Ser Asp Tyr
130 135 140 130 135 140
Leu Leu Thr Gly Lys Ile Asn Phe Asp Asn Pro Thr Leu Ala Arg Tyr Leu Leu Thr Gly Lys Ile Asn Phe Asp Asn Pro Thr Leu Ala Arg Tyr
145 150 155 160 145 150 155 160
Val Trp Gln Ser Gln Tyr Pro Val Lys Tyr His Phe Leu Ser Thr Glu Val Trp Gln Ser Gln Tyr Pro Val Lys Tyr His Phe Leu Ser Thr Glu
165 170 175 165 170 175
Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Thr Tyr Leu Ala Tyr Phe Glu Lys Ala Glu Phe Leu Gln Pro Leu Lys Thr Tyr Leu Ala
180 185 190 180 185 190
Gly Lys Tyr Gln Lys Met Asp Trp Ser Ala Tyr Glu Lys Leu Ser Pro Gly Lys Tyr Gln Lys Met Asp Trp Ser Ala Tyr Glu Lys Leu Ser Pro
195 200 205 195 200 205
Glu Gln Gln Thr Phe Tyr Leu Lys Leu Val Gly Phe Ser Asp Glu Thr Glu Gln Gln Thr Phe Tyr Leu Lys Leu Val Gly Phe Ser Asp Glu Thr
210 215 220 210 215 220
Lys Gln Leu Phe His Thr Glu Gln Thr Lys Phe Ile Phe Thr Gly Thr Lys Gln Leu Phe His Thr Glu Gln Thr Lys Phe Ile Phe Thr Gly Thr
225 230 235 240 225 230 235 240
Thr Thr Trp Glu Gly Asn Thr Asp Ile Arg Glu Tyr Tyr Ala Lys Gln Thr Thr Trp Glu Gly Asn Thr Asp Ile Arg Glu Tyr Tyr Ala Lys Gln
245 250 255 245 250 255
Gln Leu Asn Leu Leu Lys His Phe Thr His Ser Glu Gly Asp Leu Phe Gln Leu Asn Leu Leu Lys His Phe Thr His Ser Glu Gly Asp Leu Phe
260 265 270 260 265 270
Ile Gly Asp Gln Tyr Lys Ile Tyr Phe Lys Gly His Pro Arg Gly Gly Ile Gly Asp Gln Tyr Lys Ile Tyr Phe Lys Gly His Pro Arg Gly Gly
275 280 285 275 280 285
Asp Ile Asn Asp Tyr Ile Leu Lys His Ala Lys Asp Ile Thr Asn Ile Asp Ile Asn Asp Tyr Ile Leu Lys His Ala Lys Asp Ile Thr Asn Ile
290 295 300 290 295 300
Pro Ala Asn Ile Ser Phe Glu Ile Leu Met Met Thr Gly Leu Leu Pro Pro Ala Asn Ile Ser Phe Glu Ile Leu Met Met Thr Gly Leu Leu Pro
305 310 315 320 305 310 315 320
Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Pro Lys Asp Lys Val Gly Gly Val Ala Ser Ser Leu Tyr Phe Ser Leu Pro Lys
325 330 335 325 330 335
Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Lys Ile Lys Asn Glu Lys Ile Ser His Ile Ile Phe Thr Ser Asn Lys Lys Ile Lys Asn
340 345 350 340 345 350
Lys Glu Asp Ala Leu Asn Asp Pro Tyr Val Arg Val Met Leu Arg Leu Lys Glu Asp Ala Leu Asn Asp Pro Tyr Val Arg Val Met Leu Arg Leu
355 360 365 355 360 365
Gly Met Ile Asp Lys Ser Gln Ile Ile Phe Trp Asp Ser Leu Lys Gln Gly Met Ile Asp Lys Ser Gln Ile Ile Phe Trp Asp Ser Leu Lys Gln
370 375 380 370 375 380
Leu Leu
385 385
<210> 41<210> 41
<211> 390<211> 390
<212> Белок<212> Protein
<213> Photobacterium phosphoreum<213> Photobacterium phosphoreum
<400> 41<400> 41
Met Gly Cys Asn Ser Asp Ser Lys His Asn Asn Ser Asp Gly Asn Ile Met Gly Cys Asn Ser Asp Ser Lys His Asn Asn Ser Asp Gly Asn Ile
1 5 10 15 1 5 10 15
Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro Thr Lys Asn Lys Thr Ile Glu Val Tyr Val Asp Arg Ala Thr Leu Pro
20 25 30 20 25 30
Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys Thr Ile Gln Gln Met Thr Gln Ile Ile Asn Glu Asn Ser Asn Asn Lys
35 40 45 35 40 45
Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Thr Leu Leu Lys Leu Ile Ser Trp Ser Arg Tyr Pro Ile Asn Asp Glu Thr Leu Leu
50 55 60 50 55 60
Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Arg Pro Glu Leu Ile Lys Glu Ser Ile Asn Gly Ser Phe Phe Lys Asn Arg Pro Glu Leu Ile Lys
65 70 75 80 65 70 75 80
Ser Leu Asp Ser Met Ile Leu Thr Asn Glu Ile Lys Lys Val Ile Ile Ser Leu Asp Ser Met Ile Leu Thr Asn Glu Ile Lys Lys Val Ile Ile
85 90 95 85 90 95
Asn Gly Asn Thr Leu Trp Ala Val Asp Val Val Asn Ile Ile Lys Ser Asn Gly Asn Thr Leu Trp Ala Val Asp Val Val Asn Ile Ile Lys Ser
100 105 110 100 105 110
Ile Glu Ala Leu Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp Ile Glu Ala Leu Gly Lys Lys Thr Glu Ile Glu Leu Asn Phe Tyr Asp
115 120 125 115 120 125
Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Arg Leu Pro Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Asp Phe Ser Arg Leu Pro
130 135 140 130 135 140
Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Gln Glu Ser Glu Gln Glu Tyr Lys Ile Ser Leu Ser Lys Asp Asn Ile Gln
145 150 155 160 145 150 155 160
Ser Ser Ile Asn Gly Thr Gln Pro Phe Asp Asn Ser Ile Glu Asn Ile Ser Ser Ile Asn Gly Thr Gln Pro Phe Asp Asn Ser Ile Glu Asn Ile
165 170 175 165 170 175
Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala Tyr Gly Phe Ser Gln Leu Tyr Pro Thr Thr Tyr His Met Leu Arg Ala
180 185 190 180 185 190
Asp Ile Phe Glu Thr Asn Leu Pro Leu Thr Ser Leu Lys Arg Val Ile Asp Ile Phe Glu Thr Asn Leu Pro Leu Thr Ser Leu Lys Arg Val Ile
195 200 205 195 200 205
Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Thr Thr Phe Asn Ser Asn Asn Ile Lys Gln Met Lys Trp Asp Tyr Phe Thr Thr Phe Asn
210 215 220 210 215 220
Ser Gln Gln Lys Asn Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Glu Ser Gln Gln Lys Asn Lys Phe Tyr Asn Phe Thr Gly Phe Asn Pro Glu
225 230 235 240 225 230 235 240
Lys Ile Lys Glu Gln Tyr Lys Ala Ser Pro His Glu Asn Phe Ile Phe Lys Ile Lys Glu Gln Tyr Lys Ala Ser Pro His Glu Asn Phe Ile Phe
245 250 255 245 250 255
Ile Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile Ile Gly Thr Asn Ser Gly Thr Ala Thr Ala Glu Gln Gln Ile Asp Ile
260 265 270 260 265 270
Leu Thr Glu Ala Lys Lys Pro Asp Ser Pro Ile Ile Thr Asn Ser Ile Leu Thr Glu Ala Lys Lys Pro Asp Ser Pro Ile Ile Thr Asn Ser Ile
275 280 285 275 280 285
Gln Gly Leu Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn Gln Gly Leu Asp Leu Phe Phe Lys Gly His Pro Ser Ala Thr Tyr Asn
290 295 300 290 295 300
Gln Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile Gln Gln Ile Ile Asp Ala His Asn Met Ile Glu Ile Tyr Asn Lys Ile
305 310 315 320 305 310 315 320
Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly Pro Phe Glu Ala Leu Ile Met Thr Asp Ala Leu Pro Asp Ala Val Gly
325 330 335 325 330 335
Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn Gly Met Gly Ser Ser Val Phe Phe Ser Leu Pro Asn Thr Val Glu Asn
340 345 350 340 345 350
Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu Lys Phe Ile Phe Tyr Lys Ser Asp Thr Asp Ile Glu Asn Asn Ala Leu
355 360 365 355 360 365
Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys Ile Gln Val Met Ile Glu Leu Asn Ile Val Asn Arg Asn Asp Val Lys
370 375 380 370 375 380
Leu Ile Ser Asp Leu Gln Leu Ile Ser Asp Leu Gln
385 390 385 390
<210> 42<210> 42
<211> 417<211> 417
<212> Белок<212> Protein
<213> Avibacterium paragallinarum<213> Avibacterium paragallinarum
<400> 42<400> 42
Met Arg Lys Ile Ile Thr Phe Phe Ser Leu Phe Phe Ser Ile Ser Ala Met Arg Lys Ile Ile Thr Phe Phe Ser Leu Phe Phe Ser Ile Ser Ala
1 5 10 15 1 5 10 15
Trp Cys Gln Lys Met Glu Ile Tyr Leu Asp Tyr Ala Ser Leu Pro Ser Trp Cys Gln Lys Met Glu Ile Tyr Leu Asp Tyr Ala Ser Leu Pro Ser
20 25 30 20 25 30
Leu Asn Met Ile Leu Asn Leu Val Glu Asn Lys Asn Asn Glu Lys Val Leu Asn Met Ile Leu Asn Leu Val Glu Asn Lys Asn Asn Glu Lys Val
35 40 45 35 40 45
Glu Arg Ile Ile Gly Phe Glu Arg Phe Asp Phe Asn Lys Glu Ile Leu Glu Arg Ile Ile Gly Phe Glu Arg Phe Asp Phe Asn Lys Glu Ile Leu
50 55 60 50 55 60
Asn Ser Phe Ser Lys Glu Arg Ile Glu Phe Ser Lys Val Ser Ile Leu Asn Ser Phe Ser Lys Glu Arg Ile Glu Phe Ser Lys Val Ser Ile Leu
65 70 75 80 65 70 75 80
Asp Ile Lys Glu Phe Ser Asp Lys Leu Tyr Leu Asn Ile Glu Lys Ser Asp Ile Lys Glu Phe Ser Asp Lys Leu Tyr Leu Asn Ile Glu Lys Ser
85 90 95 85 90 95
Asp Thr Pro Val Asp Leu Ile Ile His Thr Asn Leu Asp His Ser Val Asp Thr Pro Val Asp Leu Ile Ile His Thr Asn Leu Asp His Ser Val
100 105 110 100 105 110
Arg Ser Leu Leu Ser Ile Phe Lys Thr Leu Ser Pro Leu Phe His Lys Arg Ser Leu Leu Ser Ile Phe Lys Thr Leu Ser Pro Leu Phe His Lys
115 120 125 115 120 125
Ile Asn Ile Glu Lys Leu Tyr Leu Tyr Asp Asp Gly Ser Gly Asn Tyr Ile Asn Ile Glu Lys Leu Tyr Leu Tyr Asp Asp Gly Ser Gly Asn Tyr
130 135 140 130 135 140
Val Asp Leu Tyr Gln His Arg Gln Glu Asn Ile Ser Ala Ile Leu Ile Val Asp Leu Tyr Gln His Arg Gln Glu Asn Ile Ser Ala Ile Leu Ile
145 150 155 160 145 150 155 160
Glu Ala Gln Lys Lys Leu Lys Asp Ala Leu Glu Asn Arg Glu Thr Asp Glu Ala Gln Lys Lys Leu Lys Asp Ala Leu Glu Asn Arg Glu Thr Asp
165 170 175 165 170 175
Thr Asp Lys Leu His Ser Leu Thr Arg Tyr Thr Trp His Lys Ile Phe Thr Asp Lys Leu His Ser Leu Thr Arg Tyr Thr Trp His Lys Ile Phe
180 185 190 180 185 190
Pro Thr Glu Tyr Ile Leu Leu Arg Pro Asp Tyr Leu Asp Ile Asp Glu Pro Thr Glu Tyr Ile Leu Leu Arg Pro Asp Tyr Leu Asp Ile Asp Glu
195 200 205 195 200 205
Lys Met Gln Pro Leu Lys His Phe Leu Ser Asp Thr Ile Val Ser Met Lys Met Gln Pro Leu Lys His Phe Leu Ser Asp Thr Ile Val Ser Met
210 215 220 210 215 220
Asp Leu Ser Arg Phe Ser His Phe Ser Lys Asn Gln Lys Glu Leu Phe Asp Leu Ser Arg Phe Ser His Phe Ser Lys Asn Gln Lys Glu Leu Phe
225 230 235 240 225 230 235 240
Leu Lys Ile Thr His Phe Asp Gln Asn Ile Phe Asn Glu Leu Asn Ile Leu Lys Ile Thr His Phe Asp Gln Asn Ile Phe Asn Glu Leu Asn Ile
245 250 255 245 250 255
Gly Thr Lys Asn Lys Glu Tyr Lys Thr Phe Ile Phe Thr Gly Thr Thr Gly Thr Lys Asn Lys Glu Tyr Lys Thr Phe Ile Phe Thr Gly Thr Thr
260 265 270 260 265 270
Thr Trp Glu Lys Asp Lys Lys Lys Arg Leu Asn Asn Ala Lys Leu Gln Thr Trp Glu Lys Asp Lys Lys Lys Arg Leu Asn Asn Ala Lys Leu Gln
275 280 285 275 280 285
Thr Glu Ile Leu Glu Ser Phe Ile Lys Pro Asn Gly Lys Phe Tyr Leu Thr Glu Ile Leu Glu Ser Phe Ile Lys Pro Asn Gly Lys Phe Tyr Leu
290 295 300 290 295 300
Gly Asn Asp Ile Lys Ile Phe Phe Lys Gly His Pro Lys Gly Asp Asp Gly Asn Asp Ile Lys Ile Phe Phe Lys Gly His Pro Lys Gly Asp Asp
305 310 315 320 305 310 315 320
Ile Asn Asp Tyr Ile Ile Arg Lys Thr Gly Ala Glu Lys Ile Pro Ala Ile Asn Asp Tyr Ile Ile Arg Lys Thr Gly Ala Glu Lys Ile Pro Ala
325 330 335 325 330 335
Asn Ile Pro Phe Glu Val Leu Met Met Thr Asn Ser Leu Pro Asp Tyr Asn Ile Pro Phe Glu Val Leu Met Met Thr Asn Ser Leu Pro Asp Tyr
340 345 350 340 345 350
Val Gly Gly Ile Met Ser Thr Val Tyr Phe Ser Leu Pro Pro Lys Asn Val Gly Gly Ile Met Ser Thr Val Tyr Phe Ser Leu Pro Pro Lys Asn
355 360 365 355 360 365
Ile Asp Lys Val Val Phe Leu Gly Ser Glu Lys Ile Lys Asn Glu Asn Ile Asp Lys Val Val Phe Leu Gly Ser Glu Lys Ile Lys Asn Glu Asn
370 375 380 370 375 380
Asp Ala Lys Ser Gln Thr Leu Ser Lys Leu Met Leu Met Leu Asn Val Asp Ala Lys Ser Gln Thr Leu Ser Lys Leu Met Leu Met Leu Asn Val
385 390 395 400 385 390 395 400
Ile Thr Pro Glu Gln Ile Phe Phe Glu Glu Met Pro Asn Pro Ile Asn Ile Thr Pro Glu Gln Ile Phe Phe Glu Glu Met Pro Asn Pro Ile Asn
405 410 415 405 410 415
Phe Phe
<210> 43<210> 43
<211> 430<211> 430
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 43<400> 43
Met Thr Arg Thr Arg Met Glu Asn Glu Leu Ile Val Ser Lys Asn Met Met Thr Arg Thr Arg Met Glu Asn Glu Leu Ile Val Ser Lys Asn Met
1 5 10 15 1 5 10 15
Gln Asn Ile Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Asn Ile Asn Gln Asn Ile Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Asn Ile Asn
20 25 30 20 25 30
Tyr Lys Arg Leu Pro Arg Glu Tyr Asp Val Phe Arg Cys Asn Gln Phe Tyr Lys Arg Leu Pro Arg Glu Tyr Asp Val Phe Arg Cys Asn Gln Phe
35 40 45 35 40 45
Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Ile Lys Ala Val Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Ile Lys Ala Val Phe
50 55 60 50 55 60
Phe Asn Pro Gly Val Phe Leu Gln Gln Tyr His Thr Ala Lys Gln Leu Phe Asn Pro Gly Val Phe Leu Gln Gln Tyr His Thr Ala Lys Gln Leu
65 70 75 80 65 70 75 80
Ile Leu Lys Asn Glu Tyr Glu Ile Lys Asn Ile Phe Cys Ser Thr Phe Ile Leu Lys Asn Glu Tyr Glu Ile Lys Asn Ile Phe Cys Ser Thr Phe
85 90 95 85 90 95
Asn Leu Pro Phe Ile Glu Ser Asn Asp Phe Leu His Gln Phe Tyr Asn Asn Leu Pro Phe Ile Glu Ser Asn Asp Phe Leu His Gln Phe Tyr Asn
100 105 110 100 105 110
Phe Phe Pro Asp Ala Lys Leu Gly Tyr Glu Val Ile Glu Asn Leu Lys Phe Phe Pro Asp Ala Lys Leu Gly Tyr Glu Val Ile Glu Asn Leu Lys
115 120 125 115 120 125
Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Phe Asn Lys Arg Glu Phe Tyr Ala Tyr Ile Lys Tyr Asn Glu Ile Tyr Phe Asn Lys Arg
130 135 140 130 135 140
Ile Thr Ser Gly Val Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly Tyr Ile Thr Ser Gly Val Tyr Met Cys Ala Ile Ala Ile Ala Leu Gly Tyr
145 150 155 160 145 150 155 160
Lys Thr Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Asp Val Ile Lys Thr Ile Tyr Leu Cys Gly Ile Asp Phe Tyr Glu Gly Asp Val Ile
165 170 175 165 170 175
Tyr Pro Phe Glu Ala Met Ser Thr Asn Ile Lys Thr Ile Phe Pro Gly Tyr Pro Phe Glu Ala Met Ser Thr Asn Ile Lys Thr Ile Phe Pro Gly
180 185 190 180 185 190
Ile Lys Asp Phe Lys Pro Ser Asn Cys His Ser Lys Glu Tyr Asp Ile Ile Lys Asp Phe Lys Pro Ser Asn Cys His Ser Lys Glu Tyr Asp Ile
195 200 205 195 200 205
Glu Ala Leu Lys Leu Leu Lys Ser Ile Tyr Lys Val Asn Ile Tyr Ala Glu Ala Leu Lys Leu Leu Lys Ser Ile Tyr Lys Val Asn Ile Tyr Ala
210 215 220 210 215 220
Leu Cys Asp Asp Ser Ile Leu Ala Asn His Phe Pro Leu Ser Ile Asn Leu Cys Asp Asp Ser Ile Leu Ala Asn His Phe Pro Leu Ser Ile Asn
225 230 235 240 225 230 235 240
Ile Asn Asn Asn Phe Thr Leu Glu Asn Lys His Asn Asn Ser Ile Asn Ile Asn Asn Asn Phe Thr Leu Glu Asn Lys His Asn Asn Ser Ile Asn
245 250 255 245 250 255
Asp Ile Leu Leu Thr Asp Asn Thr Pro Gly Val Ser Phe Tyr Lys Asn Asp Ile Leu Leu Thr Asp Asn Thr Pro Gly Val Ser Phe Tyr Lys Asn
260 265 270 260 265 270
Gln Leu Lys Ala Asp Asn Lys Ile Met Leu Asn Phe Tyr Asn Ile Leu Gln Leu Lys Ala Asp Asn Lys Ile Met Leu Asn Phe Tyr Asn Ile Leu
275 280 285 275 280 285
His Ser Lys Asp Asn Leu Ile Lys Phe Leu Asn Lys Glu Ile Ala Val His Ser Lys Asp Asn Leu Ile Lys Phe Leu Asn Lys Glu Ile Ala Val
290 295 300 290 295 300
Leu Lys Lys Gln Thr Thr Gln Arg Ala Lys Ala Arg Ile Gln Asn His Leu Lys Lys Gln Thr Thr Gln Arg Ala Lys Ala Arg Ile Gln Asn His
305 310 315 320 305 310 315 320
Leu Ser Tyr Lys Leu Gly Gln Ala Leu Ile Ile Asn Ser Lys Ser Val Leu Ser Tyr Lys Leu Gly Gln Ala Leu Ile Ile Asn Ser Lys Ser Val
325 330 335 325 330 335
Leu Gly Phe Leu Ser Leu Pro Phe Ile Ile Leu Ser Ile Val Ile Ser Leu Gly Phe Leu Ser Leu Pro Phe Ile Ile Leu Ser Ile Val Ile Ser
340 345 350 340 345 350
His Lys Gln Glu Gln Lys Ala Tyr Lys Phe Lys Val Lys Lys Asn Pro His Lys Gln Glu Gln Lys Ala Tyr Lys Phe Lys Val Lys Lys Asn Pro
355 360 365 355 360 365
Asn Leu Ala Leu Pro Pro Leu Glu Thr Tyr Pro Asp Tyr Asn Glu Ala Asn Leu Ala Leu Pro Pro Leu Glu Thr Tyr Pro Asp Tyr Asn Glu Ala
370 375 380 370 375 380
Leu Lys Glu Lys Glu Cys Phe Thr Tyr Lys Leu Gly Glu Glu Phe Ile Leu Lys Glu Lys Glu Cys Phe Thr Tyr Lys Leu Gly Glu Glu Phe Ile
385 390 395 400 385 390 395 400
Lys Ala Gly Lys Asn Trp Tyr Gly Glu Gly Tyr Ile Lys Phe Ile Phe Lys Ala Gly Lys Asn Trp Tyr Gly Glu Gly Tyr Ile Lys Phe Ile Phe
405 410 415 405 410 415
Lys Asp Val Pro Arg Leu Lys Arg Glu Phe Glu Lys Gly Glu Lys Asp Val Pro Arg Leu Lys Arg Glu Phe Glu Lys Gly Glu
420 425 430 420 425 430
<210> 44<210> 44
<211> 395<211> 395
<212> Белок<212> Protein
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 44<400> 44
Met Asn Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Ile Lys Met Asn Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Ile Lys
1 5 10 15 1 5 10 15
Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Arg Cys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Arg Cys
20 25 30 20 25 30
Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Ile Lys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Ile Lys
35 40 45 35 40 45
Gly Val Phe Phe Asn Ala His Val Phe Asp Leu Gln Met Lys Ile Thr Gly Val Phe Phe Asn Ala His Val Phe Asp Leu Gln Met Lys Ile Thr
50 55 60 50 55 60
Lys Ala Ile Val Lys Asn Gly Glu Tyr His Pro Asp His Ile Tyr Cys Lys Ala Ile Val Lys Asn Gly Glu Tyr His Pro Asp His Ile Tyr Cys
65 70 75 80 65 70 75 80
Thr His Val Glu Pro Tyr Gly Tyr Val Asn Gly Asn Gln Gln Leu Met Thr His Val Glu Pro Tyr Gly Tyr Val Asn Gly Asn Gln Gln Leu Met
85 90 95 85 90 95
Gln Glu Tyr Leu Glu Lys His Phe Val Gly Val Arg Ser Thr Tyr Ala Gln Glu Tyr Leu Glu Lys His Phe Val Gly Val Arg Ser Thr Tyr Ala
100 105 110 100 105 110
Tyr Leu Lys Asp Leu Glu Pro Phe Phe Ile Leu His Ser Lys Tyr Arg Tyr Leu Lys Asp Leu Glu Pro Phe Phe Ile Leu His Ser Lys Tyr Arg
115 120 125 115 120 125
Asn Phe Tyr Asp Gln His Phe Thr Thr Gly Ile Met Met Leu Leu Val Asn Phe Tyr Asp Gln His Phe Thr Thr Gly Ile Met Met Leu Leu Val
130 135 140 130 135 140
Ala Ile Gln Leu Gly Tyr Lys Glu Ile Tyr Leu Cys Gly Ile Asp Phe Ala Ile Gln Leu Gly Tyr Lys Glu Ile Tyr Leu Cys Gly Ile Asp Phe
145 150 155 160 145 150 155 160
Tyr Glu Asn Gly Phe Gly His Phe Tyr Glu Asn Gln Gly Gly Phe Phe Tyr Glu Asn Gly Phe Gly His Phe Tyr Glu Asn Gln Gly Gly Phe Phe
165 170 175 165 170 175
Glu Glu Asp Ser Asp Pro Met His Asp Lys Asn Ile Asp Ile Gln Ala Glu Glu Asp Ser Asp Pro Met His Asp Lys Asn Ile Asp Ile Gln Ala
180 185 190 180 185 190
Leu Glu Leu Ala Lys Lys Tyr Ala Lys Ile Tyr Ala Leu Val Pro Asn Leu Glu Leu Ala Lys Lys Tyr Ala Lys Ile Tyr Ala Leu Val Pro Asn
195 200 205 195 200 205
Ser Ala Leu Val Lys Met Ile Pro Leu Ser Ser Gln Lys Gly Val Leu Ser Ala Leu Val Lys Met Ile Pro Leu Ser Ser Gln Lys Gly Val Leu
210 215 220 210 215 220
Glu Lys Val Lys Asp Arg Ile Gly Leu Gly Glu Phe Lys Arg Glu Lys Glu Lys Val Lys Asp Arg Ile Gly Leu Gly Glu Phe Lys Arg Glu Lys
225 230 235 240 225 230 235 240
Phe Gly Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Phe Gly Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys
245 250 255 245 250 255
Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg
260 265 270 260 265 270
Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu
275 280 285 275 280 285
Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys
290 295 300 290 295 300
Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg
305 310 315 320 305 310 315 320
Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu Glu Arg Gln Lys Glu Leu
325 330 335 325 330 335
Glu Leu Glu Arg Ser Leu Lys Ala Arg Leu Lys Ala Val Leu Ala Ser Glu Leu Glu Arg Ser Leu Lys Ala Arg Leu Lys Ala Val Leu Ala Ser
340 345 350 340 345 350
Lys Gly Ile Arg Gly Asp Asn Leu Ile Ile Val Ser Leu Lys Asp Thr Lys Gly Ile Arg Gly Asp Asn Leu Ile Ile Val Ser Leu Lys Asp Thr
355 360 365 355 360 365
Tyr Arg Leu Phe Lys Gly Gly Phe Ala Leu Leu Leu Asp Leu Lys Ala Tyr Arg Leu Phe Lys Gly Gly Phe Ala Leu Leu Leu Asp Leu Lys Ala
370 375 380 370 375 380
Leu Lys Ser Ile Ile Lys Ala Phe Leu Lys Arg Leu Lys Ser Ile Ile Lys Ala Phe Leu Lys Arg
385 390 395 385 390 395
<210> 45<210> 45
<211> 260<211> 260
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 45<400> 45
Met Gly Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu Met Gly Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu
1 5 10 15 1 5 10 15
Ile Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn Ile Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn
20 25 30 20 25 30
Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala
35 40 45 35 40 45
Val Phe Tyr Asn Pro Ile Leu Phe Phe Glu Gln Tyr Tyr Thr Leu Lys Val Phe Tyr Asn Pro Ile Leu Phe Phe Glu Gln Tyr Tyr Thr Leu Lys
50 55 60 50 55 60
His Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser His Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser
65 70 75 80 65 70 75 80
Asn Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe Asn Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe
85 90 95 85 90 95
Tyr Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln Tyr Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln
100 105 110 100 105 110
Leu Lys Asp Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn Leu Lys Asp Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn
115 120 125 115 120 125
Gln Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu Gln Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu
130 135 140 130 135 140
Gly Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly Gly Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly
145 150 155 160 145 150 155 160
Ser Ser Tyr Ala Phe Asp Thr Lys Gln Lys Asn Leu Leu Lys Leu Ala Ser Ser Tyr Ala Phe Asp Thr Lys Gln Lys Asn Leu Leu Lys Leu Ala
165 170 175 165 170 175
Pro Asn Phe Lys Asn Asp Asn Ser His Tyr Ile Gly His Ser Lys Asn Pro Asn Phe Lys Asn Asp Asn Ser His Tyr Ile Gly His Ser Lys Asn
180 185 190 180 185 190
Thr Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys Thr Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys
195 200 205 195 200 205
Leu Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu Leu Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu
210 215 220 210 215 220
Ala Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr Ala Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr
225 230 235 240 225 230 235 240
Thr Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser Thr Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser
245 250 255 245 250 255
Lys Asn Ile Asn Lys Asn Ile Asn
260 260
<210> 46<210> 46
<211> 298<211> 298
<212> Белок<212> Protein
<213> Streptococcus entericus<213> Streptococcus entericus
<400> 46<400> 46
Met Lys Lys Val Tyr Phe Cys His Thr Val Tyr His Leu Leu Ile Thr Met Lys Lys Val Tyr Phe Cys His Thr Val Tyr His Leu Leu Ile Thr
1 5 10 15 1 5 10 15
Leu Cys Lys Ile Ser Val Glu Glu Gln Val Glu Ile Ile Val Phe Asp Leu Cys Lys Ile Ser Val Glu Glu Gln Val Glu Ile Ile Val Phe Asp
20 25 30 20 25 30
Thr Val Ser Asn His Glu Leu Ile Val Gln Lys Ile Arg Asp Val Phe Thr Val Ser Asn His Glu Leu Ile Val Gln Lys Ile Arg Asp Val Phe
35 40 45 35 40 45
Val Asn Thr Thr Val Leu Phe Ala Glu Gln Asn Thr Asp Phe Ser Ile Val Asn Thr Thr Val Leu Phe Ala Glu Gln Asn Thr Asp Phe Ser Ile
50 55 60 50 55 60
Leu Glu Ile Asp Arg Ala Thr Asp Ile Tyr Val Phe Asn Asp Trp Thr Leu Glu Ile Asp Arg Ala Thr Asp Ile Tyr Val Phe Asn Asp Trp Thr
65 70 75 80 65 70 75 80
Pro Ile Gly Ala Tyr Leu Arg Lys Asn Lys Leu Phe Tyr His Leu Ile Pro Ile Gly Ala Tyr Leu Arg Lys Asn Lys Leu Phe Tyr His Leu Ile
85 90 95 85 90 95
Glu Asp Gly Tyr Asn Tyr His Glu Tyr Asn Val Tyr Ala Asn Ala Leu Glu Asp Gly Tyr Asn Tyr His Glu Tyr Asn Val Tyr Ala Asn Ala Leu
100 105 110 100 105 110
Thr Met Lys Arg Arg Leu Leu Asn Phe Val Leu Arg Arg Glu Glu Pro Thr Met Lys Arg Arg Leu Leu Asn Phe Val Leu Arg Arg Glu Glu Pro
115 120 125 115 120 125
Ser Gly Phe Ser Arg Tyr Val Arg Ser Ile Glu Val Asn Arg Val Lys Ser Gly Phe Ser Arg Tyr Val Arg Ser Ile Glu Val Asn Arg Val Lys
130 135 140 130 135 140
Tyr Leu Pro Asn Asp Cys Arg Lys Ser Lys Trp Val Glu Lys Pro Arg Tyr Leu Pro Asn Asp Cys Arg Lys Ser Lys Trp Val Glu Lys Pro Arg
145 150 155 160 145 150 155 160
Ser Ala Leu Phe Glu Asn Leu Val Pro Glu His Lys Gln Lys Ile Ile Ser Ala Leu Phe Glu Asn Leu Val Pro Glu His Lys Gln Lys Ile Ile
165 170 175 165 170 175
Thr Ile Phe Gly Leu Glu Asn Tyr Gln Asp Ser Leu Arg Gly Val Leu Thr Ile Phe Gly Leu Glu Asn Tyr Gln Asp Ser Leu Arg Gly Val Leu
180 185 190 180 185 190
Val Leu Thr Gln Pro Leu Val Gln Asp Tyr Trp Asp Arg Asp Ile Thr Val Leu Thr Gln Pro Leu Val Gln Asp Tyr Trp Asp Arg Asp Ile Thr
195 200 205 195 200 205
Thr Glu Glu Glu Gln Leu Glu Phe Tyr Arg Gln Ile Val Glu Ser Tyr Thr Glu Glu Glu Gln Leu Glu Phe Tyr Arg Gln Ile Val Glu Ser Tyr
210 215 220 210 215 220
Gly Glu Gly Glu Gln Val Phe Phe Lys Ile His Pro Arg Asp Lys Val Gly Glu Gly Glu Gln Val Phe Phe Lys Ile His Pro Arg Asp Lys Val
225 230 235 240 225 230 235 240
Asp Tyr Ser Ser Leu Thr Asn Val Ile Phe Leu Lys Lys Asn Val Pro Asp Tyr Ser Ser Leu Thr Asn Val Ile Phe Leu Lys Lys Asn Val Pro
245 250 255 245 250 255
Met Glu Val Tyr Glu Leu Ile Ala Asp Cys His Phe Thr Lys Gly Ile Met Glu Val Tyr Glu Leu Ile Ala Asp Cys His Phe Thr Lys Gly Ile
260 265 270 260 265 270
Thr His Ser Ser Thr Ala Leu Asp Phe Leu Ser Cys Val Asp Lys Lys Thr His Ser Ser Thr Ala Leu Asp Phe Leu Ser Cys Val Asp Lys Lys
275 280 285 275 280 285
Ile Thr Leu Lys Gln Met Lys Ala Asn Ser Ile Thr Leu Lys Gln Met Lys Ala Asn Ser
290 295 290 295
<210> 47<210> 47
<211> 295<211> 295
<212> Белок<212> Protein
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 47<400> 47
Met Lys Glu Ile Ala Ile Ile Ser Asn Gln Arg Met Phe Phe Leu Tyr Met Lys Glu Ile Ala Ile Ile Ser Asn Gln Arg Met Phe Phe Leu Tyr
1 5 10 15 1 5 10 15
Cys Leu Leu Thr Asn Lys Asn Val Glu Asp Val Phe Phe Ile Phe Glu Cys Leu Leu Thr Asn Lys Asn Val Glu Asp Val Phe Phe Ile Phe Glu
20 25 30 20 25 30
Lys Gly Ala Met Pro Asn Asn Leu Thr Ser Ile Ser His Phe Ile Val Lys Gly Ala Met Pro Asn Asn Leu Thr Ser Ile Ser His Phe Ile Val
35 40 45 35 40 45
Leu Asp His Ser Lys Ser Glu Cys Tyr Asp Phe Phe Tyr Phe Asn Phe Leu Asp His Ser Lys Ser Glu Cys Tyr Asp Phe Phe Tyr Phe Asn Phe
50 55 60 50 55 60
Ile Ser Cys Lys Tyr Arg Leu Arg Gly Leu Asp Val Tyr Gly Ala Asp Ile Ser Cys Lys Tyr Arg Leu Arg Gly Leu Asp Val Tyr Gly Ala Asp
65 70 75 80 65 70 75 80
His Ile Lys Gly Ala Lys Phe Phe Leu Glu Arg His Arg Phe Phe Val His Ile Lys Gly Ala Lys Phe Phe Leu Glu Arg His Arg Phe Phe Val
85 90 95 85 90 95
Val Glu Asp Gly Met Met Asn Tyr Ser Lys Asn Met Tyr Ala Phe Ser Val Glu Asp Gly Met Met Asn Tyr Ser Lys Asn Met Tyr Ala Phe Ser
100 105 110 100 105 110
Leu Phe Arg Thr Arg Asn Pro Val Ile Leu Pro Gly Gly Phe His Pro Leu Phe Arg Thr Arg Asn Pro Val Ile Leu Pro Gly Gly Phe His Pro
115 120 125 115 120 125
Asn Val Lys Thr Ile Phe Leu Thr Lys Asp Asn Pro Ile Pro Asp Gln Asn Val Lys Thr Ile Phe Leu Thr Lys Asp Asn Pro Ile Pro Asp Gln
130 135 140 130 135 140
Ile Ala His Lys Arg Glu Ile Ile Asn Ile Lys Thr Leu Trp Gln Ala Ile Ala His Lys Arg Glu Ile Ile Asn Ile Lys Thr Leu Trp Gln Ala
145 150 155 160 145 150 155 160
Lys Thr Ala Thr Glu Lys Thr Lys Ile Leu Ser Phe Phe Glu Ile Asp Lys Thr Ala Thr Glu Lys Thr Lys Ile Leu Ser Phe Phe Glu Ile Asp
165 170 175 165 170 175
Met Gln Glu Ile Ser Val Ile Lys Asn Arg Ser Phe Val Leu Tyr Thr Met Gln Glu Ile Ser Val Ile Lys Asn Arg Ser Phe Val Leu Tyr Thr
180 185 190 180 185 190
Gln Pro Leu Ser Glu Asp Lys Leu Leu Thr Glu Ala Glu Lys Ile Asp Gln Pro Leu Ser Glu Asp Lys Leu Leu Thr Glu Ala Glu Lys Ile Asp
195 200 205 195 200 205
Ile Tyr Arg Thr Ile Leu Thr Lys Tyr Asn His Ser Gln Thr Val Ile Ile Tyr Arg Thr Ile Leu Thr Lys Tyr Asn His Ser Gln Thr Val Ile
210 215 220 210 215 220
Lys Pro His Pro Arg Asp Lys Thr Asp Tyr Lys Gln Leu Phe Pro Asp Lys Pro His Pro Arg Asp Lys Thr Asp Tyr Lys Gln Leu Phe Pro Asp
225 230 235 240 225 230 235 240
Ala Tyr Val Met Lys Gly Thr Tyr Pro Ser Glu Leu Leu Thr Leu Leu Ala Tyr Val Met Lys Gly Thr Tyr Pro Ser Glu Leu Leu Thr Leu Leu
245 250 255 245 250 255
Gly Val Asn Phe Asn Lys Val Ile Thr Leu Phe Ser Thr Ala Val Phe Gly Val Asn Phe Asn Lys Val Ile Thr Leu Phe Ser Thr Ala Val Phe
260 265 270 260 265 270
Asp Tyr Pro Lys Glu Lys Ile Asp Phe Tyr Gly Thr Ala Val His Pro Asp Tyr Pro Lys Glu Lys Ile Asp Phe Tyr Gly Thr Ala Val His Pro
275 280 285 275 280 285
Lys Leu Leu Asp Phe Phe Asp Lys Leu Leu Asp Phe Phe Asp
290 295 290 295
<210> 48<210> 48
<211> 488<211> 488
<212> Белок<212> Protein
<213> Alistipes sp.<213> Alistipes sp.
<400> 48<400> 48
Met Ala Leu Leu Ser Gly Thr Ala Ala Cys Ser Asp Asp Glu Val Ser Met Ala Leu Leu Ser Gly Thr Ala Ala Cys Ser Asp Asp Glu Val Ser
1 5 10 15 1 5 10 15
Gln Asn Leu Ile Val Ile Asn Gly Gly Glu His Phe Leu Ser Leu Asp Gln Asn Leu Ile Val Ile Asn Gly Gly Glu His Phe Leu Ser Leu Asp
20 25 30 20 25 30
Gly Leu Ala Arg Ala Gly Lys Ile Ser Val Leu Ala Pro Ala Pro Trp Gly Leu Ala Arg Ala Gly Lys Ile Ser Val Leu Ala Pro Ala Pro Trp
35 40 45 35 40 45
Arg Val Thr Lys Ala Ala Gly Asp Thr Trp Phe Arg Leu Ser Ala Thr Arg Val Thr Lys Ala Ala Gly Asp Thr Trp Phe Arg Leu Ser Ala Thr
50 55 60 50 55 60
Glu Gly Pro Ala Gly Tyr Ser Glu Val Glu Leu Ser Leu Asp Glu Asn Glu Gly Pro Ala Gly Tyr Ser Glu Val Glu Leu Ser Leu Asp Glu Asn
65 70 75 80 65 70 75 80
Pro Gly Ala Ala Arg Ser Ala Gln Leu Ala Phe Ala Cys Gly Asp Ala Pro Gly Ala Ala Arg Ser Ala Gln Leu Ala Phe Ala Cys Gly Asp Ala
85 90 95 85 90 95
Ile Val Pro Phe Arg Leu Ser Gln Gly Ala Leu Ser Ala Gly Tyr Asp Ile Val Pro Phe Arg Leu Ser Gln Gly Ala Leu Ser Ala Gly Tyr Asp
100 105 110 100 105 110
Ser Pro Asp Tyr Tyr Phe Tyr Val Thr Phe Gly Thr Met Pro Thr Leu Ser Pro Asp Tyr Tyr Phe Tyr Val Thr Phe Gly Thr Met Pro Thr Leu
115 120 125 115 120 125
Tyr Ala Gly Ile His Leu Leu Ser His Asp Lys Pro Gly Tyr Val Phe Tyr Ala Gly Ile His Leu Leu Ser His Asp Lys Pro Gly Tyr Val Phe
130 135 140 130 135 140
Tyr Ser Arg Ser Lys Thr Phe Asp Pro Ala Glu Phe Pro Ala Arg Ala Tyr Ser Arg Ser Lys Thr Phe Asp Pro Ala Glu Phe Pro Ala Arg Ala
145 150 155 160 145 150 155 160
Glu Val Thr Thr Ala Ala Asp Arg Thr Ala Asp Ala Thr Gln Ala Glu Glu Val Thr Thr Ala Ala Asp Arg Thr Ala Asp Ala Thr Gln Ala Glu
165 170 175 165 170 175
Met Glu Ala Met Ala Arg Glu Met Lys Arg Arg Ile Leu Glu Ile Asn Met Glu Ala Met Ala Arg Glu Met Lys Arg Arg Ile Leu Glu Ile Asn
180 185 190 180 185 190
Ser Ala Asp Pro Thr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Ser Ala Asp Pro Thr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg
195 200 205 195 200 205
Cys Arg Ile Gly Tyr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Cys Arg Ile Gly Tyr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala
210 215 220 210 215 220
Arg Val Lys Val Ser Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Arg Val Lys Val Ser Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn
225 230 235 240 225 230 235 240
Phe Tyr Asn Tyr Phe Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Glu Phe Tyr Asn Tyr Phe Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Glu
245 250 255 245 250 255
Ser Tyr Ala Ser Glu Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg Ser Tyr Ala Ser Glu Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg
260 265 270 260 265 270
Tyr Pro Glu Thr Arg Ser Leu Pro Glu Phe Glu Ser Tyr Thr Trp Pro Tyr Pro Glu Thr Arg Ser Leu Pro Glu Phe Glu Ser Tyr Thr Trp Pro
275 280 285 275 280 285
Tyr Tyr Leu Ser Thr Arg Pro Asp Tyr Arg Leu Val Val Gln Asp Gly Tyr Tyr Leu Ser Thr Arg Pro Asp Tyr Arg Leu Val Val Gln Asp Gly
290 295 300 290 295 300
Ser Leu Leu Glu Ser Ser Cys Pro Phe Ile Thr Glu Lys Leu Gly Glu Ser Leu Leu Glu Ser Ser Cys Pro Phe Ile Thr Glu Lys Leu Gly Glu
305 310 315 320 305 310 315 320
Met Glu Ile Glu Ser Ile Gln Pro Tyr Glu Met Leu Ser Ala Leu Pro Met Glu Ile Glu Ser Ile Gln Pro Tyr Glu Met Leu Ser Ala Leu Pro
325 330 335 325 330 335
Glu Ser Ser Arg Lys Arg Phe Tyr Asp Met Ala Gly Phe Asp Tyr Asp Glu Ser Ser Arg Lys Arg Phe Tyr Asp Met Ala Gly Phe Asp Tyr Asp
340 345 350 340 345 350
Lys Phe Ala Ala Leu Phe Asp Ala Ser Pro Lys Lys Asn Leu Ile Ile Lys Phe Ala Ala Leu Phe Asp Ala Ser Pro Lys Lys Asn Leu Ile Ile
355 360 365 355 360 365
Ile Gly Thr Ser His Ala Asp Asp Ala Ser Ala Arg Leu Gln Arg Asp Ile Gly Thr Ser His Ala Asp Asp Ala Ser Ala Arg Leu Gln Arg Asp
370 375 380 370 375 380
Tyr Val Ala Arg Ile Met Glu Gln Tyr Gly Ala Gln Tyr Asp Val Phe Tyr Val Ala Arg Ile Met Glu Gln Tyr Gly Ala Gln Tyr Asp Val Phe
385 390 395 400 385 390 395 400
Phe Lys Pro His Pro Ala Asp Thr Thr Ser Ala Gly Tyr Glu Thr Glu Phe Lys Pro His Pro Ala Asp Thr Thr Ser Ala Gly Tyr Glu Thr Glu
405 410 415 405 410 415
Phe Pro Gly Leu Thr Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Phe Pro Gly Leu Thr Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe
420 425 430 420 425 430
Val Trp Ser Leu Ile Asp Arg Val Asp Met Ile Gly Gly Tyr Pro Ser Val Trp Ser Leu Ile Asp Arg Val Asp Met Ile Gly Gly Tyr Pro Ser
435 440 445 435 440 445
Thr Val Phe Leu Thr Val Pro Val Asp Lys Val Arg Phe Ile Phe Ala Thr Val Phe Leu Thr Val Pro Val Asp Lys Val Arg Phe Ile Phe Ala
450 455 460 450 455 460
Ala Asp Ala Ala Ser Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Asp Ala Ala Ser Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp
465 470 475 480 465 470 475 480
Ala Thr Asp Val Glu Trp Met Gln Ala Thr Asp Val Glu Trp Met Gln
485 485
<210> 49<210> 49
<211> 291<211> 291
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 49<400> 49
Met Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu Ile Met Lys Lys Val Ile Ile Ala Gly Asn Gly Pro Ser Leu Lys Glu Ile
1 5 10 15 1 5 10 15
Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn Gln Asp Tyr Ser Arg Leu Pro Asn Asp Phe Asp Val Phe Arg Cys Asn Gln
20 25 30 20 25 30
Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala Val Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Lys Lys Cys Lys Ala Val
35 40 45 35 40 45
Phe Tyr Thr Pro Asn Phe Phe Phe Glu Gln Tyr Tyr Thr Leu Lys His Phe Tyr Thr Pro Asn Phe Phe Phe Glu Gln Tyr Tyr Thr Leu Lys His
50 55 60 50 55 60
Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser Asn Leu Ile Gln Asn Gln Glu Tyr Glu Thr Glu Leu Ile Met Cys Ser Asn
65 70 75 80 65 70 75 80
Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe Tyr Tyr Asn Gln Ala His Leu Glu Asn Glu Asn Phe Val Lys Thr Phe Tyr
85 90 95 85 90 95
Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln Leu Asp Tyr Phe Pro Asp Ala His Leu Gly Tyr Asp Phe Phe Lys Gln Leu
100 105 110 100 105 110
Lys Glu Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn Gln Lys Glu Phe Asn Ala Tyr Phe Lys Phe His Glu Ile Tyr Phe Asn Gln
115 120 125 115 120 125
Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu Gly Arg Ile Thr Ser Gly Val Tyr Met Cys Ala Val Ala Ile Ala Leu Gly
130 135 140 130 135 140
Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly Ser Tyr Lys Glu Ile Tyr Leu Ser Gly Ile Asp Phe Tyr Gln Asn Gly Ser
145 150 155 160 145 150 155 160
Ser Tyr Ala Phe Asp Thr Lys Gln Glu Asn Leu Leu Lys Leu Ala Pro Ser Tyr Ala Phe Asp Thr Lys Gln Glu Asn Leu Leu Lys Leu Ala Pro
165 170 175 165 170 175
Asp Phe Lys Asn Asp Arg Ser His Tyr Ile Gly His Ser Lys Asn Thr Asp Phe Lys Asn Asp Arg Ser His Tyr Ile Gly His Ser Lys Asn Thr
180 185 190 180 185 190
Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys Leu Asp Ile Lys Ala Leu Glu Phe Leu Glu Lys Thr Tyr Lys Ile Lys Leu
195 200 205 195 200 205
Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu Ala Tyr Cys Leu Cys Pro Asn Ser Leu Leu Ala Asn Phe Ile Glu Leu Ala
210 215 220 210 215 220
Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr Thr Pro Asn Leu Asn Ser Asn Phe Ile Ile Gln Glu Lys Asn Asn Tyr Thr
225 230 235 240 225 230 235 240
Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser Lys Lys Asp Ile Leu Ile Pro Ser Ser Glu Ala Tyr Gly Lys Phe Ser Lys
245 250 255 245 250 255
Asn Ile Asn Phe Lys Lys Ile Lys Ile Lys Glu Asn Val Tyr Tyr Lys Asn Ile Asn Phe Lys Lys Ile Lys Ile Lys Glu Asn Val Tyr Tyr Lys
260 265 270 260 265 270
Leu Ile Lys Asp Leu Leu Arg Leu Pro Ser Asp Ile Lys His Tyr Phe Leu Ile Lys Asp Leu Leu Arg Leu Pro Ser Asp Ile Lys His Tyr Phe
275 280 285 275 280 285
Lys Gly Lys Lys Gly Lys
290 290
<210> 50<210> 50
<211> 312<211> 312
<212> Белок<212> Protein
<213> Streptococcus agalactiae<213> Streptococcus agalactiae
<400> 50<400> 50
Met Thr Asn Arg Lys Ile Tyr Val Cys His Thr Leu Tyr His Leu Leu Met Thr Asn Arg Lys Ile Tyr Val Cys His Thr Leu Tyr His Leu Leu
1 5 10 15 1 5 10 15
Ile Cys Leu Tyr Lys Glu Glu Ile Tyr Ser Asn Leu Glu Ile Ile Leu Ile Cys Leu Tyr Lys Glu Glu Ile Tyr Ser Asn Leu Glu Ile Ile Leu
20 25 30 20 25 30
Ser Ser Ser Ile Pro Asp Val Asp Asn Leu Glu Lys Lys Leu Lys Ser Ser Ser Ser Ile Pro Asp Val Asp Asn Leu Glu Lys Lys Leu Lys Ser
35 40 45 35 40 45
Lys Thr Ile Asn Ile His Ile Leu Glu Glu Ser Ser Gly Glu Ser Glu Lys Thr Ile Asn Ile His Ile Leu Glu Glu Ser Ser Gly Glu Ser Glu
50 55 60 50 55 60
Glu Leu Leu Ser Val Leu Lys Asp Ala Gly Leu Ser Tyr Ser Lys Phe Glu Leu Leu Ser Val Leu Lys Asp Ala Gly Leu Ser Tyr Ser Lys Phe
65 70 75 80 65 70 75 80
Asp Ser Asn Cys Phe Ile Phe Asn Asp Ala Thr Pro Ile Gly Arg Thr Asp Ser Asn Cys Phe Ile Phe Asn Asp Ala Thr Pro Ile Gly Arg Thr
85 90 95 85 90 95
Leu Ile Lys His Gly Ile Tyr Tyr Asn Leu Ile Glu Asp Gly Leu Asn Leu Ile Lys His Gly Ile Tyr Tyr Asn Leu Ile Glu Asp Gly Leu Asn
100 105 110 100 105 110
Cys Phe Thr Tyr Ser Ile Phe Ser Gln Lys Leu Trp Lys Tyr Tyr Val Cys Phe Thr Tyr Ser Ile Phe Ser Gln Lys Leu Trp Lys Tyr Tyr Val
115 120 125 115 120 125
Lys Lys Tyr Ile Leu His Lys Ile Gln Pro His Gly Phe Ser Arg Tyr Lys Lys Tyr Ile Leu His Lys Ile Gln Pro His Gly Phe Ser Arg Tyr
130 135 140 130 135 140
Cys Leu Gly Ile Glu Val Asn Ser Leu Val Asn Leu Pro Lys Asp Pro Cys Leu Gly Ile Glu Val Asn Ser Leu Val Asn Leu Pro Lys Asp Pro
145 150 155 160 145 150 155 160
Arg Tyr Lys Lys Phe Ile Glu Val Pro Arg Lys Glu Leu Phe Asp Asn Arg Tyr Lys Lys Phe Ile Glu Val Pro Arg Lys Glu Leu Phe Asp Asn
165 170 175 165 170 175
Val Thr Glu Tyr Gln Lys Glu Met Ala Ile Asn Leu Phe Gly Ala Val Val Thr Glu Tyr Gln Lys Glu Met Ala Ile Asn Leu Phe Gly Ala Val
180 185 190 180 185 190
Arg Val Ser Ile Lys Ser Pro Ser Val Leu Val Leu Thr Gln Pro Leu Arg Val Ser Ile Lys Ser Pro Ser Val Leu Val Leu Thr Gln Pro Leu
195 200 205 195 200 205
Ser Ile Asp Lys Glu Phe Met Ser Tyr Asn Asn Lys Ile Glu Thr Ser Ser Ile Asp Lys Glu Phe Met Ser Tyr Asn Asn Lys Ile Glu Thr Ser
210 215 220 210 215 220
Glu Glu Gln Phe Asn Phe Tyr Lys Ser Ile Val Asn Glu Tyr Ile Asn Glu Glu Gln Phe Asn Phe Tyr Lys Ser Ile Val Asn Glu Tyr Ile Asn
225 230 235 240 225 230 235 240
Lys Gly Tyr Asn Val Tyr Leu Lys Val His Pro Arg Asp Val Val Asp Lys Gly Tyr Asn Val Tyr Leu Lys Val His Pro Arg Asp Val Val Asp
245 250 255 245 250 255
Tyr Ser Lys Leu Pro Val Glu Leu Leu Pro Ser Asn Val Pro Met Glu Tyr Ser Lys Leu Pro Val Glu Leu Leu Pro Ser Asn Val Pro Met Glu
260 265 270 260 265 270
Ile Ile Glu Leu Met Leu Thr Gly Arg Phe Glu Cys Gly Ile Thr His Ile Ile Glu Leu Met Leu Thr Gly Arg Phe Glu Cys Gly Ile Thr His
275 280 285 275 280 285
Ser Ser Thr Ala Leu Asp Phe Leu Thr Cys Val Asp Lys Lys Ile Thr Ser Ser Thr Ala Leu Asp Phe Leu Thr Cys Val Asp Lys Lys Ile Thr
290 295 300 290 295 300
Leu Val Asp Leu Lys Asp Ile Lys Leu Val Asp Leu Lys Asp Ile Lys
305 310 305 310
<210> 51<210> 51
<211> 410<211> 410
<212> Белок<212> Protein
<213> Bibersteinia trehalosi<213> Bibersteinia trehalosi
<400> 51<400> 51
Met Glu Phe Cys Lys Met Ala Thr Thr Gln Lys Ile Cys Val Tyr Leu Met Glu Phe Cys Lys Met Ala Thr Thr Gln Lys Ile Cys Val Tyr Leu
1 5 10 15 1 5 10 15
Asp Tyr Ala Thr Ile Pro Ser Leu Asn Tyr Ile Leu His Phe Ala Gln Asp Tyr Ala Thr Ile Pro Ser Leu Asn Tyr Ile Leu His Phe Ala Gln
20 25 30 20 25 30
His Phe Glu Asp Gln Glu Thr Ile Arg Leu Phe Gly Leu Ser Arg Phe His Phe Glu Asp Gln Glu Thr Ile Arg Leu Phe Gly Leu Ser Arg Phe
35 40 45 35 40 45
His Ile Pro Glu Ser Val Ile Gln Arg Tyr Pro Lys Gly Val Val Gln His Ile Pro Glu Ser Val Ile Gln Arg Tyr Pro Lys Gly Val Val Gln
50 55 60 50 55 60
Phe Tyr Pro Asn Gln Glu Lys Asp Phe Ser Ala Leu Leu Leu Ala Leu Phe Tyr Pro Asn Gln Glu Lys Asp Phe Ser Ala Leu Leu Leu Ala Leu
65 70 75 80 65 70 75 80
Lys Asn Ile Leu Ile Glu Val Lys Gln Gln Gln Arg Lys Cys Glu Ile Lys Asn Ile Leu Ile Glu Val Lys Gln Gln Gln Arg Lys Cys Glu Ile
85 90 95 85 90 95
Glu Leu His Leu Asn Leu Phe His Tyr Gln Leu Leu Leu Leu Pro Phe Glu Leu His Leu Asn Leu Phe His Tyr Gln Leu Leu Leu Leu Pro Phe
100 105 110 100 105 110
Leu Ser Leu Tyr Leu Asp Thr Gln Asp Tyr Cys His Leu Thr Leu Lys Leu Ser Leu Tyr Leu Asp Thr Gln Asp Tyr Cys His Leu Thr Leu Lys
115 120 125 115 120 125
Phe Tyr Asp Asp Gly Ser Glu Ala Ile Ser Ala Leu Gln Glu Leu Ala Phe Tyr Asp Asp Gly Ser Glu Ala Ile Ser Ala Leu Gln Glu Leu Ala
130 135 140 130 135 140
Leu Ala Pro Asp Leu Ala Ala Gln Ile Gln Phe Glu Lys Gln Gln Phe Leu Ala Pro Asp Leu Ala Ala Gln Ile Gln Phe Glu Lys Gln Gln Phe
145 150 155 160 145 150 155 160
Asp Glu Leu Val Val Lys Lys Ser Phe Lys Leu Ser Leu Leu Ser Arg Asp Glu Leu Val Val Lys Lys Ser Phe Lys Leu Ser Leu Leu Ser Arg
165 170 175 165 170 175
Tyr Phe Trp Gly Lys Leu Phe Glu Ser Glu Tyr Ile Trp Phe Asn Gln Tyr Phe Trp Gly Lys Leu Phe Glu Ser Glu Tyr Ile Trp Phe Asn Gln
180 185 190 180 185 190
Ala Ile Leu Gln Lys Ala Glu Leu Gln Ile Leu Lys Gln Glu Ile Ser Ala Ile Leu Gln Lys Ala Glu Leu Gln Ile Leu Lys Gln Glu Ile Ser
195 200 205 195 200 205
Ser Ser Arg Gln Met Asp Phe Ala Ile Tyr Gln Gln Met Ser Asp Glu Ser Ser Arg Gln Met Asp Phe Ala Ile Tyr Gln Gln Met Ser Asp Glu
210 215 220 210 215 220
Gln Lys Gln Leu Val Leu Glu Ile Leu Asn Ile Asp Leu Asn Lys Val Gln Lys Gln Leu Val Leu Glu Ile Leu Asn Ile Asp Leu Asn Lys Val
225 230 235 240 225 230 235 240
Ala Tyr Leu Lys Gln Leu Met Glu Asn Gln Pro Ser Phe Leu Phe Leu Ala Tyr Leu Lys Gln Leu Met Glu Asn Gln Pro Ser Phe Leu Phe Leu
245 250 255 245 250 255
Gly Thr Thr Leu Phe Asn Ile Thr Gln Glu Thr Lys Thr Trp Leu Met Gly Thr Thr Leu Phe Asn Ile Thr Gln Glu Thr Lys Thr Trp Leu Met
260 265 270 260 265 270
Gln Met His Val Asp Leu Ile Gln Gln Tyr Cys Leu Pro Ser Gly Gln Gln Met His Val Asp Leu Ile Gln Gln Tyr Cys Leu Pro Ser Gly Gln
275 280 285 275 280 285
Phe Phe Asn Asn Lys Ala Gly Tyr Leu Cys Phe Tyr Lys Gly His Pro Phe Phe Asn Asn Lys Ala Gly Tyr Leu Cys Phe Tyr Lys Gly His Pro
290 295 300 290 295 300
Asn Glu Lys Glu Met Asn Gln Met Ile Leu Ser Gln Phe Lys Asn Leu Asn Glu Lys Glu Met Asn Gln Met Ile Leu Ser Gln Phe Lys Asn Leu
305 310 315 320 305 310 315 320
Ile Ala Leu Pro Asp Asp Ile Pro Leu Glu Ile Leu Leu Leu Leu Gly Ile Ala Leu Pro Asp Asp Ile Pro Leu Glu Ile Leu Leu Leu Leu Gly
325 330 335 325 330 335
Val Ile Pro Ser Lys Val Gly Gly Phe Ala Ser Ser Ala Leu Phe Asn Val Ile Pro Ser Lys Val Gly Gly Phe Ala Ser Ser Ala Leu Phe Asn
340 345 350 340 345 350
Phe Thr Pro Ala Gln Ile Glu Asn Ile Ile Phe Phe Thr Pro Arg Tyr Phe Thr Pro Ala Gln Ile Glu Asn Ile Ile Phe Phe Thr Pro Arg Tyr
355 360 365 355 360 365
Phe Glu Lys Asp Asn Arg Leu His Ala Thr Gln Tyr Arg Leu Met Gln Phe Glu Lys Asp Asn Arg Leu His Ala Thr Gln Tyr Arg Leu Met Gln
370 375 380 370 375 380
Gly Leu Ile Glu Leu Gly Tyr Leu Asp Ala Glu Lys Ser Val Thr His Gly Leu Ile Glu Leu Gly Tyr Leu Asp Ala Glu Lys Ser Val Thr His
385 390 395 400 385 390 395 400
Phe Glu Ile Met Gln Leu Leu Thr Lys Glu Phe Glu Ile Met Gln Leu Leu Thr Lys Glu
405 410 405 410
<210> 52<210> 52
<211> 406<211> 406
<212> Белок<212> Protein
<213> Haemophilus parahaemolyticus<213> Haemophilus parahaemolyticus
<400> 52<400> 52
Met Thr Glu Gln Tyr Ile Lys Asn Val Glu Val Tyr Leu Asp Tyr Ala Met Thr Glu Gln Tyr Ile Lys Asn Val Glu Val Tyr Leu Asp Tyr Ala
1 5 10 15 1 5 10 15
Thr Ile Pro Thr Leu Asn Tyr Phe Tyr His Phe Thr Glu Asn Lys Asp Thr Ile Pro Thr Leu Asn Tyr Phe Tyr His Phe Thr Glu Asn Lys Asp
20 25 30 20 25 30
Asp Ile Ala Thr Ile Arg Leu Phe Gly Leu Gly Arg Phe Asn Ile Ser Asp Ile Ala Thr Ile Arg Leu Phe Gly Leu Gly Arg Phe Asn Ile Ser
35 40 45 35 40 45
Lys Ser Ile Ile Glu Ser Tyr Pro Glu Gly Ile Ile Arg Tyr Cys Pro Lys Ser Ile Ile Glu Ser Tyr Pro Glu Gly Ile Ile Arg Tyr Cys Pro
50 55 60 50 55 60
Ile Ile Phe Glu Asp Gln Thr Ala Phe Gln Gln Leu Phe Ile Thr Leu Ile Ile Phe Glu Asp Gln Thr Ala Phe Gln Gln Leu Phe Ile Thr Leu
65 70 75 80 65 70 75 80
Leu Thr Glu Asp Ser Phe Cys Gln Tyr Arg Phe Asn Phe His Ile Asn Leu Thr Glu Asp Ser Phe Cys Gln Tyr Arg Phe Asn Phe His Ile Asn
85 90 95 85 90 95
Leu Phe His Ser Trp Lys Met Leu Ile Pro Leu Leu His Ile Ile Trp Leu Phe His Ser Trp Lys Met Leu Ile Pro Leu Leu His Ile Ile Trp
100 105 110 100 105 110
Gln Phe Lys His Lys Val Leu Asp Ile Lys Leu Asn Phe Tyr Asp Asp Gln Phe Lys His Lys Val Leu Asp Ile Lys Leu Asn Phe Tyr Asp Asp
115 120 125 115 120 125
Gly Ser Glu Gly Leu Val Thr Leu Ser Lys Ile Glu Gln Asn Tyr Ser Gly Ser Glu Gly Leu Val Thr Leu Ser Lys Ile Glu Gln Asn Tyr Ser
130 135 140 130 135 140
Ser Glu Ile Leu Gln Lys Ile Ile Asp Ile Asp Ser Gln Ser Phe Tyr Ser Glu Ile Leu Gln Lys Ile Ile Asp Ile Asp Ser Gln Ser Phe Tyr
145 150 155 160 145 150 155 160
Ala Asp Lys Leu Ser Phe Leu Asp Glu Asp Ile Ala Arg Tyr Leu Trp Ala Asp Lys Leu Ser Phe Leu Asp Glu Asp Ile Ala Arg Tyr Leu Trp
165 170 175 165 170 175
Asn Ser Leu Phe Glu Ser His Tyr Tyr Leu Leu Asn Asp Phe Leu Leu Asn Ser Leu Phe Glu Ser His Tyr Tyr Leu Leu Asn Asp Phe Leu Leu
180 185 190 180 185 190
Lys Asn Glu Lys Leu Ser Leu Leu Lys Asn Ser Ile Lys Tyr Cys His Lys Asn Glu Lys Leu Ser Leu Leu Lys Asn Ser Ile Lys Tyr Cys His
195 200 205 195 200 205
Ile Met Asp Leu Glu Arg Tyr Leu Gln Phe Thr Gln Glu Glu Lys Asp Ile Met Asp Leu Glu Arg Tyr Leu Gln Phe Thr Gln Glu Glu Lys Asp
210 215 220 210 215 220
Phe Phe Asn Glu Leu Leu Gly Ile Asn Ile Gln Ser Leu Glu Asp Lys Phe Phe Asn Glu Leu Leu Gly Ile Asn Ile Gln Ser Leu Glu Asp Lys
225 230 235 240 225 230 235 240
Ile Lys Ile Phe Gln Gln Lys Lys Thr Phe Ile Phe Thr Gly Thr Thr Ile Lys Ile Phe Gln Gln Lys Lys Thr Phe Ile Phe Thr Gly Thr Thr
245 250 255 245 250 255
Ile Phe Ser Leu Pro Lys Glu Glu Glu Glu Thr Leu Tyr Arg Leu His Ile Phe Ser Leu Pro Lys Glu Glu Glu Glu Thr Leu Tyr Arg Leu His
260 265 270 260 265 270
Leu Asn Ala Ile Leu Asn Tyr Ile His Pro Asn Gly Lys Tyr Phe Ile Leu Asn Ala Ile Leu Asn Tyr Ile His Pro Asn Gly Lys Tyr Phe Ile
275 280 285 275 280 285
Gly Asp Gly Phe Thr Leu Val Ile Lys Gly His Pro His Gln Lys Glu Gly Asp Gly Phe Thr Leu Val Ile Lys Gly His Pro His Gln Lys Glu
290 295 300 290 295 300
Met Asn Ser Arg Leu Glu Lys Ser Phe Glu Lys Ala Val Met Leu Pro Met Asn Ser Arg Leu Glu Lys Ser Phe Glu Lys Ala Val Met Leu Pro
305 310 315 320 305 310 315 320
Asp Asn Ile Pro Phe Glu Ile Leu Tyr Leu Ile Gly Cys Lys Pro Asp Asp Asn Ile Pro Phe Glu Ile Leu Tyr Leu Ile Gly Cys Lys Pro Asp
325 330 335 325 330 335
Lys Ile Gly Gly Phe Val Ser Thr Ser Tyr Phe Ser Cys Asp Lys Lys Lys Ile Gly Gly Phe Val Ser Thr Ser Tyr Phe Ser Cys Asp Lys Lys
340 345 350 340 345 350
Asn Ile Ala Asp Leu Leu Phe Ile Ser Ala Arg Gln Glu Glu Val Arg Asn Ile Ala Asp Leu Leu Phe Ile Ser Ala Arg Gln Glu Glu Val Arg
355 360 365 355 360 365
Lys Asn Asp Tyr Leu Phe Asn Ile Gln Tyr Gln Leu Arg Asp Met Met Lys Asn Asp Tyr Leu Phe Asn Ile Gln Tyr Gln Leu Arg Asp Met Met
370 375 380 370 375 380
Ile Lys Thr Gly Phe Ile Gln Glu Glu Lys Thr His Phe Tyr Ser Asp Ile Lys Thr Gly Phe Ile Gln Glu Glu Lys Thr His Phe Tyr Ser Asp
385 390 395 400 385 390 395 400
Ile Pro Ile Phe Ile Ser Ile Pro Ile Phe Ile Ser
405 405
<210> 53<210> 53
<211> 300<211> 300
<212> Белок<212> Protein
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 53<400> 53
Met Lys Tyr Asn Ile Lys Ile Lys Ala Ile Val Ile Val Ser Ser Leu Met Lys Tyr Asn Ile Lys Ile Lys Ala Ile Val Ile Val Ser Ser Leu
1 5 10 15 1 5 10 15
Arg Met Leu Leu Ile Phe Leu Met Leu Asn Lys Tyr His Leu Asp Glu Arg Met Leu Leu Ile Phe Leu Met Leu Asn Lys Tyr His Leu Asp Glu
20 25 30 20 25 30
Val Leu Phe Val Phe Asn Glu Gly Phe Glu Leu His Lys Lys Tyr Lys Val Leu Phe Val Phe Asn Glu Gly Phe Glu Leu His Lys Lys Tyr Lys
35 40 45 35 40 45
Ile Lys His Tyr Val Ala Ile Lys Lys Lys Ile Thr Lys Phe Trp Arg Ile Lys His Tyr Val Ala Ile Lys Lys Lys Ile Thr Lys Phe Trp Arg
50 55 60 50 55 60
Leu Tyr Tyr Lys Leu Tyr Phe Tyr Arg Phe Lys Ile Asp Arg Ile Pro Leu Tyr Tyr Lys Leu Tyr Phe Tyr Arg Phe Lys Ile Asp Arg Ile Pro
65 70 75 80 65 70 75 80
Val Tyr Gly Ala Asp His Leu Gly Trp Thr Asp Tyr Phe Leu Lys Tyr Val Tyr Gly Ala Asp His Leu Gly Trp Thr Asp Tyr Phe Leu Lys Tyr
85 90 95 85 90 95
Phe Asp Phe Tyr Leu Ile Glu Asp Gly Ile Ala Asn Phe Ser Pro Lys Phe Asp Phe Tyr Leu Ile Glu Asp Gly Ile Ala Asn Phe Ser Pro Lys
100 105 110 100 105 110
Arg Tyr Glu Ile Asn Leu Thr Arg Asn Ile Pro Val Phe Gly Phe His Arg Tyr Glu Ile Asn Leu Thr Arg Asn Ile Pro Val Phe Gly Phe His
115 120 125 115 120 125
Lys Thr Val Lys Lys Ile Tyr Leu Thr Ser Leu Glu Asn Val Pro Ser Lys Thr Val Lys Lys Ile Tyr Leu Thr Ser Leu Glu Asn Val Pro Ser
130 135 140 130 135 140
Asp Ile Arg His Lys Val Glu Leu Ile Ser Leu Glu His Leu Trp Lys Asp Ile Arg His Lys Val Glu Leu Ile Ser Leu Glu His Leu Trp Lys
145 150 155 160 145 150 155 160
Thr Arg Thr Ala Gln Glu Gln His Asn Ile Leu Asp Phe Phe Ala Phe Thr Arg Thr Ala Gln Glu Gln His Asn Ile Leu Asp Phe Phe Ala Phe
165 170 175 165 170 175
Asn Leu Asp Ser Leu Ile Ser Leu Lys Met Lys Lys Tyr Ile Leu Phe Asn Leu Asp Ser Leu Ile Ser Leu Lys Met Lys Lys Tyr Ile Leu Phe
180 185 190 180 185 190
Thr Gln Cys Leu Ser Glu Asp Arg Val Ile Ser Glu Gln Glu Lys Ile Thr Gln Cys Leu Ser Glu Asp Arg Val Ile Ser Glu Gln Glu Lys Ile
195 200 205 195 200 205
Ala Ile Tyr Gln His Ile Ile Lys Asn Tyr Asp Glu Arg Leu Leu Val Ala Ile Tyr Gln His Ile Ile Lys Asn Tyr Asp Glu Arg Leu Leu Val
210 215 220 210 215 220
Ile Lys Pro His Pro Arg Glu Thr Thr Asp Tyr Gln Lys Tyr Phe Glu Ile Lys Pro His Pro Arg Glu Thr Thr Asp Tyr Gln Lys Tyr Phe Glu
225 230 235 240 225 230 235 240
Asn Val Phe Val Tyr Gln Asp Val Val Pro Ser Glu Leu Phe Glu Leu Asn Val Phe Val Tyr Gln Asp Val Val Pro Ser Glu Leu Phe Glu Leu
245 250 255 245 250 255
Leu Asp Val Asn Phe Glu Arg Val Ile Thr Leu Phe Ser Thr Ala Val Leu Asp Val Asn Phe Glu Arg Val Ile Thr Leu Phe Ser Thr Ala Val
260 265 270 260 265 270
Phe Lys Tyr Asp Arg Asn Ile Val Asp Phe Tyr Gly Thr Arg Ile His Phe Lys Tyr Asp Arg Asn Ile Val Asp Phe Tyr Gly Thr Arg Ile His
275 280 285 275 280 285
Asp Lys Ile Tyr Gln Trp Phe Gly Asp Ile Lys Phe Asp Lys Ile Tyr Gln Trp Phe Gly Asp Ile Lys Phe
290 295 300 290 295 300
<210> 54<210> 54
<211> 381<211> 381
<212> Белок<212> Protein
<213> Vibrio harveyi<213> Vibrio harveyi
<400> 54<400> 54
Met Asp Ser Ser Pro Glu Asn Thr Ser Ser Thr Leu Glu Ile Tyr Ile Met Asp Ser Ser Pro Glu Asn Thr Ser Ser Thr Leu Glu Ile Tyr Ile
1 5 10 15 1 5 10 15
Asp Ser Ala Thr Leu Pro Ser Leu Gln His Met Val Lys Ile Ile Asp Asp Ser Ala Thr Leu Pro Ser Leu Gln His Met Val Lys Ile Ile Asp
20 25 30 20 25 30
Glu Gln Ser Gly Asn Lys Lys Leu Ile Asn Trp Lys Arg Tyr Pro Ile Glu Gln Ser Gly Asn Lys Lys Leu Ile Asn Trp Lys Arg Tyr Pro Ile
35 40 45 35 40 45
Asp Asp Glu Leu Leu Leu Asp Lys Ile Asn Ala Leu Ser Phe Ser Asp Asp Asp Glu Leu Leu Leu Asp Lys Ile Asn Ala Leu Ser Phe Ser Asp
50 55 60 50 55 60
Thr Thr Asp Leu Thr Arg Tyr Met Glu Ser Ile Leu Leu Ile Gly Asp Thr Thr Asp Leu Thr Arg Tyr Met Glu Ser Ile Leu Leu Ile Gly Asp
65 70 75 80 65 70 75 80
Ile Lys Arg Val Val Ile Asn Gly Asn Ser Leu Ser Asn Tyr Asn Ile Ile Lys Arg Val Val Ile Asn Gly Asn Ser Leu Ser Asn Tyr Asn Ile
85 90 95 85 90 95
Val Gly Val Met Arg Ser Ile Asn Ala Leu Gly Leu Asp Leu Asp Val Val Gly Val Met Arg Ser Ile Asn Ala Leu Gly Leu Asp Leu Asp Val
100 105 110 100 105 110
Glu Ile Asn Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr Glu Ile Asn Phe Tyr Asp Asp Gly Ser Ala Glu Tyr Val Arg Leu Tyr
115 120 125 115 120 125
Asn Phe Ser Gln Leu Pro Glu Ala Glu Arg Glu Leu Leu Val Ser Met Asn Phe Ser Gln Leu Pro Glu Ala Glu Arg Glu Leu Leu Val Ser Met
130 135 140 130 135 140
Ser Lys Asn Asn Ile Leu Ala Ala Val Asn Gly Ile Gly Ser Tyr Asp Ser Lys Asn Asn Ile Leu Ala Ala Val Asn Gly Ile Gly Ser Tyr Asp
145 150 155 160 145 150 155 160
Ser Gly Ser Pro Glu Asn Ile Tyr Gly Phe Ala Gln Ile Tyr Pro Ala Ser Gly Ser Pro Glu Asn Ile Tyr Gly Phe Ala Gln Ile Tyr Pro Ala
165 170 175 165 170 175
Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Asp Leu Glu Ile Thr Tyr His Met Leu Arg Ala Asp Ile Phe Asp Thr Asp Leu Glu Ile
180 185 190 180 185 190
Gly Leu Ile Arg Asp Ile Leu Gly Asp Asn Val Lys Gln Met Lys Trp Gly Leu Ile Arg Asp Ile Leu Gly Asp Asn Val Lys Gln Met Lys Trp
195 200 205 195 200 205
Gly Gln Phe Leu Gly Phe Asn Glu Glu Gln Lys Glu Leu Phe Tyr Gln Gly Gln Phe Leu Gly Phe Asn Glu Glu Gln Lys Glu Leu Phe Tyr Gln
210 215 220 210 215 220
Leu Thr Ser Phe Asn Pro Asp Lys Ile Gln Ala Gln Tyr Lys Glu Ser Leu Thr Ser Phe Asn Pro Asp Lys Ile Gln Ala Gln Tyr Lys Glu Ser
225 230 235 240 225 230 235 240
Pro Asn Lys Asn Phe Val Phe Val Gly Thr Asn Ser Arg Ser Ala Thr Pro Asn Lys Asn Phe Val Phe Val Gly Thr Asn Ser Arg Ser Ala Thr
245 250 255 245 250 255
Ala Glu Gln Gln Ile Asn Ile Ile Lys Glu Ala Lys Lys Leu Asp Ser Ala Glu Gln Gln Ile Asn Ile Ile Lys Glu Ala Lys Lys Leu Asp Ser
260 265 270 260 265 270
Glu Ile Ile Pro Asn Ser Ile Asp Gly Tyr Asp Leu Phe Phe Lys Gly Glu Ile Ile Pro Asn Ser Ile Asp Gly Tyr Asp Leu Phe Phe Lys Gly
275 280 285 275 280 285
His Pro Ser Ala Thr Tyr Asn Gln Gln Ile Val Asp Ala His Asp Met His Pro Ser Ala Thr Tyr Asn Gln Gln Ile Val Asp Ala His Asp Met
290 295 300 290 295 300
Thr Glu Ile Tyr Asn Arg Thr Pro Phe Glu Val Leu Ala Met Thr Ser Thr Glu Ile Tyr Asn Arg Thr Pro Phe Glu Val Leu Ala Met Thr Ser
305 310 315 320 305 310 315 320
Ser Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Leu Phe Phe Ser Ser Leu Pro Asp Ala Val Gly Gly Met Gly Ser Ser Leu Phe Phe Ser
325 330 335 325 330 335
Leu Pro Lys Thr Val Glu Thr Lys Phe Ile Phe Tyr Lys Ser Gly Thr Leu Pro Lys Thr Val Glu Thr Lys Phe Ile Phe Tyr Lys Ser Gly Thr
340 345 350 340 345 350
Asp Ile Glu Ser Asn Ala Leu Ile Gln Val Met Leu Lys Leu Gly Ile Asp Ile Glu Ser Asn Ala Leu Ile Gln Val Met Leu Lys Leu Gly Ile
355 360 365 355 360 365
Ile Thr Asp Glu Lys Val Arg Phe Thr Thr Asp Ile Lys Ile Thr Asp Glu Lys Val Arg Phe Thr Thr Asp Ile Lys
370 375 380 370 375 380
<210> 55<210> 55
<211> 483<211> 483
<212> Белок<212> Protein
<213> Alistipes sp.<213> Alistipes sp.
<400> 55<400> 55
Met Ala Ser Cys Ser Asp Asp Asp Lys Glu Gln Thr Gly Phe Gln Ile Met Ala Ser Cys Ser Asp Asp Asp Lys Glu Gln Thr Gly Phe Gln Ile
1 5 10 15 1 5 10 15
Asp Asp Gly Ser Gly Phe Leu Ser Leu Asp Ala Ala Ala Arg Ser Gly Asp Asp Gly Ser Gly Phe Leu Ser Leu Asp Ala Ala Ala Arg Ser Gly
20 25 30 20 25 30
Ser Ile Ala Ile Thr Ala Asn Asn Ser Trp Ser Val Thr Gln Asp Lys Ser Ile Ala Ile Thr Ala Asn Asn Ser Trp Ser Val Thr Gln Asp Lys
35 40 45 35 40 45
Asp Ser Glu Trp Leu Thr Leu Ser Thr Thr Ser Gly Ala Ala Gly Arg Asp Ser Glu Trp Leu Thr Leu Ser Thr Thr Ser Gly Ala Ala Gly Arg
50 55 60 50 55 60
Thr Glu Ile Gly Ile Met Leu Glu Ala Asn Pro Gly Glu Ala Arg Asn Thr Glu Ile Gly Ile Met Leu Glu Ala Asn Pro Gly Glu Ala Arg Asn
65 70 75 80 65 70 75 80
Ala Gly Leu Thr Phe Asn Ser Gly Gly Arg Thr Tyr Pro Phe Val Ile Ala Gly Leu Thr Phe Asn Ser Gly Gly Arg Thr Tyr Pro Phe Val Ile
85 90 95 85 90 95
Thr Gln Ser Ala His Val Thr Ala Asp Phe Asp Asp Ala Asp His Cys Thr Gln Ser Ala His Val Thr Ala Asp Phe Asp Asp Ala Asp His Cys
100 105 110 100 105 110
Phe Tyr Ile Thr Phe Gly Thr Leu Pro Thr Leu Tyr Ala Gly Leu His Phe Tyr Ile Thr Phe Gly Thr Leu Pro Thr Leu Tyr Ala Gly Leu His
115 120 125 115 120 125
Val Leu Ser His Asp Lys Pro Ser Tyr Val Phe Phe Gln Arg Ser Gln Val Leu Ser His Asp Lys Pro Ser Tyr Val Phe Phe Gln Arg Ser Gln
130 135 140 130 135 140
Thr Phe Arg Pro Glu Glu Phe Pro Ala His Ala Glu Val Thr Ile Ala Thr Phe Arg Pro Glu Glu Phe Pro Ala His Ala Glu Val Thr Ile Ala
145 150 155 160 145 150 155 160
Ala Asp Pro Ser Ala Asn Ala Thr Asp Glu Asp Met Glu Arg Met Arg Ala Asp Pro Ser Ala Asn Ala Thr Asp Glu Asp Met Glu Arg Met Arg
165 170 175 165 170 175
Thr Ala Met Lys Gln Gln Ile Leu Lys Ile Asn Val Glu Asp Pro Thr Thr Ala Met Lys Gln Gln Ile Leu Lys Ile Asn Val Glu Asp Pro Thr
180 185 190 180 185 190
Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Cys Gly Ile Gly Tyr Ala Val Phe Gly Leu Tyr Val Asp Asp Leu Arg Cys Gly Ile Gly Tyr
195 200 205 195 200 205
Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Thr Arg Val Lys Val Ser Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Thr Arg Val Lys Val Ser
210 215 220 210 215 220
Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Phe Tyr Asn Tyr Phe Met Leu Ser Asp Gly Thr Gly Thr Tyr Asn Asn Phe Tyr Asn Tyr Phe
225 230 235 240 225 230 235 240
Gly Asp Pro Ala Thr Ala Glu Gln Asn Trp Glu Asn Tyr Ala Ala Gln Gly Asp Pro Ala Thr Ala Glu Gln Asn Trp Glu Asn Tyr Ala Ala Gln
245 250 255 245 250 255
Val Glu Ala Leu Asp Trp Gln His Gly Gly Arg Phe Pro Glu Thr Arg Val Glu Ala Leu Asp Trp Gln His Gly Gly Arg Phe Pro Glu Thr Arg
260 265 270 260 265 270
Met Pro Asp Gly Phe Asp Phe Tyr Glu Trp Pro Tyr Tyr Leu Ala Thr Met Pro Asp Gly Phe Asp Phe Tyr Glu Trp Pro Tyr Tyr Leu Ala Thr
275 280 285 275 280 285
Arg Pro Asn Tyr Arg Leu Val Leu Gln Asp Asp Asp Leu Leu Glu Ala Arg Pro Asn Tyr Arg Leu Val Leu Gln Asp Asp Asp Leu Leu Glu Ala
290 295 300 290 295 300
Thr Ser Pro Phe Met Thr Glu Arg Leu Gln Gln Met Arg Thr Glu Ser Thr Ser Pro Phe Met Thr Glu Arg Leu Gln Gln Met Arg Thr Glu Ser
305 310 315 320 305 310 315 320
Lys Gln Pro Tyr Glu Leu Leu Ala Ser Leu Pro Ala Glu Ala Arg Gln Lys Gln Pro Tyr Glu Leu Leu Ala Ser Leu Pro Ala Glu Ala Arg Gln
325 330 335 325 330 335
Arg Phe Phe Arg Met Ala Gly Phe Asp Tyr Asp Ala Phe Ala Ala Leu Arg Phe Phe Arg Met Ala Gly Phe Asp Tyr Asp Ala Phe Ala Ala Leu
340 345 350 340 345 350
Phe Asp Ala Ser Pro Lys Lys Asn Leu Val Ile Ile Gly Thr Ser His Phe Asp Ala Ser Pro Lys Lys Asn Leu Val Ile Ile Gly Thr Ser His
355 360 365 355 360 365
Thr Ser Glu Glu Ser Glu Ala Gln Gln Ala Ala Tyr Val Glu Arg Ile Thr Ser Glu Glu Ser Glu Ala Gln Gln Ala Ala Tyr Val Glu Arg Ile
370 375 380 370 375 380
Ile Gly Asp Tyr Gly Thr Ala Tyr Asp Ile Phe Phe Lys Pro His Pro Ile Gly Asp Tyr Gly Thr Ala Tyr Asp Ile Phe Phe Lys Pro His Pro
385 390 395 400 385 390 395 400
Ala Asp Ser Ser Ser Ser Asn Tyr Glu Glu Arg Phe Glu Gly Leu Thr Ala Asp Ser Ser Ser Ser Asn Tyr Glu Glu Arg Phe Glu Gly Leu Thr
405 410 415 405 410 415
Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ser Leu Leu Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ser Leu Leu
420 425 430 420 425 430
Asp Lys Val Asp Leu Ile Gly Gly Tyr Ser Ser Thr Val Phe Leu Thr Asp Lys Val Asp Leu Ile Gly Gly Tyr Ser Ser Thr Val Phe Leu Thr
435 440 445 435 440 445
Val Pro Val Glu Lys Thr Gly Phe Ile Phe Ala Ala Asn Ala Glu Ser Val Pro Val Glu Lys Thr Gly Phe Ile Phe Ala Ala Asn Ala Glu Ser
450 455 460 450 455 460
Leu Pro Arg Pro Leu Asn Val Leu Phe Arg Asn Ala Glu His Val Arg Leu Pro Arg Pro Leu Asn Val Leu Phe Arg Asn Ala Glu His Val Arg
465 470 475 480 465 470 475 480
Trp Ile Gln Trp Ile Gln
<210> 56<210> 56
<211> 483<211> 483
<212> Белок<212> Protein
<213> Alistipes shahii<213> Alistipes shahii
<400> 56<400> 56
Met Asp Asp Gly Thr Pro Ser Val Ser Ile Asn Gly Gly Thr Asp Phe Met Asp Asp Gly Thr Pro Ser Val Ser Ile Asn Gly Gly Thr Asp Phe
1 5 10 15 1 5 10 15
Leu Ser Leu Asp His Leu Ala Arg Ser Gly Lys Ile Thr Val Asn Ala Leu Ser Leu Asp His Leu Ala Arg Ser Gly Lys Ile Thr Val Asn Ala
20 25 30 20 25 30
Pro Ala Pro Trp Ser Val Thr Leu Ala Pro Glu Asn Tyr Gly Gln Asp Pro Ala Pro Trp Ser Val Thr Leu Ala Pro Glu Asn Tyr Gly Gln Asp
35 40 45 35 40 45
Glu Lys Pro Asp Trp Leu Thr Leu Ser Ala Glu Glu Gly Pro Ala Gly Glu Lys Pro Asp Trp Leu Thr Leu Ser Ala Glu Glu Gly Pro Ala Gly
50 55 60 50 55 60
Tyr Ser Glu Ile Asp Val Thr Phe Ala Glu Asn Pro Gly Pro Ala Arg Tyr Ser Glu Ile Asp Val Thr Phe Ala Glu Asn Pro Gly Pro Ala Arg
65 70 75 80 65 70 75 80
Ser Ala Ser Leu Leu Phe Ser Cys Asp Gly Lys Thr Leu Ala Phe Thr Ser Ala Ser Leu Leu Phe Ser Cys Asp Gly Lys Thr Leu Ala Phe Thr
85 90 95 85 90 95
Val Ser Gln Ser Ala Gly Gly Thr Gly Phe Asp Ala Pro Asp Tyr Tyr Val Ser Gln Ser Ala Gly Gly Thr Gly Phe Asp Ala Pro Asp Tyr Tyr
100 105 110 100 105 110
Phe Tyr Ile Ser Val Gly Thr Met Pro Thr Leu Tyr Ser Gly Leu His Phe Tyr Ile Ser Val Gly Thr Met Pro Thr Leu Tyr Ser Gly Leu His
115 120 125 115 120 125
Leu Leu Ser His Asp Lys Pro Ser Tyr Val Ser Tyr Glu Arg Ala Ser Leu Leu Ser His Asp Lys Pro Ser Tyr Val Ser Tyr Glu Arg Ala Ser
130 135 140 130 135 140
Thr Phe Asp Ala Ala Glu Phe Pro Asp Arg Ala Phe Val Tyr Pro Val Thr Phe Asp Ala Ala Glu Phe Pro Asp Arg Ala Phe Val Tyr Pro Val
145 150 155 160 145 150 155 160
Ala Asp Pro Thr Gly His Ala Thr Asn Glu Glu Leu Arg Ala Met Ser Ala Asp Pro Thr Gly His Ala Thr Asn Glu Glu Leu Arg Ala Met Ser
165 170 175 165 170 175
Glu Ala Met Lys Arg Arg Ile Leu Glu Ile Asn Ala Glu Asp Pro Thr Glu Ala Met Lys Arg Arg Ile Leu Glu Ile Asn Ala Glu Asp Pro Thr
180 185 190 180 185 190
Ala Val Phe Gly Leu Trp Val Asp Asp Leu Arg Cys Arg Leu Gly Tyr Ala Val Phe Gly Leu Trp Val Asp Asp Leu Arg Cys Arg Leu Gly Tyr
195 200 205 195 200 205
Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Arg Val Lys Val Thr Asp Trp Phe Val Ala Gln Gly Ile Asp Ser Ala Arg Val Lys Val Thr
210 215 220 210 215 220
Met Leu Ser Asp Gly Thr Ala Thr Tyr Asn Asn Phe His Asn Tyr Phe Met Leu Ser Asp Gly Thr Ala Thr Tyr Asn Asn Phe His Asn Tyr Phe
225 230 235 240 225 230 235 240
Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Asn Asp Tyr Ala Ala Glu Gly Asp Ala Ala Thr Ala Glu Gln Asn Trp Asn Asp Tyr Ala Ala Glu
245 250 255 245 250 255
Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg Tyr Pro Glu Thr Arg Val Glu Ala Leu Asp Trp Asn His Gly Gly Arg Tyr Pro Glu Thr Arg
260 265 270 260 265 270
Ala Pro Glu Glu Phe Ala Ser Tyr Thr Trp Pro Tyr Tyr Leu Ser Thr Ala Pro Glu Glu Phe Ala Ser Tyr Thr Trp Pro Tyr Tyr Leu Ser Thr
275 280 285 275 280 285
Arg Pro Asp Tyr Arg Leu Met Leu Gln Asn Ser Ser Leu Met Glu Ser Arg Pro Asp Tyr Arg Leu Met Leu Gln Asn Ser Ser Leu Met Glu Ser
290 295 300 290 295 300
Ser Cys Pro Phe Ile Ala Asp Arg Leu Ala Ala Met Lys Met Glu Ser Ser Cys Pro Phe Ile Ala Asp Arg Leu Ala Ala Met Lys Met Glu Ser
305 310 315 320 305 310 315 320
Val Gln Pro Tyr Glu Leu Leu Thr Ala Leu Pro Glu Ala Ser Lys Gln Val Gln Pro Tyr Glu Leu Leu Thr Ala Leu Pro Glu Ala Ser Lys Gln
325 330 335 325 330 335
Gln Phe Tyr Arg Met Ala Lys Phe Asp Tyr Ala Arg Phe Ala Gly Leu Gln Phe Tyr Arg Met Ala Lys Phe Asp Tyr Ala Arg Phe Ala Gly Leu
340 345 350 340 345 350
Phe Asp Leu Ser Pro Lys Lys Asn Leu Ile Ile Ile Gly Thr Ser His Phe Asp Leu Ser Pro Lys Lys Asn Leu Ile Ile Ile Gly Thr Ser His
355 360 365 355 360 365
Ser Ser Ala Ala Ser Glu Gln Gln Gln Ala Ala Tyr Val Glu Arg Ile Ser Ser Ala Ala Ser Glu Gln Gln Gln Ala Ala Tyr Val Glu Arg Ile
370 375 380 370 375 380
Ile Gln Gln Tyr Gly Ser Asp Tyr Asp Ile Phe Phe Lys Pro His Pro Ile Gln Gln Tyr Gly Ser Asp Tyr Asp Ile Phe Phe Lys Pro His Pro
385 390 395 400 385 390 395 400
Ala Asp Ser Ser Ser Ala Gly Tyr Pro Asp Arg Phe Glu Gly Leu Thr Ala Asp Ser Ser Ser Ala Gly Tyr Pro Asp Arg Phe Glu Gly Leu Thr
405 410 415 405 410 415
Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ala Leu Leu Leu Leu Pro Gly Gln Met Pro Phe Glu Ile Phe Val Trp Ala Leu Leu
420 425 430 420 425 430
Asp Lys Ile Asp Met Ile Gly Gly Tyr Pro Ser Thr Thr Phe Ile Ser Asp Lys Ile Asp Met Ile Gly Gly Tyr Pro Ser Thr Thr Phe Ile Ser
435 440 445 435 440 445
Val Pro Leu Asp Lys Val Gly Phe Leu Phe Ala Ala Asp Ala Asp Gly Val Pro Leu Asp Lys Val Gly Phe Leu Phe Ala Ala Asp Ala Asp Gly
450 455 460 450 455 460
Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Ala Asn Val Glu Leu Val Arg Pro Leu Asn Ile Leu Phe Arg Asp Ala Ala Asn Val Glu
465 470 475 480 465 470 475 480
Trp Ile Gln Trp Ile Gln
<210> 57<210> 57
<211> 401<211> 401
<212> Белок<212> Protein
<213> Actinobacillus suis<213> Actinobacillus suis
<400> 57<400> 57
Met Glu Arg Thr Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe Met Glu Arg Thr Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe
1 5 10 15 1 5 10 15
Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys
20 25 30 20 25 30
His Asp Asp Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met His Asp Asp Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met
35 40 45 35 40 45
Pro Gln Thr Leu Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg Pro Gln Thr Leu Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg
50 55 60 50 55 60
Asn Val Glu His Asn Val Glu Pro Leu Leu Glu Gln Leu Gln Thr Ile Asn Val Glu His Asn Val Glu Pro Leu Leu Glu Gln Leu Gln Thr Ile
65 70 75 80 65 70 75 80
Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu
85 90 95 85 90 95
Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Gln Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Gln
100 105 110 100 105 110
Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly
115 120 125 115 120 125
Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Lys Ser Ser Ser Leu Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Lys Ser Ser Ser Leu
130 135 140 130 135 140
Val Gln Asp Leu Ala Ala Thr Lys Ala Ser Leu Val Ser Leu Phe Glu Val Gln Asp Leu Ala Ala Thr Lys Ala Ser Leu Val Ser Leu Phe Glu
145 150 155 160 145 150 155 160
Asn Gly Glu Gly Ser Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp Asn Gly Glu Gly Ser Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp
165 170 175 165 170 175
Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu
180 185 190 180 185 190
Leu Asp Glu Lys Leu Gln Pro Leu Lys Ala Glu Leu Gly His Tyr Gln Leu Asp Glu Lys Leu Gln Pro Leu Lys Ala Glu Leu Gly His Tyr Gln
195 200 205 195 200 205
Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu
210 215 220 210 215 220
Trp Leu Lys Gln Ile Leu Lys Ile Asp Thr Glu Leu Glu Ser Leu Met Trp Leu Lys Gln Ile Leu Lys Ile Asp Thr Glu Leu Glu Ser Leu Met
225 230 235 240 225 230 235 240
Gln Lys Leu Thr Ala Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe Gln Lys Leu Thr Ala Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe
245 250 255 245 250 255
Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala
260 265 270 260 265 270
Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly
275 280 285 275 280 285
Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile
290 295 300 290 295 300
Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Val Ile Phe Leu Pro Glu Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Val Ile Phe Leu Pro Glu
305 310 315 320 305 310 315 320
Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys
325 330 335 325 330 335
Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys
340 345 350 340 345 350
Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Gln Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Gln
355 360 365 355 360 365
Leu Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu Leu Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu
370 375 380 370 375 380
Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser
385 390 395 400 385 390 395 400
Ser Ser
<210> 58<210> 58
<211> 401<211> 401
<212> Белок<212> Protein
<213> Actinobacillus capsulatus<213> Actinobacillus capsulatus
<400> 58<400> 58
Met Glu Arg Ile Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe Met Glu Arg Ile Pro Gln Leu Gln Ala Val Asp Ile Tyr Ile Asp Phe
1 5 10 15 1 5 10 15
Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys Ala Thr Ile Pro Ser Leu Ser Tyr Phe Leu His Phe Leu Lys His Lys
20 25 30 20 25 30
His Asp His Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met His Asp His Gln Arg Leu Arg Leu Phe Ser Leu Ala Arg Phe Glu Met
35 40 45 35 40 45
Pro Gln Thr Val Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg Pro Gln Thr Val Ile Glu Gln Tyr Glu Gly Ile Ile Gln Phe Ser Arg
50 55 60 50 55 60
Asn Val Glu His Asn Val Glu Gln Leu Leu Glu Gln Leu Gln Thr Ile Asn Val Glu His Asn Val Glu Gln Leu Leu Glu Gln Leu Gln Thr Ile
65 70 75 80 65 70 75 80
Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu Leu Ser Gln Glu Gly Lys Gln Phe Glu Leu His Leu His Leu Asn Leu
85 90 95 85 90 95
Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Lys Phe His Ser Phe Glu Met Phe Leu Asn Leu Ser Pro Thr Tyr Thr Lys
100 105 110 100 105 110
Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly Tyr Lys Glu Lys Ile Ser Lys Ile Val Leu His Leu Tyr Asp Asp Gly
115 120 125 115 120 125
Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Gln Ser Asn Ser Leu Ser Glu Gly Val Met Lys Gln Tyr Gln Leu Gln Gln Ser Asn Ser Leu
130 135 140 130 135 140
Ala Gln Asp Leu Ala Ser Thr Lys Ala Ser Leu Val Ser Leu Phe Lys Ala Gln Asp Leu Ala Ser Thr Lys Ala Ser Leu Val Ser Leu Phe Lys
145 150 155 160 145 150 155 160
Asn Gly Glu Gly Ala Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp Asn Gly Glu Gly Ala Phe Ser Gln Ile Asp Leu Ile Arg Tyr Val Trp
165 170 175 165 170 175
Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu Asn Ala Val Leu Glu Thr His Tyr Tyr Leu Leu Ser Asp His Phe Leu
180 185 190 180 185 190
Ala His Glu Lys Leu Gln Pro Leu Lys Ile Glu Leu Gly His Tyr Gln Ala His Glu Lys Leu Gln Pro Leu Lys Ile Glu Leu Gly His Tyr Gln
195 200 205 195 200 205
Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu Leu Leu Asn Leu Ser Ala Tyr Gln Tyr Leu Ser Ser Glu Asp Leu Leu
210 215 220 210 215 220
Trp Leu Lys Gln Ile Leu Lys Ile Asp Ala Glu Leu Glu Ser Leu Met Trp Leu Lys Gln Ile Leu Lys Ile Asp Ala Glu Leu Glu Ser Leu Met
225 230 235 240 225 230 235 240
His Lys Leu Thr Thr Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe His Lys Leu Thr Thr Gln Pro Val Tyr Phe Phe Ser Gly Thr Thr Phe
245 250 255 245 250 255
Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala Phe Asn Ile Ser Phe Glu Asp Lys Gln Arg Leu Ala Asn Ile His Ala
260 265 270 260 265 270
Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly Ile Leu Ile Arg Glu His Leu Asp Pro Asn Ser Gln Leu Phe Ile Gly
275 280 285 275 280 285
Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile Glu Pro Tyr Leu Phe Val Phe Lys Gly His Pro Asn Ser Pro Glu Ile
290 295 300 290 295 300
Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Ala Ile Phe Leu Pro Glu Asn Gln Ala Leu Arg Glu Tyr Tyr Pro Asn Ala Ile Phe Leu Pro Glu
305 310 315 320 305 310 315 320
Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys Asn Ile Pro Phe Glu Ile Leu Thr Leu Leu Gly Phe Ser Pro Gln Lys
325 330 335 325 330 335
Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys Ile Gly Gly Phe Ala Ser Thr Ile His Val Asn Ser Glu Gln Ser Lys
340 345 350 340 345 350
Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Asn Leu Ala Lys Leu Phe Phe Leu Thr Ser Thr Asp Glu Gln Glu Arg Asn
355 360 365 355 360 365
Arg Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu Arg Ser Asp Gly Tyr Ile Lys Gln Tyr Ala Leu Ala Gln Ala Met Leu
370 375 380 370 375 380
Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser Glu Met Gln Leu Val Ser Gln Glu Gln Val Tyr Tyr Cys Ser Leu Ser
385 390 395 400 385 390 395 400
Ser Ser
<210> 59<210> 59
<211> 311<211> 311
<212> Белок<212> Protein
<213> Haemophilus somnus<213> Haemophilus somnus
<400> 59<400> 59
Met Phe Arg Glu Asp Asn Met Asn Leu Ile Ile Cys Cys Thr Pro Leu Met Phe Arg Glu Asp Asn Met Asn Leu Ile Ile Cys Cys Thr Pro Leu
1 5 10 15 1 5 10 15
Gln Val Ile Ile Ala Glu Lys Ile Ile Glu Arg Tyr Pro Glu Gln Lys Gln Val Ile Ile Ala Glu Lys Ile Ile Glu Arg Tyr Pro Glu Gln Lys
20 25 30 20 25 30
Phe Tyr Gly Val Met Leu Glu Ser Phe Tyr Asn Asp Lys Phe Asp Phe Phe Tyr Gly Val Met Leu Glu Ser Phe Tyr Asn Asp Lys Phe Asp Phe
35 40 45 35 40 45
Tyr Glu Asn Lys Leu Lys His Leu Cys His Glu Phe Phe Cys Ile Lys Tyr Glu Asn Lys Leu Lys His Leu Cys His Glu Phe Phe Cys Ile Lys
50 55 60 50 55 60
Ile Ala Arg Phe Lys Leu Glu Arg Tyr Lys Asn Leu Leu Ser Leu Leu Ile Ala Arg Phe Lys Leu Glu Arg Tyr Lys Asn Leu Leu Ser Leu Leu
65 70 75 80 65 70 75 80
Lys Ile Lys Asn Lys Thr Phe Asp Arg Val Phe Leu Ala Asn Ile Glu Lys Ile Lys Asn Lys Thr Phe Asp Arg Val Phe Leu Ala Asn Ile Glu
85 90 95 85 90 95
Lys Arg Tyr Ile His Ile Ile Leu Ser Asn Ile Phe Phe Lys Glu Leu Lys Arg Tyr Ile His Ile Ile Leu Ser Asn Ile Phe Phe Lys Glu Leu
100 105 110 100 105 110
Tyr Thr Phe Asp Asp Gly Thr Ala Asn Ile Ala Pro Asn Ser His Leu Tyr Thr Phe Asp Asp Gly Thr Ala Asn Ile Ala Pro Asn Ser His Leu
115 120 125 115 120 125
Tyr Gln Glu Tyr Asp His Ser Leu Lys Lys Arg Ile Thr Asp Ile Leu Tyr Gln Glu Tyr Asp His Ser Leu Lys Lys Arg Ile Thr Asp Ile Leu
130 135 140 130 135 140
Leu Pro Asn His Tyr Asn Ser Asn Lys Val Lys Asn Ile Ser Lys Leu Leu Pro Asn His Tyr Asn Ser Asn Lys Val Lys Asn Ile Ser Lys Leu
145 150 155 160 145 150 155 160
His Tyr Ser Ile Tyr Arg Cys Lys Asn Asn Ile Ile Asp Asn Ile Glu His Tyr Ser Ile Tyr Arg Cys Lys Asn Asn Ile Ile Asp Asn Ile Glu
165 170 175 165 170 175
Tyr Met Pro Leu Phe Asn Leu Glu Lys Lys Tyr Thr Ala Gln Asp Lys Tyr Met Pro Leu Phe Asn Leu Glu Lys Lys Tyr Thr Ala Gln Asp Lys
180 185 190 180 185 190
Ser Ile Ser Ile Leu Leu Gly Gln Pro Ile Phe Tyr Asp Glu Glu Lys Ser Ile Ser Ile Leu Leu Gly Gln Pro Ile Phe Tyr Asp Glu Glu Lys
195 200 205 195 200 205
Asn Ile Arg Leu Ile Lys Glu Val Ile Ala Lys Phe Lys Ile Asp Tyr Asn Ile Arg Leu Ile Lys Glu Val Ile Ala Lys Phe Lys Ile Asp Tyr
210 215 220 210 215 220
Tyr Phe Pro His Pro Arg Glu Asp Tyr Tyr Ile Asp Asn Val Ser Tyr Tyr Phe Pro His Pro Arg Glu Asp Tyr Tyr Ile Asp Asn Val Ser Tyr
225 230 235 240 225 230 235 240
Ile Lys Thr Pro Leu Ile Phe Glu Glu Phe Tyr Ala Glu Arg Ser Ile Ile Lys Thr Pro Leu Ile Phe Glu Glu Phe Tyr Ala Glu Arg Ser Ile
245 250 255 245 250 255
Glu Asn Ser Ile Lys Ile Tyr Thr Phe Phe Ser Ser Ala Val Leu Asn Glu Asn Ser Ile Lys Ile Tyr Thr Phe Phe Ser Ser Ala Val Leu Asn
260 265 270 260 265 270
Ile Val Thr Lys Glu Asn Ile Asp Arg Ile Tyr Ala Leu Lys Pro Lys Ile Val Thr Lys Glu Asn Ile Asp Arg Ile Tyr Ala Leu Lys Pro Lys
275 280 285 275 280 285
Leu Thr Glu Lys Ala Tyr Leu Asp Cys Tyr Asp Ile Leu Lys Asp Phe Leu Thr Glu Lys Ala Tyr Leu Asp Cys Tyr Asp Ile Leu Lys Asp Phe
290 295 300 290 295 300
Gly Ile Lys Val Ile Asp Ile Gly Ile Lys Val Ile Asp Ile
305 310 305 310
<210> 60<210> 60
<211> 399<211> 399
<212> Белок<212> Protein
<213> Haemophilus ducreyi<213> Haemophilus ducreyi
<400> 60<400> 60
Met Leu Ile Gln Gln Asn Leu Glu Ile Tyr Leu Asp Tyr Ala Thr Ile Met Leu Ile Gln Gln Asn Leu Glu Ile Tyr Leu Asp Tyr Ala Thr Ile
1 5 10 15 1 5 10 15
Pro Ser Leu Ala Cys Phe Met His Phe Ile Gln His Lys Asp Asp Val Pro Ser Leu Ala Cys Phe Met His Phe Ile Gln His Lys Asp Asp Val
20 25 30 20 25 30
Asp Ser Ile Arg Leu Phe Gly Leu Ala Arg Phe Asp Ile Pro Gln Ser Asp Ser Ile Arg Leu Phe Gly Leu Ala Arg Phe Asp Ile Pro Gln Ser
35 40 45 35 40 45
Ile Ile Asp Arg Tyr Pro Ala Asn His Leu Phe Tyr His Asn Ile Asp Ile Ile Asp Arg Tyr Pro Ala Asn His Leu Phe Tyr His Asn Ile Asp
50 55 60 50 55 60
Asn Arg Asp Leu Thr Ala Val Leu Asn Gln Leu Ala Asp Ile Leu Ala Asn Arg Asp Leu Thr Ala Val Leu Asn Gln Leu Ala Asp Ile Leu Ala
65 70 75 80 65 70 75 80
Gln Glu Asn Lys Arg Phe Gln Ile Asn Leu His Leu Asn Leu Phe His Gln Glu Asn Lys Arg Phe Gln Ile Asn Leu His Leu Asn Leu Phe His
85 90 95 85 90 95
Ser Ile Asp Leu Phe Phe Ala Ile Tyr Pro Ile Tyr Gln Gln Tyr Gln Ser Ile Asp Leu Phe Phe Ala Ile Tyr Pro Ile Tyr Gln Gln Tyr Gln
100 105 110 100 105 110
His Lys Ile Ser Thr Ile Gln Leu Gln Leu Tyr Asp Asp Gly Ser Glu His Lys Ile Ser Thr Ile Gln Leu Gln Leu Tyr Asp Asp Gly Ser Glu
115 120 125 115 120 125
Gly Ile Val Thr Gln His Ser Leu Cys Lys Ile Ala Asp Leu Glu Gln Gly Ile Val Thr Gln His Ser Leu Cys Lys Ile Ala Asp Leu Glu Gln
130 135 140 130 135 140
Leu Ile Leu Gln His Lys Asn Val Leu Leu Glu Leu Leu Thr Lys Gly Leu Ile Leu Gln His Lys Asn Val Leu Leu Glu Leu Leu Thr Lys Gly
145 150 155 160 145 150 155 160
Thr Ala Asn Val Pro Asn Pro Thr Leu Leu Arg Tyr Leu Trp Asn Asn Thr Ala Asn Val Pro Asn Pro Thr Leu Leu Arg Tyr Leu Trp Asn Asn
165 170 175 165 170 175
Ile Ile Asp Ser Gln Phe His Leu Ile Ser Asp His Phe Leu Gln His Ile Ile Asp Ser Gln Phe His Leu Ile Ser Asp His Phe Leu Gln His
180 185 190 180 185 190
Pro Lys Leu Gln Pro Leu Lys Arg Leu Leu Lys Arg Tyr Thr Ile Leu Pro Lys Leu Gln Pro Leu Lys Arg Leu Leu Lys Arg Tyr Thr Ile Leu
195 200 205 195 200 205
Asp Phe Thr Cys Tyr Pro Arg Phe Asn Ala Glu Gln Lys Gln Leu Leu Asp Phe Thr Cys Tyr Pro Arg Phe Asn Ala Glu Gln Lys Gln Leu Leu
210 215 220 210 215 220
Lys Glu Ile Leu His Ile Ser Asn Glu Leu Glu Asn Leu Leu Lys Leu Lys Glu Ile Leu His Ile Ser Asn Glu Leu Glu Asn Leu Leu Lys Leu
225 230 235 240 225 230 235 240
Leu Lys Gln His Asn Thr Phe Leu Phe Thr Gly Thr Thr Ala Phe Asn Leu Lys Gln His Asn Thr Phe Leu Phe Thr Gly Thr Thr Ala Phe Asn
245 250 255 245 250 255
Leu Asp Gln Glu Lys Leu Asp Leu Leu Thr Gln Leu His Ile Leu Leu Leu Asp Gln Glu Lys Leu Asp Leu Leu Thr Gln Leu His Ile Leu Leu
260 265 270 260 265 270
Leu Asn Glu His Gln Asn Pro His Ser Thr His Tyr Ile Gly Asn Asn Leu Asn Glu His Gln Asn Pro His Ser Thr His Tyr Ile Gly Asn Asn
275 280 285 275 280 285
Tyr Leu Leu Leu Ile Lys Gly His Ala Asn Ser Pro Ala Leu Asn His Tyr Leu Leu Leu Ile Lys Gly His Ala Asn Ser Pro Ala Leu Asn His
290 295 300 290 295 300
Thr Leu Ala Leu His Phe Pro Asp Ala Ile Phe Leu Pro Ala Asn Ile Thr Leu Ala Leu His Phe Pro Asp Ala Ile Phe Leu Pro Ala Asn Ile
305 310 315 320 305 310 315 320
Pro Phe Glu Ile Phe Ala Met Leu Gly Phe Thr Pro Asn Lys Met Gly Pro Phe Glu Ile Phe Ala Met Leu Gly Phe Thr Pro Asn Lys Met Gly
325 330 335 325 330 335
Gly Phe Ala Ser Thr Ser Tyr Ile Asn Tyr Pro Thr Glu Asn Ile Asn Gly Phe Ala Ser Thr Ser Tyr Ile Asn Tyr Pro Thr Glu Asn Ile Asn
340 345 350 340 345 350
His Leu Phe Phe Leu Thr Ser Asp Gln Pro Ser Ile Arg Thr Lys Trp His Leu Phe Phe Leu Thr Ser Asp Gln Pro Ser Ile Arg Thr Lys Trp
355 360 365 355 360 365
Leu Asp Tyr Glu Lys Gln Phe Gly Leu Met Tyr Ser Leu Leu Ala Met Leu Asp Tyr Glu Lys Gln Phe Gly Leu Met Tyr Ser Leu Leu Ala Met
370 375 380 370 375 380
Gln Lys Ile Asn Glu Asp Gln Ala Phe Met Cys Thr Ile His Asn Gln Lys Ile Asn Glu Asp Gln Ala Phe Met Cys Thr Ile His Asn
385 390 395 385 390 395
<210> 61<210> 61
<211> 497<211> 497
<212> Белок<212> Protein
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 61<400> 61
Met Cys Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val Met Cys Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val
1 5 10 15 1 5 10 15
Asn Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp Asn Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp
20 25 30 20 25 30
Thr Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr Thr Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr
35 40 45 35 40 45
Pro Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val Pro Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val
50 55 60 50 55 60
Ala Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly Ala Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly
65 70 75 80 65 70 75 80
Asp Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val Asp Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val
85 90 95 85 90 95
Ala Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu Ala Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu
100 105 110 100 105 110
Gln Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn Gln Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn
115 120 125 115 120 125
Glu Arg Phe Ile Ser Trp Gly Arg Ile Gly Leu Thr Glu Asp Asn Ala Glu Arg Phe Ile Ser Trp Gly Arg Ile Gly Leu Thr Glu Asp Asn Ala
130 135 140 130 135 140
Glu Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser Glu Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser
145 150 155 160 145 150 155 160
Gln Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg Gln Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg
165 170 175 165 170 175
Leu Asn Leu Glu Leu Asn Thr Asn Thr Ala His Ser Phe Pro Asn Leu Leu Asn Leu Glu Leu Asn Thr Asn Thr Ala His Ser Phe Pro Asn Leu
180 185 190 180 185 190
Ala Pro Ile Leu Arg Ile Ile Ser Ser Lys Ser Asn Ile Leu Ile Ser Ala Pro Ile Leu Arg Ile Ile Ser Ser Lys Ser Asn Ile Leu Ile Ser
195 200 205 195 200 205
Asn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu Tyr Asn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu Tyr
210 215 220 210 215 220
Asn Trp Lys Asp Thr Glu Asp Lys Ser Val Lys Leu Ser Asp Ser Phe Asn Trp Lys Asp Thr Glu Asp Lys Ser Val Lys Leu Ser Asp Ser Phe
225 230 235 240 225 230 235 240
Leu Val Leu Lys Asp Tyr Phe Asn Gly Ile Ser Ser Glu Lys Pro Ser Leu Val Leu Lys Asp Tyr Phe Asn Gly Ile Ser Ser Glu Lys Pro Ser
245 250 255 245 250 255
Gly Ile Tyr Gly Arg Tyr Asn Trp His Gln Leu Tyr Asn Thr Ser Tyr Gly Ile Tyr Gly Arg Tyr Asn Trp His Gln Leu Tyr Asn Thr Ser Tyr
260 265 270 260 265 270
Tyr Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Pro Gln Leu His Asp Tyr Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Pro Gln Leu His Asp
275 280 285 275 280 285
Leu Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Gly Leu Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Gly
290 295 300 290 295 300
Phe Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val Phe Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val
305 310 315 320 305 310 315 320
Gly Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu Gly Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu
325 330 335 325 330 335
Pro Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Pro Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr
340 345 350 340 345 350
Lys Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile Lys Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile
355 360 365 355 360 365
Asn Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe Asn Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe
370 375 380 370 375 380
Lys Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser Lys Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser
385 390 395 400 385 390 395 400
Phe Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu Phe Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu
405 410 415 405 410 415
Met Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser Met Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser
420 425 430 420 425 430
Leu Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr Leu Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr
435 440 445 435 440 445
Ser Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu Ser Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu
450 455 460 450 455 460
Val Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Val Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu
465 470 475 480 465 470 475 480
Phe Trp Ser Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Ala Gln Phe Trp Ser Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Ala Gln
485 490 495 485 490 495
Tyr Tyr
<210> 62<210> 62
<211> 498<211> 498
<212> Белок<212> Protein
<213> Photobacterium sp.<213> Photobacterium sp.
<400> 62<400> 62
Met Ser Glu Glu Asn Thr Gln Ser Ile Ile Lys Asn Asp Ile Asn Lys Met Ser Glu Glu Asn Thr Gln Ser Ile Ile Lys Asn Asp Ile Asn Lys
1 5 10 15 1 5 10 15
Thr Ile Ile Asp Glu Glu Tyr Val Asn Leu Glu Pro Ile Asn Gln Ser Thr Ile Ile Asp Glu Glu Tyr Val Asn Leu Glu Pro Ile Asn Gln Ser
20 25 30 20 25 30
Asn Ile Ser Phe Thr Lys His Ser Trp Val Gln Thr Cys Gly Thr Gln Asn Ile Ser Phe Thr Lys His Ser Trp Val Gln Thr Cys Gly Thr Gln
35 40 45 35 40 45
Gln Leu Leu Thr Glu Gln Asn Lys Glu Ser Ile Ser Leu Ser Val Val Gln Leu Leu Thr Glu Gln Asn Lys Glu Ser Ile Ser Leu Ser Val Val
50 55 60 50 55 60
Ala Pro Arg Leu Asp Asp Asp Glu Lys Tyr Cys Phe Asp Phe Asn Gly Ala Pro Arg Leu Asp Asp Asp Glu Lys Tyr Cys Phe Asp Phe Asn Gly
65 70 75 80 65 70 75 80
Val Ser Asn Lys Gly Glu Lys Tyr Ile Thr Lys Val Thr Leu Asn Val Val Ser Asn Lys Gly Glu Lys Tyr Ile Thr Lys Val Thr Leu Asn Val
85 90 95 85 90 95
Val Ala Pro Ser Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Thr Val Ala Pro Ser Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Thr
100 105 110 100 105 110
Leu Gln Gln Leu Met Asp Ile Ile Lys Ser Glu Glu Glu Asn Pro Thr Leu Gln Gln Leu Met Asp Ile Ile Lys Ser Glu Glu Glu Asn Pro Thr
115 120 125 115 120 125
Ala Gln Arg Tyr Ile Ala Trp Gly Arg Ile Val Pro Thr Asp Glu Gln Ala Gln Arg Tyr Ile Ala Trp Gly Arg Ile Val Pro Thr Asp Glu Gln
130 135 140 130 135 140
Met Lys Glu Leu Asn Ile Thr Ser Phe Ala Leu Ile Asn Asn His Thr Met Lys Glu Leu Asn Ile Thr Ser Phe Ala Leu Ile Asn Asn His Thr
145 150 155 160 145 150 155 160
Pro Ala Asp Leu Val Gln Glu Ile Val Lys Gln Ala Gln Thr Lys His Pro Ala Asp Leu Val Gln Glu Ile Val Lys Gln Ala Gln Thr Lys His
165 170 175 165 170 175
Arg Leu Asn Val Lys Leu Ser Ser Asn Thr Ala His Ser Phe Asp Asn Arg Leu Asn Val Lys Leu Ser Ser Asn Thr Ala His Ser Phe Asp Asn
180 185 190 180 185 190
Leu Val Pro Ile Leu Lys Glu Leu Asn Ser Phe Asn Asn Val Thr Val Leu Val Pro Ile Leu Lys Glu Leu Asn Ser Phe Asn Asn Val Thr Val
195 200 205 195 200 205
Thr Asn Ile Asp Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu Thr Asn Ile Asp Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Asn Leu
210 215 220 210 215 220
Tyr Asn Trp Arg Asp Thr Leu Asn Lys Thr Asp Asn Leu Lys Ile Gly Tyr Asn Trp Arg Asp Thr Leu Asn Lys Thr Asp Asn Leu Lys Ile Gly
225 230 235 240 225 230 235 240
Lys Asp Tyr Leu Glu Asp Val Ile Asn Gly Ile Asn Glu Asp Thr Ser Lys Asp Tyr Leu Glu Asp Val Ile Asn Gly Ile Asn Glu Asp Thr Ser
245 250 255 245 250 255
Asn Thr Gly Thr Ser Ser Val Tyr Asn Trp Gln Lys Leu Tyr Pro Ala Asn Thr Gly Thr Ser Ser Val Tyr Asn Trp Gln Lys Leu Tyr Pro Ala
260 265 270 260 265 270
Asn Tyr His Phe Leu Arg Lys Asp Tyr Leu Thr Leu Glu Pro Ser Leu Asn Tyr His Phe Leu Arg Lys Asp Tyr Leu Thr Leu Glu Pro Ser Leu
275 280 285 275 280 285
His Glu Leu Arg Asp Tyr Ile Gly Asp Ser Leu Lys Gln Met Gln Trp His Glu Leu Arg Asp Tyr Ile Gly Asp Ser Leu Lys Gln Met Gln Trp
290 295 300 290 295 300
Asp Gly Phe Lys Lys Phe Asn Ser Lys Gln Gln Glu Leu Phe Leu Ser Asp Gly Phe Lys Lys Phe Asn Ser Lys Gln Gln Glu Leu Phe Leu Ser
305 310 315 320 305 310 315 320
Ile Val Asn Phe Asp Lys Gln Lys Leu Gln Asn Glu Tyr Asn Ser Ser Ile Val Asn Phe Asp Lys Gln Lys Leu Gln Asn Glu Tyr Asn Ser Ser
325 330 335 325 330 335
Asn Leu Pro Asn Phe Val Phe Thr Gly Thr Thr Val Trp Ala Gly Asn Asn Leu Pro Asn Phe Val Phe Thr Gly Thr Thr Val Trp Ala Gly Asn
340 345 350 340 345 350
His Glu Arg Glu Tyr Tyr Ala Lys Gln Gln Ile Asn Val Ile Asn Asn His Glu Arg Glu Tyr Tyr Ala Lys Gln Gln Ile Asn Val Ile Asn Asn
355 360 365 355 360 365
Ala Ile Asn Glu Ser Ser Pro His Tyr Leu Gly Asn Ser Tyr Asp Leu Ala Ile Asn Glu Ser Ser Pro His Tyr Leu Gly Asn Ser Tyr Asp Leu
370 375 380 370 375 380
Phe Phe Lys Gly His Pro Gly Gly Gly Ile Ile Asn Thr Leu Ile Met Phe Phe Lys Gly His Pro Gly Gly Gly Ile Ile Asn Thr Leu Ile Met
385 390 395 400 385 390 395 400
Gln Asn Tyr Pro Ser Met Val Asp Ile Pro Ser Lys Ile Ser Phe Glu Gln Asn Tyr Pro Ser Met Val Asp Ile Pro Ser Lys Ile Ser Phe Glu
405 410 415 405 410 415
Val Leu Met Met Thr Asp Met Leu Pro Asp Ala Val Ala Gly Ile Ala Val Leu Met Met Thr Asp Met Leu Pro Asp Ala Val Ala Gly Ile Ala
420 425 430 420 425 430
Ser Ser Leu Tyr Phe Thr Ile Pro Ala Glu Lys Ile Lys Phe Ile Val Ser Ser Leu Tyr Phe Thr Ile Pro Ala Glu Lys Ile Lys Phe Ile Val
435 440 445 435 440 445
Phe Thr Ser Thr Glu Thr Ile Thr Asp Arg Glu Thr Ala Leu Arg Ser Phe Thr Ser Thr Glu Thr Ile Thr Asp Arg Glu Thr Ala Leu Arg Ser
450 455 460 450 455 460
Pro Leu Val Gln Val Met Ile Lys Leu Gly Ile Val Lys Glu Glu Asn Pro Leu Val Gln Val Met Ile Lys Leu Gly Ile Val Lys Glu Glu Asn
465 470 475 480 465 470 475 480
Val Leu Phe Trp Ala Asp Leu Pro Asn Cys Glu Thr Gly Val Cys Ile Val Leu Phe Trp Ala Asp Leu Pro Asn Cys Glu Thr Gly Val Cys Ile
485 490 495 485 490 495
Ala Val Ala Val
<210> 63<210> 63
<211> 482<211> 482
<212> Белок<212> Protein
<213> Photobacterium leiognathi<213> Photobacterium leiognathi
<400> 63<400> 63
Met Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val Asn Met Asn Asp Asn Gln Asn Thr Val Asp Val Val Val Ser Thr Val Asn
1 5 10 15 1 5 10 15
Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp Thr Asp Asn Val Ile Glu Asn Asn Thr Tyr Gln Val Lys Pro Ile Asp Thr
20 25 30 20 25 30
Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr Pro Pro Thr Thr Phe Asp Ser Tyr Ser Trp Ile Gln Thr Cys Gly Thr Pro
35 40 45 35 40 45
Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val Ala Ile Leu Lys Asp Asp Glu Lys Tyr Ser Leu Ser Phe Asp Phe Val Ala
50 55 60 50 55 60
Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly Asp Pro Glu Leu Asp Gln Asp Glu Lys Phe Cys Phe Glu Phe Thr Gly Asp
65 70 75 80 65 70 75 80
Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val Ala Val Asp Gly Lys Arg Tyr Val Thr Gln Thr Asn Leu Thr Val Val Ala
85 90 95 85 90 95
Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu Gln Pro Thr Leu Glu Val Tyr Val Asp His Ala Ser Leu Pro Ser Leu Gln
100 105 110 100 105 110
Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn Glu Gln Leu Met Lys Ile Ile Gln Gln Lys Asn Glu Tyr Ser Gln Asn Glu
115 120 125 115 120 125
Arg Phe Ile Ser Trp Gly Arg Ile Arg Leu Thr Glu Asp Asn Ala Glu Arg Phe Ile Ser Trp Gly Arg Ile Arg Leu Thr Glu Asp Asn Ala Glu
130 135 140 130 135 140
Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser Gln Lys Leu Asn Ala His Ile Tyr Pro Leu Ala Gly Asn Asn Thr Ser Gln
145 150 155 160 145 150 155 160
Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg Leu Glu Leu Val Asp Ala Val Ile Asp Tyr Ala Asp Ser Lys Asn Arg Leu
165 170 175 165 170 175
Asn Leu Glu Leu Asn Thr Asn Thr Gly His Ser Phe Arg Asn Ile Ala Asn Leu Glu Leu Asn Thr Asn Thr Gly His Ser Phe Arg Asn Ile Ala
180 185 190 180 185 190
Pro Ile Leu Arg Ala Thr Ser Ser Lys Asn Asn Ile Leu Ile Ser Asn Pro Ile Leu Arg Ala Thr Ser Ser Lys Asn Asn Ile Leu Ile Ser Asn
195 200 205 195 200 205
Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Ser Leu Tyr Asn Ile Asn Leu Tyr Asp Asp Gly Ser Ala Glu Tyr Val Ser Leu Tyr Asn
210 215 220 210 215 220
Trp Lys Asp Thr Asp Asn Lys Ser Gln Lys Leu Ser Asp Ser Phe Leu Trp Lys Asp Thr Asp Asn Lys Ser Gln Lys Leu Ser Asp Ser Phe Leu
225 230 235 240 225 230 235 240
Val Leu Lys Asp Tyr Leu Asn Gly Ile Ser Ser Glu Lys Pro Asn Gly Val Leu Lys Asp Tyr Leu Asn Gly Ile Ser Ser Glu Lys Pro Asn Gly
245 250 255 245 250 255
Ile Tyr Ser Ile Tyr Asn Trp His Gln Leu Tyr His Ser Ser Tyr Tyr Ile Tyr Ser Ile Tyr Asn Trp His Gln Leu Tyr His Ser Ser Tyr Tyr
260 265 270 260 265 270
Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Thr Lys Leu His Asp Leu Phe Leu Arg Lys Asp Tyr Leu Thr Val Glu Thr Lys Leu His Asp Leu
275 280 285 275 280 285
Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Thr Phe Arg Glu Tyr Leu Gly Gly Ser Leu Lys Gln Met Ser Trp Asp Thr Phe
290 295 300 290 295 300
Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val Gly Ser Gln Leu Ser Lys Gly Asp Lys Glu Leu Phe Leu Asn Ile Val Gly
305 310 315 320 305 310 315 320
Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu Pro Phe Asp Gln Glu Lys Leu Gln Gln Glu Tyr Gln Gln Ser Glu Leu Pro
325 330 335 325 330 335
Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Asn Phe Val Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys
340 345 350 340 345 350
Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile Asn Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Val Asn Asn Ala Ile Asn
355 360 365 355 360 365
Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe Lys Glu Thr Ser Pro Tyr Tyr Leu Gly Arg Glu His Asp Leu Phe Phe Lys
370 375 380 370 375 380
Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser Phe Gly His Pro Arg Gly Gly Ile Ile Asn Asp Ile Ile Leu Gly Ser Phe
385 390 395 400 385 390 395 400
Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu Met Asn Asn Met Ile Asp Ile Pro Ala Lys Val Ser Phe Glu Val Leu Met
405 410 415 405 410 415
Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser Leu Met Thr Gly Met Leu Pro Asp Thr Val Gly Gly Ile Ala Ser Ser Leu
420 425 430 420 425 430
Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr Ser Tyr Phe Ser Ile Pro Ala Glu Lys Val Ser Phe Ile Val Phe Thr Ser
435 440 445 435 440 445
Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu Val Ser Asp Thr Ile Thr Asp Arg Glu Asp Ala Leu Lys Ser Pro Leu Val
450 455 460 450 455 460
Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Gln Val Met Met Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe
465 470 475 480 465 470 475 480
Trp Cys Trp Cys
<210> 64<210> 64
<211> 675<211> 675
<212> Белок<212> Protein
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 64<400> 64
Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys
1 5 10 15 1 5 10 15
Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala
20 25 30 20 25 30
Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro
35 40 45 35 40 45
Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile
50 55 60 50 55 60
Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro
65 70 75 80 65 70 75 80
Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr
85 90 95 85 90 95
Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro
100 105 110 100 105 110
Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln
115 120 125 115 120 125
Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg
130 135 140 130 135 140
Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys
145 150 155 160 145 150 155 160
Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu
165 170 175 165 170 175
Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn
180 185 190 180 185 190
Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro
195 200 205 195 200 205
Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile
210 215 220 210 215 220
Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp
225 230 235 240 225 230 235 240
Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu
245 250 255 245 250 255
Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met
260 265 270 260 265 270
Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe
275 280 285 275 280 285
Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg
290 295 300 290 295 300
Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala
305 310 315 320 305 310 315 320
Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe
325 330 335 325 330 335
Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn
340 345 350 340 345 350
Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu
355 360 365 355 360 365
Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu
370 375 380 370 375 380
Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly
385 390 395 400 385 390 395 400
His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro
405 410 415 405 410 415
Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met
420 425 430 420 425 430
Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr
435 440 445 435 440 445
Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser
450 455 460 450 455 460
Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln
465 470 475 480 465 470 475 480
Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp
485 490 495 485 490 495
Ala Asp His Lys Val Asn Ser Met Glu Val Ala Ile Asp Glu Ala Cys Ala Asp His Lys Val Asn Ser Met Glu Val Ala Ile Asp Glu Ala Cys
500 505 510 500 505 510
Thr Arg Ile Ile Ala Lys Arg Gln Pro Thr Ala Ser Asp Leu Arg Leu Thr Arg Ile Ile Ala Lys Arg Gln Pro Thr Ala Ser Asp Leu Arg Leu
515 520 525 515 520 525
Val Ile Ala Ile Ile Lys Thr Ile Thr Asp Leu Glu Arg Ile Gly Asp Val Ile Ala Ile Ile Lys Thr Ile Thr Asp Leu Glu Arg Ile Gly Asp
530 535 540 530 535 540
Val Ala Glu Ser Ile Ala Lys Val Ala Leu Glu Ser Phe Ser Asn Lys Val Ala Glu Ser Ile Ala Lys Val Ala Leu Glu Ser Phe Ser Asn Lys
545 550 555 560 545 550 555 560
Gln Tyr Asn Leu Leu Val Ser Leu Glu Ser Leu Gly Gln His Thr Val Gln Tyr Asn Leu Leu Val Ser Leu Glu Ser Leu Gly Gln His Thr Val
565 570 575 565 570 575
Arg Met Leu His Glu Val Leu Asp Ala Phe Ala Arg Met Asp Val Lys Arg Met Leu His Glu Val Leu Asp Ala Phe Ala Arg Met Asp Val Lys
580 585 590 580 585 590
Ala Ala Ile Glu Val Tyr Gln Glu Asp Asp Arg Ile Asp Gln Glu Tyr Ala Ala Ile Glu Val Tyr Gln Glu Asp Asp Arg Ile Asp Gln Glu Tyr
595 600 605 595 600 605
Glu Ser Ile Val Arg Gln Leu Met Ala His Met Met Glu Asp Pro Ser Glu Ser Ile Val Arg Gln Leu Met Ala His Met Met Glu Asp Pro Ser
610 615 620 610 615 620
Ser Ile Pro Asn Val Met Lys Val Met Trp Ala Ala Arg Ser Ile Glu Ser Ile Pro Asn Val Met Lys Val Met Trp Ala Ala Arg Ser Ile Glu
625 630 635 640 625 630 635 640
Arg Val Gly Asp Arg Cys Gln Asn Ile Cys Glu Tyr Ile Ile Tyr Phe Arg Val Gly Asp Arg Cys Gln Asn Ile Cys Glu Tyr Ile Ile Tyr Phe
645 650 655 645 650 655
Val Lys Gly Lys Asp Val Arg His Thr Lys Pro Asp Asp Phe Gly Thr Val Lys Gly Lys Asp Val Arg His Thr Lys Pro Asp Asp Phe Gly Thr
660 665 670 660 665 670
Met Leu Asp Met Leu Asp
675 675
<210> 65<210> 65
<211> 510<211> 510
<212> Белок<212> Protein
<213> Photobacterium damsela<213> Photobacterium damsela
<400> 65<400> 65
Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys Met Lys Lys Ile Leu Thr Val Leu Ser Ile Phe Ile Leu Ser Ala Cys
1 5 10 15 1 5 10 15
Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala Asn Ser Asp Asn Thr Ser Leu Lys Glu Thr Val Ser Ser Asn Ser Ala
20 25 30 20 25 30
Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro Asp Val Val Glu Thr Glu Thr Tyr Gln Leu Thr Pro Ile Asp Ala Pro
35 40 45 35 40 45
Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile Ser Ser Phe Leu Ser His Ser Trp Glu Gln Thr Cys Gly Thr Pro Ile
50 55 60 50 55 60
Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro Leu Asn Glu Ser Asp Lys Gln Ala Ile Ser Phe Asp Phe Val Ala Pro
65 70 75 80 65 70 75 80
Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr Glu Leu Lys Gln Asp Glu Lys Tyr Cys Phe Thr Phe Lys Gly Ile Thr
85 90 95 85 90 95
Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro Gly Asp His Arg Tyr Ile Thr Asn Thr Thr Leu Thr Val Val Ala Pro
100 105 110 100 105 110
Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln Thr Leu Glu Val Tyr Ile Asp His Ala Ser Leu Pro Ser Leu Gln Gln
115 120 125 115 120 125
Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg Leu Ile His Ile Ile Gln Ala Lys Asp Glu Tyr Pro Ser Asn Gln Arg
130 135 140 130 135 140
Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys Phe Val Ser Trp Lys Arg Val Thr Val Asp Ala Asp Asn Ala Asn Lys
145 150 155 160 145 150 155 160
Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu Leu Asn Ile His Thr Tyr Pro Leu Lys Gly Asn Asn Thr Ser Pro Glu
165 170 175 165 170 175
Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn Met Val Ala Ala Ile Asp Glu Tyr Ala Gln Ser Lys Asn Arg Leu Asn
180 185 190 180 185 190
Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro Ile Glu Phe Tyr Thr Asn Thr Ala His Val Phe Asn Asn Leu Pro Pro
195 200 205 195 200 205
Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile Ile Ile Gln Pro Leu Tyr Asn Asn Glu Lys Val Lys Ile Ser His Ile
210 215 220 210 215 220
Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp Ser Leu Tyr Asp Asp Gly Ser Ser Glu Tyr Val Ser Leu Tyr Gln Trp
225 230 235 240 225 230 235 240
Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu Lys Asp Thr Pro Asn Lys Ile Glu Thr Leu Glu Gly Glu Val Ser Leu
245 250 255 245 250 255
Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met Leu Ala Asn Tyr Leu Ala Gly Thr Ser Pro Asp Ala Pro Lys Gly Met
260 265 270 260 265 270
Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe Gly Asn Arg Tyr Asn Trp His Lys Leu Tyr Asp Thr Asp Tyr Tyr Phe
275 280 285 275 280 285
Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg Leu Arg Glu Asp Tyr Leu Asp Val Glu Ala Asn Leu His Asp Leu Arg
290 295 300 290 295 300
Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala Asp Tyr Leu Gly Ser Ser Ala Lys Gln Met Pro Trp Asp Glu Phe Ala
305 310 315 320 305 310 315 320
Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe Lys Leu Ser Asp Ser Gln Gln Thr Leu Phe Leu Asp Ile Val Gly Phe
325 330 335 325 330 335
Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn Asp Lys Glu Gln Leu Gln Gln Gln Tyr Ser Gln Ser Pro Leu Pro Asn
340 345 350 340 345 350
Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu Phe Ile Phe Thr Gly Thr Thr Thr Trp Ala Gly Gly Glu Thr Lys Glu
355 360 365 355 360 365
Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu Tyr Tyr Ala Gln Gln Gln Val Asn Val Ile Asn Asn Ala Ile Asn Glu
370 375 380 370 375 380
Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly Thr Ser Pro Tyr Tyr Leu Gly Lys Asp Tyr Asp Leu Phe Phe Lys Gly
385 390 395 400 385 390 395 400
His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro His Pro Ala Gly Gly Val Ile Asn Asp Ile Ile Leu Gly Ser Phe Pro
405 410 415 405 410 415
Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met Asp Met Ile Asn Ile Pro Ala Lys Ile Ser Phe Glu Val Leu Met Met
420 425 430 420 425 430
Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr Thr Asp Met Leu Pro Asp Thr Val Ala Gly Ile Ala Ser Ser Leu Tyr
435 440 445 435 440 445
Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser Phe Thr Ile Pro Ala Asp Lys Val Asn Phe Ile Val Phe Thr Ser Ser
450 455 460 450 455 460
Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln Asp Thr Ile Thr Asp Arg Glu Glu Ala Leu Lys Ser Pro Leu Val Gln
465 470 475 480 465 470 475 480
Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp Val Met Leu Thr Leu Gly Ile Val Lys Glu Lys Asp Val Leu Phe Trp
485 490 495 485 490 495
Ala Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Asp Lys Ala Asp Leu Pro Asp Cys Ser Ser Gly Val Cys Ile Asp Lys
500 505 510 500 505 510
<210> 66<210> 66
<211> 422<211> 422
<212> Белок<212> Protein
<213> Heliobacter acinonychis<213> Heliobacter acinonychis
<400> 66<400> 66
Met Gly Thr Ile Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser Met Gly Thr Ile Lys Lys Pro Leu Ile Ile Ala Gly Asn Gly Pro Ser
1 5 10 15 1 5 10 15
Ile Lys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe Ile Lys Asp Leu Asp Tyr Ala Leu Phe Pro Lys Asp Phe Asp Val Phe
20 25 30 20 25 30
Arg Cys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu Arg Cys Asn Gln Phe Tyr Phe Glu Asp Lys Tyr Tyr Leu Gly Arg Glu
35 40 45 35 40 45
Ile Lys Gly Val Phe Phe Asn Pro Cys Val Leu Ser Ser Gln Met Gln Ile Lys Gly Val Phe Phe Asn Pro Cys Val Leu Ser Ser Gln Met Gln
50 55 60 50 55 60
Thr Val Gln Tyr Leu Met Asp Asn Gly Glu Tyr Ser Ile Glu Arg Phe Thr Val Gln Tyr Leu Met Asp Asn Gly Glu Tyr Ser Ile Glu Arg Phe
65 70 75 80 65 70 75 80
Phe Cys Ser Val Ser Thr Asp Arg His Asp Phe Asp Gly Asp Tyr Gln Phe Cys Ser Val Ser Thr Asp Arg His Asp Phe Asp Gly Asp Tyr Gln
85 90 95 85 90 95
Thr Ile Leu Pro Val Asp Gly Tyr Leu Lys Ala His Tyr Pro Phe Val Thr Ile Leu Pro Val Asp Gly Tyr Leu Lys Ala His Tyr Pro Phe Val
100 105 110 100 105 110
Cys Asp Thr Phe Ser Leu Phe Lys Gly His Glu Glu Ile Leu Lys His Cys Asp Thr Phe Ser Leu Phe Lys Gly His Glu Glu Ile Leu Lys His
115 120 125 115 120 125
Val Lys Tyr His Leu Lys Thr Tyr Ser Lys Glu Leu Ser Ala Gly Val Val Lys Tyr His Leu Lys Thr Tyr Ser Lys Glu Leu Ser Ala Gly Val
130 135 140 130 135 140
Leu Met Leu Leu Ser Ala Val Val Leu Gly Tyr Lys Glu Ile Tyr Leu Leu Met Leu Leu Ser Ala Val Val Leu Gly Tyr Lys Glu Ile Tyr Leu
145 150 155 160 145 150 155 160
Val Gly Ile Asp Phe Gly Ala Ser Ser Trp Gly His Phe Tyr Asp Glu Val Gly Ile Asp Phe Gly Ala Ser Ser Trp Gly His Phe Tyr Asp Glu
165 170 175 165 170 175
Ser Gln Ser Gln His Phe Ser Asn His Met Ala Asp Cys His Asn Ile Ser Gln Ser Gln His Phe Ser Asn His Met Ala Asp Cys His Asn Ile
180 185 190 180 185 190
Tyr Tyr Asp Met Leu Thr Ile Cys Leu Cys Gln Lys Tyr Ala Lys Leu Tyr Tyr Asp Met Leu Thr Ile Cys Leu Cys Gln Lys Tyr Ala Lys Leu
195 200 205 195 200 205
Tyr Ala Leu Ala Pro Asn Ser Pro Leu Ser His Leu Leu Thr Leu Asn Tyr Ala Leu Ala Pro Asn Ser Pro Leu Ser His Leu Leu Thr Leu Asn
210 215 220 210 215 220
Pro Gln Ala Lys Tyr Pro Phe Glu Leu Leu Asp Lys Pro Ile Gly Tyr Pro Gln Ala Lys Tyr Pro Phe Glu Leu Leu Asp Lys Pro Ile Gly Tyr
225 230 235 240 225 230 235 240
Thr Ser Asp Leu Ile Ile Ser Ser Pro Leu Glu Glu Lys Leu Leu Glu Thr Ser Asp Leu Ile Ile Ser Ser Pro Leu Glu Glu Lys Leu Leu Glu
245 250 255 245 250 255
Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu
260 265 270 260 265 270
Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys
275 280 285 275 280 285
Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu
290 295 300 290 295 300
Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile
305 310 315 320 305 310 315 320
Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu
325 330 335 325 330 335
Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu
340 345 350 340 345 350
Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys
355 360 365 355 360 365
Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu Asn Ile Glu Glu Lys Leu Leu Glu Phe Lys Asn Ile Glu Glu Lys Leu
370 375 380 370 375 380
Leu Ala Ser Arg Leu Asn Asn Ile Leu Arg Lys Ile Lys Arg Lys Ile Leu Ala Ser Arg Leu Asn Asn Ile Leu Arg Lys Ile Lys Arg Lys Ile
385 390 395 400 385 390 395 400
Leu Pro Phe Phe Trp Gly Gly Gly Val Thr Pro Thr Leu Lys Val Ser Leu Pro Phe Phe Trp Gly Gly Gly Val Thr Pro Thr Leu Lys Val Ser
405 410 415 405 410 415
Phe Arg Trp Gly Ala Ala Phe Arg Trp Gly Ala Ala
420 420
<210> 67<210> 67
<211> 609<211> 609
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 67<400> 67
Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile
1 5 10 15 1 5 10 15
Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala
20 25 30 20 25 30
Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg
35 40 45 35 40 45
Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu
50 55 60 50 55 60
His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu
65 70 75 80 65 70 75 80
Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val
85 90 95 85 90 95
Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu
100 105 110 100 105 110
Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile
115 120 125 115 120 125
Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu
130 135 140 130 135 140
Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val
145 150 155 160 145 150 155 160
Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly
165 170 175 165 170 175
Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser
180 185 190 180 185 190
Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu
195 200 205 195 200 205
Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp
210 215 220 210 215 220
Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln
225 230 235 240 225 230 235 240
Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu
245 250 255 245 250 255
Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile
260 265 270 260 265 270
Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu
275 280 285 275 280 285
Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser
290 295 300 290 295 300
Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly
305 310 315 320 305 310 315 320
Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser
325 330 335 325 330 335
Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu
340 345 350 340 345 350
Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr
355 360 365 355 360 365
Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg
370 375 380 370 375 380
Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val
385 390 395 400 385 390 395 400
Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu
405 410 415 405 410 415
Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His
420 425 430 420 425 430
Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met
435 440 445 435 440 445
Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp
450 455 460 450 455 460
Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala
465 470 475 480 465 470 475 480
Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu
485 490 495 485 490 495
Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp
500 505 510 500 505 510
Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu
515 520 525 515 520 525
Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu
530 535 540 530 535 540
Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met
545 550 555 560 545 550 555 560
His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe
565 570 575 565 570 575
Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys
580 585 590 580 585 590
Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val
595 600 605 595 600 605
Glu Glu
<210> 68<210> 68
<211> 1830<211> 1830
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 68<400> 68
atgtgtggaa ttgttggcgc gatcgcgcaa cgtgatgtag cagaaatcct tcttgaaggt 60atgtgtggaa ttgttggcgc gatcgcgcaa cgtgatgtag cagaaatcct tcttgaaggt 60
ttacgtcgtc tggaataccg cggatatgac tctgccggtc tggccgttgt tgatgcagaa 120ttacgtcgtc tggaataccg cggatatgac tctgccggtc tggccgttgt tgatgcagaa 120
ggtcatatga cccgcctgcg tcgcctcggt aaagtccaga tgctggcaca ggcagcggaa 180ggtcatatga cccgcctgcg tcgcctcggt aaagtccaga tgctggcaca ggcagcggaa 180
gaacatcctc tgcatggcgg cactggtatt gctcacactc gctgggcgac ccacggtgaa 240gaacatcctc tgcatggcgg cactggtatt gctcacactc gctgggcgac ccacggtgaa 240
ccttcagaag tgaatgcgca tccgcatgtt tctgaacaca ttgtggtggt gcataacggc 300ccttcagaag tgaatgcgca tccgcatgtt tctgaacaca ttgtggtggt gcataacggc 300
atcatcgaaa accatgaacc gctgcgtgaa gagctaaaag cgcgtggcta taccttcgtt 360atcatcgaaa accatgaacc gctgcgtgaa gagctaaaag cgcgtggcta taccttcgtt 360
tctgaaaccg acaccgaagt gattgcccat ctggtgaact gggagctgaa acaaggcggg 420tctgaaaccg acaccgaagt gattgcccat ctggtgaact gggagctgaa acaaggcggg 420
actctgcgtg aggccgttct gcgtgctatc ccgcagctgc gtggtgcgta cggtacagtg 480actctgcgtg aggccgttct gcgtgctatc ccgcagctgc gtggtgcgta cggtacagtg 480
atcatggact cccgtcaccc ggataccctg ctggcggcac gttctggtag tccgctggtg 540atcatggact cccgtcaccc ggataccctg ctggcggcac gttctggtag tccgctggtg 540
attggcctgg ggatgggcga aaactttatc gcttctgacc agctggcgct gttgccggtg 600attggcctgg ggatgggcga aaactttatc gcttctgacc agctggcgct gttgccggtg 600
acccgtcgct ttatcttcct tgaagagggc gatattgcgg aaatcactcg ccgttcggta 660acccgtcgct ttatcttcct tgaagagggc gatattgcgg aaatcactcg ccgttcggta 660
aacatcttcg ataaaactgg cgcggaagta aaacgtcagg atatcgaatc caatctgcaa 720aacatcttcg ataaaactgg cgcggaagta aaacgtcagg atatcgaatc caatctgcaa 720
tatgacgcgg gcgataaagg catttaccgt cactacatgc agaaagagat ctacgaacag 780tatgacgcgg gcgataaagg catttaccgt cactacatgc agaaagagat ctacgaacag 780
ccgaacgcga tcaaaaacac ccttaccgga cgcatcagcc acggtcaggt tgatttaagc 840ccgaacgcga tcaaaaacac ccttaccgga cgcatcagcc acggtcaggt tgatttaagc 840
gagctgggac cgaacgccga cgaactgctg tcgaaggttg agcatattca gatcctcgcc 900gagctgggac cgaacgccga cgaactgctg tcgaaggttg agcatattca gatcctcgcc 900
tgtggtactt cttataactc cggtatggtt tcccgctact ggtttgaatc gctagcaggt 960tgtggtactt cttataactc cggtatggtt tcccgctact ggtttgaatc gctagcaggt 960
attccgtgcg acgtcgaaat cgcctctgaa ttccgctatc gcaaatctgc cgtgcgtcgt 1020attccgtgcg acgtcgaaat cgcctctgaa ttccgctatc gcaaatctgc cgtgcgtcgt 1020
aacagcctga tgatcacctt gtcacagtct ggcgaaaccg cggataccct ggctggcctg 1080aacagcctga tgatcacctt gtcacagtct ggcgaaaccg cggataccct ggctggcctg 1080
cgtctgtcga aagagctggg ttaccttggt tcactggcaa tctgtaacgt tccgggttct 1140cgtctgtcga aagagctggg ttaccttggt tcactggcaa tctgtaacgt tccgggttct 1140
tctctggtgc gcgaatccga tctggcgcta atgaccaacg cgggtacaga aatcggcgtg 1200tctctggtgc gcgaatccga tctggcgcta atgaccaacg cgggtacaga aatcggcgtg 1200
gcatccacta aagcattcac cactcagtta actgtgctgt tgatgctggt ggcgaagctg 1260gcatccacta aagcattcac cactcagtta actgtgctgt tgatgctggt ggcgaagctg 1260
tctcgcctga aaggtctgga tgcctccatt gaacatgaca tcgtgcatgg tctgcaggcg 1320tctcgcctga aaggtctgga tgcctccatt gaacatgaca tcgtgcatgg tctgcaggcg 1320
ctgccgagcc gtattgagca gatgctgtct caggacaaac gcattgaagc gctggcagaa 1380ctgccgagcc gtattgagca gatgctgtct caggacaaac gcattgaagc gctggcagaa 1380
gatttctctg acaaacatca cgcgctgttc ctgggccgtg gcgatcagta cccaatcgcg 1440gatttctctg acaaacatca cgcgctgttc ctgggccgtg gcgatcagta cccaatcgcg 1440
ctggaaggcg cattgaagtt gaaagagatc tcttacattc acgctgaagc ctacgctgct 1500ctggaaggcg cattgaagtt gaaagagatc tcttacattc acgctgaagc ctacgctgct 1500
ggcgaactga aacacggtcc gctggcgcta attgatgccg atatgccggt tattgttgtt 1560ggcgaactga aacacggtcc gctggcgcta attgatgccg atatgccggt tattgttgtt 1560
gcaccgaaca acgaattgct ggaaaaactg aaatccaaca ttgaagaagt tcgcgcgcgt 1620gcaccgaaca acgaattgct ggaaaaactg aaatccaaca ttgaagaagt tcgcgcgcgt 1620
ggcggtcagt tgtatgtctt cgccgatcag gatgcgggtt ttgtaagtag cgataacatg 1680ggcggtcagt tgtatgtctt cgccgatcag gatgcgggtt ttgtaagtag cgataacatg 1680
cacatcatcg agatgccgca tgtggaagag gtgattgcac cgatcttcta caccgttccg 1740cacatcatcg agatgccgca tgtggaagag gtgattgcac cgatcttcta caccgttccg 1740
ctgcagctgc tggcttacca tgtcgcgctg atcaaaggca ccgacgttga ccagccgcgt 1800ctgcagctgc tggcttacca tgtcgcgctg atcaaaggca ccgacgttga ccagccgcgt 1800
aacctggcaa aatcggttac ggttgagtaa 1830aacctggcaa aatcggttac ggttgagtaa 1830
<210> 69<210> 69
<211> 609<211> 609
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 69<400> 69
Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Lys Ile Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Lys Ile
1 5 10 15 1 5 10 15
Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala
20 25 30 20 25 30
Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg
35 40 45 35 40 45
Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu
50 55 60 50 55 60
His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu
65 70 75 80 65 70 75 80
Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val
85 90 95 85 90 95
Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu
100 105 110 100 105 110
Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile
115 120 125 115 120 125
Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu
130 135 140 130 135 140
Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val
145 150 155 160 145 150 155 160
Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly
165 170 175 165 170 175
Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser
180 185 190 180 185 190
Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu
195 200 205 195 200 205
Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp
210 215 220 210 215 220
Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln
225 230 235 240 225 230 235 240
Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu
245 250 255 245 250 255
Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile
260 265 270 260 265 270
Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu
275 280 285 275 280 285
Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser
290 295 300 290 295 300
Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly
305 310 315 320 305 310 315 320
Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser
325 330 335 325 330 335
Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu
340 345 350 340 345 350
Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr
355 360 365 355 360 365
Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg
370 375 380 370 375 380
Glu Ser Val Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Glu Ser Val Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val
385 390 395 400 385 390 395 400
Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu
405 410 415 405 410 415
Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His
420 425 430 420 425 430
Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met
435 440 445 435 440 445
Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp
450 455 460 450 455 460
Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala
465 470 475 480 465 470 475 480
Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu
485 490 495 485 490 495
Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp
500 505 510 500 505 510
Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Gly Leu Leu Glu Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Gly Leu Leu Glu
515 520 525 515 520 525
Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu
530 535 540 530 535 540
Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met
545 550 555 560 545 550 555 560
His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe
565 570 575 565 570 575
Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys
580 585 590 580 585 590
Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val
595 600 605 595 600 605
Glu Glu
<210> 70<210> 70
<211> 1830<211> 1830
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 70<400> 70
atgtgcggta tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt 60atgtgcggta tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt 60
ctgcgtcgtc tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa 120ctgcgtcgtc tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa 120
ggtcacatga ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa 180ggtcacatga ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa 180
gaacacccac tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa 240gaacacccac tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa 240
ccgtctgagg tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt 300ccgtctgagg tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt 300
atcatcgaga accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta 360atcatcgaga accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta 360
agcgaaaccg acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt 420agcgaaaccg acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt 420
actctgcgtg aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg 480actctgcgtg aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg 480
atcatggact ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt 540atcatggact ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt 540
atcggtctgg gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt 600atcggtctgg gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt 600
acccgtcgct tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt 660acccgtcgct tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt 660
aacatcttcg acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag 720aacatcttcg acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag 720
tatgacgctg gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag 780tatgacgctg gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag 780
ccgaacgcga tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct 840ccgaacgcga tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct 840
gagctgggtc caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct 900gagctgggtc caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct 900
tgtggtacct cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt 960tgtggtacct cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt 960
atcccatgcg acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt 1020atcccatgcg acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt 1020
aactccctca tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg 1080aactccctca tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg 1080
cgtctcagca aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct 1140cgtctcagca aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct 1140
agcctggttc gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt 1200agcctggttc gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt 1200
gcctctacca aagcgttcac tacccagctc actgtcctgc tgatgctggt tgccaaactg 1260gcctctacca aagcgttcac tacccagctc actngtcctgc tgatgctggt tgccaaactg 1260
tctcgtctca aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc 1320tctcgtctca aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc 1320
ctcccatctc gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa 1380ctcccatctc gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa 1380
gacttcagcg acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg 1440gacttcagcg acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg 1440
ctggaaggtg ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg 1500ctggaaggtg ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg 1500
ggtgagctga aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt 1560ggtgagctga aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt 1560
gctccgaaca acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt 1620gctccgaaca acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt 1620
ggtggtcagc tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg 1680ggtggtcagc tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg 1680
cacatcatcg aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg 1740cacatcatcg aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg 1740
ctgcagctgc tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt 1800ctgcagctgc tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt 1800
aacctggcga aatccgtgac cgtggaataa 1830aacctggcga aatccgtgac cgtggaataa 1830
<210> 71<210> 71
<211> 445<211> 445
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 71<400> 71
Met Ser Asn Arg Lys Tyr Phe Gly Thr Asp Gly Ile Arg Gly Arg Val Met Ser Asn Arg Lys Tyr Phe Gly Thr Asp Gly Ile Arg Gly Arg Val
1 5 10 15 1 5 10 15
Gly Asp Ala Pro Ile Thr Pro Asp Phe Val Leu Lys Leu Gly Trp Ala Gly Asp Ala Pro Ile Thr Pro Asp Phe Val Leu Lys Leu Gly Trp Ala
20 25 30 20 25 30
Ala Gly Lys Val Leu Ala Arg His Gly Ser Arg Lys Ile Ile Ile Gly Ala Gly Lys Val Leu Ala Arg His Gly Ser Arg Lys Ile Ile Ile Gly
35 40 45 35 40 45
Lys Asp Thr Arg Ile Ser Gly Tyr Met Leu Glu Ser Ala Leu Glu Ala Lys Asp Thr Arg Ile Ser Gly Tyr Met Leu Glu Ser Ala Leu Glu Ala
50 55 60 50 55 60
Gly Leu Ala Ala Ala Gly Leu Ser Ala Leu Phe Thr Gly Pro Met Pro Gly Leu Ala Ala Ala Gly Leu Ser Ala Leu Phe Thr Gly Pro Met Pro
65 70 75 80 65 70 75 80
Thr Pro Ala Val Ala Tyr Leu Thr Arg Thr Phe Arg Ala Glu Ala Gly Thr Pro Ala Val Ala Tyr Leu Thr Arg Thr Phe Arg Ala Glu Ala Gly
85 90 95 85 90 95
Ile Val Ile Ser Ala Ser His Asn Pro Phe Tyr Asp Asn Gly Ile Lys Ile Val Ile Ser Ala Ser His Asn Pro Phe Tyr Asp Asn Gly Ile Lys
100 105 110 100 105 110
Phe Phe Ser Ile Asp Gly Thr Lys Leu Pro Asp Ala Val Glu Glu Ala Phe Phe Ser Ile Asp Gly Thr Lys Leu Pro Asp Ala Val Glu Glu Ala
115 120 125 115 120 125
Ile Glu Ala Glu Met Glu Lys Glu Ile Ser Cys Val Asp Ser Ala Glu Ile Glu Ala Glu Met Glu Lys Glu Ile Ser Cys Val Asp Ser Ala Glu
130 135 140 130 135 140
Leu Gly Lys Ala Ser Arg Ile Val Asp Ala Ala Gly Arg Tyr Ile Glu Leu Gly Lys Ala Ser Arg Ile Val Asp Ala Ala Gly Arg Tyr Ile Glu
145 150 155 160 145 150 155 160
Phe Cys Lys Ala Thr Phe Pro Asn Glu Leu Ser Leu Ser Glu Leu Lys Phe Cys Lys Ala Thr Phe Pro Asn Glu Leu Ser Leu Ser Glu Leu Lys
165 170 175 165 170 175
Ile Val Val Asp Cys Ala Asn Gly Ala Thr Tyr His Ile Ala Pro Asn Ile Val Val Asp Cys Ala Asn Gly Ala Thr Tyr His Ile Ala Pro Asn
180 185 190 180 185 190
Val Leu Arg Glu Leu Gly Ala Asn Val Ile Ala Ile Gly Cys Glu Pro Val Leu Arg Glu Leu Gly Ala Asn Val Ile Ala Ile Gly Cys Glu Pro
195 200 205 195 200 205
Asn Gly Val Asn Ile Asn Ala Glu Val Gly Ala Thr Asp Val Arg Ala Asn Gly Val Asn Ile Asn Ala Glu Val Gly Ala Thr Asp Val Arg Ala
210 215 220 210 215 220
Leu Gln Ala Arg Val Leu Ala Glu Lys Ala Asp Leu Gly Ile Ala Phe Leu Gln Ala Arg Val Leu Ala Glu Lys Ala Asp Leu Gly Ile Ala Phe
225 230 235 240 225 230 235 240
Asp Gly Asp Gly Asp Arg Val Ile Met Val Asp His Glu Gly Asn Lys Asp Gly Asp Gly Asp Arg Val Ile Met Val Asp His Glu Gly Asn Lys
245 250 255 245 250 255
Val Asp Gly Asp Gln Ile Met Tyr Ile Ile Ala Arg Glu Gly Leu Arg Val Asp Gly Asp Gln Ile Met Tyr Ile Ile Ala Arg Glu Gly Leu Arg
260 265 270 260 265 270
Gln Gly Gln Leu Arg Gly Gly Ala Val Gly Thr Leu Met Ser Asn Met Gln Gly Gln Leu Arg Gly Gly Ala Val Gly Thr Leu Met Ser Asn Met
275 280 285 275 280 285
Gly Leu Glu Leu Ala Leu Lys Gln Leu Gly Ile Pro Phe Ala Arg Ala Gly Leu Glu Leu Ala Leu Lys Gln Leu Gly Ile Pro Phe Ala Arg Ala
290 295 300 290 295 300
Lys Val Gly Asp Arg Tyr Val Leu Glu Lys Met Gln Glu Lys Gly Trp Lys Val Gly Asp Arg Tyr Val Leu Glu Lys Met Gln Glu Lys Gly Trp
305 310 315 320 305 310 315 320
Arg Ile Gly Ala Glu Asn Ser Gly His Val Ile Leu Leu Asp Lys Thr Arg Ile Gly Ala Glu Asn Ser Gly His Val Ile Leu Leu Asp Lys Thr
325 330 335 325 330 335
Thr Thr Gly Asp Gly Ile Val Ala Gly Leu Gln Val Leu Ala Ala Met Thr Thr Gly Asp Gly Ile Val Ala Gly Leu Gln Val Leu Ala Ala Met
340 345 350 340 345 350
Ala Arg Asn His Met Ser Leu His Asp Leu Cys Ser Gly Met Lys Met Ala Arg Asn His Met Ser Leu His Asp Leu Cys Ser Gly Met Lys Met
355 360 365 355 360 365
Phe Pro Gln Ile Leu Val Asn Val Arg Tyr Thr Ala Gly Ser Gly Asp Phe Pro Gln Ile Leu Val Asn Val Arg Tyr Thr Ala Gly Ser Gly Asp
370 375 380 370 375 380
Pro Leu Glu His Glu Ser Val Lys Ala Val Thr Ala Glu Val Glu Ala Pro Leu Glu His Glu Ser Val Lys Ala Val Thr Ala Glu Val Glu Ala
385 390 395 400 385 390 395 400
Ala Leu Gly Asn Arg Gly Arg Val Leu Leu Arg Lys Ser Gly Thr Glu Ala Leu Gly Asn Arg Gly Arg Val Leu Leu Arg Lys Ser Gly Thr Glu
405 410 415 405 410 415
Pro Leu Ile Arg Val Met Val Glu Gly Glu Asp Glu Ala Gln Val Thr Pro Leu Ile Arg Val Met Val Glu Gly Glu Asp Glu Ala Gln Val Thr
420 425 430 420 425 430
Glu Phe Ala His Arg Ile Ala Asp Ala Val Lys Ala Val Glu Phe Ala His Arg Ile Ala Asp Ala Val Lys Ala Val
435 440 445 435 440 445
<210> 72<210> 72
<211> 1338<211> 1338
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 72<400> 72
atgagtaatc gtaaatattt cggtaccgat gggattcgtg gtcgtgtagg ggatgcgccg 60atgagtaatc gtaaatattt cggtaccgat gggattcgtg gtcgtgtagg ggatgcgccg 60
atcacacctg attttgtgct taagctgggt tgggccgcgg gtaaagtgct ggcgcgccac 120atcacacctg attttgtgct taagctgggt tgggccgcgg gtaaagtgct ggcgcgccac 120
ggctcccgta agattattat tggtaaagac acgcgtattt ctggctatat gctggagtca 180ggctcccgta agattattat tggtaaagac acgcgtattt ctggctatat gctggagtca 180
gcactggaag cgggtctggc ggcagcgggc ctttccgcac tcttcactgg cccgatgcca 240gcactggaag cgggtctggc ggcagcgggc ctttccgcac tcttcactgg cccgatgcca 240
acaccggccg tggcttatct gacgcgtacc ttccgcgcag aggccggaat tgtgatatct 300acaccggccg tggcttatct gacgcgtacc ttccgcgcag aggccggaat tgtgatatct 300
gcatcgcata acccgttcta cgataatggc attaaattct tctctatcga cggcaccaaa 360gcatcgcata acccgttcta cgataatggc attaaattct tctctatcga cggcaccaaa 360
ctgccggatg cggtagaaga ggccatcgaa gcggaaatgg aaaaggagat cagctgcgtt 420ctgccggatg cggtagaaga ggccatcgaa gcggaaatgg aaaaggagat cagctgcgtt 420
gattcggcag aactgggtaa agccagccgt atcgttgatg ccgcgggtcg ctatatcgag 480gattcggcag aactgggtaa agccagccgt atcgttgatg ccgcgggtcg ctatatcgag 480
ttttgcaaag ccacgttccc gaacgaactt agcctcagtg aactgaagat tgtggtggat 540ttttgcaaag ccacgttccc gaacgaactt agcctcagtg aactgaagat tgtggtggat 540
tgtgcaaacg gtgcgactta tcacatcgcg ccgaacgtgc tgcgcgaact gggggcgaac 600tgtgcaaacg gtgcgactta tcacatcgcg ccgaacgtgc tgcgcgaact gggggcgaac 600
gttatcgcta tcggttgtga gccaaacggt gtaaacatca atgccgaagt gggggctacc 660gttatcgcta tcggttgtga gccaaacggt gtaaacatca atgccgaagt gggggctacc 660
gacgttcgcg cgctccaggc tcgtgtgctg gctgaaaaag cggatctcgg tattgccttc 720gacgttcgcg cgctccaggc tcgtgtgctg gctgaaaaag cggatctcgg tattgccttc 720
gacggcgatg gcgatcgcgt gattatggtt gaccatgaag gcaataaagt cgatggcgat 780gacggcgatg gcgatcgcgt gattatggtt gaccatgaag gcaataaagt cgatggcgat 780
cagatcatgt atatcatcgc gcgtgaaggt cttcgtcagg gccagctgcg tggtggcgct 840cagatcatgt atatcatcgc gcgtgaaggt cttcgtcagg gccagctgcg tggtggcgct 840
gtgggtacat tgatgagcaa catggggctt gaactggcgc tgaaacagtt aggaattcca 900gtgggtacat tgatgagcaa catggggctt gaactggcgc tgaaacagtt aggaattcca 900
tttgcgcgcg cgaaagtggg tgaccgctac gtactggaaa aaatgcagga gaaaggctgg 960tttgcgcgcg cgaaagtggg tgaccgctac gtactggaaa aaatgcagga gaaaggctgg 960
cgtatcggtg cagagaattc cggtcatgtg atcctgctgg ataaaactac taccggtgac 1020cgtatcggtg cagagaattc cggtcatgtg atcctgctgg ataaaactac taccggtgac 1020
ggcatcgttg ctggcttgca ggtgctggcg gcgatggcac gtaaccatat gagcctgcac 1080ggcatcgttg ctggcttgca ggtgctggcg gcgatggcac gtaaccatat gagcctgcac 1080
gacctttgca gcggcatgaa aatgttcccg cagattctgg ttaacgtacg ttacaccgca 1140gacctttgca gcggcatgaa aatgttcccg cagattctgg ttaacgtacg ttacaccgca 1140
ggtagcggcg atccacttga gcatgagtca gttaaagccg tgaccgcaga ggttgaagct 1200ggtagcggcg atccacttga gcatgagtca gttaaagccg tgaccgcaga ggttgaagct 1200
gcgctgggca accgtggacg cgtgttgctg cgtaaatccg gcaccgaacc gttaattcgc 1260gcgctgggca accgtggacg cgtgttgctg cgtaaatccg gcaccgaacc gttaattcgc 1260
gtgatggtgg aaggcgaaga cgaagcgcag gtgactgaat ttgcacaccg catcgccgat 1320gtgatggtgg aaggcgaaga cgaagcgcag gtgactgaat ttgcacaccg catcgccgat 1320
gcagtaaaag ccgtttaa 1338gcagtaaaag ccgtttaa 1338
<210> 73<210> 73
<211> 456<211> 456
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 73<400> 73
Met Leu Asn Asn Ala Met Ser Val Val Ile Leu Ala Ala Gly Lys Gly Met Leu Asn Asn Ala Met Ser Val Val Ile Leu Ala Ala Gly Lys Gly
1 5 10 15 1 5 10 15
Thr Arg Met Tyr Ser Asp Leu Pro Lys Val Leu His Thr Leu Ala Gly Thr Arg Met Tyr Ser Asp Leu Pro Lys Val Leu His Thr Leu Ala Gly
20 25 30 20 25 30
Lys Ala Met Val Gln His Val Ile Asp Ala Ala Asn Glu Leu Gly Ala Lys Ala Met Val Gln His Val Ile Asp Ala Ala Asn Glu Leu Gly Ala
35 40 45 35 40 45
Ala His Val His Leu Val Tyr Gly His Gly Gly Asp Leu Leu Lys Gln Ala His Val His Leu Val Tyr Gly His Gly Gly Asp Leu Leu Lys Gln
50 55 60 50 55 60
Ala Leu Lys Asp Asp Asn Leu Asn Trp Val Leu Gln Ala Glu Gln Leu Ala Leu Lys Asp Asp Asn Leu Asn Trp Val Leu Gln Ala Glu Gln Leu
65 70 75 80 65 70 75 80
Gly Thr Gly His Ala Met Gln Gln Ala Ala Pro Phe Phe Ala Asp Asp Gly Thr Gly His Ala Met Gln Gln Ala Ala Pro Phe Phe Ala Asp Asp
85 90 95 85 90 95
Glu Asp Ile Leu Met Leu Tyr Gly Asp Val Pro Leu Ile Ser Val Glu Glu Asp Ile Leu Met Leu Tyr Gly Asp Val Pro Leu Ile Ser Val Glu
100 105 110 100 105 110
Thr Leu Gln Arg Leu Arg Asp Ala Lys Pro Gln Gly Gly Ile Gly Leu Thr Leu Gln Arg Leu Arg Asp Ala Lys Pro Gln Gly Gly Ile Gly Leu
115 120 125 115 120 125
Leu Thr Val Lys Leu Asp Asp Pro Thr Gly Tyr Gly Arg Ile Thr Arg Leu Thr Val Lys Leu Asp Asp Pro Thr Gly Tyr Gly Arg Ile Thr Arg
130 135 140 130 135 140
Glu Asn Gly Lys Val Thr Gly Ile Val Glu His Lys Asp Ala Thr Asp Glu Asn Gly Lys Val Thr Gly Ile Val Glu His Lys Asp Ala Thr Asp
145 150 155 160 145 150 155 160
Glu Gln Arg Gln Ile Gln Glu Ile Asn Thr Gly Ile Leu Ile Ala Asn Glu Gln Arg Gln Ile Gln Glu Ile Asn Thr Gly Ile Leu Ile Ala Asn
165 170 175 165 170 175
Gly Ala Asp Met Lys Arg Trp Leu Ala Lys Leu Thr Asn Asn Asn Ala Gly Ala Asp Met Lys Arg Trp Leu Ala Lys Leu Thr Asn Asn Asn Ala
180 185 190 180 185 190
Gln Gly Glu Tyr Tyr Ile Thr Asp Ile Ile Ala Leu Ala Tyr Gln Glu Gln Gly Glu Tyr Tyr Ile Thr Asp Ile Ile Ala Leu Ala Tyr Gln Glu
195 200 205 195 200 205
Gly Arg Glu Ile Val Ala Val His Pro Gln Arg Leu Ser Glu Val Glu Gly Arg Glu Ile Val Ala Val His Pro Gln Arg Leu Ser Glu Val Glu
210 215 220 210 215 220
Gly Val Asn Asn Arg Leu Gln Leu Ser Arg Leu Glu Arg Val Tyr Gln Gly Val Asn Asn Arg Leu Gln Leu Ser Arg Leu Glu Arg Val Tyr Gln
225 230 235 240 225 230 235 240
Ser Glu Gln Ala Glu Lys Leu Leu Leu Ala Gly Val Met Leu Arg Asp Ser Glu Gln Ala Glu Lys Leu Leu Leu Ala Gly Val Met Leu Arg Asp
245 250 255 245 250 255
Pro Ala Arg Phe Asp Leu Arg Gly Thr Leu Thr His Gly Arg Asp Val Pro Ala Arg Phe Asp Leu Arg Gly Thr Leu Thr His Gly Arg Asp Val
260 265 270 260 265 270
Glu Ile Asp Thr Asn Val Ile Ile Glu Gly Asn Val Thr Leu Gly His Glu Ile Asp Thr Asn Val Ile Ile Glu Gly Asn Val Thr Leu Gly His
275 280 285 275 280 285
Arg Val Lys Ile Gly Thr Gly Cys Val Ile Lys Asn Ser Val Ile Gly Arg Val Lys Ile Gly Thr Gly Cys Val Ile Lys Asn Ser Val Ile Gly
290 295 300 290 295 300
Asp Asp Cys Glu Ile Ser Pro Tyr Thr Val Val Glu Asp Ala Asn Leu Asp Asp Cys Glu Ile Ser Pro Tyr Thr Val Val Glu Asp Ala Asn Leu
305 310 315 320 305 310 315 320
Ala Ala Ala Cys Thr Ile Gly Pro Phe Ala Arg Leu Arg Pro Gly Ala Ala Ala Ala Cys Thr Ile Gly Pro Phe Ala Arg Leu Arg Pro Gly Ala
325 330 335 325 330 335
Glu Leu Leu Glu Gly Ala His Val Gly Asn Phe Val Glu Met Lys Lys Glu Leu Leu Glu Gly Ala His Val Gly Asn Phe Val Glu Met Lys Lys
340 345 350 340 345 350
Ala Arg Leu Gly Lys Gly Ser Lys Ala Gly His Leu Thr Tyr Leu Gly Ala Arg Leu Gly Lys Gly Ser Lys Ala Gly His Leu Thr Tyr Leu Gly
355 360 365 355 360 365
Asp Ala Glu Ile Gly Asp Asn Val Asn Ile Gly Ala Gly Thr Ile Thr Asp Ala Glu Ile Gly Asp Asn Val Asn Ile Gly Ala Gly Thr Ile Thr
370 375 380 370 375 380
Cys Asn Tyr Asp Gly Ala Asn Lys Phe Lys Thr Ile Ile Gly Asp Asp Cys Asn Tyr Asp Gly Ala Asn Lys Phe Lys Thr Ile Ile Gly Asp Asp
385 390 395 400 385 390 395 400
Val Phe Val Gly Ser Asp Thr Gln Leu Val Ala Pro Val Thr Val Gly Val Phe Val Gly Ser Asp Thr Gln Leu Val Ala Pro Val Thr Val Gly
405 410 415 405 410 415
Lys Gly Ala Thr Ile Ala Ala Gly Thr Thr Val Thr Arg Asn Val Gly Lys Gly Ala Thr Ile Ala Ala Gly Thr Thr Val Thr Arg Asn Val Gly
420 425 430 420 425 430
Glu Asn Ala Leu Ala Ile Ser Arg Val Pro Gln Thr Gln Lys Glu Gly Glu Asn Ala Leu Ala Ile Ser Arg Val Pro Gln Thr Gln Lys Glu Gly
435 440 445 435 440 445
Trp Arg Arg Pro Val Lys Lys Lys Trp Arg Arg Pro Val Lys Lys Lys
450 455 450 455
<210> 74<210> 74
<211> 1371<211> 1371
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 74<400> 74
atgttgaata atgctatgag cgtagtgatc cttgccgcag gcaaaggcac gcgcatgtat 60atgttgaata atgctatgag cgtagtgatc cttgccgcag gcaaaggcac gcgcatgtat 60
tccgatcttc cgaaagtgct gcataccctt gccgggaaag cgatggttca gcatgtcatt 120tccgatcttc cgaaagtgct gcataccctt gccgggaaag cgatggttca gcatgtcatt 120
gatgctgcga atgaattagg cgcagcgcac gttcacctgg tgtacggtca cggcggcgat 180gatgctgcga atgaattagg cgcagcgcac gttcacctgg tgtacggtca cggcggcgat 180
ctgctaaaac aggcgctgaa agacgacaac cttaactggg tgcttcaggc agagcagctg 240ctgctaaaac aggcgctgaa agacgacaac cttaactggg tgcttcaggc agagcagctg 240
ggtacgggtc atgcaatgca gcaggccgca cctttctttg ccgatgatga agacatttta 300ggtacgggtc atgcaatgca gcaggccgca cctttctttg ccgatgatga agacatttta 300
atgctctacg gcgacgtgcc gctgatctct gtcgaaacac tccagcgtct gcgtgatgct 360atgctctacg gcgacgtgcc gctgatctct gtcgaaacac tccagcgtct gcgtgatgct 360
aaaccgcagg gtggcattgg tctgctgacg gtgaaactgg atgatccgac cggttatgga 420aaaccgcagg gtggcattgg tctgctgacg gtgaaactgg atgatccgac cggttatgga 420
cgtatcaccc gtgaaaacgg caaagttacc ggcattgttg agcacaaaga tgccaccgac 480cgtatcaccc gtgaaaacgg caaagttacc ggcattgttg agcacaaaga tgccaccgac 480
gagcagcgtc agattcagga gatcaacacc ggcattctga ttgccaacgg cgcagatatg 540gagcagcgtc agattcagga gatcaacacc ggcattctga ttgccaacgg cgcagatatg 540
aaacgctggc tggcgaagct gaccaacaat aatgctcagg gcgaatacta catcaccgac 600aaacgctggc tggcgaagct gaccaacaat aatgctcagg gcgaatacta catcaccgac 600
attattgcgc tggcgtatca ggaagggcgt gaaatcgtcg ccgttcatcc gcaacgttta 660attattgcgc tggcgtatca ggaagggcgt gaaatcgtcg ccgttcatcc gcaacgttta 660
agcgaagtag aaggcgtgaa taaccgcctg caactctccc gtctggagcg tgtttatcag 720agcgaagtag aaggcgtgaa taaccgcctg caactctccc gtctggagcg tgtttatcag 720
tccgaacagg ctgaaaaact gctgttagca ggcgttatgc tgcgcgatcc agcgcgtttt 780tccgaacagg ctgaaaaact gctgttagca ggcgttatgc tgcgcgatcc agcgcgtttt 780
gatctgcgtg gtacgctaac tcacgggcgc gatgttgaaa ttgatactaa cgttatcatc 840gatctgcgtg gtacgctaac tcacgggcgc gatgttgaaa ttgatactaa cgttatcatc 840
gagggcaacg tgactctcgg tcatcgcgtg aaaattggca ccggttgcgt gattaaaaac 900gagggcaacg tgactctcgg tcatcgcgtg aaaattggca ccggttgcgt gattaaaaac 900
agcgtgattg gcgatgattg cgaaatcagt ccgtataccg ttgtggaaga tgcgaatctg 960agcgtgattg gcgatgattg cgaaatcagt ccgtataccg ttgtggaaga tgcgaatctg 960
gcagcggcct gtaccattgg cccgtttgcc cgtttgcgtc ctggtgctga gttgctggaa 1020gcagcggcct gtaccattgg cccgtttgcc cgtttgcgtc ctggtgctga gttgctggaa 1020
ggtgctcacg tcggtaactt cgttgagatg aaaaaagcgc gtctgggtaa aggctcgaaa 1080ggtgctcacg tcggtaactt cgttgagatg aaaaaagcgc gtctgggtaa aggctcgaaa 1080
gctggtcatc tgacttacct gggcgatgcg gaaattggcg ataacgttaa catcggcgcg 1140gctggtcatc tgacttacct gggcgatgcg gaaattggcg ataacgttaa catcggcgcg 1140
ggaaccatta cctgcaacta cgatggtgcg aataaattta agaccattat cggcgacgat 1200ggaaccatta cctgcaacta cgatggtgcg aataaattta agaccattat cggcgacgat 1200
gtgtttgttg gttccgacac tcagctggtg gccccggtaa cagtaggcaa aggcgcgacc 1260gtgtttgttg gttccgacac tcagctggtg gccccggtaa cagtaggcaa aggcgcgacc 1260
attgctgcgg gtacaactgt gacgcgtaat gtcggcgaaa atgcattagc tatcagccgt 1320attgctgcgg gtacaactgt gacgcgtaat gtcggcgaaa atgcattagc tatcagccgt 1320
gtgccgcaga ctcagaaaga aggctggcgt cgtccggtaa agaaaaagtg a 1371gtgccgcaga ctcagaaaga aggctggcgt cgtccggtaa agaaaaagtg a 1371
<210> 75<210> 75
<211> 391<211> 391
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 75<400> 75
Met Lys Lys Ile Leu Tyr Val Thr Gly Ser Arg Ala Glu Tyr Gly Ile Met Lys Lys Ile Leu Tyr Val Thr Gly Ser Arg Ala Glu Tyr Gly Ile
1 5 10 15 1 5 10 15
Val Arg Arg Leu Leu Thr Met Leu Arg Glu Thr Pro Glu Ile Gln Leu Val Arg Arg Leu Leu Thr Met Leu Arg Glu Thr Pro Glu Ile Gln Leu
20 25 30 20 25 30
Asp Leu Ala Val Thr Gly Met His Cys Asp Asn Ala Tyr Gly Asn Thr Asp Leu Ala Val Thr Gly Met His Cys Asp Asn Ala Tyr Gly Asn Thr
35 40 45 35 40 45
Ile His Ile Ile Glu Gln Asp Asn Phe Asn Ile Ile Lys Val Val Asp Ile His Ile Ile Glu Gln Asp Asn Phe Asn Ile Ile Lys Val Val Asp
50 55 60 50 55 60
Ile Asn Ile Asn Thr Thr Ser His Thr His Ile Leu His Ser Met Ser Ile Asn Ile Asn Thr Thr Ser His Thr His Ile Leu His Ser Met Ser
65 70 75 80 65 70 75 80
Val Cys Leu Asn Ser Phe Gly Asp Phe Phe Ser Asn Asn Thr Tyr Asp Val Cys Leu Asn Ser Phe Gly Asp Phe Phe Ser Asn Asn Thr Tyr Asp
85 90 95 85 90 95
Ala Val Met Val Leu Gly Asp Arg Tyr Glu Ile Phe Ser Val Ala Ile Ala Val Met Val Leu Gly Asp Arg Tyr Glu Ile Phe Ser Val Ala Ile
100 105 110 100 105 110
Ala Ala Ser Met His Asn Ile Pro Leu Ile His Ile His Gly Gly Glu Ala Ala Ser Met His Asn Ile Pro Leu Ile His Ile His Gly Gly Glu
115 120 125 115 120 125
Lys Thr Leu Ala Asn Tyr Asp Glu Phe Ile Arg His Ser Ile Thr Lys Lys Thr Leu Ala Asn Tyr Asp Glu Phe Ile Arg His Ser Ile Thr Lys
130 135 140 130 135 140
Met Ser Lys Leu His Leu Thr Ser Thr Glu Glu Tyr Lys Lys Arg Val Met Ser Lys Leu His Leu Thr Ser Thr Glu Glu Tyr Lys Lys Arg Val
145 150 155 160 145 150 155 160
Ile Gln Leu Gly Glu Lys Pro Gly Ser Val Phe Asn Ile Gly Ser Leu Ile Gln Leu Gly Glu Lys Pro Gly Ser Val Phe Asn Ile Gly Ser Leu
165 170 175 165 170 175
Gly Ala Glu Asn Ala Leu Ser Leu His Leu Pro Asn Lys Gln Glu Leu Gly Ala Glu Asn Ala Leu Ser Leu His Leu Pro Asn Lys Gln Glu Leu
180 185 190 180 185 190
Glu Leu Lys Tyr Gly Ser Leu Leu Lys Arg Tyr Phe Val Val Val Phe Glu Leu Lys Tyr Gly Ser Leu Leu Lys Arg Tyr Phe Val Val Val Phe
195 200 205 195 200 205
His Pro Glu Thr Leu Ser Thr Gln Ser Val Asn Asp Gln Ile Asp Glu His Pro Glu Thr Leu Ser Thr Gln Ser Val Asn Asp Gln Ile Asp Glu
210 215 220 210 215 220
Leu Leu Ser Ala Ile Ser Phe Phe Lys Asn Thr His Asp Phe Ile Phe Leu Leu Ser Ala Ile Ser Phe Phe Lys Asn Thr His Asp Phe Ile Phe
225 230 235 240 225 230 235 240
Ile Gly Ser Asn Ala Asp Thr Gly Ser Asp Ile Ile Gln Arg Lys Val Ile Gly Ser Asn Ala Asp Thr Gly Ser Asp Ile Ile Gln Arg Lys Val
245 250 255 245 250 255
Lys Tyr Phe Cys Lys Glu Tyr Lys Phe Arg Tyr Leu Ile Ser Ile Arg Lys Tyr Phe Cys Lys Glu Tyr Lys Phe Arg Tyr Leu Ile Ser Ile Arg
260 265 270 260 265 270
Ser Glu Asp Tyr Leu Ala Met Ile Lys Tyr Ser Cys Gly Leu Ile Gly Ser Glu Asp Tyr Leu Ala Met Ile Lys Tyr Ser Cys Gly Leu Ile Gly
275 280 285 275 280 285
Asn Ser Ser Ser Gly Leu Ile Glu Val Pro Ser Leu Lys Val Ala Thr Asn Ser Ser Ser Gly Leu Ile Glu Val Pro Ser Leu Lys Val Ala Thr
290 295 300 290 295 300
Ile Asn Ile Gly Asp Arg Gln Lys Gly Arg Val Arg Gly Ala Ser Val Ile Asn Ile Gly Asp Arg Gln Lys Gly Arg Val Arg Gly Ala Ser Val
305 310 315 320 305 310 315 320
Ile Asp Val Pro Val Glu Lys Asn Ala Ile Val Arg Gly Ile Asn Ile Ile Asp Val Pro Val Glu Lys Asn Ala Ile Val Arg Gly Ile Asn Ile
325 330 335 325 330 335
Ser Gln Asp Glu Lys Phe Ile Ser Val Val Gln Ser Ser Ser Asn Pro Ser Gln Asp Glu Lys Phe Ile Ser Val Val Gln Ser Ser Ser Asn Pro
340 345 350 340 345 350
Tyr Phe Lys Glu Asn Ala Leu Ile Asn Ala Val Arg Ile Ile Lys Asp Tyr Phe Lys Glu Asn Ala Leu Ile Asn Ala Val Arg Ile Ile Lys Asp
355 360 365 355 360 365
Phe Ile Lys Ser Lys Asn Lys Asp Tyr Lys Asp Phe Tyr Asp Ile Pro Phe Ile Lys Ser Lys Asn Lys Asp Tyr Lys Asp Phe Tyr Asp Ile Pro
370 375 380 370 375 380
Glu Cys Thr Thr Ser Tyr Asp Glu Cys Thr Thr Ser Tyr Asp
385 390 385 390
<210> 76<210> 76
<211> 1176<211> 1176
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 76<400> 76
atgaaaaaaa tattatacgt aactggatct agagctgaat atggaatagt tcggagactt 60atgaaaaaaa tattatacgt aactggatct agagctgaat atggaatagt tcggagactt 60
ttgacaatgc taagagaaac tccagaaata cagcttgatt tggcagttac aggaatgcat 120ttgacaatgc taagagaaac tccagaaata cagcttgatt tggcagttac aggaatgcat 120
tgtgataatg cgtatggaaa tacaatacat attatagaac aagataattt taatattatc 180tgtgataatg cgtatggaaa tacaatacat attatagaac aagataattt taatattatc 180
aaggttgtgg atataaatat caatacaact tcacatactc acattctcca ttcaatgagt 240aaggttgtgg atataaatat caatacaact tcacatactc acattctcca ttcaatgagt 240
gtttgcctca attcgtttgg tgattttttt tcaaataaca catatgatgc ggttatggtt 300gtttgcctca attcgtttgg tgattttttt tcaaataaca catatgatgc ggttatggtt 300
ttaggcgata gatatgaaat attttcagtc gctatcgcag catcaatgca taatattcca 360ttaggcgata gatatgaaat attttcagtc gctatcgcag catcaatgca taatattcca 360
ttaattcata ttcatggtgg tgaaaagaca ttagctaatt atgatgagtt tattaggcat 420ttaattcata ttcatggtgg tgaaaagaca ttagctaatt atgatgagtt tattaggcat 420
tcaattacta aaatgagtaa actccatctt acttctacag aagagtataa aaaacgagta 480tcaattacta aaatgagtaa actccatctt acttctacag aagagtataa aaaacgagta 480
attcaactag gtgaaaagcc tggtagtgtg tttaatattg gttctcttgg tgcagaaaat 540attcaactag gtgaaaagcc tggtagtgtg tttaatattg gttctcttgg tgcagaaaat 540
gctctttcat tgcatttacc aaataagcag gagttggaac taaaatatgg ttcactgtta 600gctctttcat tgcatttacc aaataagcag gagttggaac taaaatatgg ttcactgtta 600
aaacggtact ttgttgtagt attccatcct gaaacacttt ccacgcagtc ggttaatgat 660aaacggtact ttgttgtagt attccatcct gaaacacttt ccacgcagtc ggttaatgat 660
caaatagatg agttattgtc agcgatttct ttttttaaaa atactcacga ctttattttt 720caaatagatg agttattgtc agcgatttct ttttttaaaa atactcacga ctttattttt 720
attggcagta acgctgacac tggttctgat ataattcaga gaaaagtaaa atatttttgc 780attggcagta acgctgacac tggttctgat ataattcaga gaaaagtaaa atatttttgc 780
aaagagtata agttcagata tttgatttct attcgttcag aagattattt ggcaatgatt 840aaagagtata agttcagata tttgatttct attcgttcag aagattattt ggcaatgatt 840
aaatactctt gtgggctaat tgggaactcc tcctctggtt taattgaggt tccatcttta 900aaatactctt gtgggctaat tgggaactcc tcctctggtt taattgaggt tccatcttta 900
aaagttgcaa caattaacat tggtgatagg cagaaaggcc gtgttcgtgg agccagtgta 960aaagttgcaa caattaacat tggtgatagg cagaaaggcc gtgttcgtgg agccagtgta 960
atagatgtac ccgttgaaaa aaatgcaatc gtcagaggga taaatatatc tcaagatgaa 1020atagatgtac ccgttgaaaa aaatgcaatc gtcagaggga taaatatatc tcaagatgaa 1020
aaatttatta gtgttgtaca gtcatctagt aatccttatt ttaaagaaaa tgctttaatt 1080aaatttatta gtgttgtaca gtcatctagt aatccttatt ttaaagaaaa tgctttaatt 1080
aatgctgtta gaattattaa ggattttatt aaatcaaaaa ataaagatta caaagatttt 1140aatgctgtta gaattattaa ggattttatt aaatcaaaaa ataaagatta caaagatttt 1140
tatgacatcc cggaatgtac caccagttat gactag 1176tatgacatcc cggaatgtac caccagttat gactag 1176
<210> 77<210> 77
<211> 159<211> 159
<212> Белок<212> Protein
<213> Saccharomyces cerevisiae<213> Saccharomyces cerevisiae
<400> 77<400> 77
Met Ser Leu Pro Asp Gly Phe Tyr Ile Arg Arg Met Glu Glu Gly Asp Met Ser Leu Pro Asp Gly Phe Tyr Ile Arg Arg Met Glu Glu Gly Asp
1 5 10 15 1 5 10 15
Leu Glu Gln Val Thr Glu Thr Leu Lys Val Leu Thr Thr Val Gly Thr Leu Glu Gln Val Thr Glu Thr Leu Lys Val Leu Thr Thr Val Gly Thr
20 25 30 20 25 30
Ile Thr Pro Glu Ser Phe Ser Lys Leu Ile Lys Tyr Trp Asn Glu Ala Ile Thr Pro Glu Ser Phe Ser Lys Leu Ile Lys Tyr Trp Asn Glu Ala
35 40 45 35 40 45
Thr Val Trp Asn Asp Asn Glu Asp Lys Lys Ile Met Gln Tyr Asn Pro Thr Val Trp Asn Asp Asn Glu Asp Lys Lys Ile Met Gln Tyr Asn Pro
50 55 60 50 55 60
Met Val Ile Val Asp Lys Arg Thr Glu Thr Val Ala Ala Thr Gly Asn Met Val Ile Val Asp Lys Arg Thr Glu Thr Val Ala Ala Thr Gly Asn
65 70 75 80 65 70 75 80
Ile Ile Ile Glu Arg Lys Ile Ile His Glu Leu Gly Leu Cys Gly His Ile Ile Ile Glu Arg Lys Ile Ile His Glu Leu Gly Leu Cys Gly His
85 90 95 85 90 95
Ile Glu Asp Ile Ala Val Asn Ser Lys Tyr Gln Gly Gln Gly Leu Gly Ile Glu Asp Ile Ala Val Asn Ser Lys Tyr Gln Gly Gln Gly Leu Gly
100 105 110 100 105 110
Lys Leu Leu Ile Asp Gln Leu Val Thr Ile Gly Phe Asp Tyr Gly Cys Lys Leu Leu Ile Asp Gln Leu Val Thr Ile Gly Phe Asp Tyr Gly Cys
115 120 125 115 120 125
Tyr Lys Ile Ile Leu Asp Cys Asp Glu Lys Asn Val Lys Phe Tyr Glu Tyr Lys Ile Ile Leu Asp Cys Asp Glu Lys Asn Val Lys Phe Tyr Glu
130 135 140 130 135 140
Lys Cys Gly Phe Ser Asn Ala Gly Val Glu Met Gln Ile Arg Lys Lys Cys Gly Phe Ser Asn Ala Gly Val Glu Met Gln Ile Arg Lys
145 150 155 145 150 155
<210> 78<210> 78
<211> 480<211> 480
<212> ДНК<212> DNA
<213> Saccharomyces cerevisiae<213> Saccharomyces cerevisiae
<400> 78<400> 78
atgagcttac ccgatggatt ttatataagg cgaatggaag agggggattt ggaacaggtc 60atgagcttac ccgatggatt ttatataagg cgaatggaag agggggattt ggaacaggtc 60
actgagacgc taaaggtttt gaccaccgtg ggcactatta cccccgaatc cttcagcaaa 120actgagacgc taaaggtttt gaccaccgtg ggcactatta cccccgaatc cttcagcaaa 120
ctcataaaat actggaatga agccacagta tggaatgata acgaagataa aaaaataatg 180ctcataaaat actggaatga agccacagta tggaatgata acgaagataa aaaaataatg 180
caatataacc ccatggtgat tgtggacaag cgcaccgaga cggttgccgc tacggggaat 240caatataacc ccatggtgat tgtggacaag cgcaccgaga cggttgccgc tacggggaat 240
atcatcatcg aaagaaagat cattcatgaa ctggggctat gtggccacat cgaggacatt 300atcatcatcg aaagaaagat cattcatgaa ctggggctat gtggccacat cgaggacatt 300
gcagtaaact ccaagtatca gggccaaggt ttgggcaagc tcttgattga tcaattggta 360gcagtaaact ccaagtatca gggccaaggt ttgggcaagc tcttgattga tcaattggta 360
actatcggct ttgactacgg ttgttataag attattttag attgcgatga gaaaaatgtc 420actatcggct ttgactacgg ttgttataag attattttag attgcgatga gaaaaatgtc 420
aaattctatg aaaaatgtgg gtttagcaac gcaggcgtgg aaatgcaaat tagaaaatag 480aaattctatg aaaaatgtgg gtttagcaac gcaggcgtgg aaatgcaaat tagaaaatag 480
<210> 79<210> 79
<211> 188<211> 188
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 79<400> 79
Met Tyr Glu Arg Tyr Ala Gly Leu Ile Phe Asp Met Asp Gly Thr Ile Met Tyr Glu Arg Tyr Ala Gly Leu Ile Phe Asp Met Asp Gly Thr Ile
1 5 10 15 1 5 10 15
Leu Asp Thr Glu Pro Thr His Arg Lys Ala Trp Arg Glu Val Leu Gly Leu Asp Thr Glu Pro Thr His Arg Lys Ala Trp Arg Glu Val Leu Gly
20 25 30 20 25 30
His Tyr Gly Leu Gln Tyr Asp Ile Gln Ala Met Ile Ala Leu Asn Gly His Tyr Gly Leu Gln Tyr Asp Ile Gln Ala Met Ile Ala Leu Asn Gly
35 40 45 35 40 45
Ser Pro Thr Trp Arg Ile Ala Gln Ala Ile Ile Glu Leu Asn Gln Ala Ser Pro Thr Trp Arg Ile Ala Gln Ala Ile Ile Glu Leu Asn Gln Ala
50 55 60 50 55 60
Asp Leu Asp Pro His Ala Leu Ala Arg Glu Lys Thr Glu Ala Val Arg Asp Leu Asp Pro His Ala Leu Ala Arg Glu Lys Thr Glu Ala Val Arg
65 70 75 80 65 70 75 80
Ser Met Leu Leu Asp Ser Val Glu Pro Leu Pro Leu Val Asp Val Val Ser Met Leu Leu Asp Ser Val Glu Pro Leu Pro Leu Val Asp Val Val
85 90 95 85 90 95
Lys Ser Trp His Gly Arg Arg Pro Met Ala Val Gly Thr Gly Ser Glu Lys Ser Trp His Gly Arg Arg Pro Met Ala Val Gly Thr Gly Ser Glu
100 105 110 100 105 110
Ser Ala Ile Ala Glu Ala Leu Leu Ala His Leu Gly Leu Arg His Tyr Ser Ala Ile Ala Glu Ala Leu Leu Ala His Leu Gly Leu Arg His Tyr
115 120 125 115 120 125
Phe Asp Ala Val Val Ala Ala Asp His Val Lys His His Lys Pro Ala Phe Asp Ala Val Val Ala Ala Asp His Val Lys His His Lys Pro Ala
130 135 140 130 135 140
Pro Asp Thr Phe Leu Leu Cys Ala Gln Arg Met Gly Val Gln Pro Thr Pro Asp Thr Phe Leu Leu Cys Ala Gln Arg Met Gly Val Gln Pro Thr
145 150 155 160 145 150 155 160
Gln Cys Val Val Phe Glu Asp Ala Asp Phe Gly Ile Gln Ala Ala Arg Gln Cys Val Val Phe Glu Asp Ala Asp Phe Gly Ile Gln Ala Ala Arg
165 170 175 165 170 175
Ala Ala Gly Met Asp Ala Val Asp Val Arg Leu Leu Ala Ala Gly Met Asp Ala Val Asp Val Arg Leu Leu
180 185 180 185
<210> 80<210> 80
<211> 199<211> 199
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 80<400> 80
Met Leu Tyr Ile Phe Asp Leu Gly Asn Val Ile Val Asp Ile Asp Phe Met Leu Tyr Ile Phe Asp Leu Gly Asn Val Ile Val Asp Ile Asp Phe
1 5 10 15 1 5 10 15
Asn Arg Val Leu Gly Ala Trp Ser Asp Leu Thr Arg Ile Pro Leu Ala Asn Arg Val Leu Gly Ala Trp Ser Asp Leu Thr Arg Ile Pro Leu Ala
20 25 30 20 25 30
Ser Leu Lys Lys Ser Phe His Met Gly Glu Ala Phe His Gln His Glu Ser Leu Lys Lys Ser Phe His Met Gly Glu Ala Phe His Gln His Glu
35 40 45 35 40 45
Arg Gly Glu Ile Ser Asp Glu Ala Phe Ala Glu Ala Leu Cys His Glu Arg Gly Glu Ile Ser Asp Glu Ala Phe Ala Glu Ala Leu Cys His Glu
50 55 60 50 55 60
Met Ala Leu Pro Leu Ser Tyr Glu Gln Phe Ser His Gly Trp Gln Ala Met Ala Leu Pro Leu Ser Tyr Glu Gln Phe Ser His Gly Trp Gln Ala
65 70 75 80 65 70 75 80
Val Phe Val Ala Leu Arg Pro Glu Val Ile Ala Ile Met His Lys Leu Val Phe Val Ala Leu Arg Pro Glu Val Ile Ala Ile Met His Lys Leu
85 90 95 85 90 95
Arg Glu Gln Gly His Arg Val Val Val Leu Ser Asn Thr Asn Arg Leu Arg Glu Gln Gly His Arg Val Val Val Leu Ser Asn Thr Asn Arg Leu
100 105 110 100 105 110
His Thr Thr Phe Trp Pro Glu Glu Tyr Pro Glu Ile Arg Asp Ala Ala His Thr Thr Phe Trp Pro Glu Glu Tyr Pro Glu Ile Arg Asp Ala Ala
115 120 125 115 120 125
Asp His Ile Tyr Leu Ser Gln Asp Leu Gly Met Arg Lys Pro Glu Ala Asp His Ile Tyr Leu Ser Gln Asp Leu Gly Met Arg Lys Pro Glu Ala
130 135 140 130 135 140
Arg Ile Tyr Gln His Val Leu Gln Ala Glu Gly Phe Ser Pro Ser Asp Arg Ile Tyr Gln His Val Leu Gln Ala Glu Gly Phe Ser Pro Ser Asp
145 150 155 160 145 150 155 160
Thr Val Phe Phe Asp Asp Asn Ala Asp Asn Ile Glu Gly Ala Asn Gln Thr Val Phe Phe Asp Asp Asn Ala Asp Asn Ile Glu Gly Ala Asn Gln
165 170 175 165 170 175
Leu Gly Ile Thr Ser Ile Leu Val Lys Asp Lys Thr Thr Ile Pro Asp Leu Gly Ile Thr Ser Ile Leu Val Lys Asp Lys Thr Thr Ile Pro Asp
180 185 190 180 185 190
Tyr Phe Ala Lys Val Leu Cys Tyr Phe Ala Lys Val Leu Cys
195 195
<210> 81<210> 81
<211> 567<211> 567
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 81<400> 81
atgtacgagc gttatgcagg tttaattttt gatatggatg gcacaatcct ggatacggag 60atgtacgagc gttatgcagg tttaattttt gatatggatg gcacaatcct ggatacggag 60
cctacgcacc gtaaagcgtg gcgcgaagta ttagggcact acggtcttca gtacgatatt 120cctacgcacc gtaaagcgtg gcgcgaagta ttagggcact acggtcttca gtacgatatt 120
caggcgatga ttgcgcttaa tggatcgccc acctggcgta ttgctcaggc aattattgag 180caggcgatga ttgcgcttaa tggatcgccc acctggcgta ttgctcaggc aattattgag 180
ctgaatcagg ccgatctcga cccgcatgcg ttagcgcgtg aaaaaacaga agcagtaaga 240ctgaatcagg ccgatctcga cccgcatgcg ttagcgcgtg aaaaaacaga agcagtaaga 240
agtatgctgc tggatagcgt cgaaccgctt cctcttgttg atgtggtgaa aagttggcat 300agtatgctgc tggatagcgt cgaaccgctt cctcttgttg atgtggtgaa aagttggcat 300
ggtcgtcgcc caatggctgt aggaacgggg agtgaaagcg ccatcgctga ggcattgctg 360ggtcgtcgcc caatggctgt aggaacgggg agtgaaagcg ccatcgctga ggcattgctg 360
gcgcacctgg gattacgcca ttattttgac gccgtcgtcg ctgccgatca cgtcaaacac 420gcgcacctgg gattacgcca ttattttgac gccgtcgtcg ctgccgatca cgtcaaacac 420
cataaacccg cgccagacac atttttgttg tgcgcgcagc gtatgggcgt gcaaccgacg 480cataaacccg cgccagacac atttttgttg tgcgcgcagc gtatgggcgt gcaaccgacg 480
cagtgtgtgg tctttgaaga tgccgatttc ggtattcagg cggcccgtgc agcaggcatg 540cagtgtgtgg tctttgaaga tgccgatttc ggtattcagg cggcccgtgc agcaggcatg 540
gacgccgtgg atgttcgctt gctgtga 567gacgccgtgg atgttcgctt gctgtga 567
<210> 82<210> 82
<211> 600<211> 600
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 82<400> 82
atgctctata tctttgattt aggtaatgtg attgtcgata tcgactttaa ccgtgtgctg 60atgctctata tctttgattt aggtaatgtg attgtcgata tcgactttaa ccgtgtgctg 60
ggagcctgga gcgatttaac gcgtattccg ctggcatcgc ttaagaagag ttttcatatg 120ggagcctgga gcgatttaac gcgtattccg ctggcatcgc ttaagaagag ttttcatatg 120
ggggaggcgt ttcatcagca tgagcgtggg gaaattagcg acgaagcgtt cgcagaggcg 180ggggaggcgt ttcatcagca tgagcgtggg gaaattagcg acgaagcgtt cgcagaggcg 180
ctgtgtcatg agatggctct accgctaagc tacgagcagt tctctcacgg ctggcaggcg 240ctgtgtcatg agatggctct accgctaagc tacgagcagt tctctcacgg ctggcaggcg 240
gtgtttgttg cgctgcgccc ggaagtgatc gccatcatgc ataaactgcg tgagcagggg 300gtgtttgttg cgctgcgccc ggaagtgatc gccatcatgc ataaactgcg tgagcagggg 300
catcgcgtgg tggtgctttc caataccaac cgcctgcata ccaccttctg gccggaagaa 360catcgcgtgg tggtgctttc caataccaac cgcctgcata ccaccttctg gccggaagaa 360
tacccggaaa ttcgtgatgc tgctgaccat atctatctgt cgcaagatct ggggatgcgc 420tacccggaaa ttcgtgatgc tgctgaccat atctatctgt cgcaagatct ggggatgcgc 420
aaacctgaag cacgaattta ccagcatgtt ttgcaggcgg aaggtttttc acccagcgat 480aaacctgaag cacgaattta ccagcatgtt ttgcaggcgg aaggtttttc acccagcgat 480
acggtctttt tcgacgataa cgccgataat atagaaggag ccaatcagct gggcattacc 540acggtctttt tcgacgataa cgccgataat atagaaggag ccaatcagct gggcattacc 540
agtattctgg tgaaagataa aaccaccatc ccggactatt tcgcgaaggt gttatgctaa 600agtattctgg tgaaagataa aaccaccatc ccggactatt tcgcgaaggt gttatgctaa 600
<210> 83<210> 83
<211> 421<211> 421
<212> Белок<212> Protein
<213> Bacteroides ovatus<213> Bacteroides ovatus
<400> 83<400> 83
Met Asp Ser Lys Asn Asn Ile Gly His Ser Ala Asp Ile Ser Leu Thr Met Asp Ser Lys Asn Asn Ile Gly His Ser Ala Asp Ile Ser Leu Thr
1 5 10 15 1 5 10 15
Ala Glu Leu Pro Ile Pro Ile Tyr Asn Gly Asn Thr Ile Met Asp Phe Ala Glu Leu Pro Ile Pro Ile Tyr Asn Gly Asn Thr Ile Met Asp Phe
20 25 30 20 25 30
Lys Lys Leu Ala Ser Leu Tyr Lys Asp Glu Leu Leu Asp Asn Val Leu Lys Lys Leu Ala Ser Leu Tyr Lys Asp Glu Leu Leu Asp Asn Val Leu
35 40 45 35 40 45
Pro Phe Trp Leu Glu His Ser Gln Asp His Glu Tyr Gly Gly Tyr Phe Pro Phe Trp Leu Glu His Ser Gln Asp His Glu Tyr Gly Gly Tyr Phe
50 55 60 50 55 60
Thr Cys Leu Asp Arg Glu Gly Lys Val Phe Asp Thr Asp Lys Phe Ile Thr Cys Leu Asp Arg Glu Gly Lys Val Phe Asp Thr Asp Lys Phe Ile
65 70 75 80 65 70 75 80
Trp Leu Gln Ser Arg Glu Val Trp Met Phe Ser Met Leu Tyr Asn Lys Trp Leu Gln Ser Arg Glu Val Trp Met Phe Ser Met Leu Tyr Asn Lys
85 90 95 85 90 95
Val Glu Lys Arg Gln Glu Trp Leu Asp Cys Ala Ile Gln Gly Gly Glu Val Glu Lys Arg Gln Glu Trp Leu Asp Cys Ala Ile Gln Gly Gly Glu
100 105 110 100 105 110
Phe Leu Lys Lys Tyr Gly His Asp Gly Asn Tyr Asn Trp Tyr Phe Ser Phe Leu Lys Lys Tyr Gly His Asp Gly Asn Tyr Asn Trp Tyr Phe Ser
115 120 125 115 120 125
Leu Asp Arg Ser Gly Arg Pro Leu Val Glu Pro Tyr Asn Ile Phe Ser Leu Asp Arg Ser Gly Arg Pro Leu Val Glu Pro Tyr Asn Ile Phe Ser
130 135 140 130 135 140
Tyr Thr Phe Ala Thr Met Ala Phe Gly Gln Leu Ser Leu Thr Thr Gly Tyr Thr Phe Ala Thr Met Ala Phe Gly Gln Leu Ser Leu Thr Thr Gly
145 150 155 160 145 150 155 160
Asn Gln Glu Tyr Ala Asp Ile Ala Lys Lys Thr Phe Asp Ile Ile Leu Asn Gln Glu Tyr Ala Asp Ile Ala Lys Lys Thr Phe Asp Ile Ile Leu
165 170 175 165 170 175
Ser Lys Val Asp Asn Pro Lys Gly Arg Trp Asn Lys Leu His Pro Gly Ser Lys Val Asp Asn Pro Lys Gly Arg Trp Asn Lys Leu His Pro Gly
180 185 190 180 185 190
Thr Arg Asn Leu Lys Asn Phe Ala Leu Pro Met Ile Leu Cys Asn Leu Thr Arg Asn Leu Lys Asn Phe Ala Leu Pro Met Ile Leu Cys Asn Leu
195 200 205 195 200 205
Ala Leu Glu Ile Glu His Leu Leu Asp Glu Thr Tyr Leu Arg Glu Thr Ala Leu Glu Ile Glu His Leu Leu Asp Glu Thr Tyr Leu Arg Glu Thr
210 215 220 210 215 220
Met Asp Thr Cys Ile His Glu Val Met Glu Val Phe Tyr Arg Pro Glu Met Asp Thr Cys Ile His Glu Val Met Glu Val Phe Tyr Arg Pro Glu
225 230 235 240 225 230 235 240
Leu Gly Gly Ile Ile Val Glu Asn Val Asp Ile Asp Gly Asn Leu Val Leu Gly Gly Ile Ile Val Glu Asn Val Asp Ile Asp Gly Asn Leu Val
245 250 255 245 250 255
Asp Cys Phe Glu Gly Arg Gln Val Thr Pro Gly His Ala Ile Glu Ala Asp Cys Phe Glu Gly Arg Gln Val Thr Pro Gly His Ala Ile Glu Ala
260 265 270 260 265 270
Met Trp Phe Ile Met Asp Leu Gly Lys Arg Leu Asn Arg Pro Glu Leu Met Trp Phe Ile Met Asp Leu Gly Lys Arg Leu Asn Arg Pro Glu Leu
275 280 285 275 280 285
Ile Glu Lys Ala Lys Glu Thr Thr Leu Thr Met Leu Asn Tyr Gly Trp Ile Glu Lys Ala Lys Glu Thr Thr Leu Thr Met Leu Asn Tyr Gly Trp
290 295 300 290 295 300
Asp Lys Gln Tyr Gly Gly Ile Tyr Tyr Phe Met Asp Arg Asn Gly Cys Asp Lys Gln Tyr Gly Gly Ile Tyr Tyr Phe Met Asp Arg Asn Gly Cys
305 310 315 320 305 310 315 320
Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu Trp Trp Val His Ile Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu Trp Trp Val His Ile
325 330 335 325 330 335
Glu Thr Leu Ile Ser Leu Leu Lys Gly Tyr Gln Leu Thr Gly Asp Lys Glu Thr Leu Ile Ser Leu Leu Lys Gly Tyr Gln Leu Thr Gly Asp Lys
340 345 350 340 345 350
Lys Cys Leu Glu Trp Phe Glu Lys Val His Asp Tyr Thr Trp Glu His Lys Cys Leu Glu Trp Phe Glu Lys Val His Asp Tyr Thr Trp Glu His
355 360 365 355 360 365
Phe Lys Asp Lys Glu Tyr Pro Glu Trp Tyr Gly Tyr Leu Asn Arg Arg Phe Lys Asp Lys Glu Tyr Pro Glu Trp Tyr Gly Tyr Leu Asn Arg Arg
370 375 380 370 375 380
Gly Glu Val Leu Leu Pro Leu Lys Gly Gly Lys Trp Lys Gly Cys Phe Gly Glu Val Leu Leu Pro Leu Lys Gly Gly Lys Trp Lys Gly Cys Phe
385 390 395 400 385 390 395 400
His Val Pro Arg Gly Leu Tyr Gln Cys Trp Lys Thr Leu Glu Glu Ile His Val Pro Arg Gly Leu Tyr Gln Cys Trp Lys Thr Leu Glu Glu Ile
405 410 415 405 410 415
Lys Asn Ile Val Ser Lys Asn Ile Val Ser
420 420
<210> 84<210> 84
<211> 391<211> 391
<212> Белок<212> Protein
<213> Synechocystis sp.<213> Synechocystis sp.
<400> 84<400> 84
Met Ile Ala His Arg Arg Gln Glu Leu Ala Gln Gln Tyr Tyr Gln Ala Met Ile Ala His Arg Arg Gln Glu Leu Ala Gln Gln Tyr Tyr Gln Ala
1 5 10 15 1 5 10 15
Leu His Gln Asp Val Leu Pro Phe Trp Glu Lys Tyr Ser Leu Asp Arg Leu His Gln Asp Val Leu Pro Phe Trp Glu Lys Tyr Ser Leu Asp Arg
20 25 30 20 25 30
Gln Gly Gly Gly Tyr Phe Thr Cys Leu Asp Arg Lys Gly Gln Val Phe Gln Gly Gly Gly Tyr Phe Thr Cys Leu Asp Arg Lys Gly Gln Val Phe
35 40 45 35 40 45
Asp Thr Asp Lys Phe Ile Trp Leu Gln Asn Arg Gln Val Trp Gln Phe Asp Thr Asp Lys Phe Ile Trp Leu Gln Asn Arg Gln Val Trp Gln Phe
50 55 60 50 55 60
Ala Val Phe Tyr Asn Arg Leu Glu Pro Lys Pro Gln Trp Leu Glu Ile Ala Val Phe Tyr Asn Arg Leu Glu Pro Lys Pro Gln Trp Leu Glu Ile
65 70 75 80 65 70 75 80
Ala Arg His Gly Ala Asp Phe Leu Ala Arg His Gly Arg Asp Gln Asp Ala Arg His Gly Ala Asp Phe Leu Ala Arg His Gly Arg Asp Gln Asp
85 90 95 85 90 95
Gly Asn Trp Tyr Phe Ala Leu Asp Gln Glu Gly Lys Pro Leu Arg Gln Gly Asn Trp Tyr Phe Ala Leu Asp Gln Glu Gly Lys Pro Leu Arg Gln
100 105 110 100 105 110
Pro Tyr Asn Val Phe Ser Asp Cys Phe Ala Ala Met Ala Phe Ser Gln Pro Tyr Asn Val Phe Ser Asp Cys Phe Ala Ala Met Ala Phe Ser Gln
115 120 125 115 120 125
Tyr Ala Leu Ala Ser Gly Ala Gln Glu Ala Lys Ala Ile Ala Leu Gln Tyr Ala Leu Ala Ser Gly Ala Gln Glu Ala Lys Ala Ile Ala Leu Gln
130 135 140 130 135 140
Ala Tyr Asn Asn Val Leu Arg Arg Gln His Asn Pro Lys Gly Gln Tyr Ala Tyr Asn Asn Val Leu Arg Arg Gln His Asn Pro Lys Gly Gln Tyr
145 150 155 160 145 150 155 160
Glu Lys Ser Tyr Pro Gly Thr Arg Pro Leu Lys Ser Leu Ala Val Pro Glu Lys Ser Tyr Pro Gly Thr Arg Pro Leu Lys Ser Leu Ala Val Pro
165 170 175 165 170 175
Met Ile Leu Ala Asn Leu Thr Leu Glu Met Glu Trp Leu Leu Pro Pro Met Ile Leu Ala Asn Leu Thr Leu Glu Met Glu Trp Leu Leu Pro Pro
180 185 190 180 185 190
Thr Thr Val Glu Glu Val Leu Ala Gln Thr Val Arg Glu Val Met Thr Thr Thr Val Glu Glu Val Leu Ala Gln Thr Val Arg Glu Val Met Thr
195 200 205 195 200 205
Asp Phe Leu Asp Pro Glu Ile Gly Leu Met Arg Glu Ala Val Thr Pro Asp Phe Leu Asp Pro Glu Ile Gly Leu Met Arg Glu Ala Val Thr Pro
210 215 220 210 215 220
Thr Gly Glu Phe Val Asp Ser Phe Glu Gly Arg Leu Leu Asn Pro Gly Thr Gly Glu Phe Val Asp Ser Phe Glu Gly Arg Leu Leu Asn Pro Gly
225 230 235 240 225 230 235 240
His Gly Ile Glu Ala Met Trp Phe Met Met Asp Ile Ala Gln Arg Ser His Gly Ile Glu Ala Met Trp Phe Met Met Asp Ile Ala Gln Arg Ser
245 250 255 245 250 255
Gly Asp Arg Gln Leu Gln Glu Gln Ala Ile Ala Val Val Leu Asn Thr Gly Asp Arg Gln Leu Gln Glu Gln Ala Ile Ala Val Val Leu Asn Thr
260 265 270 260 265 270
Leu Glu Tyr Ala Trp Asp Glu Glu Phe Gly Gly Ile Phe Tyr Phe Leu Leu Glu Tyr Ala Trp Asp Glu Glu Phe Gly Gly Ile Phe Tyr Phe Leu
275 280 285 275 280 285
Asp Arg Gln Gly His Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu Asp Arg Gln Gly His Pro Pro Gln Gln Leu Glu Trp Asp Gln Lys Leu
290 295 300 290 295 300
Trp Trp Val His Leu Glu Thr Leu Val Ala Leu Ala Lys Gly His Gln Trp Trp Val His Leu Glu Thr Leu Val Ala Leu Ala Lys Gly His Gln
305 310 315 320 305 310 315 320
Ala Thr Gly Gln Glu Lys Cys Trp Gln Trp Phe Glu Arg Val His Asp Ala Thr Gly Gln Glu Lys Cys Trp Gln Trp Phe Glu Arg Val His Asp
325 330 335 325 330 335
Tyr Ala Trp Ser His Phe Ala Asp Pro Glu Tyr Gly Glu Trp Phe Gly Tyr Ala Trp Ser His Phe Ala Asp Pro Glu Tyr Gly Glu Trp Phe Gly
340 345 350 340 345 350
Tyr Leu Asn Arg Arg Gly Glu Val Leu Leu Asn Leu Lys Gly Gly Lys Tyr Leu Asn Arg Arg Gly Glu Val Leu Leu Asn Leu Lys Gly Gly Lys
355 360 365 355 360 365
Trp Lys Gly Cys Phe His Val Pro Arg Ala Leu Trp Leu Cys Ala Glu Trp Lys Gly Cys Phe His Val Pro Arg Ala Leu Trp Leu Cys Ala Glu
370 375 380 370 375 380
Thr Leu Gln Leu Pro Val Ser Thr Leu Gln Leu Pro Val Ser
385 390 385 390
<210> 85<210> 85
<211> 1266<211> 1266
<212> ДНК<212> DNA
<213> Bacteroides ovatus<213> Bacteroides ovatus
<400> 85<400> 85
atggatagta agaataacat tggtcattca gcagacatct ctttaactgc tgaattaccc 60atggatagta agaataacat tggtcattca gcagacatct ctttaactgc tgaattaccc 60
ataccaatct ataatggaaa tacgattatg gatttcaaaa aactggcaag tctgtacaag 120ataccaatct ataatggaaa tacgattatg gatttcaaaa aactggcaag tctgtacaag 120
gatgagctcc tggacaacgt ccttcctttc tggcttgaac attcacaaga ccatgagtat 180gatgagctcc tggacaacgt ccttcctttc tggcttgaac attcacaaga ccatgagtat 180
ggtggttact tcacctgtct ggaccgtgaa ggaaaagtat tcgatacgga taagtttatt 240ggtggttact tcacctgtct ggaccgtgaa ggaaaagtat tcgatacgga taagtttatt 240
tggctgcaaa gtcgtgaggt atggatgttc tccatgcttt acaacaaagt ggagaaacgt 300tggctgcaaa gtcgtgaggt atggatgttc tccatgcttt acaacaaagt ggagaaacgt 300
caggaatggc tagactgtgc cattcagggt ggcgaatttc taaaaaaata tggacatgac 360caggaatggc tagactgtgc cattcagggt ggcgaatttc taaaaaaata tggacatgac 360
ggcaattata actggtattt ttccctcgac cgttcgggta gaccattggt agaaccgtac 420ggcaattata actggtattt ttccctcgac cgttcgggta gaccattggt agaaccgtac 420
aatatattct cgtatacatt cgctaccatg gctttcggac agttgagcct tacaaccggt 480aatatattct cgtatacatt cgctaccatg gctttcggac agttgagcct tacaaccggt 480
aatcaggaat atgcggacat tgccaagaaa actttcgata taatcctttc caaagtggat 540aatcaggaat atgcggacat tgccaagaaa actttcgata taatcctttc caaagtggat 540
aatccgaaag ggagatggaa taagcttcat ccgggtaccc gtaatctgaa gaactttgcc 600aatccgaaag ggagatggaa taagcttcat ccgggtaccc gtaatctgaa gaactttgcc 600
ttgccaatga tcctctgtaa cttggcactg gagatagagc atttattgga tgaaacgtat 660ttgccaatga tcctctgtaa cttggcactg gagatagagc atttattgga tgaaacgtat 660
ctgcgggaaa caatggatac ttgtatccat gaagtgatgg aagttttcta tcgtcctgaa 720ctgcgggaaa caatggatac ttgtatccat gaagtgatgg aagttttcta tcgtcctgaa 720
ctcggaggta tcattgttga aaacgtggac atagacggta atttggtcga ttgttttgaa 780ctcggaggta tcattgttga aaacgtggac atagacggta atttggtcga ttgttttgaa 780
ggccgtcagg tgaccccggg acatgccatt gaagcgatgt ggtttatcat ggatctaggc 840ggccgtcagg tgaccccggg acatgccatt gaagcgatgt ggtttatcat ggatctaggc 840
aagcgtctga atcgtccgga attgatagag aaagccaaag agactactct cacgatgctt 900aagcgtctga atcgtccgga attgatagag aaagccaaag agactactct cacgatgctt 900
aattatggct gggacaagca atatggaggt atctactatt ttatggatcg taacggttgt 960aattatggct gggacaagca atatggaggt atctactatt ttatggatcg taacggttgt 960
cctccccaac aattggagtg ggaccagaaa ctctggtggg tccatatcga aacgcttatt 1020cctccccaac aattggagtg ggaccagaaa ctctggtggg tccatatcga aacgcttatt 1020
tccctgctga aaggctatca attgacggga gacaaaaaat gcttggaatg gtttgaaaag 1080tccctgctga aaggctatca attgacggga gacaaaaaat gcttggaatg gtttgaaaag 1080
gtacatgact acacttggga gcatttcaag gataaagaat atcctgaatg gtatggctac 1140gtacatgact acacttggga gcatttcaag gataaagaat atcctgaatg gtatggctac 1140
ttgaaccgaa gaggcgaagt attgctacca ctcaaaggag gaaaatggaa aggatgcttc 1200ttgaaccgaa gaggcgaagt attgctacca ctcaaaggag gaaaatggaa aggatgcttc 1200
catgtgccaa gaggactgta tcagtgctgg aaaacattag aagaaataaa aaatatagta 1260catgtgccaa gaggactgta tcagtgctgg aaaacattag aagaaataaa aaatatagta 1260
tcctaa 1266tcctaa 1266
<210> 86<210> 86
<211> 1176<211> 1176
<212> ДНК<212> DNA
<213> Synechocystis sp.<213> Synechocystis sp.
<400> 86<400> 86
atgattgccc atcgccgtca ggagttagcc cagcaatatt accaggcttt acaccaggac 60atgattgccc atcgccgtca ggagttagcc cagcaatatt accaggcttt acaccaggac 60
gtattgccct tttgggaaaa atattccctc gatcgccagg ggggcggtta ctttacctgc 120gtattgccct tttgggaaaa atattccctc gatcgccagg ggggcggtta ctttacctgc 120
ttagaccgta aaggccaggt ttttgacaca gataaattca tttggttaca aaaccgtcag 180ttagaccgta aaggccaggt ttttgacaca gataaattca tttggttaca aaaccgtcag 180
gtatggcagt ttgccgtttt ctacaaccgt ttggaaccaa aaccccaatg gttagaaatt 240gtatggcagt ttgccgtttt ctacaaccgt ttggaaccaa aaccccaatg gttagaaatt 240
gcccgccatg gtgctgattt tttagctcgc cacggccgag atcaagacgg taattggtat 300gcccgccatg gtgctgattt tttagctcgc cacggccgag atcaagacgg taattggtat 300
tttgctttgg atcaggaagg caaacccctg cgtcaaccct ataacgtttt ttccgattgc 360tttgctttgg atcaggaagg caaacccctg cgtcaaccct ataacgtttt ttccgattgc 360
ttcgccgcca tggcctttag tcaatatgcc ttagccagtg gggcgcagga agctaaagcc 420ttcgccgcca tggcctttag tcaatatgcc ttagccagtg gggcgcagga agctaaagcc 420
attgccctgc aggcctacaa taacgtccta cgccgtcagc acaatcccaa aggtcaatac 480attgccctgc aggcctacaa taacgtccta cgccgtcagc acaatcccaa aggtcaatac 480
gagaagtcct atccaggtac tagacccctc aaatccctgg cggtgccgat gattttagcc 540gagaagtcct atccaggtac tagacccctc aaatccctgg cggtgccgat gattttagcc 540
aacctcaccc tggagatgga atggttatta ccgcctacta ccgtggaaga ggtgttggcc 600aacctcaccc tggagatgga atggttatta ccgcctacta ccgtggaaga ggtgttggcc 600
caaaccgtca gagaagtgat gacggatttc ctcgacccag aaataggatt aatgcgggaa 660caaaccgtca gagaagtgat gacggatttc ctcgacccag aaataggatt aatgcgggaa 660
gcggtgaccc ccacaggaga atttgttgat agttttgaag ggcggttgct caacccagga 720gcggtgaccc ccacaggaga atttgttgat agttttgaag ggcggttgct caacccagga 720
cacggcattg aagccatgtg gttcatgatg gacattgccc aacgctccgg cgatcgccag 780cacggcattg aagccatgtg gttcatgatg gacattgccc aacgctccgg cgatcgccag 780
ttacaggagc aagccattgc agtggtgttg aacaccctgg aatatgcctg ggatgaagaa 840ttacaggagc aagccattgc agtggtgttg aacaccctgg aatatgcctg ggatgaagaa 840
tttggtggca tattttattt ccttgatcgc cagggccacc ctccccaaca actggaatgg 900tttggtggca tattttattt ccttgatcgc cagggccacc ctccccaaca actggaatgg 900
gaccaaaagc tctggtgggt acatttggaa accctggttg ccctagccaa gggccaccaa 960gaccaaaagc tctggtgggt acatttggaa accctggttg ccctagccaa gggccaccaa 960
gccactggcc aagaaaaatg ttggcaatgg tttgagcggg tccatgatta cgcctggagt 1020gccactggcc aagaaaaatg ttggcaatgg tttgagcggg tccatgatta cgcctggagt 1020
catttcgccg atcctgagta tggggaatgg tttggctacc tgaatcgccg gggagaggtg 1080catttcgccg atcctgagta tggggaatgg tttggctacc tgaatcgccg gggagaggtg 1080
ttactcaacc taaaaggggg gaaatggaaa gggtgcttcc acgtgccccg agctctgtgg 1140ttactcaacc taaaaggggg gaaatggaaa gggtgcttcc acgtgccccg agctctgtgg 1140
ctctgtgcgg aaactctcca acttccggtt agttaa 1176ctctgtgcgg aaactctcca acttccggtt agttaa 1176
<210> 87<210> 87
<211> 229<211> 229
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 87<400> 87
Met Ser Leu Leu Ala Gln Leu Asp Gln Lys Ile Ala Ala Asn Gly Gly Met Ser Leu Leu Ala Gln Leu Asp Gln Lys Ile Ala Ala Asn Gly Gly
1 5 10 15 1 5 10 15
Leu Ile Val Ser Cys Gln Pro Val Pro Asp Ser Pro Leu Asp Lys Pro Leu Ile Val Ser Cys Gln Pro Val Pro Asp Ser Pro Leu Asp Lys Pro
20 25 30 20 25 30
Glu Ile Val Ala Ala Met Ala Leu Ala Ala Glu Gln Ala Gly Ala Val Glu Ile Val Ala Ala Met Ala Leu Ala Ala Glu Gln Ala Gly Ala Val
35 40 45 35 40 45
Ala Ile Arg Ile Glu Gly Val Ala Asn Leu Gln Ala Thr Arg Ala Val Ala Ile Arg Ile Glu Gly Val Ala Asn Leu Gln Ala Thr Arg Ala Val
50 55 60 50 55 60
Val Ser Val Pro Ile Ile Gly Ile Val Lys Arg Asp Leu Glu Asp Ser Val Ser Val Pro Ile Ile Gly Ile Val Lys Arg Asp Leu Glu Asp Ser
65 70 75 80 65 70 75 80
Pro Val Arg Ile Thr Ala Tyr Ile Glu Asp Val Asp Ala Leu Ala Gln Pro Val Arg Ile Thr Ala Tyr Ile Glu Asp Val Asp Ala Leu Ala Gln
85 90 95 85 90 95
Ala Gly Ala Asp Ile Ile Ala Ile Asp Gly Thr Asp Arg Pro Arg Pro Ala Gly Ala Asp Ile Ile Ala Ile Asp Gly Thr Asp Arg Pro Arg Pro
100 105 110 100 105 110
Val Pro Val Glu Thr Leu Leu Ala Arg Ile His His His Gly Leu Leu Val Pro Val Glu Thr Leu Leu Ala Arg Ile His His His Gly Leu Leu
115 120 125 115 120 125
Ala Met Thr Asp Cys Ser Thr Pro Glu Asp Gly Leu Ala Cys Gln Lys Ala Met Thr Asp Cys Ser Thr Pro Glu Asp Gly Leu Ala Cys Gln Lys
130 135 140 130 135 140
Leu Gly Ala Glu Ile Ile Gly Thr Thr Leu Ser Gly Tyr Thr Thr Pro Leu Gly Ala Glu Ile Ile Gly Thr Thr Leu Ser Gly Tyr Thr Thr Pro
145 150 155 160 145 150 155 160
Glu Thr Pro Glu Glu Pro Asp Leu Ala Leu Val Lys Thr Leu Ser Asp Glu Thr Pro Glu Glu Pro Asp Leu Ala Leu Val Lys Thr Leu Ser Asp
165 170 175 165 170 175
Ala Gly Cys Arg Val Ile Ala Glu Gly Arg Tyr Asn Thr Pro Ala Gln Ala Gly Cys Arg Val Ile Ala Glu Gly Arg Tyr Asn Thr Pro Ala Gln
180 185 190 180 185 190
Ala Ala Asp Ala Met Arg His Gly Ala Trp Ala Val Thr Val Gly Ser Ala Ala Asp Ala Met Arg His Gly Ala Trp Ala Val Thr Val Gly Ser
195 200 205 195 200 205
Ala Ile Thr Arg Leu Glu His Ile Cys Gln Trp Tyr Asn Thr Ala Met Ala Ile Thr Arg Leu Glu His Ile Cys Gln Trp Tyr Asn Thr Ala Met
210 215 220 210 215 220
Lys Lys Ala Val Leu Lys Lys Ala Val Leu
225 225
<210> 88<210> 88
<211> 690<211> 690
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 88<400> 88
atgtcgttac ttgcacaact ggatcaaaaa atcgctgcta acggtggcct gattgtctcc 60atgtcgttac ttgcacaact ggatcaaaaa atcgctgcta acggtggcct gattgtctcc 60
tgccagccgg ttccggacag cccgctcgat aaacccgaaa tcgtcgccgc catggcatta 120tgccagccgg ttccggacag cccgctcgat aaacccgaaa tcgtcgccgc catggcatta 120
gcggcagaac aggcgggcgc ggttgccatt cgcattgaag gtgtggcaaa tctgcaagcc 180gcggcagaac aggcgggcgc ggttgccatt cgcattgaag gtgtggcaaa tctgcaagcc 180
acgcgtgcgg tggtgagcgt gccgattatt ggaattgtga aacgcgatct ggaggattct 240acgcgtgcgg tggtgagcgt gccgattatt ggaattgtga aacgcgatct ggaggattct 240
ccggtacgca tcacggccta tattgaagat gttgatgcgc tggcgcaggc gggcgcggac 300ccggtacgca tcacggccta tattgaagat gttgatgcgc tggcgcaggc gggcgcggac 300
attatcgcca ttgacggcac cgaccgcccg cgtccggtgc ctgttgaaac gctgctggca 360attatcgcca ttgacggcac cgaccgcccg cgtccggtgc ctgttgaaac gctgctggca 360
cgtattcacc atcacggttt actggcgatg accgactgct caacgccgga agacggcctg 420cgtattcacc atcacggttt actggcgatg accgactgct caacgccgga agacggcctg 420
gcatgccaaa agctgggagc cgaaattatt ggcactacgc tttctggcta taccacgcct 480gcatgccaaa agctgggagc cgaaattatt ggcactacgc tttctggcta taccacgcct 480
gaaacgccag aagagccgga tctggcgctg gtgaaaacgt tgagcgacgc cggatgtcgg 540gaaacgccag aagagccgga tctggcgctg gtgaaaacgt tgagcgacgc cggatgtcgg 540
gtgattgccg aagggcgtta caacacgcct gctcaggcgg cggatgcgat gcgccacggc 600gtgattgccg aagggcgtta caacacgcct gctcaggcgg cggatgcgat gcgccacggc 600
gcgtgggcgg tgacggtcgg ttctgcaatc acgcgtcttg agcacatttg tcagtggtac 660gcgtgggcgg tgacggtcgg ttctgcaatc acgcgtcttg agcacatttg tcagtggtac 660
aacacagcga tgaaaaaggc ggtgctatga 690aacacagcga tgaaaaaggc ggtgctatga 690
<210> 89<210> 89
<211> 346<211> 346
<212> Белок<212> Protein
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 89<400> 89
Met Lys Glu Ile Lys Ile Gln Asn Ile Ile Ile Ser Glu Glu Lys Ala Met Lys Glu Ile Lys Ile Gln Asn Ile Ile Ile Ser Glu Glu Lys Ala
1 5 10 15 1 5 10 15
Pro Leu Val Val Pro Glu Ile Gly Ile Asn His Asn Gly Ser Leu Glu Pro Leu Val Val Pro Glu Ile Gly Ile Asn His Asn Gly Ser Leu Glu
20 25 30 20 25 30
Leu Ala Lys Ile Met Val Asp Ala Ala Phe Ser Ala Gly Ala Lys Ile Leu Ala Lys Ile Met Val Asp Ala Ala Phe Ser Ala Gly Ala Lys Ile
35 40 45 35 40 45
Ile Lys His Gln Thr His Ile Val Glu Asp Glu Met Ser Lys Ala Ala Ile Lys His Gln Thr His Ile Val Glu Asp Glu Met Ser Lys Ala Ala
50 55 60 50 55 60
Lys Lys Val Ile Pro Gly Asn Ala Lys Ile Ser Ile Tyr Glu Ile Met Lys Lys Val Ile Pro Gly Asn Ala Lys Ile Ser Ile Tyr Glu Ile Met
65 70 75 80 65 70 75 80
Gln Lys Cys Ala Leu Asp Tyr Lys Asp Glu Leu Ala Leu Lys Glu Tyr Gln Lys Cys Ala Leu Asp Tyr Lys Asp Glu Leu Ala Leu Lys Glu Tyr
85 90 95 85 90 95
Thr Glu Lys Leu Gly Leu Val Tyr Leu Ser Thr Pro Phe Ser Arg Ala Thr Glu Lys Leu Gly Leu Val Tyr Leu Ser Thr Pro Phe Ser Arg Ala
100 105 110 100 105 110
Gly Ala Asn Arg Leu Glu Asp Met Gly Val Ser Ala Phe Lys Ile Gly Gly Ala Asn Arg Leu Glu Asp Met Gly Val Ser Ala Phe Lys Ile Gly
115 120 125 115 120 125
Ser Gly Glu Cys Asn Asn Tyr Pro Leu Ile Lys His Ile Ala Ala Phe Ser Gly Glu Cys Asn Asn Tyr Pro Leu Ile Lys His Ile Ala Ala Phe
130 135 140 130 135 140
Lys Lys Pro Met Ile Val Ser Thr Gly Met Asn Ser Ile Glu Ser Ile Lys Lys Pro Met Ile Val Ser Thr Gly Met Asn Ser Ile Glu Ser Ile
145 150 155 160 145 150 155 160
Lys Pro Thr Val Lys Ile Leu Leu Asp Asn Glu Ile Pro Phe Val Leu Lys Pro Thr Val Lys Ile Leu Leu Asp Asn Glu Ile Pro Phe Val Leu
165 170 175 165 170 175
Met His Thr Thr Asn Leu Tyr Pro Thr Pro His Asn Leu Val Arg Leu Met His Thr Thr Asn Leu Tyr Pro Thr Pro His Asn Leu Val Arg Leu
180 185 190 180 185 190
Asn Ala Met Leu Glu Leu Lys Lys Glu Phe Ser Cys Met Val Gly Leu Asn Ala Met Leu Glu Leu Lys Lys Glu Phe Ser Cys Met Val Gly Leu
195 200 205 195 200 205
Ser Asp His Thr Thr Asp Asn Leu Ala Cys Leu Gly Ala Val Val Leu Ser Asp His Thr Thr Asp Asn Leu Ala Cys Leu Gly Ala Val Val Leu
210 215 220 210 215 220
Gly Ala Cys Val Leu Glu Arg His Phe Thr Asp Ser Met His Arg Ser Gly Ala Cys Val Leu Glu Arg His Phe Thr Asp Ser Met His Arg Ser
225 230 235 240 225 230 235 240
Gly Pro Asp Ile Val Cys Ser Met Asp Thr Lys Ala Leu Lys Glu Leu Gly Pro Asp Ile Val Cys Ser Met Asp Thr Lys Ala Leu Lys Glu Leu
245 250 255 245 250 255
Ile Ile Gln Ser Glu Gln Met Ala Ile Ile Arg Gly Asn Asn Glu Ser Ile Ile Gln Ser Glu Gln Met Ala Ile Ile Arg Gly Asn Asn Glu Ser
260 265 270 260 265 270
Lys Lys Ala Ala Lys Gln Glu Gln Val Thr Ile Asp Phe Ala Phe Ala Lys Lys Ala Ala Lys Gln Glu Gln Val Thr Ile Asp Phe Ala Phe Ala
275 280 285 275 280 285
Ser Val Val Ser Ile Lys Asp Ile Lys Lys Gly Glu Val Leu Ser Met Ser Val Val Ser Ile Lys Asp Ile Lys Lys Gly Glu Val Leu Ser Met
290 295 300 290 295 300
Asp Asn Ile Trp Val Lys Arg Pro Gly Leu Gly Gly Ile Ser Ala Ala Asp Asn Ile Trp Val Lys Arg Pro Gly Leu Gly Gly Ile Ser Ala Ala
305 310 315 320 305 310 315 320
Glu Phe Glu Asn Ile Leu Gly Lys Lys Ala Leu Arg Asp Ile Glu Asn Glu Phe Glu Asn Ile Leu Gly Lys Lys Ala Leu Arg Asp Ile Glu Asn
325 330 335 325 330 335
Asp Ala Gln Leu Ser Tyr Glu Asp Phe Ala Asp Ala Gln Leu Ser Tyr Glu Asp Phe Ala
340 345 340 345
<210> 90<210> 90
<211> 1041<211> 1041
<212> ДНК<212> DNA
<213> Campylobacter jejuni<213> Campylobacter jejuni
<400> 90<400> 90
atgaaagaaa taaaaataca aaatataatc ataagtgaag aaaaagcacc cttagtcgtg 60atgaaagaaa taaaaataca aaatataatc ataagtgaag aaaaagcacc cttagtcgtg 60
cctgaaatag gcattaatca taatggcagt ttagaactag ctaaaattat ggtagatgca 120cctgaaatag gcattaatca taatggcagt ttagaactag ctaaaattat ggtagatgca 120
gcctttagcg caggtgctaa gattataaag catcaaaccc acatcgttga agatgagatg 180gcctttagcg caggtgctaa gattataaag catcaaaccc acatcgttga agatgagatg 180
agtaaggccg ctaaaaaagt aattcctggt aatgcaaaaa taagcattta tgagattatg 240agtaaggccg ctaaaaaagt aattcctggt aatgcaaaaa taagcattta tgagattatg 240
caaaaatgtg ctttagatta taaagatgag ctagcactta aagaatacac agaaaaatta 300caaaaatgtg ctttagatta taaagatgag ctagcactta aagaatacac agaaaaatta 300
ggtcttgttt atcttagcac acctttttct cgtgcaggtg caaaccgctt agaagatatg 360ggtcttgttt atcttagcac acctttttct cgtgcaggtg caaaccgctt agaagatatg 360
ggagttagtg cttttaagat tggttcaggt gagtgtaata attatccgct tattaaacac 420ggagttagtg cttttaagat tggttcaggt gagtgtaata attatccgct tattaaacac 420
atagcagcct ttaaaaagcc tatgatagtt agcacaggaa tgaatagtat tgaaagtata 480atagcagcct ttaaaaagcc tatgatagtt agcacaggaa tgaatagtat tgaaagtata 480
aaaccaactg taaaaatctt attagacaat gaaattccct ttgttttaat gcactcgacc 540aaaccaactg taaaaatctt attagacaat gaaattccct ttgttttaat gcactcgacc 540
aatctttacc caaccccgca taatcttgta agattaaacg ctatgcttga attaaaaaaa 600aatctttacc caaccccgca taatcttgta agattaaacg ctatgcttga attaaaaaaa 600
gaattttctt gcatggtagg cttaagcgac cacacaacag ataatcttgc gtgtttaggt 660gaattttctt gcatggtagg cttaagcgac cacacaacag ataatcttgc gtgtttaggt 660
gcggttgcac ttggtgcttg tgtgcttgaa agacatttta ctgatagtat gcatagaagt 720gcggttgcac ttggtgcttg tgtgcttgaa agacatttta ctgatagtat gcatagaagt 720
ggccctgata tagtttgttc tatggataca aaggctttaa aagagctaat tatccaaagt 780ggccctgata tagtttgttc tatggataca aaggctttaa aagagctaat tatccaaagt 780
gagcaaatgg ctataatgaa aggaaataat gaaagcaaaa aagcagctaa gcaagaacaa 840gagcaaatgg ctataatgaa aggaaataat gaaagcaaaa aagcagctaa gcaagaacaa 840
gttacaattg attttgcctt tgcaagcgta gttagcatta aagatattaa aaaaggcgaa 900gttacaattg attttgcctt tgcaagcgta gttagcatta aagatattaa aaaaggcgaa 900
gttttatcta tggacaatat ctgggttaaa agacctggac ttggtggaat tagtgcggct 960gttttatcta tggacaatat ctgggttaaa agacctggac ttggtggaat tagtgcggct 960
gaatttgaaa atattttagg caaaaaagca ttaagagata tagaaaatga tactcagtta 1020gaatttgaaa atattttagg caaaaaagca ttaagagata tagaaaatga tactcagtta 1020
agctatgagg attttgcgtg a 1041agctatgagg attttgcgtg a 1041
<210> 91<210> 91
<211> 221<211> 221
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 91<400> 91
Met Ser Leu Ala Ile Ile Pro Ala Arg Gly Gly Ser Lys Gly Ile Lys Met Ser Leu Ala Ile Ile Pro Ala Arg Gly Gly Ser Lys Gly Ile Lys
1 5 10 15 1 5 10 15
Asn Lys Asn Leu Val Leu Leu Asn Asn Lys Pro Leu Ile Tyr Tyr Thr Asn Lys Asn Leu Val Leu Leu Asn Asn Lys Pro Leu Ile Tyr Tyr Thr
20 25 30 20 25 30
Ile Lys Ala Ala Leu Asn Ala Lys Ser Ile Ser Lys Val Val Val Ser Ile Lys Ala Ala Leu Asn Ala Lys Ser Ile Ser Lys Val Val Val Ser
35 40 45 35 40 45
Ser Asp Ser Asp Glu Ile Leu Asn Tyr Ala Lys Ser Gln Asn Val Asp Ser Asp Ser Asp Glu Ile Leu Asn Tyr Ala Lys Ser Gln Asn Val Asp
50 55 60 50 55 60
Ile Leu Lys Arg Pro Ile Ser Leu Ala Gln Asp Asp Thr Thr Ser Asp Ile Leu Lys Arg Pro Ile Ser Leu Ala Gln Asp Asp Thr Thr Ser Asp
65 70 75 80 65 70 75 80
Lys Val Leu Leu His Ala Leu Lys Phe Tyr Lys Asp Tyr Glu Asp Val Lys Val Leu Leu His Ala Leu Lys Phe Tyr Lys Asp Tyr Glu Asp Val
85 90 95 85 90 95
Val Phe Leu Gln Pro Thr Ser Pro Leu Arg Thr Asn Ile His Ile Asn Val Phe Leu Gln Pro Thr Ser Pro Leu Arg Thr Asn Ile His Ile Asn
100 105 110 100 105 110
Glu Ala Phe Asn Leu Tyr Lys Asn Ser Asn Ala Asn Ala Leu Ile Ser Glu Ala Phe Asn Leu Tyr Lys Asn Ser Asn Ala Asn Ala Leu Ile Ser
115 120 125 115 120 125
Val Ser Glu Cys Asp Asn Lys Ile Leu Lys Ala Phe Val Cys Asn Asp Val Ser Glu Cys Asp Asn Lys Ile Leu Lys Ala Phe Val Cys Asn Asp
130 135 140 130 135 140
Cys Gly Asp Leu Ala Gly Ile Cys Asn Asp Glu Tyr Pro Phe Met Pro Cys Gly Asp Leu Ala Gly Ile Cys Asn Asp Glu Tyr Pro Phe Met Pro
145 150 155 160 145 150 155 160
Arg Gln Lys Leu Pro Lys Thr Tyr Met Ser Asn Gly Ala Ile Tyr Ile Arg Gln Lys Leu Pro Lys Thr Tyr Met Ser Asn Gly Ala Ile Tyr Ile
165 170 175 165 170 175
Leu Lys Ile Lys Glu Phe Leu Asn Asn Pro Ser Phe Leu Gln Ser Lys Leu Lys Ile Lys Glu Phe Leu Asn Asn Pro Ser Phe Leu Gln Ser Lys
180 185 190 180 185 190
Thr Lys His Phe Leu Met Asp Glu Ser Ser Ser Leu Asp Ile Asp Cys Thr Lys His Phe Leu Met Asp Glu Ser Ser Ser Leu Asp Ile Asp Cys
195 200 205 195 200 205
Leu Glu Asp Leu Lys Lys Val Glu Gln Ile Trp Lys Lys Leu Glu Asp Leu Lys Lys Val Glu Gln Ile Trp Lys Lys
210 215 220 210 215 220
<210> 92<210> 92
<211> 666<211> 666
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 92<400> 92
atgagcctgg ccattatccc ggcacgtggc ggttctaaag gcatcaaaaa caaaaacctg 60atgagcctgg ccattatccc ggcacgtggc ggttctaaag gcatcaaaaa caaaaacctg 60
gttctgctga acaataaacc gctgatttat tacaccatca aagcggccct gaacgccaaa 120gttctgctga acaataaacc gctgatttat tacaccatca aagcggccct gaacgccaaa 120
agtattagca aagtggttgt gagctctgat tctgatgaaa tcctgaacta cgcaaaaagt 180agtattagca aagtggttgt gagctctgat tctgatgaaa tcctgaacta cgcaaaaagt 180
cagaacgttg atatcctgaa acgtccgatc agtctggcac aggatgatac cacgagcgat 240cagaacgttg atatcctgaa acgtccgatc agtctggcac aggatgatac cacgagcgat 240
aaagtgctgc tgcatgcgct gaaattctac aaagattacg aagatgttgt gttcctgcag 300aaagtgctgc tgcatgcgct gaaattctac aaagattacg aagatgttgt gttcctgcag 300
ccgaccagcc cgctgcgtac gaatattcac atcaacgaag cgttcaacct gtacaaaaac 360ccgaccagcc cgctgcgtac gaatattcac atcaacgaag cgttcaacct gtacaaaaac 360
agcaacgcaa acgcgctgat ttctgttagt gaatgcgata acaaaatcct gaaagcgttt 420agcaacgcaa acgcgctgat ttctgttagt gaatgcgata acaaaatcct gaaagcgttt 420
gtgtgcaatg attgtggcga tctggccggt atttgtaacg atgaataccc gttcatgccg 480gtgtgcaatg attgtggcga tctggccggt atttgtaacg atgaataccc gttcatgccg 480
cgccagaaac tgccgaaaac ctatatgagc aatggtgcca tctacatcct gaaaatcaaa 540cgccagaaac tgccgaaaac ctatatgagc aatggtgcca tctacatcct gaaaatcaaa 540
gaattcctga acaacccgag cttcctgcag tctaaaacga aacatttcct gatggatgaa 600gaattcctga acaacccgag cttcctgcag tctaaaacga aacatttcct gatggatgaa 600
agtagctctc tggatattga ttgcctggaa gatctgaaaa aagtggaaca gatctggaaa 660agtagctctc tggatattga ttgcctggaa gatctgaaaa aagtggaaca gatctggaaa 660
aaataa 666aaataa 666
<210> 93<210> 93
<211> 417<211> 417
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 93<400> 93
Met Tyr Tyr Leu Lys Asn Thr Asn Phe Trp Met Phe Gly Leu Phe Phe Met Tyr Tyr Leu Lys Asn Thr Asn Phe Trp Met Phe Gly Leu Phe Phe
1 5 10 15 1 5 10 15
Phe Phe Tyr Phe Phe Ile Met Gly Ala Tyr Phe Pro Phe Phe Pro Ile Phe Phe Tyr Phe Phe Ile Met Gly Ala Tyr Phe Pro Phe Phe Pro Ile
20 25 30 20 25 30
Trp Leu His Asp Ile Asn His Ile Ser Lys Ser Asp Thr Gly Ile Ile Trp Leu His Asp Ile Asn His Ile Ser Lys Ser Asp Thr Gly Ile Ile
35 40 45 35 40 45
Phe Ala Ala Ile Ser Leu Phe Ser Leu Leu Phe Gln Pro Leu Phe Gly Phe Ala Ala Ile Ser Leu Phe Ser Leu Leu Phe Gln Pro Leu Phe Gly
50 55 60 50 55 60
Leu Leu Ser Asp Lys Leu Gly Leu Arg Lys Tyr Leu Leu Trp Ile Ile Leu Leu Ser Asp Lys Leu Gly Leu Arg Lys Tyr Leu Leu Trp Ile Ile
65 70 75 80 65 70 75 80
Thr Gly Met Leu Val Met Phe Ala Pro Phe Phe Ile Phe Ile Phe Gly Thr Gly Met Leu Val Met Phe Ala Pro Phe Phe Ile Phe Ile Phe Gly
85 90 95 85 90 95
Pro Leu Leu Gln Tyr Asn Ile Leu Val Gly Ser Ile Val Gly Gly Ile Pro Leu Leu Gln Tyr Asn Ile Leu Val Gly Ser Ile Val Gly Gly Ile
100 105 110 100 105 110
Tyr Leu Gly Phe Cys Phe Asn Ala Gly Ala Pro Ala Val Glu Ala Phe Tyr Leu Gly Phe Cys Phe Asn Ala Gly Ala Pro Ala Val Glu Ala Phe
115 120 125 115 120 125
Ile Glu Lys Val Ser Arg Arg Ser Asn Phe Glu Phe Gly Arg Ala Arg Ile Glu Lys Val Ser Arg Arg Ser Asn Phe Glu Phe Gly Arg Ala Arg
130 135 140 130 135 140
Met Phe Gly Cys Val Gly Trp Ala Leu Cys Ala Ser Ile Val Gly Ile Met Phe Gly Cys Val Gly Trp Ala Leu Cys Ala Ser Ile Val Gly Ile
145 150 155 160 145 150 155 160
Met Phe Thr Ile Asn Asn Gln Phe Val Phe Trp Leu Gly Ser Gly Cys Met Phe Thr Ile Asn Asn Gln Phe Val Phe Trp Leu Gly Ser Gly Cys
165 170 175 165 170 175
Ala Leu Ile Leu Ala Val Leu Leu Phe Phe Ala Lys Thr Asp Ala Pro Ala Leu Ile Leu Ala Val Leu Leu Phe Phe Ala Lys Thr Asp Ala Pro
180 185 190 180 185 190
Ser Ser Ala Thr Val Ala Asn Ala Val Gly Ala Asn His Ser Ala Phe Ser Ser Ala Thr Val Ala Asn Ala Val Gly Ala Asn His Ser Ala Phe
195 200 205 195 200 205
Ser Leu Lys Leu Ala Leu Glu Leu Phe Arg Gln Pro Lys Leu Trp Phe Ser Leu Lys Leu Ala Leu Glu Leu Phe Arg Gln Pro Lys Leu Trp Phe
210 215 220 210 215 220
Leu Ser Leu Tyr Val Ile Gly Val Ser Cys Thr Tyr Asp Val Phe Asp Leu Ser Leu Tyr Val Ile Gly Val Ser Cys Thr Tyr Asp Val Phe Asp
225 230 235 240 225 230 235 240
Gln Gln Phe Ala Asn Phe Phe Thr Ser Phe Phe Ala Thr Gly Glu Gln Gln Gln Phe Ala Asn Phe Phe Thr Ser Phe Phe Ala Thr Gly Glu Gln
245 250 255 245 250 255
Gly Thr Arg Val Phe Gly Tyr Val Thr Thr Met Gly Glu Leu Leu Asn Gly Thr Arg Val Phe Gly Tyr Val Thr Thr Met Gly Glu Leu Leu Asn
260 265 270 260 265 270
Ala Ser Ile Met Phe Phe Ala Pro Leu Ile Ile Asn Arg Ile Gly Gly Ala Ser Ile Met Phe Phe Ala Pro Leu Ile Ile Asn Arg Ile Gly Gly
275 280 285 275 280 285
Lys Asn Ala Leu Leu Leu Ala Gly Thr Ile Met Ser Val Arg Ile Ile Lys Asn Ala Leu Leu Leu Ala Gly Thr Ile Met Ser Val Arg Ile Ile
290 295 300 290 295 300
Gly Ser Ser Phe Ala Thr Ser Ala Leu Glu Val Val Ile Leu Lys Thr Gly Ser Ser Phe Ala Thr Ser Ala Leu Glu Val Val Ile Leu Lys Thr
305 310 315 320 305 310 315 320
Leu His Met Phe Glu Val Pro Phe Leu Leu Val Gly Cys Phe Lys Tyr Leu His Met Phe Glu Val Pro Phe Leu Leu Val Gly Cys Phe Lys Tyr
325 330 335 325 330 335
Ile Thr Ser Gln Phe Glu Val Arg Phe Ser Ala Thr Ile Tyr Leu Val Ile Thr Ser Gln Phe Glu Val Arg Phe Ser Ala Thr Ile Tyr Leu Val
340 345 350 340 345 350
Cys Phe Cys Phe Phe Lys Gln Leu Ala Met Ile Phe Met Ser Val Leu Cys Phe Cys Phe Phe Lys Gln Leu Ala Met Ile Phe Met Ser Val Leu
355 360 365 355 360 365
Ala Gly Asn Met Tyr Glu Ser Ile Gly Phe Gln Gly Ala Tyr Leu Val Ala Gly Asn Met Tyr Glu Ser Ile Gly Phe Gln Gly Ala Tyr Leu Val
370 375 380 370 375 380
Leu Gly Leu Val Ala Leu Gly Phe Thr Leu Ile Ser Val Phe Thr Leu Leu Gly Leu Val Ala Leu Gly Phe Thr Leu Ile Ser Val Phe Thr Leu
385 390 395 400 385 390 395 400
Ser Gly Pro Gly Pro Leu Ser Leu Leu Arg Arg Gln Val Asn Glu Val Ser Gly Pro Gly Pro Leu Ser Leu Leu Arg Arg Gln Val Asn Glu Val
405 410 415 405 410 415
Ala Ala
<210> 94<210> 94
<211> 1254<211> 1254
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 94<400> 94
atgtactatt taaaaaacac aaacttttgg atgttcggtt tattcttttt cttttacttt 60atgtactatt taaaaaacac aaacttttgg atgttcggtt tattcttttt ctttttttt 60
tttatcatgg gagcctactt cccgtttttc ccgatttggc tacatgacat caaccatatc 120tttatcatgg gagcctactt cccgtttttc ccgatttggc tacatgacat caaccatatc 120
agcaaaagtg atacgggtat tatttttgcc gctatttctc tgttctcgct attattccaa 180agcaaaagtg atacgggtat tatttttgcc gctatttctc tgttctcgct attattccaa 180
ccgctgtttg gtctgctttc tgacaaactc gggctgcgca aatacctgct gtggattatt 240ccgctgtttg gtctgctttc tgacaaactc gggctgcgca aatacctgct gtggattatt 240
accggcatgt tagtgatgtt tgcgccgttc tttattttta tcttcgggcc actgttacaa 300accggcatgt tagtgatgtt tgcgccgttc tttattttta tcttcgggcc actgttacaa 300
tacaacattt tagtaggatc gattgttggt ggtatttatc taggcttttg ttttaacgcc 360tacaacattt tagtaggatc gattgttggt ggtatttatc taggcttttg ttttaacgcc 360
ggtgcgccag cagtagaggc atttattgag aaagtcagcc gtcgcagtaa tttcgaattt 420ggtgcgccag cagtagaggc atttattgag aaagtcagcc gtcgcagtaa tttcgaattt 420
ggtcgcgcgc ggatgtttgg ctgtgttggc tgggcgctgt gtgcctcgat tgtcggcatc 480ggtcgcgcgc ggatgtttgg ctgtgttggc tgggcgctgt gtgcctcgat tgtcggcatc 480
atgttcacca tcaataatca gtttgttttc tggctgggct ctggctgtgc actcatcctc 540atgttcacca tcaataatca gtttgttttc tggctgggct ctggctgtgc actcatcctc 540
gccgttttac tctttttcgc caaaacggat gcgccctctt ctgccacggt tgccaatgcg 600gccgttttac tctttttcgc caaaacggat gcgccctctt ctgccacggt tgccaatgcg 600
gtaggtgcca accattcggc atttagcctt aagctggcac tggaactgtt cagacagcca 660gtaggtgcca accattcggc atttagcctt aagctggcac tggaactgtt cagacagcca 660
aaactgtggt ttttgtcact gtatgttatt ggcgtttcct gcacctacga tgtttttgac 720aaactgtggt ttttgtcact gtatgttatt ggcgtttcct gcacctacga tgtttttgac 720
caacagtttg ctaatttctt tacttcgttc tttgctaccg gtgaacaggg tacgcgggta 780caacagtttg ctaatttctt tacttcgttc tttgctaccg gtgaacaggg tacgcgggta 780
tttggctacg taacgacaat gggcgaatta cttaacgcct cgattatgtt ctttgcgcca 840tttggctacg taacgacaat gggcgaatta cttaacgcct cgattatgtt ctttgcgcca 840
ctgatcatta atcgcatcgg tgggaaaaac gccctgctgc tggctggcac tattatgtct 900ctgatcatta atcgcatcgg tgggaaaaac gccctgctgc tggctggcac tattatgtct 900
gtacgtatta ttggctcatc gttcgccacc tcagcgctgg aagtggttat tctgaaaacg 960gtacgtatta ttggctcatc gttcgccacc tcagcgctgg aagtggttat tctgaaaacg 960
ctgcatatgt ttgaagtacc gttcctgctg gtgggctgct ttaaatatat taccagccag 1020ctgcatatgt ttgaagtacc gttcctgctg gtgggctgct ttaaatatat taccagccag 1020
tttgaagtgc gtttttcagc gacgatttat ctggtctgtt tctgcttctt taagcaactg 1080tttgaagtgc gtttttcagc gacgatttat ctggtctgtt tctgcttctt taagcaactg 1080
gcgatgattt ttatgtctgt actggcgggc aatatgtatg aaagcatcgg tttccagggc 1140gcgatgattt ttatgtctgt actggcgggc aatatgtatg aaagcatcgg tttccagggc 1140
gcttatctgg tgctgggtct ggtggcgctg ggcttcacct taatttccgt gttcacgctt 1200gcttatctgg tgctgggtct ggtggcgctg ggcttcacct taatttccgt gttcacgctt 1200
agcggccccg gtccgctttc tctactgcgt cgtcaggtga atgaagtcgc ttaa 1254agcggccccg gtccgctttc tctactgcgt cgtcaggtga atgaagtcgc ttaa 1254
<210> 95<210> 95
<211> 1024<211> 1024
<212> Белок<212> Protein
<213> Escherichia coli<213> Escherichia coli
<400> 95<400> 95
Met Thr Met Ile Thr Asp Ser Leu Ala Val Val Leu Gln Arg Arg Asp Met Thr Met Ile Thr Asp Ser Leu Ala Val Val Leu Gln Arg Arg Asp
1 5 10 15 1 5 10 15
Trp Glu Asn Pro Gly Val Thr Gln Leu Asn Arg Leu Ala Ala His Pro Trp Glu Asn Pro Gly Val Thr Gln Leu Asn Arg Leu Ala Ala His Pro
20 25 30 20 25 30
Pro Phe Ala Ser Trp Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro Pro Phe Ala Ser Trp Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro
35 40 45 35 40 45
Ser Gln Gln Leu Arg Ser Leu Asn Gly Glu Trp Arg Phe Ala Trp Phe Ser Gln Gln Leu Arg Ser Leu Asn Gly Glu Trp Arg Phe Ala Trp Phe
50 55 60 50 55 60
Pro Ala Pro Glu Ala Val Pro Glu Ser Trp Leu Glu Cys Asp Leu Pro Pro Ala Pro Glu Ala Val Pro Glu Ser Trp Leu Glu Cys Asp Leu Pro
65 70 75 80 65 70 75 80
Glu Ala Asp Thr Val Val Val Pro Ser Asn Trp Gln Met His Gly Tyr Glu Ala Asp Thr Val Val Val Pro Ser Asn Trp Gln Met His Gly Tyr
85 90 95 85 90 95
Asp Ala Pro Ile Tyr Thr Asn Val Thr Tyr Pro Ile Thr Val Asn Pro Asp Ala Pro Ile Tyr Thr Asn Val Thr Tyr Pro Ile Thr Val Asn Pro
100 105 110 100 105 110
Pro Phe Val Pro Thr Glu Asn Pro Thr Gly Cys Tyr Ser Leu Thr Phe Pro Phe Val Pro Thr Glu Asn Pro Thr Gly Cys Tyr Ser Leu Thr Phe
115 120 125 115 120 125
Asn Val Asp Glu Ser Trp Leu Gln Glu Gly Gln Thr Arg Ile Ile Phe Asn Val Asp Glu Ser Trp Leu Gln Glu Gly Gln Thr Arg Ile Ile Phe
130 135 140 130 135 140
Asp Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly Arg Trp Val Asp Gly Val Asn Ser Ala Phe His Leu Trp Cys Asn Gly Arg Trp Val
145 150 155 160 145 150 155 160
Gly Tyr Gly Gln Asp Ser Arg Leu Pro Ser Glu Phe Asp Leu Ser Ala Gly Tyr Gly Gln Asp Ser Arg Leu Pro Ser Glu Phe Asp Leu Ser Ala
165 170 175 165 170 175
Phe Leu Arg Ala Gly Glu Asn Arg Leu Ala Val Met Val Leu Arg Trp Phe Leu Arg Ala Gly Glu Asn Arg Leu Ala Val Met Val Leu Arg Trp
180 185 190 180 185 190
Ser Asp Gly Ser Tyr Leu Glu Asp Gln Asp Met Trp Arg Met Ser Gly Ser Asp Gly Ser Tyr Leu Glu Asp Gln Asp Met Trp Arg Met Ser Gly
195 200 205 195 200 205
Ile Phe Arg Asp Val Ser Leu Leu His Lys Pro Thr Thr Gln Ile Ser Ile Phe Arg Asp Val Ser Leu Leu His Lys Pro Thr Thr Gln Ile Ser
210 215 220 210 215 220
Asp Phe His Val Ala Thr Arg Phe Asn Asp Asp Phe Ser Arg Ala Val Asp Phe His Val Ala Thr Arg Phe Asn Asp Asp Phe Ser Arg Ala Val
225 230 235 240 225 230 235 240
Leu Glu Ala Glu Val Gln Met Cys Gly Glu Leu Arg Asp Tyr Leu Arg Leu Glu Ala Glu Val Gln Met Cys Gly Glu Leu Arg Asp Tyr Leu Arg
245 250 255 245 250 255
Val Thr Val Ser Leu Trp Gln Gly Glu Thr Gln Val Ala Ser Gly Thr Val Thr Val Ser Leu Trp Gln Gly Glu Thr Gln Val Ala Ser Gly Thr
260 265 270 260 265 270
Ala Pro Phe Gly Gly Glu Ile Ile Asp Glu Arg Gly Gly Tyr Ala Asp Ala Pro Phe Gly Gly Glu Ile Ile Asp Glu Arg Gly Gly Tyr Ala Asp
275 280 285 275 280 285
Arg Val Thr Leu Arg Leu Asn Val Glu Asn Pro Lys Leu Trp Ser Ala Arg Val Thr Leu Arg Leu Asn Val Glu Asn Pro Lys Leu Trp Ser Ala
290 295 300 290 295 300
Glu Ile Pro Asn Leu Tyr Arg Ala Val Val Glu Leu His Thr Ala Asp Glu Ile Pro Asn Leu Tyr Arg Ala Val Val Glu Leu His Thr Ala Asp
305 310 315 320 305 310 315 320
Gly Thr Leu Ile Glu Ala Glu Ala Cys Asp Val Gly Phe Arg Glu Val Gly Thr Leu Ile Glu Ala Glu Ala Cys Asp Val Gly Phe Arg Glu Val
325 330 335 325 330 335
Arg Ile Glu Asn Gly Leu Leu Leu Leu Asn Gly Lys Pro Leu Leu Ile Arg Ile Glu Asn Gly Leu Leu Leu Leu Asn Gly Lys Pro Leu Leu Ile
340 345 350 340 345 350
Arg Gly Val Asn Arg His Glu His His Pro Leu His Gly Gln Val Met Arg Gly Val Asn Arg His Glu His His Pro Leu His Gly Gln Val Met
355 360 365 355 360 365
Asp Glu Gln Thr Met Val Gln Asp Ile Leu Leu Met Lys Gln Asn Asn Asp Glu Gln Thr Met Val Gln Asp Ile Leu Leu Met Lys Gln Asn Asn
370 375 380 370 375 380
Phe Asn Ala Val Arg Cys Ser His Tyr Pro Asn His Pro Leu Trp Tyr Phe Asn Ala Val Arg Cys Ser His Tyr Pro Asn His Pro Leu Trp Tyr
385 390 395 400 385 390 395 400
Thr Leu Cys Asp Arg Tyr Gly Leu Tyr Val Val Asp Glu Ala Asn Ile Thr Leu Cys Asp Arg Tyr Gly Leu Tyr Val Val Asp Glu Ala Asn Ile
405 410 415 405 410 415
Glu Thr His Gly Met Val Pro Met Asn Arg Leu Thr Asp Asp Pro Arg Glu Thr His Gly Met Val Pro Met Asn Arg Leu Thr Asp Asp Pro Arg
420 425 430 420 425 430
Trp Leu Pro Ala Met Ser Glu Arg Val Thr Arg Met Val Gln Arg Asp Trp Leu Pro Ala Met Ser Glu Arg Val Thr Arg Met Val Gln Arg Asp
435 440 445 435 440 445
Arg Asn His Pro Ser Val Ile Ile Trp Ser Leu Gly Asn Glu Ser Gly Arg Asn His Pro Ser Val Ile Ile Trp Ser Leu Gly Asn Glu Ser Gly
450 455 460 450 455 460
His Gly Ala Asn His Asp Ala Leu Tyr Arg Trp Ile Lys Ser Val Asp His Gly Ala Asn His Asp Ala Leu Tyr Arg Trp Ile Lys Ser Val Asp
465 470 475 480 465 470 475 480
Pro Ser Arg Pro Val Gln Tyr Glu Gly Gly Gly Ala Asp Thr Thr Ala Pro Ser Arg Pro Val Gln Tyr Glu Gly Gly Gly Ala Asp Thr Thr Ala
485 490 495 485 490 495
Thr Asp Ile Ile Cys Pro Met Tyr Ala Arg Val Asp Glu Asp Gln Pro Thr Asp Ile Ile Cys Pro Met Tyr Ala Arg Val Asp Glu Asp Gln Pro
500 505 510 500 505 510
Phe Pro Ala Val Pro Lys Trp Ser Ile Lys Lys Trp Leu Ser Leu Pro Phe Pro Ala Val Pro Lys Trp Ser Ile Lys Lys Trp Leu Ser Leu Pro
515 520 525 515 520 525
Gly Glu Thr Arg Pro Leu Ile Leu Cys Glu Tyr Ala His Ala Met Gly Gly Glu Thr Arg Pro Leu Ile Leu Cys Glu Tyr Ala His Ala Met Gly
530 535 540 530 535 540
Asn Ser Leu Gly Gly Phe Ala Lys Tyr Trp Gln Ala Phe Arg Gln Tyr Asn Ser Leu Gly Gly Phe Ala Lys Tyr Trp Gln Ala Phe Arg Gln Tyr
545 550 555 560 545 550 555 560
Pro Arg Leu Gln Gly Gly Phe Val Trp Asp Trp Val Asp Gln Ser Leu Pro Arg Leu Gln Gly Gly Phe Val Trp Asp Trp Val Asp Gln Ser Leu
565 570 575 565 570 575
Ile Lys Tyr Asp Glu Asn Gly Asn Pro Trp Ser Ala Tyr Gly Gly Asp Ile Lys Tyr Asp Glu Asn Gly Asn Pro Trp Ser Ala Tyr Gly Gly Asp
580 585 590 580 585 590
Phe Gly Asp Thr Pro Asn Asp Arg Gln Phe Cys Met Asn Gly Leu Val Phe Gly Asp Thr Pro Asn Asp Arg Gln Phe Cys Met Asn Gly Leu Val
595 600 605 595 600 605
Phe Ala Asp Arg Thr Pro His Pro Ala Leu Thr Glu Ala Lys His Gln Phe Ala Asp Arg Thr Pro His Pro Ala Leu Thr Glu Ala Lys His Gln
610 615 620 610 615 620
Gln Gln Phe Phe Gln Phe Arg Leu Ser Gly Gln Thr Ile Glu Val Thr Gln Gln Phe Phe Gln Phe Arg Leu Ser Gly Gln Thr Ile Glu Val Thr
625 630 635 640 625 630 635 640
Ser Glu Tyr Leu Phe Arg His Ser Asp Asn Glu Leu Leu His Trp Met Ser Glu Tyr Leu Phe Arg His Ser Asp Asn Glu Leu Leu His Trp Met
645 650 655 645 650 655
Val Ala Leu Asp Gly Lys Pro Leu Ala Ser Gly Glu Val Pro Leu Asp Val Ala Leu Asp Gly Lys Pro Leu Ala Ser Gly Glu Val Pro Leu Asp
660 665 670 660 665 670
Val Ala Pro Gln Gly Lys Gln Leu Ile Glu Leu Pro Glu Leu Pro Gln Val Ala Pro Gln Gly Lys Gln Leu Ile Glu Leu Pro Glu Leu Pro Gln
675 680 685 675 680 685
Pro Glu Ser Ala Gly Gln Leu Trp Leu Thr Val Arg Val Val Gln Pro Pro Glu Ser Ala Gly Gln Leu Trp Leu Thr Val Arg Val Val Gln Pro
690 695 700 690 695 700
Asn Ala Thr Ala Trp Ser Glu Ala Gly His Ile Ser Ala Trp Gln Gln Asn Ala Thr Ala Trp Ser Glu Ala Gly His Ile Ser Ala Trp Gln Gln
705 710 715 720 705 710 715 720
Trp Arg Leu Ala Glu Asn Leu Ser Val Thr Leu Pro Ala Ala Ser His Trp Arg Leu Ala Glu Asn Leu Ser Val Thr Leu Pro Ala Ala Ser His
725 730 735 725 730 735
Ala Ile Pro His Leu Thr Thr Ser Glu Met Asp Phe Cys Ile Glu Leu Ala Ile Pro His Leu Thr Thr Ser Glu Met Asp Phe Cys Ile Glu Leu
740 745 750 740 745 750
Gly Asn Lys Arg Trp Gln Phe Asn Arg Gln Ser Gly Phe Leu Ser Gln Gly Asn Lys Arg Trp Gln Phe Asn Arg Gln Ser Gly Phe Leu Ser Gln
755 760 765 755 760 765
Met Trp Ile Gly Asp Lys Lys Gln Leu Leu Thr Pro Leu Arg Asp Gln Met Trp Ile Gly Asp Lys Lys Gln Leu Leu Thr Pro Leu Arg Asp Gln
770 775 780 770 775 780
Phe Thr Arg Ala Pro Leu Asp Asn Asp Ile Gly Val Ser Glu Ala Thr Phe Thr Arg Ala Pro Leu Asp Asn Asp Ile Gly Val Ser Glu Ala Thr
785 790 795 800 785 790 795 800
Arg Ile Asp Pro Asn Ala Trp Val Glu Arg Trp Lys Ala Ala Gly His Arg Ile Asp Pro Asn Ala Trp Val Glu Arg Trp Lys Ala Ala Gly His
805 810 815 805 810 815
Tyr Gln Ala Glu Ala Ala Leu Leu Gln Cys Thr Ala Asp Thr Leu Ala Tyr Gln Ala Glu Ala Ala Leu Leu Gln Cys Thr Ala Asp Thr Leu Ala
820 825 830 820 825 830
Asp Ala Val Leu Ile Thr Thr Ala His Ala Trp Gln His Gln Gly Lys Asp Ala Val Leu Ile Thr Thr Ala His Ala Trp Gln His Gln Gly Lys
835 840 845 835 840 845
Thr Leu Phe Ile Ser Arg Lys Thr Tyr Arg Ile Asp Gly Ser Gly Gln Thr Leu Phe Ile Ser Arg Lys Thr Tyr Arg Ile Asp Gly Ser Gly Gln
850 855 860 850 855 860
Met Ala Ile Thr Val Asp Val Glu Val Ala Ser Asp Thr Pro His Pro Met Ala Ile Thr Val Asp Val Glu Val Ala Ser Asp Thr Pro His Pro
865 870 875 880 865 870 875 880
Ala Arg Ile Gly Leu Asn Cys Gln Leu Ala Gln Val Ala Glu Arg Val Ala Arg Ile Gly Leu Asn Cys Gln Leu Ala Gln Val Ala Glu Arg Val
885 890 895 885 890 895
Asn Trp Leu Gly Leu Gly Pro Gln Glu Asn Tyr Pro Asp Arg Leu Thr Asn Trp Leu Gly Leu Gly Pro Gln Glu Asn Tyr Pro Asp Arg Leu Thr
900 905 910 900 905 910
Ala Ala Cys Phe Asp Arg Trp Asp Leu Pro Leu Ser Asp Met Tyr Thr Ala Ala Cys Phe Asp Arg Trp Asp Leu Pro Leu Ser Asp Met Tyr Thr
915 920 925 915 920 925
Pro Tyr Val Phe Pro Ser Glu Asn Gly Leu Arg Cys Gly Thr Arg Glu Pro Tyr Val Phe Pro Ser Glu Asn Gly Leu Arg Cys Gly Thr Arg Glu
930 935 940 930 935 940
Leu Asn Tyr Gly Pro His Gln Trp Arg Gly Asp Phe Gln Phe Asn Ile Leu Asn Tyr Gly Pro His Gln Trp Arg Gly Asp Phe Gln Phe Asn Ile
945 950 955 960 945 950 955 960
Ser Arg Tyr Ser Gln Gln Gln Leu Met Glu Thr Ser His Arg His Leu Ser Arg Tyr Ser Gln Gln Gln Leu Met Glu Thr Ser His Arg His Leu
965 970 975 965 970 975
Leu His Ala Glu Glu Gly Thr Trp Leu Asn Ile Asp Gly Phe His Met Leu His Ala Glu Glu Gly Thr Trp Leu Asn Ile Asp Gly Phe His Met
980 985 990 980 985 990
Gly Ile Gly Gly Asp Asp Ser Trp Ser Pro Ser Val Ser Ala Glu Phe Gly Ile Gly Gly Asp Asp Ser Trp Ser Pro Ser Val Ser Ala Glu Phe
995 1000 1005 995 1000 1005
Gln Leu Ser Ala Gly Arg Tyr His Tyr Gln Leu Val Trp Cys Gln Gln Leu Ser Ala Gly Arg Tyr His Tyr Gln Leu Val Trp Cys Gln
1010 1015 1020 1010 1015 1020
Lys Lys
<210> 96<210> 96
<211> 3075<211> 3075
<212> ДНК<212> DNA
<213> Escherichia coli<213> Escherichia coli
<400> 96<400> 96
atgaccatga ttacggattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 60atgaccatga ttacggattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 60
ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 120ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 120
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 180gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 180
tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 240tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 240
gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 300gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 300
tacaccaacg tgacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 360tacaccaacg tgacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 360
acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 420acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 420
cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 480cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 480
ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 540ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 540
ggagaaaacc gcctcgcggt gatggtgctg cgctggagtg acggcagtta tctggaagat 600ggagaaaacc gcctcgcggt gatggtgctg cgctggagtg acggcagtta tctggaagat 600
caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 660caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 660
acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 720acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 720
ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 780ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 780
ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 840ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 840
gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 900gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 900
ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 960ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 960
ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1020ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1020
ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1080ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1080
catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1140catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1140
aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1200aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1200
acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1260acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1260
atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1320atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1320
gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1380gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1380
aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1440aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1440
ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1500ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1500
tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1560tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1560
atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1620atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1620
cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1680cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1680
ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 1740ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 1740
gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 1800gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 1800
cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 1860cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 1860
gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 1920gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 1920
agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 1980agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 1980
ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2040ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2040
attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2100attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2100
gtagtgcaac cgaacgcgac cgcatggtca gaagccggac acatcagcgc ctggcagcag 2160gtagtgcaac cgaacgcgac cgcatggtca gaagccggac acatcagcgc ctggcagcag 2160
tggcgtctgg ctgaaaacct cagcgtgaca ctccccgccg cgtcccacgc catcccgcat 2220tggcgtctgg ctgaaaacct cagcgtgaca ctccccgccg cgtcccacgc catcccgcat 2220
ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2280ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2280
cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2340cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2340
ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2400ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2400
cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2460cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2460
gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2520gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2520
cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2580cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2580
ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2640ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2640
gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 2700gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 2700
ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 2760ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 2760
ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 2820ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 2820
gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 2880gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 2880
agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 2940agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 2940
gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3000gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3000
agcccgtcag tatcggcgga attccagctg agcgccggtc gctaccatta ccagttggtc 3060agcccgtcag tatcggcgga attccagctg agcgccggtc gctaccatta ccagttggtc 3060
tggtgtcaaa aataa 3075tggtgtcaaa aataa 3075
<210> 97<210> 97
<211> 3123<211> 3123
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 97<400> 97
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180
caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240
ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300
aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360
tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420
ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480
gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540
atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600
caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660
tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720
ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780
cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840
ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900
gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960
caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020
atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080
ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140
caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200
gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260
ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320
gaaaccctgc aactgccggt ctcttaataa tcgaaggaga tacaacatga gcttacccga 1380gaaaccctgc aactgccggt ctcttaataa tcgaaggaga tacaacatga gcttacccga 1380
tggattttat ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa 1440tggattttat ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa 1440
ggttttgacc accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg 1500ggttttgacc accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg 1500
gaatgaagcc acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat 1560gaatgaagcc acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat 1560
ggtgattgtg gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag 1620ggtgattgtg gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag 1620
aaagatcatt catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa 1680aaagatcatt catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa 1680
gtatcagggc caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga 1740gtatcagggc caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga 1740
ctacggttgt tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa 1800ctacggttgt tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa 1800
atgtgggttt agcaacgcag gcgtggaaat gcaaattaga aaatagaata actagcataa 1860atgtgggttt agcaacgcag gcgtggaaat gcaaattaga aaatagaata actagcataa 1860
acccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 1920acccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 1920
cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 1980cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 1980
cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2040cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2040
aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc aggccggcct gttaagacgg 2100aacgaaaggc tcagtcgaaa gactgggcct ttcgggatcc aggccggcct gttaagacgg 2100
ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg aagttatcga 2160ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg aagttatcga 2160
gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac accgtggaaa 2220gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac accgtggaaa 2220
cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact gtaatgcaag 2280cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact gtaatgcaag 2280
tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg tggtaacggc 2340tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg tggtaacggc 2340
gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat gcctcgggca 2400gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat gcctcgggca 2400
tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag cagcaacgat 2460tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag cagcaacgat 2460
gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa agttaggtgg 2520gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa agttaggtgg 2520
ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca aatccatgcg 2580ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca aatccatgcg 2580
ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact cccaacatca 2640ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact cccaacatca 2640
gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg cgcttgctgc 2700gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg cgcttgctgc 2700
cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca ggtttgagca 2760cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca ggtttgagca 2760
gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc ggaggcaggg 2820gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc ggaggcaggg 2820
cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg gtgcttatgt 2880cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg gtgcttatgt 2880
gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata caaagttggg 2940gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata caaagttggg 2940
catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct aacaattcgt 3000catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct aacaattcgt 3000
tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg tataatgtat 3060tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg tataatgtat 3060
gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt gtataagaga 3120gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt gtataagaga 3120
cag 3123cag 3123
<210> 98<210> 98
<211> 2965<211> 2965
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 98<400> 98
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgaaagaa 180cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgaaagaa 180
atcaaaatcc agaacatcat catcagcgaa gaaaaagcgc cgctggttgt gccggaaatc 240atcaaaatcc agaacatcat catcagcgaa gaaaaagcgc cgctggttgt gccggaaatc 240
ggcattaacc ataatggtag tctggaactg gcaaaaatca tggtggatgc ggcctttagc 300ggcattaacc ataatggtag tctggaactg gcaaaaatca tggtggatgc ggcctttagc 300
gccggtgcaa aaatcattaa acatcagacc cacattgtgg aagatgaaat gtctaaagca 360gccggtgcaa aaatcattaa acatcagacc cacattgtgg aagatgaaat gtctaaagca 360
gcgaaaaaag ttatcccggg caacgcgaaa atcagtatct acgaaatcat gcagaaatgc 420gcgaaaaaag ttatcccggg caacgcgaaa atcagtatct acgaaatcat gcagaaatgc 420
gcgctggatt acaaagatga actggccctg aaagaatata ccgaaaaact gggtctggtg 480gcgctggatt acaaagatga actggccctg aaagaatata ccgaaaaact gggtctggtg 480
tacctgtcta ccccgtttag tcgtgcgggt gcaaaccgtc tggaagatat gggtgttagt 540tacctgtcta ccccgtttag tcgtgcgggt gcaaaccgtc tggaagatat gggtgttagt 540
gcgttcaaaa tcggcagcgg tgaatgtaac aattatccgc tgatcaaaca tattgccgca 600gcgttcaaaa tcggcagcgg tgaatgtaac aattatccgc tgatcaaaca tattgccgca 600
tttaaaaaac cgatgattgt tagcaccggc atgaatagca tcgaatctat taaaccgacg 660tttaaaaaac cgatgattgt tagcaccggc atgaatagca tcgaatctat taaaccgacg 660
gtgaaaatcc tgctggataa cgaaattccg tttgttctga tgcataccac gaatctgtac 720gtgaaaatcc tgctggataa cgaaattccg tttgttctga tgcataccac gaatctgtac 720
ccgaccccgc acaacctggt gcgtctgaat gccatgctgg aactgaaaaa agaattctct 780ccgaccccgc acaacctggt gcgtctgaat gccatgctgg aactgaaaaa agaattctct 780
tgcatggttg gtctgagtga tcacaccacg gataatctgg catgcctggg tgcagtggtt 840tgcatggttg gtctgagtga tcacaccacg gataatctgg catgcctggg tgcagtggtt 840
ctgggtgcgt gtgtgctgga acgtcatttc accgatagca tgcaccgctc tggtccggat 900ctgggtgcgt gtgtgctgga acgtcatttc accgatagca tgcaccgctc tggtccggat 900
attgtttgta gtatggatac gaaagcactg aaagaactga tcattcagag cgaacagatg 960attgtttgta gtatggatac gaaagcactg aaagaactga tcattcagag cgaacagatg 960
gcgatcattc gcggcaacaa tgaatctaaa aaagcggcca aacaggaaca ggtgaccatc 1020gcgatcattc gcggcaacaa tgaatctaaa aaagcggcca aacaggaaca ggtgaccatc 1020
gattttgcat tcgcgagtgt ggttagcatc aaagatatca aaaaaggcga agtgctgagc 1080gattttgcat tcgcgagtgt ggttagcatc aaagatatca aaaaaggcga agtgctgagc 1080
atggataata tttgggttaa acgtccgggt ctgggcggta tctctgcagc ggaatttgaa 1140atggataata tttgggttaa acgtccgggt ctgggcggta tctctgcagc ggaatttgaa 1140
aacattctgg gcaaaaaagc actgcgcgat attgaaaatg atgcgcagct gtcttatgaa 1200aacattctgg gcaaaaaagc actgcgcgat attgaaaatg atgcgcagct gtcttatgaa 1200
gatttcgcct aataaatcga tactagcata accccttggg gcctctaaac gcgtcgacac 1260gatttcgcct aataaatcga tactagcata accccttggg gcctctaaac gcgtcgacac 1260
gcaaaaaggc catccgtcag gatggccttc tgcttaattt gatgcctggc agtttatggc 1320gcaaaaaggc catccgtcag gatggccttc tgcttaattt gatgcctggc agtttatggc 1320
gggcgtcctg cccgccaccc tccgggccgt tgcttcgcaa cgttcaaatc cgctcccggc 1380gggcgtcctg cccgccaccc tccggggccgt tgcttcgcaa cgttcaaatc cgctcccggc 1380
ggatttgtcc tactcaggag agcgttcacc gacaaacaac agataaaacg aaaggcccag 1440ggatttgtcc tactcaggag agcgttcacc gacaaacaac agataaaacg aaaggcccag 1440
tctttcgact gagcctttcg ttttatttga tgcctggcag ttccctactc tcgcatgggg 1500tctttcgact gagcctttcg ttttatttga tgcctggcag ttccctactc tcgcatgggg 1500
agaccccaca ctaccatccg gtatcgataa gcttgatggc gaaaggggga tgtgctgcaa 1560agaccccaca ctaccatccg gtatcgataa gcttgatggc gaaaggggga tgtgctgcaa 1560
ggcgattaag ttgggtaacg ccagggtttt cccagtcacg acgttgtaaa acgacggcca 1620ggcgattaag ttgggtaacg cccaggtttt cccagtcacg acgttgtaaa acgacggcca 1620
gtgaattcga gctcggtacc taccgttcgt ataatgtatg ctatacgaag ttatcgagct 1680gtgaattcga gctcggtacc taccgttcgt ataatgtatg ctatacgaag ttatcgagct 1680
ctagagaatg atcccctccc tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 1740ctagagaatg atcccctccc tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 1740
ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 1800ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 1800
gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca 1860gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca 1860
gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat 1920gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat 1920
tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt 1980tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt 1980
tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 2040tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 2040
gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 2100gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 2100
agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 2160agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 2160
tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 2220tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 2220
tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 2280tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 2280
gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 2340gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 2340
tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 2400tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 2400
ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 2460ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 2460
cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 2520cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 2520
atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 2580atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 2580
gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 2640gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 2640
tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc 2700tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc 2700
gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 2760gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 2760
ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 2820ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 2820
atcgccttct tgacgagttc ttctgagcgg gactctggga atttcgacga cctgcagcca 2880atcgccttct tgacgagttc ttctgagcgg gactctggga atttcgacga cctgcagcca 2880
agcataactt cgtataatgt atgctatacg aacggtagga tcctctagag tcgacctgca 2940agcataactt cgtataatgt atgctatacg aacggtagga tcctctagag tcgacctgca 2940
ggcatgagat gtgtataaga gacag 2965ggcatgagat gtgtataaga gacag 2965
<210> 99<210> 99
<211> 3904<211> 3904
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 99<400> 99
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180cgacaaaaat ctagaaataa ttttgtttaa ctttaagaag gagatataca aatgatcgct 180
caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240caccgtcgtc aggaactggc tcaacagtat tatcaggctc tgcaccaaga tgtgctgccg 240
ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300ttctgggaaa agtattcgct ggatcgtcaa ggcggtggct attttacctg cctggaccgc 300
aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360aagggtcagg tttttgatac ggacaagttc atttggctgc aaaaccgtca agtgtggcaa 360
tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420tttgcggttt tctacaatcg cctggaaccg aaaccgcagt ggctggaaat cgctcgtcat 420
ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480ggtgcggatt ttctggcacg tcacggtcgt gatcaggacg gtaactggta tttcgccctg 480
gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540gatcaggaag gcaaaccgct gcgccaaccg tacaatgtgt tttccgactg tttcgcggcg 540
atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600atggcgttta gccagtatgc actggcttct ggtgctcaag aagcgaaggc cattgcactg 600
caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660caagcgtata acaatgttct gcgtcgccag cataacccga aaggtcaata tgaaaagagt 660
tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720tacccgggta cccgtccgct gaaatccctg gcagtgccga tgatcctggc taatctgacg 720
ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780ctggaaatgg aatggctgct gccgccgacc acggtcgaag aagtgctggc ccagaccgtt 780
cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840cgtgaagtca tgacggattt tctggacccg gaaattggcc tgatgcgcga agcagttacc 840
ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900ccgacgggtg aatttgtcga ttcattcgaa ggccgcctgc tgaacccggg tcatggcatt 900
gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960gaagcgatgt ggtttatgat ggatattgcc cagcgttcgg gtgaccgcca gctgcaagaa 960
caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020caggctattg cggtggttct gaataccctg gaatatgcat gggatgaaga atttggtggc 1020
atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080atcttttact tcctggaccg tcaaggtcac ccgccgcagc aactggaatg ggatcagaaa 1080
ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140ctgtggtggg tccatctgga aaccctggtg gccctggcaa aaggtcacca ggcgacgggc 1140
caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200caagaaaagt gctggcagtg gtttgaacgc gtgcatgatt atgcatggag ccactttgct 1200
gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260gacccggaat atggtgaatg gttcggctac ctgaaccgtc gcggtgaagt gctgctgaat 1260
ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320ctgaaaggtg gcaaatggaa gggctgcttc cacgttccgc gtgcgctgtg gctgtgtgcc 1320
gaaaccctgc aactgccggt ctcttaattt cgtcgacaca caggaaacat attaaaaatt 1380gaaaccctgc aactgccggt ctcttaattt cgtcgacaca caggaaacat attaaaaatt 1380
aaaacctgca ggagtttaaa cgcggccgcg atatcgttgt aaaacgacgg ccagtgcaag 1440aaaacctgca ggagtttaaa cgcggccgcg atatcgttgt aaaacgacgg ccagtgcaag 1440
aatcataaaa aatttatttg ctttcaggaa aatttttctg tataatagat tcataaattt 1500aatcataaaa aatttatttg ctttcaggaa aatttttctg tataatagat tcataaattt 1500
gagagaggag tttttgtgag cggataacaa ttccccatct tagtatatta gttaagtata 1560gagagaggag tttttgtgag cggataacaa ttccccatct tagtatatta gttaagtata 1560
aatacacaag gagatataca tatgaaagaa atcaaaatcc agaacatcat catcagcgaa 1620aatacacaag gagatataca tatgaaagaa atcaaaatcc agaacatcat catcagcgaa 1620
gaaaaagcgc cgctggttgt gccggaaatc ggcattaacc ataatggtag tctggaactg 1680gaaaaagcgc cgctggttgt gccggaaatc ggcattaacc ataatggtag tctggaactg 1680
gcaaaaatca tggtggatgc ggcctttagc gccggtgcaa aaatcattaa acatcagacc 1740gcaaaaatca tggtggatgc ggcctttagc gccggtgcaa aaatcattaa acatcagacc 1740
cacattgtgg aagatgaaat gtctaaagca gcgaaaaaag ttatcccggg caacgcgaaa 1800cacattgtgg aagatgaaat gtctaaagca gcgaaaaaag ttatcccggg caacgcgaaa 1800
atcagtatct acgaaatcat gcagaaatgc gcgctggatt acaaagatga actggccctg 1860atcagtatct acgaaatcat gcagaaatgc gcgctggatt acaaagatga actggccctg 1860
aaagaatata ccgaaaaact gggtctggtg tacctgtcta ccccgtttag tcgtgcgggt 1920aaagaatata ccgaaaaact gggtctggtg tacctgtcta ccccgtttag tcgtgcgggt 1920
gcaaaccgtc tggaagatat gggtgttagt gcgttcaaaa tcggcagcgg tgaatgtaac 1980gcaaaccgtc tggaagatat gggtgttagt gcgttcaaaa tcggcagcgg tgaatgtaac 1980
aattatccgc tgatcaaaca tattgccgca tttaaaaaac cgatgattgt tagcaccggc 2040aattatccgc tgatcaaaca tattgccgca tttaaaaaac cgatgattgt tagcaccggc 2040
atgaatagca tcgaatctat taaaccgacg gtgaaaatcc tgctggataa cgaaattccg 2100atgaatagca tcgaatctat taaaccgacg gtgaaaatcc tgctggataa cgaaattccg 2100
tttgttctga tgcataccac gaatctgtac ccgaccccgc acaacctggt gcgtctgaat 2160tttgttctga tgcataccac gaatctgtac ccgaccccgc acaacctggt gcgtctgaat 2160
gccatgctgg aactgaaaaa agaattctct tgcatggttg gtctgagtga tcacaccacg 2220gccatgctgg aactgaaaaa agaattctct tgcatggttg gtctgagtga tcacaccacg 2220
gataatctgg catgcctggg tgcagtggtt ctgggtgcgt gtgtgctgga acgtcatttc 2280gataatctgg catgcctggg tgcagtggtt ctgggtgcgt gtgtgctgga acgtcatttc 2280
accgatagca tgcaccgctc tggtccggat attgtttgta gtatggatac gaaagcactg 2340accgatagca tgcaccgctc tggtccggat attgtttgta gtatggatac gaaagcactg 2340
aaagaactga tcattcagag cgaacagatg gcgatcattc gcggcaacaa tgaatctaaa 2400aaagaactga tcattcagag cgaacagatg gcgatcattc gcggcaacaa tgaatctaaa 2400
aaagcggcca aacaggaaca ggtgaccatc gattttgcat tcgcgagtgt ggttagcatc 2460aaagcggcca aacaggaaca ggtgaccatc gattttgcat tcgcgagtgt ggttagcatc 2460
aaagatatca aaaaaggcga agtgctgagc atggataata tttgggttaa acgtccgggt 2520aaagatatca aaaaaggcga agtgctgagc atggataata tttgggttaa acgtccgggt 2520
ctgggcggta tctctgcagc ggaatttgaa aacattctgg gcaaaaaagc actgcgcgat 2580ctgggcggta tctctgcagc ggaatttgaa aacattctgg gcaaaaaagc actgcgcgat 2580
attgaaaatg atgcgcagct gtcttatgaa gatttcgcct aaaataacta gcataacccc 2640attgaaaatg atgcgcagct gtcttatgaa gatttcgcct aaaataacta gcataacccc 2640
ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2700ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2700
tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2760tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2760
tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2820tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2820
aggctcagtc gaaagactgg gcctttcggg atccaggccg gcctgttaac gaattaatct 2880aggctcagtc gaaagactgg gcctttcggg atccaggccg gcctgttaac gaattaatct 2880
tccgcggcgg tatcgataag cttgatatcg aattccgaag ttcctattct ctagaaagta 2940tccgcggcgg tatcgataag cttgatatcg aattccgaag ttcctattct ctagaaagta 2940
taggaacttc aggtctgaag aggagtttac gtccagccaa gctagcttgg ctgcaggtcg 3000taggaacttc aggtctgaag aggagtttac gtccagccaa gctagcttgg ctgcaggtcg 3000
tcgaaattct accgggtagg ggaggcgctt ttcccaaggc agtctggagc atgcgcttta 3060tcgaaattct accgggtagg ggaggcgctt ttcccaaggc agtctggagc atgcgcttta 3060
gcagccccgc tgggcacttg gcgctacaca agtggcctct ggcctcgcac acattccaca 3120gcagccccgc tgggcacttg gcgctacaca agtggcctct ggcctcgcac acattccaca 3120
tccaccggta ggcgccaacc ggctccgttc tttggtggcc ccttcgcgcc accttctact 3180tccaccggta ggcgccaacc ggctccgttc tttggtggcc ccttcgcgcc accttctact 3180
cctcccctag tcaggaagtt cccccccgcc ccgcagctcg cgtcgtgcag gacgtgacaa 3240cctcccctag tcaggaagtt cccccccgcc ccgcagctcg cgtcgtgcag gacgtgacaa 3240
atggaagtag cacgtctcac tagtctcgtg cagatggaca gcaccgctga gcaatggaag 3300atggaagtag cacgtctcac tagtctcgtg cagatggaca gcaccgctga gcaatggaag 3300
cgggtaggcc tttggggcag cggccaatag cagctttgct ccttcgcttt ctgggctcag 3360cgggtaggcc tttggggcag cggccaatag cagctttgct ccttcgcttt ctgggctcag 3360
gggcgggctc agggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 3420gggcgggctc agggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 3420
cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 3480cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 3480
cctgcagcct gttgacaatt aatcatcggc atagtatatc ggcatagtat aatacgacaa 3540cctgcagcct gttgacaatt aatcatcggc atagtatatc ggcatagtat aatacgacaa 3540
ggtgaggaac taaaccatgg gtcaaagtag cgatgaagcc aacgctcccg ttgcagggca 3600ggtgaggaac taaaccatgg gtcaaagtag cgatgaagcc aacgctcccg ttgcagggca 3600
gtttgcgctt cccctgagtg ccacctttgg cttaggggat cgcgtacgca agaaatctgg 3660gtttgcgctt cccctgagtg ccacctttgg cttaggggat cgcgtacgca agaaatctgg 3660
tgccgcttgg cagggtcaag tcgtcggttg gtattgcaca aaactcactc ctgaaggcta 3720tgccgcttgg cagggtcaag tcgtcggttg gtattgcaca aaactcactc ctgaaggcta 3720
tgcggtcgag tccgaatccc acccaggctc agtgcaaatt tatcctgtgg ctgcacttga 3780tgcggtcgag tccgaatccc acccaggctc agtgcaaatt tatcctgtgg ctgcacttga 3780
acgtgtggcc taatgagggg atcaattctc tagagctcgc tgatcagaag ttcctattct 3840acgtgtggcc taatgagggg atcaattctc tagagctcgc tgatcagaag ttcctattct 3840
ctagaaagta taggaacttc gatggcgcct catccctgaa gccaaagatg tgtataagag 3900ctagaaagta taggaacttc gatggcgcct catccctgaa gccaaagatg tgtataagag 3900
acag 3904acag 3904
<210> 100<210> 100
<211> 3793<211> 3793
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 100<400> 100
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaaa atgtgcggta 180cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaaa atgtgcggta 180
tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt ctgcgtcgtc 240tcgttggtgc tatcgcacag cgtgatgtag cgaaaatcct cctggaaggt ctgcgtcgtc 240
tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa ggtcacatga 300tcgaataccg tggttacgac tctgccggtc tggcagtagt ggatgcagaa ggtcacatga 300
ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa gaacacccac 360ctcgtctgcg tcgtctgggt aaagtgcaga tgctcgcgca ggcggcggaa gaacacccac 360
tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa ccgtctgagg 420tccacggtgg tacgggtatc gcacacactc gttgggcaac ccacggtgaa ccgtctgagg 420
tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt atcatcgaga 480tcaacgcaca cccgcatgtt agcgagcaca tcgtagtcgt tcacaacggt atcatcgaga 480
accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta agcgaaaccg 540accacgaacc actccgtgag gaactcaaag cccgtggtta caccttcgta agcgaaaccg 540
acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt actctgcgtg 600acacggaagt tatcgcccac ctcgttaact gggaactcaa acagggtggt actctgcgtg 600
aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg atcatggact 660aagcagttct gcgtgccatt ccacagctgc gtggtgcata cggtaccgtg atcatggact 660
ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt atcggtctgg 720ctcgtcatcc ggataccctg ctcgccgcac gttctggttc tccactcgtt atcggtctgg 720
gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt acccgtcgct 780gtatgggtga gaacttcatc gcctctgatc agctggccct gctcccagtt acccgtcgct 780
tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt aacatcttcg 840tcatcttcct ggaagagggt gacatcgccg aaatcacccg tcgttccgtt aacatcttcg 840
acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag tatgacgctg 900acaaaacggg tgcggaagtt aaacgtcagg acatcgagtc taacctgcag tatgacgctg 900
gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag ccgaacgcga 960gtgacaaagg catctaccgt cactacatgc agaaagagat ctacgaacag ccgaacgcga 960
tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct gagctgggtc 1020tcaaaaacac cctgaccggt cgtatctctc acggtcaggt tgacctgtct gagctgggtc 1020
caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct tgtggtacct 1080caaacgcgga cgaactcctg tccaaagtcg agcacatcca gatcctggct tgtggtacct 1080
cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt atcccatgcg 1140cttacaactc cggtatggtt tctcgttact ggttcgaatc tctggcaggt atcccatgcg 1140
acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt aactccctca 1200acgttgaaat cgcctccgaa ttccgttatc gtaaatctgc ggtacgtcgt aactccctca 1200
tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg cgtctcagca 1260tgatcaccct gtctcagtct ggtgaaaccg ctgatactct ggcaggtctg cgtctcagca 1260
aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct agcctggttc 1320aagaactggg ttacctgggt tctctggcca tctgcaacgt tccgggttct agcctggttc 1320
gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt gcctctacca 1380gtgagtctgt gctggctctg atgaccaacg cgggtacgga gatcggtgtt gcctctacca 1380
aagcgttcac tacccagctc actgtcctgc tgatgctggt tgccaaactg tctcgtctca 1440aagcgttcac tacccagctc actgtcctgc tgatgctggt tgccaaactg tctcgtctca 1440
aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc ctcccatctc 1500aaggcctcga cgctagcatc gaacacgaca tcgtacacgg tctgcaggcc ctcccatctc 1500
gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa gacttcagcg 1560gtatcgagca gatgctgccg caggacaaac gtatcgaagc actggcagaa gacttcagcg 1560
acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg ctggaaggtg 1620acaaacacca cgcgctgttt ctgggtcgtg gtgaccagta cccaattgcg ctggaaggtg 1620
ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg ggtgagctga 1680ccctgaaact gaaagagatc agctacatcc atgcagaggc atacgcagcg ggtgagctga 1680
aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt gctccgaaca 1740aacatggtcc actggccctg atcgacgcag atatgccggt tattgtggtt gctccgaaca 1740
acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt ggtggtcagc 1800acggcctgct ggagaaactg aaatccaaca tcgaggaagt acgtgcgcgt ggtggtcagc 1800
tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg cacatcatcg 1860tgtacgtgtt tgctgaccag gacgcgggtt tcgtttccag cgacaacatg cacatcatcg 1860
aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg ctgcagctgc 1920aaatgccgca tgttgaagag gtaatcgcgc caatcttcta caccgtaccg ctgcagctgc 1920
tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt aacctggcga 1980tggcgtacca tgtagccctg atcaaaggta cggacgttga ccagccgcgt aacctggcga 1980
aatccgtgac cgtggaataa cgaaggagat agaaccatga gcttacccga tggattttat 2040aatccgtgac cgtggaataa cgaaggagat agaaccatga gcttacccga tggattttat 2040
ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa ggttttgacc 2100ataaggcgaa tggaagaggg ggatttggaa caggtcactg agacgctaaa ggttttgacc 2100
accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg gaatgaagcc 2160accgtgggca ctattacccc cgaatccttc agcaaactca taaaatactg gaatgaagcc 2160
acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat ggtgattgtg 2220acagtatgga atgataacga agataaaaaa ataatgcaat ataaccccat ggtgattgtg 2220
gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag aaagatcatt 2280gacaagcgca ccgagacggt tgccgctacg gggaatatca tcatcgaaag aaagatcatt 2280
catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa gtatcagggc 2340catgaactgg ggctatgtgg ccacatcgag gacattgcag taaactccaa gtatcagggc 2340
caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga ctacggttgt 2400caaggtttgg gcaagctctt gattgatcaa ttggtaacta tcggctttga ctacggttgt 2400
tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa atgtgggttt 2460tataagatta ttttagattg cgatgagaaa aatgtcaaat tctatgaaaa atgtgggttt 2460
agcaacgcag gcgtggaaat gcaaattaga aaatagcatc cgtatcggaa acactagcat 2520agcaacgcag gcgtggaaat gcaaattaga aaatagcatc cgtatcggaa acactagcat 2520
aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 2580aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaacca atttgcctgg 2580
cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 2640cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 2640
cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2700cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2700
aacgaaaggc tcagtcgaaa gactgggcct ttcgcttcca caactttgta taataaagtt 2760aacgaaaggc tcagtcgaaa gactgggcct ttcgcttcca caactttgta taataaagtt 2760
gtccccacgg ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg 2820gtccccacgg ccagtgaatt cgagctcggt acctaccgtt cgtataatgt atgctatacg 2820
aagttatcga gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac 2880aagttatcga gctctagaga atgatcccct cattaggcca cacgttcaag tgcagcgcac 2880
accgtggaaa cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact 2940accgtggaaa cggatgaagg cacgaaccca gttgacataa gcctgttcgg ttcgtaaact 2940
gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg 3000gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga accttgaccg aacgcagcgg 3000
tggtaacggc gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat 3060tggtaacggc gcagtggcgg ttttcatggc ttgttatgac tgtttttttg tacagtctat 3060
gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag 3120gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag 3120
cagcaacgat gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa 3180cagcaacgat gttacgcagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa 3180
agttaggtgg ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca 3240agttaggtgg ctcaagtatg ggcatcattc gcacatgtag gctcggccct gaccaagtca 3240
aatccatgcg ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact 3300aatccatgcg ggctgctctt gatcttttcg gtcgtgagtt cggagacgta gccacctact 3300
cccaacatca gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg 3360cccaacatca gccggactcc gattacctcg ggaacttgct ccgtagtaag acattcatcg 3360
cgcttgctgc cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca 3420cgcttgctgc cttcgaccaa gaagcggttg ttggcgctct cgcggcttac gttctgccca 3420
ggtttgagca gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc 3480ggtttgagca gccgcgtagt gagatctata tctatgatct cgcagtctcc ggcgagcacc 3480
ggaggcaggg cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg 3540ggaggcaggg cattgccacc gcgctcatca atctcctcaa gcatgaggcc aacgcgcttg 3540
gtgcttatgt gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata 3600gtgcttatgt gatctacgtg caagcagatt acggtgacga tcccgcagtg gctctctata 3600
caaagttggg catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct 3660caaagttggg catacgggaa gaagtgatgc actttgatat cgacccaagt accgccacct 3660
aacaattcgt tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg 3720aacaattcgt tcaagccgag atcgtagaat ttcgacgacc tgcagccaag cataacttcg 3720
tataatgtat gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt 3780tataatgtat gctatacgaa cggtaggatc ctctagagtc gacctgcagg catgagatgt 3780
gtataagaga cag 3793gtataagaga cag 3793
<210> 101<210> 101
<211> 3847<211> 3847
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 101<400> 101
ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60ctgtctctta tacacatctc cggccagatg attaattcct aatttttgtt gacactctat 60
cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120cattgataga gttattttac cactccctat cagtgataga gaaaagtgaa atgaatagtt 120
cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaac catgtccaac 180cgacaaaaat ctagaaataa ttttgtttgg cgtcgagaag gagatagaac catgtccaac 180
aatggctcgt caccgctggt gctttggtat aaccaactcg gcatgaatga tgtagacagg 240aatggctcgt caccgctggt gctttggtat aaccaactcg gcatgaatga tgtagacagg 240
gttgggggca aaaatgcctc cctgggtgaa atgattacta acctttccgg aatgggtgtt 300gttgggggca aaaatgcctc cctgggtgaa atgattacta acctttccgg aatgggtgtt 300
tccgttccga atggtttcgc cacaaccgcc gacgcgttta accagtttct ggaccaaagc 360tccgttccga atggtttcgc cacaaccgcc gacgcgttta accagtttct ggaccaaagc 360
ggcgtaaacc agcgcattta tgaactgctg gataaaacgg atattgacga tgttactcag 420ggcgtaaacc agcgcattta tgaactgctg gataaaacgg atattgacga tgttactcag 420
cttgcgaaag cgggcgcgca aatccgccag tggattatcg acactccctt ccagcctgag 480cttgcgaaag cgggcgcgca aatccgccag tggattatcg acactccctt ccagcctgag 480
ctggaaaacg ccatcagcga agcctatgca cagctttctg ccgatgacga aaacgcctct 540ctggaaaacg ccatcagcga agcctatgca cagctttctg ccgatgacga aaacgcctct 540
tttgcggtgc gctcctccgc caccgcagaa gatatgccgg acgcttcttt tgccggtcag 600tttgcggtgc gctcctccgc caccgcagaa gatatgccgg acgcttcttt tgccggtcag 600
caggaaacct tcctcaacgt tcagggtttt gacgccgttc tcgtggcagt gaaacatgta 660caggaaacct tcctcaacgt tcagggtttt gacgccgttc tcgtggcagt gaaacatgta 660
tttgcttctc tgtttaacga tcgcgccatc tcttatcgtg tgcaccaggg ttacgatcac 720tttgcttctc tgtttaacga tcgcgccatc tcttatcgtg tgcaccaggg ttacgatcac 720
cgtggtgtgg cgctctccgc cggtgttcaa cggatggtgc gctctgacct cgcatcatct 780cgtggtgtgg cgctctccgc cggtgttcaa cggatggtgc gctctgacct cgcatcatct 780
ggcgtgatgt tctccattga taccgaatcc ggctttgacc aggtggtgtt tatcacttcc 840ggcgtgatgt tctccattga taccgaatcc ggctttgacc aggtggtgtt tatcacttcc 840
gcatggggcc ttggtgagat ggtcgtgcag ggtgcggtta acccggatga gttttacgtg 900gcatggggcc ttggtgagat ggtcgtgcag ggtgcggtta acccggatga gttttacgtg 900
cataaaccga cactggcggc gaatcgcccg gctatcgtgc gccgcaccat ggggtcgaaa 960cataaaccga cactggcggc gaatcgcccg gctatcgtgc gccgcaccat ggggtcgaaa 960
aaaatccgca tggtttacgc gccgacccag gagcacggca agcaggttaa aatcgaagac 1020aaaatccgca tggtttacgc gccgacccag gagcacggca agcaggttaa aatcgaagac 1020
gtaccgcagg aacagcgtga catcttctcg ctgaccaacg aagaagtgca ggaactggca 1080gtaccgcagg aacagcgtga catcttctcg ctgaccaacg aagaagtgca ggaactggca 1080
aaacaggccg tacaaattga gaaacactac ggtcgcccga tggatattga gtgggcgaaa 1140aaacaggccg tacaaattga gaaacactac ggtcgcccga tggatattga gtgggcgaaa 1140
gatggccaca ccggtaaact gttcattgtg caggcgcgtc cggaaaccgt gcgctcacgc 1200gatggccaca ccggtaaact gttcattgtg caggcgcgtc cggaaaccgt gcgctcacgc 1200
ggtcaggtca tggagcgtta tacgctgcat tcacagggta agattatcgc cgaaggccgt 1260ggtcaggtca tggagcgtta tacgctgcat tcacagggta agattatcgc cgaaggccgt 1260
gctatcggtc atcgcatcgg tgcgggtccg gtgaaagtca tccatgatat cagcgaaatg 1320gctatcggtc atcgcatcgg tgcgggtccg gtgaaagtca tccatgatat cagcgaaatg 1320
aaccgcatcg aacctggtga cgtgctggtc actgacatga ccgacccgga ctgggaaccg 1380aaccgcatcg aacctggtga cgtgctggtc actgacatga ccgacccgga ctgggaaccg 1380
atcatgaaga aagcatctgc catcgtcacc aaccgtggcg gtcgtacctg tcacgcggcg 1440atcatgaaga aagcatctgc catcgtcacc aaccgtggcg gtcgtacctg tcacgcggcg 1440
atcatcgctc gtgaactggg cattccggcg gtagtgggct gtggtgatgc aacagaacgg 1500atcatcgctc gtgaactggg cattccggcg gtagtgggct gtggtgatgc aacagaacgg 1500
atgaaagacg gtgagaacgt cactgtttct tgtgccgaag gtgataccgg ttacgtctat 1560atgaaagacg gtgagaacgt cactgtttct tgtgccgaag gtgataccgg ttacgtctat 1560
gcggagttgc tggaatttag cgtgaaaagc tccagcgtag aaacgatgcc ggatctgccg 1620gcggagttgc tggaatttag cgtgaaaagc tccagcgtag aaacgatgcc ggatctgccg 1620
ttgaaagtga tgatgaacgt cggtaacccg gaccgagctt tcgacttcgc ctgtctgccg 1680ttgaaagtga tgatgaacgt cggtaacccg gaccgagctt tcgacttcgc ctgtctgccg 1680
aacgaaggcg tgggacttgc gcgtctggaa tttatcatca accgtatgat tggcgtccac 1740aacgaaggcg tgggacttgc gcgtctggaa tttatcatca accgtatgat tggcgtccac 1740
ccacgcgcac tgcttgagtt tgacgatcag gaaccgcagt tgcaaaacga aatccgcgag 1800ccacgcgcac tgcttgagtt tgacgatcag gaaccgcagt tgcaaaacga aatccgcgag 1800
atgatgaaag gttttgattc tccgcgtgaa ttttacgttg gtcgtctgac tgaagggatc 1860atgatgaaag gttttgattc tccgcgtgaa ttttacgttg gtcgtctgac tgaagggatc 1860
gcgacgctgg gtgccgcgtt ttatccgaag cgcgtcattg tccgtctctc tgattttaaa 1920gcgacgctgg gtgccgcgtt ttatccgaag cgcgtcattg tccgtctctc tgattttaaa 1920
tcgaacgaat atgccaacct ggtcggtggt gagcgttacg agccagatga agagaacccg 1980tcgaacgaat atgccaacct ggtcggtggt gagcgttacg agccagatga agagaacccg 1980
atgctcggct tccgtggcgc gggacgctat atttccgaca gcttccgcga ctgtttcgcg 2040atgctcggct tccgtggcgc gggacgctat atttccgaca gcttccgcga ctgtttcgcg 2040
ctggagtgcg aagcagtgaa acgtgtgcgc aacgacatgg ggctgaccaa cgttgagatc 2100ctggagtgcg aagcagtgaa acgtgtgcgc aacgacatgg ggctgaccaa cgttgagatc 2100
atgatcccgt tcgtgcgaac cgtagatcag gcgaaagcgg tggttgagga actggcgcgt 2160atgatcccgt tcgtgcgaac cgtagatcag gcgaaagcgg tggttgagga actggcgcgt 2160
caggggctga aacgtggtga gaacgggctg aaaatcatca tgatgtgtga aatcccgtcc 2220caggggctga aacgtggtga gaacgggctg aaaatcatca tgatgtgtga aatcccgtcc 2220
aacgccttgc tggccgagca gttcctcgaa tatttcgacg gcttctcaat tggctcaaac 2280aacgccttgc tggccgagca gttcctcgaa tatttcgacg gcttctcaat tggctcaaac 2280
gacatgacgc agctggcgct cggtctggat cgtgactccg gcgtggtgtc tgaactgttc 2340gacatgacgc agctggcgct cggtctggat cgtgactccg gcgtggtgtc tgaactgttc 2340
gatgagcgca acgatgcggt gaaagcactg ctgtcgatgg cgattcgtgc cgcgaagaaa 2400gatgagcgca acgatgcggt gaaagcactg ctgtcgatgg cgattcgtgc cgcgaagaaa 2400
cagggcaaat atgtcgggat ttgcggtcag ggtccgtccg accacgaaga ctttgccgca 2460cagggcaaat atgtcgggat ttgcggtcag ggtccgtccg accacgaaga ctttgccgca 2460
tggttgatgg aagaggggat cgatagcctg tctctgaacc cggacaccgt ggtgcaaacc 2520tggttgatgg aagaggggat cgatagcctg tctctgaacc cggacaccgt ggtgcaaacc 2520
tggttaagcc tggctgaact gaagaaataa catccgtatc ggaaacacta gcataacccc 2580tggttaagcc tggctgaact gaagaaataa catccgtatc ggaaacacta gcataacccc 2580
ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2640ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa accaatttgc ctggcggcag 2640
tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2700tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga 2700
tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2760tggtagtgtg gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa 2760
aggctcagtc gaaagactgg gcctttcgct tccacaactt tgtataataa agttgtcccc 2820aggctcagtc gaaagactgg gcctttcgct tccacaactt tgtataataa agttgtcccc 2820
acggccagtg aattcgagct cggtacctac cgttcgtata atgtatgcta tacgaagtta 2880acggccagtg aattcgagct cggtacctac cgttcgtata atgtatgcta tacgaagtta 2880
tcgagctcta gagaatgatc ccctcattag gccacacgtt caagtgcagc gcacaccgtg 2940tcgagctcta gagaatgatc ccctcattag gccacacgtt caagtgcagc gcacaccgtg 2940
gaaacggatg aaggcacgaa cccagttgac ataagcctgt tcggttcgta aactgtaatg 3000gaaacggatg aaggcacgaa cccagttgac ataagcctgt tcggttcgta aactgtaatg 3000
caagtagcgt atgcgctcac gcaactggtc cagaaccttg accgaacgca gcggtggtaa 3060caagtagcgt atgcgctcac gcaactggtc cagaaccttg accgaacgca gcggtggtaa 3060
cggcgcagtg gcggttttca tggcttgtta tgactgtttt tttgtacagt ctatgcctcg 3120cggcgcagtg gcggttttca tggcttgtta tgactgtttt tttgtacagt ctatgcctcg 3120
ggcatccaag cagcaagcgc gttacgccgt gggtcgatgt ttgatgttat ggagcagcaa 3180ggcatccaag cagcaagcgc gttacgccgt gggtcgatgt ttgatgttat ggagcagcaa 3180
cgatgttacg cagcagcaac gatgttacgc agcagggcag tcgccctaaa acaaagttag 3240cgatgttacg cagcagcaac gatgttacgc agcagggcag tcgccctaaa acaaagttag 3240
gtggctcaag tatgggcatc attcgcacat gtaggctcgg ccctgaccaa gtcaaatcca 3300gtggctcaag tatgggcatc attcgcacat gtaggctcgg ccctgaccaa gtcaaatcca 3300
tgcgggctgc tcttgatctt ttcggtcgtg agttcggaga cgtagccacc tactcccaac 3360tgcgggctgc tcttgatctt ttcggtcgtg agttcggaga cgtagccacc tactcccaac 3360
atcagccgga ctccgattac ctcgggaact tgctccgtag taagacattc atcgcgcttg 3420atcagccgga ctccgattac ctcgggaact tgctccgtag taagacattc atcgcgcttg 3420
ctgccttcga ccaagaagcg gttgttggcg ctctcgcggc ttacgttctg cccaggtttg 3480ctgccttcga ccaagaagcg gttgttggcg ctctcgcggc ttacgttctg cccaggtttg 3480
agcagccgcg tagtgagatc tatatctatg atctcgcagt ctccggcgag caccggaggc 3540agcagccgcg tagtgagatc tatatctatg atctcgcagt ctccggcgag caccggaggc 3540
agggcattgc caccgcgctc atcaatctcc tcaagcatga ggccaacgcg cttggtgctt 3600agggcattgc caccgcgctc atcaatctcc tcaagcatga ggccaacgcg cttggtgctt 3600
atgtgatcta cgtgcaagca gattacggtg acgatcccgc agtggctctc tatacaaagt 3660atgtgatcta cgtgcaagca gattacggtg acgatcccgc agtggctctc tatacaaagt 3660
tgggcatacg ggaagaagtg atgcactttg atatcgaccc aagtaccgcc acctaacaat 3720tgggcatacg ggaagaagtg atgcactttg atatcgaccc aagtaccgcc acctaacaat 3720
tcgttcaagc cgagatcgta gaatttcgac gacctgcagc caagcataac ttcgtataat 3780tcgttcaagc cgagatcgta gaatttcgac gacctgcagc caagcataac ttcgtataat 3780
gtatgctata cgaacggtag gatcctctag agtcgacctg caggcatgag atgtgtataa 3840gtatgctata cgaacggtag gatcctctag agtcgacctg caggcatgag atgtgtataa 3840
gagacag 3847gagacag 3847
<210> 102<210> 102
<211> 5554<211> 5554
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Плазмида<223> Plasmid
<400> 102<400> 102
catcgattta ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac 60catcgattta ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac 60
ggaactcgct cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat 120ggaactcgct cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat 120
cgtcaaaacc aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca 180cgtcaaaacc aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca 180
gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct 240gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct 240
ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga 300ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga 300
tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat 360tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat 360
tatccatcgg tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct 420tatccatcgg tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct 420
caagcagatt tatcgccagc agctccgaat agcgcccttc cccttgcccg gcgttaatga 480caagcagatt tatcgccagc agctccgaat agcgcccttc cccttgcccg gcgttaatga 480
tttgcccaaa caggtcgctg aaatgcggct ggtgcgcttc atccgggcga aagaaccccg 540tttgcccaaa caggtcgctg aaatgcggct ggtgcgcttc atccgggcga aagaaccccg 540
tattggcaaa tattgacggc cagttaagcc attcatgcca gtaggcgcgc ggacgaaagt 600tattggcaaa tattgacggc cagttaagcc attcatgcca gtaggcgcgc ggacgaaagt 600
aaacccactg gtgataccat tcgcgagcct ccggatgacg accgtagtga tgaatctctc 660aaacccactg gtgataccat tcgcgagcct ccggatgacg accgtagtga tgaatctctc 660
ctggcgggaa cagcaaaata tcacccggtc ggcaaacaaa ttctcgtccc tgatttttca 720ctggcgggaa cagcaaaata tcacccggtc ggcaaacaaa ttctcgtccc tgatttttca 720
ccaccccctg accgcgaatg gtgagattga gaatataacc tttcattccc agcggtcggt 780ccaccccctg accgcgaatg gtgagattga gaatataacc tttcattccc agcggtcggt 780
cgataaaaaa atcgagataa ccgttggcct caatcggcgt taaacccgcc accagatggg 840cgataaaaaa atcgagataa ccgttggcct caatcggcgt taaacccgcc accagatggg 840
cattaaacga gtatcccggc agcaggggat cattttgcgc ttcagccata cttttcatac 900cattaaacga gtatcccggc agcaggggat cattttgcgc ttcagccata cttttcatac 900
tcccgccatt cagagaagaa accaattgtc catattgcat cagacattgc cgtcactgcg 960tcccgccatt cagagaagaa accaattgtc catattgcat cagacattgc cgtcactgcg 960
tcttttactg gctcttctcg ctaaccaaac cggtaacccc gcttattaaa agcattctgt 1020tcttttactg gctcttctcg ctaaccaaac cggtaacccc gcttattaaa agcattctgt 1020
aacaaagcgg gaccaaagcc atgacaaaaa cgcgtaacaa aagtgtctat aatcacggca 1080aacaaagcgg gaccaaagcc atgacaaaaa cgcgtaacaa aagtgtctat aatcacggca 1080
gaaaagtcca cattgattat ttgcacggcg tcacactttg ctatgccata gcatttttat 1140gaaaagtcca cattgattat ttgcacggcg tcacactttg ctatgccata gcatttttat 1140
ccataagatt agcggatcct acctgacgct ttttatcgca actctctact gtttctccat 1200ccataagatt agcggatcct acctgacgct ttttatcgca actctctact gtttctccat 1200
acccgttttt ttgggaattc gagctctaag gaggttataa aaaatgtcta atctgctgac 1260acccgttttt ttgggaattc gagctctaag gaggttataa aaaatgtcta atctgctgac 1260
ggtccaccaa aacctgccgg ctctgccggt cgatgctacc tctgatgaag ttcgcaaaaa 1320ggtccaccaa aacctgccgg ctctgccggt cgatgctacc tctgatgaag ttcgcaaaaa 1320
cctgatggat atgtttcgtg atcgccaggc attcagcgaa catacctgga aaatgctgct 1380cctgatggat atgtttcgtg atcgccaggc attcagcgaa catacctgga aaatgctgct 1380
gtccgtgtgc cgttcatggg cggcctggtg taaactgaac aatcgcaaat ggtttccggc 1440gtccgtgtgc cgttcatggg cggcctggtg taaactgaac aatcgcaaat ggtttccggc 1440
ggaaccggaa gatgtccgtg actatctgct gtacctgcag gcccgcggtc tggcagttaa 1500ggaaccggaa gatgtccgtg actatctgct gtacctgcag gcccgcggtc tggcagttaa 1500
aacgatccag caacatctgg gccaactgaa tatgctgcac cgtcgctccg gtctgccgcg 1560aacgatccag caacatctgg gccaactgaa tatgctgcac cgtcgctccg gtctgccgcg 1560
tccgagcgat tctaatgcgg tgtcactggt tatgcgtcgc attcgtaaag aaaacgtgga 1620tccgagcgat tctaatgcgg tgtcactggt tatgcgtcgc attcgtaaag aaaacgtgga 1620
tgcaggcgaa cgcgctaaac aggcactggc ttttgaacgt accgatttcg accaagttcg 1680tgcaggcgaa cgcgctaaac aggcactggc ttttgaacgt accgatttcg accaagttcg 1680
ctcgctgatg gaaaacagcg atcgttgcca ggacatccgc aatctggcgt tcctgggtat 1740ctcgctgatg gaaaacagcg atcgttgcca ggacatccgc aatctggcgt tcctgggtat 1740
tgcctataac accctgctgc gcattgcaga aatcgctcgt attcgcgtga aagatatcag 1800tgcctataac accctgctgc gcattgcaga aatcgctcgt attcgcgtga aagatatcag 1800
ccgtacggac ggcggtcgca tgctgattca catcggccgt accaaaacgc tggtctctac 1860ccgtacggac ggcggtcgca tgctgattca catcggccgt accaaaacgc tggtctctac 1860
cgcaggcgtg gaaaaagctc tgagtctggg tgtgacgaaa ctggttgaac gctggattag 1920cgcaggcgtg gaaaaagctc tgagtctggg tgtgacgaaa ctggttgaac gctggattag 1920
tgtctccggc gtggcggatg acccgaacaa ttacctgttt tgtcgtgttc gcaaaaatgg 1980tgtctccggc gtggcggatg acccgaacaa ttacctgttt tgtcgtgttc gcaaaaatgg 1980
tgtcgcagct ccgtcagcca cctcgcagct gagcacgcgt gcactggaag gcatcttcga 2040tgtcgcagct ccgtcagcca cctcgcagct gagcacgcgt gcactggaag gcatcttcga 2040
agctacccat cgcctgattt atggcgccaa agatgactcg ggtcaacgtt acctggcgtg 2100agctacccat cgcctgattt atggcgccaa agatgactcg ggtcaacgtt acctggcgtg 2100
gtctggtcac agtgcacgtg ttggtgccgc acgtgatatg gcccgtgccg gtgtttccat 2160gtctggtcac agtgcacgtg ttggtgccgc acgtgatatg gcccgtgccg gtgtttccat 2160
cccggaaatt atgcaggcag gcggttggac caacgttaat atcgtcatga actatattcg 2220cccggaaatt atgcaggcag gcggttggac caacgttaat atcgtcatga actatattcg 2220
caatctggac tcggaaacgg gtgctatggt tcgcctgctg gaagacggtg actaatgagt 2280caatctggac tcggaaacgg gtgctatggt tcgcctgctg gaagacggtg actaatgagt 2280
gccggagttc atcgaaaaaa tggacgaggc actggctgaa attggttttg tatttgggga 2340gccggagttc atcgaaaaaa tggacgaggc actggctgaa attggttttg tatttgggga 2340
gcaatggcga tgacgcatcc tcacgataat atccgggtag gcgcaatcac tttcgtctac 2400gcaatggcga tgacgcatcc tcacgataat atccgggtag gcgcaatcac tttcgtctac 2400
tccgttacaa agcgaggctg ggtatttccc ggcctttctg ttatccgaaa tccactgaaa 2460tccgttacaa agcgaggctg ggtatttccc ggcctttctg ttatccgaaa tccactgaaa 2460
gcacagcggc tggctgagga gataaataat aaacgagggg ctgtatgcac aaagcatctt 2520gcacagcggc tggctgagga gataaataat aaacgagggg ctgtatgcac aaagcatctt 2520
ctgttgagtt aagaacgagt atcgagatgg cacatagcct tgctcaaatt ggaatcaggt 2580ctgttgagtt aagaacgagt atcgagatgg cacatagcct tgctcaaatt ggaatcaggt 2580
ttgtgccaat accagtagaa acagacgaag aatccatggg tatggacagt tttccctttg 2640ttgtgccaat accagtagaa agacgaag aatccatggg tatggacagt tttccctttg 2640
atatgtaacg gtgaacagtt gttctacttt tgtttgttag tcttgatgct tcactgatag 2700atatgtaacg gtgaacagtt gttctacttt tgtttgttag tcttgatgct tcactgatag 2700
atacaagagc cataagaacc tcagatcctt ccgtatttag ccagtatgtt ctctagtgtg 2760atacaagagc cataagaacc tcagatcctt ccgtatttag ccagtatgtt ctctagtgtg 2760
gttcgttgtt tttgcgtgag ccatgagaac gaaccattga gatcatactt actttgcatg 2820gttcgttgtt tttgcgtgag ccatgagaac gaaccattga gatcatactt actttgcatg 2820
tcactcaaaa attttgcctc aaaactggtg agctgaattt ttgcagttaa agcatcgtgt 2880tcactcaaaa attttgcctc aaaactggtg agctgaattt ttgcagttaa agcatcgtgt 2880
agtgtttttc ttagtccgtt acgtaggtag gaatctgatg taatggttgt tggtattttg 2940agtgtttttc ttagtccgtt acgtaggtag gaatctgatg taatggttgt tggtattttg 2940
tcaccattca tttttatctg gttgttctca agttcggtta cgagatccat ttgtctatct 3000tcaccattca tttttatctg gttgttctca agttcggtta cgagatccat ttgtctatct 3000
agttcaactt ggaaaatcaa cgtatcagtc gggcggcctc gcttatcaac caccaatttc 3060agttcaactt ggaaaatcaa cgtatcagtc gggcggcctc gcttatcaac caccaatttc 3060
atattgctgt aagtgtttaa atctttactt attggtttca aaacccattg gttaagcctt 3120atattgctgt aagtgtttaa atctttactt attggtttca aaacccattg gttaagcctt 3120
ttaaactcat ggtagttatt ttcaagcatt aacatgaact taaattcatc aaggctaatc 3180ttaaactcat ggtagttatt ttcaagcatt aacatgaact taaattcatc aaggctaatc 3180
tctatatttg ccttgtgagt tttcttttgt gttagttctt ttaataacca ctcataaatc 3240tctatatttg ccttgtgagt tttcttttgt gttagttctt ttaataacca ctcataaatc 3240
ctcatagagt atttgttttc aaaagactta acatgttcca gattatattt tatgaatttt 3300ctcatagagt atttgttttc aaaagactta acatgttcca gattatattt tatgaatttt 3300
tttaactgga aaagataagg caatatctct tcactaaaaa ctaattctaa tttttcgctt 3360tttaactgga aaagataagg caatatctct tcactaaaaa ctaattctaa tttttcgctt 3360
gagaacttgg catagtttgt ccactggaaa atctcaaagc ctttaaccaa aggattcctg 3420gagaacttgg catagtttgt ccactggaaa atctcaaagc ctttaaccaa aggattcctg 3420
atttccacag ttctcgtcat cagctctctg gttgctttag ctaatacacc ataagcattt 3480atttccacag ttctcgtcat cagctctctg gttgctttag ctaatacacc ataagcattt 3480
tccctactga tgttcatcat ctgagcgtat tggttataag tgaacgatac cgtccgttct 3540tccctactga tgttcatcat ctgagcgtat tggttataag tgaacgatac cgtccgttct 3540
ttccttgtag ggttttcaat cgtggggttg agtagtgcca cacagcataa aattagcttg 3600ttccttgtag ggttttcaat cgtggggttg agtagtgcca cacagcataa aattagcttg 3600
gtttcatgct ccgttaagtc atagcgacta atcgctagtt catttgcttt gaaaacaact 3660gtttcatgct ccgttaagtc atagcgacta atcgctagtt catttgcttt gaaaacaact 3660
aattcagaca tacatctcaa ttggtctagg tgattttaat cactatacca attgagatgg 3720aattcagaca tacatctcaa ttggtctagg tgattttaat cactatacca attgagatgg 3720
gctagtcaat gataattact agtccttttc ctttgagttg tgggtatctg taaattctgc 3780gctagtcaat gataattact agtccttttc ctttgagttg tgggtatctg taaattctgc 3780
tagacctttg ctggaaaact tgtaaattct gctagaccct ctgtaaattc cgctagacct 3840tagacctttg ctggaaaact tgtaaattct gctagaccct ctgtaaattc cgctagacct 3840
ttgtgtgttt tttttgttta tattcaagtg gttataattt atagaataaa gaaagaataa 3900ttgtgtgttt tttttgttta tattcaagtg gttataattt atagaataaa gaaagaataa 3900
aaaaagataa aaagaataga tcccagccct gtgtataact cactacttta gtcagttccg 3960aaaaagataa aaagaataga tcccagccct gtgtataact cactacttta gtcagttccg 3960
cagtattaca aaaggatgtc gcaaacgctg tttgctcctc tacaaaacag accttaaaac 4020cagtattaca aaaggatgtc gcaaacgctg tttgctcctc tacaaaacag accttaaaac 4020
cctaaaggct taagtagcac cctcgcaagc tcggttgcgg ccgcaatcgg gcaaatcgct 4080cctaaaggct taagtagcac cctcgcaagc tcggttgcgg ccgcaatcgg gcaaatcgct 4080
gaatattcct tttgtctccg accatcaggc acctgagtcg ctgtcttttt cgtgacattc 4140gaatattcct tttgtctccg accatcaggc acctgagtcg ctgtcttttt cgtgacattc 4140
agttcgctgc gctcacggct ctggcagtga atgggggtaa atggcactac aggcgccttt 4200agttcgctgc gctcacggct ctggcagtga atgggggtaa atggcactac aggcgccttt 4200
tatggattca tgcaaggaaa ctacccataa tacaagaaaa gcccgtcacg ggcttctcag 4260tatggattca tgcaaggaaa ctacccataa tacaagaaaa gcccgtcacg ggcttctcag 4260
ggcgttttat ggcgggtctg ctatgtggtg ctatctgact ttttgctgtt cagcagttcc 4320ggcgttttat ggcgggtctg ctatgtggtg ctatctgact ttttgctgtt cagcagttcc 4320
tgccctctga ttttccagtc tgaccacttc ggattatccc gtgacaggtc attcagactg 4380tgccctctga ttttccagtc tgaccacttc ggattatccc gtgacaggtc attcagactg 4380
gctaatgcac ccagtaaggc agcggtatca tcaacggggt ctgacgctca gtggaacgaa 4440gctaatgcac ccagtaaggc agcggtatca tcaacggggt ctgacgctca gtggaacgaa 4440
aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 4500aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 4500
ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 4560ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 4560
agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 4620agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 4620
atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 4680atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 4680
cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 4740cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 4740
aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 4800aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 4800
cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 4860cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 4860
aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca 4920aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca 4920
ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa 4980ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa 4980
gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca 5040gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca 5040
ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt 5100ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt 5100
tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt 5160tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt 5160
tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg 5220tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg 5220
ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga 5280ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga 5280
tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc 5340tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc 5340
agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg 5400agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg 5400
acacggaaat gttgaatact catactcttc ctttttcaat attattgaag catttatcag 5460acacggaaat gttgaatact catactcttc ctttttcaat attattgaag catttatcag 5460
ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg 5520ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg 5520
gttccgcgca catttccccg aaaagtgcca cctg 5554gttccgcgca catttccccg aaaagtgcca cctg 5554
<210> 103<210> 103
<211> 3415<211> 3415
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 103<400> 103
ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60
ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120
attttgttta actttaagaa ggagatatac aaatgaacaa cgacaactcc acgaccacca 180attttgttta actttaagaa ggagatatac aaatgaacaa cgacaactcc acgaccacca 180
acaataacgc tattgaaatc tatgtggatc gtgcgaccct gccgacgatc cagcaaatga 240acaataacgc tattgaaatc tatgtggatc gtgcgaccct gccgacgatc cagcaaatga 240
ccaaaattgt tagccagaaa acgtctaaca aaaaactgat ctcatggtcg cgctacccga 300ccaaaattgt tagccagaaa acgtctaaca aaaaactgat ctcatggtcg cgctacccga 300
ttaccgataa aagcctgctg aagaaaatta acgcggaatt tttcaaagaa caatttgaac 360ttaccgataa aagcctgctg aagaaaatta acgcggaatt tttcaaagaa caatttgaac 360
tgacggaaag cctgaaaaac atcatcctgt ctgaaaacat cgataacctg atcattcatg 420tgacggaaag cctgaaaaac atcatcctgt ctgaaaacat cgataacctg atcattcatg 420
gcaataccct gtggagtatt gatgtggttg acattatcaa agaagtcaac ctgctgggca 480gcaataccct gtggagtatt gatgtggttg acattatcaa agaagtcaac ctgctgggca 480
aaaatattcc gatcgaactg cacttttatg atgacggttc cgccgaatac gttcgtatct 540aaaatattcc gatcgaactg cacttttatg atgacggttc cgccgaatac gttcgtatct 540
acgaatttag taaactgccg gaatccgaac agaaatacaa aaccagcctg tctaaaaaca 600acgaatttag taaactgccg gaatccgaac agaaatacaa aaccagcctg tctaaaaaca 600
acatcaaatt ctcaatcgat ggcaccgact cgttcaaaaa cacgatcgaa aacatctacg 660acatcaaatt ctcaatcgat ggcaccgact cgttcaaaaa cacgatcgaa aacatctacg 660
gtttcagcca actgtatccg accacgtacc acatgctgcg tgcagatatc ttcgacacca 720gtttcagcca actgtatccg accacgtacc acatgctgcg tgcagatatc ttcgacacca 720
cgctgaaaat taacccgctg cgcgaactgc tgtcaaacaa catcaaacag atgaaatggg 780cgctgaaaat taacccgctg cgcgaactgc tgtcaaacaa catcaaacag atgaaatggg 780
attacttcaa agacttcaac tacaaacaaa aagatatctt ttactcactg accaacttca 840attacttcaa agacttcaac tacaaacaaa aagatatctt ttactcactg accaacttca 840
acccgaaaga aatccaggaa gacttcaaca aaaactcgaa caaaaacttc atcttcatcg 900acccgaaaga aatccaggaa gacttcaaca aaaactcgaa caaaaacttc atcttcatcg 900
gcagtaactc cgcgaccgcc acggcagaag aacaaatcaa tattatcagc gaagcgaaga 960gcagtaactc cgcgaccgcc acggcagaag aacaaatcaa tattatcagc gaagcgaaga 960
aagaaaacag cagcattatc accaattcaa tttcggatta tgacctgttt ttcaaaggtc 1020aagaaaacag cagcattatc accaattcaa tttcggatta tgacctgttt ttcaaaggtc 1020
atccgtctgc cacgtttaac gaacagatta tcaatgcaca cgatatgatc gaaatcaaca 1080atccgtctgc cacgtttaac gaacagatta tcaatgcaca cgatatgatc gaaatcaaca 1080
acaaaatccc gttcgaagct ctgatcatga ccggcattct gccggatgcc gttggcggta 1140acaaaatccc gttcgaagct ctgatcatga ccggcattct gccggatgcc gttggcggta 1140
tgggtagttc cgtctttttc agtatcccga aagaagtcaa aaacaaattc gtgttctata 1200tgggtagttc cgtctttttc agtatcccga aagaagtcaa aaacaaattc gtgttctata 1200
aaagtggtac ggatatcgaa aataactccc tgattcaggt gatgctgaaa ctgaatctga 1260aaagtggtac ggatatcgaa aataactccc tgattcaggt gatgctgaaa ctgaatctga 1260
ttaaccgcga taatattaaa ctgatctctg acatttaatt tcgtcgacac acaggaaaca 1320ttaaccgcga taatattaaa ctgatctctg acatttaatt tcgtcgacac acaggaaaca 1320
tattaaaaat taaaacctgc aggagtttaa acgcggccgc gatatcgttg taaaacgacg 1380tattaaaaat taaaacctgc aggagtttaa acgcggccgc gatatcgttg taaaacgacg 1380
gccagtgcaa gaatcataaa aaatttattt gctttcagga aaatttttct gtataataga 1440gccagtgcaa gaatcataaa aaatttattt gctttcagga aaatttttct gtataataga 1440
ttcataaatt tgagagagga gtttttgtga gcggataaca attccccatc ttagtatatt 1500ttcataaatt tgagagagga gtttttgtga gcggataaca attccccatc ttagtatatt 1500
agttaagtat aaatacacaa ggagatatac atatgagcct ggccattatc ccggcacgtg 1560agttaagtat aaatacacaa ggagatatac atatgagcct ggccattatc ccggcacgtg 1560
gcggttctaa aggcatcaaa aacaaaaacc tggttctgct gaacaataaa ccgctgattt 1620gcggttctaa aggcatcaaa aacaaaaacc tggttctgct gaacaataaa ccgctgattt 1620
attacaccat caaagcggcc ctgaacgcca aaagtattag caaagtggtt gtgagctctg 1680attacaccat caaagcggcc ctgaacgcca aaagtattag caaagtggtt gtgagctctg 1680
attctgatga aatcctgaac tacgcaaaaa gtcagaacgt tgatatcctg aaacgtccga 1740attctgatga aatcctgaac tacgcaaaaa gtcagaacgt tgatatcctg aaacgtccga 1740
tcagtctggc acaggatgat accacgagcg ataaagtgct gctgcatgcg ctgaaattct 1800tcagtctggc acaggatgat accacgagcg ataaagtgct gctgcatgcg ctgaaattct 1800
acaaagatta cgaagatgtt gtgttcctgc agccgaccag cccgctgcgt acgaatattc 1860acaaagatta cgaagatgtt gtgttcctgc agccgaccag cccgctgcgt acgaatattc 1860
acatcaacga agcgttcaac ctgtacaaaa acagcaacgc aaacgcgctg atttctgtta 1920acatcaacga agcgttcaac ctgtacaaaa acagcaacgc aaacgcgctg atttctgtta 1920
gtgaatgcga taacaaaatc ctgaaagcgt ttgtgtgcaa tgattgtggc gatctggccg 1980gtgaatgcga taacaaaatc ctgaaagcgt ttgtgtgcaa tgattgtggc gatctggccg 1980
gtatttgtaa cgatgaatac ccgttcatgc cgcgccagaa actgccgaaa acctatatga 2040gtatttgtaa cgatgaatac ccgttcatgc cgcgccagaa actgccgaaa acctatatga 2040
gcaatggtgc catctacatc ctgaaaatca aagaattcct gaacaacccg agcttcctgc 2100gcaatggtgc catctacatc ctgaaaatca aagaattcct gaacaacccg agcttcctgc 2100
agtctaaaac gaaacatttc ctgatggatg aaagtagctc tctggatatt gattgcctgg 2160agtctaaaac gaaacatttc ctgatggatg aaagtagctc tctggatatt gattgcctgg 2160
aagatctgaa aaaagtggaa cagatctgga aaaaataaaa tactgaaacc aatttgcctg 2220aagatctgaa aaaagtggaa cagatctgga aaaaataaaa tactgaaacc aatttgcctg 2220
gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta 2280gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta 2280
gcgccgatgg tagtgtgggg tctccccatg cgagagtagg gaactgccag gcatcaaata 2340gcgccgatgg tagtgtgggg tctccccatg cgagagtagg gaactgccag gcatcaaata 2340
aaacgaaagg ctcagtcgaa agactgggcc tttcgcttcc acaactttgt ataataaagt 2400aaacgaaagg ctcagtcgaa agactgggcc tttcgcttcc acaactttgt ataataaagt 2400
tgtccccacg gccagtgaat tcgagctcgg tacctaccgt tcgtataatg tatgctatac 2460tgtccccacg gccagtgaat tcgagctcgg tacctaccgt tcgtataatg tatgctatac 2460
gaagttatcg agctctagag aatgatcccc tcattaggcc acacgttcaa gtgcagcgca 2520gaagttatcg agctctagag aatgatcccc tcattaggcc acacgttcaa gtgcagcgca 2520
caccgtggaa acggatgaag gcacgaaccc agttgacata agcctgttcg gttcgtaaac 2580caccgtggaa acggatgaag gcacgaaccc agttgacata agcctgttcg gttcgtaaac 2580
tgtaatgcaa gtagcgtatg cgctcacgca actggtccag aaccttgacc gaacgcagcg 2640tgtaatgcaa gtagcgtatg cgctcacgca actggtccag aaccttgacc gaacgcagcg 2640
gtggtaacgg cgcagtggcg gttttcatgg cttgttatga ctgttttttt gtacagtcta 2700gtggtaacgg cgcagtggcg gttttcatgg cttgttatga ctgttttttt gtacagtcta 2700
tgcctcgggc atccaagcag caagcgcgtt acgccgtggg tcgatgtttg atgttatgga 2760tgcctcgggc atccaagcag caagcgcgtt acgccgtggg tcgatgtttg atgttatgga 2760
gcagcaacga tgttacgcag cagcaacgat gttacgcagc agggcagtcg ccctaaaaca 2820gcagcaacga tgttacgcag cagcaacgat gttacgcagc agggcagtcg ccctaaaaca 2820
aagttaggtg gctcaagtat gggcatcatt cgcacatgta ggctcggccc tgaccaagtc 2880aagttaggtg gctcaagtat gggcatcatt cgcacatgta ggctcggccc tgaccaagtc 2880
aaatccatgc gggctgctct tgatcttttc ggtcgtgagt tcggagacgt agccacctac 2940aaatccatgc gggctgctct tgatcttttc ggtcgtgagt tcggagacgt agccacctac 2940
tcccaacatc agccggactc cgattacctc gggaacttgc tccgtagtaa gacattcatc 3000tcccaacatc agccggactc cgattacctc gggaacttgc tccgtagtaa gacattcatc 3000
gcgcttgctg ccttcgacca agaagcggtt gttggcgctc tcgcggctta cgttctgccc 3060gcgcttgctg ccttcgacca agaagcggtt gttggcgctc tcgcggctta cgttctgccc 3060
aggtttgagc agccgcgtag tgagatctat atctatgatc tcgcagtctc cggcgagcac 3120aggtttgagc agccgcgtag tgagatctat atctatgatc tcgcagtctc cggcgagcac 3120
cggaggcagg gcattgccac cgcgctcatc aatctcctca agcatgaggc caacgcgctt 3180cggaggcagg gcattgccac cgcgctcatc aatctcctca agcatgaggc caacgcgctt 3180
ggtgcttatg tgatctacgt gcaagcagat tacggtgacg atcccgcagt ggctctctat 3240ggtgcttatg tgatctacgt gcaagcagat tacggtgacg atcccgcagt ggctctctat 3240
acaaagttgg gcatacggga agaagtgatg cactttgata tcgacccaag taccgccacc 3300acaaagttgg gcatacggga agaagtgatg cactttgata tcgacccaag taccgccacc 3300
taacaattcg ttcaagccga gatcgtagaa tttcgacgac ctgcagccaa gcataacttc 3360taacaattcg ttcaagccga gatcgtagaa tttcgacgac ctgcagccaa gcataacttc 3360
gtataatgta tgctatacga acggtaggat cctctagagt cgacctgcag gcatg 3415gtataatgta tgctatacga acggtaggat cctctagagt cgacctgcag gcatg 3415
<210> 104<210> 104
<211> 3763<211> 3763
<212> ДНК<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220><220>
<223> Экспрессионная кассета<223> Expression Cassette
<400> 104<400> 104
ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60ccggccagat gattaattcc taatttttgt tgacactcta tcattgatag agttatttta 60
ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120ccactcccta tcagtgatag agaaaagtga aatgaatagt tcgacaaaaa tctagaaata 120
attttgttta actttaagaa ggagatatac aaatgtgtaa cgataatcaa aatacggtcg 180attttgttta actttaagaa ggagatatac aaatgtgtaa cgataatcaa aatacggtcg 180
atgttgttgt gagcaccgtt aacgataacg tcatcgaaaa caacacgtac caagttaaac 240atgttgttgt gagcaccgtt aacgataacg tcatcgaaaa caacacgtac caagttaaac 240
cgatcgatac cccgaccacg tttgacagtt actcctggat tcagacgtgc ggcaccccga 300cgatcgatac cccgaccacg tttgacagtt actcctggat tcagacgtgc ggcaccccga 300
tcctgaaaga tgacgaaaaa tattcactgt cgtttgattt cgtcgccccg gaactggatc 360tcctgaaaga tgacgaaaaa tattcactgt cgtttgattt cgtcgccccg gaactggatc 360
aggacgaaaa attctgtttc gaatttaccg gcgatgttga cggtaaacgt tatgtcacgc 420aggacgaaaa attctgtttc gaatttaccg gcgatgttga cggtaaacgt tatgtcacgc 420
agaccaacct gacggtggtt gcaccgaccc tggaagttta cgtcgatcat gctagtctgc 480agaccaacct gacggtggtt gcaccgaccc tggaagttta cgtcgatcat gctagtctgc 480
cgtccctgca gcaactgatg aaaatcatcc agcagaaaaa cgaatactca cagaatgaac 540cgtccctgca gcaactgatg aaaatcatcc agcagaaaaa cgaatactca cagaatgaac 540
gtttcatttc gtggggccgc atcggtctga cggaagataa cgcggaaaaa ctgaatgccc 600gtttcatttc gtggggccgc atcggtctga cggaagataa cgcggaaaaa ctgaatgccc 600
atatttatcc gctggcaggc aacaatacct cacaggaact ggtggatgca gtgatcgatt 660atatttatcc gctggcaggc aacaatacct cacaggaact ggtggatgca gtgatcgatt 660
acgctgactc gaaaaaccgt ctgaatctgg aactgaacac gaataccgcg cacagctttc 720acgctgactc gaaaaaccgt ctgaatctgg aactgaacac gaataccgcg cacagctttc 720
cgaacctggc cccgattctg cgcattatca gctctaaaag caacatcctg atctctaaca 780cgaacctggc cccgattctg cgcattatca gctctaaaag caacatcctg atctctaaca 780
tcaacctgta cgatgacggc agtgctgaat atgtgaacct gtacaattgg aaagataccg 840tcaacctgta cgatgacggc agtgctgaat atgtgaacct gtacaattgg aaagataccg 840
aagacaaatc cgtgaaactg agcgattctt tcctggttct gaaagactac tttaacggta 900aagacaaatc cgtgaaactg agcgattctt tcctggttct gaaagactac tttaacggta 900
ttagttccga aaaaccgagc ggcatctatg gtcgctacaa ctggcatcaa ctgtataata 960ttagttccga aaaaccgagc ggcatctatg gtcgctacaa ctggcatcaa ctgtataata 960
cgtcttatta cttcctgcgt aaagattacc tgaccgttga accgcagctg cacgacctgc 1020cgtcttatta cttcctgcgt aaagattacc tgaccgttga accgcagctg cacgacctgc 1020
gcgaatatct gggcggtagt ctgaaacaaa tgtcctggga tggcttttca cagctgtcga 1080gcgaatatct gggcggtagt ctgaaacaaa tgtcctggga tggcttttca cagctgtcga 1080
aaggtgacaa agaactgttc ctgaacattg tcggctttga tcaggaaaaa ctgcagcaag 1140aaggtgacaa agaactgttc ctgaacattg tcggctttga tcaggaaaaa ctgcagcaag 1140
aataccagca atcagaactg ccgaatttcg tgtttacggg caccacgacc tgggcaggcg 1200aataccagca atcagaactg ccgaatttcg tgtttacggg caccacgacc tgggcaggcg 1200
gtgaaaccaa agaatattac gctcagcaac aggtgaacgt cgtgaacaat gcgattaatg 1260gtgaaaccaa agaatattac gctcagcaac aggtgaacgt cgtgaacaat gcgattaatg 1260
aaaccagccc gtattacctg ggccgtgaac atgacctgtt tttcaaaggt cacccgcgcg 1320aaaccagccc gtattacctg ggccgtgaac atgacctgtt tttcaaaggt cacccgcgcg 1320
gcggtattat caatgatatt atcctgggca gtttcaacaa tatgattgac atcccggcca 1380gcggtattat caatgatatt atcctgggca gtttcaacaa tatgattgac atcccggcca 1380
aagtgtcctt tgaagttctg atgatgacgg gtatgctgcc ggataccgtg ggcggtattg 1440aagtgtcctt tgaagttctg atgatgacgg gtatgctgcc ggataccgtg ggcggtattg 1440
cgtcatcgct gtattttagc atcccggccg aaaaagtctc tttcattgtg tttaccagct 1500cgtcatcgct gtattttagc atcccggccg aaaaagtctc tttcattgtg tttaccagct 1500
ctgatacgat caccgatcgt gaagacgcgc tgaaatctcc gctggtgcag gttatgatga 1560ctgatacgat caccgatcgt gaagacgcgc tgaaatctcc gctggtgcag gttatgatga 1560
ccctgggcat tgttaaagaa aaagatgtgc tgttctggtc ggatctgccg gattgttcct 1620ccctgggcat tgttaaagaa aaagatgtgc tgttctggtc ggatctgccg gattgttcct 1620
cgggtgtttg tattgctcag tattaatttc gtcgacacac aggaaacata ttaaaaatta 1680cgggtgtttg tattgctcag tattaatttc gtcgacacac aggaaacata ttaaaaatta 1680
aaacctgcag gagtttaaac gcggccgcga tatcgttgta aaacgacggc cagtgcaaga 1740aaacctgcag gagtttaaac gcggccgcga tatcgttgta aaacgacggc cagtgcaaga 1740
atcataaaaa atttatttgc tttcaggaaa atttttctgt ataatagatt cataaatttg 1800atcataaaaa atttatttgc tttcaggaaa atttttctgt ataatagatt cataaatttg 1800
agagaggagt ttttgtgagc ggataacaat tccccatctt agtatattag ttaagtataa 1860agagaggagt ttttgtgagc ggataacaat tccccatctt agtatattag ttaagtataa 1860
atacacaagg agatatacat atgagcctgg ccattatccc ggcacgtggc ggttctaaag 1920atacacaagg agatatacat atgagcctgg ccattatccc ggcacgtggc ggttctaaag 1920
gcatcaaaaa caaaaacctg gttctgctga acaataaacc gctgatttat tacaccatca 1980gcatcaaaaa caaaaacctg gttctgctga acaataaacc gctgatttat tacaccatca 1980
aagcggccct gaacgccaaa agtattagca aagtggttgt gagctctgat tctgatgaaa 2040aagcggccct gaacgccaaa agtattagca aagtggttgt gagctctgat tctgatgaaa 2040
tcctgaacta cgcaaaaagt cagaacgttg atatcctgaa acgtccgatc agtctggcac 2100tcctgaacta cgcaaaaagt cagaacgttg atatcctgaa acgtccgatc agtctggcac 2100
aggatgatac cacgagcgat aaagtgctgc tgcatgcgct gaaattctac aaagattacg 2160aggatgatac cacgagcgat aaagtgctgc tgcatgcgct gaaattctac aaagattacg 2160
aagatgttgt gttcctgcag ccgaccagcc cgctgcgtac gaatattcac atcaacgaag 2220aagatgttgt gttcctgcag ccgaccagcc cgctgcgtac gaatattcac atcaacgaag 2220
cgttcaacct gtacaaaaac agcaacgcaa acgcgctgat ttctgttagt gaatgcgata 2280cgttcaacct gtacaaaaac agcaacgcaa acgcgctgat ttctgttagt gaatgcgata 2280
acaaaatcct gaaagcgttt gtgtgcaatg attgtggcga tctggccggt atttgtaacg 2340acaaaatcct gaaagcgttt gtgtgcaatg attgtggcga tctggccggt atttgtaacg 2340
atgaataccc gttcatgccg cgccagaaac tgccgaaaac ctatatgagc aatggtgcca 2400atgaataccc gttcatgccg cgccagaaac tgccgaaaac ctatatgagc aatggtgcca 2400
tctacatcct gaaaatcaaa gaattcctga acaacccgag cttcctgcag tctaaaacga 2460tctacatcct gaaaatcaaa gaattcctga acaacccgag cttcctgcag tctaaaacga 2460
aacatttcct gatggatgaa agtagctctc tggatattga ttgcctggaa gatctgaaaa 2520aacatttcct gatggatgaa agtagctctc tggatattga ttgcctggaa gatctgaaaa 2520
aagtggaaca gatctggaaa aaataaaata ctgaaaccaa tttgcctggc ggcagtagcg 2580aagtggaaca gatctggaaa aaataaaata ctgaaaccaa tttgcctggc ggcagtagcg 2580
cggtggtccc acctgacccc atgccgaact cagaagtgaa acgccgtagc gccgatggta 2640cggtggtccc acctgacccc atgccgaact cagaagtgaa acgccgtagc gccgatggta 2640
gtgtggggtc tccccatgcg agagtaggga actgccaggc atcaaataaa acgaaaggct 2700gtgtggggtc tccccatgcg agagtaggga actgccaggc atcaaataaa acgaaaggct 2700
cagtcgaaag actgggcctt tcgcttccac aactttgtat aataaagttg tccccacggc 2760cagtcgaaag actgggcctt tcgcttccac aactttgtat aataaagttg tccccacggc 2760
cagtgaattc gagctcggta cctaccgttc gtataatgta tgctatacga agttatcgag 2820cagtgaattc gagctcggta cctaccgttc gtataatgta tgctatacga agttatcgag 2820
ctctagagaa tgatcccctc attaggccac acgttcaagt gcagcgcaca ccgtggaaac 2880ctctagagaa tgatcccctc attaggccac acgttcaagt gcagcgcaca ccgtggaaac 2880
ggatgaaggc acgaacccag ttgacataag cctgttcggt tcgtaaactg taatgcaagt 2940ggatgaaggc acgaacccag ttgacataag cctgttcggt tcgtaaactg taatgcaagt 2940
agcgtatgcg ctcacgcaac tggtccagaa ccttgaccga acgcagcggt ggtaacggcg 3000agcgtatgcg ctcacgcaac tggtccagaa ccttgaccga acgcagcggt ggtaacggcg 3000
cagtggcggt tttcatggct tgttatgact gtttttttgt acagtctatg cctcgggcat 3060cagtggcggt tttcatggct tgttatgact gtttttttgt acagtctatg cctcgggcat 3060
ccaagcagca agcgcgttac gccgtgggtc gatgtttgat gttatggagc agcaacgatg 3120ccaagcagca agcgcgttac gccgtgggtc gatgtttgat gttatggagc agcaacgatg 3120
ttacgcagca gcaacgatgt tacgcagcag ggcagtcgcc ctaaaacaaa gttaggtggc 3180ttacgcagca gcaacgatgt tacgcagcag ggcagtcgcc ctaaaacaaa gttaggtggc 3180
tcaagtatgg gcatcattcg cacatgtagg ctcggccctg accaagtcaa atccatgcgg 3240tcaagtatgg gcatcattcg cacatgtagg ctcggccctg accaagtcaa atccatgcgg 3240
gctgctcttg atcttttcgg tcgtgagttc ggagacgtag ccacctactc ccaacatcag 3300gctgctcttg atcttttcgg tcgtgagttc ggagacgtag ccacctactc ccaacatcag 3300
ccggactccg attacctcgg gaacttgctc cgtagtaaga cattcatcgc gcttgctgcc 3360ccggactccg attacctcgg gaacttgctc cgtagtaaga cattcatcgc gcttgctgcc 3360
ttcgaccaag aagcggttgt tggcgctctc gcggcttacg ttctgcccag gtttgagcag 3420ttcgaccaag aagcggttgt tggcgctctc gcggcttacg ttctgcccag gtttgagcag 3420
ccgcgtagtg agatctatat ctatgatctc gcagtctccg gcgagcaccg gaggcagggc 3480ccgcgtagtg agatctatat ctatgatctc gcagtctccg gcgagcaccg gaggcagggc 3480
attgccaccg cgctcatcaa tctcctcaag catgaggcca acgcgcttgg tgcttatgtg 3540attgccaccg cgctcatcaa tctcctcaag catgaggcca acgcgcttgg tgcttatgtg 3540
atctacgtgc aagcagatta cggtgacgat cccgcagtgg ctctctatac aaagttgggc 3600atctacgtgc aagcagatta cggtgacgat cccgcagtgg ctctctatac aaagttgggc 3600
atacgggaag aagtgatgca ctttgatatc gacccaagta ccgccaccta acaattcgtt 3660atacgggaag aagtgatgca ctttgatatc gacccaagta ccgccaccta acaattcgtt 3660
caagccgaga tcgtagaatt tcgacgacct gcagccaagc ataacttcgt ataatgtatg 3720caagccgaga tcgtagaatt tcgacgacct gcagccaagc ataacttcgt ataatgtatg 3720
ctatacgaac ggtaggatcc tctagagtcg acctgcaggc atg 3763ctatacgaac ggtaggatcc tctagagtcg acctgcaggc atg 3763
<---<---
Claims (39)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP18174643.9 | 2018-05-28 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| RU2020139928A RU2020139928A (en) | 2022-06-28 |
| RU2819876C2 true RU2819876C2 (en) | 2024-05-28 |
Family
ID=
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2430631C2 (en) * | 2006-02-10 | 2011-10-10 | Нестек С.А. | Probiotic oligosaccharides mixture and food product containing it |
| WO2014153253A1 (en) * | 2013-03-14 | 2014-09-25 | Glycosyn LLC | Microorganisms and methods for producing sialylated and n-acetylglucosamine-containing oligosaccharides |
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2430631C2 (en) * | 2006-02-10 | 2011-10-10 | Нестек С.А. | Probiotic oligosaccharides mixture and food product containing it |
| WO2014153253A1 (en) * | 2013-03-14 | 2014-09-25 | Glycosyn LLC | Microorganisms and methods for producing sialylated and n-acetylglucosamine-containing oligosaccharides |
Non-Patent Citations (1)
| Title |
|---|
| NICOLAS FIERFORT, ERIC SAMAIN Genetic engineering of Escherichia coli for the economical production of sialylated oligosaccharides J Biotechnol. 2008 Apr 30;134(3-4):261-5. doi: 10.1016/j.jbiotec.2008.02.010. Epub 2008 Mar 10. * |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU2019278599B2 (en) | Fermentative production of sialylated saccharides | |
| KR102726984B1 (en) | Fermentative production of N-acetylneuraminic acid | |
| CN111133112B (en) | Sialyltransferase and its use in producing sialylated oligosaccharides | |
| CN110869508B (en) | Fucosyltransferase and its use in producing fucosylated oligosaccharides | |
| EP3315610A1 (en) | Improved process for the production of fucosylated oligosaccharides | |
| DK181319B1 (en) | Genetically engineered cells and methods comprising use of a sialyltransferase for in vivo synthesis of 3’sl | |
| RU2819876C2 (en) | Enzymatic production of sialylated saccharides | |
| HK40038571A (en) | Fermentative production of sialylated saccharides | |
| RU2809787C2 (en) | Enzymative synthesis of n-acetylneuramic acid | |
| RU2822039C2 (en) | Sialyltransferases and use thereof in producing sialylated oligosaccharides | |
| KR102897437B1 (en) | Sialyltransferase and its use in the production of sialylated oligosaccharides | |
| DK181683B1 (en) | Cells exprssing new sialyltransferases for in vivo synthesis of lst-a, methods using same and constructs encoding said sialyltransferases | |
| HK40029050A (en) | Fermentative production of n-acetylneuraminic acid | |
| DK202200591A1 (en) | New sialyltransferases for in vivo synthesis of lst-c | |
| HK40021260A (en) | Sialyltransferases and their use in producing sialylated oligosaccharides | |
| RU2818835C2 (en) | Fucosyltransferases and their use for obtaining fucosylated oligosaccharides | |
| HK40021260B (en) | Sialyltransferases and their use in producing sialylated oligosaccharides | |
| HK40019749A (en) | Fucosyltransferases and their use in producing fucosylated oligosaccharides | |
| HK40019749B (en) | Fucosyltransferases and their use in producing fucosylated oligosaccharides |