US20020187544A1 - Uropathogenic E. coli D-serine detoxification operon - Google Patents
Uropathogenic E. coli D-serine detoxification operon Download PDFInfo
- Publication number
- US20020187544A1 US20020187544A1 US10/117,417 US11741702A US2002187544A1 US 20020187544 A1 US20020187544 A1 US 20020187544A1 US 11741702 A US11741702 A US 11741702A US 2002187544 A1 US2002187544 A1 US 2002187544A1
- Authority
- US
- United States
- Prior art keywords
- leu
- ala
- gly
- ile
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- MTCFGRXMJLQNBG-UWTATZPHSA-N D-Serine Chemical compound OC[C@@H](N)C(O)=O MTCFGRXMJLQNBG-UWTATZPHSA-N 0.000 title claims abstract description 140
- 229930195711 D-Serine Natural products 0.000 title claims abstract description 140
- 241000588724 Escherichia coli Species 0.000 title claims abstract description 65
- 238000001784 detoxification Methods 0.000 title 1
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 115
- 238000000034 method Methods 0.000 claims abstract description 44
- 230000004044 response Effects 0.000 claims abstract description 8
- 101150059880 dsdA gene Proteins 0.000 claims description 56
- 102000004169 proteins and genes Human genes 0.000 claims description 55
- 230000014509 gene expression Effects 0.000 claims description 50
- 210000002700 urine Anatomy 0.000 claims description 31
- 230000012010 growth Effects 0.000 claims description 29
- 108010055053 3-dehydroshikimate dehydratase Proteins 0.000 claims description 22
- 239000000523 sample Substances 0.000 claims description 19
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 16
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 claims description 14
- 238000003780 insertion Methods 0.000 claims description 14
- 230000037431 insertion Effects 0.000 claims description 14
- 108020005187 Oligonucleotide Probes Proteins 0.000 claims description 13
- 239000002751 oligonucleotide probe Substances 0.000 claims description 13
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 13
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 12
- 229920001184 polypeptide Polymers 0.000 claims description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 10
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 9
- 229910052799 carbon Inorganic materials 0.000 claims description 9
- 230000004927 fusion Effects 0.000 claims description 9
- 230000002103 transcriptional effect Effects 0.000 claims description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 8
- 238000009739 binding Methods 0.000 claims description 8
- 229910052757 nitrogen Inorganic materials 0.000 claims description 8
- 229910021529 ammonia Inorganic materials 0.000 claims description 7
- 230000003247 decreasing effect Effects 0.000 claims description 7
- 238000009396 hybridization Methods 0.000 claims description 6
- 108091033319 polynucleotide Proteins 0.000 claims description 6
- 102000040430 polynucleotide Human genes 0.000 claims description 6
- 239000002157 polynucleotide Substances 0.000 claims description 6
- 238000010839 reverse transcription Methods 0.000 claims description 6
- 238000006243 chemical reaction Methods 0.000 claims description 5
- 239000007787 solid Substances 0.000 claims description 5
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 claims description 4
- 239000002773 nucleotide Substances 0.000 claims description 3
- 238000000539 two dimensional gel electrophoresis Methods 0.000 claims description 3
- 230000002550 fecal effect Effects 0.000 claims description 2
- 125000003729 nucleotide group Chemical group 0.000 claims description 2
- 230000002829 reductive effect Effects 0.000 claims description 2
- 238000012360 testing method Methods 0.000 claims description 2
- 230000009615 deamination Effects 0.000 claims 2
- 238000006481 deamination reaction Methods 0.000 claims 2
- 230000001580 bacterial effect Effects 0.000 abstract description 13
- 241000282326 Felis catus Species 0.000 description 59
- 108010050848 glycylleucine Proteins 0.000 description 42
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 33
- 208000019206 urinary tract infection Diseases 0.000 description 32
- 239000002609 medium Substances 0.000 description 26
- 241001522750 Escherichia coli CFT073 Species 0.000 description 25
- 239000013612 plasmid Substances 0.000 description 23
- 101100500093 Escherichia coli (strain K12) dsdC gene Proteins 0.000 description 22
- 238000002474 experimental method Methods 0.000 description 19
- 108020004414 DNA Proteins 0.000 description 16
- 210000004027 cell Anatomy 0.000 description 15
- 108010034529 leucyl-lysine Proteins 0.000 description 15
- 238000012217 deletion Methods 0.000 description 14
- 230000037430 deletion Effects 0.000 description 13
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 12
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 12
- 230000001105 regulatory effect Effects 0.000 description 12
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 11
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 11
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 11
- 230000000694 effects Effects 0.000 description 11
- 241000894006 Bacteria Species 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 10
- 238000003752 polymerase chain reaction Methods 0.000 description 10
- 238000013518 transcription Methods 0.000 description 10
- 230000035897 transcription Effects 0.000 description 10
- 210000003932 urinary bladder Anatomy 0.000 description 10
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 9
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 9
- 210000003734 kidney Anatomy 0.000 description 9
- 108010085203 methionylmethionine Proteins 0.000 description 9
- 108010073969 valyllysine Proteins 0.000 description 9
- 229920001817 Agar Polymers 0.000 description 8
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 8
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 8
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 8
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 8
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 8
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 8
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 8
- 239000008272 agar Substances 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 208000015181 infectious disease Diseases 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 229960000210 nalidixic acid Drugs 0.000 description 8
- MHWLWQUZZRMNGJ-UHFFFAOYSA-N nalidixic acid Chemical compound C1=C(C)N=C2N(CC)C=C(C(O)=O)C(=O)C2=C1 MHWLWQUZZRMNGJ-UHFFFAOYSA-N 0.000 description 8
- 108010048818 seryl-histidine Proteins 0.000 description 8
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 7
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 7
- 241000880493 Leptailurus serval Species 0.000 description 7
- 241000699666 Mus <mouse, genus> Species 0.000 description 7
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 108010008355 arginyl-glutamine Proteins 0.000 description 7
- 108010016616 cysteinylglycine Proteins 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 108010078144 glutaminyl-glycine Proteins 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 6
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 6
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 6
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 6
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 6
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 6
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 6
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 6
- 241000699670 Mus sp. Species 0.000 description 6
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 6
- 102000018120 Recombinases Human genes 0.000 description 6
- 108010091086 Recombinases Proteins 0.000 description 6
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 6
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 6
- 239000003242 anti bacterial agent Substances 0.000 description 6
- 230000001419 dependent effect Effects 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- 230000006698 induction Effects 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- DVLFYONBTKHTER-UHFFFAOYSA-N 3-(N-morpholino)propanesulfonic acid Chemical compound OS(=O)(=O)CCCN1CCOCC1 DVLFYONBTKHTER-UHFFFAOYSA-N 0.000 description 5
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 5
- 150000008574 D-amino acids Chemical class 0.000 description 5
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 5
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 5
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 5
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 5
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 5
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 5
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 5
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 5
- 206010048709 Urosepsis Diseases 0.000 description 5
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 5
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 5
- 238000003491 array Methods 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 5
- 102000005936 beta-Galactosidase Human genes 0.000 description 5
- 108010005774 beta-Galactosidase Proteins 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 230000002860 competitive effect Effects 0.000 description 5
- 101150064513 dsdX gene Proteins 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 230000003834 intracellular effect Effects 0.000 description 5
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 5
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 238000010172 mouse model Methods 0.000 description 5
- 108010080629 tryptophan-leucine Proteins 0.000 description 5
- 230000001018 virulence Effects 0.000 description 5
- 239000000304 virulence factor Substances 0.000 description 5
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 4
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 4
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 4
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 4
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 4
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 4
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 4
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 4
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 4
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 4
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 4
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 4
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 4
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 4
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 4
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 4
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 4
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 4
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 4
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 4
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 4
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 4
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 4
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 4
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 4
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 4
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 4
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 4
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 4
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 4
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 4
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 4
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 4
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 4
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 4
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 4
- TYRMVTKPOWPZBC-SXNHZJKMSA-N Gln-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N TYRMVTKPOWPZBC-SXNHZJKMSA-N 0.000 description 4
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 4
- YLABFXCRQQMMHS-AVGNSLFASA-N Gln-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YLABFXCRQQMMHS-AVGNSLFASA-N 0.000 description 4
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 4
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 4
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 4
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 4
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 4
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 4
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 4
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 4
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 4
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 4
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 4
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 4
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 4
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 4
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 4
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 4
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 4
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 4
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 4
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 4
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 4
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 4
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 4
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 4
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 4
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 4
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 4
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 4
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 4
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 4
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 4
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 4
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 4
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 4
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 4
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 4
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 4
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 4
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 4
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 4
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 4
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 4
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 4
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 4
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 4
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 4
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 4
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 4
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 4
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 4
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 4
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 4
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 4
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 4
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 4
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 4
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 4
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 4
- 108060001084 Luciferase Proteins 0.000 description 4
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 4
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 4
- BPDXWKVZNCKUGG-BZSNNMDCSA-N Lys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N BPDXWKVZNCKUGG-BZSNNMDCSA-N 0.000 description 4
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 4
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 4
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 4
- 239000007993 MOPS buffer Substances 0.000 description 4
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 4
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 4
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 4
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 4
- ULLIQRYQNMAAHC-RWMBFGLXSA-N Met-His-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N ULLIQRYQNMAAHC-RWMBFGLXSA-N 0.000 description 4
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 4
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 4
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 4
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 4
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 4
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 4
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 4
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 4
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 4
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 4
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 4
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 4
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 4
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 4
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 4
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 4
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 4
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 4
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 4
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 4
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 4
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 4
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 4
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 4
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 4
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 4
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 4
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 4
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 4
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 4
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 4
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 4
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 4
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 4
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 4
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 4
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 4
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 4
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 4
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 4
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 4
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 4
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 4
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 4
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 4
- 238000005273 aeration Methods 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 229940088710 antibiotic agent Drugs 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 230000001332 colony forming effect Effects 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000000875 corresponding effect Effects 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 4
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 4
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 4
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 4
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 4
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 238000002493 microarray Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 230000002035 prolonged effect Effects 0.000 description 4
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 4
- 230000007923 virulence factor Effects 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 3
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 3
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 3
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 3
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 3
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 3
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 3
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 3
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 3
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 3
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 3
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 3
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 3
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 3
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 3
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 3
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 3
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 3
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 3
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 3
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 3
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 3
- 101100397104 Aspergillus niger ipuA gene Proteins 0.000 description 3
- 241000700199 Cavia porcellus Species 0.000 description 3
- HSAWNMMTZCLTPY-DCAQKATOSA-N Cys-Met-Leu Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HSAWNMMTZCLTPY-DCAQKATOSA-N 0.000 description 3
- NMWZMKLDGZXRKP-BZSNNMDCSA-N Cys-Phe-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NMWZMKLDGZXRKP-BZSNNMDCSA-N 0.000 description 3
- 206010014896 Enterocolitis haemorrhagic Diseases 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 241001646716 Escherichia coli K-12 Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 3
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 3
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 3
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 3
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 3
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 3
- XFHMVFKCQSHLKW-HJGDQZAQSA-N Gln-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XFHMVFKCQSHLKW-HJGDQZAQSA-N 0.000 description 3
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 3
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 3
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 3
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 3
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 3
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 3
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 3
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 3
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 3
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 3
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 3
- AAXMRLWFJFDYQO-GUBZILKMSA-N His-Asp-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O AAXMRLWFJFDYQO-GUBZILKMSA-N 0.000 description 3
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 3
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 3
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 3
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 3
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 3
- CKONPJHGMIDMJP-IHRRRGAJSA-N His-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CKONPJHGMIDMJP-IHRRRGAJSA-N 0.000 description 3
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 3
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 3
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 3
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 3
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 3
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 3
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 3
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 3
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 3
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 3
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 3
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 3
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 3
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 3
- 239000005089 Luciferase Substances 0.000 description 3
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 3
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 3
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 3
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 3
- TYEJPFJNAHIKRT-DCAQKATOSA-N Lys-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N TYEJPFJNAHIKRT-DCAQKATOSA-N 0.000 description 3
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 3
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 3
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 3
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 3
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 3
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 3
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 3
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 3
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 3
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 3
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 3
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 3
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 3
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 3
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 3
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 3
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 3
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 3
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 3
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 3
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 3
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 3
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 3
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 3
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 3
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 3
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 3
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 3
- 230000004520 agglutination Effects 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 230000001174 ascending effect Effects 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 230000000741 diarrhetic effect Effects 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 210000003743 erythrocyte Anatomy 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 238000009650 gentamicin protection assay Methods 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000009545 invasion Effects 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 238000009630 liquid culture Methods 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 108010027345 wheylin-1 peptide Proteins 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- 108010076441 Ala-His-His Proteins 0.000 description 2
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 2
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 2
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 2
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 2
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 2
- 238000000018 DNA microarray Methods 0.000 description 2
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 2
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 2
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 2
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 2
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 2
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- 108010006464 Hemolysin Proteins Proteins 0.000 description 2
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 2
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 2
- UJWYPUUXIAKEES-CUJWVEQBSA-N His-Cys-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UJWYPUUXIAKEES-CUJWVEQBSA-N 0.000 description 2
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 2
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 2
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 2
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 2
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 2
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 2
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 2
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 239000007836 KH2PO4 Substances 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 2
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- ZZHPLPSLBVBWOA-WDSOQIARSA-N Lys-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N ZZHPLPSLBVBWOA-WDSOQIARSA-N 0.000 description 2
- 108090000301 Membrane transport proteins Proteins 0.000 description 2
- 102000003939 Membrane transport proteins Human genes 0.000 description 2
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 2
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- HOKKHZGPKSLGJE-GSVOUGTGSA-N N-Methyl-D-aspartic acid Chemical compound CN[C@@H](C(O)=O)CC(O)=O HOKKHZGPKSLGJE-GSVOUGTGSA-N 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 2
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 2
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 2
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 2
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- WFZYXGSAPWKTHR-XEGUGMAKSA-N Trp-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WFZYXGSAPWKTHR-XEGUGMAKSA-N 0.000 description 2
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 2
- UPOGHWJJZAZNSW-XIRDDKMYSA-N Trp-His-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O UPOGHWJJZAZNSW-XIRDDKMYSA-N 0.000 description 2
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 2
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 2
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 2
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 2
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 2
- ZYVAAYAOTVJBSS-GMVOTWDCSA-N Tyr-Trp-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZYVAAYAOTVJBSS-GMVOTWDCSA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- WBUOKGBHGDPYMH-GUBZILKMSA-N Val-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)C(C)C WBUOKGBHGDPYMH-GUBZILKMSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000000376 autoradiography Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 239000002158 endotoxin Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000002919 epithelial cell Anatomy 0.000 description 2
- 229930195712 glutamate Natural products 0.000 description 2
- 239000003228 hemolysin Substances 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 229920006008 lipopolysaccharide Polymers 0.000 description 2
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 2
- 238000004020 luminiscence type Methods 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 229960001153 serine Drugs 0.000 description 2
- 108091035539 telomere Proteins 0.000 description 2
- 102000055501 telomere Human genes 0.000 description 2
- 108091008023 transcriptional regulators Proteins 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 210000001635 urinary tract Anatomy 0.000 description 2
- 108010062110 water dikinase pyruvate Proteins 0.000 description 2
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- 101710164539 Asc-type amino acid transporter 1 Proteins 0.000 description 1
- FJIRXKVEDFLLOQ-SRVKXCTJSA-N Asn-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N FJIRXKVEDFLLOQ-SRVKXCTJSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 208000003322 Coinfection Diseases 0.000 description 1
- VTJLJQGUMBWHBP-GUBZILKMSA-N Cys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N VTJLJQGUMBWHBP-GUBZILKMSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- DYDCUQKUCUHJBH-UWTATZPHSA-N D-Cycloserine Chemical compound N[C@@H]1CONC1=O DYDCUQKUCUHJBH-UWTATZPHSA-N 0.000 description 1
- DYDCUQKUCUHJBH-UHFFFAOYSA-N D-Cycloserine Natural products NC1CONC1=O DYDCUQKUCUHJBH-UHFFFAOYSA-N 0.000 description 1
- -1 D-serine Chemical class 0.000 description 1
- 125000000734 D-serino group Chemical group [H]N([H])[C@@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 230000008836 DNA modification Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 101100382042 Escherichia coli (strain K12) btsT gene Proteins 0.000 description 1
- 241000938885 Escherichia coli NU14 Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- XUZQMPGBGFQJMY-SRVKXCTJSA-N Gln-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XUZQMPGBGFQJMY-SRVKXCTJSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- TVTIDSMADMIHEU-KKUMJFAQSA-N His-Cys-Phe Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(O)=O TVTIDSMADMIHEU-KKUMJFAQSA-N 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- 239000006154 MacConkey agar Substances 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 102000004868 N-Methyl-D-Aspartate Receptors Human genes 0.000 description 1
- 108090001041 N-Methyl-D-Aspartate Receptors Proteins 0.000 description 1
- GXCLVBGFBYZDAG-UHFFFAOYSA-N N-[2-(1H-indol-3-yl)ethyl]-N-methylprop-2-en-1-amine Chemical compound CN(CCC1=CNC2=C1C=CC=C2)CC=C GXCLVBGFBYZDAG-UHFFFAOYSA-N 0.000 description 1
- 206010029803 Nosocomial infection Diseases 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- BELBBZDIHDAJOR-UHFFFAOYSA-N Phenolsulfonephthalein Chemical group C1=CC(O)=CC=C1C1(C=2C=CC(O)=CC=2)C2=CC=CC=C2S(=O)(=O)O1 BELBBZDIHDAJOR-UHFFFAOYSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- JZRWCGZRTZMZEH-UHFFFAOYSA-N Thiamine Natural products CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 1
- 102000008579 Transposases Human genes 0.000 description 1
- 108010020764 Transposases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 108010073429 Type V Secretion Systems Proteins 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- HLBHFAWNMAQGNO-AVGNSLFASA-N Val-His-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N HLBHFAWNMAQGNO-AVGNSLFASA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- 241000775914 Valdivia <angiosperm> Species 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 241000607447 Yersinia enterocolitica Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000007818 agglutination assay Methods 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229940024606 amino acid Drugs 0.000 description 1
- 238000009635 antibiotic susceptibility testing Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 210000003578 bacterial chromosome Anatomy 0.000 description 1
- WQZGKKKJIJFFOK-FPRJBGLDSA-N beta-D-galactose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-FPRJBGLDSA-N 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000012677 causal agent Substances 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- BKHZIBWEHPHYAI-UHFFFAOYSA-N chloroform;3-methylbutan-1-ol Chemical compound ClC(Cl)Cl.CC(C)CCO BKHZIBWEHPHYAI-UHFFFAOYSA-N 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 108010014562 cytotoxic necrotizing factor type 1 Proteins 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000000378 dietary effect Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 101150091425 fimB gene Proteins 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010448 genetic screening Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000004110 gluconeogenesis Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 230000009643 growth defect Effects 0.000 description 1
- 230000035931 haemagglutination Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- XLYOFNOQVPJJNP-ZSJDYOACSA-N heavy water Substances [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 235000003642 hunger Nutrition 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 238000007852 inverse PCR Methods 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 230000004777 loss-of-function mutation Effects 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 210000005171 mammalian brain Anatomy 0.000 description 1
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- HOVAGTYPODGVJG-UHFFFAOYSA-N methyl beta-galactoside Natural products COC1OC(CO)C(O)C(O)C1O HOVAGTYPODGVJG-UHFFFAOYSA-N 0.000 description 1
- 108010079904 microcin Proteins 0.000 description 1
- 108010001459 microcin H47 Proteins 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 239000002858 neurotransmitter agent Substances 0.000 description 1
- 238000002966 oligonucleotide array Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000007918 pathogenicity Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 210000002640 perineum Anatomy 0.000 description 1
- 210000003200 peritoneal cavity Anatomy 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 229960003531 phenolsulfonphthalein Drugs 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- OTYBMLCTZGSZBG-UHFFFAOYSA-L potassium sulfate Chemical compound [K+].[K+].[O-]S([O-])(=O)=O OTYBMLCTZGSZBG-UHFFFAOYSA-L 0.000 description 1
- 229910052939 potassium sulfate Inorganic materials 0.000 description 1
- 229940124606 potential therapeutic agent Drugs 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000006340 racemization Effects 0.000 description 1
- 101150079601 recA gene Proteins 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000035440 response to pH Effects 0.000 description 1
- OARRHUQTFTUEOS-UHFFFAOYSA-N safranin Chemical compound [Cl-].C=12C=C(N)C(C)=CC2=NC2=CC(C)=C(N)C=C2[N+]=1C1=CC=CC=C1 OARRHUQTFTUEOS-UHFFFAOYSA-N 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000037351 starvation Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 125000003831 tetrazolyl group Chemical group 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- KYMBYSLLVAOCFI-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SCN1CC1=CN=C(C)N=C1N KYMBYSLLVAOCFI-UHFFFAOYSA-N 0.000 description 1
- 229960003495 thiamine Drugs 0.000 description 1
- 235000019157 thiamine Nutrition 0.000 description 1
- 239000011721 thiamine Substances 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 210000003708 urethra Anatomy 0.000 description 1
- 230000002485 urinary effect Effects 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 238000003260 vortexing Methods 0.000 description 1
- 230000009614 wildtype growth Effects 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1072—Differential gene expression library synthesis, e.g. subtracted libraries, differential screening
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/527—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving lyase
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56911—Bacteria
- G01N33/56916—Enterobacteria, e.g. shigella, salmonella, klebsiella, serratia
Definitions
- Urinary tract infections are responsible for an estimated 10 million outpatient visits and more than one million hospitalizations each year in the United States. Annual health care costs attributable to UTIs exceed one billion dollars. Approximately 50% of women will have at least one UTI during their lives, and many of these women experience recurrent UTI.
- Uropathogenic Eschericia coli by far the most common uropathogen, is the causal agent in greater than 85% of uncomplicated UTIs.
- the organisms are also known to cause hospital acquired infections associated with injury, surgical procedures, or catheterization.
- UPEC present a complex ecological case among the pathogenic E. coli .
- UPEC can colonize the human intestinal tract without causing disease, yet they can sequentially colonize the perineum, urethra, bladder and kidney.
- Known E. coli virulence factors that contribute to UTIs are fimbriae (e.g.
- Urinary tract infections in humans are generally treated with antibiotic therapy. Treatment of UTIs may be complicated by strains that are resistant to antibiotics, particularly to those antibiotics in common use. The identification of new UPEC-specified pathogenicity factors will facilitate the development of new methods of treating or preventing UTIs.
- the present invention provides a method of detecting uropathogenic E. coli nucleotide sequences differentially expressed in the presence or absence of D-serine comprising providing a library comprising a plurality of transposon mutants of a uropathogenic E.
- the mutants comprising a transcriptional fusion comprising a transcriptional regulation sequence operably connected to a sequence encoding a detectable protein; growing the mutants in the presence or absence of D-serine; identifying mutants having increased or decreased expression of the transcriptional fusion in the presence or absence of D-serine by comparing the relative levels of the detectable protein of the mutants grown in the presence or absence of D-serine; identifying the insertion site of the transcriptional fusion in the identified transposon mutants; and correlating the insertion site with an E. coli nucleotide sequence.
- the present invention provides a method of identifying proteins differentially expressed in a wild-type uropathogenic E. coli strain and a uropathogenic E. coli dsdCXA locus mutant, the mutant having reduced expression of one or more proteins selected from the group consisting of DsdA, DsdC, and DsdX, the method comprising comparing proteins isolated from the wild-type uropathogenic E. coli strain and proteins isolated from the uropathogenic E. coli dsdCXA locus mutant; and identifying proteins from the dsdCXA mutant having increased or decreased level of expression relative to expression of the corresponding proteins in the wild-type uropathogenic E. coli strain.
- the invention includes a method of identifying a uropathogenic E. coli polynucleotide sequence that binds to DsdC protein comprising contacting an E. coli polynucleotide sequence with DsdC protein and detecting binding of the polynucleotide sequence to the protein.
- the present invention provides a method of detecting genes from a uropathogenic E. coli strain that are differentially expressed in the presence or absence of D-serine comprising hybridizing a first set of labeled oligonucleotide probes with an array of oligomers, the oligomers comprising gene sequences from the uropathogenic E. coli strain, wherein the first set of labeled oligonucleotide probes is made by reverse transcription of RNA isolated from uropathogenic E.
- the present invention provides a method of detecting proteins differentially expressed in a uropathogenic E. coli strain in response to D-serine, comprising providing a first culture of the uropathogenic E. coli strain grown in the presence of D-serine and a second culture of the uropathogenic E. coli strain grown in the absence of D-serine; comparing proteins isolated from the two cultures, and identifying proteins from E. coli grown in the presence of D-serine that are increased or decreased relative to the corresponding proteins in the uropathogenic E. coli strain grown in the absence of D-serine.
- the present invention also provides a method of characterizing bacterial strains isolated from clinical samples comprising testing the strain for the ability to grow in the presence of D-serine.
- the present invention provides urine dipstick comprising a solid support, a polypeptide comprising D-serine deaminase and an indicator responsive to ammonia, wherein the polypeptide and indicator are coimmobilized on the support.
- FIG. 1 compares the 53 minute regions of three E. coli strains: laboratory K-12 strain MG1655, enterohemorhagic E. coli (EHEC) EDL933, and uropathogenic E. coli CFT073.
- EHEC enterohemorhagic E. coli
- FIG. 2 compares the growth of wild type E. coli CFT073 or a dsdA mutant of E. coli CFT073 grown in minimal medium or urine as a function of time as measured by the concentration of colony forming units ( 2 A) or the OD 600 ( 2 D).
- FIG. 3 compares the concentration of colony forming units of wild-type E. coli CFT073 or a dsdA mutant of E. coli CFT073 recovered from the bladder or kidneys of mice coinfected with mutant and wild-type E. coli CFT073.
- FIG. 4 compares the concentration of colony forming units of wild type E. coli CFT073 or a dsdA mutant of E. coli CFT073 recovered from the bladder or kidneys of mice coinfected with mutant and wild-type E. coli CFT073 strains carrying pWAM2682, which expresses DsdA in two replicate experiments ( 4 A and 4 B).
- FIG. 5 presents a model D-serine modulation of E. coli uropathogenesis.
- the sequence of the uropathogenic E. coli strain CFT073 genome was compared with the sequences of the genomes of E. coli laboratory K-12 strain MG1655 and enterohemorrhagic E. coli (EHEC) 0157 strain EDL933.
- EHEC enterohemorrhagic E. coli
- a sequence found in uropathogenic E. coli strain CFT073 but not in E. coli K-12 or EHEC 0157 was identified in the 53 minute region of the K-12 genetic map.
- the sequence includes the D-serine tolerance (dsdCXA) locus, which is involved in overcoming D-serine toxicity, and is presented in SEQ ID NO: 13.
- the dsd locus includes coding sequences for a positive transcriptional activator (DsdC), a putative permease (DsdX), and a D-serine deaminase (DsdA), which catalyzes the hydrolysis of D-serine to give pyruvate and ammonia.
- the 53 minute region includes two putative recombinases unique to UPEC, which are designated ipuA and ipuB.
- the DNA sequences of dsdC, dsdX dsdA, ipuA and ipuB are provided in SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, and SEQ ID NO:9, respectively, and the deduced amino acid sequences of their putative translation products are shown in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, and SEQ ID NO:10, respectively.
- the proteins putatively encoded by ipuA and ipuB are homologs of the recombinase fimB, which suggests that the proteins may function to regulate expression of the dsd operon in a manner similar to the regulation of type 1 fimbriae by FimB and FimE.
- the polypeptide putatively encoded by dsdC is a member of the LysR family of transcriptional regulators.
- the LysR-type regulators are very common in bacteria and control the expression of genes associated with a wide range of cellular processes, including many diverse virulence factors. Members of the LysR family of transcriptional regulators usually require the presence of a specific coinducer to activate transcription.
- the polypeptides are about 270-330 amino acid residues in length and exhibit sequence similarity over about 270 amino acid residues. The strongest conservation occurs in the N-terminal 66 amino acid residues, which has a helix-turn-helix motif.
- the LysR-type regulators bind to DNA as dimers or tetramers and activate gene transcription, possibly by introducing a bend in the DNA helix.
- Uropathogenic E. coli are able to grow in media or urine in the presence of D-serine, whereas E. coli strains that lack the dsdCXA genes are unable to grow or grow more slowly than UPEC strains.
- three collections of E. coli were examined for the ability to utilize media having D-serine as the sole source of carbon and nitrogen. The results showed that 49 of 60 tested UPEC strains were able to use D-serine as the sole source of carbon and nitrogen. In contrast, just four of 74 tested hemorrhagic colitis and infantile diarrheal E. coli strains were able to use D-serine as the sole source of carbon and nitrogen.
- a “wild-type UPEC strain” is a strain of uropathogenic E. coli having the ability to use D-serine as the sole source of carbon and nitrogen.
- a mutant in wild-type UPEC strain CFT073 with a deletion in the dsdA gene was constructed as described in the Examples.
- the dsdA mutant was evaluated for its ability to establish infection in a murine model of ascending urinary tract infection.
- the mutant strain was present in the bladders and kidneys 48 hours postinfection at levels of about 300 times that of the wild-type strain. This result runs counter to our expectation that a knockout of this putative virulence gene would be less effective than wild-type CFT073 in establishing an infection.
- Complementation of the dsdA mutant with DsdA supplied in trans resulted in recovery similar to wild type.
- D-serine may act as an effector of the LysR-type transcriptional regulator, DsdC, to cause an increase in transcription.
- DsdA would lower D-serine concentrations, which, if D-serine acts as an effector with DsdC, would reduce DsdC-regulated transcription.
- concentrations of D-serine are relatively high, DsdC may remain active and cause the transcription of a myriad of genes for other virulence factors. It follows that people susceptible to recurrent UTIs caused by UPEC may produce urine with higher levels of D-serine than those who do not experience recurrent UTIs.
- transposon mutagenesis is used to insert a gene encoding a detectable protein into the bacterial chromosome under the control of a transcriptional regulation sequence.
- detectable proteins include, but are not limited to, luciferase and the lacZ gene product.
- Mutants expressing the detectable protein are identified and grown in the presence in or absence of D-serine. The relative expression of detectable protein in mutants grown in the presence and absence of D-serine is assessed, and those mutants exhibiting differential expression of the detectable protein in the presence and absence of D-serine are selected for further analysis.
- Another approach to identifying genes whose expression is affected by DsdC or D-serine metabolism is to compare the proteins expressed by a wild-type UPEC strain grown in the presence or absence of D-serine, or expressed by a wild-type UPEC strain and a mutant of the strain comprising a mutation in dsdC, dsdA, or dsdX grown in the presence or absence of D-serine. Proteins are isolated from the bacterial cultures, separated by two-dimensional gel electrophoresis, and differential expression identified by comparing gels obtained from different bacterial strains or by the same bacterial strain grown in the presence or absence of D-serine. Proteins of interest may be further characterized to permit identification.
- RNA is isolated from a wild-type UPEC strain grown in the presence or absence of D-serine and subjected to reverse transcription to create labeled oligonucleotide probes, which in turn are hybridized to an oligomer array of E. coli gene sequences to identify sequences differentially expressed in the presence or absence of D-serine.
- the ability of certain UPEC strains to grow on D-serine will provide a simple means of differentiating E. coli isolates from urine or blood.
- the bacterial isolate may be tested for the ability to grow on or in medium containing D-serine as the sole carbon and nitrogen source.
- the bacterial isolate may be plated on a suitable medium comprising D-serine in a concentration effective to support growth of a UPEC strain capable of using D-serine as the sole source of carbon and nitrogen, suitably in the range of from about 100 ⁇ g/ml to about 500 ⁇ g/ml.
- an aliquot of the isolate may be transferred to a liquid medium comprising D-serine in an effective concentration, suitably in the range of from about 100 ⁇ g/ml to about 500 ⁇ g/ml. Growth in liquid medium may be assessed by observing turbidity.
- the liquid medium may comprise an indicator such as tetrazolium, which undergoes a color change in response to bacterial growth.
- cells may be plated and then contacted with D-serine in a concentration effective to inhibit the growth of most non-UPEC isolates such as J198 (a normal fecal isolate), but which permits the growth of a wild-type UPEC strain such as E. coli strain CTF073.
- the D-serine may be delivered on a solid support, such as a filter disc of the type commonly used in antibiotic sensitivity testing, impregnated with D-serine.
- Whether a correlation between UTI susceptibility and urinary D-serine levels exists may be determined by assaying D-serine in the urine of human populations having differential susceptibility to UTIs. If such a correlation exists, assaying D-serine may be useful in identifying individuals having this particular susceptibility. Unfortunately, D-serine assays are presently performed by HPLC and are prohibitively expensive for routine use.
- D-serine deaminase may be used to manufacture urine dipsticks and biosensors for quantifying D-serine.
- the D-serine deaminase may be coimmobilized on a suitable support with a suitable indicator that changes color in the presence of the ammonia generated by the enzymatic hydrolyis of D-serine.
- the D-serine deaminase When the immobilized D-serine deaminase is exposed to a sample containing D-serine, the D-serine will be hydrolyzed to form pyruvate and ammonia, which will cause the indicator to change color, thus permitting the detection of the D-serine in the sample.
- the D-serine deaminase used in urine dipsticks or biosensors may be obtained from any source.
- D-serine deaminase may be obtained by subcloning a D-serine deaminase gene, such as the CFT073 dsdA gene (SEQ ID NO:6), into a suitable expression vector, introducing the vector into a suitable host cell, allowing expression of the protein, and recovering from the host cell D-serine deaminase substantially free of other proteins.
- Suitable indicators include any compound that detectably changes in response to the hydrolysis of D-serine.
- a suitable indicator may include an indicator that changes color in response to pH changes, which would occur with an increase in ammonia concentration.
- An example of such an indicator is phenol red.
- Urosepsis E. coli strain CFT073 was originally isolated from the blood and urine of a woman admitted to the University of Maryland Medical System (Mobley et al., Infect. Immun., 1990, 58: 1281-1289). E. coli strain collections used in epidemiology experiments were a collection of urosepsis and urinary tract isolates (Arthur et al., Infect. Immun., 1989a, 57: 314-321.; Arthur et al., Infect. Immun., 1989b, 57: 303-313.; Arthur et al., Infect.
- bacteria were plated on a modified Minimal A Medium agar (15 g K 2 HPO 4 , 4.5 g KH 2 PO 4 , 1 g K 2 SO 4 , 0.5 g Na 3 C 6 H 5 O 7 .2H 2 O, 15 g agar and 1 ml of 1 M solution MgSO4.7H 2 O per liter) supplemented with 500 ug/ml of D-serine (Sigma) (Miller, Experiments in molecular genetics, 1972).
- a modified Minimal A Medium agar 15 g K 2 HPO 4 , 4.5 g KH 2 PO 4 , 1 g K 2 SO 4 , 0.5 g Na 3 C 6 H 5 O 7 .2H 2 O, 15 g agar and 1 ml of 1 M solution MgSO4.7H 2 O per liter
- bacteria were grown in MOPS minimal medium [3-(N-morpholino)propanesulfonic acid] supplemented with 10 ⁇ M thiamine and 0.4% glycerol (Neidhardt et al., J. Bacteriol., 1974, 119: 736-747).
- MOPS minimal medium [3-(N-morpholino)propanesulfonic acid] supplemented with 10 ⁇ M thiamine and 0.4% glycerol (Neidhardt et al., J. Bacteriol., 1974, 119: 736-747).
- the media used included L broth (5 g of sodium chloride, 5 g of yeast extract and 10 g of tryptone per liter [Fisher Scientific]) and L agar (L broth containing 1.5% agar [Difco]).
- Human urine was provided by two different donors and filter sterilized prior to use in experiments.
- Mouse urine was collected from a group of eight mice and UV sterilized prior to use.
- Liquid cultures were grown at 37° degrees with aeration except for cultures used in mouse UTI experiments.
- each strain was inoculated into L-broth and grown without aeration for two days. Bacteria were then passaged into fresh medium two additional times for a total of 6 days growth.
- dsdA-specific primers for Polymerase Chain Reaction were designed using the CFT073 genome sequence (R.A. Welch, unpublished).
- the dsdA primers used were 5′-GGATGGCGATGCTGCGTTG-3′ (SEQ ID NO:11) and 5′-CACAGGGGAAGGTGAGATATGC-3′ (IDT) (SEQ ID NO:12).
- the template was wild-type CFT073 genomic DNA prepared using the Wizard Genomic DNA Isolation Kit (Promega).
- the PCR amplification used the Expand High Fidelity PCR System (Roche Diagnostics) according to the manufacturer's protocol.
- the resulting PCR product was gel purified with the QIAQuick Gel Purification Kit (Qiagen) and ligated into pGEMt (Promega).
- the resulting dsdA recombinant plasmid was examined by restriction endonuclease analysis.
- a dsda deletion was constructed by digesting the dsdA plasmid with PmeI and EcoRI, which resulted in a 445 bp deletion.
- the digested plasmid was treated with T4 DNA polymerase in order to blunt the DNA ends. After gel purification, the plasmid was religated.
- Apal-SacI fragment containing dsdA was subcloned into the suicide vector, pEP185.2 (kind gift of Virginia Miller, Washington University).
- This construct was sequenced (ACGT, Incorporated) using T3 and T7 primers provided by ACGT. Conjugation was carried out as described by Kinder et al. (Kinder et al., Gene, 1993, 136: 271-275) with some modifications.
- the suicide plasmid construct was transformed into a S17 ⁇ pir background and then conjugated into a nalidixic acid resistant mutant of CFT073.
- chromosomal integrates of the recombinant plasmid were selected and D-cycloserine enrichment was used to select for a second crossover event that excised the suicide plasmid backbone from the CFT073 chromosome.
- Chromosomal DNA was isolated from the putative dsdA mutants and examined via PCR analysis using dsdA-specific primers to confirm the deletion. Phenotypic analysis of the putative mutants was carried out by inoculating colonies onto MOPS glycerol minimal and MOPS minimal D-serine agar media.
- the dsdA gene including its Shine and Delgarno sequence, was PCR amplified with primers (5′GCGCTGCAGCGTTATTAACGGCCTTTTGCCA GATATTGATTC) (SEQ ID NO:17) and (5′CGCGGATCCCGTACTAT GGAAAACGCTAAAA TGAATTCGC) (SEQ ID NO:18).
- the primers were designed with PstI and BamHI restriction sites at their respective 5′ ends.
- the PCR product was digested with PstI and BamHI enzymes, gel purified and ligated with pACYC177 previously digested with the same two restriction enzymes.
- the plasmid designated pWAM2682, was used in subsequent complementation analysis.
- mice Female CBA/JCrHsd mice (five to eight weeks of age, Harlan) were inoculated via trans-urethral catheterization with 50 ⁇ l of the prepared bacteria (approximately 1 ⁇ 10 9 organisms). After 48 h, the mice were sacrificed and their bladders and kidneys harvested. The organs were homogenized and dilutions plated on L agar without antibiotics. After overnight incubation, colonies were counted and then replica plated onto L agar containing nalidixic acid in order to determine the number of surviving dsdA mutants.
- nalidixic acid resistant CFT073 background strain used in the construction of the dsda mutant competes equally with the CFT073 nalidixic acid sensitive parent strain in the UTI model (Torres et al., Infect. Immun., 2001, 69: 6179-6185).
- both the wild-type and dsdA mutant strains carried the complementing dsdA plasmid, pWAM2682. Both strains were grown as above with the addition of kanamycin in order to assure maintenance of the plasmids during the six day culture. Results were analyzed for statistical significance via paired Student's t test.
- Wild-type and dsdA mutant strains grown overnight in human urine were inoculated into fresh urine and examined microscopically after 3 and 24 h of growth.
- the cell suspensions were air-dried on glass slides and stained with safranin.
- the cells were visualized using a Zeiss Axioplan IIi microscope and the images were collected digitally using OpenLabs 3.02 software (Improvision, Inc.).
- a CFT073 lacZY deletion mutant was isolated using the ⁇ red recombinase mutagenesis system of Datsenko and Wanner (Datsenko et al., Proc. Natl. Acad. Sci. USA, 2000, 97(12): 6640-6645), which is incorporated by reference herein. Briefly, PCR products were generated using primers with 36- to 50-nucleotide extensions that are homologous to the region adjacent to the lacZYA genes and template plasmid carrying an antibiotic resistance gene flanked by FLP recognition target sites. The PCR product was introduced into CFT073 carrying a Red expression plasmid via electroporation. After selection, the resistance gene was eliminated introduction of a helper plasmid which acts directly on the FLP recognition target sites flanking the resistance gene. The temperature sensitive Red and FLP helper plasmids are cured by growth at 37 degrees C.
- MiniTn10lux transposon insertion mutants of CTF073 were constructed using MiniTn10lux.
- E. coli strain KV330 (carrying the suicide plasmid pKV45 which harbors the mini-TN10 transposon, transposase and resolvase) was the donor strain for conjugal transfer into recipient host strain CFT073.
- Transconjugants were plated on LB agar supplemented with chloramphenicol (to select for the transposon) and nalidixic acid (to select for the host strain, CFT073). Individual isolates were assayed for luciferase expression in the absence of presence of D-serine.
- MiniTn5lacZY transposon insertion mutants of a CFT073 lacZY deletion mutant were constructed as follows. E.coli S 17-1 ⁇ pir containing the pREV10 plasmid (this plasmid cannot be maintained in CFT073, but contains a MiniTn5Km2 transposon with an added promoterless lacZY gene) was the donor strain for conjugal transfer into recipient strain, CFT0731acZYA. Transconjugants were plated on LB agar supplemented with kanamycin (to select for the transposon) and nalidixic acid (to select for the recipient strain, CFT0731acZYA).
- a library of these insertion mutants can then be screened for beta-galactosidase enzyme activity on either MOPS agar plates containing X-Gal (5-bromo-4-chloro-3-indoyl beta-D-galactoside) or standard MacConkey agar. On each of these medias, production of beta-galactosidase enzyme by bacterial colonies can be assayed by a color change.
- CFT-0731acZYA has no endogenous beta-galactosidase enzyme activity, so any detected activity is due to expression of genes near where the transposon has been inserted into the genome.
- Each insertion mutant can be grown on the two indicator medias in the presence or absence of 500 micrograms per milliliter D-serine.
- a CFT073 dsdA Mutant has a Prolonged Lag Phase When Grown in Urine
- a dsdA deletion mutant lacking the 445-bp corresponding to bases 18-486 of SEQ ID NO:5 was constructed as described above.
- the mutant designated WAM2615, was found to lack the ability to grow on D-serine minimal medium.
- WAM2615 was able to grow on D-serine minimal medium.
- Growth of wild-type CFT073 and the dsdA mutant in glycerol minimal medium, human and mouse urine were compared. Neither wild-type CFT073 nor the dsdA mutant grew appreciably in mouse urine.
- the dsdA mutant grew at approximately the same rate as wild-type CFT073 in glycerol minimal medium, in which each had a doubling time of approximately 1.2 hours (FIG. 2A). However, when grown in human urine, the mutant exhibited a prolonged lag phase of approximately five hours, with an initial 10-fold decrease in colony-forming units (CFUs). At 10-12 h, the mutant then grew with wild type growth kinetics and after 11 h achieved CFUs equal to the wild-type (data not shown). The decrease in CFUs and prolonged lag phase was reproduced with the repeated subculture into fresh urine, which suggests that the resumption of growth was not due to acquired mutations (data not shown).
- type 1 fimbriae expression was assayed. In these experiments the wild-type and dsdA mutant strains were grown as described in Experimental procedures for the UTI studies.
- a pair of CFT073 mutants that are locked ON or locked OFF for expression of type 1 fimbriae were used as controls.
- the bacteria were diluted two-fold in microtiter wells and guinea pig erythrocytes were added to each well.
- the CFT073 type 1 locked ON strain had agglutination titers approximately two- to four-fold higher than wild-type CFT073.
- the CFT073 type 1 locked OFF mutant failed to agglutinate the erythrocytes.
- the agglutination titer for the dsdA mutant was two-fold less than the wild-type (data not shown).
- Lipopolysaccharide was isolated from the wild-type and the dsdA mutant. No differences were observed in the relative amounts or sizes of their lipopolysaccharides. In addition, no differences were observed in the expression of hemolysin (data not shown).
- a CFT073 library of random promoter probe miniTn10lux transposon insertions was constructed as described above.
- the transposon carries a promoterless luciferase gene that allows detection of loci potentially regulated by D-serine.
- mutants demonstrating changes in light production were subsequently grown in glycerol medium liquid cultures and the levels of luminescence in the presence or absence of D-serine were quantitated.
- Four mutants with 3-fold or greater changes in luminescence were identified, and the sequence of the miniTn10lux insertion identified by subcloning and DNA sequence analysis. Using this approach for four mutants having increased expression of luciferase in the presence of D-serine, transposon insertions were found in the direction of transcription in coding sequence of genes.
- yjiY a MG1655 gene with weak sequence similarity with a carbon starvation protein
- a MG1655 gene manY involved in transport of mannose
- ppsA phosphoenol-pyruvate synthase
- an enzyme involved in gluconeogenesis an enzyme involved in gluconeogenesis
- an orf of unknown function Interestingly, the unknown orf appears to be inserted within known genes involved in synthesis, secretion and immunity to the microcin H47, an antibiotic peptide produced by the H47 E. coli strain.
- the microcin 47 genes are present within a unique segment of CFT073 chromosome that is a new PAI present at the aspV tRNA site.
- a CFT073 lacZY deletion mutant was used as the recipient for the miniTn5lacZY transposon to generate transposon mutants. To date, 2,000 colonies have been screened for changes in LacZ phenotypes. Twenty mutants having either increased or decreased LacZ expression upon exposure to non-inhibitory levels of D-serine (50 ⁇ g/ml) in complex media after overnight growth have been identified. The analysis of these mutants is underway.
- the mutants will be subjected to Southern blotting analysis to assess the degree to which the insertions occur at different genomic locations.
- the insertion site for the D-serine regulated locus will be identified by either inverse PCR methods or direct selection of recombinant clones with the miniTn5lacZY kanamycin resistance gene and subsequent DNA sequence analysis of the transposon-chromosomal junction site using miniTn5lacZY termini-specific primers.
- a mutant having D-serine-independent expression of the dsdCXA genes may be constructed from a mutant in which either the dsdC or dsdCXA promoter is deleted and replaced with a different promoter.
- PCR-generated recombinant fragments encoding a tightly-controlled promoter such as the arabinose-inducible BAD promoter will replace the native promoter for independent control of transcription of either dsdC or dsdXA.
- Using this promoter will require controls that eliminate or minimize arabinose-specific effects outside of D-serine-affected sites.
- the native promoter-deletion mutant parent strain will be used under arabinose-inducing conditions as a control.
- suitable replacement promoters may include Class III-regulated CAP-cAMP promoter (MalT) or CAP-cAMP independent, inducible promoters.
- the cellular D-serine levels can be controlled in such a mutant during in vitro growth.
- dsdX would be reinserted, but under control of an independent constitutive promoter.
- a dsdC deletion mutant of either of the previous dsdX constructs will be complemented by a dsdC recombinant plasmid with dsdC under control of the araBAD (P bad ) promoter from vectors pBAD18 or pBAD33 (Guzman, et al., J. Bacteriol., 1995, 177(14): 4121-4130).
- CFT073 LacZ transcriptional fusion mutants altered in expression in the presence of arabinose and D-serine will be quantified by ⁇ -galactosidase assays. Mutants with 3-fold or greater changes in ⁇ -galactosidase activity in the presence of D-serine and arabinose will be chosen for further study.
- a secondary screen will identify the fusions that are independent of dsdC and controlled by D-serine alone. The dsdC-recombinant plasmid from the mutants will be cured by repeated rounds of nonselective growth as done by Nelson et al.
- Protein expression of wild type CFT073 and CFT073 dsdA mutant grown in human urine and glycerol minimal medium plus or minus D-serine will be compared. Total and envelope-enriched cellular proteins will be separated by two-dimensional electrophoresis (Kendrick Labs, Madison, Wis.). Samples will be prepared in triplicate and subjected to independent electrophoretic runs. Individual polypeptide spots will be isolated. Polypeptide species with reproducible changes in expression levels in response to loss of DsdA will be identified by MALDI-TOF mass spectrometric identification of peptides by molecular weight (Kendrick Labs, Madison, Wis.).
- Microarray techniques will be used to assess the patterns of genome-wide gene expression controlled by D-serine (+/ ⁇ ) DsdC.
- the initial comparative analysis will use RNA purified from CFT073 cells grown in glycerol minimal medium (+/ ⁇ ) D-serine.
- Initially studies will use the E. coli K-12 spotted orf arrays, which are available from the University of Wisconsin Gene Expression Center (GEC) (http://www.gcow.wisc.edu/gec/index.html) (Wei et al., J. Bacteriol., 2001, 183(2): 545-556).
- GEC University of Wisconsin Gene Expression Center
- CFT073 cells grown in minimal glycerol medium (+/ ⁇ ) D-serine will show expression differences for at least dsdXA, ppsa and manY, where those genes are represented on the E. coli K-12 arrays. These genes will serve as positive controls.
- CFT073-specific oligomer arrays will be made at the UW Gene Expression Center by the NimbleGen technology, a maskless array synthesizer that produces 125,000 custom oligonucleotide arrays with 20 base oligomers present at 16 ⁇ m ⁇ 16 ⁇ m spots.
- Microhybridization will be performed according to the protocol provided by GEC (http://www.gcow.wisc.edu/Gec/protocols/). Labeled probes will be generated starting with total RNA extracted from bacteria by the method described by the Gene Expression Center (http://www.gcow.wisc.edu/Gec/protocols/). Cells harvested from liquid cultures are treated with one part boiling lysis solution (2% SDS, 16 mM EDTA, 200 mM NaCl) to two parts culture solution and boiled for 5 minutes with vortexing. This aqueous solution will then be extracted three times with hot acid-phenol-chloroform and once with chloroform-isoamyl alcohol.
- RNA Total RNA will be precipitated with isopropanol overnight at ⁇ 20° C. and washed once with cold 70% ethanol. RNA samples will then be resuspended in DEPC-treated water and treated with RNase free RQ1 DNase (Promega). Each reaction will be then re-purified using an RNeasy kit (QIAgen) and further precipitated with lithium chloride (Ambion) according to each manufacturer's protocol. Samples will be resuspended in DEPC-treated water and the concentration and purity of each determined by absorbance at 260/280 nm. Integrity of the rRNA bands will be determined by running each sample on a formaldehyde-agarose gel.
- RNA samples of total RNA will be labeled with either Cy5- or Cy3-labeled dUTP (Amersham-Pharmacia) according to the protocol supplied by GEC. Briefly, 20 ⁇ g of each sample will be mixed with 10 ⁇ g of random hexamer primers (Amersham-Pharmacia), heated to 70° C. for 5 minutes, then placed on ice. The 1 ⁇ labeling mixture (containing first strand buffer [GibcoBRL], DTT, deoxynucleoside triphosphates [low dTTP, Promega], RNAsin [Promega] and DEPC water) will be added to each sample and incubated at room temperature for 10 minutes.
- first strand buffer [GibcoBRL]
- DTT deoxynucleoside triphosphates
- RNAsin RNAsin
- Cy5-dUTP (or Cy3-dUTP)-labeled cDNA will be generated by reverse-transcription with Superscript II (GibcoBRL) at 42° C. for 2 hours in the dark. After labeling, remaining RNA will be hydrolyzed by the addition of NaOH and incubation for 15 minutes at 65° C. followed by the addition of HCl and Tris (pH 7.4) to neutralize the reactions. Control and experimental probes will be purified, combined, and concentrated using Microcon MY-30 filter units (Amicon).
- the labeled probes will be mixed with one Ag salmon sperm DNA, 4 ⁇ g of yeast tRNA and 10 ⁇ l of PerfectHyb Plus hybridization buffer (Sigma) to a final volume of 20 ⁇ l. This mixture will be heated for 5 minutes at 100° C. and hybridized to the microarray overnight ( ⁇ 16 hours at 60° C.). Microarrays will then be washed 2 minutes in 0.2 ⁇ SSC, 0.1% SDS, followed by two washes in 0.2 ⁇ SSC, and a final wash in 0.05 ⁇ SSC for 15 seconds.
- Hybridization slides will be dried by centrifugation and immediately scanned using a Packard BioChip SA5000 scanning system.
- This system includes a scanning laser confocal fluorescence microscope and the ScanArray software that collects the 16-bit TIFF images. These images will be quantified using the QuantArray software, which automates spot finding and allows the quantitation of spot intensity.
- Induction ratios for each gene will be calculated by dividing background-adjusted intensities of the experimental sample by the background-adjusted intensities of the control sample. The intensities used for these calculations will be expressed as a percentage of the total intensity of all the spots on each array, which corrects for the specific activity of probes used between arrays. Those genes having induction ratios of greater than 2 standard deviations from the mean induction ratio will be selected for further analysis and confirmation by Northern analysis.
- DFI Differential Fluorescence Induction
- Valdivia and Falkow DFI technique permitted identification of 12 CFT073 genes that are up-expressed in the peritoneal cavity of mice but down-expressed during growth in L-broth.
- the DFI method may also be applied with a new library constructed in a CFT073 dsdA mutant background. This new library would then be put through selections of either growth in human urine or infected mouse bladders or kidneys in the UTI model.
- the D-serine-positive genes that are selected and eventually identified by DNA sequence analysis will be subjected to Northern blotting analysis to confirm the induction of steady-state levels of gene-specific mRNA. Based on the criteria described above, candidate genes will be chosen for mutant construction and phenotypic analysis.
- DsdC will be obtained from cellular extracts from strains in which DsdC is constitutively expressed or over expressed in vectors such as pT7 in a BL21 (DE3) pLysS background (Novagen).
- Candidate promoter fragments will be synthesized by PCR amplification, labeled internally with [ ⁇ - 32 ] dNTP or terminally by [ ⁇ - 32 ] dNTP, and purified.
- the labeled promoter fragments will be incubated in binding reactions including variables as described in Ausubel et al (Ausubel et al., Current Protocols in Molecular Biology, 1987, New York: John Wiley and Sons). Sample mixtures will be separated by nondenaturing polyacrylamide gel electrophoresis and the radiolabeled bands identified by autoradiography. If warranted based on the perceived importance of the identified targets to uropathogenesis, we will purify reagent amounts of DsdC for more detailed studies. DsdC will be acquired by standard over-expression vector technology or gene fusion affinity chromatography methods.
- the purified DsdC (+/ ⁇ D-serine) will be used in DNA mobility shift (gel-retardation) assays with overlapping PCR fragments and DNase I footprinting methods (Ausubel et al., 1987).
- Binding of DsdC protein to nucleotide sequences could also be detecting by labeling the DsdC protein with a detectable label, using the labeled DsdC to probe of an oligomer array of nucleotide sequences from a UPEC strain, detecting binding of DsdC to oligomers, and identifying the oligomer.
- the binding assays may be performed in the presence or absence of D-serine.
- loss-of-function, in-frame deletion mutants will be constructed utilizing the PCR fragment ⁇ -red recombinase method (Datsenko et al., 2000) and tested for the ability to compete with CFT073 wild type in the UTI model as described above.
- CFT073 mutants that are locked “ON” or “OFF” will be included as controls for type 1 pili expression and mannose-sensitive guinea pig RBC-agglutination activity. These strains were the kind gift of Dr. Harry Mobley.
- the CFT073 type 1 pilus “OFF” mutant should serve as the negative control for the invasion studies. We expect that these experiments will help to assess if the 300-fold greater competitive ability of the CFT073 dsdA mutant is correlated with either increased efficiency in attachment, intracellular invasion or intracellular persistence. The latter phenotype will be judged to occur if intracellular organisms do not decrease over time or there appears to be decrease in the killing of the cells in the infected epithelial cell layers. The uroepithelial invasion assay will be used to assess the phenotypes of the additional mutants isolated as described above.
- E. coli f-met-tRNA recognizes the triplet “GTG as an initition codon.
- the first amino acid residue encoded by SEQ ID NO1 should be a methionine residue instead of a valine residue.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Biomedical Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- Urology & Nephrology (AREA)
- Analytical Chemistry (AREA)
- Hematology (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Cell Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Bioinformatics & Computational Biology (AREA)
- Pathology (AREA)
- General Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Food Science & Technology (AREA)
- Virology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Disclosed are methods of detecting uropathogenic E. coli genes that are differentially expressed in response to D-serine. Also disclosed are methods of characterizing bacterial isolates from clinical samples based on the ability to metabolize D-serine.
Description
- This application claims priority to U.S. Provisional Application No. 60/281,859, filed Apr. 5, 2001, which is incorporated herein in its entirety.
- [0002] This invention was made with United States government support awarded by the following agencies: NIH AI39000. The United States has certain rights in this invention.
- Urinary tract infections (UTIs) are responsible for an estimated 10 million outpatient visits and more than one million hospitalizations each year in the United States. Annual health care costs attributable to UTIs exceed one billion dollars. Approximately 50% of women will have at least one UTI during their lives, and many of these women experience recurrent UTI.
- Uropathogenic Eschericia coli (UPEC), by far the most common uropathogen, is the causal agent in greater than 85% of uncomplicated UTIs. The organisms are also known to cause hospital acquired infections associated with injury, surgical procedures, or catheterization. UPEC present a complex ecological case among the pathogenic E. coli. UPEC can colonize the human intestinal tract without causing disease, yet they can sequentially colonize the perineum, urethra, bladder and kidney. Known E. coli virulence factors that contribute to UTIs are fimbriae (
e.g. type 1 fimbriae and Pap fimbriae), iron acquisition systems, hemolysin, cytotoxic necrotizing factor-1, and the Sat autotransporter. Environmental cues used by UPEC for expression of genes needed for colonization of the bladder and kidney have not been described in the literature. - Urinary tract infections in humans are generally treated with antibiotic therapy. Treatment of UTIs may be complicated by strains that are resistant to antibiotics, particularly to those antibiotics in common use. The identification of new UPEC-specified pathogenicity factors will facilitate the development of new methods of treating or preventing UTIs.
- In one aspect, the present invention provides a method of detecting uropathogenic E. coli nucleotide sequences differentially expressed in the presence or absence of D-serine comprising providing a library comprising a plurality of transposon mutants of a uropathogenic E. coli strain, the mutants comprising a transcriptional fusion comprising a transcriptional regulation sequence operably connected to a sequence encoding a detectable protein; growing the mutants in the presence or absence of D-serine; identifying mutants having increased or decreased expression of the transcriptional fusion in the presence or absence of D-serine by comparing the relative levels of the detectable protein of the mutants grown in the presence or absence of D-serine; identifying the insertion site of the transcriptional fusion in the identified transposon mutants; and correlating the insertion site with an E. coli nucleotide sequence.
- In another aspect, the present invention provides a method of identifying proteins differentially expressed in a wild-type uropathogenic E. coli strain and a uropathogenic E. coli dsdCXA locus mutant, the mutant having reduced expression of one or more proteins selected from the group consisting of DsdA, DsdC, and DsdX, the method comprising comparing proteins isolated from the wild-type uropathogenic E. coli strain and proteins isolated from the uropathogenic E. coli dsdCXA locus mutant; and identifying proteins from the dsdCXA mutant having increased or decreased level of expression relative to expression of the corresponding proteins in the wild-type uropathogenic E. coli strain.
- In other aspects, the invention includes a method of identifying a uropathogenic E. coli polynucleotide sequence that binds to DsdC protein comprising contacting an E. coli polynucleotide sequence with DsdC protein and detecting binding of the polynucleotide sequence to the protein.
- In another aspect, the present invention provides a method of detecting genes from a uropathogenic E. coli strain that are differentially expressed in the presence or absence of D-serine comprising hybridizing a first set of labeled oligonucleotide probes with an array of oligomers, the oligomers comprising gene sequences from the uropathogenic E. coli strain, wherein the first set of labeled oligonucleotide probes is made by reverse transcription of RNA isolated from uropathogenic E. coli grown in the presence of D-serine; hybridizing a second set of labeled oligonucleotide probes with an array of oligomers identical to the aforementioned array, wherein the second set of labeled oligonucleotide probes is made by reverse transcription of RNA isolated from uropathogenic E. coli grown in the absence of D-serine; comparing hybridization of labeled oligonucleotide probes to identify oligomers having differential hybridization to the first and second sets of oligonucleotide probes; and identifying genes comprising the sequences of the identified oligomers.
- In yet another aspect, the present invention provides a method of detecting proteins differentially expressed in a uropathogenic E. coli strain in response to D-serine, comprising providing a first culture of the uropathogenic E. coli strain grown in the presence of D-serine and a second culture of the uropathogenic E. coli strain grown in the absence of D-serine; comparing proteins isolated from the two cultures, and identifying proteins from E. coli grown in the presence of D-serine that are increased or decreased relative to the corresponding proteins in the uropathogenic E. coli strain grown in the absence of D-serine.
- The present invention also provides a method of characterizing bacterial strains isolated from clinical samples comprising testing the strain for the ability to grow in the presence of D-serine.
- In another aspect, the present invention provides urine dipstick comprising a solid support, a polypeptide comprising D-serine deaminase and an indicator responsive to ammonia, wherein the polypeptide and indicator are coimmobilized on the support.
- FIG. 1 compares the 53 minute regions of three E. coli strains: laboratory K-12 strain MG1655, enterohemorhagic E. coli (EHEC) EDL933, and uropathogenic E. coli CFT073.
- FIG. 2 compares the growth of wild type E. coli CFT073 or a dsdA mutant of E. coli CFT073 grown in minimal medium or urine as a function of time as measured by the concentration of colony forming units (2A) or the OD600 (2D).
- FIG. 3 compares the concentration of colony forming units of wild-type E. coli CFT073 or a dsdA mutant of E. coli CFT073 recovered from the bladder or kidneys of mice coinfected with mutant and wild-type E. coli CFT073.
- FIG. 4 compares the concentration of colony forming units of wild type E. coli CFT073 or a dsdA mutant of E. coli CFT073 recovered from the bladder or kidneys of mice coinfected with mutant and wild-type E. coli CFT073 strains carrying pWAM2682, which expresses DsdA in two replicate experiments (4A and 4B).
- FIG. 5 presents a model D-serine modulation of E. coli uropathogenesis.
- The sequence of the uropathogenic E. coli strain CFT073 genome was compared with the sequences of the genomes of E. coli laboratory K-12 strain MG1655 and enterohemorrhagic E. coli (EHEC) 0157 strain EDL933. A sequence found in uropathogenic E. coli strain CFT073 but not in E. coli K-12 or EHEC 0157 was identified in the 53 minute region of the K-12 genetic map. The sequence includes the D-serine tolerance (dsdCXA) locus, which is involved in overcoming D-serine toxicity, and is presented in SEQ ID NO: 13.
- The dsd locus includes coding sequences for a positive transcriptional activator (DsdC), a putative permease (DsdX), and a D-serine deaminase (DsdA), which catalyzes the hydrolysis of D-serine to give pyruvate and ammonia. The 53 minute region includes two putative recombinases unique to UPEC, which are designated ipuA and ipuB. The DNA sequences of dsdC, dsdX dsdA, ipuA and ipuB are provided in SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, and SEQ ID NO:9, respectively, and the deduced amino acid sequences of their putative translation products are shown in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, and SEQ ID NO:10, respectively. With reference to the SEQ ID NO:13, bases 3507-4844 correspond to SEQ ID NO:3; bases 4862-6190 correspond to SEQ ID NO:5; bases 1378-1947 correspond to SEQ ID NO:9; bases 3295-2342 correspond to the reverse complement of SEQ ID NO:1; and bases 623-7 correspond to the reverse complement of SEQ ID NO:7.
- Based on their deduced amino acid sequences, the proteins putatively encoded by ipuA and ipuB are homologs of the recombinase fimB, which suggests that the proteins may function to regulate expression of the dsd operon in a manner similar to the regulation of
type 1 fimbriae by FimB and FimE. - The polypeptide putatively encoded by dsdC is a member of the LysR family of transcriptional regulators. The LysR-type regulators are very common in bacteria and control the expression of genes associated with a wide range of cellular processes, including many diverse virulence factors. Members of the LysR family of transcriptional regulators usually require the presence of a specific coinducer to activate transcription. The polypeptides are about 270-330 amino acid residues in length and exhibit sequence similarity over about 270 amino acid residues. The strongest conservation occurs in the N-terminal 66 amino acid residues, which has a helix-turn-helix motif. The LysR-type regulators bind to DNA as dimers or tetramers and activate gene transcription, possibly by introducing a bend in the DNA helix.
- Until recently, it was considered axiomatic that naturally occurring D-amino acids occurred only in bacteria, which contain D-amino acids in their cell walls. Bruckner et al. reported the presence of D-amino acids, including D-serine, in the urine from seven human subjects (Bruckner, et al., Amino Acids 1994. 6:205-211). Of the D-amino acids in urine, D-serine was present in the highest concentration (30 to 380 μMol/l). The percentage of D-serine relative to total (D-plus L-) serine ranged from 19% to 56%. Bruckner et al. speculated that D-amino acids in urine are derived from dietary sources (e.g., Lactobacillus in food).
- It was recently reported that L-serine is converted to D-serine in mammalian brain by direct, enzyme-catalyzed racemization (Wolosker et al., PNAS 96:721-755, 1999). The highest concentrations of D-serine are found in regions of the brain having N-methyl-D-aspartate (NMDA) receptors for the neurotransmitter glutamate. D-serine functions as a potent agonist at the NMDA-glycine site and with glutamate can activate the NMDA receptor (Mothet et al. PNAS 97:4926-4931, 2000).
- Uropathogenic E. coli are able to grow in media or urine in the presence of D-serine, whereas E. coli strains that lack the dsdCXA genes are unable to grow or grow more slowly than UPEC strains. As shown in the Examples below, three collections of E. coli were examined for the ability to utilize media having D-serine as the sole source of carbon and nitrogen. The results showed that 49 of 60 tested UPEC strains were able to use D-serine as the sole source of carbon and nitrogen. In contrast, just four of 74 tested hemorrhagic colitis and infantile diarrheal E. coli strains were able to use D-serine as the sole source of carbon and nitrogen.
- As used herein, a “wild-type UPEC strain” is a strain of uropathogenic E. coli having the ability to use D-serine as the sole source of carbon and nitrogen.
- To evaluate the role of the dsd operon in UPEC pathogenicity, a mutant in wild-type UPEC strain CFT073 with a deletion in the dsdA gene was constructed as described in the Examples. The dsdA mutant was evaluated for its ability to establish infection in a murine model of ascending urinary tract infection. Surprisingly, the mutant strain was present in the bladders and kidneys 48 hours postinfection at levels of about 300 times that of the wild-type strain. This result runs counter to our expectation that a knockout of this putative virulence gene would be less effective than wild-type CFT073 in establishing an infection. Complementation of the dsdA mutant with DsdA supplied in trans resulted in recovery similar to wild type.
- The superior ability of the dsdA mutant over wild-type CFT073 to establish an infection in the murine coinfection experiments caused us to hypothesize that UPEC use D-serine as a signal to modulate growth and/or expression of virulence determinants. D-serine may act as an effector of the LysR-type transcriptional regulator, DsdC, to cause an increase in transcription. Production of DsdA would lower D-serine concentrations, which, if D-serine acts as an effector with DsdC, would reduce DsdC-regulated transcription. However, when concentrations of D-serine are relatively high, DsdC may remain active and cause the transcription of a myriad of genes for other virulence factors. It follows that people susceptible to recurrent UTIs caused by UPEC may produce urine with higher levels of D-serine than those who do not experience recurrent UTIs.
- The identification of genes whose expression is regulated by D-serine, DsdC, or DsdC and D-serine will facilitate the design of drugs and methods of treating UTIs. Several approaches may be used to identify genes or proteins as targets for drug therapy.
- One approach to identifying D-serine-regulated genes to create a library of transposon mutants by transposon-mediated mutagenesis, as described below in the Examples. Briefly, transposon mutagenesis is used to insert a gene encoding a detectable protein into the bacterial chromosome under the control of a transcriptional regulation sequence. Examples of suitable detectable proteins include, but are not limited to, luciferase and the lacZ gene product. Mutants expressing the detectable protein are identified and grown in the presence in or absence of D-serine. The relative expression of detectable protein in mutants grown in the presence and absence of D-serine is assessed, and those mutants exhibiting differential expression of the detectable protein in the presence and absence of D-serine are selected for further analysis.
- Another approach to identifying genes whose expression is affected by DsdC or D-serine metabolism is to compare the proteins expressed by a wild-type UPEC strain grown in the presence or absence of D-serine, or expressed by a wild-type UPEC strain and a mutant of the strain comprising a mutation in dsdC, dsdA, or dsdX grown in the presence or absence of D-serine. Proteins are isolated from the bacterial cultures, separated by two-dimensional gel electrophoresis, and differential expression identified by comparing gels obtained from different bacterial strains or by the same bacterial strain grown in the presence or absence of D-serine. Proteins of interest may be further characterized to permit identification.
- In another approach, RNA is isolated from a wild-type UPEC strain grown in the presence or absence of D-serine and subjected to reverse transcription to create labeled oligonucleotide probes, which in turn are hybridized to an oligomer array of E. coli gene sequences to identify sequences differentially expressed in the presence or absence of D-serine.
- Using the methods described below in the Examples, several sequences differentially expressed in the presence of D-serine have been identified. Further identification and characterization of differentially expressed sequences is underway. Loss of function mutants for proteins identified as being of potential interest will be developed and evaluated for virulence using any suitable method. One method of evaluating virulence is an intracellular invasion assay, described in the Examples. Proteins that are correlated with virulence may be used to develop a vaccine against UPEC-mediated UTIs, to develop antibodies for use in treating UTIs by passive immunization, or to screen potential therapeutic agents. Conveniently, once a protein of interest is identified, the protein may be obtained by cloning the gene encoding the protein into an expression vector and obtaining expression in a suitable host cell.
- The ability of certain UPEC strains to grow on D-serine will provide a simple means of differentiating E. coli isolates from urine or blood. The bacterial isolate may be tested for the ability to grow on or in medium containing D-serine as the sole carbon and nitrogen source. The bacterial isolate may be plated on a suitable medium comprising D-serine in a concentration effective to support growth of a UPEC strain capable of using D-serine as the sole source of carbon and nitrogen, suitably in the range of from about 100 μg/ml to about 500 μg/ml. Alternatively, an aliquot of the isolate may be transferred to a liquid medium comprising D-serine in an effective concentration, suitably in the range of from about 100 μg/ml to about 500 μg/ml. Growth in liquid medium may be assessed by observing turbidity. Optionally, the liquid medium may comprise an indicator such as tetrazolium, which undergoes a color change in response to bacterial growth. Alternatively, cells may be plated and then contacted with D-serine in a concentration effective to inhibit the growth of most non-UPEC isolates such as J198 (a normal fecal isolate), but which permits the growth of a wild-type UPEC strain such as E. coli strain CTF073. Conveniently, the D-serine may be delivered on a solid support, such as a filter disc of the type commonly used in antibiotic sensitivity testing, impregnated with D-serine.
- Whether a correlation between UTI susceptibility and urinary D-serine levels exists may be determined by assaying D-serine in the urine of human populations having differential susceptibility to UTIs. If such a correlation exists, assaying D-serine may be useful in identifying individuals having this particular susceptibility. Unfortunately, D-serine assays are presently performed by HPLC and are prohibitively expensive for routine use.
- The recognition that D-serine may play a role in establishing UTIs suggests the importance of having a simple assay for assaying D-serine levels. D-serine deaminase may be used to manufacture urine dipsticks and biosensors for quantifying D-serine. The D-serine deaminase may be coimmobilized on a suitable support with a suitable indicator that changes color in the presence of the ammonia generated by the enzymatic hydrolyis of D-serine. When the immobilized D-serine deaminase is exposed to a sample containing D-serine, the D-serine will be hydrolyzed to form pyruvate and ammonia, which will cause the indicator to change color, thus permitting the detection of the D-serine in the sample.
- The D-serine deaminase used in urine dipsticks or biosensors may be obtained from any source. For example, D-serine deaminase may be obtained by subcloning a D-serine deaminase gene, such as the CFT073 dsdA gene (SEQ ID NO:6), into a suitable expression vector, introducing the vector into a suitable host cell, allowing expression of the protein, and recovering from the host cell D-serine deaminase substantially free of other proteins.
- Suitable indicators include any compound that detectably changes in response to the hydrolysis of D-serine. For example, a suitable indicator may include an indicator that changes color in response to pH changes, which would occur with an increase in ammonia concentration. An example of such an indicator is phenol red.
- The following non-limiting examples are intended to be purely illustrative.
- Urosepsis E. coli strain CFT073 was originally isolated from the blood and urine of a woman admitted to the University of Maryland Medical System (Mobley et al., Infect. Immun., 1990, 58: 1281-1289). E. coli strain collections used in epidemiology experiments were a collection of urosepsis and urinary tract isolates (Arthur et al., Infect. Immun., 1989a, 57: 314-321.; Arthur et al., Infect. Immun., 1989b, 57: 303-313.; Arthur et al., Infect. Immun., 1990, 58: 471-479) and the DEC collection of hemorrhagic colitis and infantile diarrheal strains (Whittam et al., Infection and Immunity, 1993, 61: 1619-1629). Other strains used are described in Table 1: Bacterial strains and plasmids
TABLE 1 Strains, plasmids Description Source or reference Bacterial strains S17-λpir recA thi pro hsdR−M+ (Simon et al., 1983) (RP4-2 Tc::Mu—Km::Tn), λpir CFT073 urosepsis isolate (Mobley et al., 1990) CFT073 NalR urosepsis isolate; (Mobley et al., 1990) spontaneous NalR WAM2615 CFT073Δ −445 bp dsdA, NalR This study WAM2692 WAM2615 with pWAM2682 This study WAM2693 CFT073 with pWAM2682 This study Plasmids pEP185.2 suicide vector, CmR (Kinder et al., 1993) pWAM2682 pACYC177, dsdA+, KnR This study - To assess whether bacteria are able to utilize D-serine as the sole carbon and nitrogen source, bacteria were plated on a modified Minimal A Medium agar (15 g K 2HPO4, 4.5 g KH2PO4, 1 g K2SO4, 0.5 g Na3C6H5O7.2H2O, 15 g agar and 1 ml of 1 M solution MgSO4.7H2O per liter) supplemented with 500 ug/ml of D-serine (Sigma) (Miller, Experiments in molecular genetics, 1972). For growth studies, bacteria were grown in MOPS minimal medium [3-(N-morpholino)propanesulfonic acid] supplemented with 10 μM thiamine and 0.4% glycerol (Neidhardt et al., J. Bacteriol., 1974, 119: 736-747). For other experiments, the media used included L broth (5 g of sodium chloride, 5 g of yeast extract and 10 g of tryptone per liter [Fisher Scientific]) and L agar (L broth containing 1.5% agar [Difco]). Human urine was provided by two different donors and filter sterilized prior to use in experiments. Mouse urine was collected from a group of eight mice and UV sterilized prior to use. Antibiotics were added as needed at the following concentrations: nalidixic acid and kanamycin at 50 μg/ml and chloraramphenicol at 20 μg/ml (Sigma).
- Liquid cultures were grown at 37° degrees with aeration except for cultures used in mouse UTI experiments. In the mouse UTI experiments, each strain was inoculated into L-broth and grown without aeration for two days. Bacteria were then passaged into fresh medium two additional times for a total of 6 days growth.
- To determine growth curves, single colonies were inoculated into 5 ml of the appropriate medium and grown overnight at 37° C. with aeration. Following the overnight incubation, cells were collected by centrifugation, washed in saline and resuspended in 0.5 ml of saline (8.5 g NaCl per liter). The bacteria were then added to pre-warmed tubes of the desired medium. The OD 600 was adjusted to 0.03 with the appropriate medium. The cultures were grown at 37° C. with aeration and OD600 readings were taken at the designated time points and dilutions were plated to determine CFUs.
- Restriction and DNA modification enzymes were purchased from either New England Biolabs or Promega Corporation. dsdA-specific primers for Polymerase Chain Reaction (PCR) were designed using the CFT073 genome sequence (R.A. Welch, unpublished). The dsdA primers used were 5′-GGATGGCGATGCTGCGTTG-3′ (SEQ ID NO:11) and 5′-CACAGGGGAAGGTGAGATATGC-3′ (IDT) (SEQ ID NO:12). The template was wild-type CFT073 genomic DNA prepared using the Wizard Genomic DNA Isolation Kit (Promega). The PCR amplification used the Expand High Fidelity PCR System (Roche Diagnostics) according to the manufacturer's protocol. The resulting PCR product was gel purified with the QIAQuick Gel Purification Kit (Qiagen) and ligated into pGEMt (Promega). The resulting dsdA recombinant plasmid was examined by restriction endonuclease analysis. A dsda deletion was constructed by digesting the dsdA plasmid with PmeI and EcoRI, which resulted in a 445 bp deletion. The digested plasmid was treated with T4 DNA polymerase in order to blunt the DNA ends. After gel purification, the plasmid was religated. An Apal-SacI fragment containing dsdA was subcloned into the suicide vector, pEP185.2 (kind gift of Virginia Miller, Washington University). This construct was sequenced (ACGT, Incorporated) using T3 and T7 primers provided by ACGT. Conjugation was carried out as described by Kinder et al. (Kinder et al., Gene, 1993, 136: 271-275) with some modifications. The suicide plasmid construct was transformed into a S17λpir background and then conjugated into a nalidixic acid resistant mutant of CFT073. Single-crossover chromosomal integrates of the recombinant plasmid were selected and D-cycloserine enrichment was used to select for a second crossover event that excised the suicide plasmid backbone from the CFT073 chromosome. Chromosomal DNA was isolated from the putative dsdA mutants and examined via PCR analysis using dsdA-specific primers to confirm the deletion. Phenotypic analysis of the putative mutants was carried out by inoculating colonies onto MOPS glycerol minimal and MOPS minimal D-serine agar media.
- The dsdA gene, including its Shine and Delgarno sequence, was PCR amplified with primers (5′GCGCTGCAGCGTTATTAACGGCCTTTTGCCA GATATTGATTC) (SEQ ID NO:17) and (5′CGCGGATCCCGTACTAT GGAAAACGCTAAAA TGAATTCGC) (SEQ ID NO:18). The primers were designed with PstI and BamHI restriction sites at their respective 5′ ends. The PCR product was digested with PstI and BamHI enzymes, gel purified and ligated with pACYC177 previously digested with the same two restriction enzymes. The plasmid, designated pWAM2682, was used in subsequent complementation analysis.
- After 6 days of static growth, the density of each bacterial strain was adjusted to an OD 600 of 0.6. Equal numbers of wild-type CFT073 cells and dsdA mutant (nalidixic acid resistant background) were mixed and pelleted by centrifugation for 15 min at 7000 rpm. The pellet was resuspended in 500 μl of phosphate buffered saline (7.5 g NaCl, 0.2 g KCl, 0.2 g KH2PO4, 1.15 g Na2HPO4 per liter). Female CBA/JCrHsd mice (five to eight weeks of age, Harlan) were inoculated via trans-urethral catheterization with 50 μl of the prepared bacteria (approximately 1×109 organisms). After 48 h, the mice were sacrificed and their bladders and kidneys harvested. The organs were homogenized and dilutions plated on L agar without antibiotics. After overnight incubation, colonies were counted and then replica plated onto L agar containing nalidixic acid in order to determine the number of surviving dsdA mutants. The nalidixic acid resistant CFT073 background strain used in the construction of the dsda mutant competes equally with the CFT073 nalidixic acid sensitive parent strain in the UTI model (Torres et al., Infect. Immun., 2001, 69: 6179-6185).
- For the in vivo complementation studies, both the wild-type and dsdA mutant strains carried the complementing dsdA plasmid, pWAM2682. Both strains were grown as above with the addition of kanamycin in order to assure maintenance of the plasmids during the six day culture. Results were analyzed for statistical significance via paired Student's t test.
- The wild-type and dsdA mutant strains were grown as described for the murine infection studies. The passaged cultures were diluted to an OD 600 of 0.6 and CFUs determined by plate count. The assay was carried out in a 96 well U-bottomed microtiter plate. Serial two-fold dilutions of each culture (50 μl) were mixed with 50 μl of a 25% suspension of guinea pig erythrocytes in saline. The mixtures were incubated at room temperature and agglutination titers were determined after 1.5 h. In these experiments, CFT073 mutants locked ON and locked OFF for
type 1 fimbriae expression (N. W. Gunther, unpublished) were used as hemagglutination controls (kind gift of N. W. Gunther and H. L. Mobley). - Wild-type and dsdA mutant strains grown overnight in human urine were inoculated into fresh urine and examined microscopically after 3 and 24 h of growth. The cell suspensions were air-dried on glass slides and stained with safranin. The cells were visualized using a Zeiss Axioplan IIi microscope and the images were collected digitally using OpenLabs 3.02 software (Improvision, Inc.).
- A CFT073 lacZY deletion mutant was isolated using the λ red recombinase mutagenesis system of Datsenko and Wanner (Datsenko et al., Proc. Natl. Acad. Sci. USA, 2000, 97(12): 6640-6645), which is incorporated by reference herein. Briefly, PCR products were generated using primers with 36- to 50-nucleotide extensions that are homologous to the region adjacent to the lacZYA genes and template plasmid carrying an antibiotic resistance gene flanked by FLP recognition target sites. The PCR product was introduced into CFT073 carrying a Red expression plasmid via electroporation. After selection, the resistance gene was eliminated introduction of a helper plasmid which acts directly on the FLP recognition target sites flanking the resistance gene. The temperature sensitive Red and FLP helper plasmids are cured by growth at 37 degrees C.
- MiniTn10lux transposon insertion mutants of CTF073 were constructed using MiniTn10lux. E. coli strain KV330 (carrying the suicide plasmid pKV45 which harbors the mini-TN10 transposon, transposase and resolvase) was the donor strain for conjugal transfer into recipient host strain CFT073. Transconjugants were plated on LB agar supplemented with chloramphenicol (to select for the transposon) and nalidixic acid (to select for the host strain, CFT073). Individual isolates were assayed for luciferase expression in the absence of presence of D-serine.
- MiniTn5lacZY transposon insertion mutants of a CFT073 lacZY deletion mutant were constructed as follows. E.coli S 17-1λpir containing the pREV10 plasmid (this plasmid cannot be maintained in CFT073, but contains a MiniTn5Km2 transposon with an added promoterless lacZY gene) was the donor strain for conjugal transfer into recipient strain, CFT0731acZYA. Transconjugants were plated on LB agar supplemented with kanamycin (to select for the transposon) and nalidixic acid (to select for the recipient strain, CFT0731acZYA). A library of these insertion mutants can then be screened for beta-galactosidase enzyme activity on either MOPS agar plates containing X-Gal (5-bromo-4-chloro-3-indoyl beta-D-galactoside) or standard MacConkey agar. On each of these medias, production of beta-galactosidase enzyme by bacterial colonies can be assayed by a color change. CFT-0731acZYA has no endogenous beta-galactosidase enzyme activity, so any detected activity is due to expression of genes near where the transposon has been inserted into the genome. Each insertion mutant can be grown on the two indicator medias in the presence or absence of 500 micrograms per milliliter D-serine. (Nelson K. M. et al., Infect. Immun., 2001, 69(10):6201-6208.)
- Among the urosepsis and urinary tract isolates tested, 49 of 60 were able to grow on D-serrine minimal medium. In contrast, only four of the 74 strains from the DEC collection of hemorrhagic colitis and infantile diarrheal strains were able to grow on D-serine minimal medium.
- To evaluate the role of D-serine catabolism in uropathogenesis, a dsdA deletion mutant lacking the 445-bp corresponding to bases 18-486 of SEQ ID NO:5 was constructed as described above. The mutant, designated WAM2615, was found to lack the ability to grow on D-serine minimal medium. When DsdA was provided in trans, WAM2615 was able to grow on D-serine minimal medium. Growth of wild-type CFT073 and the dsdA mutant in glycerol minimal medium, human and mouse urine were compared. Neither wild-type CFT073 nor the dsdA mutant grew appreciably in mouse urine. The dsdA mutant grew at approximately the same rate as wild-type CFT073 in glycerol minimal medium, in which each had a doubling time of approximately 1.2 hours (FIG. 2A). However, when grown in human urine, the mutant exhibited a prolonged lag phase of approximately five hours, with an initial 10-fold decrease in colony-forming units (CFUs). At 10-12 h, the mutant then grew with wild type growth kinetics and after 11 h achieved CFUs equal to the wild-type (data not shown). The decrease in CFUs and prolonged lag phase was reproduced with the repeated subculture into fresh urine, which suggests that the resumption of growth was not due to acquired mutations (data not shown). Furthermore, when culture growth was monitored by OD 600, the initial decrease in CFUs exhibited by the dsdA mutant in urine was not accompanied by a concomitant decrease in optical density (FIG. 2B). The prolonged lag phase and loss of CFUs were abrogated when DsdA was provided in trans (data not shown). These results support the hypothesis that the ability to catabolize D-serine provides a growth advantage to UPEC in urine.
- The growth defect of the mutant in urine prompted a microscopic examination of its cellular morphology. Photomicrographs of the wild-type and mutant cells corresponding to the 3 h time point of the urine growth curve showed that the dsda mutant lost the typical rod-shape seen with the wild-type and the pleiomorphic mutant cells were rounded and swollen. After 24 h in urine, the mutant cells became more rod-like and smaller in size. The cell shape defect observed for the mutant in urine was complemented when DsdA was provided in trans. When the mutant and wild-type were grown in L broth or glycerol minimal medium they appeared as rods that were similar in size (data not shown).
- The ability of dsdA mutant to establish infection was evaluated in a murine model of ascending UTI, as described above. Two days postinfection, the mutant was recovered at higher frequencies than the wild-type from both bladders and kidneys (FIG. 3A and 3B). When two experiments were averaged, the recovered CFUs from the bladders and kidneys were approximately 270- and 300-times higher, respectively, for the mutant than the wild-type. The results were similar in an additional experiment where the tissues were harvested at eight days post-infection (data not shown). Overall, these results did not meet the expectation that the dsdA mutant would be less competitive than the wild-type in the UTI model. To examine the possibility that the strains differed in competitive growth ability, they were grown together in L broth, glucose minimal medium and glycerol minimal medium. There were no significant differences in growth or recovery rates of either the wild-type or the mutant in these media (data not shown).
- In order to rule out the possibility that unknown mutations in the dsdA mutant background were responsible for the competitive advantage, an intact dsd4 gene was provided in trans. The complemented strain regained the ability to grow in D-serine minimal medium. For UTI experiments, the wild-type strain was transformed with the complementing plasmid in order to compensate for dsdA copy effects. When these strains were tested in the murine model, no statistically significant difference was observed in their recovery after a two day infection (FIG. 4 A and B).
- Several groups have shown that expression of
type 1 fimbriae is critical for establishing a UTI in different model systems (Connell et al., PNAS, 1996, 93: 9827-9832; Mulvey et al., Science, 1998, 282: 1494-1497; Sokurenko et al., PNAS, 1998, 95: 8922-8926). To determine if alterations intype 1 fimbrial expression were responsible for the dramatic effects of the dsdA mutation in vivo,type 1 fimbriae expression was assayed. In these experiments the wild-type and dsdA mutant strains were grown as described in Experimental procedures for the UTI studies. A pair of CFT073 mutants that are locked ON or locked OFF for expression oftype 1 fimbriae (gift of Dr. Harry Mobley) were used as controls. The bacteria were diluted two-fold in microtiter wells and guinea pig erythrocytes were added to each well. TheCFT073 type 1 locked ON strain had agglutination titers approximately two- to four-fold higher than wild-type CFT073. As expected, theCFT073 type 1 locked OFF mutant failed to agglutinate the erythrocytes. The agglutination titer for the dsdA mutant was two-fold less than the wild-type (data not shown). - Lipopolysaccharide was isolated from the wild-type and the dsdA mutant. No differences were observed in the relative amounts or sizes of their lipopolysaccharides. In addition, no differences were observed in the expression of hemolysin (data not shown).
- In order to identify genes with altered expression in response to the presence or absence of D-serine, a CFT073 library of random promoter probe miniTn10lux transposon insertions was constructed as described above. The transposon carries a promoterless luciferase gene that allows detection of loci potentially regulated by D-serine. Two thousand mutants grown on glycerol minimal medium in the presence or absence of D-serine, and observed for changes in light production using an X-ray film overlay.
- The mutants demonstrating changes in light production were subsequently grown in glycerol medium liquid cultures and the levels of luminescence in the presence or absence of D-serine were quantitated. Four mutants with 3-fold or greater changes in luminescence were identified, and the sequence of the miniTn10lux insertion identified by subcloning and DNA sequence analysis. Using this approach for four mutants having increased expression of luciferase in the presence of D-serine, transposon insertions were found in the direction of transcription in coding sequence of genes. These genes were: yjiY, a MG1655 gene with weak sequence similarity with a carbon starvation protein; a MG1655 gene, manY involved in transport of mannose, ppsA (phosphoenol-pyruvate synthase), an enzyme involved in gluconeogenesis, and an orf of unknown function. Interestingly, the unknown orf appears to be inserted within known genes involved in synthesis, secretion and immunity to the microcin H47, an antibiotic peptide produced by the H47 E. coli strain. Based on the comparison of the three E. coli genomes, the microcin 47 genes are present within a unique segment of CFT073 chromosome that is a new PAI present at the aspV tRNA site.
- A CFT073 lacZY deletion mutant was used as the recipient for the miniTn5lacZY transposon to generate transposon mutants. To date, 2,000 colonies have been screened for changes in LacZ phenotypes. Twenty mutants having either increased or decreased LacZ expression upon exposure to non-inhibitory levels of D-serine (50 μg/ml) in complex media after overnight growth have been identified. The analysis of these mutants is underway.
- The mutants will be subjected to Southern blotting analysis to assess the degree to which the insertions occur at different genomic locations. The insertion site for the D-serine regulated locus will be identified by either inverse PCR methods or direct selection of recombinant clones with the miniTn5lacZY kanamycin resistance gene and subsequent DNA sequence analysis of the transposon-chromosomal junction site using miniTn5lacZY termini-specific primers.
- Mutants in which expression of dsdC and dsdXA genes is independent of D-serine will be constructed by dsdC and dsdXA constitutive promoter mutations or by the recombinant construction of dsdC and dsdXA operons under the control of D-serine-independent, conditional promoters. D-serine deaminase constitutive mutants will be isolated as described by McFall et al. using chemical mutageneis and enrichment techniques in which D-serine is a growth limiting substrate (Bloom et al., J. Bacteriol., 1975, 121: 1078-1084; Bloom et al., J. Bacteriol., 1975, 121(3): 1092-1101; McFall, Molecular Biology, 1964, 9: 754-762; McFall, J. Bacteriol., 1967, 94(6): 1982-1988; McFall, J. Bacteriol., 1975, 121(3): 1074-1077; McFall, J. Bacteriol., 1973, 113(2): 781-785; McFall et al., Mol. Gen. Genet., 1970, 106(4): 371-377) which are incorporated by reference herein.
- Alternatively, a mutant having D-serine-independent expression of the dsdCXA genes may be constructed from a mutant in which either the dsdC or dsdCXA promoter is deleted and replaced with a different promoter. Using the λ-red recombinase method, PCR-generated recombinant fragments encoding a tightly-controlled promoter such as the arabinose-inducible BAD promoter will replace the native promoter for independent control of transcription of either dsdC or dsdXA. Using this promoter will require controls that eliminate or minimize arabinose-specific effects outside of D-serine-affected sites. The native promoter-deletion mutant parent strain will be used under arabinose-inducing conditions as a control. Other suitable replacement promoters may include Class III-regulated CAP-cAMP promoter (MalT) or CAP-cAMP independent, inducible promoters. With the above constructs the expression of dsdC or dsdXA can be put under DsdC-D-serine-independent control.
- Using either the constitutive promoter mutants or BAD-dependent constructs, loss of function mutations for each dsdCXA gene alone and selected combinations will be constructed. This will permit examination of a variety of phenotypes independent of D-serine-induced expression. For example, dsdA and dsdA- dsdC double mutants in these backgrounds would enable study of dsdA phenotypes that are either dsdC-dependent or -independent. In addition, in a dsdC mutant where dsdXA is under D-serine independent expression, possible D-serine transport via DsdX can be studied without the changes in DsdX expression.
- Genetic screening to identify dsdC-dependent and -independent/ D-serine regulated genes will involve assessing LacZ phenotypic changes in a miniTn5 lacZY CFT073 mutant library in which dsdC is under control of a conditional promoter. This strategy was used to identify RscR-regulated genes in Y. enterocolitica (Nelson, et al., 2001). However, in our system, the D-serine transporter gene, dsdX must be under transcriptional control that is independent of dsdC and D-serine. A dsdA deletion will be introduced into a dsdXA constitutive promoter mutant isolated as described above. The cellular D-serine levels can be controlled in such a mutant during in vitro growth. Alternatively, starting with a CFT073 dsdXA deletion mutant background, dsdX would be reinserted, but under control of an independent constitutive promoter. For the conditional expression of dsdC, a dsdC deletion mutant of either of the previous dsdX constructs will be complemented by a dsdC recombinant plasmid with dsdC under control of the araBAD (P bad) promoter from vectors pBAD18 or pBAD33 (Guzman, et al., J. Bacteriol., 1995, 177(14): 4121-4130). CFT073 LacZ transcriptional fusion mutants altered in expression in the presence of arabinose and D-serine will be quantified by β-galactosidase assays. Mutants with 3-fold or greater changes in β-galactosidase activity in the presence of D-serine and arabinose will be chosen for further study. A secondary screen will identify the fusions that are independent of dsdC and controlled by D-serine alone. The dsdC-recombinant plasmid from the mutants will be cured by repeated rounds of nonselective growth as done by Nelson et al. (Nelson et al., 2001) The cured mutants will be assessed for retention of the D-serine-specific phenotype. Those mutants that are regulated by arabinose alone will be eliminated from further study. Transposon insertion sites of mutants specific for dsdC or D-serine will be identified as described above.
- Protein expression of wild type CFT073 and CFT073 dsdA mutant grown in human urine and glycerol minimal medium plus or minus D-serine will be compared. Total and envelope-enriched cellular proteins will be separated by two-dimensional electrophoresis (Kendrick Labs, Madison, Wis.). Samples will be prepared in triplicate and subjected to independent electrophoretic runs. Individual polypeptide spots will be isolated. Polypeptide species with reproducible changes in expression levels in response to loss of DsdA will be identified by MALDI-TOF mass spectrometric identification of peptides by molecular weight (Kendrick Labs, Madison, Wis.).
- Microarray techniques will be used to assess the patterns of genome-wide gene expression controlled by D-serine (+/−) DsdC. The initial comparative analysis will use RNA purified from CFT073 cells grown in glycerol minimal medium (+/−) D-serine. Initially studies will use the E. coli K-12 spotted orf arrays, which are available from the University of Wisconsin Gene Expression Center (GEC) (http://www.gcow.wisc.edu/gec/index.html) (Wei et al., J. Bacteriol., 2001, 183(2): 545-556). Based on preliminary data, it is expected that CFT073 cells grown in minimal glycerol medium (+/−) D-serine will show expression differences for at least dsdXA, ppsa and manY, where those genes are represented on the E. coli K-12 arrays. These genes will serve as positive controls. CFT073-specific oligomer arrays will be made at the UW Gene Expression Center by the NimbleGen technology, a maskless array synthesizer that produces 125,000 custom oligonucleotide arrays with 20 base oligomers present at 16 μm×16 μm spots.
- Microhybridization will be performed according to the protocol provided by GEC (http://www.gcow.wisc.edu/Gec/protocols/). Labeled probes will be generated starting with total RNA extracted from bacteria by the method described by the Gene Expression Center (http://www.gcow.wisc.edu/Gec/protocols/). Cells harvested from liquid cultures are treated with one part boiling lysis solution (2% SDS, 16 mM EDTA, 200 mM NaCl) to two parts culture solution and boiled for 5 minutes with vortexing. This aqueous solution will then be extracted three times with hot acid-phenol-chloroform and once with chloroform-isoamyl alcohol. Total RNA will be precipitated with isopropanol overnight at −20° C. and washed once with cold 70% ethanol. RNA samples will then be resuspended in DEPC-treated water and treated with RNase free RQ1 DNase (Promega). Each reaction will be then re-purified using an RNeasy kit (QIAgen) and further precipitated with lithium chloride (Ambion) according to each manufacturer's protocol. Samples will be resuspended in DEPC-treated water and the concentration and purity of each determined by absorbance at 260/280 nm. Integrity of the rRNA bands will be determined by running each sample on a formaldehyde-agarose gel.
- Samples of total RNA will be labeled with either Cy5- or Cy3-labeled dUTP (Amersham-Pharmacia) according to the protocol supplied by GEC. Briefly, 20 μg of each sample will be mixed with 10 μg of random hexamer primers (Amersham-Pharmacia), heated to 70° C. for 5 minutes, then placed on ice. The 1× labeling mixture (containing first strand buffer [GibcoBRL], DTT, deoxynucleoside triphosphates [low dTTP, Promega], RNAsin [Promega] and DEPC water) will be added to each sample and incubated at room temperature for 10 minutes. Cy5-dUTP (or Cy3-dUTP)-labeled cDNA will be generated by reverse-transcription with Superscript II (GibcoBRL) at 42° C. for 2 hours in the dark. After labeling, remaining RNA will be hydrolyzed by the addition of NaOH and incubation for 15 minutes at 65° C. followed by the addition of HCl and Tris (pH 7.4) to neutralize the reactions. Control and experimental probes will be purified, combined, and concentrated using Microcon MY-30 filter units (Amicon).
- The labeled probes will be mixed with one Ag salmon sperm DNA, 4 μg of yeast tRNA and 10 μl of PerfectHyb Plus hybridization buffer (Sigma) to a final volume of 20 μl. This mixture will be heated for 5 minutes at 100° C. and hybridized to the microarray overnight (˜16 hours at 60° C.). Microarrays will then be washed 2 minutes in 0.2×SSC, 0.1% SDS, followed by two washes in 0.2×SSC, and a final wash in 0.05×SSC for 15 seconds.
- Hybridization slides will be dried by centrifugation and immediately scanned using a Packard BioChip SA5000 scanning system. This system includes a scanning laser confocal fluorescence microscope and the ScanArray software that collects the 16-bit TIFF images. These images will be quantified using the QuantArray software, which automates spot finding and allows the quantitation of spot intensity.
- Induction ratios for each gene will be calculated by dividing background-adjusted intensities of the experimental sample by the background-adjusted intensities of the control sample. The intensities used for these calculations will be expressed as a percentage of the total intensity of all the spots on each array, which corrects for the specific activity of probes used between arrays. Those genes having induction ratios of greater than 2 standard deviations from the mean induction ratio will be selected for further analysis and confirmation by Northern analysis.
- Currently, most array data are analyzed by observing the level of induction and choosing an arbitrary cut-off point for significance. The GEC is investigating more rigorous and computationally intense methods of array data analysis. DNA microarray experiments are expensive, so replication of experiments needs to be minimized. To address this, a Bayesian statistical approach that includes prior information about the arrays (from previous experiments) in weighting the expression ratios can be used to improve the confidence of measurements (Long et al., J. Biol. Chem., 2001, 276(23): 19937-19944). Software that applies this method, Cyber-T, is available freely from the genomics web site at the University of California at Irvine (http://www.Renomics.uci.edu/software.html) (Long et al., 2001).
- Northern blotting or primer-extension analysis will be used to confirm the change in transcription levels. Candidate genes that are altered in their expression upon exposure to D-serine will be assessed for likelihood of their impact on uropathogenesis. Solid candidate genes will be mutagenized and mutants assessed for their competitive ability against the CFT073 wild type strain in the mouse UTI model.
- In our laboratory, the Valdivia and Falkow DFI technique (Valdivia, et al., Science, 1997, 277: 2007-2011) permitted identification of 12 CFT073 genes that are up-expressed in the peritoneal cavity of mice but down-expressed during growth in L-broth. We constructed two separate CFT073 libraries containing 1-3 kb partial Sau3A digest fragments inserted into the promoterless GFP vector, BVC. We will reuse the libraries to identify promoters that are negatively or positively affected by growth in the presence of D-serine. This approach will identify genes that undergo relatively large changes in their expression under the alternative conditions (presence or absence of D-serine). The DFI method may also be applied with a new library constructed in a CFT073 dsdA mutant background. This new library would then be put through selections of either growth in human urine or infected mouse bladders or kidneys in the UTI model. The D-serine-positive genes that are selected and eventually identified by DNA sequence analysis will be subjected to Northern blotting analysis to confirm the induction of steady-state levels of gene-specific mRNA. Based on the criteria described above, candidate genes will be chosen for mutant construction and phenotypic analysis.
- Direct binding of DsdC to candidate genes will be evaluated in the presence and absence D-serine. DsdC will be obtained from cellular extracts from strains in which DsdC is constitutively expressed or over expressed in vectors such as pT7 in a BL21 (DE3) pLysS background (Novagen). Candidate promoter fragments will be synthesized by PCR amplification, labeled internally with [α- 32] dNTP or terminally by [γ-32] dNTP, and purified. The labeled promoter fragments will be incubated in binding reactions including variables as described in Ausubel et al (Ausubel et al., Current Protocols in Molecular Biology, 1987, New York: John Wiley and Sons). Sample mixtures will be separated by nondenaturing polyacrylamide gel electrophoresis and the radiolabeled bands identified by autoradiography. If warranted based on the perceived importance of the identified targets to uropathogenesis, we will purify reagent amounts of DsdC for more detailed studies. DsdC will be acquired by standard over-expression vector technology or gene fusion affinity chromatography methods. To more precisely map the DsdC binding sites, the purified DsdC (+/−D-serine) will be used in DNA mobility shift (gel-retardation) assays with overlapping PCR fragments and DNase I footprinting methods (Ausubel et al., 1987).
- Binding of DsdC protein to nucleotide sequences could also be detecting by labeling the DsdC protein with a detectable label, using the labeled DsdC to probe of an oligomer array of nucleotide sequences from a UPEC strain, detecting binding of DsdC to oligomers, and identifying the oligomer. The binding assays may be performed in the presence or absence of D-serine.
- In vitro transcription reactions will be performed as described previously (Aiyar et al., PNAS, 1998, 95(25): 14652-14657; Weyand et al., Mol. Microbiol., 2001, 39(6): 1504-1522), in the presence and absence of D-serine. These experiments will permit identification of newly synthesized labeled transcripts by gel electrophoresis and autoradiography. McFall's group demonstrated that D-serine deaminase activity could be detected in DsdC-dependent in vitro transcription/translation reactions using recombinant plasmids (Heincz et al, J. Bacteriol., 1984, 160(1): 42-49). This result suggests that in vitro transcriptional analysis with DsdC-dependent promoters is feasible.
- To directly assess the relevance of the D-serine-regulated genes that are identified, loss-of-function, in-frame deletion mutants will be constructed utilizing the PCR fragment λ-red recombinase method (Datsenko et al., 2000) and tested for the ability to compete with CFT073 wild type in the UTI model as described above.
- Intracellular invasion assays of 5637 human bladder epithelial cells as described by Mulvey et al. (Mulvey et al., Science, 1998, 282(5393): 1494-1497; Mulvey et al., Infect. Immun., 2001, 69(7): 4572-4579), which are incorporated by reference herein, will be performed to compare the abilities of the CFT073 and CFT073 dsdA strains to invade and replicate. We have acquired from Dr. Scott Hultgren, E. coli strain NU14 will be used as a positive control in the invasion studies. In addition, two CFT073 mutants that are locked “ON” or “OFF” will be included as controls for
type 1 pili expression and mannose-sensitive guinea pig RBC-agglutination activity. These strains were the kind gift of Dr. Harry Mobley. TheCFT073 type 1 pilus “OFF” mutant should serve as the negative control for the invasion studies. We expect that these experiments will help to assess if the 300-fold greater competitive ability of the CFT073 dsdA mutant is correlated with either increased efficiency in attachment, intracellular invasion or intracellular persistence. The latter phenotype will be judged to occur if intracellular organisms do not decrease over time or there appears to be decrease in the killing of the cells in the infected epithelial cell layers. The uroepithelial invasion assay will be used to assess the phenotypes of the additional mutants isolated as described above. - The present invention is not limited to the exemplified embodiments, but is intended to encompass all such modifications and variations as come within the scope of the following claims.
- Aiyar, S. E., et al., PNAS, 1998, 95(25): 14652-14657.
- Arthur, M., et al., Infect. Immun., 1989a, 57: 314-321.
- Arthur, M., et al., Infect. Immun., 1989b, 57: 303-313.
- Arthur, M., et al., Infect. Immun., 1990, 58: 471-479.
- Ausubel et al., Current Protocols in Molecular Biology, 1987, New York: John Wiley and Sons.
- Bloom, F. R., et al., J. Bacteriol., 1975, 121:1078-1084.
- Bloom, F. R., et al., J. Bacteriol., 1975, 121(3): 1092-1101.
- Bruckner, J., et al., Amino Acids, 1994, 6: 205-211.
- Connell, J., et al., Proc. Natl. Acad. Sci. USA, 1996, 93: 9827-9832.
- Datsenko, K., et al., Proc. Natl. Acad. Sci. USA, 2000, 97(12): 6640-6645.
- Guzman, L., et al., J. Bacteriol., 1995, 177(14): 4121-4130.
- Heincz, M. C., et al., J. Bacteriol. 1984, 160: 42-49.
- Kinder, S. A., et al., Gene, 1993. 136: 271-275.
- Long, A. D., et al., J. Biol. Chem., 2001, 276(23): 19937-19944.
- McFall, E., Molecular Biology, 1964, 9: 754-762.
- McFall, E., J. Bacteriol. 1967, 94: 1982-1988.
- McFall, E., J. Bacteriol., 1975, 121(3): 1074-1077.
- McFall, E., J. Bacteriol., 1973, 113(2): 781-785.
- McFall, E., et al., Mol. Gen. Genet., 1970, 106(4): 371-377.
- Miller, J. H., Experiments in molecular genetics, 1972.
- Mobley, H. L., et al., Infect. Immun., 1990, 58: 1281-1289.
- Mothet, J. P., et al., Proc. Natl. Acad. Sci. USA, 2000, 97: 4926-4931.
- Mulvey, M. A., et al., Science, 1998, 282: 1494-1497.
- Mulvey, M. A. et al., Infect Immun., 2001, 69(7): 4572-4579
- Neidhardt, F. C., et al., J. Bacteriol., 1974, 119: 736-747.
- Nelson, K. M., et al., Infect. Immun., 2001, 69: 6201-6208.
- Simon, R., et al., Biotechnology, 1983, 1: 784-791.
- Sokurenko, E. V., et al., Proc. Natl. Acad. Sci. USA, 1998, 95: 8922-8926.
- Torres, A. G., et al., Infect. Immun., 2001, 69: 6179-6185.
- Valdivia, R. H., et al., Science, 1997, 277: 2007-2011.
- Wei, Y., et al., J. Bacteriol., 2001, 183(2): 545-556.
- Weyand, N. J., et al., Mol. Microbiol., 2001, 39(6): 1504-1522.
- Whittam, T. S., et al., Infect. Immun., 1993, 61: 1619-1629.
- Wolosker, H., et al., Proc. Natl. Acad. Sci. USA, 1999b, 96: 721-725.
-
1 18 1 954 DNA E. coli CFT073 CDS (1)..(954) E. coli f-met-tRNA recognizes the triplet “GTG as an initition codon. The first amino acid residue encoded by SEQ ID NO1 should be a methionine residue instead of a valine residue. 1 gtg att atg gaa ccc ctt cgt gaa ata aga aat cga ctg ctt aac ggc 48 Val Ile Met Glu Pro Leu Arg Glu Ile Arg Asn Arg Leu Leu Asn Gly 1 5 10 15 tgg caa cta tca aaa ctg cat act ttt gaa gtg gct gcc agg cat cag 96 Trp Gln Leu Ser Lys Leu His Thr Phe Glu Val Ala Ala Arg His Gln 20 25 30 tcc ttc gct ctc gcg gca gag gaa ttg tcg ctg agc ccc agt gcg gta 144 Ser Phe Ala Leu Ala Ala Glu Glu Leu Ser Leu Ser Pro Ser Ala Val 35 40 45 agt cac cgt atc aat cag ctg gaa gaa gaa tta ggt att cag ttg ttt 192 Ser His Arg Ile Asn Gln Leu Glu Glu Glu Leu Gly Ile Gln Leu Phe 50 55 60 gtt cgt tcc cat cgc aaa gtg gaa tta acg cac gag ggg aaa cgt gtt 240 Val Arg Ser His Arg Lys Val Glu Leu Thr His Glu Gly Lys Arg Val 65 70 75 80 tat tgg gcg cta aaa tcg tcg ctg gat acc ctg aat cag gaa att ctg 288 Tyr Trp Ala Leu Lys Ser Ser Leu Asp Thr Leu Asn Gln Glu Ile Leu 85 90 95 gat atc aaa aat cag gag tta tcg gga acg tta acg ctg tat tcc cgg 336 Asp Ile Lys Asn Gln Glu Leu Ser Gly Thr Leu Thr Leu Tyr Ser Arg 100 105 110 ccc tct atc gcc caa tgc tgg ttg gtg ccc gca tta ggt gac ttt aca 384 Pro Ser Ile Ala Gln Cys Trp Leu Val Pro Ala Leu Gly Asp Phe Thr 115 120 125 cgc cga tat ccg tct att tcg ctc acc gtg ctc act ggt aat gac aat 432 Arg Arg Tyr Pro Ser Ile Ser Leu Thr Val Leu Thr Gly Asn Asp Asn 130 135 140 gtc aat ttg caa cgt gcc gga atc gat ttg gcg atc tac ttt gat gat 480 Val Asn Leu Gln Arg Ala Gly Ile Asp Leu Ala Ile Tyr Phe Asp Asp 145 150 155 160 gcg ccg tca gcg caa ttg gct cat cac ttt ctg atg gat gaa gaa atc 528 Ala Pro Ser Ala Gln Leu Ala His His Phe Leu Met Asp Glu Glu Ile 165 170 175 ttg cca gtt tgc agc ccg gag tac gct caa aga cat gat tta acc aac 576 Leu Pro Val Cys Ser Pro Glu Tyr Ala Gln Arg His Asp Leu Thr Asn 180 185 190 acg gta att aac ctg cgt cac tgt acg ttg ctc cat gac aga cag gca 624 Thr Val Ile Asn Leu Arg His Cys Thr Leu Leu His Asp Arg Gln Ala 195 200 205 tgg agc aac gac tcc ggt acg gat gaa tgg cat agt tgg gcg caa cat 672 Trp Ser Asn Asp Ser Gly Thr Asp Glu Trp His Ser Trp Ala Gln His 210 215 220 tat gcg gtt aat ttg cca aca tct tct gga att ggc ttt gat cgc tct 720 Tyr Ala Val Asn Leu Pro Thr Ser Ser Gly Ile Gly Phe Asp Arg Ser 225 230 235 240 gat tta gcc gtt atc gcc gcg atg aat cat att ggg gtg gcg atg gga 768 Asp Leu Ala Val Ile Ala Ala Met Asn His Ile Gly Val Ala Met Gly 245 250 255 agg aaa cgt atg gta caa aaa agg ctt gcc agt ggt gag ctc gtc gcg 816 Arg Lys Arg Met Val Gln Lys Arg Leu Ala Ser Gly Glu Leu Val Ala 260 265 270 ccg ttt ggc gat atg acg gtg aaa tgc cat cag cat tat tac atc acc 864 Pro Phe Gly Asp Met Thr Val Lys Cys His Gln His Tyr Tyr Ile Thr 275 280 285 aca tta ccg ggc agg cag tgg cca aaa att gag gca ttt att act tgg 912 Thr Leu Pro Gly Arg Gln Trp Pro Lys Ile Glu Ala Phe Ile Thr Trp 290 295 300 tta aga gaa cag gta agt caa tat gaa tgt tat acc tta taa 954 Leu Arg Glu Gln Val Ser Gln Tyr Glu Cys Tyr Thr Leu 305 310 315 2 317 PRT E. coli CFT073 2 Val Ile Met Glu Pro Leu Arg Glu Ile Arg Asn Arg Leu Leu Asn Gly 1 5 10 15 Trp Gln Leu Ser Lys Leu His Thr Phe Glu Val Ala Ala Arg His Gln 20 25 30 Ser Phe Ala Leu Ala Ala Glu Glu Leu Ser Leu Ser Pro Ser Ala Val 35 40 45 Ser His Arg Ile Asn Gln Leu Glu Glu Glu Leu Gly Ile Gln Leu Phe 50 55 60 Val Arg Ser His Arg Lys Val Glu Leu Thr His Glu Gly Lys Arg Val 65 70 75 80 Tyr Trp Ala Leu Lys Ser Ser Leu Asp Thr Leu Asn Gln Glu Ile Leu 85 90 95 Asp Ile Lys Asn Gln Glu Leu Ser Gly Thr Leu Thr Leu Tyr Ser Arg 100 105 110 Pro Ser Ile Ala Gln Cys Trp Leu Val Pro Ala Leu Gly Asp Phe Thr 115 120 125 Arg Arg Tyr Pro Ser Ile Ser Leu Thr Val Leu Thr Gly Asn Asp Asn 130 135 140 Val Asn Leu Gln Arg Ala Gly Ile Asp Leu Ala Ile Tyr Phe Asp Asp 145 150 155 160 Ala Pro Ser Ala Gln Leu Ala His His Phe Leu Met Asp Glu Glu Ile 165 170 175 Leu Pro Val Cys Ser Pro Glu Tyr Ala Gln Arg His Asp Leu Thr Asn 180 185 190 Thr Val Ile Asn Leu Arg His Cys Thr Leu Leu His Asp Arg Gln Ala 195 200 205 Trp Ser Asn Asp Ser Gly Thr Asp Glu Trp His Ser Trp Ala Gln His 210 215 220 Tyr Ala Val Asn Leu Pro Thr Ser Ser Gly Ile Gly Phe Asp Arg Ser 225 230 235 240 Asp Leu Ala Val Ile Ala Ala Met Asn His Ile Gly Val Ala Met Gly 245 250 255 Arg Lys Arg Met Val Gln Lys Arg Leu Ala Ser Gly Glu Leu Val Ala 260 265 270 Pro Phe Gly Asp Met Thr Val Lys Cys His Gln His Tyr Tyr Ile Thr 275 280 285 Thr Leu Pro Gly Arg Gln Trp Pro Lys Ile Glu Ala Phe Ile Thr Trp 290 295 300 Leu Arg Glu Gln Val Ser Gln Tyr Glu Cys Tyr Thr Leu 305 310 315 3 1338 DNA E. coli CFT073 CDS (1)..(1338) 3 atg cac tct caa atc tgg gtt gtg agc acg ctg ctt atc agc atc gtg 48 Met His Ser Gln Ile Trp Val Val Ser Thr Leu Leu Ile Ser Ile Val 1 5 10 15 tta att gta ctg acc atc gtg aag ttc aaa ttc cac ccg ttt ctg gcg 96 Leu Ile Val Leu Thr Ile Val Lys Phe Lys Phe His Pro Phe Leu Ala 20 25 30 ctg ttg ctg gcc agc ttc ttc gtg gga acg atg atg ggc atg ggg cca 144 Leu Leu Leu Ala Ser Phe Phe Val Gly Thr Met Met Gly Met Gly Pro 35 40 45 ctg gat atg gta aat gct att gaa agt gga att ggc gga acg ctg ggg 192 Leu Asp Met Val Asn Ala Ile Glu Ser Gly Ile Gly Gly Thr Leu Gly 50 55 60 ttc ctc gct gcg gtt atc ggc ctt ggc acg ata ctg gga aaa atg atg 240 Phe Leu Ala Ala Val Ile Gly Leu Gly Thr Ile Leu Gly Lys Met Met 65 70 75 80 gaa gta tcc ggg gcc gca gaa aga att ggt ctg aca ctt caa cgc tgc 288 Glu Val Ser Gly Ala Ala Glu Arg Ile Gly Leu Thr Leu Gln Arg Cys 85 90 95 cgg tgg ctt tca gct gat gtc att atg gtg ctg gtt ggc ctg att tgc 336 Arg Trp Leu Ser Ala Asp Val Ile Met Val Leu Val Gly Leu Ile Cys 100 105 110 ggc atc acg ctg ttt gtt gaa gtg ggc gtc gtg cta ttg att cct ctg 384 Gly Ile Thr Leu Phe Val Glu Val Gly Val Val Leu Leu Ile Pro Leu 115 120 125 gct ttt tca att gcc aaa aaa acc aat acc tca ttg tta aag ctg gcc 432 Ala Phe Ser Ile Ala Lys Lys Thr Asn Thr Ser Leu Leu Lys Leu Ala 130 135 140 att ccg ctg tgt acc gca ttg atg gca gtg cac tgc gtg gtt ccc cca 480 Ile Pro Leu Cys Thr Ala Leu Met Ala Val His Cys Val Val Pro Pro 145 150 155 160 cat ccg gct gct tta tat gtt gcc aat aag ctg ggc gca gat atc ggt 528 His Pro Ala Ala Leu Tyr Val Ala Asn Lys Leu Gly Ala Asp Ile Gly 165 170 175 tcg gtg atc gtc tac ggt ttg ctg gtt ggg ctg atg gca tca ctg atc 576 Ser Val Ile Val Tyr Gly Leu Leu Val Gly Leu Met Ala Ser Leu Ile 180 185 190 ggt ggc cca ctt ttc ctt aaa ttt ctg ggt caa cga ctg ccc ttt aaa 624 Gly Gly Pro Leu Phe Leu Lys Phe Leu Gly Gln Arg Leu Pro Phe Lys 195 200 205 cct gta ccc aca gag ttt gcc gat ctc aaa gtt cgc gat gaa aaa aca 672 Pro Val Pro Thr Glu Phe Ala Asp Leu Lys Val Arg Asp Glu Lys Thr 210 215 220 cta ccg tca tta ggc gca acg tta ttc acc gta ctg cta ccc att gcg 720 Leu Pro Ser Leu Gly Ala Thr Leu Phe Thr Val Leu Leu Pro Ile Ala 225 230 235 240 ctg atg ttg gtt aaa acg att gcc gaa ttg aat atg gca cgt gag agt 768 Leu Met Leu Val Lys Thr Ile Ala Glu Leu Asn Met Ala Arg Glu Ser 245 250 255 ggt ttg tat acc ttg ctt gag ttt att ggc aac cct atc act gcc acg 816 Gly Leu Tyr Thr Leu Leu Glu Phe Ile Gly Asn Pro Ile Thr Ala Thr 260 265 270 ttt atc gcc gtg ttt gtc gcc tat tat gtg ttg ggt ata cgt cag cat 864 Phe Ile Ala Val Phe Val Ala Tyr Tyr Val Leu Gly Ile Arg Gln His 275 280 285 atg agc atg ggg acg atg ctc aca cat acg gaa aat ggt ttc ggt tct 912 Met Ser Met Gly Thr Met Leu Thr His Thr Glu Asn Gly Phe Gly Ser 290 295 300 att gct aat att ttg ctg att atc ggg gcc gga ggc gca ttc aac gcc 960 Ile Ala Asn Ile Leu Leu Ile Ile Gly Ala Gly Gly Ala Phe Asn Ala 305 310 315 320 att tta aaa agc agc agt ctc gct gat acg ctg gca gtt att ctc tcc 1008 Ile Leu Lys Ser Ser Ser Leu Ala Asp Thr Leu Ala Val Ile Leu Ser 325 330 335 aat atg cat atg cac ccg att ctt ctg gcc tgg ttg gtg gca ctt att 1056 Asn Met His Met His Pro Ile Leu Leu Ala Trp Leu Val Ala Leu Ile 340 345 350 ctg cat gct gca gtg ggc tcc gct acg gtg gca atg atg ggg gca acg 1104 Leu His Ala Ala Val Gly Ser Ala Thr Val Ala Met Met Gly Ala Thr 355 360 365 gca att gtt gca ccc atg ctg ccg ctg tat ccc gac atc agc ccg gaa 1152 Ala Ile Val Ala Pro Met Leu Pro Leu Tyr Pro Asp Ile Ser Pro Glu 370 375 380 att att gcg att gct atc ggt tcc ggt gcg att ggc tgc acg atc gtt 1200 Ile Ile Ala Ile Ala Ile Gly Ser Gly Ala Ile Gly Cys Thr Ile Val 385 390 395 400 acg gac tca ctt ttc tgg ctg gtg aag caa tat tgc ggc gct acg ctc 1248 Thr Asp Ser Leu Phe Trp Leu Val Lys Gln Tyr Cys Gly Ala Thr Leu 405 410 415 aat gaa aca ttt aaa tac tat acg aca gcg aca ttt atc gct tca gtc 1296 Asn Glu Thr Phe Lys Tyr Tyr Thr Thr Ala Thr Phe Ile Ala Ser Val 420 425 430 atc gct ctg gcg ggc aca ttc ctg ctg tca ttt atc atc taa 1338 Ile Ala Leu Ala Gly Thr Phe Leu Leu Ser Phe Ile Ile 435 440 445 4 445 PRT E. coli CFT073 4 Met His Ser Gln Ile Trp Val Val Ser Thr Leu Leu Ile Ser Ile Val 1 5 10 15 Leu Ile Val Leu Thr Ile Val Lys Phe Lys Phe His Pro Phe Leu Ala 20 25 30 Leu Leu Leu Ala Ser Phe Phe Val Gly Thr Met Met Gly Met Gly Pro 35 40 45 Leu Asp Met Val Asn Ala Ile Glu Ser Gly Ile Gly Gly Thr Leu Gly 50 55 60 Phe Leu Ala Ala Val Ile Gly Leu Gly Thr Ile Leu Gly Lys Met Met 65 70 75 80 Glu Val Ser Gly Ala Ala Glu Arg Ile Gly Leu Thr Leu Gln Arg Cys 85 90 95 Arg Trp Leu Ser Ala Asp Val Ile Met Val Leu Val Gly Leu Ile Cys 100 105 110 Gly Ile Thr Leu Phe Val Glu Val Gly Val Val Leu Leu Ile Pro Leu 115 120 125 Ala Phe Ser Ile Ala Lys Lys Thr Asn Thr Ser Leu Leu Lys Leu Ala 130 135 140 Ile Pro Leu Cys Thr Ala Leu Met Ala Val His Cys Val Val Pro Pro 145 150 155 160 His Pro Ala Ala Leu Tyr Val Ala Asn Lys Leu Gly Ala Asp Ile Gly 165 170 175 Ser Val Ile Val Tyr Gly Leu Leu Val Gly Leu Met Ala Ser Leu Ile 180 185 190 Gly Gly Pro Leu Phe Leu Lys Phe Leu Gly Gln Arg Leu Pro Phe Lys 195 200 205 Pro Val Pro Thr Glu Phe Ala Asp Leu Lys Val Arg Asp Glu Lys Thr 210 215 220 Leu Pro Ser Leu Gly Ala Thr Leu Phe Thr Val Leu Leu Pro Ile Ala 225 230 235 240 Leu Met Leu Val Lys Thr Ile Ala Glu Leu Asn Met Ala Arg Glu Ser 245 250 255 Gly Leu Tyr Thr Leu Leu Glu Phe Ile Gly Asn Pro Ile Thr Ala Thr 260 265 270 Phe Ile Ala Val Phe Val Ala Tyr Tyr Val Leu Gly Ile Arg Gln His 275 280 285 Met Ser Met Gly Thr Met Leu Thr His Thr Glu Asn Gly Phe Gly Ser 290 295 300 Ile Ala Asn Ile Leu Leu Ile Ile Gly Ala Gly Gly Ala Phe Asn Ala 305 310 315 320 Ile Leu Lys Ser Ser Ser Leu Ala Asp Thr Leu Ala Val Ile Leu Ser 325 330 335 Asn Met His Met His Pro Ile Leu Leu Ala Trp Leu Val Ala Leu Ile 340 345 350 Leu His Ala Ala Val Gly Ser Ala Thr Val Ala Met Met Gly Ala Thr 355 360 365 Ala Ile Val Ala Pro Met Leu Pro Leu Tyr Pro Asp Ile Ser Pro Glu 370 375 380 Ile Ile Ala Ile Ala Ile Gly Ser Gly Ala Ile Gly Cys Thr Ile Val 385 390 395 400 Thr Asp Ser Leu Phe Trp Leu Val Lys Gln Tyr Cys Gly Ala Thr Leu 405 410 415 Asn Glu Thr Phe Lys Tyr Tyr Thr Thr Ala Thr Phe Ile Ala Ser Val 420 425 430 Ile Ala Leu Ala Gly Thr Phe Leu Leu Ser Phe Ile Ile 435 440 445 5 1329 DNA E. coli CFT073 CDS (1)..(1329) 5 atg gaa aac gct aaa atg aat tcg ctc atc gcc cag tat ccg ttg gta 48 Met Glu Asn Ala Lys Met Asn Ser Leu Ile Ala Gln Tyr Pro Leu Val 1 5 10 15 gag gat ctg gtt gct ctt aaa gaa acc acc tgg ttt aat cct ggc acg 96 Glu Asp Leu Val Ala Leu Lys Glu Thr Thr Trp Phe Asn Pro Gly Thr 20 25 30 acc tca ttg gca gaa ggt tta cct tat gtt ggc ctg acc gaa cag gat 144 Thr Ser Leu Ala Glu Gly Leu Pro Tyr Val Gly Leu Thr Glu Gln Asp 35 40 45 gtt cag gac gcc cat gcg cgc ctg tct cgt ttt gcg ccg tat ctg gca 192 Val Gln Asp Ala His Ala Arg Leu Ser Arg Phe Ala Pro Tyr Leu Ala 50 55 60 aaa gca ttt cct gaa acg gct gcc gcg ggg ggg att att gaa tca gaa 240 Lys Ala Phe Pro Glu Thr Ala Ala Ala Gly Gly Ile Ile Glu Ser Glu 65 70 75 80 ctg gtt gct att cct gct atg caa aaa cgg ctg gaa aag gaa tat caa 288 Leu Val Ala Ile Pro Ala Met Gln Lys Arg Leu Glu Lys Glu Tyr Gln 85 90 95 caa ccg atc gcc ggg caa ttg cta cta aaa aaa gac agc cat ttg ccg 336 Gln Pro Ile Ala Gly Gln Leu Leu Leu Lys Lys Asp Ser His Leu Pro 100 105 110 att tcc ggc tcc ata aaa gca cgt ggc ggg att tat gaa gtc ctg gcc 384 Ile Ser Gly Ser Ile Lys Ala Arg Gly Gly Ile Tyr Glu Val Leu Ala 115 120 125 cat gct gaa aaa ctg gct ctg gaa gcg ggt ttg ctg aca ctt gag gat 432 His Ala Glu Lys Leu Ala Leu Glu Ala Gly Leu Leu Thr Leu Glu Asp 130 135 140 gat tac agc aaa ctg ctt tcc ccg gag ttt aaa cag ttc ttt agc caa 480 Asp Tyr Ser Lys Leu Leu Ser Pro Glu Phe Lys Gln Phe Phe Ser Gln 145 150 155 160 tac agt att gct gtg gga tca acc gga aat ctg ggg tta tca att ggt 528 Tyr Ser Ile Ala Val Gly Ser Thr Gly Asn Leu Gly Leu Ser Ile Gly 165 170 175 att atg agc gcc cgc atc ggc ttt aaa gta acg gtg cat atg tct gct 576 Ile Met Ser Ala Arg Ile Gly Phe Lys Val Thr Val His Met Ser Ala 180 185 190 gat gcc cgc gcc tgg aaa aaa gcg aaa ctg cgc agt cat ggc gtt act 624 Asp Ala Arg Ala Trp Lys Lys Ala Lys Leu Arg Ser His Gly Val Thr 195 200 205 gtc gtg gaa tac gag caa gat tat ggt gtt gcc gtc gag gaa gga cgt 672 Val Val Glu Tyr Glu Gln Asp Tyr Gly Val Ala Val Glu Glu Gly Arg 210 215 220 aaa gcc gcg cag tct gac ccg aac tgt ttc ttt att gat gac gaa aat 720 Lys Ala Ala Gln Ser Asp Pro Asn Cys Phe Phe Ile Asp Asp Glu Asn 225 230 235 240 tcc cgc acg ttg ttc ctt gga tat tcc gtc gca ggc cag cgt ctt aag 768 Ser Arg Thr Leu Phe Leu Gly Tyr Ser Val Ala Gly Gln Arg Leu Lys 245 250 255 gcg caa ttt gct cag caa ggt cgc atc gtc gat gct gat aac cct ctg 816 Ala Gln Phe Ala Gln Gln Gly Arg Ile Val Asp Ala Asp Asn Pro Leu 260 265 270 ttt gtc tat ctg ccg tgt ggt gtt ggc ggt ggt cct ggt ggc gtc gca 864 Phe Val Tyr Leu Pro Cys Gly Val Gly Gly Gly Pro Gly Gly Val Ala 275 280 285 ttc gga ctt aaa ctg gcg ttt ggc gat cat gtt cac tgc ttt ttt gcc 912 Phe Gly Leu Lys Leu Ala Phe Gly Asp His Val His Cys Phe Phe Ala 290 295 300 gaa cca aca cac tcc ccg tgt atg tta tta ggc gtc cat acc gga tta 960 Glu Pro Thr His Ser Pro Cys Met Leu Leu Gly Val His Thr Gly Leu 305 310 315 320 cac gat cag att tct gtt cag gat att ggc atc gac aac ctt acc gct 1008 His Asp Gln Ile Ser Val Gln Asp Ile Gly Ile Asp Asn Leu Thr Ala 325 330 335 gcc gat ggc ctt gca gtt ggt cgc gca tcg ggc ttt gtc ggg cgg gca 1056 Ala Asp Gly Leu Ala Val Gly Arg Ala Ser Gly Phe Val Gly Arg Ala 340 345 350 atg gag cgt ctg ctg gat ggt ttc tat acc ctt agc gat caa acc atg 1104 Met Glu Arg Leu Leu Asp Gly Phe Tyr Thr Leu Ser Asp Gln Thr Met 355 360 365 tat gac atg ctt ggc tgg ctg gcg cag gaa gaa ggt att cgt ctt gaa 1152 Tyr Asp Met Leu Gly Trp Leu Ala Gln Glu Glu Gly Ile Arg Leu Glu 370 375 380 cca tcg gca ctg gcg ggt atg gcc ggg tct cag cgc gtg tgc gca tca 1200 Pro Ser Ala Leu Ala Gly Met Ala Gly Ser Gln Arg Val Cys Ala Ser 385 390 395 400 gta agt tac caa cag atg cac ggt ttc agc gcc gaa caa ctg cgt aat 1248 Val Ser Tyr Gln Gln Met His Gly Phe Ser Ala Glu Gln Leu Arg Asn 405 410 415 gcc act cat ctg gtg tgg gcg acg gga ggt gga atg gtg ccg gaa gaa 1296 Ala Thr His Leu Val Trp Ala Thr Gly Gly Gly Met Val Pro Glu Glu 420 425 430 gag atg aat caa tat ctg gca aaa ggc cgt taa 1329 Glu Met Asn Gln Tyr Leu Ala Lys Gly Arg 435 440 6 442 PRT E. coli CFT073 6 Met Glu Asn Ala Lys Met Asn Ser Leu Ile Ala Gln Tyr Pro Leu Val 1 5 10 15 Glu Asp Leu Val Ala Leu Lys Glu Thr Thr Trp Phe Asn Pro Gly Thr 20 25 30 Thr Ser Leu Ala Glu Gly Leu Pro Tyr Val Gly Leu Thr Glu Gln Asp 35 40 45 Val Gln Asp Ala His Ala Arg Leu Ser Arg Phe Ala Pro Tyr Leu Ala 50 55 60 Lys Ala Phe Pro Glu Thr Ala Ala Ala Gly Gly Ile Ile Glu Ser Glu 65 70 75 80 Leu Val Ala Ile Pro Ala Met Gln Lys Arg Leu Glu Lys Glu Tyr Gln 85 90 95 Gln Pro Ile Ala Gly Gln Leu Leu Leu Lys Lys Asp Ser His Leu Pro 100 105 110 Ile Ser Gly Ser Ile Lys Ala Arg Gly Gly Ile Tyr Glu Val Leu Ala 115 120 125 His Ala Glu Lys Leu Ala Leu Glu Ala Gly Leu Leu Thr Leu Glu Asp 130 135 140 Asp Tyr Ser Lys Leu Leu Ser Pro Glu Phe Lys Gln Phe Phe Ser Gln 145 150 155 160 Tyr Ser Ile Ala Val Gly Ser Thr Gly Asn Leu Gly Leu Ser Ile Gly 165 170 175 Ile Met Ser Ala Arg Ile Gly Phe Lys Val Thr Val His Met Ser Ala 180 185 190 Asp Ala Arg Ala Trp Lys Lys Ala Lys Leu Arg Ser His Gly Val Thr 195 200 205 Val Val Glu Tyr Glu Gln Asp Tyr Gly Val Ala Val Glu Glu Gly Arg 210 215 220 Lys Ala Ala Gln Ser Asp Pro Asn Cys Phe Phe Ile Asp Asp Glu Asn 225 230 235 240 Ser Arg Thr Leu Phe Leu Gly Tyr Ser Val Ala Gly Gln Arg Leu Lys 245 250 255 Ala Gln Phe Ala Gln Gln Gly Arg Ile Val Asp Ala Asp Asn Pro Leu 260 265 270 Phe Val Tyr Leu Pro Cys Gly Val Gly Gly Gly Pro Gly Gly Val Ala 275 280 285 Phe Gly Leu Lys Leu Ala Phe Gly Asp His Val His Cys Phe Phe Ala 290 295 300 Glu Pro Thr His Ser Pro Cys Met Leu Leu Gly Val His Thr Gly Leu 305 310 315 320 His Asp Gln Ile Ser Val Gln Asp Ile Gly Ile Asp Asn Leu Thr Ala 325 330 335 Ala Asp Gly Leu Ala Val Gly Arg Ala Ser Gly Phe Val Gly Arg Ala 340 345 350 Met Glu Arg Leu Leu Asp Gly Phe Tyr Thr Leu Ser Asp Gln Thr Met 355 360 365 Tyr Asp Met Leu Gly Trp Leu Ala Gln Glu Glu Gly Ile Arg Leu Glu 370 375 380 Pro Ser Ala Leu Ala Gly Met Ala Gly Ser Gln Arg Val Cys Ala Ser 385 390 395 400 Val Ser Tyr Gln Gln Met His Gly Phe Ser Ala Glu Gln Leu Arg Asn 405 410 415 Ala Thr His Leu Val Trp Ala Thr Gly Gly Gly Met Val Pro Glu Glu 420 425 430 Glu Met Asn Gln Tyr Leu Ala Lys Gly Arg 435 440 7 630 DNA E. coli CFT073 CDS (1)..(630) 7 atg caa aac aga aaa ttt tta acg cat cat gaa att aat tta cta ctt 48 Met Gln Asn Arg Lys Phe Leu Thr His His Glu Ile Asn Leu Leu Leu 1 5 10 15 caa tct gta aaa caa aag agt tgc tct tca cgc gat gtt tgc atg att 96 Gln Ser Val Lys Gln Lys Ser Cys Ser Ser Arg Asp Val Cys Met Ile 20 25 30 tta ttg gct tat ttt cat ggt ctg cgt gtt agt gag ctc tta tct ctg 144 Leu Leu Ala Tyr Phe His Gly Leu Arg Val Ser Glu Leu Leu Ser Leu 35 40 45 cag ttg tcg gat tta gaa cta act aca gaa aaa ata tat att caa cgg 192 Gln Leu Ser Asp Leu Glu Leu Thr Thr Glu Lys Ile Tyr Ile Gln Arg 50 55 60 ata aaa aat gga ttc agt aca gtc cat cca ctt caa aag gaa gag gtg 240 Ile Lys Asn Gly Phe Ser Thr Val His Pro Leu Gln Lys Glu Glu Val 65 70 75 80 att gca ata aca aac tgg ctg aat gaa agg aat tca tta aat gtt aaa 288 Ile Ala Ile Thr Asn Trp Leu Asn Glu Arg Asn Ser Leu Asn Val Lys 85 90 95 cat ttc aat gat aac ccg tgg tta ttt gtt tcc cga aca gga aaa cca 336 His Phe Asn Asp Asn Pro Trp Leu Phe Val Ser Arg Thr Gly Lys Pro 100 105 110 tta tcg aga caa cga ttt tat aac ata gtt tct gct gcg ggt aaa aat 384 Leu Ser Arg Gln Arg Phe Tyr Asn Ile Val Ser Ala Ala Gly Lys Asn 115 120 125 gca ggt tta aat att aaa gtt cat cca cat atg tta cgc cat gca tgt 432 Ala Gly Leu Asn Ile Lys Val His Pro His Met Leu Arg His Ala Cys 130 135 140 ggt tat tca cta gca gat aat ggt gtg gat acc cgt ctt att cag gac 480 Gly Tyr Ser Leu Ala Asp Asn Gly Val Asp Thr Arg Leu Ile Gln Asp 145 150 155 160 tat ctc ggg cat cgt aat atc agg cat acg gta att tat act gct tct 528 Tyr Leu Gly His Arg Asn Ile Arg His Thr Val Ile Tyr Thr Ala Ser 165 170 175 aat tca atg aga ttt gaa aaa atg tgg ggg ata ggt gac gca aaa aag 576 Asn Ser Met Arg Phe Glu Lys Met Trp Gly Ile Gly Asp Ala Lys Lys 180 185 190 caa cat ttt gac cca aaa tgt aaa ccc aat ctt tgt tta gaa att ctc 624 Gln His Phe Asp Pro Lys Cys Lys Pro Asn Leu Cys Leu Glu Ile Leu 195 200 205 gta tag 630 Val 210 8 209 PRT E. coli CFT073 8 Met Gln Asn Arg Lys Phe Leu Thr His His Glu Ile Asn Leu Leu Leu 1 5 10 15 Gln Ser Val Lys Gln Lys Ser Cys Ser Ser Arg Asp Val Cys Met Ile 20 25 30 Leu Leu Ala Tyr Phe His Gly Leu Arg Val Ser Glu Leu Leu Ser Leu 35 40 45 Gln Leu Ser Asp Leu Glu Leu Thr Thr Glu Lys Ile Tyr Ile Gln Arg 50 55 60 Ile Lys Asn Gly Phe Ser Thr Val His Pro Leu Gln Lys Glu Glu Val 65 70 75 80 Ile Ala Ile Thr Asn Trp Leu Asn Glu Arg Asn Ser Leu Asn Val Lys 85 90 95 His Phe Asn Asp Asn Pro Trp Leu Phe Val Ser Arg Thr Gly Lys Pro 100 105 110 Leu Ser Arg Gln Arg Phe Tyr Asn Ile Val Ser Ala Ala Gly Lys Asn 115 120 125 Ala Gly Leu Asn Ile Lys Val His Pro His Met Leu Arg His Ala Cys 130 135 140 Gly Tyr Ser Leu Ala Asp Asn Gly Val Asp Thr Arg Leu Ile Gln Asp 145 150 155 160 Tyr Leu Gly His Arg Asn Ile Arg His Thr Val Ile Tyr Thr Ala Ser 165 170 175 Asn Ser Met Arg Phe Glu Lys Met Trp Gly Ile Gly Asp Ala Lys Lys 180 185 190 Gln His Phe Asp Pro Lys Cys Lys Pro Asn Leu Cys Leu Glu Ile Leu 195 200 205 Val 9 570 DNA E. coli CFT073 CDS (1)..(570) 9 atg cgc aaa ttt att act cat agt gaa tgg ttg tta ttt ttt gaa gcc 48 Met Arg Lys Phe Ile Thr His Ser Glu Trp Leu Leu Phe Phe Glu Ala 1 5 10 15 atc aac ggt tca aaa aat gaa att aga gat aaa gcc atg tta caa atg 96 Ile Asn Gly Ser Lys Asn Glu Ile Arg Asp Lys Ala Met Leu Gln Met 20 25 30 gca tat gta cat ggt ctt agg gta agc gag ctt att gca cta aag att 144 Ala Tyr Val His Gly Leu Arg Val Ser Glu Leu Ile Ala Leu Lys Ile 35 40 45 agc gac att gac ttt tca gag tca gca ata tat atc aag cgt tta aaa 192 Ser Asp Ile Asp Phe Ser Glu Ser Ala Ile Tyr Ile Lys Arg Leu Lys 50 55 60 aat gga tta tcc aca gta cat ccc ctg caa aaa gaa act gtt ctg tta 240 Asn Gly Leu Ser Thr Val His Pro Leu Gln Lys Glu Thr Val Leu Leu 65 70 75 80 tta aaa aaa tgg cta gca cta cgt gat aac ata gta aaa aaa cct ttt 288 Leu Lys Lys Trp Leu Ala Leu Arg Asp Asn Ile Val Lys Lys Pro Phe 85 90 95 gaa gac tct ttg ttt ctt tct tgt caa gga aat aaa atc tcg aga caa 336 Glu Asp Ser Leu Phe Leu Ser Cys Gln Gly Asn Lys Ile Ser Arg Gln 100 105 110 tat gtg tat aag atg tgt aag aaa tat agt cac aat atg aat att aat 384 Tyr Val Tyr Lys Met Cys Lys Lys Tyr Ser His Asn Met Asn Ile Asn 115 120 125 att cat cct cac atg ctt agg cat ggt tgt ggg tat gct tta gct aat 432 Ile His Pro His Met Leu Arg His Gly Cys Gly Tyr Ala Leu Ala Asn 130 135 140 caa ggg tta gac acc aga cta ata caa gat tat tta ggg cac aga aat 480 Gln Gly Leu Asp Thr Arg Leu Ile Gln Asp Tyr Leu Gly His Arg Asn 145 150 155 160 ata cat cat acg gta tta tat aca gcc agt aat gca gca aga ttt aaa 528 Ile His His Thr Val Leu Tyr Thr Ala Ser Asn Ala Ala Arg Phe Lys 165 170 175 cga gta tgg gag ggt gat gtg tta gat ata aag aag ata taa 570 Arg Val Trp Glu Gly Asp Val Leu Asp Ile Lys Lys Ile 180 185 190 10 189 PRT E. coli CFT073 10 Met Arg Lys Phe Ile Thr His Ser Glu Trp Leu Leu Phe Phe Glu Ala 1 5 10 15 Ile Asn Gly Ser Lys Asn Glu Ile Arg Asp Lys Ala Met Leu Gln Met 20 25 30 Ala Tyr Val His Gly Leu Arg Val Ser Glu Leu Ile Ala Leu Lys Ile 35 40 45 Ser Asp Ile Asp Phe Ser Glu Ser Ala Ile Tyr Ile Lys Arg Leu Lys 50 55 60 Asn Gly Leu Ser Thr Val His Pro Leu Gln Lys Glu Thr Val Leu Leu 65 70 75 80 Leu Lys Lys Trp Leu Ala Leu Arg Asp Asn Ile Val Lys Lys Pro Phe 85 90 95 Glu Asp Ser Leu Phe Leu Ser Cys Gln Gly Asn Lys Ile Ser Arg Gln 100 105 110 Tyr Val Tyr Lys Met Cys Lys Lys Tyr Ser His Asn Met Asn Ile Asn 115 120 125 Ile His Pro His Met Leu Arg His Gly Cys Gly Tyr Ala Leu Ala Asn 130 135 140 Gln Gly Leu Asp Thr Arg Leu Ile Gln Asp Tyr Leu Gly His Arg Asn 145 150 155 160 Ile His His Thr Val Leu Tyr Thr Ala Ser Asn Ala Ala Arg Phe Lys 165 170 175 Arg Val Trp Glu Gly Asp Val Leu Asp Ile Lys Lys Ile 180 185 11 20 DNA Artificial Sequence Description of Artificial Sequence oligonucleotide primer 11 ggagtggcga tgctgcgttg 20 12 22 DNA Artificial Sequence Description of Artificial Sequence oligonucleotide primer 12 cacaggggaa ggtgagatat gc 22 13 6190 DNA E. coli CFT073 CDS (3507)..(4844) permease 13 ctatacgaga atttctaaac aaagattggg tttacatttt gggtcaaaat gttgcttttt 60 tgcgtcacct atcccccaca ttttttcaaa tctcattgaa ttagaagcag tataaattac 120 cgtatgcctg atattacgat gcccgagata gtcctgaata agacgggtat ccacaccatt 180 atctgctagt gaataaccac atgcatggcg taacatatgt ggatgaactt taatatttaa 240 acctgcattt ttacccgcag cagaaactat gttataaaat cgttgtctcg ataatggttt 300 tcctgttcgg gaaacaaata accacgggtt atcattgaaa tgtttaacat ttaatgaatt 360 cctttcattc agccagtttg ttattgcaat cacctcttcc ttttgaagtg gatggactgt 420 actgaatcca ttttttatcc gttgaatata tattttttct gtagttagtt ctaaatccga 480 caactgcaga gataagagct cactaacacg cagaccatga aaataagcca ataaaatcat 540 gcaaacatcg cgtgaagagc aactcttttg ttttacagat tgaagtagta aattaatttc 600 atgatgcgtt aaaaattttc tgttttgcat aaatcacctc tatttatatg aaatagccat 660 gtcaactatg atgaaagaat tacatttagt caataatata attcatatag ttgctgtatg 720 attatagatt aatttattaa aattcaccta atctaattaa tcgcttgtaa acgcatataa 780 acaaccttcg ttatgtaact atcgttcaaa actctcttgt aaatataaag agccctataa 840 gatgattata aaaaagtcct acaacccgaa atagcattcc ctcataaaat taagctcaac 900 atcggcatat ttaaaaagta tccaatgcaa ttaatatttt atctaaaatt aacattgtat 960 cttgtatata ttacatccat gaatgtatgg atttttttat ttaaaaatta atgcactata 1020 acaaaaaaca tcaccaagtg ttaaaactaa cacaacaatt aaccaaaata atatggctgt 1080 ttatttttca tccgtaaaca aaaaggtaca aatattttca ccacaaaaaa accaccagca 1140 tggcattgaa ttaccattca caatacacac ctcctttcgc ttatttatgc tcattggttg 1200 actaataatg atctataccc cctgggcaac tatctctctg gcggggtatt gatgaatatc 1260 catttagtgg cataatttaa accccagctc attatttgct tataaaaaca agcaataata 1320 ccaggctctg gtatttatag ttcatttttt aacttaatta atataaccga taatatt 1377 atg cgc aaa ttt att act cat agt gaa tgg ttg tta ttt ttt gaa gcc 1425 Met Arg Lys Phe Ile Thr His Ser Glu Trp Leu Leu Phe Phe Glu Ala 1 5 10 15 atc aac ggt tca aaa aat gaa att aga gat aaa gcc atg tta caa atg 1473 Ile Asn Gly Ser Lys Asn Glu Ile Arg Asp Lys Ala Met Leu Gln Met 20 25 30 gca tat gta cat ggt ctt agg gta agc gag ctt att gca cta aag att 1521 Ala Tyr Val His Gly Leu Arg Val Ser Glu Leu Ile Ala Leu Lys Ile 35 40 45 agc gac att gac ttt tca gag tca gca ata tat atc aag cgt tta aaa 1569 Ser Asp Ile Asp Phe Ser Glu Ser Ala Ile Tyr Ile Lys Arg Leu Lys 50 55 60 aat gga tta tcc aca gta cat ccc ctg caa aaa gaa act gtt ctg tta 1617 Asn Gly Leu Ser Thr Val His Pro Leu Gln Lys Glu Thr Val Leu Leu 65 70 75 80 tta aaa aaa tgg cta gca cta cgt gat aac ata gta aaa aaa cct ttt 1665 Leu Lys Lys Trp Leu Ala Leu Arg Asp Asn Ile Val Lys Lys Pro Phe 85 90 95 gaa gac tct ttg ttt ctt tct tgt caa gga aat aaa atc tcg aga caa 1713 Glu Asp Ser Leu Phe Leu Ser Cys Gln Gly Asn Lys Ile Ser Arg Gln 100 105 110 tat gtg tat aag atg tgt aag aaa tat agt cac aat atg aat att aat 1761 Tyr Val Tyr Lys Met Cys Lys Lys Tyr Ser His Asn Met Asn Ile Asn 115 120 125 att cat cct cac atg ctt agg cat ggt tgt ggg tat gct tta gct aat 1809 Ile His Pro His Met Leu Arg His Gly Cys Gly Tyr Ala Leu Ala Asn 130 135 140 caa ggg tta gac acc aga cta ata caa gat tat tta ggg cac aga aat 1857 Gln Gly Leu Asp Thr Arg Leu Ile Gln Asp Tyr Leu Gly His Arg Asn 145 150 155 160 ata cat cat acg gta tta tat aca gcc agt aat gca gca aga ttt aaa 1905 Ile His His Thr Val Leu Tyr Thr Ala Ser Asn Ala Ala Arg Phe Lys 165 170 175 cga gta tgg gag ggt gat gtg tta gat ata aag aag ata taa 1947 Arg Val Trp Glu Gly Asp Val Leu Asp Ile Lys Lys Ile 180 185 190 cccctgttct aaaggcctca attcattgag gcttttagcc cctcattttt atgacataca 2007 actgataatc ataatggatc tcaatattaa tcaaaatgag aaataaatca cctttttatt 2067 aaaaaattaa tttttttaaa gtcacaaacc aacacatata atatttaata aaaatgccac 2127 cagcaaatat attgccactt atcttttctc tggtaatcta gtttactata aattatcgct 2187 ctccaatatt cgacgccaag aagaagtggt tattttattt atccttttcg taactattga 2247 caaaacaatt cccatataaa aattgcaatt tataaagcca acatacactt acagtgcata 2307 ataaggttat ttaggaacca gatgatttaa tgtattataa ggtataacat tcatattgac 2367 ttacctgttc tcttaaccaa gtaataaatg cctcaatttt tggccactgc ctgcccggta 2427 atgtggtgat gtaataatgc tgatggcatt tcaccgtcat atcgccaaac ggcgcgacga 2487 gctcaccact ggcaagcctt ttttgtacca tacgtttcct tcccatcgcc accccaatat 2547 gattcatcgc ggcgataacg gctaaatcag agcgatcaaa gccaattcca gaagatgttg 2607 gcaaattaac cgcataatgt tgcgcccaac tatgccattc atccgtaccg gagtcgttgc 2667 tccatgcctg tctgtcatgg agcaacgtac agtgacgcag gttaattacc gtgttggtta 2727 aatcatgtct ttgagcgtac tccgggctgc aaactggcaa gatttcttca tccatcagaa 2787 agtgatgagc caattgcgct gacggcgcat catcaaagta gatcgccaaa tcgattccgg 2847 cacgttgcaa attgacattg tcattaccag tgagcacggt gagcgaaata gacggatatc 2907 ggcgtgtaaa gtcacctaat gcgggcacca accagcattg ggcgatagag ggccgggaat 2967 acagcgttaa cgttcccgat aactcctgat ttttgatatc cagaatttcc tgattcaggg 3027 tatccagcga cgattttagc gcccaataaa cacgtttccc ctcgtgcgtt aattccactt 3087 tgcgatggga acgaacaaac aactgaatac ctaattcttc ttccagctga ttgatacggt 3147 gacttaccgc actggggctc agcgacaatt cctctgccgc gagagcgaag gactgatgcc 3207 tggcagccac ttcaaaagta tgcagttttg atagttgcca gccgttaagc agtcgatttc 3267 ttatttcacg aaggggttcc ataatcacct catttttctc ttaagtgtaa aaaaatagcg 3327 gcaaaatttc agctgtaagg tgagctaaag tgaaccatat ctcaattcac cttcattttt 3387 agatgtaaat cactcccttg atgcaattta gctcatgtga aaggcaaatt ttatcgtttg 3447 tcagcctgcg ttgttttttt gtccaatatc atcaggttaa tcacagggga aggtgagat 3506 atg cac tct caa atc tgg gtt gtg agc acg ctg ctt atc agc atc gtg 3554 Met His Ser Gln Ile Trp Val Val Ser Thr Leu Leu Ile Ser Ile Val 195 200 205 tta att gta ctg acc atc gtg aag ttc aaa ttc cac ccg ttt ctg gcg 3602 Leu Ile Val Leu Thr Ile Val Lys Phe Lys Phe His Pro Phe Leu Ala 210 215 220 ctg ttg ctg gcc agc ttc ttc gtg gga acg atg atg ggc atg ggg cca 3650 Leu Leu Leu Ala Ser Phe Phe Val Gly Thr Met Met Gly Met Gly Pro 225 230 235 ctg gat atg gta aat gct att gaa agt gga att ggc gga acg ctg ggg 3698 Leu Asp Met Val Asn Ala Ile Glu Ser Gly Ile Gly Gly Thr Leu Gly 240 245 250 ttc ctc gct gcg gtt atc ggc ctt ggc acg ata ctg gga aaa atg atg 3746 Phe Leu Ala Ala Val Ile Gly Leu Gly Thr Ile Leu Gly Lys Met Met 255 260 265 270 gaa gta tcc ggg gcc gca gaa aga att ggt ctg aca ctt caa cgc tgc 3794 Glu Val Ser Gly Ala Ala Glu Arg Ile Gly Leu Thr Leu Gln Arg Cys 275 280 285 cgg tgg ctt tca gct gat gtc att atg gtg ctg gtt ggc ctg att tgc 3842 Arg Trp Leu Ser Ala Asp Val Ile Met Val Leu Val Gly Leu Ile Cys 290 295 300 ggc atc acg ctg ttt gtt gaa gtg ggc gtc gtg cta ttg att cct ctg 3890 Gly Ile Thr Leu Phe Val Glu Val Gly Val Val Leu Leu Ile Pro Leu 305 310 315 gct ttt tca att gcc aaa aaa acc aat acc tca ttg tta aag ctg gcc 3938 Ala Phe Ser Ile Ala Lys Lys Thr Asn Thr Ser Leu Leu Lys Leu Ala 320 325 330 att ccg ctg tgt acc gca ttg atg gca gtg cac tgc gtg gtt ccc cca 3986 Ile Pro Leu Cys Thr Ala Leu Met Ala Val His Cys Val Val Pro Pro 335 340 345 350 cat ccg gct gct tta tat gtt gcc aat aag ctg ggc gca gat atc ggt 4034 His Pro Ala Ala Leu Tyr Val Ala Asn Lys Leu Gly Ala Asp Ile Gly 355 360 365 tcg gtg atc gtc tac ggt ttg ctg gtt ggg ctg atg gca tca ctg atc 4082 Ser Val Ile Val Tyr Gly Leu Leu Val Gly Leu Met Ala Ser Leu Ile 370 375 380 ggt ggc cca ctt ttc ctt aaa ttt ctg ggt caa cga ctg ccc ttt aaa 4130 Gly Gly Pro Leu Phe Leu Lys Phe Leu Gly Gln Arg Leu Pro Phe Lys 385 390 395 cct gta ccc aca gag ttt gcc gat ctc aaa gtt cgc gat gaa aaa aca 4178 Pro Val Pro Thr Glu Phe Ala Asp Leu Lys Val Arg Asp Glu Lys Thr 400 405 410 cta ccg tca tta ggc gca acg tta ttc acc gta ctg cta ccc att gcg 4226 Leu Pro Ser Leu Gly Ala Thr Leu Phe Thr Val Leu Leu Pro Ile Ala 415 420 425 430 ctg atg ttg gtt aaa acg att gcc gaa ttg aat atg gca cgt gag agt 4274 Leu Met Leu Val Lys Thr Ile Ala Glu Leu Asn Met Ala Arg Glu Ser 435 440 445 ggt ttg tat acc ttg ctt gag ttt att ggc aac cct atc act gcc acg 4322 Gly Leu Tyr Thr Leu Leu Glu Phe Ile Gly Asn Pro Ile Thr Ala Thr 450 455 460 ttt atc gcc gtg ttt gtc gcc tat tat gtg ttg ggt ata cgt cag cat 4370 Phe Ile Ala Val Phe Val Ala Tyr Tyr Val Leu Gly Ile Arg Gln His 465 470 475 atg agc atg ggg acg atg ctc aca cat acg gaa aat ggt ttc ggt tct 4418 Met Ser Met Gly Thr Met Leu Thr His Thr Glu Asn Gly Phe Gly Ser 480 485 490 att gct aat att ttg ctg att atc ggg gcc gga ggc gca ttc aac gcc 4466 Ile Ala Asn Ile Leu Leu Ile Ile Gly Ala Gly Gly Ala Phe Asn Ala 495 500 505 510 att tta aaa agc agc agt ctc gct gat acg ctg gca gtt att ctc tcc 4514 Ile Leu Lys Ser Ser Ser Leu Ala Asp Thr Leu Ala Val Ile Leu Ser 515 520 525 aat atg cat atg cac ccg att ctt ctg gcc tgg ttg gtg gca ctt att 4562 Asn Met His Met His Pro Ile Leu Leu Ala Trp Leu Val Ala Leu Ile 530 535 540 ctg cat gct gca gtg ggc tcc gct acg gtg gca atg atg ggg gca acg 4610 Leu His Ala Ala Val Gly Ser Ala Thr Val Ala Met Met Gly Ala Thr 545 550 555 gca att gtt gca ccc atg ctg ccg ctg tat ccc gac atc agc ccg gaa 4658 Ala Ile Val Ala Pro Met Leu Pro Leu Tyr Pro Asp Ile Ser Pro Glu 560 565 570 att att gcg att gct atc ggt tcc ggt gcg att ggc tgc acg atc gtt 4706 Ile Ile Ala Ile Ala Ile Gly Ser Gly Ala Ile Gly Cys Thr Ile Val 575 580 585 590 acg gac tca ctt ttc tgg ctg gtg aag caa tat tgc ggc gct acg ctc 4754 Thr Asp Ser Leu Phe Trp Leu Val Lys Gln Tyr Cys Gly Ala Thr Leu 595 600 605 aat gaa aca ttt aaa tac tat acg aca gcg aca ttt atc gct tca gtc 4802 Asn Glu Thr Phe Lys Tyr Tyr Thr Thr Ala Thr Phe Ile Ala Ser Val 610 615 620 atc gct ctg gcg ggc aca ttc ctg ctg tca ttt atc atc taa 4844 Ile Ala Leu Ala Gly Thr Phe Leu Leu Ser Phe Ile Ile 625 630 635 gcgcaaagag acgtact atg gaa aac gct aaa atg aat tcg ctc atc gcc 4894 Met Glu Asn Ala Lys Met Asn Ser Leu Ile Ala 640 645 cag tat ccg ttg gta gag gat ctg gtt gct ctt aaa gaa acc acc tgg 4942 Gln Tyr Pro Leu Val Glu Asp Leu Val Ala Leu Lys Glu Thr Thr Trp 650 655 660 ttt aat cct ggc acg acc tca ttg gca gaa ggt tta cct tat gtt ggc 4990 Phe Asn Pro Gly Thr Thr Ser Leu Ala Glu Gly Leu Pro Tyr Val Gly 665 670 675 ctg acc gaa cag gat gtt cag gac gcc cat gcg cgc ctg tct cgt ttt 5038 Leu Thr Glu Gln Asp Val Gln Asp Ala His Ala Arg Leu Ser Arg Phe 680 685 690 695 gcg ccg tat ctg gca aaa gca ttt cct gaa acg gct gcc gcg ggg ggg 5086 Ala Pro Tyr Leu Ala Lys Ala Phe Pro Glu Thr Ala Ala Ala Gly Gly 700 705 710 att att gaa tca gaa ctg gtt gct att cct gct atg caa aaa cgg ctg 5134 Ile Ile Glu Ser Glu Leu Val Ala Ile Pro Ala Met Gln Lys Arg Leu 715 720 725 gaa aag gaa tat caa caa ccg atc gcc ggg caa ttg cta cta aaa aaa 5182 Glu Lys Glu Tyr Gln Gln Pro Ile Ala Gly Gln Leu Leu Leu Lys Lys 730 735 740 gac agc cat ttg ccg att tcc ggc tcc ata aaa gca cgt ggc ggg att 5230 Asp Ser His Leu Pro Ile Ser Gly Ser Ile Lys Ala Arg Gly Gly Ile 745 750 755 tat gaa gtc ctg gcc cat gct gaa aaa ctg gct ctg gaa gcg ggt ttg 5278 Tyr Glu Val Leu Ala His Ala Glu Lys Leu Ala Leu Glu Ala Gly Leu 760 765 770 775 ctg aca ctt gag gat gat tac agc aaa ctg ctt tcc ccg gag ttt aaa 5326 Leu Thr Leu Glu Asp Asp Tyr Ser Lys Leu Leu Ser Pro Glu Phe Lys 780 785 790 cag ttc ttt agc caa tac agt att gct gtg gga tca acc gga aat ctg 5374 Gln Phe Phe Ser Gln Tyr Ser Ile Ala Val Gly Ser Thr Gly Asn Leu 795 800 805 ggg tta tca att ggt att atg agc gcc cgc atc ggc ttt aaa gta acg 5422 Gly Leu Ser Ile Gly Ile Met Ser Ala Arg Ile Gly Phe Lys Val Thr 810 815 820 gtg cat atg tct gct gat gcc cgc gcc tgg aaa aaa gcg aaa ctg cgc 5470 Val His Met Ser Ala Asp Ala Arg Ala Trp Lys Lys Ala Lys Leu Arg 825 830 835 agt cat ggc gtt act gtc gtg gaa tac gag caa gat tat ggt gtt gcc 5518 Ser His Gly Val Thr Val Val Glu Tyr Glu Gln Asp Tyr Gly Val Ala 840 845 850 855 gtc gag gaa gga cgt aaa gcc gcg cag tct gac ccg aac tgt ttc ttt 5566 Val Glu Glu Gly Arg Lys Ala Ala Gln Ser Asp Pro Asn Cys Phe Phe 860 865 870 att gat gac gaa aat tcc cgc acg ttg ttc ctt gga tat tcc gtc gca 5614 Ile Asp Asp Glu Asn Ser Arg Thr Leu Phe Leu Gly Tyr Ser Val Ala 875 880 885 ggc cag cgt ctt aag gcg caa ttt gct cag caa ggt cgc atc gtc gat 5662 Gly Gln Arg Leu Lys Ala Gln Phe Ala Gln Gln Gly Arg Ile Val Asp 890 895 900 gct gat aac cct ctg ttt gtc tat ctg ccg tgt ggt gtt ggc ggt ggt 5710 Ala Asp Asn Pro Leu Phe Val Tyr Leu Pro Cys Gly Val Gly Gly Gly 905 910 915 cct ggt ggc gtc gca ttc gga ctt aaa ctg gcg ttt ggc gat cat gtt 5758 Pro Gly Gly Val Ala Phe Gly Leu Lys Leu Ala Phe Gly Asp His Val 920 925 930 935 cac tgc ttt ttt gcc gaa cca aca cac tcc ccg tgt atg tta tta ggc 5806 His Cys Phe Phe Ala Glu Pro Thr His Ser Pro Cys Met Leu Leu Gly 940 945 950 gtc cat acc gga tta cac gat cag att tct gtt cag gat att ggc atc 5854 Val His Thr Gly Leu His Asp Gln Ile Ser Val Gln Asp Ile Gly Ile 955 960 965 gac aac ctt acc gct gcc gat ggc ctt gca gtt ggt cgc gca tcg ggc 5902 Asp Asn Leu Thr Ala Ala Asp Gly Leu Ala Val Gly Arg Ala Ser Gly 970 975 980 ttt gtc ggg cgg gca atg gag cgt ctg ctg gat ggt ttc tat acc ctt 5950 Phe Val Gly Arg Ala Met Glu Arg Leu Leu Asp Gly Phe Tyr Thr Leu 985 990 995 agc gat caa acc atg tat gac atg ctt ggc tgg ctg gcg cag gaa gaa 5998 Ser Asp Gln Thr Met Tyr Asp Met Leu Gly Trp Leu Ala Gln Glu Glu 1000 1005 1010 1015 ggt att cgt ctt gaa cca tcg gca ctg gcg ggt atg gcc ggg tct cag 6046 Gly Ile Arg Leu Glu Pro Ser Ala Leu Ala Gly Met Ala Gly Ser Gln 1020 1025 1030 cgc gtg tgc gca tca gta agt tac caa cag atg cac ggt ttc agc gcc 6094 Arg Val Cys Ala Ser Val Ser Tyr Gln Gln Met His Gly Phe Ser Ala 1035 1040 1045 gaa caa ctg cgt aat gcc act cat ctg gtg tgg gcg acg gga ggt gga 6142 Glu Gln Leu Arg Asn Ala Thr His Leu Val Trp Ala Thr Gly Gly Gly 1050 1055 1060 atg gtg ccg gaa gaa gag atg aat caa tat ctg gca aaa ggc cgt taa 6190 Met Val Pro Glu Glu Glu Met Asn Gln Tyr Leu Ala Lys Gly Arg 1065 1070 1075 14 189 PRT E. coli CFT073 14 Met Arg Lys Phe Ile Thr His Ser Glu Trp Leu Leu Phe Phe Glu Ala 1 5 10 15 Ile Asn Gly Ser Lys Asn Glu Ile Arg Asp Lys Ala Met Leu Gln Met 20 25 30 Ala Tyr Val His Gly Leu Arg Val Ser Glu Leu Ile Ala Leu Lys Ile 35 40 45 Ser Asp Ile Asp Phe Ser Glu Ser Ala Ile Tyr Ile Lys Arg Leu Lys 50 55 60 Asn Gly Leu Ser Thr Val His Pro Leu Gln Lys Glu Thr Val Leu Leu 65 70 75 80 Leu Lys Lys Trp Leu Ala Leu Arg Asp Asn Ile Val Lys Lys Pro Phe 85 90 95 Glu Asp Ser Leu Phe Leu Ser Cys Gln Gly Asn Lys Ile Ser Arg Gln 100 105 110 Tyr Val Tyr Lys Met Cys Lys Lys Tyr Ser His Asn Met Asn Ile Asn 115 120 125 Ile His Pro His Met Leu Arg His Gly Cys Gly Tyr Ala Leu Ala Asn 130 135 140 Gln Gly Leu Asp Thr Arg Leu Ile Gln Asp Tyr Leu Gly His Arg Asn 145 150 155 160 Ile His His Thr Val Leu Tyr Thr Ala Ser Asn Ala Ala Arg Phe Lys 165 170 175 Arg Val Trp Glu Gly Asp Val Leu Asp Ile Lys Lys Ile 180 185 15 445 PRT E. coli CFT073 15 Met His Ser Gln Ile Trp Val Val Ser Thr Leu Leu Ile Ser Ile Val 1 5 10 15 Leu Ile Val Leu Thr Ile Val Lys Phe Lys Phe His Pro Phe Leu Ala 20 25 30 Leu Leu Leu Ala Ser Phe Phe Val Gly Thr Met Met Gly Met Gly Pro 35 40 45 Leu Asp Met Val Asn Ala Ile Glu Ser Gly Ile Gly Gly Thr Leu Gly 50 55 60 Phe Leu Ala Ala Val Ile Gly Leu Gly Thr Ile Leu Gly Lys Met Met 65 70 75 80 Glu Val Ser Gly Ala Ala Glu Arg Ile Gly Leu Thr Leu Gln Arg Cys 85 90 95 Arg Trp Leu Ser Ala Asp Val Ile Met Val Leu Val Gly Leu Ile Cys 100 105 110 Gly Ile Thr Leu Phe Val Glu Val Gly Val Val Leu Leu Ile Pro Leu 115 120 125 Ala Phe Ser Ile Ala Lys Lys Thr Asn Thr Ser Leu Leu Lys Leu Ala 130 135 140 Ile Pro Leu Cys Thr Ala Leu Met Ala Val His Cys Val Val Pro Pro 145 150 155 160 His Pro Ala Ala Leu Tyr Val Ala Asn Lys Leu Gly Ala Asp Ile Gly 165 170 175 Ser Val Ile Val Tyr Gly Leu Leu Val Gly Leu Met Ala Ser Leu Ile 180 185 190 Gly Gly Pro Leu Phe Leu Lys Phe Leu Gly Gln Arg Leu Pro Phe Lys 195 200 205 Pro Val Pro Thr Glu Phe Ala Asp Leu Lys Val Arg Asp Glu Lys Thr 210 215 220 Leu Pro Ser Leu Gly Ala Thr Leu Phe Thr Val Leu Leu Pro Ile Ala 225 230 235 240 Leu Met Leu Val Lys Thr Ile Ala Glu Leu Asn Met Ala Arg Glu Ser 245 250 255 Gly Leu Tyr Thr Leu Leu Glu Phe Ile Gly Asn Pro Ile Thr Ala Thr 260 265 270 Phe Ile Ala Val Phe Val Ala Tyr Tyr Val Leu Gly Ile Arg Gln His 275 280 285 Met Ser Met Gly Thr Met Leu Thr His Thr Glu Asn Gly Phe Gly Ser 290 295 300 Ile Ala Asn Ile Leu Leu Ile Ile Gly Ala Gly Gly Ala Phe Asn Ala 305 310 315 320 Ile Leu Lys Ser Ser Ser Leu Ala Asp Thr Leu Ala Val Ile Leu Ser 325 330 335 Asn Met His Met His Pro Ile Leu Leu Ala Trp Leu Val Ala Leu Ile 340 345 350 Leu His Ala Ala Val Gly Ser Ala Thr Val Ala Met Met Gly Ala Thr 355 360 365 Ala Ile Val Ala Pro Met Leu Pro Leu Tyr Pro Asp Ile Ser Pro Glu 370 375 380 Ile Ile Ala Ile Ala Ile Gly Ser Gly Ala Ile Gly Cys Thr Ile Val 385 390 395 400 Thr Asp Ser Leu Phe Trp Leu Val Lys Gln Tyr Cys Gly Ala Thr Leu 405 410 415 Asn Glu Thr Phe Lys Tyr Tyr Thr Thr Ala Thr Phe Ile Ala Ser Val 420 425 430 Ile Ala Leu Ala Gly Thr Phe Leu Leu Ser Phe Ile Ile 435 440 445 16 442 PRT E. coli CFT073 16 Met Glu Asn Ala Lys Met Asn Ser Leu Ile Ala Gln Tyr Pro Leu Val 1 5 10 15 Glu Asp Leu Val Ala Leu Lys Glu Thr Thr Trp Phe Asn Pro Gly Thr 20 25 30 Thr Ser Leu Ala Glu Gly Leu Pro Tyr Val Gly Leu Thr Glu Gln Asp 35 40 45 Val Gln Asp Ala His Ala Arg Leu Ser Arg Phe Ala Pro Tyr Leu Ala 50 55 60 Lys Ala Phe Pro Glu Thr Ala Ala Ala Gly Gly Ile Ile Glu Ser Glu 65 70 75 80 Leu Val Ala Ile Pro Ala Met Gln Lys Arg Leu Glu Lys Glu Tyr Gln 85 90 95 Gln Pro Ile Ala Gly Gln Leu Leu Leu Lys Lys Asp Ser His Leu Pro 100 105 110 Ile Ser Gly Ser Ile Lys Ala Arg Gly Gly Ile Tyr Glu Val Leu Ala 115 120 125 His Ala Glu Lys Leu Ala Leu Glu Ala Gly Leu Leu Thr Leu Glu Asp 130 135 140 Asp Tyr Ser Lys Leu Leu Ser Pro Glu Phe Lys Gln Phe Phe Ser Gln 145 150 155 160 Tyr Ser Ile Ala Val Gly Ser Thr Gly Asn Leu Gly Leu Ser Ile Gly 165 170 175 Ile Met Ser Ala Arg Ile Gly Phe Lys Val Thr Val His Met Ser Ala 180 185 190 Asp Ala Arg Ala Trp Lys Lys Ala Lys Leu Arg Ser His Gly Val Thr 195 200 205 Val Val Glu Tyr Glu Gln Asp Tyr Gly Val Ala Val Glu Glu Gly Arg 210 215 220 Lys Ala Ala Gln Ser Asp Pro Asn Cys Phe Phe Ile Asp Asp Glu Asn 225 230 235 240 Ser Arg Thr Leu Phe Leu Gly Tyr Ser Val Ala Gly Gln Arg Leu Lys 245 250 255 Ala Gln Phe Ala Gln Gln Gly Arg Ile Val Asp Ala Asp Asn Pro Leu 260 265 270 Phe Val Tyr Leu Pro Cys Gly Val Gly Gly Gly Pro Gly Gly Val Ala 275 280 285 Phe Gly Leu Lys Leu Ala Phe Gly Asp His Val His Cys Phe Phe Ala 290 295 300 Glu Pro Thr His Ser Pro Cys Met Leu Leu Gly Val His Thr Gly Leu 305 310 315 320 His Asp Gln Ile Ser Val Gln Asp Ile Gly Ile Asp Asn Leu Thr Ala 325 330 335 Ala Asp Gly Leu Ala Val Gly Arg Ala Ser Gly Phe Val Gly Arg Ala 340 345 350 Met Glu Arg Leu Leu Asp Gly Phe Tyr Thr Leu Ser Asp Gln Thr Met 355 360 365 Tyr Asp Met Leu Gly Trp Leu Ala Gln Glu Glu Gly Ile Arg Leu Glu 370 375 380 Pro Ser Ala Leu Ala Gly Met Ala Gly Ser Gln Arg Val Cys Ala Ser 385 390 395 400 Val Ser Tyr Gln Gln Met His Gly Phe Ser Ala Glu Gln Leu Arg Asn 405 410 415 Ala Thr His Leu Val Trp Ala Thr Gly Gly Gly Met Val Pro Glu Glu 420 425 430 Glu Met Asn Gln Tyr Leu Ala Lys Gly Arg 435 440 17 42 DNA Artificial Sequence Description of Artificial Sequencesynthetic oligonucleotide 17 gcgctgcagc gttattaacg gccttttgcc agatattgat tc 42 18 40 DNA Artificial Sequence Description of Artificial Sequencesynthetic oligonucleotide 18 cgcggatccc gtactatgga aaacgctaaa atgaattcgc 40
Claims (18)
1. A method of detecting uropathogenic E. coli nucleotide sequences differentially expressed in the presence or absence of D-serine comprising:
(a) providing a library comprising a plurality of transposon mutants of a uropathogenic E. coli strain, the mutants comprising a transcriptional fusion comprising a transcriptional regulation sequence operably connected to a sequence encoding a detectable protein;
(b) growing the mutants of (a) in the presence of D-serine;
(c) growing the mutants of (a) in the absence of D-serine;
(d) identifying mutants having increased or decreased expression of the transcriptional fusion in the presence or absence of D-serine by comparing the relative levels of the detectable protein of the mutants of (b) and (c);
(e) identifying the insertion site of the transcriptional fusion in the transposon mutants identified in (d); and
(f) correlating the insertion site with an E. coli nucleotide sequence.
2. The method of claim 1 , wherein the uropathogenic E. coli strain is CFf073.
3. A method of identifying proteins differentially expressed in a wild-type uropathogenic E. coli strain and a uropathogenic E. coli dsdCXA locus mutant, the mutant having reduced expression of one or more proteins selected from the group consisting of DsdA, DsdC, and DsdX, the method comprising
(a) comparing proteins isolated from the wild-type uropathogenic E. coli strain and proteins isolated from the uropathogenic E. coli dsdCXA locus mutant; and
(b) identifying proteins from the dsdCXA mutant having increased or decreased level of expression relative to expression of the corresponding proteins in the wild-type uropathogenic E. coli strain.
4. The method of claim 3 , wherein the mutant is a dsdA mutant.
5. The method of claim 3 , wherein the isolated proteins of (a) are separated by two-dimensional gel electrophoresis.
6. The method of claim 3 , wherein prior to (a), the wild-type strain and the dsdA mutant are grown in the presence of D-serine.
7. A method of detecting genes from a uropathogenic E. coli strain that are differentially expressed in the presence or absence of D-serine comprising
(a) hybridizing a first set of labeled oligonucleotide probes with an array of oligomers, the oligomers comprising gene sequences from the uropathogenic E. coli, wherein the first set of labeled oligonucleotide probes is made by reverse transcription of RNA isolated from uropathogenic E. coli grown in the presence of D-serine;
(b) hybridizing a second set of labeled oligonucleotide probes with an array of oligomers identical to the array of (a), wherein the second set of labeled oligonucleotide probes is made by reverse transcription of RNA isolated from uropathogenic E. coli grown in the absence of D-serine;
(c) comparing hybridization of labeled oligonucleotide probes of (a) and (b) to identify oligomers having differential hybridization to the first and second sets of oligonucleotide probes;
(d) identifying genes comprising the sequences of the oligomers of (c).
8. A method of detecting proteins differentially expressed in a uropathogenic E. coli strain in response to D-serine, comprising:
(a) providing a first culture of the uropathogenic E. coli strain grown in the presence of D-serine;
(b) providing a second culture of the uropathogenic E. coli strain grown in the absence of D-serine;
(c) comparing proteins isolated from (a) and (b); and
(d) identifying proteins from E. coli grown in the presence of D-serine that are increased or decreased relative to the corresponding proteins in the uropathogenic E. coli strain grown in the absence of D-serine.
9. A method of identifying a uropathogenic E. coli polynucleotide sequence that binds to DsdC protein comprising
contacting an E. coli polynucleotide sequence with DsdC protein; and
detecting binding of the polynucleotide sequence to the protein.
10. A method of characterizing an E. coli strain isolated from a clinical sample comprising testing the strain for the ability to grow in the presence of D-serine.
11. The method of claim 10 , wherein D-serine is the sole source of carbon and nitrogen.
12. The method of claim 10 , wherein D-serine is present in a concentration effective to inhibit the growth of a normal fecal isolate of E. coli.
13. The method of claim 10 , wherein D-serine is present in a concentration of at least 100 ug/ml.
14. The method of claim 10 , wherein D-serine is present at a concentration of from about 100 ug/ml to about 500 ug/ml.
15. A method of detecting D-serine in a sample comprising the steps of:
(a) contacting the sample with a polypeptide comprising D-serine deaminase under suitable reaction conditions and for a period of time sufficient to allow the deamination of at least a portion of D-serine molecules in the sample; and
(b) detecting the deamination of D-serine.
16. The method of claim 15 , wherein the polypeptide of step (a) is immobilized on a solid support.
17. The method of claim 16 , wherein the polypeptide is coimmobilized with an indicator responsive to ammonia.
18. A urine dipstick comprising a solid support, a polypeptide comprising D-serine deaminase and an indicator responsive to ammonia, wherein the polypeptide and indicator are coimmobilized on the support.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/117,417 US20020187544A1 (en) | 2001-04-05 | 2002-04-05 | Uropathogenic E. coli D-serine detoxification operon |
| US11/289,989 US7422871B2 (en) | 2001-04-05 | 2005-11-30 | Uropathogenic E. coli D-serine detoxification operon |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US28185901P | 2001-04-05 | 2001-04-05 | |
| US10/117,417 US20020187544A1 (en) | 2001-04-05 | 2002-04-05 | Uropathogenic E. coli D-serine detoxification operon |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US11/289,989 Division US7422871B2 (en) | 2001-04-05 | 2005-11-30 | Uropathogenic E. coli D-serine detoxification operon |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20020187544A1 true US20020187544A1 (en) | 2002-12-12 |
Family
ID=26815266
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US10/117,417 Abandoned US20020187544A1 (en) | 2001-04-05 | 2002-04-05 | Uropathogenic E. coli D-serine detoxification operon |
| US11/289,989 Expired - Fee Related US7422871B2 (en) | 2001-04-05 | 2005-11-30 | Uropathogenic E. coli D-serine detoxification operon |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US11/289,989 Expired - Fee Related US7422871B2 (en) | 2001-04-05 | 2005-11-30 | Uropathogenic E. coli D-serine detoxification operon |
Country Status (1)
| Country | Link |
|---|---|
| US (2) | US20020187544A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040209370A1 (en) * | 2002-12-19 | 2004-10-21 | Wonchul Suh | Method for chromosomal engineering |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US3873269A (en) * | 1972-10-11 | 1975-03-25 | Merck Patent Gmbh | Indicator for the determination of urea |
| US4803170A (en) * | 1985-05-09 | 1989-02-07 | Ultra Diagnostics Corporation | Competitive immunoassay method, device and test kit |
| US6617488B1 (en) * | 1997-10-14 | 2003-09-09 | Indicator Technologies, Inc. | Method and apparatus for indicating the conditions in an absorbent article |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1997020552A1 (en) | 1995-12-07 | 1997-06-12 | Albert Einstein College Of Medicine Of Yeshiva University | Treatment of negative and cognitive symptoms of schizophrenia with glycine and its precursors |
-
2002
- 2002-04-05 US US10/117,417 patent/US20020187544A1/en not_active Abandoned
-
2005
- 2005-11-30 US US11/289,989 patent/US7422871B2/en not_active Expired - Fee Related
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US3873269A (en) * | 1972-10-11 | 1975-03-25 | Merck Patent Gmbh | Indicator for the determination of urea |
| US4803170A (en) * | 1985-05-09 | 1989-02-07 | Ultra Diagnostics Corporation | Competitive immunoassay method, device and test kit |
| US6617488B1 (en) * | 1997-10-14 | 2003-09-09 | Indicator Technologies, Inc. | Method and apparatus for indicating the conditions in an absorbent article |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040209370A1 (en) * | 2002-12-19 | 2004-10-21 | Wonchul Suh | Method for chromosomal engineering |
Also Published As
| Publication number | Publication date |
|---|---|
| US7422871B2 (en) | 2008-09-09 |
| US20060099629A1 (en) | 2006-05-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Ferrières et al. | The RcsC sensor kinase is required for normal biofilm formation in Escherichia coli K‐12 and controls the expression of a regulon in response to growth on a solid surface | |
| Bisicchia et al. | The essential YycFG two‐component system controls cell wall metabolism in Bacillus subtilis | |
| Boddicker et al. | Differential binding to and biofilm formation on, HEp‐2 cells by Salmonella enterica Serovar Typhimurium is dependent upon allelic variation in the fimH gene of the fim gene cluster | |
| Elliott et al. | The locus of enterocyte effacement (LEE)-encoded regulator controls expression of both LEE-and non-LEE-encoded virulence factors in enteropathogenic and enterohemorrhagic Escherichia coli | |
| Arakawa et al. | Genomic organization of the Klebsiella pneumoniae cps region responsible for serotype K2 capsular polysaccharide synthesis in the virulent strain Chedid | |
| Mellies et al. | The Per regulon of enteropathogenic Escherichia coli: identification of a regulatory cascade and a novel transcriptional activator, the locus of enterocyte effacement (LEE)‐encoded regulator (Ler) | |
| Das et al. | Stringent response in Vibrio cholerae: genetic analysis of spoT gene function and identification of a novel (p) ppGpp synthetase gene | |
| De Boever et al. | Enterococcus faecalis conjugative plasmid pAM373: complete nucleotide sequence and genetic analyses of sex pheromone response | |
| Dougherty et al. | Identification of Haemophilus influenzae Rd transformation genes using cassette mutagenesis | |
| KR20020089462A (en) | Cellular Arrays for the Identification of Altered Gene Expression | |
| JP2003518386A (en) | Genes identified as required for the growth of Escherichia coli | |
| US20030013104A1 (en) | Multiple antibiotic resistance operon assays | |
| Schembri et al. | DNA microarray analysis of fim mutations in Escherichia coli | |
| EP0830457A1 (en) | Methods for evaluation of antimicrobial targets | |
| JP2002535007A (en) | Genes identified as required for the growth of Escherichia coli | |
| Unsworth et al. | Identification and analysis of bacterial virulence genes in vivo | |
| Maurer et al. | Functional interchangeability of DNA replication genes in Salmonella typhimurium and Escherichia coli demonstrated by a general complementation procedure | |
| Edwards et al. | Genomic analysis and growth-phase-dependent regulation of the SEF14 fimbriae of Salmonella enterica serovar Enteritidis | |
| Maxson et al. | Multiple promoters control expression of the Yersinia enterocolitica phage-shock-protein A (pspA) operon | |
| JPH05504672A (en) | Polynucleotide probes, methods and kits for the identification and detection of Gram-negative bacteria | |
| US7422871B2 (en) | Uropathogenic E. coli D-serine detoxification operon | |
| US20030027175A1 (en) | Dynamic whole genome screening methodology and systems | |
| US7309484B2 (en) | Compositions and methods for screening antibacterial compounds | |
| JP2001505417A (en) | Methods for identifying genes essential for organism growth | |
| EP1079861A1 (en) | Regulation of biofilm formation |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: WISCONSIN ALUMNI RESEARCH FOUNDATION, WISCONSIN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WELCH, RODNEY A.;ROESCH, PAULA L.;REEL/FRAME:012749/0823;SIGNING DATES FROM 20020419 TO 20020503 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |
|
| AS | Assignment |
Owner name: NATIONAL INSTITUTES OF HEALTH (NIH), U.S. DEPT. OF Free format text: EXECUTIVE ORDER 9424, CONFIRMATORY LICENSE;ASSIGNOR:UNIVERSITY OF WISCONSIN-MADISON;REEL/FRAME:021910/0585 Effective date: 20020531 |