US20170204381A1 - Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds - Google Patents
Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds Download PDFInfo
- Publication number
- US20170204381A1 US20170204381A1 US14/991,774 US201614991774A US2017204381A1 US 20170204381 A1 US20170204381 A1 US 20170204381A1 US 201614991774 A US201614991774 A US 201614991774A US 2017204381 A1 US2017204381 A1 US 2017204381A1
- Authority
- US
- United States
- Prior art keywords
- seq
- glycosyltransferase
- amino acid
- isolated
- pmst1
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000015572 biosynthetic process Effects 0.000 title description 22
- 238000003786 synthesis reaction Methods 0.000 title description 15
- NIGUVXFURDGQKZ-UQTBNESHSA-N alpha-Neup5Ac-(2->3)-beta-D-Galp-(1->4)-[alpha-L-Fucp-(1->3)]-beta-D-GlcpNAc Chemical class O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@H]2[C@@H]([C@@H](O[C@]3(O[C@H]([C@H](NC(C)=O)[C@@H](O)C3)[C@H](O)[C@H](O)CO)C(O)=O)[C@@H](O)[C@@H](CO)O2)O)[C@@H](CO)O[C@@H](O)[C@@H]1NC(C)=O NIGUVXFURDGQKZ-UQTBNESHSA-N 0.000 title 1
- 230000000694 effects Effects 0.000 claims abstract description 88
- 229920001542 oligosaccharide Polymers 0.000 claims abstract description 53
- 230000003247 decreasing effect Effects 0.000 claims abstract description 35
- 108010006232 Neuraminidase Proteins 0.000 claims abstract description 30
- 102000005348 Neuraminidase Human genes 0.000 claims abstract description 30
- 108700023372 Glycosyltransferases Proteins 0.000 claims description 158
- 150000001413 amino acids Chemical class 0.000 claims description 127
- 102000051366 Glycosyltransferases Human genes 0.000 claims description 121
- 239000000758 substrate Substances 0.000 claims description 63
- 238000000034 method Methods 0.000 claims description 55
- 150000002482 oligosaccharides Chemical class 0.000 claims description 52
- 235000000346 sugar Nutrition 0.000 claims description 49
- 238000006460 hydrolysis reaction Methods 0.000 claims description 41
- 230000007062 hydrolysis Effects 0.000 claims description 40
- TXCIAUNLDRJGJZ-BILDWYJOSA-N CMP-N-acetyl-beta-neuraminic acid Chemical compound O1[C@@H]([C@H](O)[C@H](O)CO)[C@H](NC(=O)C)[C@@H](O)C[C@]1(C(O)=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-BILDWYJOSA-N 0.000 claims description 39
- 150000007523 nucleic acids Chemical class 0.000 claims description 38
- 108010057005 beta-galactoside alpha-2,3-sialyltransferase Proteins 0.000 claims description 37
- 102000039446 nucleic acids Human genes 0.000 claims description 35
- 108020004707 nucleic acids Proteins 0.000 claims description 35
- TXCIAUNLDRJGJZ-UHFFFAOYSA-N CMP-N-acetyl neuraminic acid Natural products O1C(C(O)C(O)CO)C(NC(=O)C)C(O)CC1(C(O)=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(N=C(N)C=C2)=O)O1 TXCIAUNLDRJGJZ-UHFFFAOYSA-N 0.000 claims description 33
- 102000003838 Sialyltransferases Human genes 0.000 claims description 31
- 108090000141 Sialyltransferases Proteins 0.000 claims description 31
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 31
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 claims description 30
- 239000002773 nucleotide Substances 0.000 claims description 28
- 125000003729 nucleotide group Chemical group 0.000 claims description 26
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 26
- SQVRNKJHWKZAKO-PFQGKNLYSA-N N-acetyl-beta-neuraminic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-PFQGKNLYSA-N 0.000 claims description 23
- 229920001184 polypeptide Polymers 0.000 claims description 22
- 229910052799 carbon Inorganic materials 0.000 claims description 19
- IERHLVCPSMICTF-UHFFFAOYSA-N cytidine monophosphate Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(COP(O)(O)=O)O1 IERHLVCPSMICTF-UHFFFAOYSA-N 0.000 claims description 17
- 239000011541 reaction mixture Substances 0.000 claims description 17
- 229910052731 fluorine Inorganic materials 0.000 claims description 16
- 229910052721 tungsten Inorganic materials 0.000 claims description 16
- 229910052727 yttrium Inorganic materials 0.000 claims description 15
- 238000005580 one pot reaction Methods 0.000 claims description 9
- 241000894006 Bacteria Species 0.000 claims description 8
- 229910052739 hydrogen Inorganic materials 0.000 claims description 8
- 229910052757 nitrogen Inorganic materials 0.000 claims description 8
- 238000012546 transfer Methods 0.000 claims description 8
- 229910052698 phosphorus Inorganic materials 0.000 claims description 7
- 229910052717 sulfur Inorganic materials 0.000 claims description 7
- 102000048245 N-acetylneuraminate lyases Human genes 0.000 claims description 6
- 108700023220 N-acetylneuraminate lyases Proteins 0.000 claims description 6
- 108010081778 N-acylneuraminate cytidylyltransferase Proteins 0.000 claims description 5
- 241000238631 Hexapoda Species 0.000 claims description 4
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 claims description 4
- OVRNDRQMDRJTHS-OZRXBMAMSA-N N-acetyl-beta-D-mannosamine Chemical compound CC(=O)N[C@@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-OZRXBMAMSA-N 0.000 claims description 3
- PCDQPRRSZKQHHS-CCXZUQQUSA-N Cytarabine Triphosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-CCXZUQQUSA-N 0.000 claims description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 2
- 229940107700 pyruvic acid Drugs 0.000 claims description 2
- IERHLVCPSMICTF-ZAKLUEHWSA-N cytidine-5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-ZAKLUEHWSA-N 0.000 claims 1
- 238000002360 preparation method Methods 0.000 abstract description 5
- -1 sialyl-Lewisx oligosaccharides Chemical class 0.000 abstract description 4
- 235000001014 amino acid Nutrition 0.000 description 107
- 229940024606 amino acid Drugs 0.000 description 95
- 239000000370 acceptor Substances 0.000 description 54
- 210000004027 cell Anatomy 0.000 description 44
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 description 37
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 37
- 230000035772 mutation Effects 0.000 description 37
- 102000053602 DNA Human genes 0.000 description 32
- 108020004414 DNA Proteins 0.000 description 31
- 108090000623 proteins and genes Proteins 0.000 description 27
- 229940088598 enzyme Drugs 0.000 description 26
- 238000006243 chemical reaction Methods 0.000 description 24
- 230000014509 gene expression Effects 0.000 description 22
- 102000004190 Enzymes Human genes 0.000 description 21
- 108090000790 Enzymes Proteins 0.000 description 21
- 102000004169 proteins and genes Human genes 0.000 description 21
- 239000013598 vector Substances 0.000 description 20
- 241000588724 Escherichia coli Species 0.000 description 19
- IERHLVCPSMICTF-XVFCMESISA-N cytidine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-XVFCMESISA-N 0.000 description 19
- 235000018102 proteins Nutrition 0.000 description 18
- 125000003275 alpha amino acid group Chemical group 0.000 description 17
- 125000005629 sialic acid group Chemical group 0.000 description 17
- 239000011734 sodium Substances 0.000 description 17
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 14
- 238000003556 assay Methods 0.000 description 13
- FAPWRFPIFSIZLT-UHFFFAOYSA-M sodium chloride Inorganic materials [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 13
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 12
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 11
- 239000000872 buffer Substances 0.000 description 11
- 238000002741 site-directed mutagenesis Methods 0.000 description 11
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 10
- 238000005160 1H NMR spectroscopy Methods 0.000 description 10
- 238000000132 electrospray ionisation Methods 0.000 description 10
- 238000000746 purification Methods 0.000 description 10
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 10
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 9
- 238000004128 high performance liquid chromatography Methods 0.000 description 9
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 9
- 150000002772 monosaccharides Chemical class 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- 150000001875 compounds Chemical class 0.000 description 8
- 229920000642 polymer Polymers 0.000 description 8
- 239000013615 primer Substances 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 7
- 239000013078 crystal Substances 0.000 description 7
- 239000013604 expression vector Substances 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 7
- 231100000350 mutagenesis Toxicity 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- HVCOBJNICQPDBP-UHFFFAOYSA-N 3-[3-[3,5-dihydroxy-6-methyl-4-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyoxan-2-yl]oxydecanoyloxy]decanoic acid;hydrate Chemical compound O.OC1C(OC(CC(=O)OC(CCCCCCC)CC(O)=O)CCCCCCC)OC(C)C(O)C1OC1C(O)C(O)C(O)C(C)O1 HVCOBJNICQPDBP-UHFFFAOYSA-N 0.000 description 6
- HSHNITRMYYLLCV-UHFFFAOYSA-N 4-methylumbelliferone Chemical compound C1=C(O)C=CC2=C1OC(=O)C=C2C HSHNITRMYYLLCV-UHFFFAOYSA-N 0.000 description 6
- 229930186217 Glycolipid Natural products 0.000 description 6
- 102000003886 Glycoproteins Human genes 0.000 description 6
- 108090000288 Glycoproteins Proteins 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 108010021466 Mutant Proteins Proteins 0.000 description 6
- 102000008300 Mutant Proteins Human genes 0.000 description 6
- 241000606856 Pasteurella multocida Species 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 238000003367 kinetic assay Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 229940051027 pasteurella multocida Drugs 0.000 description 6
- 230000001105 regulatory effect Effects 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 230000009450 sialylation Effects 0.000 description 6
- 150000008163 sugars Chemical class 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 102000002068 Glycopeptides Human genes 0.000 description 5
- 108010015899 Glycopeptides Proteins 0.000 description 5
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 5
- 238000007792 addition Methods 0.000 description 5
- 238000005251 capillar electrophoresis Methods 0.000 description 5
- 230000001186 cumulative effect Effects 0.000 description 5
- 230000005284 excitation Effects 0.000 description 5
- 239000008101 lactose Substances 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 238000003752 polymerase chain reaction Methods 0.000 description 5
- 230000002103 transcriptional effect Effects 0.000 description 5
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 4
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- RWZYAGGXGHYGMB-UHFFFAOYSA-N anthranilic acid Chemical compound NC1=CC=CC=C1C(O)=O RWZYAGGXGHYGMB-UHFFFAOYSA-N 0.000 description 4
- 125000004429 atom Chemical group 0.000 description 4
- 150000001720 carbohydrates Chemical class 0.000 description 4
- 235000014633 carbohydrates Nutrition 0.000 description 4
- 239000013592 cell lysate Substances 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 239000000178 monomer Substances 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 230000007306 turnover Effects 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- PSGQCCSGKGJLRL-UHFFFAOYSA-N 4-methyl-2h-chromen-2-one Chemical group C1=CC=CC2=C1OC(=O)C=C2C PSGQCCSGKGJLRL-UHFFFAOYSA-N 0.000 description 3
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 3
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 3
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- 239000007987 MES buffer Substances 0.000 description 3
- OVRNDRQMDRJTHS-ZTVVOAFPSA-N N-acetyl-D-mannosamine Chemical compound CC(=O)N[C@@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-ZTVVOAFPSA-N 0.000 description 3
- 238000005481 NMR spectroscopy Methods 0.000 description 3
- 241000588650 Neisseria meningitidis Species 0.000 description 3
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 3
- 229920004890 Triton X-100 Polymers 0.000 description 3
- 239000013504 Triton X-100 Substances 0.000 description 3
- 0 [1*]C1C(O)CC(O)(C(=O)[O-])OC1[C@H]([4*])[C@H]([3*])C[2*] Chemical compound [1*]C1C(O)CC(O)(C(=O)[O-])OC1[C@H]([4*])[C@H]([3*])C[2*] 0.000 description 3
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 238000006555 catalytic reaction Methods 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 230000033581 fucosylation Effects 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000011065 in-situ storage Methods 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 125000001483 monosaccharide substituent group Chemical group 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000004809 thin layer chromatography Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 238000011179 visual inspection Methods 0.000 description 3
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- 108030003364 Alpha-N-acetylgalactosaminide alpha-2,6-sialyltransferases Proteins 0.000 description 2
- 102100029233 Alpha-N-acetylneuraminide alpha-2,8-sialyltransferase Human genes 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102100029962 CMP-N-acetylneuraminate-beta-1,4-galactoside alpha-2,3-sialyltransferase Human genes 0.000 description 2
- 102100031974 CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase 4 Human genes 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 2
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 2
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241001646716 Escherichia coli K-12 Species 0.000 description 2
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 102100030928 Lactosylceramide alpha-2,3-sialyltransferase Human genes 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 125000003047 N-acetyl group Chemical group 0.000 description 2
- 108010015197 N-acetyllactosaminide alpha-2,3-sialyltransferase Proteins 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- AFYNADDZULBEJA-UHFFFAOYSA-N bicinchoninic acid Chemical compound C1=CC=CC2=NC(C=3C=C(C4=CC=CC=C4N=3)C(=O)O)=CC(C(O)=O)=C21 AFYNADDZULBEJA-UHFFFAOYSA-N 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 150000002016 disaccharides Chemical class 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 235000019439 ethyl acetate Nutrition 0.000 description 2
- 238000004992 fast atom bombardment mass spectroscopy Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 150000008195 galaktosides Chemical class 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 108010031142 heparosan synthase Proteins 0.000 description 2
- 238000005570 heteronuclear single quantum coherence Methods 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 238000000655 nuclear magnetic resonance spectrum Methods 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- ZRSNZINYAWTAHE-UHFFFAOYSA-N p-methoxybenzaldehyde Chemical compound COC1=CC=C(C=O)C=C1 ZRSNZINYAWTAHE-UHFFFAOYSA-N 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- ZCCUUQDIBDJBTK-UHFFFAOYSA-N psoralen Chemical compound C1=C2OC(=O)C=CC2=CC2=C1OC=C2 ZCCUUQDIBDJBTK-UHFFFAOYSA-N 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 2
- 235000016491 selenocysteine Nutrition 0.000 description 2
- 229940055619 selenocysteine Drugs 0.000 description 2
- 125000005630 sialyl group Chemical group 0.000 description 2
- 239000000741 silica gel Substances 0.000 description 2
- 229910002027 silica gel Inorganic materials 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 150000004043 trisaccharides Chemical class 0.000 description 2
- LAQPKDLYOBZWBT-NYLDSJSYSA-N (2s,4s,5r,6r)-5-acetamido-2-{[(2s,3r,4s,5s,6r)-2-{[(2r,3r,4r,5r)-5-acetamido-1,2-dihydroxy-6-oxo-4-{[(2s,3s,4r,5s,6s)-3,4,5-trihydroxy-6-methyloxan-2-yl]oxy}hexan-3-yl]oxy}-3,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy}-4-hydroxy-6-[(1r,2r)-1,2,3-trihydrox Chemical compound O[C@H]1[C@H](O)[C@H](O)[C@H](C)O[C@H]1O[C@H]([C@@H](NC(C)=O)C=O)[C@@H]([C@H](O)CO)O[C@H]1[C@H](O)[C@@H](O[C@]2(O[C@H]([C@H](NC(C)=O)[C@@H](O)C2)[C@H](O)[C@H](O)CO)C(O)=O)[C@@H](O)[C@@H](CO)O1 LAQPKDLYOBZWBT-NYLDSJSYSA-N 0.000 description 1
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- FMYBFLOWKQRBST-UHFFFAOYSA-N 2-[bis(carboxymethyl)amino]acetic acid;nickel Chemical compound [Ni].OC(=O)CN(CC(O)=O)CC(O)=O FMYBFLOWKQRBST-UHFFFAOYSA-N 0.000 description 1
- INEWUCPYEUEQTN-UHFFFAOYSA-N 3-(cyclohexylamino)-2-hydroxy-1-propanesulfonic acid Chemical compound OS(=O)(=O)CC(O)CNC1CCCCC1 INEWUCPYEUEQTN-UHFFFAOYSA-N 0.000 description 1
- VXGRJERITKFWPL-UHFFFAOYSA-N 4',5'-Dihydropsoralen Natural products C1=C2OC(=O)C=CC2=CC2=C1OCC2 VXGRJERITKFWPL-UHFFFAOYSA-N 0.000 description 1
- ZZOKVYOCRSMTSS-UHFFFAOYSA-N 9h-fluoren-9-ylmethyl carbamate Chemical compound C1=CC=C2C(COC(=O)N)C3=CC=CC=C3C2=C1 ZZOKVYOCRSMTSS-UHFFFAOYSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 102100031970 Alpha-N-acetylgalactosaminide alpha-2,6-sialyltransferase 2 Human genes 0.000 description 1
- 108010059390 Alpha-N-acetylneuraminate alpha-2,8-sialyltransferase Proteins 0.000 description 1
- 101710115567 Alpha-N-acetylneuraminide alpha-2,8-sialyltransferase Proteins 0.000 description 1
- 101001027098 Arabidopsis thaliana Fucose-1-phosphate guanylyltransferase Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000606124 Bacteroides fragilis Species 0.000 description 1
- 108030003360 Beta-galactoside alpha-(2,6)-sialyltransferases Proteins 0.000 description 1
- 102100029945 Beta-galactoside alpha-2,6-sialyltransferase 1 Human genes 0.000 description 1
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- 102000005572 Cathepsin A Human genes 0.000 description 1
- 108010059081 Cathepsin A Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 241001522878 Escherichia coli B Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010045674 Fucose-1-phosphate guanylyltransferase Proteins 0.000 description 1
- 102000006471 Fucosyltransferases Human genes 0.000 description 1
- 108010019236 Fucosyltransferases Proteins 0.000 description 1
- 102000030902 Galactosyltransferase Human genes 0.000 description 1
- 108060003306 Galactosyltransferase Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 102000000340 Glucosyltransferases Human genes 0.000 description 1
- 108010055629 Glucosyltransferases Proteins 0.000 description 1
- 102000016354 Glucuronosyltransferase Human genes 0.000 description 1
- 108010092364 Glucuronosyltransferase Proteins 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 229910004373 HOAc Inorganic materials 0.000 description 1
- 241000590002 Helicobacter pylori Species 0.000 description 1
- 108091027305 Heteroduplex Proteins 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 108030003357 Lactosylceramide alpha-2,3-sialyltransferases Proteins 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102000006722 Mannosyltransferases Human genes 0.000 description 1
- 108010087568 Mannosyltransferases Proteins 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 241000700562 Myxoma virus Species 0.000 description 1
- 102000007524 N-Acetylgalactosaminyltransferases Human genes 0.000 description 1
- 108010046220 N-Acetylgalactosaminyltransferases Proteins 0.000 description 1
- 102000002493 N-Acetylglucosaminyltransferases Human genes 0.000 description 1
- 108010093077 N-Acetylglucosaminyltransferases Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- NYWZBRWKDRMPAS-GRRZBWEESA-N N-acetyl-9-O-acetylneuraminic acid Chemical group CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)COC(C)=O NYWZBRWKDRMPAS-GRRZBWEESA-N 0.000 description 1
- 150000008270 N-acetylgalactosaminides Chemical class 0.000 description 1
- SUHQNCLNRUAGOO-UHFFFAOYSA-N N-glycoloyl-neuraminic acid Natural products OCC(O)C(O)C(O)C(NC(=O)CO)C(O)CC(=O)C(O)=O SUHQNCLNRUAGOO-UHFFFAOYSA-N 0.000 description 1
- FDJKUWYYUZCUJX-UHFFFAOYSA-N N-glycolyl-beta-neuraminic acid Natural products OCC(O)C(O)C1OC(O)(C(O)=O)CC(O)C1NC(=O)CO FDJKUWYYUZCUJX-UHFFFAOYSA-N 0.000 description 1
- FDJKUWYYUZCUJX-KVNVFURPSA-N N-glycolylneuraminic acid Chemical compound OC[C@H](O)[C@H](O)[C@@H]1O[C@](O)(C(O)=O)C[C@H](O)[C@H]1NC(=O)CO FDJKUWYYUZCUJX-KVNVFURPSA-N 0.000 description 1
- 238000012565 NMR experiment Methods 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- HSCJRCZFDFQWRP-ABVWGUQPSA-N UDP-alpha-D-galactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-ABVWGUQPSA-N 0.000 description 1
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 102000010199 Xylosyltransferases Human genes 0.000 description 1
- 108050001741 Xylosyltransferases Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 150000001371 alpha-amino acids Chemical class 0.000 description 1
- 235000008206 alpha-amino acids Nutrition 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 108010064886 beta-D-galactoside alpha 2-6-sialyltransferase Proteins 0.000 description 1
- 125000001488 beta-D-galactosyl group Chemical group C1([C@H](O)[C@@H](O)[C@@H](O)[C@H](O1)CO)* 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 229940022399 cancer vaccine Drugs 0.000 description 1
- 238000009566 cancer vaccine Methods 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 150000004657 carbamic acid derivatives Chemical class 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000002288 cocrystallisation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 238000002447 crystallographic data Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- IJKVHSBPTUYDLN-UHFFFAOYSA-N dihydroxy(oxo)silane Chemical compound O[Si](O)=O IJKVHSBPTUYDLN-UHFFFAOYSA-N 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000003818 flash chromatography Methods 0.000 description 1
- VUWZPRWSIVNGKG-UHFFFAOYSA-N fluoromethane Chemical compound F[CH2] VUWZPRWSIVNGKG-UHFFFAOYSA-N 0.000 description 1
- 125000002519 galactosyl group Chemical group C1([C@H](O)[C@@H](O)[C@@H](O)[C@H](O1)CO)* 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- 108010076477 haematoside synthetase Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 229940037467 helicobacter pylori Drugs 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- CBOIHMRHGLHBPB-UHFFFAOYSA-N hydroxymethyl Chemical compound O[CH2] CBOIHMRHGLHBPB-UHFFFAOYSA-N 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- BQINXKOTJQCISL-GRCPKETISA-N keto-neuraminic acid Chemical class OC(=O)C(=O)C[C@H](O)[C@@H](N)[C@@H](O)[C@H](O)[C@H](O)CO BQINXKOTJQCISL-GRCPKETISA-N 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- ZAFUNKXZZPSTLA-UHFFFAOYSA-N n-[5,6-dihydroxy-1-oxo-4-[3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-3-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyhexan-2-yl]acetamide Chemical compound OC1C(O)C(O)C(C)OC1OC(C(NC(C)=O)C=O)C(C(O)CO)OC1C(O)C(O)C(O)C(CO)O1 ZAFUNKXZZPSTLA-UHFFFAOYSA-N 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000002731 protein assay Methods 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 238000001472 pulsed field gradient Methods 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 229940076788 pyruvate Drugs 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 102220039305 rs587780536 Human genes 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000005469 synchrotron radiation Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000004885 tandem mass spectrometry Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 150000004044 tetrasaccharides Chemical class 0.000 description 1
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical class CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1081—Glycosyltransferases (2.4) transferring other glycosyl groups (2.4.99)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1077—Pentosyltransferases (2.4.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/04—Polysaccharides, i.e. compounds containing more than five saccharide radicals attached to each other by glycosidic bonds
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/18—Preparation of compounds containing saccharide radicals produced by the action of a glycosyl transferase, e.g. alpha-, beta- or gamma-cyclodextrins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/99—Glycosyltransferases (2.4) transferring other glycosyl groups (2.4.99)
Definitions
- glycosyltransferase-catalyzed reactions have gained increasing attention and application for the synthesis of complex carbohydrates and glycoconjugates.
- Most mammalian glycosyltransferases suffer from no or low expression in E. coli systems and more restricted substrate specificity.
- bacterial glycosyltransferases are generally easier to access using E. coli expression systems and have more promiscuous substrate flexibility. Nevertheless, despite the discovery of many bacterial glycosyltransferases which have promiscuities for both donor and acceptor substrates, the application of glycosyltransferases in the synthesis of carbohydrate-containing structures is limited by the availability and the substrate specificity of wild-type enzymes.
- sialyltransferases the key enzymes that catalyze the transfer of a sialic acid residue from cytidine 5′-monophosphate-sialic acid (CMP-sialic acid) to an acceptor, have been commonly used for the synthesis of sialic acid-containing structures.
- Sialyl Lewis x [SLe x , Sia ⁇ 2-3Gal ⁇ 1-4(Fuc ⁇ 1-3)GlcNAc ⁇ OR] is an important carbohydrate epitope involved in inflammation as well as adhesion and metastasis of cancer cells. It is a well-known tumor-associated carbohydrate antigen and has been used as a candidate for cancer vaccine.
- SLe x The biosynthesis of SLe x involves the formation of Sia ⁇ 2-3Gal ⁇ 1-4GlcNAc ⁇ OR catalyzed by an ⁇ 2-3-sialyltransferase followed by an ⁇ 1-3-fucosyltransferase-catalyzed fucosylation.
- This biosynthetic sequence usually cannot be altered as common ⁇ 2-3-sialyltransferases do not use fucose-containing Lewis x [Le x , Gal ⁇ 1-4(Fuc ⁇ 1-3)GlcNAc ⁇ OR] as a substrate.
- sialic acids constitute a family of great structural diversity. So far, more than 50 structurally distinct sialic acid forms have been identified in nature.
- an efficient enzymatic approach is to use Le x [Gal ⁇ 1-4(Fuc ⁇ 1-3)GlcNAc ⁇ OR] as a fucose-containing acceptor to add different sialic acid forms by a suitable ⁇ 2-3-sialyltransferase.
- This process of introducing different forms of sialic acid onto the common fucosylated acceptor Le x in the last step has significant advantages compared to the normal SLe x biosynthetic pathway in which fucosylation is the last glycosylation process. It not only simplifies the synthetic scheme as a less number of reactions are needed, but also makes the purification process much easier as negatively charged SLe x product is separated from neutral Le x oligosaccharide instead of separating both negatively charged oligosaccharides SLe x and non-fucosylated sialosides if fucosylation occurs in the last step.
- ⁇ 2-3-sialyltransferase from Pasteurella multocida (PmST1) has a good expression level in E. coli (100 mg L ⁇ 1 culture) ( J. Am. Chem. Soc. 2005, 127, 17618-17619.). It can use Le x as an acceptor for the synthesis of SLe x but the yields are poor ( ⁇ 20%) in spite of different conditions tested. What is needed, therefore, are ⁇ 2-3-sialyltransferases having good ⁇ 2-3-sialyltransferase activity with good expression levels, and lowered ⁇ 2-3-sialidase or donor substrate hydrolysis activity. Surprisingly, the present invention meets this and other needs.
- the present invention provides an isolated glycosyltransferase, wherein the amino acid of the glycosyltransferase corresponding to position 120 of SEQ ID NO:1 is any amino acid other than M, the amino acid the glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, or the amino acid the glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R.
- the glycosyltransferase of the present invention has decreased ⁇ 2-3 sialidase or donor substrate hydrolysis activity compared to a control glycosyltransferase, wherein the amino acid of the control glycosyltransferase corresponding to position 120 of SEQ ID NO:1 is M, the amino acid of the control glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is E, and the amino acid of the control glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is R.
- the glycosyltransferase of the present invention can be a member of the glycosyltransferase family 80 (GT80).
- the present invention provides a recombinant nucleic acid encoding an isolated glycosyltransferase of the present invention.
- the present invention provides a cell including a recombinant nucleic acid of the present invention.
- the present invention provide a method of preparing an oligosaccharide, the method including forming a reaction mixture including an acceptor sugar, a donor substrate of a sugar moiety and a nucleotide, and the glycosyltransferase of the present invention, under conditions sufficient to transfer the sugar moiety from the donor substrate to the acceptor sugar, thereby forming the oligosaccharide.
- FIG. 1A-1B show the ternary crystal structure of PmST1 (PDB ID: 21HZ) with bound CMP-3F(axial)-Neu5Ac and lactose ( Figure A) and the structure of the modeled PmST1 double mutant E271F/R313Y of PmST1 wild type sequence SEQ ID NO: 13 ( Figure B). The mutation sites are underlined. The mutant structure was obtained from automated homology modeling using Swiss-Model.
- FIG. 2 shows acceptor substrate specificity data for the ⁇ 2-3-sialyltransferase activity of wild-type PmST1 (white columns) and its double mutant E271F/R313Y of PmST1 wild type sequence SEQ ID No: 13 (black columns).
- FIG. 3 shows thermal stability data for the ⁇ 2-3-sialyltransferase activity of wild-type PmST1 (white columns) and its double mutant E271F/R313Y of PmST1 wild type sequence SEQ ID No: 13 (black columns).
- FIG. 4 shows HPLC-based time course studies of PmST1-catalyzed ⁇ 2-3-sialylation of Lewis x trisaccharide (1 mM) with periodical addition of sialyltransferase donor CMP-Neu5Ac (indicated by arrows). Numbers in parentheses represent the % consumption of CMP-Neu5Ac by capillary electrophoresis (CE) assays.
- CE capillary electrophoresis
- FIG. 5 illustrates that water (in the donor hydrolysis reaction) competes with Lewis' (in PmST1-catalyzed ⁇ 2-3-sialylation reaction) for the consumption of CMP-Neu5Ac.
- FIG. 6 shows the SDS-PAGE analysis of the M144D mutant of PmST1 wild type sequence SEQ ID No: 13.
- Lane 1 Protein marker
- Lane 2 Whole cells before induction
- Lane 3 Whole cells after induction
- Lane 4 Cell lysate
- Lane 5 Purified fraction.
- FIG. 7A-7C show the structural comparison between wild-type (WT) PmST1 and M144D mutant of PmST1 wild type sequence SEQ ID No: 13 with bound CMP.
- FIG. 7A shows the overall structure alignment of WT PmST1 and the PmST1 M144D mutant, both with CMP bound.
- FIG. 7B shows the stereo view of the superposition near the active site for WT PmST1 and the M144D mutant with bound CMP-3F(a)-Neu5Ac (a donor substrate analog) and lactose acceptor.
- FIG. 7C shows the active site of the ternary crystal structure of PmST1 (PDB 1D: 21HZ) with bound CMP-3F(axial)-Neu5Ac and lactose.
- FIG. 8 shows 15 N- 1 H HSQC NMR spectra of 15 N-labeled PmST1 (WT versus M144D mutant of PmST1 wild type sequence SEQ ID NO: 13; as well as apo versus CMP-bound).
- FIG. 9 shows the one-pot three-enzyme synthesis of sialyl Le x ⁇ ProN 3 (SLe x ⁇ ProN 3 ) containing different forms of sialic acids from Le x ⁇ ProN 3 .
- Aldolase refers to Pasteurella multocida sialic acid aldolase
- NmCSS refers to Neisseria meningitidis CMP-sialic acid synthetase.
- FIG. 10 shows amino acid sequences alignment of GT80 sialyltransferases.
- Pp_Pst3-1 GenBank accession number BAF63530
- Psp_Pst3-2 GenBank accession number BAF92025
- Vsp_2,3ST GeneBank accession number BAF91160
- P1ST6_JT-1 GenBank accession number BAF91416
- P1ST6_JT-2 P1ST6_JT-2
- Pd2,6ST GeneBank accession number BAA25316
- Psp_pst6-1 GeneBank accession number BAF92026
- Pm0188Ph GeneBank accession number DQ087233
- Hd0053P GenBank accession number AAP95068
- the present invention provides alpha2-3 sialyltransferase mutants of PmST1 having reduced alpha2-3 sialidase or donor substrate hydrolysis, useful for the preparation of oligosaccharides, and can be tolerant of fucosylated oligosaccharides.
- the mutations described herein can be incorporated into a variety of sialyltransferases to produce mutants having reduced sialidase or donor substrate activity.
- glycosyltransferase refers to a polypeptide that catalyzes the formation of a glycoside or an oligosaccharide from a donor substrate and an acceptor or acceptor sugar.
- a glycosyltransferase catalyzes the transfer of the monosaccharide moiety of the donor substrate to a hydroxyl group of the acceptor.
- the covalent linkage between the monosaccharide and the acceptor sugar can be a 1-4 linkage, a 1-3 linkage, a 1-6-linkage, a 1-2 linkage, a 2-3-linkage, a 2-6-linkage, a 2-8-linkage, or a 2-9-linkage.
- the linkage may be in the ⁇ - or ⁇ -configuration with respect to the anomeric carbon of the monosaccharide.
- Other types of linkages may be formed by the glycosyltransferases in the methods of the invention.
- Glycosyltransferases include, but are not limited to, sialyltransferases, heparosan synthases (HSs), glucosaminyltransferases, N-acetylglucosaminyltransferases, glucosyltransferases, glucuronyltransferases, N-acetylgalactosaminyltransferases, galactosyltransferases, galacturonyltransferases, fucosyltransferases, mannosyltransferases, xylosyltransferases.
- Sialyltransferases are enzymes that catalyze the transfer of sialic acid, or analogs thereof, to a monosaccharide or an oligosaccharide.
- Alpha2-3-sialidase refers to an enzyme that catalyzes the hydrolysis of alpha2-3-glycosidic linkages of terminal sialic acids on oligosaccharides.
- Donor substrate hydrolysis refers to hydrolysis of the nucleotide-sugar bond of the donor substrate.
- amino acid refers to any monomer unit that can be incorporated into a peptide, polypeptide, or protein.
- amino acid includes the following twenty natural or genetically encoded alpha-amino acids: alanine (Ala or A), arginine (Arg or R), asparagine (Asn or N), aspartic acid (Asp or D), cysteine (Cys or C), glutamine (Gln or Q), glutamic acid (Glu or E), glycine (Gly or G), histidine (His or H), isoleucine (Ile or I), leucine (Leu or L), lysine (Lys or K), methionine (Met or M), phenylalanine (Phe or F), proline (Pro or P), serine (Ser or S), threonine (Thr or T), tryptophan (Trp or W), tyrosine (Tyr or Y), and va
- amino acid also includes unnatural amino acids, modified amino acids (e.g., having modified side chains and/or backbones), and amino acid analogs.
- Polypeptide “peptide,” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues. All three terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non-naturally occurring amino acid polymers. As used herein, the terms encompass amino acid chains of any length, including full-length proteins, wherein the amino acid residues are linked by covalent peptide bonds.
- mutant in the context of glycosyltransferases of the present invention, means a polypeptide, typically recombinant, that comprises one or more amino acid substitutions relative to a corresponding, naturally-occurring or unmodified glycosyltransferase, such as an alpha2-3 sialyltransferase.
- corresponding to another sequence (e.g., regions, fragments, nucleotide or amino acid positions, or the like) is based on the convention of numbering according to nucleotide or amino acid position number and then aligning the sequences in a manner that maximizes the percentage of sequence identity.
- glycosyltransferase polypeptide sequence differs from SEQ ID NO:1, 13, 15, 17, 19, 21, 23, 25, 27 or 29 (e.g., by changes in amino acids or addition or deletion of amino acids), it may be that a particular mutation associated with improved activity as discussed herein will not be in the same position number as it is in SEQ ID NO:1, 13, 15, 17, 19, 21, 23, 25, 27 or 29.
- a nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence.
- a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation.
- percent sequence identity is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the sequence in the comparison window can comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
- nucleic acids or polypeptide sequences refer to two or more sequences or subsequences that are the same. Sequences are “substantially identical” to each other if they have a specified percentage of nucleotides or amino acid residues that are the same (e.g., at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% identity over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. These definitions also refer to the complement of a test sequence. Optionally, the identity exists over a region that is at least about 50 nucleotides in length, or more typically over a region that is 100 to
- similarity in the context of two or more polypeptide sequences, refer to two or more sequences or subsequences that have a specified percentage of amino acid residues that are either the same or similar as defined by a conservative amino acid substitutions (e.g., 60% similarity, optionally 65%, 70%, 75%, 80%, 85%, 90%, or 95% similar over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection.
- a conservative amino acid substitutions e.g., 60% similarity, optionally 65%, 70%, 75%, 80%, 85%, 90%, or 95% similar over a specified region
- Sequences are “substantially similar” to each other if they are at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, or at least 55% similar to each other.
- this similarly exists over a region that is at least about 50 amino acids in length, or more typically over a region that is at least about 100 to 500 or 1000 or more amino acids in length.
- sequence comparison typically one sequence acts as a reference sequence, to which test sequences are compared.
- test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters are commonly used, or alternative parameters can be designated.
- sequence comparison algorithm then calculates the percent sequence identities or similarities for the test sequences relative to the reference sequence, based on the program parameters.
- a “comparison window,” as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.
- Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, for example, by the local homology algorithm of Smith and Waterman ( Adv. Appl. Math. 2:482, 1970), by the homology alignment algorithm of Needleman and Wunsch ( J. Mol. Biol. 48:443, 1970), by the search for similarity method of Pearson and Lipman ( Proc.
- HSPs high scoring sequence pairs
- T is referred to as the neighborhood word score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always ⁇ 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score.
- the BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment.
- W wordlength
- E expectation
- the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul, Proc. Natl. Acad. Sci. USA 90:5873-87, 1993).
- One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance.
- P(N) the smallest sum probability
- a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, typically less than about 0.01, and more typically less than about 0.001.
- Recombinant refers to an amino acid sequence or a nucleotide sequence that has been intentionally modified by recombinant methods.
- recombinant nucleic acid herein is meant a nucleic acid, originally formed in vitro, in general, by the manipulation of a nucleic acid by endonucleases, in a form not normally found in nature.
- an isolated, mutant glycosyltransferase nucleic acid, in a linear form, or an expression vector formed in vitro by ligating DNA molecules that are not normally joined are both considered recombinant for the purposes of this invention.
- a “recombinant protein” is a protein made using recombinant techniques, i.e., through the expression of a recombinant nucleic acid as depicted above.
- vector refers to a piece of DNA, typically double-stranded, which may have inserted into it a piece of foreign DNA.
- the vector may be, for example, of plasmid origin.
- Vectors contain “replicon” polynucleotide sequences that facilitate the autonomous replication of the vector in a host cell.
- Foreign DNA is defined as heterologous DNA, which is DNA not naturally found in the host cell, which, for example, replicates the vector molecule, encodes a selectable or screenable marker, or encodes a transgene.
- the vector is used to transport the foreign or heterologous DNA into a suitable host cell.
- the vector can replicate independently of or coincidental with the host chromosomal DNA, and several copies of the vector and its inserted DNA can be generated.
- the vector can also contain the necessary elements that permit transcription of the inserted DNA into an mRNA molecule or otherwise cause replication of the inserted DNA into multiple copies of RNA.
- Some expression vectors additionally contain sequence elements adjacent to the inserted DNA that increase the half-life of the expressed mRNA and/or allow translation of the mRNA into a protein molecule. Many molecules of mRNA and polypeptide encoded by the inserted DNA can thus be rapidly synthesized.
- nucleotide in addition to referring to the naturally occurring ribonucleotide or deoxyribonucleotide monomers, shall herein be understood to refer to related structural variants thereof, including derivatives and analogs, that are functionally equivalent with respect to the particular context in which the nucleotide is being used (e.g., hybridization to a complementary base), unless the context clearly indicates otherwise.
- nucleic acid refers to a polymer that can be corresponded to a ribose nucleic acid (RNA) or deoxyribose nucleic acid (DNA) polymer, or an analog thereof.
- RNA ribose nucleic acid
- DNA deoxyribose nucleic acid
- polymers of nucleotides such as RNA and DNA, as well as synthetic forms, modified (e.g., chemically or biochemically modified) forms thereof, and mixed polymers (e.g., including both RNA and DNA subunits).
- Exemplary modifications include methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, and the like), pendent moieties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, and the like), chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids and the like). Also included are synthetic molecules that mimic polynucleotides in their ability to bind to a designated sequence via hydrogen bonding and other chemical interactions.
- internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, and the like), pendent moieties (e.g., polypeptides), intercalators (e.g.,
- nucleotide monomers are linked via phosphodiester bonds, although synthetic forms of nucleic acids can comprise other linkages (e.g., peptide nucleic acids as described in Nielsen et al. (Science 254:1497-1500, 1991).
- a nucleic acid can be or can include, e.g., a chromosome or chromosomal segment, a vector (e.g., an expression vector), an expression cassette, a naked DNA or RNA polymer, the product of a polymerase chain reaction (PCR), an oligonucleotide, a probe, and a primer.
- PCR polymerase chain reaction
- a nucleic acid can be, e.g., single-stranded, double-stranded, or triple-stranded and is not limited to any particular length. Unless otherwise indicated, a particular nucleic acid sequence comprises or encodes complementary sequences, in addition to any sequence explicitly indicated.
- oligosaccharide refers to a compound containing at least two sugars covalently linked together. Oligosaccharides include disaccharides, trisaccharides, tetrasachharides, pentasaccharides, hexasaccharides, heptasaccharides, octasaccharides, and the like. Covalent linkages generally consist of glycosidic linkages (i.e., C—O—C bonds) formed from the hydroxyl groups of adjacent sugars.
- Linkages can occur between the 1-carbon and the 4-carbon of adjacent sugars (i.e., a 1-4 linkage), the 1-carbon and the 3-carbon of adjacent sugars (i.e., a 1-3 linkage), the 1-carbon and the 6-carbon of adjacent sugars (i.e., a 1-6 linkage), or the 1-carbon and the 2-carbon of adjacent sugars (i.e., a 1-2 linkage).
- a sugar can be linked within an oligosaccharide such that the anomeric carbon is in the ⁇ - or ⁇ -configuration.
- the oligosaccharides prepared according to the methods of the invention can also include linkages between carbon atoms other than the 1-, 2-, 3-, 4-, and 6-carbons.
- acceptor sugar refers a sugar that accepts the sugar being added.
- the acceptor sugar can be an oligosaccharide, such as a fucosylated oligosaccharide, that accepts a sialic acid or analog thereof.
- Donor substrate refers to a compound having a nucleotide and the sugar that is added to the acceptor, where the sugar and nucleotide are covalently bound together.
- the sugar can be sialic acid or analogs thereof.
- the nucleotide can be any suitable nucleotide such as cytidine monophosphate (CMP).
- sialic acid aldolase refers to an aldolase that prepares sialic acid using pyruvate and N-acetyl mannose (ManNAc).
- the present invention includes a variety of sialyltransferases with reduced sialidase and/or donor substrate hydrolysis activity.
- Sialyltransferases are one class of glycosyltransferases, enzymes that catalyze the transfer of a sugar from a nucleotide-sugar complex (donor substrate) to an acceptor, a mono, di or oligosaccharide.
- Sialyltransferases catalyze the transfer of N-acetylneuraminic acid, and analogs thereof, from a sialic acid-nucleotide complex, the donor substrate, to the terminal sugar of the acceptor which can be a monosaccharide, an oligosaccharide, a glycolipid, a glycopeptide, or a glycoprotein.
- sialyltransferases include, but are not limited to, sialyltransferases in family EC 2.4.99, such as beta-galactosamide alpha-2,6-sialyltransferase (EC 2.4.99.1), alpha-N-acetylgalactosaminide alpha-2,6-sialyltransferase (EC 2.4.99.3), beta-galactoside alpha-2,3-sialyltransferase (EC 2.4.99.4), N-acetyllactosaminide alpha-2,3-sialyltransferase (EC 2.4.99.6), alpha-N-acetyl-neuraminide alpha-2,8-sialyltransferase (EC 2.4.99.8); lactosylceramide alpha-2,3-sialyltransferase (EC 2.4.99.9).
- the sialyltransferases of the present invention also include those of the CAZy GT80 family, or EC 2.4.99.4, drawn to alpha2-3 and alpha2-6 sialyltransferases, as well as sialyltransferases in the GT29, GT30, GT38, GT42, GT52, and GT73 families.
- Representative GT80 sialyltransferases include, but are not limited to, PmST1, Psp26ST, Vsp23ST, Pd26ST, P1ST6 JT-1, P1ST6 JT-2, Pp Pst3-1, Pp Pst3-2, Np23ST and Hd0053. (See Glycobiology 201, 21(6), 716; J. Mol. Biol. 2003, 328, 307; Annu. Rev. Biochem. 2008, 77, 521; Appl. Microbiol. Biotechnol. 2012, 94, 887 for review of sialyltransferases.)
- glycosyltransferases of the present invention include those having decreased ⁇ 2-3 sialidase or donor substrate hydrolysis activity compared to a control glycosyltransferase.
- ⁇ 2-3 sialidase activity refers to the back reaction starting from the product oligosaccharide, cleaving the glycosidic bond between the sugar from the donor substrate and the acceptor, resulting in the donor substrate and the acceptor.
- the glycosyltransferase can be an ⁇ 2-3-sialyltransferase.
- the ⁇ 2-3-sialyltransferases of the present invention can include sialyltransferases of Pasteurella multocida .
- the glycosyltransferases of the present invention can have a motif in the sialyltransferase domain including at least one of sialyltransferase motif A (YDDGS, corresponding to positions 139-143 of PmST1 wild type, SEQ ID NO:13) and sialyltransferase motif B (KGH, corresponding to positions 309-311 of PmST1 wild type, SEQ ID NO:13).
- the glycosyltransferases of the present invention can include a polypeptide having any suitable percent identity to the control sequence.
- the glycosyltransferases of the present invention can include a polypeptide having a percent sequence identity to the control glycosyltransferase sequence of at least 20, 30, 40, 50, 55, 60, 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or at least 99%.
- percent sequence identity can be at least 80%.
- percent sequence identity can be at least 90%.
- percent sequence identity can be at least 95%.
- the glycosyltransferase includes a polypeptide sequence having at least 80% sequence identity to SEQ ID NO:1.
- the isolated glycosyltransferase includes a polypeptide sequence of SEQ ID NO:3 (M120D), SEQ ID NO:5 (M120H), SEQ ID NO:7 (E247F), SEQ ID NO: 9 (R289Y) or SEQ ID NO: 11 (E247F/R289Y).
- glycosyltransferases can vary, so the precise amino acid positions corresponding to each mutation can vary depending on the particular control glycosyltransferase used.
- Amino acid and nucleic acid sequence alignment programs are readily available (see, e.g., those referred to supra) and, given the particular motifs identified herein, serve to assist in the identification of the exact amino acids (and corresponding codons) for modification in accordance with the present invention.
- the positions of several mutations are shown in the table below for the PmST1 wild type sequence (SEQ ID NO:13) and the ⁇ 24PmST1 (SEQ ID NO:1) sequence.
- PmST1 wild type ⁇ 24PmST1 Mutation (SEQ ID NO: 13) (SEQ ID No: 1) 1 M144D M120D 2 M144H M120H 3 E271F E247F 4 R313Y R289Y 5 E271F/R313Y E247F/R289Y
- amino acid position 144 in the PmST1 wild type sequence corresponds to position 120 of the ⁇ 24PmST1 sequence (SEQ ID NO: 1).
- the control glycosyltransferases of the present invention includes any suitable glycosyltransferase or sialyltransferase.
- the glycosyltransferases of the present invention includes mutants corresponding to any position of PmST1 wild type sequence (SEQ ID NO:13) and ⁇ 24PmST1 (SEQ ID NO:1) (see Biochemistry 2006, 45(7), 2139, and 2007, 46(21), 6288).
- the glycosyltransferases of the present invention include, but are not limited to, mutants at at least one of positions 120, 247 and 289 of ⁇ 24PmST1 (SEQ ID NO:1).
- glycosyltransferases include mutants at at least one of positions 144, 271 and 313 of PmST1 wild type sequence (SEQ ID NO:13).
- the mutants can include any suitable amino acid other than the native amino acid.
- the amino acid can be V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, R, or H.
- the control glycosyltransferase can be the PmST1 wild type sequence (SEQ ID NO:13) or the ⁇ 24PmST1 (SEQ ID NO:1).
- the control glycosyltransferase can be ⁇ 24PmST1 (SEQ ID NO:1).
- the present invention provides an isolated glycosyltransferase, wherein the amino acid of the glycosyltransferase corresponding to position 120 of SEQ ID NO:1 is any amino acid other than M, the amino acid the glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, or the amino acid the glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R.
- the glycosyltransferase of the present invention has decreased ⁇ 2-3 sialidase or donor substrate hydrolysis activity compared to a control glycosyltransferase, wherein the amino acid of the control glycosyltransferase corresponding to position 120 of SEQ ID NO:1 is M, the amino acid of the control glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is E, and the amino acid of the control glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is R.
- the glycosyltransferase of the present invention can be a member of the glycosyltransferase family 80 (GT80).
- the isolated glycosyltransferase has decreased ⁇ 2-3 sialidase activity, and includes at least one of the amino acid corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, and the amino acid corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R.
- Decreased ⁇ 2-3 sialidase activity can be measured by the ratio of ⁇ 2-3 sialidase activity for the control glycosyltransferase to the ⁇ 2-3 sialidase activity of the isolated glycosyltransferase.
- the ratio can be at least 2:1, 3:1, 4:1, 5:1, 10:1, 20:1, 30:1, 40:1, 50:1, 100:1, 200:1, 300:1, 400:1, 500:1 or at least 1000:1. In some embodiments, the ratio is at least 5:1. In some embodiments, the ratio is at least 10:1. In some embodiments, the ratio is at least 100:1. In some embodiments, the ratio is at least 1000:1.
- the isolated glycosyltransferase having decreased ⁇ 2-3 sialidase activity includes the amino acid corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, and the amino acid corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R.
- the isolated glycosyltransferase having decreased ⁇ 2-3 sialidase activity includes the amino acid corresponding to position 117 of SEQ ID NO:1 is D or E. In some embodiments, the isolated glycosyltransferase having decreased ⁇ 2-3 sialidase activity includes the amino acid corresponding to position 117 of SEQ ID NO:1 is A, G, V, L or I. In some embodiments, the isolated glycosyltransferase having decreased ⁇ 2-3 sialidase activity includes the amino acid corresponding to position 287 of SEQ ID NO:1 is H, K, R, W or F.
- glycosyltransferases of the present invention have decreased donor substrate hydrolysis activity.
- Decreased donor substrate hydrolysis activity can be measured by the ratio of donor substrate hydrolysis activity for the control glycosyltransferase to the donor substrate hydrolysis activity of the isolated glycosyltransferase. The ratio can be at least 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9:1 or at least 10:1.
- the isolated glycosyltransferase has decreased donor substrate hydrolysis activity, wherein the amino acid corresponding to position 120 of SEQ ID NO:1 is any amino acid other than M.
- the ratio of donor substrate hydrolysis activity for the control ⁇ 2-3 sialidase to the donor substrate hydrolysis activity of the isolated glycosyltransferase is at least 2:1.
- the amino acid corresponding to position 120 of SEQ ID NO:1 can be any amino acid of V, I, L, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, R, or H. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be any amino acid of D, E, H, K or R. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be any amino acid of D or H. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be amino acid D. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be amino acid H.
- the amino acid corresponding to position 247 of SEQ ID NO:1 can be any amino acid of V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, K, R, or H. In some embodiments, the amino acid corresponding to position 247 of SEQ ID NO:1 can be any amino acid of F, Y or W. In some embodiments, the amino acid corresponding to position 247 of SEQ ID NO:1 can be amino acid F.
- the amino acid corresponding to position 289 of SEQ ID NO:1 can be any amino acid of V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, and H. In some embodiments, the amino acid corresponding to position 289 of SEQ ID NO:1 can be any amino acid of Y, F or W. In some embodiments, the amino acid corresponding to position 289 of SEQ ID NO:1 can be amino acid Y.
- the glycosyltransferases of the present invention can have one or more mutations.
- the glycosyltransferase includes the amino acid corresponding to position 247 of SEQ ID NO:1 can be any amino acid of F, Y or W, and the amino acid corresponding to position 289 of SEQ ID NO:1 can be any amino acid of Y, F or W.
- the amino acid corresponding to position 247 of SEQ ID NO:1 can be amino acid F
- the amino acid corresponding to position 289 of SEQ ID NO:1 can be amino acid Y.
- the isolated glycosyltransferase can be the amino acid corresponding to position 120 of SEQ ID NO:1 is D, E, H, K or R, the amino acid corresponding to position 247 of SEQ ID NO:1 is F, Y or W, or the amino acid corresponding to position 289 of SEQ ID NO:1 is Y, F or W.
- the isolated glycosyltransferase can be the amino acid corresponding to position 120 of SEQ ID NO:1 is D or H, the amino acid corresponding to position 247 of SEQ ID NO:1 is F, or the amino acid corresponding to position 289 of SEQ ID NO:1 is Y.
- the glycosyltransferases of the present invention can be constructed by mutating the DNA sequences that encode the corresponding unmodified glycosyltransferase (e.g., a wild-type glycosyltransferase or a corresponding variant from which the glycosyltransferase of the invention is derived), such as by using techniques commonly referred to as site-directed mutagenesis.
- Nucleic acid molecules encoding the unmodified form of the glycosyltransferase can be mutated by a variety of techniques well-known to one of ordinary skill in the art. (See, e.g., PCR Strategies (M. A. Innis, D. H. Gelfand, and J. J.
- the two primer system utilized in the Transformer Site-Directed Mutagenesis kit from Clontech, may be employed for introducing site-directed mutants into a polynucleotide encoding an unmodified form of the glycosyltransferase.
- two primers are simultaneously annealed to the plasmid; one of these primers contains the desired site-directed mutation, the other contains a mutation at another point in the plasmid resulting in elimination of a restriction site.
- Second strand synthesis is then carried out, tightly linking these two mutations, and the resulting plasmids are transformed into a mutS strain of E. coli .
- Plasmid DNA is isolated from the transformed bacteria, restricted with the relevant restriction enzyme (thereby linearizing the unmutated plasmids), and then retransformed into E. coli .
- This system allows for generation of mutations directly in an expression plasmid, without the necessity of subcloning or generation of single-stranded phagemids.
- the tight linkage of the two mutations and the subsequent linearization of unmutated plasmids result in high mutation efficiency and allow minimal screening.
- this method requires the use of only one new primer type per mutation site.
- a set of “designed degenerate” oligonucleotide primers can be synthesized in order to introduce all of the desired mutations at a given site simultaneously.
- Transformants can be screened by sequencing the plasmid DNA through the mutagenized region to identify and sort mutant clones. Each mutant DNA can then be restricted and analyzed by electrophoresis, such as for example, on a Mutation Detection Enhancement gel (Mallinckrodt Baker, Inc., Phillipsburg, N.J.) to confirm that no other alterations in the sequence have occurred (by band shift comparison to the unmutagenized control).
- the entire DNA region can be sequenced to confirm that no additional mutational events have occurred outside of the targeted region.
- Verified mutant duplexes in pET (or other) overexpression vectors can be employed to transform E. coli such as, e.g., strain E. coli BL21 (DE3) pLysS, for high level production of the mutant protein, and purification by standard protocols.
- the method of FAB-MS mapping for example, can be employed to rapidly check the fidelity of mutant expression. This technique provides for sequencing segments throughout the whole protein and provides the necessary confidence in the sequence assignment. In a mapping experiment of this type, protein is digested with a protease (the choice will depend on the specific region to be modified since this segment is of prime interest and the remaining map should be identical to the map of unmutated protein).
- the set of cleavage fragments is fractionated by, for example, microbore HPLC (reversed phase or ion exchange, again depending on the specific region to be modified) to provide several peptides in each fraction, and the molecular weights of the peptides are determined by standard methods, such as FAB-MS.
- the determined mass of each fragment are then compared to the molecular weights of peptides expected from the digestion of the predicted sequence, and the correctness of the sequence quickly ascertained. Since this mutagenesis approach to protein modification is directed, sequencing of the altered peptide should not be necessary if the MS data agrees with prediction.
- CAD-tandem MS/MS can be employed to sequence the peptides of the mixture in question, or the target peptide can be purified for subtractive Edman degradation or carboxypeptidase Y digestion depending on the location of the modification.
- Mutant glycosyltransferases with at least one amino acid substituted can be generated in various ways. In the case of amino acids located close together in the polypeptide chain, they may be mutated simultaneously using one oligonucleotide that codes for all of the desired amino acid substitutions. If however, the amino acids are located some distance from each other (separated by more than ten amino acids, for example) it is more difficult to generate a single oligonucleotide that encodes all of the desired changes. Instead, one of two alternative methods may be employed. In the first method, a separate oligonucleotide is generated for each amino acid to be substituted.
- the oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand of DNA that is synthesized from the template will encode all of the desired amino acid substitutions.
- An alternative method involves two or more rounds of mutagenesis to produce the desired mutant. The first round is as described for the single mutants: DNA encoding the unmodified glycosyltransferase is used for the template, an oligonucleotide encoding the first desired amino acid substitution(s) is annealed to this template, and the heteroduplex DNA molecule is then generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of mutagenesis as the template. Thus, this template already contains one or more mutations.
- the oligonucleotide encoding the additional desired amino acid substitution(s) is then annealed to this template, and the resulting strand of DNA now encodes mutations from both the first and second rounds of mutagenesis.
- This resultant DNA can be used as a template in a third round of mutagenesis, and so on.
- the multi-site mutagenesis method of Seyfang & Jin Anal. Biochem. 324:285-291. 2004 may be utilized.
- nucleic acids optionally isolated, encoding any of the glycosyltransferases of the present invention (e.g., glycosyltransferases comprising any of SEQ ID NOs:4, 6, 8, 10 and 12).
- a nucleic acid of the present invention encoding a glycosyltransferase of the invention
- vectors can be made. Any vector containing replicon and control sequences that are derived from a species compatible with the host cell can be used in the practice of the invention.
- expression vectors include transcriptional and translational regulatory nucleic acid regions operably linked to the nucleic acid encoding the mutant glycosyltransferase.
- control sequences refers to DNA sequences necessary for the expression of an operably linked coding sequence in a particular host organism.
- the control sequences that are suitable for prokaryotes include a promoter, optionally an operator sequence, and a ribosome binding site.
- the vector may contain a Positive Retroregulatory Element (PRE) to enhance the half-life of the transcribed mRNA (see Gelfand et al. U.S. Pat. No. 4,666,848).
- PRE Positive Retroregulatory Element
- the transcriptional and translational regulatory nucleic acid regions will generally be appropriate to the host cell used to express the glycosyltransferase. Numerous types of appropriate expression vectors, and suitable regulatory sequences are known in the art for a variety of host cells.
- the transcriptional and translational regulatory sequences may include, e.g., promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences.
- the regulatory sequences include a promoter and transcriptional start and stop sequences.
- Vectors also typically include a polylinker region containing several restriction sites for insertion of foreign DNA.
- “fusion flags” are used to facilitate purification and, if desired, subsequent removal of tag/flag sequence, e.g., “His-Tag”. However, these are generally unnecessary when purifying an thermoactive and/or thermostable protein from a mesophilic host (e.g., E.
- coli where a “heat-step” may be employed.
- suitable vectors containing DNA encoding replication sequences, regulatory sequences, phenotypic selection genes, and the mutant glycosyltransferase of interest are prepared using standard recombinant DNA procedures. Isolated plasmids, viral vectors, and DNA fragments are cleaved, tailored, and ligated together in a specific order to generate the desired vectors, as is well-known in the art (see, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, New York, N.Y., 2nd ed. 1989)).
- the present invention provides a recombinant nucleic acid encoding an isolated glycosyltransferase of the present invention.
- the expression vector contains a selectable marker gene to allow the selection of transformed host cells.
- Selection genes are well known in the art and will vary with the host cell used. Suitable selection genes can include, for example, genes coding for ampicillin and/or tetracycline resistance, which enables cells transformed with these vectors to grow in the presence of these antibiotics.
- a nucleic acid encoding a glycosyltransferase of the invention is introduced into a cell, either alone or in combination with a vector.
- introduction into or grammatical equivalents herein is meant that the nucleic acids enter the cells in a manner suitable for subsequent integration, amplification, and/or expression of the nucleic acid.
- the method of introduction is largely dictated by the targeted cell type. Exemplary methods include CaPO 4 precipitation, liposome fusion, LIPOFECTIN®, electroporation, viral infection, and the like.
- prokaryotes are used as host cells for the initial cloning steps of the present invention.
- Other host cells include, but are not limited to, eukaryotic (e.g., mammalian, plant and insect cells), or prokaryotic (bacterial) cells.
- Exemplary host cells include, but are not limited to, Escherichia coli, Saccharomyces cerevisiae, Pichia pastoris , Sf9 insect cells, and CHO cells. They are particularly useful for rapid production of large amounts of DNA, for production of single-stranded DNA templates used for site-directed mutagenesis, for screening many mutants simultaneously, and for DNA sequencing of the mutants generated.
- Suitable prokaryotic host cells include E. coli K12 strain 94 (ATCC No.
- E. coli strain W3110 ATCC No. 27,325
- E. coli K12 strain DG116 ATCC No. 53,606
- E. coli X1776 ATCC No. 31,537
- E. coli B E. coli B; however many other strains of E. coli , such as HB101, JM101, NM522, NM538, NM539, and many other species and genera of prokaryotes including bacilli such as Bacillus subtilis , other enterobacteriaceae such as Salmonella typhimurium or Serratia marcesans , and various Pseudomonas species can all be used as hosts.
- Prokaryotic host cells or other host cells with rigid cell walls are typically transformed using the calcium chloride method as described in section 1.82 of Sambrook et al., supra.
- electroporation can be used for transformation of these cells.
- Prokaryote transformation techniques are set forth in, for example Dower, in Genetic Engineering, Principles and Methods 12:275-296 (Plenum Publishing Corp., 1990); Hanahan et al., Meth. Enzymol., 204:63, 1991.
- Plasmids typically used for transformation of E. coli include pBR322, pUCI8, pUCI9, pUCI18, pUC119, and Bluescript M13, all of which are described in sections 1.12-1.20 of Sambrook et al., supra. However, many other suitable vectors are available as well.
- the glycosyltransferases of the present invention are produced by culturing a host cell transformed with an expression vector containing a nucleic acid encoding the glycosyltransferase, under the appropriate conditions to induce or cause expression of the glycosyltransferase.
- Methods of culturing transformed host cells under conditions suitable for protein expression are well-known in the art (see, e.g., Sambrook et al., supra).
- Suitable host cells for production of the glycosyltransferases from lambda pL promoter-containing plasmid vectors include E. coli strain DG116 (ATCC No. 53606) (see U.S. Pat. No. 5,079,352 and Lawyer, F. C.
- the present invention provides a cell including a recombinant nucleic acid of the present invention.
- the cell can be prokaryotes, eukaryotes, mammalian, plant, bacteria or insect cells.
- the glycosyltransferases of the present invention can be used to prepare oligosaccharides, specifically to add N-acetylneuraminic acid (Neu5Ac), other sialic acids, and analogs thereof, to a monosaccharide, an oligosaccharide, a glycolipid, a glycopeptide, or a glycoprotein.
- the glycosyltransferase PmST1 catalyzes the addition of CMP-Neu5Ac to a fucosylated oligosaccharide by transferring the Neu5Ac to the oligosaccharide.
- the present invention provides a method of preparing an oligosaccharide, the method including forming a reaction mixture including an acceptor sugar, a donor substrate containing a sugar moiety and a nucleotide, and the glycosyltransferase of the present invention, under conditions sufficient to transfer the sugar moiety from the donor substrate to the acceptor sugar, thereby forming the oligosaccharide.
- the acceptor sugar can be any suitable oligosaccharide, glycolipid, glycopeptide, or glycoprotein.
- the acceptor sugar is an oligosaccharide
- any suitable oligosaccharide can be used.
- the acceptor sugar can be Gal ⁇ 1-4GlcNAc ⁇ OR, wherein R can H, a sugar or an oligosaccharide.
- the acceptor sugar can be fucosylated, such as Gal ⁇ 1-4(Fuc ⁇ 1-3)GlcNAc ⁇ OR (Lewis x ⁇ OR or Le x ⁇ OR) wherein R can H, a sugar or an oligosaccharide.
- the donor substrate includes a nucleotide and sugar.
- Any nucleotide can be used, include, but are not limited to, adenine, guanine, cytosine, uracil and thymine nucleotides with one, two or three phosphate groups.
- the nucleotide can be cytidine monophosphate (CMP).
- CMP cytidine monophosphate
- the sugar can be any suitable sugar.
- the glycosyltransferase is a sialyltransferase
- the sugar can be N-acetylneuraminic acid or Neu5Ac, other sialic acids and analogs thereof.
- Sialic acid is a general term for N- and O-substituted derivatives of neuraminic acid, and includes, but is not limited to, N-acetyl (Neu5Ac) or N-glycolyl (Neu5Gc) substitutions, as well as O-substitutions including acetyl, lactyl, methyl, sulfate and phosphate, among others.
- the sialic acid can be a compound of the formula:
- R 1 can be H, OH, N 3 , NHC(O)Me, NHC(O)CH 2 OH, NHC(O)CH 2 N 3 , NHC(O)OCH 2 C ⁇ CH 2 , NHC(O)CH 2 F, NHC(O)CH 2 NHCbz, NHC(O)CH 2 OC(O)Me, or NHC(O)CH 2 OBn; and R 2 , R 3 , and R 4 can be independently selected from H, OH, N 3 , OMe, F, OSO 3 ⁇ , OPO 3 H ⁇ , or OC(O)Me.
- the donor substrate can be CMP-Neu5Ac. Other donor substrates are useful in the methods of the present invention.
- the sialic acid can be a compound of the formula:
- the glycosyltransferase can include a polypeptide sequence such as SEQ ID NO:3. (M120D), SEQ ID NO:5 (M120H), SEQ ID NO:7 (E247F), SEQ ID NO:9 (R289Y) or SEQ ID NO:11 (E247F/R289Y).
- the glycosyltransferase can include a polypeptide sequence such as SEQ ID NO:3 (M120D) or SEQ ID NO:5 (M120H).
- the glycosyltransferase can include a polypeptide sequence such as SEQ ID NO:7 (E247F), SEQ ID NO:9 (R289Y) or SEQ ID NO:11 (E247F/R289Y).
- the glycosyltransferases can be, for example, purified, secreted by a cell present in the reaction mixture, or can catalyze the reaction within a cell expressing the glycosyltransferase.
- reaction mixtures comprising the glycosyltransferases as described herein.
- the reaction mixtures can further comprise reagents for use in glycosylation techniques.
- the reaction mixtures comprise a buffer, salts (e.g., Mn 2+ , Mg 2+ ), and labels (e.g., fluorophores).
- the donor substrate can be prepared prior to preparation of the oligosaccharide, or prepared in situ immediately prior to preparation of the oligosaccharide.
- the method of the present invention also includes forming a reaction mixture including a CMP-sialic acid synthetase, cytidine triphosphate, and N-acetylneuraminic acid (Neu5Ac) or a Neu5Ac analog, under conditions suitable to form the CMP-Neu5Ac or CMP-Neu5Ac analog.
- the step of forming the donor substrate and the step of forming the oligosaccharide are performed in one pot.
- the sugar is prepared separately prior to use in the methods of the present invention.
- the sugar can be prepared in situ immediately prior to use in the methods of the present invention.
- the method also includes forming a reaction mixture including a sialic acid aldolase, pyruvic acid or derivatives thereof, and N-acetylmannosamine or derivatives thereof, under conditions suitable to form the Neu5Ac or Neu5Ac analog.
- the step of forming the sugar, the step of forming the donor substrate and the step of forming the oligosaccharide are performed in one pot.
- the oligosaccharide prepared by the method of the present invention can be any suitable oligosaccharide, glycolipid or glycoprotein.
- the oligosaccharide can be an ⁇ 2-3-linked sialyloligosaccharide.
- the oligosaccharide can be a fucosylated oligosaccharide.
- the oligosaccharide can be Neu5Ac ⁇ 2-3Gal ⁇ 1-4(Fuc ⁇ 1-3)GlcNAc ⁇ OR (Sia-Lewis x ⁇ OR or SLe x ⁇ OR) wherein R can be H, a monosaccharide, or an oligosaccharide.
- the oligosaccharide can be Neu5Ac ⁇ 2-3Gal ⁇ 1-4GlcNAc ⁇ OR, wherein R can H, a monosaccharide, or an oligosaccharide.
- Escherichia coli BL21 (DE3) was from Invitrogen (Carlsbad, Calif., USA). Ni 2+ -NTA agarose (nickel-nitrilotriacetic acid agarose) and QIAprep spin miniprep kit were from Qiagen (Valencia, Calif., USA). Bicinchoninic acid (BCA) protein assay kit was from Pierce Biotechnology, Inc. (Rockford, Ill.). QuikChange Multi Site-Directed Mutagenesis Kit was from Agilent Technologies company/Stratagene (Santa Clara, Calif.).
- Site-directed mutagenesis was carried out using the QuikChange multi-site-directed mutagenesis kit from Stratagene according to the manufacturer's protocol.
- the primers used were 5′ACCGGCACGACAACTTGG TTT GGAAATACCGATGTGCG3′ for E271F and 5′ ATCTACTTTAAAGGGCATCCT TAT GGTGGTGAAATTAATGACTAC3′ for R313Y.
- the sites of mutations are underlined.
- the plasmids containing the mutant genes were transformed into E. coli BL21 (DE3).
- the E. coli cells were cultured in LB-rich media (10 g L ⁇ 1 tryptone, 5 g L yeast extract, and 10 g L ⁇ 1 NaCl) supplemented with ampicillin (100 ⁇ g mL ⁇ 1 ).
- IPTG isopropyl-1-thio- ⁇ -D-galactopyranoside
- the incubation of the induced culture was performed at 37° C. for 3 h with vigorous shaking at 250 rpm in a C25KC incubator shaker (New Brunswick Scientific, Edison, N.J.).
- His 6 -tagged mutant proteins were purified from the cell lysate.
- the cell pellet harvested by centrifugation at 4000 rpm for 2 h was resuspended in 20 mL (for cells obtained from one liter culture) of lysis buffer (pH 8.0, 100 mM Tris-HCl containing 0.1% Triton X-100).
- lysis buffer pH 8.0, 100 mM Tris-HCl containing 0.1% Triton X-100.
- lysozyme 50 ⁇ g mL ⁇ 1
- DNaseI 3 ⁇ g mL ⁇ 1
- the cell lysate was obtained as the supernatant after centrifugation at 11,000 rpm for 20 min. Purification of His 6 -tagged proteins from the lysate was achieved using an ⁇ KTA FPLC system (GE Healthcare) equipped with a HisTrapTM FF 5 mL column. The column was pre-equilibrated with 8 column volumes of the binding buffer (5 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl pH 7.5) prior to lysate loading. After the sample loading, the column was washed with 8 column volumes of the binding and washing buffer (40 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl pH 7.5).
- Protein elution was carried out with 8 column volumes of the elute buffer (200 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl pH 7.5). The fractions containing the purified enzyme were collected and stored at 4° C.
- the kinetic assays for the sialidase activity were performed in duplicate in a total volume of 10 ⁇ L in MES buffer (100 mM, pH 5.5) containing different concentrations of Neu5Ac ⁇ 2-3Lac ⁇ MU (0.4, 1.0, 2.0, 4.0, 10.0, 20.0, 40.0, and 60.0 mM) and the mutant proteins (2.5 mg mL ⁇ 1 of D141A, 1.6 mg mL ⁇ 1 of E271F, 1 mg mL ⁇ 1 of R313Y, and 3.2 mg mL ⁇ 1 of E271F/R313Y). All reactions were allowed to proceed at 37° C.
- the kinetic assays were performed in duplicate in reaction mixtures of 10 ⁇ L containing Tris-HCl buffer (100 mM, pH 8.5), a fixed concentration of CMP-Neu5Ac (1 mM), different concentrations of Lac ⁇ MU (0.2, 0.5, 1.0, 2.0, 5.0, and 9.0 mM) and the mutant proteins (2 ⁇ g mL ⁇ 1 of E271F, 2 ⁇ g mL ⁇ 1 of R313Y, and 1.6 ⁇ g mL ⁇ 1 of E271F/R313Y). All reactions were allowed to proceed at 37° C.
- the kinetic assays were performed in duplicate in reaction mixtures of 10 ⁇ L containing Tris-HCl buffer (100 mM, pH 8.5), a fixed concentration of Lac ⁇ MU (1 mM), different concentrations of CMP-Neu5Ac (0.1, 0.2, 0.5, 1.0, 2.0, 5.0, 10.0 and 20.0 mM) and the mutant proteins (2 ⁇ g mL ⁇ 1 of E271F, 2 ⁇ g mL ⁇ 1 of R313Y, and 1.6 ⁇ g mL ⁇ 1 of E271F/R313Y).
- Acceptor substrate specificity assays by HPLC were performed in duplicate in 20 mL of Tris-HCl buffer (100 mM, pH 8.5) containing CMP-Neu5Ac (1 mM), a fluorescent acceptor (1 mM), MgCl2 (20 mM), and an enzyme (2 ⁇ g mL-1, wild-type PmST1 or E271R/R313Y mutant). Reactions were allowed to proceed for 5 min at 37° C. The 4-methylumbelliferone (MU)-labeled fluorescent acceptors and the products formed were detected with excitation at 325 nm and emission at 372 nm.
- MU 4-methylumbelliferone
- the 9-fluorenylmethylcarbamate (Fmoc)-labeled fluorescent acceptors and the products formed were detected with excitation at 262 nm and emission at 313 nm.
- the 2-aminobenzoic acid (2AA)-labeled fluorescent acceptors and the products formed were detected with excitation at 315 nm and emission at 400 nm.
- the designed PmST1 mutants E271F, R313Y, and E271F/R313Y were expressed in E. coli using the same expression condition as the wild-type PmST1 (100 mg L ⁇ 1 culture) and achieved a compatible level of expression (90 mg L ⁇ 1 culture). Similar to the wild-type PmST1, one-step Ni 2+ -column purification was sufficient to provide pure protein (>99%) of the mutants.
- Example 2 A Sialyltransferase Mutant with Decreased Donor Hydrolysis and Reduced Sialidase Activities for Directly Sialylating Lewis x
- Site-directed mutagenesis, expression and purification of PmST1 mutants Site-directed mutagenesis was performed using the QuikChange multi-site-directed mutagenesis kit from Stratagene according to the manufacturer's protocol.
- the primers used were 5′ AATCTTTATGACGATGGCTCA GAT GAATATGTTGATTTAGAAAAAG 3′ for M144D; 5′ AATCTTTATGACGATGGCTCA CAT GAATATGTTGATTTAGAAAAAG 3′ for M144H; 5′ ATCACGCTGTATTTAGATCCT GAT TCCTTACCGGCATTAAATCAG 3′ for A35D; and 5′ ATCACGCTGTATTTAGATCCT CAT TCCTTACCGGCATTAAATCAG 3′ for A35H.
- the expression and purification of the mutants were performed as previously described for the WT PmST1.
- the reactions were stopped by adding 104 of pre-chilled ethanol.
- the mixtures were incubated on ice for 30 min and centrifuged at 13,000 rpm for 5 min.
- the supernatants were diluted with borate buffer (25 mM, pH 9.5) and aliquotes of 54 each were injected to a Beckman Coulter P/ACETM MDQ Capillary Electrophoresis system equipped with a capillary (60 cm ⁇ 75 ⁇ m i.d.) and monitored at 254 nm.
- the apparent kinetic parameters were obtained by fitting the experimental data (the average values of duplicate assay results) into the Michaelis-Menten equation using Grafit 5.0.
- Le x ⁇ MU As the acceptor substrate, the reactions were carried out in duplicate at 37° C. for 9 min (M144D) or 10 min (M144H) in a reaction mixture (10 ⁇ L) containing CAPSO (100 mM, pH 9.5), an enzyme (M144D, 39 ⁇ g mL ⁇ 1 or M144H, 5 ⁇ g mL ⁇ 1 ), and various concentrations of Le x ⁇ MU (1.0, 5.0, 10.0, 15.0, 25.0, and 35.0 mM) with a fixed concentration (1 mM) of CMP-Neu5Ac or various concentrations (0.2, 0.5, 1.0, 2.0, 5.0, 10.0, 20.0, and 40.0 mM) of CMP-Neu5Ac with a fixed concentration (1 mM) of Le x ⁇ MU.
- CAPSO 100 mM, pH 9.5
- an enzyme M144D, 39 ⁇ g mL ⁇ 1 or M144H, 5 ⁇ g mL ⁇ 1
- Reactions were stopped by adding 10 ⁇ L of pre-chilled ethanol. The mixtures were incubated on ice for 30 min and centrifuged at 13,000 rpm for 5 min. The supernatants were diluted with 25% acetonitrile and kept on ice until aliquots of 8 ⁇ L were injected and analyzed by the Shimadzu LC-6AD system equipped with a membrane on-line degasser, a temperature control unit, and a fluorescence detector (Shimadzu RF-10AXL). A reverse-phase Premier C18 column (250 ⁇ 4.6 mm i.d., 5 ⁇ m particle size, Shimadzu) protected with a C18 guard column cartridge was used. The mobile phase was 25% acetonitrile.
- the fluorophore (MU)-labeled compounds were detected by excitation at 325 nm and emission at 372 nm.
- the apparent kinetic parameters were obtained by fitting the experimental data (the average values of duplicate assay results) into the Michaelis-Menten equation using Grafit 5.0.
- the reactions were performed in duplicate in a total volume of 10 ⁇ L at 37° C. for 60 min (M144D) or 15 min (M144H) in MES buffer (100 mM, pH 5.5) containing Neu5Ac ⁇ 2-3Lac ⁇ MU (0.4, 1, 2, 4, 10, 20, 40 and 60 mM) and an enzyme (M144H, 1.36 mg mL ⁇ 1 or M144D, 1.05 mg mL ⁇ 1 ).
- Sample treatment after the reaction and analysis were carried out by HPLC similar to that described above for the ⁇ 2-3-sialyltransferase assays.
- the reactions were carried out in duplicate in a total volume of 10 ⁇ L at 37° C. for 20 hr in MES buffer (100 mM, pH 5.5) containing Neu5Ac ⁇ 2-3Le x ⁇ MU (1 mM) and an enzyme (4 mg mL ⁇ 1 ). Aliquots of 1 ⁇ L were withdrawn at 1 hr, 6 hr and 20 hr, and analyzed by HPLC as described above for the ⁇ 2-3-sialyltransferase assays.
- PmST1 M144D mutant in complex with CMP-3F(a)-Neu5Ac was deposited with a PDB ID code 3S44.
- PmST1 M144D mutant in Tris-HCl buffer (20 mM, pH 7.5) was concentrated to 13 mg mL ⁇ 1 , and CMP-3F(axial)Neu5Ac was added to a final concentration of 2 mM.
- Binary CMP-3F(axial)Neu5Ac crystals were grown by hanging drop with 3 ⁇ L of the sample mixed with an equal volume of reservoir buffer [24% poly(ethylene glycol) 3350, 100 mM HEPES (pH 7.5), 50 mM NaCl, and 0.4% Triton X-100].
- Enzymes were expressed in E. coli BL21 (DE3) using M9 media containing 15 NH 4 Cl (1.0 g L ⁇ 1 ), Na 2 HPO 4 .7H 2 O (12.66 g L ⁇ 1 ), KH 2 PO 4 (3.0 g L ⁇ 1 ), NaCl (0.5 g L ⁇ 1 ), MgSO 4 (0.2 g CaCl 2 (50 ⁇ M), and glucose (0.3%). Expressions were induced by adding 0.5 mM of isopropyl ⁇ -D-1-thiogalactopyranoside (IPTG) and incubating at 37° C. for 4 hr. The purifications were performed as previously described for the WT PmST1.
- IPTG isopropyl ⁇ -D-1-thiogalactopyranoside
- the purified enzymes were dialyzed with a phosphate buffer (10 mM, pH 7.0).
- NMR samples of 15 N-labeled WT and M144D PmST1 ( ⁇ 0.7 mM) were prepared in 90%/10% of H 2 O/D 2 O containing 10 mM of phosphate (pH 7.0) in the presence or the absence of saturating CMP.
- 15 N- 1 H HSQC NMR experiments were performed at 37° C. on Bruker Avance III 800 spectrometer with an Ultrashield Bruker magnet equipped with a four-channel interface, triple-resonance probe, and cryo-probe with Z-axis pulsed field gradients.
- the number of complex points and acquisition times were: 256, 180 ms ( 15 N (F 1 )); and, 512, 64 ms ( 1 H (F 2 )).
- the NMR spectra were processed and analyzed using the software, NMRPipe.
- the reactions were carried out by incubating the reaction mixture in an incubator shaker at 37° C. for 4-6 h.
- Donor hydrolysis by PmST1 causes low yield sialylation of Le x .
- time course studies were carried out using a fluorescently labeled Le x acceptor (4-methylumbelliferyl 13-Le x or Le x ⁇ MU) in a high performance liquid chromatography (HPLC) assay.
- Le x acceptor 4-methylumbelliferyl 13-Le x or Le x ⁇ MU
- HPLC high performance liquid chromatography
- CMP-Neu5Ac donor substrate hydrolysis activity of PmST1, where water molecules compete with the poor Le x acceptor for the consumption of sugar nucleotide (CMP-Neu5Ac) donor of the sialyltransferase (Error!
- D141A mutation decreased the efficiency of CMP-Neu5Ac hydrolysis activity of PmST1 by 1,000-fold mainly due to the decrease in the turnover number.
- H311A mutation also decreased the CMP-Neu5Ac hydrolysis activity by 16-fold, mainly contributed by a decreased turnover number without affecting the binding affinity significantly.
- M144D mutations decreased the efficiency of donor hydrolysis.
- M144D mutation decreased the efficiency of donor hydrolysis by 20-fold due to a 4.9-fold increase of the K m value and a 4.2-fold decrease of the k cat value.
- M144H mutation caused a less significant 3.3-fold decrease in the efficiency of donor hydrolysis due to a significant 8.7-fold increase in the K m value which is offset by a 2.6-fold increase in the k cat value.
- M144H mutation only decreased the ⁇ 2-3-sialyltransferase activity weakly (1.3-fold) when Lac ⁇ MU was used as an acceptor and increased the efficiency of ⁇ 2-3-sialyltransferase activity by 2.6-fold when Le x 3MU was used as an acceptor.
- PmST1 M144D Mutant has a Decreased ⁇ 2-3-Sialidase Activity.
- M144D and M144H mutations also decreased the ⁇ 2-3-sialidase activity of PmST1 by 5588- and 594-fold respectively when Neu5Ac ⁇ 2-3Lac ⁇ MU was used as the sialidase substrate (Table 6). While the PmST1 M144D mutant showed no sialidase activity when Neu5Ac ⁇ 2-3Le x ⁇ MU was used as the substrate, PmST1 M144H has increased sialidase activity compared to the WT PmST1 using the SLe x substrate.
- the PmST1 M144H mutant cleaved 10.0%, 24.5%, and 34.0% of Neu5Ac from Neu5Ac ⁇ 2-3Le x ⁇ MU in 1 h, 6 h, and 20 h, respectively.
- WT PmST1 removed 2.0%, 7.0%, and 7.5% of Neu5Ac from Neu5Ac ⁇ 2-3Le x ⁇ MU under the same reaction conditions.
- the decreased ⁇ 2-3-sialidase activity by M144D mutation allows the potential application of the PmST1 M144D mutant in sialylation of glycoconjugates containing terminal galactoside or Le x where the decreased ⁇ 2-3-sialidase activity has the most advantages as these reactions are challenging for prompt monitoring.
- PmST1 M144D mutant has a similar expression level as the WT PmST1.
- the PmST1 M144D mutation did not change the enzyme expression level in E. coli .
- About 98 mg of C-His 6 -tagged PmST1 M144D protein can be routinely purified from one liter of E. coli cell culture using Ni 2+ -affinity column (Error! Reference source not found.). This expression level is very similar to that (100 mg) of the WT PmST1 and allows the application of the mutant in preparative and large-scale synthesis of SLe x antigens.
- the structure of the PmST1 M144D mutant with CMP-3F(axiai)-Neu5Ac was determined to 1.45 ⁇ resolution with R factor and R free values of 18.7% and 21.5% respectively Table 3).
- Error! Reference source not found. shows the structural comparison between WT PmST1 and M144D mutant with bound CMP donor. Error! Reference source not found.
- A shows the overall structure of WT PmST1 with CMP bound (white tubes), aligned with the C-terminal domain of the M144D mutant (grey tubes) also with CMP bound (space filled atoms). Error! Reference source not found.
- B shows the stereo view of the superposition near the active site.
- WT PmST1 is shown as white tubes with bound CMP-3F(a)-Neu5Ac (sticks with white carbon bonds) and lactose acceptor (sticks with dark grey carbon bonds).
- the M144D mutant in shown as grey tubes with CMP bound (sticks with light grey carbon bonds). Error! Reference source not found.
- C shows the active site of the ternary crystal structure of PmST1 (PDB 1D: 21HZ) with bound CMP-3F(axial)-Neu5Ac and lactose. The mutation site M144 is underlined.
- the structure resides in the open conformation similar to the wild-type structure with no substrate (rmsd of 0.50 ⁇ for 385 equivalent ⁇ -carbons).
- the M144D structure contains well-ordered electron density in the active site that clearly defines the CMP nucleotide.
- the sialic acid moiety is disordered, likely due to dynamics and/or multiple conformations in the open state of the enzyme.
- the CMP moiety does not bind as deeply into the pocket of the active site as the WT PmST1.
- the base and ribose are situated about 1.5 and 2.0 ⁇ respectively, farther out of the active site compared to the WT PmST1.
- Glu338 forms bidentate hydrogen bond interactions with both the 2′ and 3′ OH of the CMP ribose.
- an ordered water molecule mediates the interaction between the ribose and Glu338.
- the more shallow binding of the donor nucleotide in the M144D structure does not pull down the ⁇ -strand and the ensuing loop that contains Trp270.
- donor-nucleotide binding pulls down a ⁇ -strand causing Trp270 to pop out of the C-terminal domain, where it helps define the acceptor binding site in the sialyltransferase reaction.
- M144D mutant is more efficient than M144H mutant in sialylating Le x .
- the M144D mutation decreased the undesired CMP-Neu5Ac hydrolysis activity significantly (20-fold) without appreciably changing the efficiency of the ⁇ 2-3-sialyltransferase activity when Le x was used as an acceptor.
- M144D showed an overall improved activity in sialylation of Le x for the formation of sialyl Le x (SLe x ) structures.
- M144H mutant which has a 3.3-fold decreased CMP-Neu5Ac hydrolysis activity and 2.6-fold increased ⁇ 2-3-sialyltransferase activity using Le x as an acceptor was less effective for directly sialylating Le x .
- Synthesis of SLe x containing diverse sialic acid forms using PmST1 M144D mutant was demonstrated using an efficient one-pot three-enzyme chemoenzymatic synthetic system (Error! Reference source not found.).
- the system contained PmST1 M144D mutant, an Neisseria meningitidis CMP-sialic acid synthetase (NmCSS), and a Pasteurella multocida sialic acid aldolase.
- ManNAc N-Acetylmannosamine
- Mannose mannose
- their derivatives were used for in situ synthesis of CMP-sialic acids and derivatives.
- Le x trisaccharide used as the sialyltransferase acceptor was synthesized using a one-pot two-enzyme system containing a bifunctional L-fucokinase/GDP-fucose pyrophosphorylase (FKP) cloned from Bacteroides fragilis and a recombinant Helicobacter pylori ⁇ 1-3-fucosyltransferase as shown previously. As shown in Error!
- SLe x tetrasaccharides containing natural sialic acid forms including N-acetylneuraminc acid (Neu5Ac), N-glycolylneuraminic acid (Neu5Gc), 2-keto-3-deoxy-D-glycero-D-galacto-nonulosonic acid (Kdn), as well as 9-O-acetylated Neu5Ac and Neu5Gc were obtained in excellent (85-93%) to good yields (62-64%).
- SLe x containing the 9-O-acetyl sialic acid forms were due to the de-O-acetylation process leading to the formation of non-O-acetylated SLe x oligosaccharides.
- SLe x containing non-natural sialic acid forms including those with an N-azidoacetyl group or an azido group at C-5 or a C-9 azido group were also successfully obtained in excellent yields (84-91%).
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Saccharide Compounds (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The present invention provides mutants of PmST1 for the preparation of sialyl-Lewisx oligosaccharides, and other sialosides with decreased sialidase activity.
Description
- This application is a continuation application of the U.S. application Ser. No. 14/237,334 filed Jun. 18, 2014, now U.S. Pat. No. 9,255,257, the National Stage Entry of PCT Application No. PCT/US2012/049748, filed Aug. 6, 2012, which claims priority to U.S. Provisional Application No. 61/515,702, filed Aug. 5, 2011 which is incorporated in its entirety herein for all purposes.
- This invention was made with Government support under Grant Nos. R01HD065122 awarded by National Institutes of Health. The Government has certain rights in this invention.
- Glycosyltransferase-catalyzed reactions have gained increasing attention and application for the synthesis of complex carbohydrates and glycoconjugates. Most mammalian glycosyltransferases suffer from no or low expression in E. coli systems and more restricted substrate specificity. In comparison, bacterial glycosyltransferases are generally easier to access using E. coli expression systems and have more promiscuous substrate flexibility. Nevertheless, despite the discovery of many bacterial glycosyltransferases which have promiscuities for both donor and acceptor substrates, the application of glycosyltransferases in the synthesis of carbohydrate-containing structures is limited by the availability and the substrate specificity of wild-type enzymes.
- For example, sialyltransferases, the key enzymes that catalyze the transfer of a sialic acid residue from
cytidine 5′-monophosphate-sialic acid (CMP-sialic acid) to an acceptor, have been commonly used for the synthesis of sialic acid-containing structures. Sialyl Lewisx [SLex, Siaα2-3Galβ1-4(Fucα1-3)GlcNAcαOR] is an important carbohydrate epitope involved in inflammation as well as adhesion and metastasis of cancer cells. It is a well-known tumor-associated carbohydrate antigen and has been used as a candidate for cancer vaccine. The biosynthesis of SLex involves the formation of Siaα2-3Galβ1-4GlcNAcαOR catalyzed by an α2-3-sialyltransferase followed by an α1-3-fucosyltransferase-catalyzed fucosylation. This biosynthetic sequence usually cannot be altered as common α2-3-sialyltransferases do not use fucose-containing Lewisx [Lex, Galβ1-4(Fucα1-3)GlcNAcαOR] as a substrate. - As common terminal monosaccharides, sialic acids constitute a family of great structural diversity. So far, more than 50 structurally distinct sialic acid forms have been identified in nature. To obtain SLex with different sialic acid forms to elucidate the biological significance of naturally occurring sialic acid modifications, an efficient enzymatic approach is to use Lex [Galβ1-4(Fucα1-3)GlcNAcαOR] as a fucose-containing acceptor to add different sialic acid forms by a suitable α2-3-sialyltransferase. This process of introducing different forms of sialic acid onto the common fucosylated acceptor Lex in the last step has significant advantages compared to the normal SLex biosynthetic pathway in which fucosylation is the last glycosylation process. It not only simplifies the synthetic scheme as a less number of reactions are needed, but also makes the purification process much easier as negatively charged SLex product is separated from neutral Lex oligosaccharide instead of separating both negatively charged oligosaccharides SLex and non-fucosylated sialosides if fucosylation occurs in the last step.
- We and others have demonstrated that a myxoma virus α2-3-sialyltransferase can use Lex as an acceptor substrate for synthesizing SLex. Nevertheless, the low expression level of the enzyme in E. coli (<0.1 mg L−1 culture) limits its application in preparative and large-scale synthesis of SLex.
- We have previously shown that a multifunctional α2-3-sialyltransferase from Pasteurella multocida (PmST1) has a good expression level in E. coli (100 mg L−1 culture) (J. Am. Chem. Soc. 2005, 127, 17618-17619.). It can use Lex as an acceptor for the synthesis of SLex but the yields are poor (<20%) in spite of different conditions tested. What is needed, therefore, are α2-3-sialyltransferases having good α2-3-sialyltransferase activity with good expression levels, and lowered α2-3-sialidase or donor substrate hydrolysis activity. Surprisingly, the present invention meets this and other needs.
- In some embodiments, the present invention provides an isolated glycosyltransferase, wherein the amino acid of the glycosyltransferase corresponding to
position 120 of SEQ ID NO:1 is any amino acid other than M, the amino acid the glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, or the amino acid the glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R. The glycosyltransferase of the present invention has decreased α2-3 sialidase or donor substrate hydrolysis activity compared to a control glycosyltransferase, wherein the amino acid of the control glycosyltransferase corresponding toposition 120 of SEQ ID NO:1 is M, the amino acid of the control glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is E, and the amino acid of the control glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is R. Finally, the glycosyltransferase of the present invention can be a member of the glycosyltransferase family 80 (GT80). - In some embodiments, the present invention provides a recombinant nucleic acid encoding an isolated glycosyltransferase of the present invention.
- In some embodiments, the present invention provides a cell including a recombinant nucleic acid of the present invention.
- In some embodiments, the present invention provide a method of preparing an oligosaccharide, the method including forming a reaction mixture including an acceptor sugar, a donor substrate of a sugar moiety and a nucleotide, and the glycosyltransferase of the present invention, under conditions sufficient to transfer the sugar moiety from the donor substrate to the acceptor sugar, thereby forming the oligosaccharide.
-
FIG. 1A-1B show the ternary crystal structure of PmST1 (PDB ID: 21HZ) with bound CMP-3F(axial)-Neu5Ac and lactose (Figure A) and the structure of the modeled PmST1 double mutant E271F/R313Y of PmST1 wild type sequence SEQ ID NO: 13 (Figure B). The mutation sites are underlined. The mutant structure was obtained from automated homology modeling using Swiss-Model. -
FIG. 2 shows acceptor substrate specificity data for the α2-3-sialyltransferase activity of wild-type PmST1 (white columns) and its double mutant E271F/R313Y of PmST1 wild type sequence SEQ ID No: 13 (black columns). -
FIG. 3 shows thermal stability data for the α2-3-sialyltransferase activity of wild-type PmST1 (white columns) and its double mutant E271F/R313Y of PmST1 wild type sequence SEQ ID No: 13 (black columns). -
FIG. 4 shows HPLC-based time course studies of PmST1-catalyzed α2-3-sialylation of Lewisx trisaccharide (1 mM) with periodical addition of sialyltransferase donor CMP-Neu5Ac (indicated by arrows). Numbers in parentheses represent the % consumption of CMP-Neu5Ac by capillary electrophoresis (CE) assays. -
FIG. 5 illustrates that water (in the donor hydrolysis reaction) competes with Lewis' (in PmST1-catalyzed α2-3-sialylation reaction) for the consumption of CMP-Neu5Ac. -
FIG. 6 shows the SDS-PAGE analysis of the M144D mutant of PmST1 wild type sequence SEQ ID No: 13. Lane 1: Protein marker; Lane 2: Whole cells before induction; Lane 3: Whole cells after induction; Lane 4: Cell lysate; Lane 5: Purified fraction. -
FIG. 7A-7C —show the structural comparison between wild-type (WT) PmST1 and M144D mutant of PmST1 wild type sequence SEQ ID No: 13 with bound CMP.FIG. 7A shows the overall structure alignment of WT PmST1 and the PmST1 M144D mutant, both with CMP bound.FIG. 7B shows the stereo view of the superposition near the active site for WT PmST1 and the M144D mutant with bound CMP-3F(a)-Neu5Ac (a donor substrate analog) and lactose acceptor.FIG. 7C shows the active site of the ternary crystal structure of PmST1 (PDB 1D: 21HZ) with bound CMP-3F(axial)-Neu5Ac and lactose. -
FIG. 8 shows 15N-1H HSQC NMR spectra of 15N-labeled PmST1 (WT versus M144D mutant of PmST1 wild type sequence SEQ ID NO: 13; as well as apo versus CMP-bound). -
FIG. 9 shows the one-pot three-enzyme synthesis of sialyl LexβProN3 (SLexβProN3) containing different forms of sialic acids from LexβProN3. Aldolase refers to Pasteurella multocida sialic acid aldolase, and NmCSS refers to Neisseria meningitidis CMP-sialic acid synthetase. -
FIG. 10 shows amino acid sequences alignment of GT80 sialyltransferases. Pp_Pst3-1 (GenBank accession number BAF63530), Psp_Pst3-2 (GenBank accession number BAF92025), Vsp_2,3ST (GenBank accession number BAF91160), P1ST6_JT-1 (GenBank accession number BAF91416), P1ST6_JT-2, (GenBank accession number BAI49484), Pd2,6ST (GenBank accession number BAA25316), Psp_pst6-1 (GenBank accession number BAF92026), Pm0188Ph (GenBank accession number DQ087233), and Hd0053P (GenBank accession number AAP95068). - The present invention provides alpha2-3 sialyltransferase mutants of PmST1 having reduced alpha2-3 sialidase or donor substrate hydrolysis, useful for the preparation of oligosaccharides, and can be tolerant of fucosylated oligosaccharides. The mutations described herein can be incorporated into a variety of sialyltransferases to produce mutants having reduced sialidase or donor substrate activity.
- As used herein, the term “glycosyltransferase” refers to a polypeptide that catalyzes the formation of a glycoside or an oligosaccharide from a donor substrate and an acceptor or acceptor sugar. In general, a glycosyltransferase catalyzes the transfer of the monosaccharide moiety of the donor substrate to a hydroxyl group of the acceptor. The covalent linkage between the monosaccharide and the acceptor sugar can be a 1-4 linkage, a 1-3 linkage, a 1-6-linkage, a 1-2 linkage, a 2-3-linkage, a 2-6-linkage, a 2-8-linkage, or a 2-9-linkage. The linkage may be in the α- or β-configuration with respect to the anomeric carbon of the monosaccharide. Other types of linkages may be formed by the glycosyltransferases in the methods of the invention. Glycosyltransferases include, but are not limited to, sialyltransferases, heparosan synthases (HSs), glucosaminyltransferases, N-acetylglucosaminyltransferases, glucosyltransferases, glucuronyltransferases, N-acetylgalactosaminyltransferases, galactosyltransferases, galacturonyltransferases, fucosyltransferases, mannosyltransferases, xylosyltransferases. Sialyltransferases are enzymes that catalyze the transfer of sialic acid, or analogs thereof, to a monosaccharide or an oligosaccharide. In some embodiments, the glycosyltransferases useful in the present invention include those in Glycosyltransferase family 80 (GT80 using CAZy nomenclature), and includes beta-galactoside alpha-2,3-sialyltransferases that catalyze the following conversion: CMP-sialic acid+β-D-galactosyl-R=CMP+α-sialic acid-(2→3)-β-D-galactosyl-R, where the acceptor is GalβOR, where R is H, a monosaccharide, an oligosaccharide, a polysaccharide, a glycopeptide, a glycoprotein, or a glycolipid. GT80 family sialyltransferases also include galactoside or N-acetylgalactosaminide alpha-2,6-sialyltransferases that catalyze the following conversion: CMP-sialic acid+galactosyl/GalNAc-R=CMP+α-sialic acid-(2→3)-β-D-galactosyl/GalNAc-R, where the acceptor is GalOR or GalNAcOR, where R is H, serine or threonine on a peptide or protein, a monosaccharide, an oligosaccharide, a polysaccharide, a glycopeptide, a glycoprotein, or a glycolipid.
- “Alpha2-3-sialidase” refers to an enzyme that catalyzes the hydrolysis of alpha2-3-glycosidic linkages of terminal sialic acids on oligosaccharides.
- “Donor substrate hydrolysis” refers to hydrolysis of the nucleotide-sugar bond of the donor substrate.
- An “amino acid” refers to any monomer unit that can be incorporated into a peptide, polypeptide, or protein. As used herein, the term “amino acid” includes the following twenty natural or genetically encoded alpha-amino acids: alanine (Ala or A), arginine (Arg or R), asparagine (Asn or N), aspartic acid (Asp or D), cysteine (Cys or C), glutamine (Gln or Q), glutamic acid (Glu or E), glycine (Gly or G), histidine (His or H), isoleucine (Ile or I), leucine (Leu or L), lysine (Lys or K), methionine (Met or M), phenylalanine (Phe or F), proline (Pro or P), serine (Ser or S), threonine (Thr or T), tryptophan (Trp or W), tyrosine (Tyr or Y), and valine (Val or V). In cases where “X” residues are undefined, these should be defined as “any amino acid.” The structures of these twenty natural amino acids are shown in, e.g., Stryer et al., Biochemistry, 5th ed., Freeman and Company (2002), which is incorporated by reference. Additional amino acids, such as selenocysteine and pyrrolysine, can also be genetically coded for (Stadtman (1996) “Selenocysteine,” Annu Rev Biochem. 65:83-100 and Ibba et al. (2002) “Genetic code: introducing pyrrolysine,” Curr Biol. 12(13):R464-R466, which are both incorporated by reference). The term “amino acid” also includes unnatural amino acids, modified amino acids (e.g., having modified side chains and/or backbones), and amino acid analogs.
- “Polypeptide,” “peptide,” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues. All three terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non-naturally occurring amino acid polymers. As used herein, the terms encompass amino acid chains of any length, including full-length proteins, wherein the amino acid residues are linked by covalent peptide bonds.
- The term “mutant,” in the context of glycosyltransferases of the present invention, means a polypeptide, typically recombinant, that comprises one or more amino acid substitutions relative to a corresponding, naturally-occurring or unmodified glycosyltransferase, such as an alpha2-3 sialyltransferase.
- In the context of glycosyltransferases, “corresponding to” another sequence (e.g., regions, fragments, nucleotide or amino acid positions, or the like) is based on the convention of numbering according to nucleotide or amino acid position number and then aligning the sequences in a manner that maximizes the percentage of sequence identity. Because not all positions within a given “corresponding region” need be identical, non-matching positions within a corresponding region may be regarded as “corresponding positions.” Accordingly, as used herein, referral to an “amino acid of the glycosyltransferase corresponding to position [X]” of a specified glycosyltransferase refers to equivalent positions, based on alignment, in other glycosyltransferases and structural homologues and families. In some embodiments of the present invention, “correspondence” of amino acid positions are determined with respect to a region of the glycosyltransferase comprising one or more motifs of SEQ ID NO:1, 13, 15, 17, 19, 21, 23, 25, 27 or 29. When a glycosyltransferase polypeptide sequence differs from SEQ ID NO:1, 13, 15, 17, 19, 21, 23, 25, 27 or 29 (e.g., by changes in amino acids or addition or deletion of amino acids), it may be that a particular mutation associated with improved activity as discussed herein will not be in the same position number as it is in SEQ ID NO:1, 13, 15, 17, 19, 21, 23, 25, 27 or 29.
- A nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence. For example, a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation.
- As used herein, “percent sequence identity” is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the sequence in the comparison window can comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
- The terms “identical” or “identity,” in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same. Sequences are “substantially identical” to each other if they have a specified percentage of nucleotides or amino acid residues that are the same (e.g., at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% identity over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. These definitions also refer to the complement of a test sequence. Optionally, the identity exists over a region that is at least about 50 nucleotides in length, or more typically over a region that is 100 to 500 or 1000 or more nucleotides in length.
- The terms “similarity” or “percent similarity,” in the context of two or more polypeptide sequences, refer to two or more sequences or subsequences that have a specified percentage of amino acid residues that are either the same or similar as defined by a conservative amino acid substitutions (e.g., 60% similarity, optionally 65%, 70%, 75%, 80%, 85%, 90%, or 95% similar over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Sequences are “substantially similar” to each other if they are at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, or at least 55% similar to each other. Optionally, this similarly exists over a region that is at least about 50 amino acids in length, or more typically over a region that is at least about 100 to 500 or 1000 or more amino acids in length.
- For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters are commonly used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities or similarities for the test sequences relative to the reference sequence, based on the program parameters.
- A “comparison window,” as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, for example, by the local homology algorithm of Smith and Waterman (Adv. Appl. Math. 2:482, 1970), by the homology alignment algorithm of Needleman and Wunsch (J. Mol. Biol. 48:443, 1970), by the search for similarity method of Pearson and Lipman (Proc. Natl. Acad. Sci. USA 85:2444, 1988), by computerized implementations of these algorithms (e.g., GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by manual alignment and visual inspection (see, e.g., Ausubel et al., Current Protocols in Molecular Biology (1995 supplement)).
- Algorithms suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (Nuc. Acids Res. 25:3389-402, 1977), and Altschul et al. (J. Mol. Biol. 215:403-10, 1990), respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) or 10, M=5, N=−4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915, 1989) alignments (B) of 50, expectation (E) of 10, M=5, N=−4, and a comparison of both strands.
- The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul, Proc. Natl. Acad. Sci. USA 90:5873-87, 1993). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, typically less than about 0.01, and more typically less than about 0.001.
- “Recombinant,” as used herein, refers to an amino acid sequence or a nucleotide sequence that has been intentionally modified by recombinant methods. By the term “recombinant nucleic acid” herein is meant a nucleic acid, originally formed in vitro, in general, by the manipulation of a nucleic acid by endonucleases, in a form not normally found in nature. Thus an isolated, mutant glycosyltransferase nucleic acid, in a linear form, or an expression vector formed in vitro by ligating DNA molecules that are not normally joined, are both considered recombinant for the purposes of this invention. It is understood that once a recombinant nucleic acid is made and reintroduced into a host cell, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the host cell rather than in vitro manipulations; however, such nucleic acids, once produced recombinantly, although subsequently replicated non-recombinantly, are still considered recombinant for the purposes of the invention. A “recombinant protein” is a protein made using recombinant techniques, i.e., through the expression of a recombinant nucleic acid as depicted above.
- The term “vector” refers to a piece of DNA, typically double-stranded, which may have inserted into it a piece of foreign DNA. The vector may be, for example, of plasmid origin. Vectors contain “replicon” polynucleotide sequences that facilitate the autonomous replication of the vector in a host cell. Foreign DNA is defined as heterologous DNA, which is DNA not naturally found in the host cell, which, for example, replicates the vector molecule, encodes a selectable or screenable marker, or encodes a transgene. The vector is used to transport the foreign or heterologous DNA into a suitable host cell. Once in the host cell, the vector can replicate independently of or coincidental with the host chromosomal DNA, and several copies of the vector and its inserted DNA can be generated. In addition, the vector can also contain the necessary elements that permit transcription of the inserted DNA into an mRNA molecule or otherwise cause replication of the inserted DNA into multiple copies of RNA. Some expression vectors additionally contain sequence elements adjacent to the inserted DNA that increase the half-life of the expressed mRNA and/or allow translation of the mRNA into a protein molecule. Many molecules of mRNA and polypeptide encoded by the inserted DNA can thus be rapidly synthesized.
- The term “nucleotide,” in addition to referring to the naturally occurring ribonucleotide or deoxyribonucleotide monomers, shall herein be understood to refer to related structural variants thereof, including derivatives and analogs, that are functionally equivalent with respect to the particular context in which the nucleotide is being used (e.g., hybridization to a complementary base), unless the context clearly indicates otherwise.
- The term “nucleic acid” or “polynucleotide” refers to a polymer that can be corresponded to a ribose nucleic acid (RNA) or deoxyribose nucleic acid (DNA) polymer, or an analog thereof. This includes polymers of nucleotides such as RNA and DNA, as well as synthetic forms, modified (e.g., chemically or biochemically modified) forms thereof, and mixed polymers (e.g., including both RNA and DNA subunits). Exemplary modifications include methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, and the like), pendent moieties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, and the like), chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids and the like). Also included are synthetic molecules that mimic polynucleotides in their ability to bind to a designated sequence via hydrogen bonding and other chemical interactions. Typically, the nucleotide monomers are linked via phosphodiester bonds, although synthetic forms of nucleic acids can comprise other linkages (e.g., peptide nucleic acids as described in Nielsen et al. (Science 254:1497-1500, 1991). A nucleic acid can be or can include, e.g., a chromosome or chromosomal segment, a vector (e.g., an expression vector), an expression cassette, a naked DNA or RNA polymer, the product of a polymerase chain reaction (PCR), an oligonucleotide, a probe, and a primer. A nucleic acid can be, e.g., single-stranded, double-stranded, or triple-stranded and is not limited to any particular length. Unless otherwise indicated, a particular nucleic acid sequence comprises or encodes complementary sequences, in addition to any sequence explicitly indicated.
- As used herein, the term “oligosaccharide” refers to a compound containing at least two sugars covalently linked together. Oligosaccharides include disaccharides, trisaccharides, tetrasachharides, pentasaccharides, hexasaccharides, heptasaccharides, octasaccharides, and the like. Covalent linkages generally consist of glycosidic linkages (i.e., C—O—C bonds) formed from the hydroxyl groups of adjacent sugars. Linkages can occur between the 1-carbon and the 4-carbon of adjacent sugars (i.e., a 1-4 linkage), the 1-carbon and the 3-carbon of adjacent sugars (i.e., a 1-3 linkage), the 1-carbon and the 6-carbon of adjacent sugars (i.e., a 1-6 linkage), or the 1-carbon and the 2-carbon of adjacent sugars (i.e., a 1-2 linkage). A sugar can be linked within an oligosaccharide such that the anomeric carbon is in the α- or β-configuration. The oligosaccharides prepared according to the methods of the invention can also include linkages between carbon atoms other than the 1-, 2-, 3-, 4-, and 6-carbons.
- “Acceptor sugar” refers a sugar that accepts the sugar being added. For example, the acceptor sugar can be an oligosaccharide, such as a fucosylated oligosaccharide, that accepts a sialic acid or analog thereof.
- “Donor substrate” refers to a compound having a nucleotide and the sugar that is added to the acceptor, where the sugar and nucleotide are covalently bound together. The sugar can be sialic acid or analogs thereof. The nucleotide can be any suitable nucleotide such as cytidine monophosphate (CMP).
- “Sialic acid aldolase” refers to an aldolase that prepares sialic acid using pyruvate and N-acetyl mannose (ManNAc).
- The present invention includes a variety of sialyltransferases with reduced sialidase and/or donor substrate hydrolysis activity. Sialyltransferases are one class of glycosyltransferases, enzymes that catalyze the transfer of a sugar from a nucleotide-sugar complex (donor substrate) to an acceptor, a mono, di or oligosaccharide. Sialyltransferases catalyze the transfer of N-acetylneuraminic acid, and analogs thereof, from a sialic acid-nucleotide complex, the donor substrate, to the terminal sugar of the acceptor which can be a monosaccharide, an oligosaccharide, a glycolipid, a glycopeptide, or a glycoprotein. Representative sialyltransferases include, but are not limited to, sialyltransferases in family EC 2.4.99, such as beta-galactosamide alpha-2,6-sialyltransferase (EC 2.4.99.1), alpha-N-acetylgalactosaminide alpha-2,6-sialyltransferase (EC 2.4.99.3), beta-galactoside alpha-2,3-sialyltransferase (EC 2.4.99.4), N-acetyllactosaminide alpha-2,3-sialyltransferase (EC 2.4.99.6), alpha-N-acetyl-neuraminide alpha-2,8-sialyltransferase (EC 2.4.99.8); lactosylceramide alpha-2,3-sialyltransferase (EC 2.4.99.9). The sialyltransferases of the present invention also include those of the CAZy GT80 family, or EC 2.4.99.4, drawn to alpha2-3 and alpha2-6 sialyltransferases, as well as sialyltransferases in the GT29, GT30, GT38, GT42, GT52, and GT73 families. Representative GT80 sialyltransferases include, but are not limited to, PmST1, Psp26ST, Vsp23ST, Pd26ST, P1ST6 JT-1, P1ST6 JT-2, Pp Pst3-1, Pp Pst3-2, Np23ST and Hd0053. (See Glycobiology 201, 21(6), 716; J. Mol. Biol. 2003, 328, 307; Annu. Rev. Biochem. 2008, 77, 521; Appl. Microbiol. Biotechnol. 2012, 94, 887 for review of sialyltransferases.)
- The glycosyltransferases of the present invention include those having decreased α2-3 sialidase or donor substrate hydrolysis activity compared to a control glycosyltransferase. α2-3 sialidase activity refers to the back reaction starting from the product oligosaccharide, cleaving the glycosidic bond between the sugar from the donor substrate and the acceptor, resulting in the donor substrate and the acceptor.
- In some embodiments, the glycosyltransferase can be an α2-3-sialyltransferase. The α2-3-sialyltransferases of the present invention can include sialyltransferases of Pasteurella multocida. In some embodiments, the glycosyltransferases of the present invention can have a motif in the sialyltransferase domain including at least one of sialyltransferase motif A (YDDGS, corresponding to positions 139-143 of PmST1 wild type, SEQ ID NO:13) and sialyltransferase motif B (KGH, corresponding to positions 309-311 of PmST1 wild type, SEQ ID NO:13).
- The glycosyltransferases of the present invention can include a polypeptide having any suitable percent identity to the control sequence. For example, the glycosyltransferases of the present invention can include a polypeptide having a percent sequence identity to the control glycosyltransferase sequence of at least 20, 30, 40, 50, 55, 60, 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98 or at least 99%. In some embodiments, percent sequence identity can be at least 80%. In some embodiments, percent sequence identity can be at least 90%. In some embodiments, percent sequence identity can be at least 95%. In some embodiments, the glycosyltransferase includes a polypeptide sequence having at least 80% sequence identity to SEQ ID NO:1.
- In some embodiments, the isolated glycosyltransferase includes a polypeptide sequence of SEQ ID NO:3 (M120D), SEQ ID NO:5 (M120H), SEQ ID NO:7 (E247F), SEQ ID NO: 9 (R289Y) or SEQ ID NO: 11 (E247F/R289Y).
- The precise length of glycosyltransferases can vary, so the precise amino acid positions corresponding to each mutation can vary depending on the particular control glycosyltransferase used. Amino acid and nucleic acid sequence alignment programs are readily available (see, e.g., those referred to supra) and, given the particular motifs identified herein, serve to assist in the identification of the exact amino acids (and corresponding codons) for modification in accordance with the present invention. The positions of several mutations are shown in the table below for the PmST1 wild type sequence (SEQ ID NO:13) and the Δ24PmST1 (SEQ ID NO:1) sequence.
-
PmST1 wild type Δ24PmST1 Mutation (SEQ ID NO: 13) (SEQ ID No: 1) 1 M144D M120D 2 M144H M120H 3 E271F E247F 4 R313Y R289Y 5 E271F/R313Y E247F/R289Y - The above table illustrates “correspondence” of an amino acid position to a different sequence. For example, amino acid position 144 in the PmST1 wild type sequence (SEQ ID NO:13) corresponds to position 120 of the Δ24PmST1 sequence (SEQ ID NO: 1).
- The control glycosyltransferases of the present invention includes any suitable glycosyltransferase or sialyltransferase. The glycosyltransferases of the present invention includes mutants corresponding to any position of PmST1 wild type sequence (SEQ ID NO:13) and Δ24PmST1 (SEQ ID NO:1) (see Biochemistry 2006, 45(7), 2139, and 2007, 46(21), 6288). For example, the glycosyltransferases of the present invention include, but are not limited to, mutants at at least one of
positions 120, 247 and 289 of Δ24PmST1 (SEQ ID NO:1). Other glycosyltransferases include mutants at at least one ofpositions 144, 271 and 313 of PmST1 wild type sequence (SEQ ID NO:13). The mutants can include any suitable amino acid other than the native amino acid. For example, the amino acid can be V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, R, or H. In some embodiments, the control glycosyltransferase can be the PmST1 wild type sequence (SEQ ID NO:13) or the Δ24PmST1 (SEQ ID NO:1). In some embodiments, the control glycosyltransferase can be Δ24PmST1 (SEQ ID NO:1). - In some embodiments, the present invention provides an isolated glycosyltransferase, wherein the amino acid of the glycosyltransferase corresponding to position 120 of SEQ ID NO:1 is any amino acid other than M, the amino acid the glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, or the amino acid the glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R. The glycosyltransferase of the present invention has decreased α2-3 sialidase or donor substrate hydrolysis activity compared to a control glycosyltransferase, wherein the amino acid of the control glycosyltransferase corresponding to position 120 of SEQ ID NO:1 is M, the amino acid of the control glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is E, and the amino acid of the control glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is R. Finally, the glycosyltransferase of the present invention can be a member of the glycosyltransferase family 80 (GT80).
- In some embodiments, the isolated glycosyltransferase has decreased α2-3 sialidase activity, and includes at least one of the amino acid corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, and the amino acid corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R. Decreased α2-3 sialidase activity can be measured by the ratio of α2-3 sialidase activity for the control glycosyltransferase to the α2-3 sialidase activity of the isolated glycosyltransferase. The ratio can be at least 2:1, 3:1, 4:1, 5:1, 10:1, 20:1, 30:1, 40:1, 50:1, 100:1, 200:1, 300:1, 400:1, 500:1 or at least 1000:1. In some embodiments, the ratio is at least 5:1. In some embodiments, the ratio is at least 10:1. In some embodiments, the ratio is at least 100:1. In some embodiments, the ratio is at least 1000:1.
- In some embodiments, the isolated glycosyltransferase having decreased α2-3 sialidase activity includes the amino acid corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, and the amino acid corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R.
- In some embodiments, the isolated glycosyltransferase having decreased α2-3 sialidase activity includes the amino acid corresponding to position 117 of SEQ ID NO:1 is D or E. In some embodiments, the isolated glycosyltransferase having decreased α2-3 sialidase activity includes the amino acid corresponding to position 117 of SEQ ID NO:1 is A, G, V, L or I. In some embodiments, the isolated glycosyltransferase having decreased α2-3 sialidase activity includes the amino acid corresponding to position 287 of SEQ ID NO:1 is H, K, R, W or F.
- Other glycosyltransferases of the present invention have decreased donor substrate hydrolysis activity. Decreased donor substrate hydrolysis activity can be measured by the ratio of donor substrate hydrolysis activity for the control glycosyltransferase to the donor substrate hydrolysis activity of the isolated glycosyltransferase. The ratio can be at least 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9:1 or at least 10:1. In some embodiments, the isolated glycosyltransferase has decreased donor substrate hydrolysis activity, wherein the amino acid corresponding to position 120 of SEQ ID NO:1 is any amino acid other than M. In some embodiments, the ratio of donor substrate hydrolysis activity for the control α2-3 sialidase to the donor substrate hydrolysis activity of the isolated glycosyltransferase is at least 2:1.
- In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be any amino acid of V, I, L, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, R, or H. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be any amino acid of D, E, H, K or R. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be any amino acid of D or H. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be amino acid D. In some embodiments, the amino acid corresponding to position 120 of SEQ ID NO:1 can be amino acid H.
- In some embodiments, the amino acid corresponding to position 247 of SEQ ID NO:1 can be any amino acid of V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, K, R, or H. In some embodiments, the amino acid corresponding to position 247 of SEQ ID NO:1 can be any amino acid of F, Y or W. In some embodiments, the amino acid corresponding to position 247 of SEQ ID NO:1 can be amino acid F.
- In some embodiments, the amino acid corresponding to position 289 of SEQ ID NO:1 can be any amino acid of V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, and H. In some embodiments, the amino acid corresponding to position 289 of SEQ ID NO:1 can be any amino acid of Y, F or W. In some embodiments, the amino acid corresponding to position 289 of SEQ ID NO:1 can be amino acid Y.
- The glycosyltransferases of the present invention can have one or more mutations. In some embodiments, the glycosyltransferase includes the amino acid corresponding to position 247 of SEQ ID NO:1 can be any amino acid of F, Y or W, and the amino acid corresponding to position 289 of SEQ ID NO:1 can be any amino acid of Y, F or W. In some embodiments, the amino acid corresponding to position 247 of SEQ ID NO:1 can be amino acid F, and the amino acid corresponding to position 289 of SEQ ID NO:1 can be amino acid Y.
- In some embodiments, the isolated glycosyltransferase can be the amino acid corresponding to position 120 of SEQ ID NO:1 is D, E, H, K or R, the amino acid corresponding to position 247 of SEQ ID NO:1 is F, Y or W, or the amino acid corresponding to position 289 of SEQ ID NO:1 is Y, F or W. In some embodiments, the isolated glycosyltransferase can be the amino acid corresponding to position 120 of SEQ ID NO:1 is D or H, the amino acid corresponding to position 247 of SEQ ID NO:1 is F, or the amino acid corresponding to position 289 of SEQ ID NO:1 is Y.
- The glycosyltransferases of the present invention can be constructed by mutating the DNA sequences that encode the corresponding unmodified glycosyltransferase (e.g., a wild-type glycosyltransferase or a corresponding variant from which the glycosyltransferase of the invention is derived), such as by using techniques commonly referred to as site-directed mutagenesis. Nucleic acid molecules encoding the unmodified form of the glycosyltransferase can be mutated by a variety of techniques well-known to one of ordinary skill in the art. (See, e.g., PCR Strategies (M. A. Innis, D. H. Gelfand, and J. J. Sninsky eds., 1995, Academic Press, San Diego, Calif.) at Chapter 14; PCR Protocols: A Guide to Methods and Applications (M. A. Innis, D. H. Gelfand, J. J. Sninsky, and T. J. White eds., Academic Press, N Y, 1990).
- By way of non-limiting example, the two primer system, utilized in the Transformer Site-Directed Mutagenesis kit from Clontech, may be employed for introducing site-directed mutants into a polynucleotide encoding an unmodified form of the glycosyltransferase. Following denaturation of the target plasmid in this system, two primers are simultaneously annealed to the plasmid; one of these primers contains the desired site-directed mutation, the other contains a mutation at another point in the plasmid resulting in elimination of a restriction site. Second strand synthesis is then carried out, tightly linking these two mutations, and the resulting plasmids are transformed into a mutS strain of E. coli. Plasmid DNA is isolated from the transformed bacteria, restricted with the relevant restriction enzyme (thereby linearizing the unmutated plasmids), and then retransformed into E. coli. This system allows for generation of mutations directly in an expression plasmid, without the necessity of subcloning or generation of single-stranded phagemids. The tight linkage of the two mutations and the subsequent linearization of unmutated plasmids result in high mutation efficiency and allow minimal screening. Following synthesis of the initial restriction site primer, this method requires the use of only one new primer type per mutation site. Rather than prepare each positional mutant separately, a set of “designed degenerate” oligonucleotide primers can be synthesized in order to introduce all of the desired mutations at a given site simultaneously. Transformants can be screened by sequencing the plasmid DNA through the mutagenized region to identify and sort mutant clones. Each mutant DNA can then be restricted and analyzed by electrophoresis, such as for example, on a Mutation Detection Enhancement gel (Mallinckrodt Baker, Inc., Phillipsburg, N.J.) to confirm that no other alterations in the sequence have occurred (by band shift comparison to the unmutagenized control). Alternatively, the entire DNA region can be sequenced to confirm that no additional mutational events have occurred outside of the targeted region.
- Verified mutant duplexes in pET (or other) overexpression vectors can be employed to transform E. coli such as, e.g., strain E. coli BL21 (DE3) pLysS, for high level production of the mutant protein, and purification by standard protocols. The method of FAB-MS mapping, for example, can be employed to rapidly check the fidelity of mutant expression. This technique provides for sequencing segments throughout the whole protein and provides the necessary confidence in the sequence assignment. In a mapping experiment of this type, protein is digested with a protease (the choice will depend on the specific region to be modified since this segment is of prime interest and the remaining map should be identical to the map of unmutated protein). The set of cleavage fragments is fractionated by, for example, microbore HPLC (reversed phase or ion exchange, again depending on the specific region to be modified) to provide several peptides in each fraction, and the molecular weights of the peptides are determined by standard methods, such as FAB-MS. The determined mass of each fragment are then compared to the molecular weights of peptides expected from the digestion of the predicted sequence, and the correctness of the sequence quickly ascertained. Since this mutagenesis approach to protein modification is directed, sequencing of the altered peptide should not be necessary if the MS data agrees with prediction. If necessary to verify a changed residue, CAD-tandem MS/MS can be employed to sequence the peptides of the mixture in question, or the target peptide can be purified for subtractive Edman degradation or carboxypeptidase Y digestion depending on the location of the modification.
- Mutant glycosyltransferases with at least one amino acid substituted can be generated in various ways. In the case of amino acids located close together in the polypeptide chain, they may be mutated simultaneously using one oligonucleotide that codes for all of the desired amino acid substitutions. If however, the amino acids are located some distance from each other (separated by more than ten amino acids, for example) it is more difficult to generate a single oligonucleotide that encodes all of the desired changes. Instead, one of two alternative methods may be employed. In the first method, a separate oligonucleotide is generated for each amino acid to be substituted. The oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand of DNA that is synthesized from the template will encode all of the desired amino acid substitutions. An alternative method involves two or more rounds of mutagenesis to produce the desired mutant. The first round is as described for the single mutants: DNA encoding the unmodified glycosyltransferase is used for the template, an oligonucleotide encoding the first desired amino acid substitution(s) is annealed to this template, and the heteroduplex DNA molecule is then generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of mutagenesis as the template. Thus, this template already contains one or more mutations. The oligonucleotide encoding the additional desired amino acid substitution(s) is then annealed to this template, and the resulting strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This resultant DNA can be used as a template in a third round of mutagenesis, and so on. Alternatively, the multi-site mutagenesis method of Seyfang & Jin (Anal. Biochem. 324:285-291. 2004) may be utilized.
- Accordingly, also provided are recombinant nucleic acids, optionally isolated, encoding any of the glycosyltransferases of the present invention (e.g., glycosyltransferases comprising any of SEQ ID NOs:4, 6, 8, 10 and 12). Using a nucleic acid of the present invention, encoding a glycosyltransferase of the invention, a variety of vectors can be made. Any vector containing replicon and control sequences that are derived from a species compatible with the host cell can be used in the practice of the invention. Generally, expression vectors include transcriptional and translational regulatory nucleic acid regions operably linked to the nucleic acid encoding the mutant glycosyltransferase. The term “control sequences” refers to DNA sequences necessary for the expression of an operably linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. In addition, the vector may contain a Positive Retroregulatory Element (PRE) to enhance the half-life of the transcribed mRNA (see Gelfand et al. U.S. Pat. No. 4,666,848). The transcriptional and translational regulatory nucleic acid regions will generally be appropriate to the host cell used to express the glycosyltransferase. Numerous types of appropriate expression vectors, and suitable regulatory sequences are known in the art for a variety of host cells. In general, the transcriptional and translational regulatory sequences may include, e.g., promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences. In typical embodiments, the regulatory sequences include a promoter and transcriptional start and stop sequences. Vectors also typically include a polylinker region containing several restriction sites for insertion of foreign DNA. In certain embodiments, “fusion flags” are used to facilitate purification and, if desired, subsequent removal of tag/flag sequence, e.g., “His-Tag”. However, these are generally unnecessary when purifying an thermoactive and/or thermostable protein from a mesophilic host (e.g., E. coli) where a “heat-step” may be employed. The construction of suitable vectors containing DNA encoding replication sequences, regulatory sequences, phenotypic selection genes, and the mutant glycosyltransferase of interest are prepared using standard recombinant DNA procedures. Isolated plasmids, viral vectors, and DNA fragments are cleaved, tailored, and ligated together in a specific order to generate the desired vectors, as is well-known in the art (see, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, New York, N.Y., 2nd ed. 1989)). In some embodiments, the present invention provides a recombinant nucleic acid encoding an isolated glycosyltransferase of the present invention.
- In certain embodiments, the expression vector contains a selectable marker gene to allow the selection of transformed host cells. Selection genes are well known in the art and will vary with the host cell used. Suitable selection genes can include, for example, genes coding for ampicillin and/or tetracycline resistance, which enables cells transformed with these vectors to grow in the presence of these antibiotics.
- In one aspect of the present invention, a nucleic acid encoding a glycosyltransferase of the invention is introduced into a cell, either alone or in combination with a vector. By “introduced into” or grammatical equivalents herein is meant that the nucleic acids enter the cells in a manner suitable for subsequent integration, amplification, and/or expression of the nucleic acid. The method of introduction is largely dictated by the targeted cell type. Exemplary methods include CaPO4 precipitation, liposome fusion, LIPOFECTIN®, electroporation, viral infection, and the like.
- In some embodiments, prokaryotes are used as host cells for the initial cloning steps of the present invention. Other host cells include, but are not limited to, eukaryotic (e.g., mammalian, plant and insect cells), or prokaryotic (bacterial) cells. Exemplary host cells include, but are not limited to, Escherichia coli, Saccharomyces cerevisiae, Pichia pastoris, Sf9 insect cells, and CHO cells. They are particularly useful for rapid production of large amounts of DNA, for production of single-stranded DNA templates used for site-directed mutagenesis, for screening many mutants simultaneously, and for DNA sequencing of the mutants generated. Suitable prokaryotic host cells include E. coli K12 strain 94 (ATCC No. 31,446), E. coli strain W3110 (ATCC No. 27,325), E. coli K12 strain DG116 (ATCC No. 53,606), E. coli X1776 (ATCC No. 31,537), and E. coli B; however many other strains of E. coli, such as HB101, JM101, NM522, NM538, NM539, and many other species and genera of prokaryotes including bacilli such as Bacillus subtilis, other enterobacteriaceae such as Salmonella typhimurium or Serratia marcesans, and various Pseudomonas species can all be used as hosts. Prokaryotic host cells or other host cells with rigid cell walls are typically transformed using the calcium chloride method as described in section 1.82 of Sambrook et al., supra. Alternatively, electroporation can be used for transformation of these cells. Prokaryote transformation techniques are set forth in, for example Dower, in Genetic Engineering, Principles and Methods 12:275-296 (Plenum Publishing Corp., 1990); Hanahan et al., Meth. Enzymol., 204:63, 1991. Plasmids typically used for transformation of E. coli include pBR322, pUCI8, pUCI9, pUCI18, pUC119, and Bluescript M13, all of which are described in sections 1.12-1.20 of Sambrook et al., supra. However, many other suitable vectors are available as well.
- In some embodiments, the glycosyltransferases of the present invention are produced by culturing a host cell transformed with an expression vector containing a nucleic acid encoding the glycosyltransferase, under the appropriate conditions to induce or cause expression of the glycosyltransferase. Methods of culturing transformed host cells under conditions suitable for protein expression are well-known in the art (see, e.g., Sambrook et al., supra). Suitable host cells for production of the glycosyltransferases from lambda pL promoter-containing plasmid vectors include E. coli strain DG116 (ATCC No. 53606) (see U.S. Pat. No. 5,079,352 and Lawyer, F. C. et al., PCR Methods and Applications 2:275-87, 1993, which are both incorporated herein by reference). Following expression, the glycosyltransferase can be harvested and isolated. Methods for purifying the thermostable glycosyltransferase are described in, for example, Lawyer et al., supra. In some embodiments, the present invention provides a cell including a recombinant nucleic acid of the present invention. In some embodiments, the cell can be prokaryotes, eukaryotes, mammalian, plant, bacteria or insect cells.
- The glycosyltransferases of the present invention can be used to prepare oligosaccharides, specifically to add N-acetylneuraminic acid (Neu5Ac), other sialic acids, and analogs thereof, to a monosaccharide, an oligosaccharide, a glycolipid, a glycopeptide, or a glycoprotein. As shown in
FIG. 5 , the glycosyltransferase PmST1, catalyzes the addition of CMP-Neu5Ac to a fucosylated oligosaccharide by transferring the Neu5Ac to the oligosaccharide. - In some embodiments, the present invention provides a method of preparing an oligosaccharide, the method including forming a reaction mixture including an acceptor sugar, a donor substrate containing a sugar moiety and a nucleotide, and the glycosyltransferase of the present invention, under conditions sufficient to transfer the sugar moiety from the donor substrate to the acceptor sugar, thereby forming the oligosaccharide.
- The acceptor sugar can be any suitable oligosaccharide, glycolipid, glycopeptide, or glycoprotein. When the acceptor sugar is an oligosaccharide, any suitable oligosaccharide can be used. For example, the acceptor sugar can be Galβ1-4GlcNAcαOR, wherein R can H, a sugar or an oligosaccharide. Alternatively, the acceptor sugar can be fucosylated, such as Galβ1-4(Fucα1-3)GlcNAcαOR (LewisxβOR or LexβOR) wherein R can H, a sugar or an oligosaccharide.
- The donor substrate includes a nucleotide and sugar. Any nucleotide can be used, include, but are not limited to, adenine, guanine, cytosine, uracil and thymine nucleotides with one, two or three phosphate groups. In some embodiments, the nucleotide can be cytidine monophosphate (CMP). The sugar can be any suitable sugar. When the glycosyltransferase is a sialyltransferase, the sugar can be N-acetylneuraminic acid or Neu5Ac, other sialic acids and analogs thereof. Sialic acid is a general term for N- and O-substituted derivatives of neuraminic acid, and includes, but is not limited to, N-acetyl (Neu5Ac) or N-glycolyl (Neu5Gc) substitutions, as well as O-substitutions including acetyl, lactyl, methyl, sulfate and phosphate, among others. In some embodiments, the sialic acid can be a compound of the formula:
- wherein R1 can be H, OH, N3, NHC(O)Me, NHC(O)CH2OH, NHC(O)CH2N3, NHC(O)OCH2C═CH2, NHC(O)CH2F, NHC(O)CH2NHCbz, NHC(O)CH2OC(O)Me, or NHC(O)CH2OBn; and R2, R3, and R4 can be independently selected from H, OH, N3, OMe, F, OSO3 −, OPO3H−, or OC(O)Me. In some embodiments, the donor substrate can be CMP-Neu5Ac. Other donor substrates are useful in the methods of the present invention. In other embodiments, the sialic acid can be a compound of the formula:
- Any glycosyltransferase of the present invention can be used in the methods of the present invention. In some embodiments, the glycosyltransferase can include a polypeptide sequence such as SEQ ID NO:3. (M120D), SEQ ID NO:5 (M120H), SEQ ID NO:7 (E247F), SEQ ID NO:9 (R289Y) or SEQ ID NO:11 (E247F/R289Y). In some embodiments, the glycosyltransferase can include a polypeptide sequence such as SEQ ID NO:3 (M120D) or SEQ ID NO:5 (M120H). In some embodiments, the glycosyltransferase can include a polypeptide sequence such as SEQ ID NO:7 (E247F), SEQ ID NO:9 (R289Y) or SEQ ID NO:11 (E247F/R289Y). The glycosyltransferases can be, for example, purified, secreted by a cell present in the reaction mixture, or can catalyze the reaction within a cell expressing the glycosyltransferase.
- In another aspect of the present invention, reaction mixtures are provided comprising the glycosyltransferases as described herein. The reaction mixtures can further comprise reagents for use in glycosylation techniques. For example, in certain embodiments, the reaction mixtures comprise a buffer, salts (e.g., Mn2+, Mg2+), and labels (e.g., fluorophores).
- The donor substrate can be prepared prior to preparation of the oligosaccharide, or prepared in situ immediately prior to preparation of the oligosaccharide. In some embodiments, the method of the present invention also includes forming a reaction mixture including a CMP-sialic acid synthetase, cytidine triphosphate, and N-acetylneuraminic acid (Neu5Ac) or a Neu5Ac analog, under conditions suitable to form the CMP-Neu5Ac or CMP-Neu5Ac analog. In some embodiments, the step of forming the donor substrate and the step of forming the oligosaccharide are performed in one pot.
- In some embodiments, the sugar is prepared separately prior to use in the methods of the present invention. Alternatively, the sugar can be prepared in situ immediately prior to use in the methods of the present invention. In some embodiments, the method also includes forming a reaction mixture including a sialic acid aldolase, pyruvic acid or derivatives thereof, and N-acetylmannosamine or derivatives thereof, under conditions suitable to form the Neu5Ac or Neu5Ac analog. In some embodiments, the step of forming the sugar, the step of forming the donor substrate and the step of forming the oligosaccharide are performed in one pot.
- The oligosaccharide prepared by the method of the present invention can be any suitable oligosaccharide, glycolipid or glycoprotein. For example, the oligosaccharide can be an α2-3-linked sialyloligosaccharide. In some embodiments, the oligosaccharide can be a fucosylated oligosaccharide. In some embodiments, the oligosaccharide can be Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcαOR (Sia-LewisxβOR or SLexβOR) wherein R can be H, a monosaccharide, or an oligosaccharide. In some embodiments, the oligosaccharide can be Neu5Acα2-3Galβ1-4GlcNAcαOR, wherein R can H, a monosaccharide, or an oligosaccharide.
- Materials.
- Escherichia coli BL21 (DE3) was from Invitrogen (Carlsbad, Calif., USA). Ni2+-NTA agarose (nickel-nitrilotriacetic acid agarose) and QIAprep spin miniprep kit were from Qiagen (Valencia, Calif., USA). Bicinchoninic acid (BCA) protein assay kit was from Pierce Biotechnology, Inc. (Rockford, Ill.). QuikChange Multi Site-Directed Mutagenesis Kit was from Agilent Technologies company/Stratagene (Santa Clara, Calif.).
- Site-Directed Mutagenesis.
- Site-directed mutagenesis was carried out using the QuikChange multi-site-directed mutagenesis kit from Stratagene according to the manufacturer's protocol. The primers used were 5′ACCGGCACGACAACTTGGTTTGGAAATACCGATGTGCG3′ for E271F and 5′ ATCTACTTTAAAGGGCATCCTTATGGTGGTGAAATTAATGACTAC3′ for R313Y. The sites of mutations are underlined.
- Protein Expression and Purification.
- The plasmids containing the mutant genes were transformed into E. coli BL21 (DE3). The E. coli cells were cultured in LB-rich media (10 g L−1 tryptone, 5 g L yeast extract, and 10 g L−1NaCl) supplemented with ampicillin (100 μg mL−1). Overexpression of the mutant proteins was achieved by adding 0.1 mM of isopropyl-1-thio-β-D-galactopyranoside (IPTG) to the E. coli culture when its OD600=reached 0.8. The incubation of the induced culture was performed at 37° C. for 3 h with vigorous shaking at 250 rpm in a C25KC incubator shaker (New Brunswick Scientific, Edison, N.J.).
- His6-tagged mutant proteins were purified from the cell lysate. To obtain the cell lysate, the cell pellet harvested by centrifugation at 4000 rpm for 2 h was resuspended in 20 mL (for cells obtained from one liter culture) of lysis buffer (pH 8.0, 100 mM Tris-HCl containing 0.1% Triton X-100). To lyse the cells, lysozyme (50 μg mL−1) and DNaseI (3 μg mL−1) were then added to the resuspended cells followed by shaking at 37° C. for 60 min. The cell lysate was obtained as the supernatant after centrifugation at 11,000 rpm for 20 min. Purification of His6-tagged proteins from the lysate was achieved using an ÄKTA FPLC system (GE Healthcare) equipped with a
HisTrap™ FF 5 mL column. The column was pre-equilibrated with 8 column volumes of the binding buffer (5 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl pH 7.5) prior to lysate loading. After the sample loading, the column was washed with 8 column volumes of the binding and washing buffer (40 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl pH 7.5). Protein elution was carried out with 8 column volumes of the elute buffer (200 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl pH 7.5). The fractions containing the purified enzyme were collected and stored at 4° C. - Kinetic assays. The kinetic assays for the sialidase activity were performed in duplicate in a total volume of 10 μL in MES buffer (100 mM, pH 5.5) containing different concentrations of Neu5Acα2-3LacβMU (0.4, 1.0, 2.0, 4.0, 10.0, 20.0, 40.0, and 60.0 mM) and the mutant proteins (2.5 mg mL−1 of D141A, 1.6 mg mL−1 of E271F, 1 mg mL−1 of R313Y, and 3.2 mg mL−1 of E271F/R313Y). All reactions were allowed to proceed at 37° C. for 60 min (D141A), 1 min (E271F), 25 min (R313Y), and 20 min (E271F/R313Y). The apparent kinetic parameters were obtained by fitting the experimental data (the average values of duplicate assay results) into the Michaelis-Menten equation using Grafit 5.0.
- To obtain the apparent kinetic parameters of LacβMU as the acceptor for the α2-3-sialyltransferase activity, the kinetic assays were performed in duplicate in reaction mixtures of 10 μL containing Tris-HCl buffer (100 mM, pH 8.5), a fixed concentration of CMP-Neu5Ac (1 mM), different concentrations of LacβMU (0.2, 0.5, 1.0, 2.0, 5.0, and 9.0 mM) and the mutant proteins (2 μg mL−1 of E271F, 2 μg mL−1 of R313Y, and 1.6 μg mL−1 of E271F/R313Y). All reactions were allowed to proceed at 37° C. for 5 min (E271F), 7 min (R313Y), and 10 min (E271F/R313Y). The apparent kinetic parameters were obtained by fitting the experimental data (the average values of duplicate assay results) into the Michaelis-Menten equation using Grafit 5.0.
- To obtain the apparent kinetic parameters of CMP-Neu5Ac as the donor for the α2-3-sialyltransferase activity, the kinetic assays were performed in duplicate in reaction mixtures of 10 μL containing Tris-HCl buffer (100 mM, pH 8.5), a fixed concentration of LacβMU (1 mM), different concentrations of CMP-Neu5Ac (0.1, 0.2, 0.5, 1.0, 2.0, 5.0, 10.0 and 20.0 mM) and the mutant proteins (2 μg mL−1 of E271F, 2 μg mL−1 of R313Y, and 1.6 μg mL−1 of E271F/R313Y). All reactions were allowed to proceed at 37° C. for 2 min (E271F), 7 min (R313Y), and 5 min (E271F/R313Y). The apparent kinetic parameters were obtained by fitting the experimental data (the average values of duplicate assay results) into the Michaelis-Menten equation using Grafit 5.0.
- All the sialidase and α2-3-sialyltransferase assays were performed in an HPLC system. Reactions were stopped by adding 10 μL of ethanol. After necessary dilutions were performed to adjust the concentrations of the fluorescent-labeled compounds, the samples were then kept on ice until aliquots of 8 μL were injected and analyzed by a Shimadzu LC-6AD system equipped with a membrane on-line degasser, a temperature control unit, and a fluorescence detector (Shimadzu RF-10AXL). A reverse-phase Premier C18 column (250×4.6 mm i.d., 5 μm particle size, Shimadzu) protected with a C18 guard column cartridge was used. The mobile phase was 25% acetonitrile. The fluorescent compounds LacβMU and Neu5Acα2-3LacβMU were detected by excitation at 325 nm and emission at 372 nm.
- Acceptor substrate specificity assays by HPLC. Assays were performed in duplicate in 20 mL of Tris-HCl buffer (100 mM, pH 8.5) containing CMP-Neu5Ac (1 mM), a fluorescent acceptor (1 mM), MgCl2 (20 mM), and an enzyme (2 □g mL-1, wild-type PmST1 or E271R/R313Y mutant). Reactions were allowed to proceed for 5 min at 37° C. The 4-methylumbelliferone (MU)-labeled fluorescent acceptors and the products formed were detected with excitation at 325 nm and emission at 372 nm. The 9-fluorenylmethylcarbamate (Fmoc)-labeled fluorescent acceptors and the products formed were detected with excitation at 262 nm and emission at 313 nm. The 2-aminobenzoic acid (2AA)-labeled fluorescent acceptors and the products formed were detected with excitation at 315 nm and emission at 400 nm.
- Stability Studies by HPLC.
- Thermal stability studies were carried out by incubating wild-type PmST1 or E271F/R313Y mutant solution (20 μg mL−1) at 37° C. Samples were withdrawn at various time intervals for enzyme activity assays.
- Kinetics of the α2-3-Sialidase Activity.
- To test the involvement of D141 and H311 in the α2-3-sialidase activity of PmST1, the α2-3-sialidase activity of two previously obtained PmST1 mutants, D141A and H311A, were evaluated using a fluorescent α2-3-sialoside, Neu5Acα2-3LacβMU, as the substrate. The α2-3-sialidase activity of H311A mutant was too low to obtain the kinetic data. For the α2-3-sialidase activity of D141A mutant, the Km value (15±1 mM) was about the same as the wild-type PmST1 (24 mM), but its catalytic efficiency was about 7,300-fold lower than that of the wild-type PmST1 mainly due to a much slower turnover number of the D141A mutant (Table 1). These data indicated that both D141 and H311 are important for the α2-3-sialidase activity of PmST1.
-
TABLE 1 Apparent kinetic data for the α2-3-sialidase activity of wild-type PmST1 (WT) and PmST1 mutants. kcat/Km Km (mM) kcat (s−1) (s−1 mM−1) WT4 24 2.3 × 102 9.5 D141A 15 ± 1 (1.9 ± 0.1) × 10−2 1.3 × 10−3 E271F 5.7 ± 0.9 0.92 ± 0.04 0.16 R313Y 51 ± 5 0.18 ± 0.01 3.6 × 10−3 E271F/R313Y (5.4 ± 0.6) × 102 0.83 ± 0.08 1.5 × 10−3 - Kinetics of the α2-3-Sialidase Activity of the Mutants E271F, R313Y, and E271F/R313Y.
- The designed PmST1 mutants E271F, R313Y, and E271F/R313Y were expressed in E. coli using the same expression condition as the wild-type PmST1 (100 mg L−1 culture) and achieved a compatible level of expression (90 mg L−1 culture). Similar to the wild-type PmST1, one-step Ni2+-column purification was sufficient to provide pure protein (>99%) of the mutants.
- The kinetic assays for the α2-3-sialidase activity of the mutants E271F, R313Y, and E271F/R313Y using a fluorescent 4-methylumbelliferyl sialoside, Neu5Acα2-3LacβMU, as the substrate (Table 1) indicated that E271F mutation decreased the α2-3-sialidase activity of PmST1 about 59-fold which was mainly caused by a 250-fold decrease in the turnover number despite of a 4.2-fold decrease in the Km value. As expected, the R313Y mutation at a site close to the critical H311 residue for the α2-3-sialidase activity of PmST1 caused a 2,639-fold decrease in the catalytic efficiency (kcat/Km=0.0036 s−1 mM−1) compared to the wild-type PmST1 (kcat/Km=9.5 s−1 mM−1) mainly due to a (1,278-fold) decreased kcat value and a 2-fold increased Km value. The E271F/R313Y double mutant had the lowest α2-3-sialidase activity (kcat/Km=0.0015 s−1 mM−1) which was a 6,333-fold decrease compared to the wild-type PmST1 due to a 22.5-fold increase in the Km value and a 277-fold decrease in the kcat value.
- Kinetics of the α2-3-Sialyltransferase Activity of the Mutants E271F, R313Y, and E271F/R313Y.
- Kinetic assays (Table 2) for the α2-3-sialyltransferase activity of mutants E271F, R313Y, and E271F/R313Y using LacI3MU as the fluorescent acceptor and CMP-Neu5Ac as the donor indicated that either E271F or R313Y mutation did not cause significant changes on either the Km or the kcat value, leading to quite consistent catalytic efficiencies (kcat/Km=28-39 s−1 mM−1) compared to the wild-type PmST1 (kcat/Km=34 s−1 mM−1).
-
TABLE 2 Apparent kinetic data for the α2-3-sialyltransferase activity of wild-type PmST1 (WT) and PmST1 mutants. CMP-Neu5Ac LacβMU Enzymes Km (mM) kcat (s−1) kcat/Km (s−1 mM−1) Km (mM) kcat (s−1) kcat/Km (s−1 mM−1) WT4 0.44 32 73 1.4 47 34 E271F 0.18 ± 0.01 26 ± 1 1.4 × 102 0.71 ± 0.12 28 ± 1 39 R313Y 0.62 ± 0.04 19 ± 1 30 0.67 ± 0.05 19 ± 1 28 E271F/R313Y 0.34 ± 0.02 23 ± 1 69 0.54 ± 0.04 17 ± 1 32 - Acceptor Substrate Specificities of Wild-Type PmST1 and E271F/R313Y Mutant.
- Fluorescent glycans with different glycosidic linkages and various monosaccharide units, including Galβ1-4Glcβ, Galβ1-4GlcNAcβ, Galβ1-3GalNAcα, and Galβ1-3GlcNAcβ structures, were used to investigate the acceptor substrate specificities of the wild-type PmST1 and E271F/R313Y mutant. As shown in Error! Reference source not found, the E271F/R313Y mutant exhibited similar or slightly higher activity than the wild-type PmST1 towards different acceptors. Therefore, the acceptor promiscuity of PmST1 was not changed significantly by E271F and R313Y mutations.
- Thermal Stabilities of Wild-Type PmST1 and E271F/R313Y Mutant.
- Incubating the wild-type PmST1 and E271F/R313Y mutant at 37° C. for up to 2 hours did not decrease their activities significantly (Error! Reference source not found.). Therefore, both enzymes are considered quite stable and mutation does not affect the thermal stability of the PmST1.
- Site-directed mutagenesis, expression and purification of PmST1 mutants. Site-directed mutagenesis was performed using the QuikChange multi-site-directed mutagenesis kit from Stratagene according to the manufacturer's protocol. The primers used were 5′
AATCTTTATGACGATGGCTCAGATGAATATGTTGATTTAGAAAAAG 3′ for M144D; 5′AATCTTTATGACGATGGCTCACATGAATATGTTGATTTAGAAAAAG 3′ for M144H; 5′ATCACGCTGTATTTAGATCCTGATTCCTTACCGGCATTAAATCAG 3′ for A35D; and 5′ATCACGCTGTATTTAGATCCTCATTCCTTACCGGCATTAAATCAG 3′ for A35H. The expression and purification of the mutants were performed as previously described for the WT PmST1. - Kinetics of the donor hydrolysis activity of PmST1 and mutants by capillary electrophoresis analysis. The reactions were carried out in duplicate in a total volume of 10 μL at 37° C. for 15 min (WT), 40 min (D141A), 20 min (H311A), or 15 min (M144D and M144H) in Tris-HCl buffer (200 mM, pH 8.5) containing CMP-Neu5Ac (1, 2, 5, 10, 20 and 40 mM) and an enzyme (WT, 4 μg mL−1; D141A, 1500 μg mL−1; H311A, 40 μg mL−1; M144D, 39 μg mL−1; M144H, 5 μg mL−1). The reactions were stopped by adding 104 of pre-chilled ethanol. The mixtures were incubated on ice for 30 min and centrifuged at 13,000 rpm for 5 min. The supernatants were diluted with borate buffer (25 mM, pH 9.5) and aliquotes of 54 each were injected to a Beckman Coulter P/ACE™ MDQ Capillary Electrophoresis system equipped with a capillary (60 cm×75 μm i.d.) and monitored at 254 nm. The apparent kinetic parameters were obtained by fitting the experimental data (the average values of duplicate assay results) into the Michaelis-Menten equation using Grafit 5.0.
- Kinetics of the α2-3-sialyltransferase activity of PmST1 mutants by HPLC analysis. With LacβMU as the acceptor substrate, the reactions were performed in duplicate at 37° C. for 10 min (M144D) or 4 min (M144H) in a reaction mixture (10 μL) containing Tris-HCl (100 mM, pH 8.5), an enzyme (5 μg mL−1), and different concentrations (0.2, 0.5, 1.0, 2.0, and 5.0 mM) of LacβMU with a fixed concentration (1 mM) of CMP-Neu5Ac or different concentrations (0.2, 0.5, 1.0, 2.0, 5.0, and 10.0 mM) of CMP-Neu5Ac with a fixed concentration (1 mM) of LacβMU. With LexβMU as the acceptor substrate, the reactions were carried out in duplicate at 37° C. for 9 min (M144D) or 10 min (M144H) in a reaction mixture (10 μL) containing CAPSO (100 mM, pH 9.5), an enzyme (M144D, 39 μg mL−1 or M144H, 5 μg mL−1), and various concentrations of LexβMU (1.0, 5.0, 10.0, 15.0, 25.0, and 35.0 mM) with a fixed concentration (1 mM) of CMP-Neu5Ac or various concentrations (0.2, 0.5, 1.0, 2.0, 5.0, 10.0, 20.0, and 40.0 mM) of CMP-Neu5Ac with a fixed concentration (1 mM) of LexβMU. Reactions were stopped by adding 10 μL of pre-chilled ethanol. The mixtures were incubated on ice for 30 min and centrifuged at 13,000 rpm for 5 min. The supernatants were diluted with 25% acetonitrile and kept on ice until aliquots of 8 μL were injected and analyzed by the Shimadzu LC-6AD system equipped with a membrane on-line degasser, a temperature control unit, and a fluorescence detector (Shimadzu RF-10AXL). A reverse-phase Premier C18 column (250×4.6 mm i.d., 5 μm particle size, Shimadzu) protected with a C18 guard column cartridge was used. The mobile phase was 25% acetonitrile. The fluorophore (MU)-labeled compounds were detected by excitation at 325 nm and emission at 372 nm. The apparent kinetic parameters were obtained by fitting the experimental data (the average values of duplicate assay results) into the Michaelis-Menten equation using Grafit 5.0.
- Kinetics of the α2-3-Sialidase Activity of PmST1 Mutants.
- The reactions were performed in duplicate in a total volume of 10 μL at 37° C. for 60 min (M144D) or 15 min (M144H) in MES buffer (100 mM, pH 5.5) containing Neu5Acα2-3LacβMU (0.4, 1, 2, 4, 10, 20, 40 and 60 mM) and an enzyme (M144H, 1.36 mg mL−1 or M144D, 1.05 mg mL−1). Sample treatment after the reaction and analysis were carried out by HPLC similar to that described above for the α2-3-sialyltransferase assays.
- The α2-3-Sialidase Activity Assays of PmST1 and Mutants.
- The reactions were carried out in duplicate in a total volume of 10 μL at 37° C. for 20 hr in MES buffer (100 mM, pH 5.5) containing Neu5Acα2-3LexβMU (1 mM) and an enzyme (4 mg mL−1). Aliquots of 1 μL were withdrawn at 1 hr, 6 hr and 20 hr, and analyzed by HPLC as described above for the α2-3-sialyltransferase assays.
- Accession Codes.
- The structure of PmST1 M144D mutant in complex with CMP-3F(a)-Neu5Ac was deposited with a PDB ID code 3S44.
- Materials and Compound Characterization.
- Chemicals were purchased and used without further purification. 1H NMR (600 MHz) and 13C NMR (150 MHz) spectra were recorded on a Varian VNMRS 600 MHz spectrometer or 1H NMR (800 MHz) and 13C NMR (200 MHz) on a Bruker 800 MHz spectrometer. High resolution electrospray ionization (ESI) mass spectra were obtained at the Mass Spectrometry Facility in the University of California, Davis.
Silica gel 60 Å was used for flash column chromatography. Thin-layer chromatography (TLC) was performed on silica gel plates using anisaldehyde sugar stain or 5% sulfuric acid in ethanol stain for detection. Gel filtration chromatography was performed with a column (100 cm×2.5 cm) packed with BioGel P-2 Fine resins. Pasteurella multocida sialic acid aldolase, 1 N. meningitidis CMP-sialic acid synthetase (NmCSS), 2 and wild-type PmST1 were expressed in E. coli and purified as described previously. - Crystallization and Structure Determination.
- PmST1 M144D mutant in Tris-HCl buffer (20 mM, pH 7.5) was concentrated to 13 mg mL−1, and CMP-3F(axial)Neu5Ac was added to a final concentration of 2 mM. Binary CMP-3F(axial)Neu5Ac crystals were grown by hanging drop with 3 μL of the sample mixed with an equal volume of reservoir buffer [24% poly(ethylene glycol) 3350, 100 mM HEPES (pH 7.5), 50 mM NaCl, and 0.4% Triton X-100]. Then, the binary crystals were soaking with 10 mM of CMP-3F(axial)-Neu5Ac and 10 mM of LexβProN3 in buffer containing 26% poly (ethylene glycol) 3350, HEPES (100 mM, pH 7.5), NaCl (100 mM), and 0.4% Triton X-100 for overnight. All crystals were transferred to Paratone-N and frozen in a steam of nitrogen to 100 K for data collection. Diffraction data were collected at the Stanford Synchrotron Radiation Lightsource to 1.45 Å resolution. Data were processed with XDS and scaled with XSCALE (Table 3). The structure was solved by Molecular Replacement using the program PHASER. Only the ligand-free open conformation structure (PDB ID: 2EXO) was successful in structure determination. The model was displayed and adjusted with COOT and refined with REFMAC. Final data processing and refinement statistics are shown in Table 3.
-
TABLE 3 X-Ray data collection and refinement statistics for PmST1 M144D.d unit cell dimensions a,b,c (Å), β 52.44, 61.57, 62.58, β = 114.15° space group P21 no. of monomers per asymmetric unit 1 resolution range (Å) 25.0-1.45 (1.49-1.45) Rsym [a] (%) 3.8 (47.4) <I>/σ<I> 19.06 (2.58) no. of reflections 229,446 (16,432) no. of unique reflections 63,327 (4,962) redundancy 3.6 (3.3) completeness (%) 98.1 (98.2) Rfactor [b] (%) 18.7 Rfree [c] (%) 21.5 no. of protein atoms 3,197 no. of CMP atoms 21 no. of water atoms 431 mean B-factor (Å2) Protein, all atoms 14.8 Protein, main chain 13.4 Protein, side chain 16.2 CMP 20.1 water 25.8 rmsd from ideality bond distance (Å) 0.0128 bond angle (deg) 1.429 [a]Rmerge = [ΣhΣi|Ih − Ihi|/ΣhΣiIhi] where Ih is the mean of Ihi observations of reflection h. Numbers in parenthesis represent highest resolution shell. [b]R-Factor and [c]Rfree = Σ||Fobs| − |Fcalc||/Σ|Fobs| × 100 for 95% of recorded data (R-Factor) or 5% data (Rfree) dProtein Data Bank Accession codes: The structure of PmST1 M144D mutant in complex with CMP-3F(a)-Neu5Ac was deposited with a PDB ID code 3S44. - NMR Analysis of WT PmST1 and M144D Mutant.
- Enzymes were expressed in E. coli BL21 (DE3) using M9 media containing 15NH4Cl (1.0 g L−1), Na2HPO4.7H2O (12.66 g L−1), KH2PO4 (3.0 g L−1), NaCl (0.5 g L−1), MgSO4 (0.2 g CaCl2 (50 μM), and glucose (0.3%). Expressions were induced by adding 0.5 mM of isopropyl β-D-1-thiogalactopyranoside (IPTG) and incubating at 37° C. for 4 hr. The purifications were performed as previously described for the WT PmST1. The purified enzymes were dialyzed with a phosphate buffer (10 mM, pH 7.0). NMR samples of 15N-labeled WT and M144D PmST1 (˜0.7 mM) were prepared in 90%/10% of H2O/D2O containing 10 mM of phosphate (pH 7.0) in the presence or the absence of saturating CMP. 15N-1H HSQC NMR experiments were performed at 37° C. on Bruker Avance III 800 spectrometer with an Ultrashield Bruker magnet equipped with a four-channel interface, triple-resonance probe, and cryo-probe with Z-axis pulsed field gradients. The number of complex points and acquisition times were: 256, 180 ms (15N (F1)); and, 512, 64 ms (1H (F2)). The NMR spectra were processed and analyzed using the software, NMRPipe.
- One-pot three-enzyme synthesis of SLexβProN3 with different sialic acid forms. LexβProN3 (20-25 mg),9 a sialic acid precursor (mannose, ManNAc, ManNGc or their derivatives, 1.5 equiv.), sodium pyruvate (5 equiv.), and CTP (1.5 equiv.) were dissolved in Tris-HCl buffer (10 mL, 100 mM, pH 7.5-8.5) containing MgCl2 (20 mM) and appropriate amounts of Pm aldolase (0.5 mg), NmCSS (0.3-0.5 mg), and PmST1 mutant M144D (0.5-0.9 mg). The reactions were carried out by incubating the reaction mixture in an incubator shaker at 37° C. for 4-6 h. The product formation was monitored by TLC developed with EtOAc:MeOH:H2O:HOAc=4:2:1:0.2 (by volume) and stained withp-anisaldehyde sugar stain. When an optimal yield was achieved, the reaction was stopped by adding the same volume (10 mL) of cold EtOH and incubation at 4° C. for 30 min. The mixture was then centrifuged and the precipitates were removed. The supernatant was concentrated, passed through a BioGel P-2 gel filtration column, and eluted with water to obtain partially purified product. A silica gel column was then used to obtain pure sialylated products with EtOAc:MeOH:H2O=6:2:1 (by volume).
- NMR chemical shifts and HRMS data of SLexβProN3 containing different sialic acid forms synthesized by the one-pot three-enzyme system.
- Neu5Acα2-3LexβProN3 (1a).
- 33 mg, yield 93%. 1H NMR (600 MHz, D2O): δ 5.09 (d, 1H, J=4.2 Hz), 4.50 (d, 1H, J=7.8 Hz), 4.07 (dd, 1H, J=10.4 and 3.2 Hz), 4.01-3.82 (m, 11H), 3.74 (d, 1H, J=4.2 Hz), 3.66-3.59 (m, 9H), 3.56-3.50 (m, 4H), 3.36-3.30 (m, 2H), 2.72 (dd, 1H, J=12.6 and 4.8 Hz), 2.01 (s, 3H), 2.00 (s, 3H), 1.87 (m, 2H), 1.75 (t, 1H, J=12.3 Hz), 1.12 (d, 3H, J=6.6). 13C NMR (150 MHz, D2O): δ 175.20, 174.41, 174.05, 101.79, 101.15, 99.82, 98.76, 75.81, 75.42, 75.07, 74.98, 73.51, 73.07, 72.07, 72.03, 69.42, 69.35, 68.47, 68.28, 67.87, 67.47, 67.36, 66.84, 62.76, 61.64, 59.81, 55.98, 51.86, 47.93, 39.95, 28.27, 22.39, 22.20, 15.43. HRMS (ESI) m/z calcd for C34H57N5O23Na (M+Na) 926.3319, found 926.3342.
- Neu5Gcα2-3LexβProN3 (1b).
- 28 mg, yield 87%. 1H NMR (600 MHz, D2O): δ 5.13 (d, 1H, J=4.2 Hz), 4.56-4.54 (m, 2H), 4.15 (s, 2H), 4.07 (dd, 1H, J=10.4 and 3.2 Hz), 4.01-3.82 (m, 12H), 3.78-3.60 (m, 8H), 3.56-3.54 (m, 3H), 3.52 (dd, 1H, J=10.4 and 7.8 Hz), 3.36-3.30 (m, 2H), 2.78 (dd, 1H, J=12.6 and 4.8 Hz), 2.06 (s, 3H), 1.87 (m, 2H), 1.75 (t, 1H, J=12.3 Hz), 1.19 (d, 3H, J=6.6). 13C NMR (150 MHz, D2O): δ 175.73, 174.17, 173.84, 101.55, 100.91, 99.62, 98.52, 75.59, 75.20, 74.83, 74.75, 73.30, 72.57, 71.84, 69.20, 69.12, 69.10, 67.98, 67.65, 67.23, 67.13, 66.57, 66.61, 62.50, 60.91, 59.79, 59.60, 55.75, 51.33, 47.71, 39.78, 28.04, 22.16, 15.20. HRMS (ESI) m/z calcd for C34H57N5O24Na (M+Na) 942.3291, found 942.3292.
- Kdnα2-3LexβProN3 (1c).
- 27 mg, yield 85%. 1H NMR (600 MHz, D2O): δ 5.04 (d, 1H, J=4.2 Hz), 4.47-4.45 (m, 2H), 3.85 (dd, 1H, J=9.6 and 2.4 Hz), 3.81-3.62 (m, 10H), 3.56 (d, 1H, J=4.0 Hz), 3.49-3.29 (m, 12H), 3.18-3.14 (m, 3H), 2.65 (dd, 1H, J=12.6 and 4.8 Hz), 1.98 (s, 3H), 1.87 (m, 2H), 1.69 (t, 1H, J=12.3 Hz), 1.10 (d, 3H, J=6.6). 13C NMR (150 MHz, D2O): δ 174.36, 174.17, 101.72, 101.09, 99.76, 98.74, 75.70, 75.35, 75.02, 74.93, 74.04, 73.43, 72.26, 71.99, 70.32, 69.84, 69.34, 69.26, 67.79, 67.36, 67.28, 66.79, 62.73, 61.60, 59.73, 55.91, 47.84, 39.51, 28.21, 22.31, 15.37. HRMS (ESI) m/z calcd for C32H54N4O23Na (M+Na) 885.3077, found 885.3103.
- Neu5AcN3α2-3LexβProN3 (1d).
- 33 mg, yield 89%. 1H NMR (800 MHz, D2O): δ 5.06 (d, 1H, J=4.0 Hz), 4.47-4.56 (m, 2H), 4.01 (s, 2H), 3.96-3.76 (m, 11H), 3.72-3.59 (m, 10H), 3.54-3.45 (m, 3H), 3.47 (dd, 1H, J=10.4 and 7.8 Hz), 3.34-3.27 (m, 2H), 2.71 (dd, 1H, J=12.6 and 4.8 Hz), 1.98 (s, 3H), 1.81 (m, 2H), 1.74 (t, 1H, J=12.3 Hz), 1.10 (d, 3H, J=6.6). 13C NMR (200 MHz, D2O): δ 174.43, 174.09, 171.36, 101.75, 101.17, 99.82, 98.81, 75.79, 75.40, 75.08, 74.99, 73.47, 72.74, 72.10, 72.06, 69.43, 69.33, 68.35, 68.19, 67.86, 67.44, 67.35, 66.86, 62.72, 61.67, 59.79, 55.98, m 52.06, 51.92, 47.91, 39.97, 28.28, 22.38, 15.44. HRMS (ESI) m/z calcd for C34H56N8O23Na (M+Na) 967.3356, found 967.3396.
- KdnN3α2-3LexβProN3 (1e).
- 27 mg, yield 84%. 1H NMR (600 MHz, D2O): δ 5.11 (d, 1H, J=4.0 Hz), 4.54 (d, 1H, J=8.0 Hz), 4.52 (d, 1H, J=8.0 Hz), 4.07 (dd, 1H, J=9.6 and 3.2 Hz), 4.03-3.83 (m, 11H), 3.78 (d, 1H, J=3.2 Hz), 3.72-3.66 (m, 6H), 3.60-3.49 (m, 5H), 3.38-3.35 (m, 3H), 2.75 (dd, 1H, J=12.6 and 4.8 Hz), 2.04 (s, 3H), 1.83 (m, 2H), 1.78 (t, 1H, J=12.3 Hz), 1.17 (d, 3H, J=6.6). 13C NMR (150 MHz, D2O): δ 174.15, 173.64, 101.49, 100.88, 99.60, 98.52, 75.53, 75.16, 74.78, 74.73, 73.25, 72.77, 71.93, 71.80, 69.38, 69.16, 69.08, 68.30, 67.61, 67.10, 66.59, 62.50, 62.46, 61.38, 59.57, 55.73, 47.67, 39.50, 28.02, 22.13, 15.17. HRMS (ESI) m/z calcd for C32H53N7O22Na (M+Na) 910.3141, found 910.3137.
- 9-N3-Neu5Acα2-3LexβProN3 (1f).
- 28 mg, yield 91%. 1H NMR (800 MHz, D2O): δ 5.11 (d, 1H, J=4.0 Hz), 4.56 (d, 1H, J=8.0 Hz), 4.51 (d, 1H, J=8.0 Hz), 4.03-4.02 (m, 2H), 3.98-3.85 (m, 9H), 3.79 (d, 1H, J=3.2 Hz), 3.71-3.69 (m, 8H), 3.61-3.49 (m, 6H), 3.40-3.36 (m, 3H), 2.77 (dd, 1H, J=12.6 and 4.8 Hz), 2.05 (s, 3H), 1.83 (m, 2H), 1.79 (t, 1H, J=12.3 Hz), 1.17 (d, 3H, J=6.6). 13C NMR (200 MHz, D2O): δ 174.90, 174.16, 173.74, 101.51, 100.88, 99.58, 98.50, 75.63, 75.21, 74.81, 74.73, 73.24, 72.64, 72.42, 71.81, 70.39, 69.19, 69.09, 68.70, 68.20, 67.61, 67.15, 67.11, 66.59, 61.38, 59.60, 55.72, 53.01, 51.59, 47.68, 39.77, 28.02, 22.14, 15.18. HRMS (ESI) m/z calcd for C34H56N8O22Na (M+Na) 951.3407, found 910.3407.
- 9-O—Ac-Neu5Acα2-3LexβProN3 (1g).
- 20 mg, yield 62%. 1H NMR (600 MHz, D2O): δ 5.12 (d, 1H, J=4.0 Hz), 4.55-4.53 (m, 2H), 4.44 (dd, 1H, J=11.4 and 1.8 Hz), 4.20 (dd, 1H, J=11.4 and 6.6 Hz), 4.14-3.86 (m, 11H), 3.79 (d, 1H, J=3.0 Hz), 3.74-3.65 (m, 8H), 3.58-3.56 (m, 2H), 3.54 (dd, 1H, J=9.6 and 7.8 Hz), 3.41-3.36 (m, 2H), 2.78 (dd, 1H, J=12.6 and 4.8 Hz), 2.16 (s, 3H), 2.05 (s, 6H), 1.85 (m, 2H), 1.81 (t, 1H, J=12.6 Hz), 1.18 (d, 3H, J=6.6). 13C NMR (150 MHz, D2O): δ 174.18, 174.41, 174.00, 173.96, 101.80, 101.14, 99.80, 98.78, 75.88, 75.53, 75.09, 74.80, 73.54, 72.89, 72.08, 69.75, 69.57, 69.45, 69.37, 68.46, 68.37, 67.89, 67.37, 66.86, 65.98, 61.65, 59.84, 55.99, 51.85, 47.60, 40.05, 28.29, 22.41, 22.23, 20.44, 15.45. HRMS (ESI) m/z calcd for C36H59N5O24Na (M+Na) 968.3448, found 968.3427.
- 9-O—Ac-Neu5Gcα2-3LexβProN3 (1h).
- 21 mg, yield 64%. 1H NMR (600 MHz, D2O): δ 5.12 (d, 1H, J=4.0 Hz), 4.55 (dd, 1H, J=7.8 and 4.2 Hz), 4.45 (m, 1H), 4.22-4.19 (m, 1H), 4.14 (s, 2H), 4.13-3.79 (m, 13H), 3.73-3.66 (m, 8H), 3.61-3.59 (m, 2H), 3.55 (t, 1H, J=7.8 Hz), 3.41-3.37 (m, 2H), 2.80 (dd, 1H, J=12.6 and 4.8 Hz), 2.16 (s, 3H), 2.06 (s, 6H), 1.88-1.85 (m, 2H), 1.81 (t, 1H, J=12.6 Hz), 1.19 (d, 3H, J=6.6). 13C NMR (150 MHz, D2O): δ 175.93, 174.54, 174.41, 174.01, 101.80, 101.15, 99.81, 99.77, 75.86, 75.53, 75.09, 75.00, 73.54, 72.62, 72.09, 69.75, 69.63, 69.45, 69.37, 68.30, 68.23, 67.90, 67.36, 66.86, 65.93, 61.65, 61.17, 59.83, 55.99, 51.54, 47.95, 40.10, 28.29, 22.41, 20.44, 15.45. HRMS (ESI) m/z calcd for C36H59N5O24Na (M+Na) 984.3397, found 984.3397.
- Donor hydrolysis by PmST1 causes low yield sialylation of Lex. In order to understand why PmST1-catalyzed sialylation of Lex resulted in low yields, time course studies were carried out using a fluorescently labeled Lex acceptor (4-methylumbelliferyl 13-Lex or LexβMU) in a high performance liquid chromatography (HPLC) assay. As shown in Error! Reference source not found, PmST1-catalyzed sialylation of LexβMU (1 mM) using one equivalent of donor CMP-Neu5Ac reached a low yield (1.1-1.3%) plateau quickly within 2 min. Every additional dose of donor substrate CMP-Neu5Ac (shown by arrows in Error! Reference source not found.) increased the product formation which always reached a plateau quickly. Monitoring the CMP-Neu5Ac consumption (% consumption numbers are shown in parentheses in Error! Reference source not found.) in the reaction mixture by capillary electrophoresis studies confirmed a quick consumption of CMP-Neu5Ac. These indicated that donor (CMP-Neu5Ac) hydrolysis activity of PmST1, where water molecules compete with the poor Lex acceptor for the consumption of sugar nucleotide (CMP-Neu5Ac) donor of the sialyltransferase (Error! Reference source not found.), contributed significantly to the low yield of PmST1-catalyzed sialylation. In fact, donor hydrolysis has been observed in other glycosyltransferase-catalyzed reactions that lead to lower synthetic yields. The donor hydrolysis were observed frequently in co-crystallization of glycosyltransferases with a corresponding sugar nucleotide donor where its sugar component was usually cleaved off and only the hydrolyzed nucleotide was observed in the substrate binding pocket of the enzyme. Therefore, inert donor derivatives of glycosyltransferases have been commonly applied in the x-ray crystal structure studies of glycosyltransferases. Two recent papers discussed the donor hydrolysis activities of human blood group A and B glycosyltransferases (GTA and GTB) which are Mn2+-dependent and the UDP-Gal hydrolysis activity of GTB is increased in the presence of an acceptor substrate analog. Nevertheless, the effect of donor hydrolysis of glycosyltransferases on glycosylation processes has not been investigated in detail. In addition, no strategy has been reported for improving the yields of glycosyltransferase-catalyzed reactions by decreasing donor hydrolysis activity.
- Asp141 and His311 influence PmST1 donor hydrolysis activity. As shown in Table 4, D141A mutation decreased the efficiency of CMP-Neu5Ac hydrolysis activity of PmST1 by 1,000-fold mainly due to the decrease in the turnover number. H311A mutation also decreased the CMP-Neu5Ac hydrolysis activity by 16-fold, mainly contributed by a decreased turnover number without affecting the binding affinity significantly.
-
TABLE 4 Apparent kinetics of the CMP-Neu5Ac hydrolysis activity of WT PmST1 and mutants. Km (mM) kcat (s−1) kcat/Km (s−1 mM−1) WT 1.5 ± 0.2 27 ± 1 18 aD141A 1.4 ± 0.2 (2.5 ± 0.1) × 10−2 1.8 × 10−2 aH311A 1.8 ± 0.2 2.1 ± 0.1 1.1 M144D 7.3 ± 0.5 6.5 ± 0.1 0.89 M144H 13 ± 3 71 ± 6 5.5 aPmST1 D141A and H311A mutants were generated previously. See, Ni, et al. (2006) Biochemistry 45, 2139-2148. - PmST1 Mutants with Decreased CMP-Neu5Ac Hydrolysis Activity.
- As shown in Table 4, both M144D and M144H mutations decreased the efficiency of donor hydrolysis. M144D mutation decreased the efficiency of donor hydrolysis by 20-fold due to a 4.9-fold increase of the Km value and a 4.2-fold decrease of the kcat value. M144H mutation caused a less significant 3.3-fold decrease in the efficiency of donor hydrolysis due to a significant 8.7-fold increase in the Km value which is offset by a 2.6-fold increase in the kcat value.
- α2-3-Sialyltransferase Activities of PmST1 Mutants.
- As shown in Table 5, when a good sialyltransferase acceptor 4-methylumbelliferyl β-lactoside (LacβMU) was used, the M144D mutation decreased the α2-3-sialyltransferase activity by 18-fold due to a 9-fold increase of Km value and a 2-fold decrease of kcat value. When a poor sialyltransferase acceptor LexβMU was used, the M144D mutation did not change the efficiency of the α2-3-sialyltransferase activity of PmST1 significantly. In comparison, M144H mutation only decreased the α2-3-sialyltransferase activity weakly (1.3-fold) when LacβMU was used as an acceptor and increased the efficiency of α2-3-sialyltransferase activity by 2.6-fold when Lex3MU was used as an acceptor.
-
TABLE 5 Apparent kinetics of the α2-3-sialyltransferase activity of WT PmST1 and mutants. Km (mM) kcat (s−1) kcat/Km (s−1 mM−1) WT M144D M144H WT M144D M144H WT M144D M144H LacβMU a1.4 12 ± 1 0.79 ± 0.04 a47 22 ± 1 21 ± 1 a34 1.9 27 bCMP-Neu5Ac a0.44 0.30 ± 0.05 0.81 ± 0.06 a32 1.9 ± 0.1 21 ± 1 a73 6.1 27 LexβMU 17 ± 2 13 ± 2 8.1 ± 0.9 6.7 ± 0.3 4.0 ± 0.2 8.4 ± 0.3 0.38 0.32 1.0 cCMP-Neu5Ac 0.39 ± 0.03 2.1 ± 0.1 0.4 ± 0.05 0.55 ± 0.01 0.59 ± 0.01 0.93 ± 0.02 1.4 0.28 2.2 aData are from Yu, H., et al. (2005) J. Am. Chem. Soc. 127, 17618-17619. bWith LacβMU, cWith LexβMU. - PmST1 M144D Mutant has a Decreased α2-3-Sialidase Activity.
- M144D and M144H mutations also decreased the α2-3-sialidase activity of PmST1 by 5588- and 594-fold respectively when Neu5Acα2-3LacβMU was used as the sialidase substrate (Table 6). While the PmST1 M144D mutant showed no sialidase activity when Neu5Acα2-3LexβMU was used as the substrate, PmST1 M144H has increased sialidase activity compared to the WT PmST1 using the SLex substrate. For example, the PmST1 M144H mutant cleaved 10.0%, 24.5%, and 34.0% of Neu5Ac from Neu5Acα2-3LexβMU in 1 h, 6 h, and 20 h, respectively. In comparison, WT PmST1 removed 2.0%, 7.0%, and 7.5% of Neu5Ac from Neu5Acα2-3LexβMU under the same reaction conditions. The decreased α2-3-sialidase activity by M144D mutation allows the potential application of the PmST1 M144D mutant in sialylation of glycoconjugates containing terminal galactoside or Lex where the decreased α2-3-sialidase activity has the most advantages as these reactions are challenging for prompt monitoring.
-
TABLE 6 Apparent kinetics of the α2-3-sialidase activity of WT PmST1, M144D, and M144H mutants using Neu5Acα2-3LacβMU as the sialidase substrate. Km (mM) kcat (s−1) kcat/Km (s−1 mM−1) aWT a24 a2.3 × 102 a9.5 M144D 20 ± 2 (3.5 ± 0.1) × 10−2 1.7 × 10−3 M144H 1.7 ± 0.3 (2.7 ± 0.2) × 10−2 1.6 × 10−2 aData are from Yu, H., et al. (2005) J. Am. Chem. Soc. 127, 17618-17619. - PmST1 M144D mutant has a similar expression level as the WT PmST1. The PmST1 M144D mutation did not change the enzyme expression level in E. coli. About 98 mg of C-His6-tagged PmST1 M144D protein can be routinely purified from one liter of E. coli cell culture using Ni2+-affinity column (Error! Reference source not found.). This expression level is very similar to that (100 mg) of the WT PmST1 and allows the application of the mutant in preparative and large-scale synthesis of SLex antigens.
- X-Ray Crystal Structure of PmST1 M144D Mutant.
- The structure of the PmST1 M144D mutant with CMP-3F(axiai)-Neu5Ac was determined to 1.45 Å resolution with Rfactor and Rfree values of 18.7% and 21.5% respectively Table 3). Error! Reference source not found. shows the structural comparison between WT PmST1 and M144D mutant with bound CMP donor. Error! Reference source not found. A shows the overall structure of WT PmST1 with CMP bound (white tubes), aligned with the C-terminal domain of the M144D mutant (grey tubes) also with CMP bound (space filled atoms). Error! Reference source not found. B shows the stereo view of the superposition near the active site. WT PmST1 is shown as white tubes with bound CMP-3F(a)-Neu5Ac (sticks with white carbon bonds) and lactose acceptor (sticks with dark grey carbon bonds). The M144D mutant in shown as grey tubes with CMP bound (sticks with light grey carbon bonds). Error! Reference source not found. C shows the active site of the ternary crystal structure of PmST1 (PDB 1D: 21HZ) with bound CMP-3F(axial)-Neu5Ac and lactose. The mutation site M144 is underlined.
- The structure resides in the open conformation similar to the wild-type structure with no substrate (rmsd of 0.50 Å for 385 equivalent α-carbons). However, the M144D structure contains well-ordered electron density in the active site that clearly defines the CMP nucleotide. The sialic acid moiety is disordered, likely due to dynamics and/or multiple conformations in the open state of the enzyme. In the M144D structure, the CMP moiety does not bind as deeply into the pocket of the active site as the WT PmST1. The base and ribose are situated about 1.5 and 2.0 Å respectively, farther out of the active site compared to the WT PmST1. In the wild-type structure, Glu338 forms bidentate hydrogen bond interactions with both the 2′ and 3′ OH of the CMP ribose. In the M144D structure, an ordered water molecule mediates the interaction between the ribose and Glu338. The more shallow binding of the donor nucleotide in the M144D structure does not pull down the β-strand and the ensuing loop that contains Trp270. In comparison, in the wild-type enzyme, donor-nucleotide binding pulls down a β-strand causing Trp270 to pop out of the C-terminal domain, where it helps define the acceptor binding site in the sialyltransferase reaction.
- PmST1 M144D mutant is more efficient than M144H mutant in sialylating Lex. Overall, the M144D mutation decreased the undesired CMP-Neu5Ac hydrolysis activity significantly (20-fold) without appreciably changing the efficiency of the α2-3-sialyltransferase activity when Lex was used as an acceptor. As a result, M144D showed an overall improved activity in sialylation of Lex for the formation of sialyl Lex (SLex) structures. In comparison, M144H mutant which has a 3.3-fold decreased CMP-Neu5Ac hydrolysis activity and 2.6-fold increased α2-3-sialyltransferase activity using Lex as an acceptor was less effective for directly sialylating Lex.
- Synthesis of SLex containing diverse sialic acid forms using PmST1 M144D mutant. The application of the PmST1 M144D mutant obtained by protein structure-based rational design in the synthesis of SLex containing diverse naturally occurring and non-natural sialic acid forms was demonstrated using an efficient one-pot three-enzyme chemoenzymatic synthetic system (Error! Reference source not found.). The system contained PmST1 M144D mutant, an Neisseria meningitidis CMP-sialic acid synthetase (NmCSS), and a Pasteurella multocida sialic acid aldolase. N-Acetylmannosamine (ManNAc), mannose, and their derivatives were used for in situ synthesis of CMP-sialic acids and derivatives. Lex trisaccharide used as the sialyltransferase acceptor was synthesized using a one-pot two-enzyme system containing a bifunctional L-fucokinase/GDP-fucose pyrophosphorylase (FKP) cloned from Bacteroides fragilis and a recombinant Helicobacter pylori α1-3-fucosyltransferase as shown previously. As shown in Error! Reference source not found, SLex tetrasaccharides containing natural sialic acid forms including N-acetylneuraminc acid (Neu5Ac), N-glycolylneuraminic acid (Neu5Gc), 2-keto-3-deoxy-D-glycero-D-galacto-nonulosonic acid (Kdn), as well as 9-O-acetylated Neu5Ac and Neu5Gc were obtained in excellent (85-93%) to good yields (62-64%). The relatively lower yields for the synthesis of SLex containing the 9-O-acetyl sialic acid forms were due to the de-O-acetylation process leading to the formation of non-O-acetylated SLex oligosaccharides. In addition, SLex containing non-natural sialic acid forms including those with an N-azidoacetyl group or an azido group at C-5 or a C-9 azido group were also successfully obtained in excellent yields (84-91%).
- Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, one of skill in the art will appreciate that certain changes and modifications may be practiced within the scope of the appended claims. In addition, each reference provided herein is incorporated by reference in its entirety to the same extent as if each reference was individually incorporated by reference. Where a conflict exists between the instant application and a reference provided herein, the instant application shall dominate.
Claims (34)
1. An isolated glycosyltransferase, wherein
the amino acid corresponding to position 120 of SEQ ID NO:1 is any amino acid other than M,
the amino acid corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, or
the amino acid corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R,
wherein the glycosyltransferase has decreased α2-3 sialidase or donor substrate hydrolysis activity compared to a control glycosyltransferase, wherein the amino acid of the glycosyltransferase corresponding to position 120 of SEQ ID NO:1 is M, the amino acid corresponding to position 247 of SEQ ID NO:1 is E, and the amino acid corresponding to position 289 of SEQ ID NO:1 is R,
and wherein the glycosyltransferase is a member of the glycosyltransferase family 80 (GT80).
2. The isolated glycosyltransferase of claim 1 , wherein the isolated glycosyltransferase has decreased α2-3 sialidase activity, and
the amino acid of the glycosyltransferase corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, or
the amino acid of the glycosyltransferase corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R.
3. The isolated glycosyltransferase of any of claims 1 -2 , wherein the ratio of α2-3 sialidase activity for the control glycosyltransferase to the α2-3 sialidase activity of the isolated glycosyltransferase is at least 5:1.
4. The isolated glycosyltransferase of claim 3 , wherein the ratio is at least 10:1.
5. The isolated glycosyltransferase of claim 3 , wherein the ratio is at least 100:1.
6. The isolated glycosyltransferase of claim 3 , wherein the ratio is at least 1000:1.
7. The isolated glycosyltransferase of any of claims 1 -6 , wherein the isolated glycosyltransferase comprises:
the amino acid corresponding to position 247 of SEQ ID NO:1 is any amino acid other than E, and
the amino acid corresponding to position 289 of SEQ ID NO:1 is any amino acid other than R.
8. The isolated glycosyltransferase of any of claims 1 -7 , wherein the isolated glycosyltransferase has decreased donor substrate hydrolysis activity, and wherein the amino acid corresponding to position 120 of SEQ ID NO:1 is any amino acid other than M.
9. The isolated glycosyltransferase of claim 8 , wherein the ratio of donor substrate hydrolysis activity for the control α2-3 sialidase to the donor substrate hydrolysis activity of the isolated glycosyltransferase is at least 2:1.
10. The isolated glycosyltransferase of any of claims 1 -9 , wherein the amino acid corresponding to position 120 of SEQ ID NO:1 is any amino acid selected from the group consisting of V, I, L, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, R, and H.
11. The isolated glycosyltransferase of any of claims 1 -10 , wherein the amino acid corresponding to position 247 of SEQ ID NO:1 is any amino acid selected from the group consisting of V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, K, R, and H.
12. The isolated glycosyltransferase of any of claims 1 -11 , wherein the amino acid corresponding to position 289 of SEQ ID NO:1 is any amino acid selected from the group consisting of V, I, L, M, F, W, P, S, T, A, G, C, Y, N, Q, D, E, K, and H.
13. The isolated glycosyltransferase of any of claims 1 -12 , wherein
the amino acid corresponding to position 120 of SEQ ID NO:1 is D, E, H, K or R,
the amino acid corresponding to position 247 of SEQ ID NO:1 is F, Y or W, or
the amino acid corresponding to position 289 of SEQ ID NO:1 is Y, F or W.
14. The isolated glycosyltransferase of any of claims 1 -13 , wherein
the amino acid corresponding to position 120 of SEQ ID NO:1 is D or H,
the amino acid corresponding to position 247 of SEQ ID NO:1 is F, or
the amino acid corresponding to position 289 of SEQ ID NO:1 is Y.
15. The isolated glycosyltransferase of any of claims 1 -14 , wherein the glycosyltransferase is an α2-3 sialyltransferase.
16. The isolated glycosyltransferase of claim 15 , comprising a motif in the sialyltransferase domain comprising at least one member selected from the group consisting of sialyltransferase motif A (YDDGS) and sialyltransferase motif B (KGH).
17. The isolated glycosyltransferase of any of claims 1 -16 , wherein the control glycosyltransferase is SEQ ID NO:1.
18. The isolated glycosyltransferase of claim 17 , wherein the glycosyltransferase comprises a polypeptide sequence having at least 80% sequence identity to SEQ ID NO:1.
19. The isolated glycosyltransferase of claim 1 , wherein the isolated glycosyltransferase comprises a polypeptide sequence selected from the group consisting of SEQ ID NO: 3 (M120D), SEQ ID NO: 5 (M120H), SEQ ID NO: 7 (E247F), SEQ ID NO: 9 (R289Y) and SEQ ID NO: 11 (E247F/R289Y).
20. A recombinant nucleic acid encoding an isolated glycosyltransferase of any of claims 1 -19 .
21. A cell comprising an recombinant nucleic acid of claim 20 .
22. The cell of claim 21 , wherein the cell is selected from the group consisting of bacteria, yeast, insect, mammalian and plant cells.
23. A method of preparing an oligosaccharide, the method comprising:
a) forming a reaction mixture comprising an acceptor sugar, a donor substrate comprising a sugar moiety and a nucleotide, and the glycosyltransferase of any of claims 1 -19 , under conditions sufficient to transfer the sugar moiety from the donor substrate to the acceptor sugar, thereby forming the oligosaccharide.
24. The method of claim 23 , wherein the glycosyltransferase comprises a polypeptide sequence selected from the group consisting of SEQ ID NO: 3 (M120D), SEQ ID NO: 5 (M120H), SEQ ID NO: 7 (E247F), SEQ ID NO: 9 (R289Y) and SEQ ID NO: 11 (E247F/R289Y).
25. The method of claim 23 , wherein the glycosyltransferase comprises a polypeptide sequence selected from the group consisting of SEQ ID NO: 3 (M120D) and SEQ ID NO: 5 (M120H).
26. The method of claim 23 , wherein the isolated glycosyltransferase comprises a polypeptide sequence selected from the group consisting of SEQ ID NO: 7 (E247F), SEQ ID NO: 9 (R289Y) and SEQ ID NO: 11 (E247F/R289Y).
27. The method of claim 23 , wherein the donor substrate comprises a cytidine 5′-monophosphate(CMP)-sialic acid.
28. The method of claim 27 , wherein the CMP-sialic acid comprises cytidine 5′-monophosphate N-acetylneuraminic acid (CMP-Neu5Ac) or a CMP-Neu5Ac analog.
29. The method of claim 28 , further comprising:
b) forming a reaction mixture comprising a CMP-sialic acid synthetase, cytidine triphosphate, and N-acetylneuraminic acid (Neu5Ac) or a Neu5Ac analog, under conditions suitable to form the CMP-Neu5Ac or CMP-Neu5Ac analog.
30. The method of claim 29 , wherein steps a) and b) are performed in one pot.
31. The method of claim 29 , further comprising:
c) forming a reaction mixture comprising a sialic acid aldolase, pyruvic acid or derivatives thereof, and N-acetylmannosamine or derivatives thereof, under conditions suitable to form the Neu5Ac or Neu5Ac analog.
32. The method of claim 31 , wherein steps a), b), and c) are performed in one pot.
33. The method of any of claims 23 -32 , wherein the oligosaccharide is an ci2-3-linked sialyloligosaccharide.
34. The method of claim 23 , wherein the oligosaccharide is a fucosylated oligosaccharide.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/991,774 US20170204381A1 (en) | 2011-08-05 | 2016-01-08 | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201161515702P | 2011-08-05 | 2011-08-05 | |
| PCT/US2012/049748 WO2013022836A2 (en) | 2011-08-05 | 2012-08-06 | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds |
| US201414237334A | 2014-06-18 | 2014-06-18 | |
| US14/991,774 US20170204381A1 (en) | 2011-08-05 | 2016-01-08 | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds |
Related Parent Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2012/049748 Continuation WO2013022836A2 (en) | 2011-08-05 | 2012-08-06 | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds |
| US14/237,334 Continuation US9255257B2 (en) | 2011-08-05 | 2012-08-06 | PmST1 mutants for chemoenzymatic synthesis of sialyl lewis X compounds |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20170204381A1 true US20170204381A1 (en) | 2017-07-20 |
Family
ID=47669184
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/237,334 Active 2033-01-23 US9255257B2 (en) | 2011-08-05 | 2012-08-06 | PmST1 mutants for chemoenzymatic synthesis of sialyl lewis X compounds |
| US14/991,774 Abandoned US20170204381A1 (en) | 2011-08-05 | 2016-01-08 | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/237,334 Active 2033-01-23 US9255257B2 (en) | 2011-08-05 | 2012-08-06 | PmST1 mutants for chemoenzymatic synthesis of sialyl lewis X compounds |
Country Status (2)
| Country | Link |
|---|---|
| US (2) | US9255257B2 (en) |
| WO (1) | WO2013022836A2 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019126749A1 (en) * | 2017-12-21 | 2019-06-27 | The Regents Of The University Of California | Sialyltransferase variants having neosialidase activity |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2018201058A2 (en) * | 2017-04-27 | 2018-11-01 | The Regents Of The University Of California | Sialidase inhibitors and preparation thereof |
| WO2019020707A1 (en) * | 2017-07-26 | 2019-01-31 | Jennewein Biotechnologie Gmbh | Sialyltransferases and their use in producing sialylated oligosaccharides |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5374541A (en) | 1993-05-04 | 1994-12-20 | The Scripps Research Institute | Combined use of β-galactosidase and sialyltransferase coupled with in situ regeneration of CMP-sialic acid for one pot synthesis of oligosaccharides |
| WO2003027297A1 (en) | 2001-09-26 | 2003-04-03 | Kyowa Hakko Kogyo Co., Ltd. | PROCESS FOR PRODUCING α2,3/α2,8-SIALYLTRANSFERASE AND SIALIC ACID-CONTAINING COMPLEX SUGAR |
| NZ567841A (en) | 2003-03-06 | 2010-01-29 | Seneb Biosciences Inc | Methods and compositions for the enzymatic synthesis of gangliosides |
| EP1789558A4 (en) | 2004-09-17 | 2008-10-01 | Ca Nat Research Council | SIALYLTRANSFERASES WITH PRESERVED SEQUENCE MOTIVES |
| WO2006112025A1 (en) * | 2005-04-15 | 2006-10-26 | Japan Tobacco Inc. | NOVEL β-GALACTOSIDE-α-2,3-SIALYLTRANSFERASE, GENE ENCODING THE SAME AND METHOD FOR PRODUCING THE SAME |
-
2012
- 2012-08-06 US US14/237,334 patent/US9255257B2/en active Active
- 2012-08-06 WO PCT/US2012/049748 patent/WO2013022836A2/en not_active Ceased
-
2016
- 2016-01-08 US US14/991,774 patent/US20170204381A1/en not_active Abandoned
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019126749A1 (en) * | 2017-12-21 | 2019-06-27 | The Regents Of The University Of California | Sialyltransferase variants having neosialidase activity |
| US11739305B2 (en) | 2017-12-21 | 2023-08-29 | The Regents Of The University Of California | Sialyltransferase variants having neosialidase activity |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2013022836A3 (en) | 2013-05-10 |
| US20140302565A1 (en) | 2014-10-09 |
| WO2013022836A2 (en) | 2013-02-14 |
| US9255257B2 (en) | 2016-02-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6129412B2 (en) | Method for producing sialic acid derivative | |
| AU2011356210B2 (en) | Novel fucosyltransferases and their applications | |
| US20020132320A1 (en) | Glycoconjugate synthesis using a pathway-engineered organism | |
| JP2023506287A (en) | Alpha-1,2-fucosyltransferase enzyme that converts lactose | |
| KR20220128581A (en) | Enzymatic hexosaminidization of lactose | |
| Ding et al. | A Photobacterium sp. α2–6-sialyltransferase (Psp2, 6ST) mutant with an increased expression level and improved activities in sialylating Tn antigens | |
| JP2011167200A (en) | H.pylori fucosyltransferase | |
| CN116790649A (en) | A method for enzymatic synthesis of UDP-glucuronic acid and UDP-N-acetylglucosamine | |
| EP2905341B1 (en) | Methods for producing GDP-fucose using nucleotide sugar transporters and recombinant microorganism host cells used therefor | |
| JP5189585B2 (en) | Novel β-galactoside-α2,6-sialyltransferase, gene encoding the same, and method for improving enzyme activity | |
| US20170204381A1 (en) | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds | |
| Yamamoto et al. | Bacterial sialyltransferases | |
| US9783838B2 (en) | PmST3 enzyme for chemoenzymatic synthesis of alpha-2-3-sialosides | |
| WO2019035482A1 (en) | Protein exhibiting epimerization activity | |
| US9938510B2 (en) | Photobacterium sp. alpha-2-6-sialyltransferase variants | |
| TW201732039A (en) | New polyphosphate-dependent glucokinase and method for preparing glucose 6-phosphate using the same | |
| WO2006043305A1 (en) | Method of enhancing enzymatic activity of glycosyltransferase | |
| EP2599867A1 (en) | Novel enzyme protein, process for production of the enzyme protein, and gene encoding the enzyme protein | |
| US9102967B2 (en) | PmST2 enzyme for chemoenzymatic synthesis of α-2-3-sialylglycolipids | |
| WO2007105321A1 (en) | NOVEL β-GALACTOSIDE α2,6-SIALYLTRANSFERASE, GENE CODING FOR THE TRANSFERASE AND PROCESS FOR PRODUCING THE SAME | |
| WO2024097788A1 (en) | Glycosyltransferase engineering for chemoenzymatic total synthesis of gangliosides | |
| KR100984567B1 (en) | Novel nucleoside diphosphate-glucose synthase and a method for producing nucleoside diphosphate-glucose using the same | |
| Betenbaugh et al. | N-Acetylneuraminic acid synthase (NANS) | |
| WO2021031170A1 (en) | Polyphosphate kinase rmppk, and coding gene and application thereof | |
| Xiao | Enzymatic Synthesis of Common Sugar Nucleotide and Therapeutic Oligosaccharides |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |