CN115466756A - 转氨酶、固定化转氨酶及用于制备西他列汀的用途 - Google Patents
转氨酶、固定化转氨酶及用于制备西他列汀的用途 Download PDFInfo
- Publication number
- CN115466756A CN115466756A CN202110655349.9A CN202110655349A CN115466756A CN 115466756 A CN115466756 A CN 115466756A CN 202110655349 A CN202110655349 A CN 202110655349A CN 115466756 A CN115466756 A CN 115466756A
- Authority
- CN
- China
- Prior art keywords
- seq
- leu
- val
- gly
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000340 Transaminases Proteins 0.000 title claims abstract description 143
- 102000003929 Transaminases Human genes 0.000 title claims abstract description 139
- MFFMDFFZMYYVKS-SECBINFHSA-N sitagliptin Chemical compound C([C@H](CC(=O)N1CC=2N(C(=NN=2)C(F)(F)F)CC1)N)C1=CC(F)=C(F)C=C1F MFFMDFFZMYYVKS-SECBINFHSA-N 0.000 title claims abstract description 43
- 229960004034 sitagliptin Drugs 0.000 title claims abstract description 41
- 238000002360 preparation method Methods 0.000 title claims abstract description 24
- 238000006243 chemical reaction Methods 0.000 claims abstract description 59
- 239000000758 substrate Substances 0.000 claims abstract description 36
- 229920005989 resin Polymers 0.000 claims abstract description 30
- 239000011347 resin Substances 0.000 claims abstract description 30
- FZRKAZHKEDOPNN-UHFFFAOYSA-N Nitric oxide anion Chemical compound O=[N-] FZRKAZHKEDOPNN-UHFFFAOYSA-N 0.000 claims abstract description 26
- 229910052739 hydrogen Inorganic materials 0.000 claims abstract description 8
- 238000004519 manufacturing process Methods 0.000 claims abstract description 5
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 11
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims description 45
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 claims description 38
- 239000002773 nucleotide Substances 0.000 claims description 32
- 125000003729 nucleotide group Chemical group 0.000 claims description 32
- 239000000243 solution Substances 0.000 claims description 22
- 235000007682 pyridoxal 5'-phosphate Nutrition 0.000 claims description 19
- 239000011589 pyridoxal 5'-phosphate Substances 0.000 claims description 19
- 229960001327 pyridoxal phosphate Drugs 0.000 claims description 19
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 19
- 102000004190 Enzymes Human genes 0.000 claims description 13
- 108090000790 Enzymes Proteins 0.000 claims description 13
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical group CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 claims description 13
- 238000000034 method Methods 0.000 claims description 13
- 239000007810 chemical reaction solvent Substances 0.000 claims description 12
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 12
- 239000007864 aqueous solution Substances 0.000 claims description 10
- 229960004115 sitagliptin phosphate Drugs 0.000 claims description 10
- RTZRUVMEWWPNRR-UHFFFAOYSA-N tert-butyl n-(3-iodo-1h-pyrrolo[2,3-b]pyridin-5-yl)carbamate Chemical compound CC(C)(C)OC(=O)NC1=CN=C2NC=C(I)C2=C1 RTZRUVMEWWPNRR-UHFFFAOYSA-N 0.000 claims description 10
- 238000003756 stirring Methods 0.000 claims description 9
- QAEDTLFWHIEVPK-UHFFFAOYSA-N 1-[3-(trifluoromethyl)-6,8-dihydro-5h-[1,2,4]triazolo[4,3-a]pyrazin-7-yl]-4-(2,4,5-trifluorophenyl)butane-1,3-dione Chemical group C1=C(F)C(F)=CC(F)=C1CC(=O)CC(=O)N1CC2=NN=C(C(F)(F)F)N2CC1 QAEDTLFWHIEVPK-UHFFFAOYSA-N 0.000 claims description 8
- 230000001580 bacterial effect Effects 0.000 claims description 6
- 239000013604 expression vector Substances 0.000 claims description 6
- 238000001914 filtration Methods 0.000 claims description 6
- 108091033319 polynucleotide Proteins 0.000 claims description 6
- 239000002157 polynucleotide Substances 0.000 claims description 6
- 102000040430 polynucleotide Human genes 0.000 claims description 6
- 238000003259 recombinant expression Methods 0.000 claims description 6
- 239000003822 epoxy resin Substances 0.000 claims description 5
- 229920000647 polyepoxide Polymers 0.000 claims description 5
- 239000010802 sludge Substances 0.000 claims description 5
- 241000672609 Escherichia coli BL21 Species 0.000 claims description 4
- 238000005406 washing Methods 0.000 claims description 4
- 241000588724 Escherichia coli Species 0.000 claims description 2
- 239000013612 plasmid Substances 0.000 claims description 2
- 230000035484 reaction time Effects 0.000 claims description 2
- 230000035772 mutation Effects 0.000 abstract description 5
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 238000009776 industrial production Methods 0.000 abstract description 3
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 50
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 50
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 50
- 108010047857 aspartylglycine Proteins 0.000 description 50
- 108010068265 aspartyltyrosine Proteins 0.000 description 50
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 46
- 150000001413 amino acids Chemical group 0.000 description 40
- 108010093096 Immobilized Enzymes Proteins 0.000 description 29
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 25
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 25
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 25
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 25
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 25
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 25
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 25
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 25
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 25
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 25
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 25
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 25
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 25
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 25
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 25
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 25
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 25
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 25
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 25
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 25
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 25
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 25
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 25
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 25
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 25
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 25
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 25
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 25
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 25
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 25
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 25
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 25
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 25
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 25
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 25
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 25
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 25
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 25
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 25
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 25
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 25
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 25
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 25
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 25
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 25
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 25
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 25
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 25
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 25
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 25
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 25
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 25
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 25
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 25
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 25
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 25
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 25
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 25
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 25
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 25
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 25
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 25
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 25
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 25
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 25
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 25
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 25
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 25
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 25
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 25
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 25
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 25
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 25
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 25
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 25
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 25
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 25
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 25
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 25
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 25
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 25
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 25
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 25
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 25
- 108010092114 histidylphenylalanine Proteins 0.000 description 25
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 25
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 25
- 108010003700 lysyl aspartic acid Proteins 0.000 description 25
- 108010051242 phenylalanylserine Proteins 0.000 description 25
- 108010004914 prolylarginine Proteins 0.000 description 25
- 108010029020 prolylglycine Proteins 0.000 description 25
- 108010026333 seryl-proline Proteins 0.000 description 25
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 25
- 108010073969 valyllysine Proteins 0.000 description 25
- 108010009962 valyltyrosine Proteins 0.000 description 25
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 22
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 22
- 108010044940 alanylglutamine Proteins 0.000 description 22
- 108010054812 diprotin A Proteins 0.000 description 22
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 21
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 21
- FWOVTJKVUCGVND-UFYCRDLUSA-N Tyr-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FWOVTJKVUCGVND-UFYCRDLUSA-N 0.000 description 21
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 17
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 17
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 17
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 17
- 108010005233 alanylglutamic acid Proteins 0.000 description 17
- 108010057821 leucylproline Proteins 0.000 description 16
- 230000014759 maintenance of location Effects 0.000 description 10
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 8
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 8
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 8
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 8
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 108090000623 proteins and genes Proteins 0.000 description 8
- 238000004128 high performance liquid chromatography Methods 0.000 description 7
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- 238000000926 separation method Methods 0.000 description 6
- 238000004587 chromatography analysis Methods 0.000 description 5
- JMMWKPVZQRWMSS-UHFFFAOYSA-N isopropyl acetate Chemical class CC(C)OC(C)=O JMMWKPVZQRWMSS-UHFFFAOYSA-N 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 238000005070 sampling Methods 0.000 description 5
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 4
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 4
- YNAVUWVOSKDBBP-UHFFFAOYSA-N Morpholine Natural products C1COCCN1 YNAVUWVOSKDBBP-UHFFFAOYSA-N 0.000 description 4
- PDKILSUYSUGCAO-JBACZVJFSA-N Tyr-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PDKILSUYSUGCAO-JBACZVJFSA-N 0.000 description 4
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- MENOBBYDZHOWLE-UHFFFAOYSA-N morpholine-2,3-dione Chemical compound O=C1NCCOC1=O MENOBBYDZHOWLE-UHFFFAOYSA-N 0.000 description 4
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 4
- 239000002904 solvent Substances 0.000 description 4
- 238000000967 suction filtration Methods 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 3
- 229920002449 FKM Polymers 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 238000006555 catalytic reaction Methods 0.000 description 3
- 206010012601 diabetes mellitus Diseases 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 239000002994 raw material Substances 0.000 description 3
- 230000001603 reducing effect Effects 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- TUAXCHGULMWHIO-SECBINFHSA-N (3r)-3-[(2-methylpropan-2-yl)oxycarbonylamino]-4-(2,4,5-trifluorophenyl)butanoic acid Chemical compound CC(C)(C)OC(=O)N[C@@H](CC(O)=O)CC1=CC(F)=C(F)C=C1F TUAXCHGULMWHIO-SECBINFHSA-N 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 238000005576 amination reaction Methods 0.000 description 2
- 235000011114 ammonium hydroxide Nutrition 0.000 description 2
- 238000009903 catalytic hydrogenation reaction Methods 0.000 description 2
- 239000012295 chemical reaction liquid Substances 0.000 description 2
- 239000012141 concentrate Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000002242 deionisation method Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 239000000706 filtrate Substances 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 229940011051 isopropyl acetate Drugs 0.000 description 2
- GWYFCOCPABKNJV-UHFFFAOYSA-N isovaleric acid Chemical compound CC(C)CC(O)=O GWYFCOCPABKNJV-UHFFFAOYSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 208000030159 metabolic disease Diseases 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 2
- SPOMEWBVWWDQBC-UHFFFAOYSA-K tripotassium;dihydrogen phosphate;hydrogen phosphate Chemical compound [K+].[K+].[K+].OP(O)([O-])=O.OP([O-])([O-])=O SPOMEWBVWWDQBC-UHFFFAOYSA-K 0.000 description 2
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- DKRDJODCERTCFW-UHFFFAOYSA-N 1,2-dimorpholin-4-ylethane-1,2-dione Chemical class C1COCCN1C(=O)C(=O)N1CCOCC1 DKRDJODCERTCFW-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical class NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- 206010067484 Adverse reaction Diseases 0.000 description 1
- 241000186063 Arthrobacter Species 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- 241001052560 Thallis Species 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- PBCJIPOGFJYBJE-UHFFFAOYSA-N acetonitrile;hydrate Chemical compound O.CC#N PBCJIPOGFJYBJE-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000006838 adverse reaction Effects 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- VZTDIZULWFCMLS-UHFFFAOYSA-N ammonium formate Chemical compound [NH4+].[O-]C=O VZTDIZULWFCMLS-UHFFFAOYSA-N 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 201000011529 cardiovascular cancer Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004296 chiral HPLC Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010411 cooking Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- HPNMFZURTQLUMO-UHFFFAOYSA-N diethylamine Chemical compound CCNCC HPNMFZURTQLUMO-UHFFFAOYSA-N 0.000 description 1
- 208000016097 disease of metabolism Diseases 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- WJPYOCIWVYDFDT-UHFFFAOYSA-N ethyl 3-oxo-4-(2,4,5-trifluorophenyl)butanoate Chemical compound CCOC(=O)CC(=O)CC1=CC(F)=C(F)C=C1F WJPYOCIWVYDFDT-UHFFFAOYSA-N 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 239000012065 filter cake Substances 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- JEGUKCSWCFPDGT-UHFFFAOYSA-N h2o hydrate Chemical compound O.O JEGUKCSWCFPDGT-UHFFFAOYSA-N 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 201000001421 hyperglycemia Diseases 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 230000003914 insulin secretion Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 229940035736 metformin and pioglitazone Drugs 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000003541 multi-stage reaction Methods 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 239000008055 phosphate buffer solution Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000008213 purified water Substances 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
- C12P17/182—Heterocyclic compounds containing nitrogen atoms as the only ring heteroatoms in the condensed system
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N11/00—Carrier-bound or immobilised enzymes; Carrier-bound or immobilised microbial cells; Preparation thereof
- C12N11/02—Enzymes or microbial cells immobilised on or in an organic carrier
- C12N11/08—Enzymes or microbial cells immobilised on or in an organic carrier the carrier being a synthetic polymer
- C12N11/089—Enzymes or microbial cells immobilised on or in an organic carrier the carrier being a synthetic polymer obtained otherwise than by reactions only involving carbon-to-carbon unsaturated bonds
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1096—Transferases (2.) transferring nitrogenous groups (2.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/14—Nitrogen or oxygen as hetero atom and at least one other diverse hetero ring atom in the same ring
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/185—Escherichia
- C12R2001/19—Escherichia coli
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供了一种固定化转氨酶在制备西他列汀和/或(R)‑3‑氨基‑1‑吗啉‑4‑(2,4,5‑三氟苯基)‑1‑丁酮中的用途;所述固定化转氨酶包括树脂和转氨酶,所述转氨酶的氨基酸序列如SEQ ID NO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49或其变体所示;所述变体为在如SEQ ID NO:5所示氨基酸序列的第241位的R突变为A、H、N、S、L、T、C或M。本发明还提供了一种固定化转氨酶、一种转氨酶及其制备方法和用途。将本发明的转氨酶制备成固定化转氨酶后,将其用于催化酮酰胺底物以生产西他列汀或其中间体时转化率高且立体选择性好、稳定性好、可重复使用率提高、操作更简单,进而降低了生产成本,有利于工业化生产。
Description
技术领域
本发明属于生物技术领域,具体涉及一种转氨酶、固定化转氨酶,以及该固定化转氨酶在制备西他列汀或其中间体中的应用,本发明还涉及一种西他列汀或其中间体的制备方法。
背景技术
糖尿病是由于胰岛素分泌改变,导致胰岛素缺乏及作用减弱,或者胰岛素活性降低,或者在两者共同影响下,出现的代谢性疾病,以高血糖为特征,并同时伴有蛋白质、糖及脂肪的代谢紊乱。糖尿病及其并发症对人类健康的危害程度居心血管疾病、肿瘤之后的第三位,成为危害人类健康的重要疾病。在糖尿病的四种类型中,II型糖尿病占90%以上,多见于30岁以上的中老年人,病因主要是由于机体对胰岛素不敏感。
西他列汀磷酸盐(Sitagliptin phosphate)是2006年FDA批准上市的第一个二肽基酶-IV(DPP-4)抑制剂,用于治疗II型糖尿病。其单用或与二甲双胍、吡格列酮合用都有明显的降血糖作用,且服用安全,耐受性好,不良反应少。
美国专利US8293507公开了Codexis公司用对节杆菌来源的转氨酶进行改造得到的转氨酶催化4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮得到西他列汀((2R)-4-氧-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-胺),西他列汀进一步磷酸化得到西他列汀磷酸盐。经改造的转氨酶能够在2g/L酮酰胺底物,0.5M异丙胺,22℃,5%DMSO,100μM吡哆醛磷酸(PLP)和20mg/mL转氨酶多肽的条件下,将底物4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮(在本申请中简称为西他双酮)转化成西他列汀到HPLC-UV在210nm可检测的水平。其中效果最好的突变体(SEQ ID NO:110)的催化转化率可达90-95%。
US9617573继续对US8293507中SEQ ID NO:110进行改造,所获突变体能够在50g/L酮酰胺底物,1.5M异丙胺(isopropylamine),55℃,50%DMSO(v/v),1mM吡哆醛磷酸的条件下,以至少1.2倍于SEQ ID NO:110的活性将4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮转化成西他列汀。其中效果最好的突变体(SEQ ID NO:130)的催化转化率可达90-95%。
但是这些转氨酶突变体在100%有机溶剂中都是不稳定的,因此将液体酶进行固定,得到固定化酶,以提高转氨酶在有机溶剂中的稳定性。如US9587229公开了将US8293507中的SEQ ID NO:110固定在SEPABEADS EXE120树脂上,结果显示SEQ ID NO:110的SEPABEADS EXE120固定化酶在水饱和的IPAc(乙酸异丙酯)溶剂中,能催化100g/L的4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮的底物至西他列汀,ee值99.9%以上。但是US9587229中并没有研究固定化酶的可重复使用率(套用批次数),且反应溶剂为水饱和的IPAc(乙酸异丙酯)溶剂,采用此溶剂用于生产时,需要事先将IPAc进行水饱和,操作更繁琐,如果用于多批次套用反应,其反应体系中含水量也不好控制,影响固定化酶的水活度,使固定化酶容易失活变性。
WO2019011236A1报道了1-吗啉-4-(2,4,5-三氟苯基)-1,3-丁二酮(本发明中又简称为吗啉双酮)可以经转氨酶催化制得(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮,再经多步反应制得西他列汀,但是反应中采用液体酶,含一定量的DMSO的水溶剂体系,底物浓度不高,液体酶不能重复利用,应用于生产时成本较高、效率较低。
发明内容
本发明所要解决的技术问题是为了克服现有技术中将转氨酶制备成固定化酶时用于反应体系时稳定性仍然较差进而用于催化酮酰胺底物以生产西他列汀或其中间体时转化率不高、不能重复利用等缺陷,提供了一种转氨酶、固定化转氨酶及其用于制备西他列汀或其中间体的用途。将本发明的转氨酶制备成固定化转氨酶后,将其用于催化酮酰胺底物以生产西他列汀或其中间体时转化率高且立体选择性好、稳定性好、可重复使用率提高、操作更简单,进而降低了生产成本,有利于工业化生产。
本发明人对现有技术中的转氨酶进行了大量研究,发现进行某些具体位点发生突变时,所得的转氨酶进行固定化后,将其用于催化酮酰胺底物时稳定性更高且转化率更高,将其用于西他列汀或其中间体的生产时成本更低。
为了解决上述技术问题,本发明第一方面提供了一种固定化转氨酶在制备西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮中的用途;
其中,所述固定化转氨酶包括树脂和转氨酶,所述转氨酶的氨基酸序列如SEQ IDNO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49或其变体所示;所述变体为在如SEQ IDNO:5所示氨基酸序列的第241位的R突变为A、H、N、S、L、T、C或M。
较佳地,所述变体的氨基酸序列如SEQ ID NO:11、SEQ ID NO:15、SEQ ID NO:21、SEQ ID NO:25、SEQ ID NO:37、SEQ ID NO:39、SEQ ID NO:41或SEQ ID NO:43所示。更佳地,所述转氨酶的核苷酸序列优选如SEQ ID NO:6、SEQ ID NO:12、SEQ ID NO:16、SEQ ID NO:22、SEQ ID NO:26、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:42、SEQ ID NO:44、SEQ IDNO:46、SEQ ID NO:48或SEQ ID NO:50所示;
较佳地,所述的转氨酶与所述树脂通过共价键合作用连接。
较佳地,所述固定化转氨酶中所述树脂上挂载的转氨酶浓度为150~300mg/g(即,每克所述树脂上挂载的转氨酶的质量为150~300mg),例如250mg/g。
较佳地,所述用途包括在氨基供体存在时,在反应溶剂中用所述的固定化转氨酶催化酮酰胺底物得所述西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮的步骤。
更佳地,所述的反应溶剂为异丙醇水溶液,优选所述异丙醇水溶液中水的体积含量(水占整个溶液的体积比)为2%~20%,更优选所述制备的反应体系中还包括转氨酶的辅助因子例如吡哆醛磷酸,其占整个反应体系的浓度优选为0.5~5mg/mL。
更佳地,所述酮酰胺底物为4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮和/或1-吗啉-4-(2,4,5-三氟苯基)-1,3-丁二酮。
本发明中,所述的4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮(本发明中又简称为西他双酮),具体结构式如下:
本发明中,所述的1-吗啉-4-(2,4,5-三氟苯基)-1,3-丁二酮(本发明中又简称为吗啉双酮),具体结构式如下:
更佳地,所述氨基供体为异丙胺。
更佳地,所述氨基供体与所述酮酰胺底物的摩尔比为1:1~5:1。
更佳地,所述酮酰胺底物占整个反应体系的浓度为20~200g/L。
更佳地,所述固定化转氨酶与所述酮酰胺底物的质量比为1:1~6:1,例如3:1,2.5:1或2:1。
更佳地,所述反应的温度为30-60℃,优选45℃。
为了解决上述技术问题,本发明第二方面提供了一种固定化转氨酶,其包括树脂和转氨酶,所述转氨酶的氨基酸序列如SEQ ID NO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ IDNO:49或其变体所示;所述变体为在如SEQ ID NO:5所示氨基酸序列的第241位的R突变为A、H、N、S、L、T、C或M。
较佳地,所述变体的氨基酸序列如SEQ ID NO:11、SEQ ID NO:15、SEQ ID NO:21、SEQ ID NO:25、SEQ ID NO:37、SEQ ID NO:39、SEQ ID NO:41或SEQ ID NO:43所示。更佳地,所述转氨酶的核苷酸序列优选如SEQ ID NO:6、SEQ ID NO:12、SEQ ID NO:16、SEQ ID NO:22、SEQ ID NO:26、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:42、SEQ ID NO:44、SEQ IDNO:46、SEQ ID NO:48或SEQ ID NO:50所示。
较佳地,所述的转氨酶与所述树脂通过共价键合作用连接。
较佳地,所述固定化转氨酶中所述树脂上挂载的转氨酶浓度为150~300mg/g(即,每克所述树脂上挂载的转氨酶的质量为150~300mg),例如250mg/g。为了解决上述技术问题,本发明第三方面提供了一种制备固定化转氨酶的方法,其包括:
1)使转氨酶的溶液和树脂接触以形成固定化转氨酶,所述转氨酶的氨基酸序列如SEQ ID NO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49或其变体所示;所述变体为在如SEQ ID NO:5所示氨基酸序列的第241位的R突变为A、H、N、S、L、T、C或M;
2)过滤和冲洗所述固定化转氨酶。
较佳地,所述变体的氨基酸序列如SEQ ID NO:11、SEQ ID NO:15、SEQ ID NO:21、SEQ ID NO:25、SEQ ID NO:37、SEQ ID NO:39、SEQ ID NO:41或SEQ ID NO:43所示;所述转氨酶的核苷酸序列优选如SEQ ID NO:6、SEQ ID NO:12、SEQ ID NO:16、SEQ ID NO:22、SEQID NO:26、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:42、SEQ ID NO:44、SEQ ID NO:46、SEQID NO:48或SEQ ID NO:50所示。
较佳地,所述的转氨酶与所述树脂通过共价键合作用连接。
较佳地,所述转氨酶以转氨酶菌泥的形式存在,所述转氨酶菌泥与所述树脂的质量比为3:1~1:2,例如5:4。
较佳地,所述反应的温度为10~40℃,优选20~25℃。
较佳地,所述反应的时间为10~30h,优选20~25小时。
较佳地,所述反应在搅拌下进行,所述搅拌的速率优选为50~300rpm,例如150或200rpm。
在某一较佳实施例中,所述固定化转氨酶的制备方法如下:取5g转氨酶(例如以菌泥的形式存在),加磷酸氢二钾-磷酸二氢钾缓冲液(例如体积为50mL,浓度为100mM)混匀高压均质破碎细胞、离心收集上清液酶液,加入10g K2HPO4·3H2O、1g KH2PO4和0.01g PLP,搅拌溶解,加入4gHFA树脂(采购于苏州汇通色谱分离纯化有限公司)进行固定,20~25℃、200rpm摇床固定20~25h,抽滤,去离子洗涤,抽滤得到固定化酶。
为了解决上述技术问题,本发明第四方面提供了一种制备西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮的方法,其包括在氨基供体存在时,在反应溶剂中用如本发明第二方面所述的固定化转氨酶催化酮酰胺底物得西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮的步骤。
较佳地,所述酮酰胺底物为4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮和/或1-吗啉-4-(2,4,5-三氟苯基)-1,3-丁二酮。
较佳地,所述反应溶剂为异丙醇水溶液。
较佳地,所述氨基供体为异丙胺。
较佳地,所述氨基供体与所述酮酰胺底物的摩尔比为1:1~5:1。
较佳地,当所述反应溶剂为异丙醇水溶液时,水的体积含量(水占整个溶液的体积比)为2%~20%。
较佳地,所述酮酰胺底物占整个反应体系的浓度为20g/L~200g/L。
较佳地,所述固定化转氨酶与底物的质量比为1:1~6:1,例如3:1,2.5:1或2:1。
较佳地,所述制备的反应体系中还包括转氨酶的辅助因子例如吡哆醛磷酸,其占整个反应体系的浓度优选为0.5~5mg/mL。
较佳地,所述反应的温度为30-60℃,优选45℃。
为了解决上述技术问题,本发明第五方面提供了一种转氨酶,所述转氨酶的氨基酸序列如SEQ ID NO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49或其变体所示;所述变体为在如SEQ ID NO:5所示的氨基酸序列的第241位由R突变为A、H、N、S、L、T、C或M。
较佳地,所述变体的氨基酸序列如SEQ ID NO:11、SEQ ID NO:15、SEQ ID NO:21、SEQ ID NO:25、SEQ ID NO:37、SEQ ID NO:39、SEQ ID NO:41或SEQ ID NO:43所示;
更佳地,所述转氨酶的核苷酸序列如SEQ ID NO:6、SEQ ID NO:12、SEQ ID NO:16、SEQ ID NO:22、SEQ ID NO:26、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:42、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:48或SEQ ID NO:50所示。
为了解决上述技术问题,本发明第六方面提供了一种多核苷酸,其编码如本发明第五方面所述的转氨酶。
为了解决上述技术问题,本发明第七方面提供了一种重组表达载体,其包括如本发明第六方面所述的多核苷酸。
较佳地,所述重组表达载体的骨架为质粒pET21a。
为了解决上述技术问题,本发明第八方面提供了一种转化体,其为在宿主中导入如本发明第六方面所述的多核苷酸或者如本发明第七方面所述的重组表达载体。
较佳地,所述宿主为大肠杆菌;优选大肠杆菌BL21。
为了解决上述技术问题,本发明第九方面提供了一种如本发明第五方面所述的转氨酶在制备西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮中的用途。
较佳地,所述用途包括在氨基供体存在时,在反应溶剂中用转氨酶催化酮酰胺底物得所述西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮的步骤。
更佳地,所述的反应溶剂为异丙醇水溶液,优选所述异丙醇水溶液中水的体积含量(水占整个溶液的体积比)为2%~20%,更优选所述制备的反应体系中还包括转氨酶的辅助因子例如吡哆醛磷酸,其占整个反应体系的浓度优选为0.5~5mg/mL。
更佳地,所述酮酰胺底物为4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮和/或1-吗啉-4-(2,4,5-三氟苯基)-1,3-丁二酮。
更佳地,所述氨基供体为异丙胺。
更佳地,所述氨基供体与所述酮酰胺底物的摩尔比为1:1~5:1。
更佳地,所述酮酰胺底物占整个反应体系的浓度为20~200g/L。
更佳地,所述转氨酶与所述酮酰胺底物的质量比为1:1~6:1,例如3:1,2.5:1或2:1。
更佳地,所述转氨酶以固定化的转氨酶的形式存在。
更佳地,所述制备的反应体系中还包括转氨酶的辅助因子例如吡哆醛磷酸,其占整个反应体系的浓度优选为0.5~5mg/mL。
更佳地,所述反应的温度为30-60℃,优选45℃。
本发明中,所述异丙醇水溶液中异丙醇的量能让底物全部溶解即可。异丙醇水溶液中水的体积含量(水占整个溶液的体积比)可以为2%~20%,水太少会让固定化酶失活,水太多则底物溶解不彻底。
本发明某一方面还提供了一种西他列汀磷酸盐的制备方法,所述制备方法包括下述步骤:
(1)按照如本发明第四方面所述的制备方法,制得西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮;
(2)将步骤(1)制得的(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮进行反应,得到西他列汀磷酸盐。
较佳地,所述西他列汀磷酸盐为西他列汀磷酸盐一水合物。
本发明某一方面还提供了一种酶制剂,其包括如本发明第二方面所述的固定化转氨酶或如本发明第五方面所述的转氨酶;
较佳地,所述酶制剂还包括转氨酶的辅助因子例如吡哆醛磷酸。
本发明中,所述的酶制剂通常可以是从培养物中获得的含转氨酶的转化体宿主细胞或其培养液得到的酶制剂,或者用其加工后得到的制品;其中,所述制品是指由转化体宿主细胞得到的提取物,通过对提取物中的转氨酶进行分离或纯化得到的分离产品,或通过固定化转化体细胞及其提取物或提取物的分离产品而得到的固定化制品。
在符合本领域常识的基础上,上述各优选条件,可任意组合,即得本发明各较佳实例。
本发明所用试剂和原料均市售可得。
本发明的积极进步效果在于:将本发明的转氨酶制备成固定化转氨酶后,将其用于催化酮酰胺底物以生产西他列汀中间体时转化率高且立体选择性好、稳定性好、可重复使用率提高、操作更简单,进而降低了生产成本,有利于工业化生产。
具体实施方式
下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施例范围之中。下列实施例中未注明具体条件的实验方法,按照常规方法和条件,或按照商品说明书选择。
本发明中的实验方法如无特别说明均为常规方法,基因克隆操作具体可参加J.萨姆布鲁克等编的《分子克隆实验指南》。
本发明中的氨基酸简写符号如无特殊说明均为本领域常规,具体简写符号对应的氨基酸如表1所示。
表1
所述氨基酸对应的密码子也为本领域常规,具体氨基酸与密码子的对应关系如表2所示。
表2
Pet21a购买自Novagen公司;NdeI酶、HindIII酶购买自Thermo Fisher公司,BL21感受态细胞购买自北京鼎国昌盛生物技术有限责任公司。
转化率HPLC方法:色谱柱:C18柱4.6×250mm,5μm;检测器:UV268nm;柱温:40℃;流速:0.8mL/min;进样量:20μL;流动相A:水∶乙腈∶甲酸∶氨水=950∶50∶0.5∶0.5,pH应在3.60~3.80之间,如果pH达不到,10%的氨水或者10%的甲酸调整pH到3.70;也可使用甲酸铵调节pH到3.70;流动相B:水∶乙腈=20∶80;梯度洗脱:100%A(0.01min),60%A+40%B(20min),60%A+40%B(40min),100%A(50min),100%A(60min)。
吗啉双酮对照品保留时间:27.820min;(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮对照品保留时间:14.856min;
西他双酮对照品保留时间:34.715min;
西他列汀对照品(采购于北京盈祥科技有限公司)保留时间:17.705min。
其中,吗啉双酮底物原料和对照品(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮由本公司自行合成,合成方法参考WO2019011236A1;3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮消旋体由实验室自行合成,由吗啉双酮进行氨基化和催化加氢制得。对照品(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮构型的确定:(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮可进一步反应制得(3R)-N-叔丁氧羰基-3-氨基-4-(2,4,5-三氟苯基)-丁酸(参考WO2019011236A1),以采购于安徽海康药业有限责任公司(3R)-N-叔丁氧羰基-3-氨基-4-(2,4,5-三氟苯基)-丁酸标准品做对照,可以确定为R构型。
构型确定的HPLC方法:
色谱柱:Daicel Chiralpak AD-H(4.6mm*250mm,5μm);流动相:正己烷:异丙醇=90:10;检测波长:210nm;流速:1.0ml/min;进样体积:10μl;柱温:25℃;运行时间:40min。
西他双酮底物原料由本实验室自行合成,合成方法参考CN100430397C,西他列汀消旋体由本实验室自行合成,由西他双酮进行氨基化和催化加氢制得。
手性HPLC方法检测产物ee值方法如下:
色谱柱:Daicel ChiralpakAD-H柱4.6mm×250mm,5μm;流动相:正己烷:异丙醇:二乙胺=40:60:0.1;检测器:UV268nm;柱温:25℃;流速:0.8mL/min;进样量:10μL。
3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮消旋体对照品保留时间:10.290min和28.087min;
(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮对照品保留时间:28.093min。
西他列汀消旋体对照品保留时间:14.172min和17.702min;
西他列汀对照品保留时间:17.665min。
实施例1一种转氨酶突变体酶液的制备
根据已报道的Enz.1(SEQ ID NO:1,即US8293507中的SEQ ID NO:110)、Enz.2(SEQID NO:3,即US9617573中的SEQ ID NO:130)的编码基因,全基因进行合成,酶切位点NdeI、HindIII,载体pET21a。基因合成公司为苏州金唯智生物科技有限公司(苏州工业园区星湖街218号生物纳米科技园C3楼)。合成的转氨酶基因转化至宿主大肠杆菌BL21感受态细胞,得到含有转氨酶Enz.1、Enz.2基因的工程菌株。
同样地,将表3中工程化得到的转氨酶Enz.3~Enz.25的基因,酶切位点NdeI、HindIII,载体pET21a。合成的转氨酶基因转化至宿主大肠杆菌BL21感受态细胞,得到表3含有转氨酶基因的工程菌Enz.3~Enz.25。
表3
将含有转氨酶基因的工程菌在经平皿划线活化后,挑单菌落接种至含50μg/ml卡那霉素的5ml LB液体培养基中,37℃震荡培养12h。按2%(v/v)接种量转接至150ml同样含50μg/ml卡那霉素的新鲜LB液体培养基中,37℃震荡至OD600达到0.8左右时,加入IPTG至其终浓度为0.5mM,18℃诱导培养16h。培养结束后,将培养液10,000rpm离心10min,弃上清液,收集菌体,置于-80℃超低温冰箱中保存,待用。
实施例2固定化转氨酶突变体的制备
取5g转氨酶突变体菌泥,加50mL 100mM磷酸氢二钾-磷酸二氢钾缓冲液混匀高压均质破碎细胞、离心收集上清液酶液,加入10g K2HPO4·3H2O、1g KH2PO4和0.01g PLP,搅拌溶解,加入4gHFA树脂(采购于苏州汇通色谱分离纯化有限公司)进行固定,20~25℃、200rpm摇床固定20~25h,抽滤,去离子洗涤,抽滤得到固定化酶。
实施例3各转氨酶突变体的固定化酶用于催化吗啉双酮
三角瓶中,加入10mL异丙醇,0.5g吗啉双酮,0.8mL 16mg/mL的PLP水溶液,0.25mL异丙胺和1g实施例2中制得的固定化酶,45℃、200rpm摇床反应,反应4小时和24小时取样检测转化率。检测结果如下表4所示。
其中,HPLC检测转化率的方法见上述的转化率HPLC方法部分。经过检测发现,各底物和产物的保留时间与各自的对照品一致。
表4
结果显示Enz.3、Enz.6、Enz.8、Enz.11、Enz.13、Enz.19、Enz.20、Enz.21、Enz.22、Enz.23、Enz.24、Enz.25的转化率都优于Enz.2和Enz.1,其中Enz.6、Enz.11、Enz.24的24小时的转化率均可达到80%以上。
三角瓶中,加入25mL异丙醇,2.5g吗啉双酮,2mL 16mg/mL的PLP水溶液,1.24mL异丙胺和7.5g实施例2中制得的固定化酶,45℃、200rpm摇床反应,反应24小时取样检测转化率。过滤反应液得固定化酶,继续加入25mL异丙醇,2.5g吗啉双酮,2mL16mg/mL的PLP水溶液,1.24mL异丙胺,45℃、200rpm摇床反应,反应24小时取样检测转化率。按照上述方法固定化酶重复套用,结果显示固定化酶套用10批反应,转化率仍然保持在90%以上,ee值>99.9%,固定化酶比较稳定。而Enz.1和Enz.2在同样条件下套用3次的转化率都在50%以下,不适合用来催化吗啉双酮。具体套用结果见下表5。
表5
实施例5各转氨酶突变体的固定化酶用于催化西他双酮
三角瓶中,加入10mL异丙醇,0.5g西他双酮,0.8mL 16mg/mL的PLP水溶液,0.23mL异丙胺和1g实施例2中制得的固定化酶,45℃、200rpm摇床反应,反应4小时和24小时取样HPLC检测转化率,检测结果如下表6所示。
其中,HPLC检测转化率的方法见上述的转化率HPLC方法部分。经检测,各底物和产物的保留时间与各自的对照品一致。
表6
结果显示,Enz.3、Enz.6、Enz.11、Enz.13、Enz.19、Enz.20、Enz.21、Enz.22的转化率都优于Enz.1和Enz.2,其中Enz.11的效果最优。
将这些较好的固定化酶进行蛋白浓度测定,检测方法:采用Bradford法检测固定化前后酶液在595nm处吸光值变化,进而计算出蛋白浓度,测定结果如下表7所示:
表7
三角瓶中,加入25mL异丙醇,2.5g西他双酮,2mL 16mg/mL的PLP水溶液,1.135mL异丙胺和6.25g实施例2中制得的固定化酶,45℃、200rpm摇床反应,反应24小时取样检测转化率。过滤反应液得固定化酶,继续加入25mL异丙醇,2.5g西他双酮,2mL 16mg/mL的PLP水溶液,1.135mL异丙胺,45℃、200rpm摇床反应,反应24小时取样检测转化率。按照上述方法固定化酶重复套用,结果显示固定化酶Enz.11-HFA套用10批反应,转化率仍然保持在95%以上,ee值>99.9%,固定化酶比较稳定。具体套用结果见下表8。
表8
*:与Enz.1的转化率比较;#:与Enz.2的转化率比较
以上结果说明,本发明的转氨酶突变在固定化酶催化西他双酮的实验中,显示了比现有技术Enz.1和Enz.2用于固定化酶催化时转化率更优异的效果,且该差异有统计学意义(p值均小于0.001,其中p值的计算是用T-Test双尾实验)。
实施例7西他列汀磷酸盐一水合物的制备
待实施例6反应结束后,过滤,得到固定化酶和滤液,滤液60℃下浓缩拉干得到浓缩物。加入100mL二氯甲烷溶解,加入100mL纯化水,搅拌,用30%浓盐酸调pH至2-3之间,静置分层。水相加入100mL二氯甲烷,搅拌,加入30%氢氧化钠溶液调pH至11,静置分层。水相再次加入100mL二氯甲烷,搅拌萃取,静置分层。合并两次碱萃的有机相。60℃浓缩,浓缩物加入120mL异丙醇,搅拌溶解,加入10.6g的85%磷酸,升温至75℃,搅拌溶清,缓慢降温使西他列汀磷酸盐结晶析出,5℃保温2h,过滤,滤饼放于60℃烘干干燥,得到西他列汀磷酸盐一水合物。
SEQUENCE LISTING
<110> 弈柯莱生物科技(上海)股份有限公司
<120> 转氨酶、固定化转氨酶及用于制备西他列汀的用途
<130> P20014045C
<160> 50
<170> PatentIn version 3.5
<210> 1
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.1氨基酸
<400> 1
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Met Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Pro Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 2
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.1核苷酸
<400> 2
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatct ctatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctatggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggtccgg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 3
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.2氨基酸
<400> 3
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Met Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Pro Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 4
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.2核苷酸
<400> 4
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctatggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtccgg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 5
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.3氨基酸
<400> 5
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Arg Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 6
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.3核苷酸
<400> 6
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
cgtgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 7
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.4氨基酸
<400> 7
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Glu Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 8
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.4核苷酸
<400> 8
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gaagctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 9
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.5氨基酸
<400> 9
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Trp Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 10
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.5核苷酸
<400> 10
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tgggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 11
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.6氨基酸
<400> 11
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 12
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.6核苷酸
<400> 12
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 13
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.7氨基酸
<400> 13
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Lys Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 14
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.7核苷酸
<400> 14
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
aaagctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 15
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.8氨基酸
<400> 15
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
His Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 16
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.8核苷酸
<400> 16
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
cacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 17
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.9氨基酸
<400> 17
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Tyr Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 18
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.9核苷酸
<400> 18
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 19
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.10氨基酸
<400> 19
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Gln Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 20
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.10核苷酸
<400> 20
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
caggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 21
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.11氨基酸
<400> 21
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Asn Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 22
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.11核苷酸
<400> 22
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
aacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 23
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.12氨基酸
<400> 23
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Pro Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 24
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.12核苷酸
<400> 24
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ccggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 25
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.13氨基酸
<400> 25
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ser Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 26
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.13核苷酸
<400> 26
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 27
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.14氨基酸
<400> 27
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Asp Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 28
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.14核苷酸
<400> 28
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 29
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.15氨基酸
<400> 29
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Gly Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 30
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.15核苷酸
<400> 30
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ggtgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 31
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.16氨基酸
<400> 31
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ile Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 32
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.16核苷酸
<400> 32
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
atcgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 33
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.17氨基酸
<400> 33
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Phe Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 34
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.17核苷酸
<400> 34
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ttcgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 35
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.18氨基酸
<400> 35
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Val Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 36
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.18核苷酸
<400> 36
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gttgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 37
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.19氨基酸
<400> 37
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Leu Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 38
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.19核苷酸
<400> 38
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
ctggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 39
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.20氨基酸
<400> 39
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Thr Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 40
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.20核苷酸
<400> 40
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
accgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 41
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.21氨基酸
<400> 41
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Cys Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 42
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.21核苷酸
<400> 42
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
tgcgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 43
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.22氨基酸
<400> 43
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Gly Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Phe Ala Ser Pro Tyr Leu Gln Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Gly Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Met Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 44
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.22核苷酸
<400> 44
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcggtggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgttc gcttctccgt acctgcagat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg acggtgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
atggctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 45
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.23氨基酸
<400> 45
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Phe Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Cys Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Ala Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 46
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.23核苷酸
<400> 46
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatct ctatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctttcgtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggttgcg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
gctgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 47
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.24氨基酸
<400> 47
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Ser Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Val Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Asn Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 48
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.24核苷酸
<400> 48
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatct ctatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg gttgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
aacgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
<210> 49
<211> 330
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.25氨基酸
<400> 49
Met Ala Phe Ser Ala Asp Thr Pro Glu Ile Val Tyr Thr His Asp Thr
1 5 10 15
Gly Leu Asp Tyr Ile Thr Tyr Ser Asp Tyr Glu Leu Asp Pro Ala Asn
20 25 30
Pro Leu Ala Gly Gly Ala Ala Trp Ile Glu Gly Ala Phe Val Pro Pro
35 40 45
Ser Glu Ala Arg Ile Pro Ile Phe Asp Gln Gly Phe Tyr Thr Ser Asp
50 55 60
Ala Thr Tyr Thr Thr Phe His Val Trp Asn Gly Asn Ala Phe Arg Leu
65 70 75 80
Gly Asp His Ile Glu Arg Leu Phe Ser Asn Ala Glu Ser Ile Arg Leu
85 90 95
Ile Pro Pro Leu Thr Gln Asp Glu Val Lys Glu Ile Ala Leu Glu Leu
100 105 110
Val Ala Lys Thr Glu Leu Arg Glu Ala Gln Val Thr Val Thr Ile Thr
115 120 125
Arg Gly Tyr Ser Ser Thr Pro Phe Glu Arg Asp Ile Thr Lys His Arg
130 135 140
Pro Gln Val Tyr Met Ser Ala Cys Pro Tyr Gln Trp Ile Val Pro Phe
145 150 155 160
Asp Arg Ile Arg Asp Gly Val His Leu Met Ile Ala Gln Ser Val Arg
165 170 175
Arg Thr Pro Arg Ser Ser Ile Asp Pro Gln Val Lys Asn Phe Gln Trp
180 185 190
Gly Asp Leu Ile Arg Ala Ile Gln Glu Thr His Asp Arg Gly Phe Glu
195 200 205
Leu Pro Leu Leu Leu Asp Cys Asp Asn Leu Leu Ala Glu Gly Thr Gly
210 215 220
Phe Asn Val Val Val Ile Lys Asp Gly Val Val Arg Ser Pro Gly Arg
225 230 235 240
Arg Ala Leu Pro Gly Ile Thr Arg Lys Thr Val Leu Glu Ile Ala Glu
245 250 255
Ser Leu Gly His Glu Ala Ile Leu Ala Asp Ile Thr Pro Ala Glu Leu
260 265 270
Tyr Asp Ala Asp Glu Val Leu Gly Cys Ser Thr Gly Gly Gly Val Trp
275 280 285
Pro Phe Val Ser Val Asp Gly Asn Ser Ile Ser Asp Gly Val Pro Gly
290 295 300
Pro Val Thr Gln Ser Ile Ile Arg Arg Tyr Trp Glu Leu Asn Val Glu
305 310 315 320
Pro Ser Ser Leu Leu Thr Pro Val Gln Tyr
325 330
<210> 50
<211> 990
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.25核苷酸
<400> 50
atggctttct ctgctgacac cccggaaatc gtttacaccc acgacaccgg tctggactac 60
atcacctact ctgactacga actggacccg gctaacccgc tggctggtgg tgctgcttgg 120
atcgaaggtg ctttcgttcc gccgtctgaa gctcgtatcc cgatcttcga ccagggtttc 180
tacacctctg acgctaccta caccaccttc cacgtttgga acggtaacgc tttccgtctg 240
ggtgaccaca tcgaacgtct gttctctaac gctgaatcta tccgtctgat cccgccgctg 300
acccaggacg aagttaaaga aatcgctctg gaactggttg ctaaaaccga actgcgtgaa 360
gctcaggtta ccgttaccat cacccgtggt tactcttcta ccccgttcga acgtgacatc 420
accaaacacc gtccgcaggt ttacatgtct gcttgcccgt accagtggat cgttccgttc 480
gaccgtatcc gtgacggtgt tcacctgatg atcgctcagt ctgttcgtcg taccccgcgt 540
tcttctatcg acccgcaggt taaaaacttc cagtggggtg acctgatccg tgctatccag 600
gaaacccacg accgtggttt cgaactgccg ctgctgctgg actgcgacaa cctgctggct 660
gaaggtaccg gtttcaacgt tgttgttatc aaagacggtg ttgttcgttc tccgggtcgt 720
cgtgctctgc cgggtatcac ccgtaaaacc gttctggaaa tcgctgaatc tctgggtcac 780
gaagctatcc tggctgacat caccccggct gaactgtacg acgctgacga agttctgggt 840
tgctctaccg gtggtggtgt ttggccgttc gtttctgttg acggtaactc tatctctgac 900
ggtgttccgg gtccggttac ccagtctatc atccgtcgtt actgggaact gaacgttgaa 960
ccgtcttctc tgctgacccc ggttcagtac 990
Claims (11)
1.一种固定化转氨酶在制备西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮中的用途;
其中,所述固定化转氨酶包括树脂和转氨酶,所述转氨酶的氨基酸序列如SEQ ID NO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49或其变体所示;所述变体为在如SEQ ID NO:5所示氨基酸序列的第241位的R突变为A、H、N、S、L、T、C或M;
较佳地:
所述变体的氨基酸序列如SEQ ID NO:11、SEQ ID NO:15、SEQ ID NO:21、SEQ ID NO:25、SEQ ID NO:37、SEQ ID NO:39、SEQ ID NO:41或SEQ ID NO:43所示;所述转氨酶的核苷酸序列优选如SEQ ID NO:6、SEQ ID NO:12、SEQ ID NO:16、SEQ ID NO:22、SEQ ID NO:26、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:42、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:48或SEQ ID NO:50所示;和/或,所述的转氨酶与所述树脂通过共价键合作用连接;和/或,所述的树脂为环氧树脂,优选为和/或,所述的制备中使用的反应溶剂为异丙醇水溶液,优选所述异丙醇水溶液中水的体积含量为2%~20%;和/或,所述制备的反应体系还包括转氨酶的辅助因子例如吡哆醛磷酸,其浓度优选为0.5~5mg/mL;和/或,所述固定化转氨酶中所述树脂上挂载的转氨酶浓度为150~300mg/g,例如250mg/g。
2.一种固定化转氨酶,其特征在于,其包括树脂和转氨酶,所述转氨酶的氨基酸序列如SEQ ID NO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49或其变体所示;所述变体为在如SEQ ID NO:5所示氨基酸序列的第241位的R突变为A、H、N、S、L、T、C或M;
较佳地:
所述变体的氨基酸序列如SEQ ID NO:11、SEQ ID NO:15、SEQ ID NO:21、SEQ ID NO:25、SEQ ID NO:37、SEQ ID NO:39、SEQ ID NO:41或SEQ ID NO:43所示;所述转氨酶的核苷酸序列优选如SEQ ID NO:6、SEQ ID NO:12、SEQ ID NO:16、SEQ ID NO:22、SEQ ID NO:26、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:42、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:48或SEQ ID NO:50所示;和/或,所述的转氨酶与所述树脂通过共价键合作用连接;和/或,所述的树脂为环氧树脂,优选为和/或,所述固定化转氨酶中所述树脂上挂载的转氨酶浓度为150~300mg/g,例如250mg/g。
3.一种制备如权利要求2所述的固定化转氨酶的方法,其包括:
1)使所述转氨酶的溶液和所述树脂接触反应以形成固定化转氨酶;
2)过滤和冲洗所述固定化转氨酶;
较佳地:
所述转氨酶以转氨酶菌泥的形式存在,所述转氨酶菌泥与所述树脂的质量比为3:1~1:2,例如5:4;和/或,所述反应的温度为10~40℃,优选20~25℃;和/或,所述反应的时间为10~30h,优选20~25h;和/或,所述反应在搅拌下进行,所述搅拌的速率优选为50~300rpm,例如150或200rpm。
4.一种制备西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮的方法,其包括在氨基供体存在时,在反应溶剂中用固定化转氨酶催化酮酰胺底物得西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮的步骤;
其中,所述固定化转氨酶如权利要求2所述;
较佳地,所述酮酰胺底物为4-氧代-4-[3-(三氟甲基)-5,6-二氢-[1,2,4]三唑并[4,3-a]吡嗪-7-(8H)-基]-1-(2,4,5-三氟苯基)丁-2-酮和/或1-吗啉-4-(2,4,5-三氟苯基)-1,3-丁二酮;
和/或,所述反应溶剂为异丙醇水溶液;
和/或,所述氨基供体为异丙胺;
和/或,所述氨基供体与所述酮酰胺底物的摩尔比为1:1~5:1;
和/或,当所述反应溶剂为异丙醇水溶液时,水的体积含量为2%~20%;
和/或,所述酮酰胺底物的浓度为20~200g/L;
和/或,所述固定化转氨酶与所述酮酰胺底物的质量比为1:1~6:1,例如3:1,2.5:1或2:1;
和/或,所述制备的反应体系中还包括转氨酶的辅助因子例如吡哆醛磷酸,其浓度优选为0.5~5mg/mL;
和/或,所述反应的温度为30-60℃,优选45℃。
5.一种转氨酶,其特征在于,所述转氨酶的氨基酸序列如SEQ ID NO:5、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49或其变体所示;所述变体为在如SEQ ID NO:5所示的氨基酸序列的第241位由R突变为A、H、N、S、L、T、C或M;
较佳地,所述变体的氨基酸序列如SEQ ID NO:11、SEQ ID NO:15、SEQ ID NO:21、SEQID NO:25、SEQ ID NO:37、SEQ ID NO:39、SEQ ID NO:41或SEQ ID NO:43所示;
更佳地,所述转氨酶的核苷酸序列如SEQ ID NO:6、SEQ ID NO:12、SEQ ID NO:16、SEQID NO:22、SEQ ID NO:26、SEQ ID NO:38、SEQ ID NO:40、SEQ ID NO:42、SEQ ID NO:44、SEQID NO:46、SEQ ID NO:48或SEQ ID NO:50所示。
6.一种多核苷酸,其编码权利要求5所述的转氨酶。
7.一种重组表达载体,其包括如权利要求6所述的多核苷酸;
较佳地,所述重组表达载体的骨架为质粒pET21a。
8.一种转化体,其为在宿主中导入如权利要求6所述的多核苷酸或者如权利要求7所述的重组表达载体;
较佳地,所述宿主为大肠杆菌;优选大肠杆菌BL21。
9.一种如权利要求5所述的转氨酶在制备西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮中的用途。
10.一种西他列汀磷酸盐的制备方法,其特征在于,所述制备方法包括下述步骤:
(1)按照权利要求4所述的方法,制得西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮;
(2)将步骤(1)制得的西他列汀和/或(R)-3-氨基-1-吗啉-4-(2,4,5-三氟苯基)-1-丁酮进行反应,得到西他列汀磷酸盐;
较佳地,所述西他列汀磷酸盐为西他列汀磷酸盐一水合物。
11.一种酶制剂,其包括如权利要求2所述的固定化转氨酶或如权利要求5所述的转氨酶;
较佳地,所述酶制剂还包括转氨酶的辅助因子例如吡哆醛磷酸。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110655349.9A CN115466756A (zh) | 2021-06-11 | 2021-06-11 | 转氨酶、固定化转氨酶及用于制备西他列汀的用途 |
| PCT/CN2022/092053 WO2022257686A1 (zh) | 2021-06-11 | 2022-05-10 | 转氨酶、固定化转氨酶及用于制备西他列汀的用途 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110655349.9A CN115466756A (zh) | 2021-06-11 | 2021-06-11 | 转氨酶、固定化转氨酶及用于制备西他列汀的用途 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN115466756A true CN115466756A (zh) | 2022-12-13 |
Family
ID=84364776
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202110655349.9A Pending CN115466756A (zh) | 2021-06-11 | 2021-06-11 | 转氨酶、固定化转氨酶及用于制备西他列汀的用途 |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN115466756A (zh) |
| WO (1) | WO2022257686A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117467733A (zh) * | 2023-12-27 | 2024-01-30 | 北京元延医药科技股份有限公司 | 高手性纯度西格列汀及使用固定化转氨酶制备的方法 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116024083B (zh) * | 2023-02-22 | 2023-06-13 | 凯莱英生命科学技术(天津)有限公司 | 连续流反应制备手性胺类化合物的装置和方法 |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2838744C (en) * | 2011-06-24 | 2020-10-13 | Merck Sharp & Dohme Corp. | Immobilized transaminases and process for making and using immobilized transaminase |
| SG11201506748WA (en) * | 2013-02-28 | 2015-09-29 | Codexis Inc | Engineered transaminase polypeptides for industrial biocatalysis |
| CN105219745B (zh) * | 2014-06-12 | 2018-12-04 | 上海弈柯莱生物医药科技有限公司 | 一种固定化转氨酶及其在合成西他列汀中间体中的应用 |
| ES2733477T3 (es) * | 2017-07-04 | 2019-11-29 | Fis Fabbrica Italiana Sintetici Spa | Procedimiento eficaz para la preparación de sitagliptina a través de una preparación muy eficaz del intermedio ácido 2,4,5-trifluorofenilacético |
| WO2019207443A1 (en) * | 2018-04-24 | 2019-10-31 | Unichem Laboratories Ltd | An enzymatic process for the preparation of (r)-sitagliptin |
| CN108586346B (zh) * | 2018-05-10 | 2019-10-01 | 北京富盛嘉华医药科技有限公司 | 一种生物催化合成西他列汀及其中间体的方法 |
| EP3604507A1 (en) * | 2018-07-30 | 2020-02-05 | University College Cork-National University of Ireland, Cork | An omega-transaminase enzyme |
| US12421531B2 (en) * | 2019-10-25 | 2025-09-23 | Asymchem Laboratories (Fuxin) Co., Ltd. | Transaminase mutant and use thereof |
| CN113061594B (zh) * | 2019-12-31 | 2023-11-24 | 弈柯莱生物科技(集团)股份有限公司 | 转氨酶突变体、固定化转氨酶及用于制备西他列汀的用途 |
-
2021
- 2021-06-11 CN CN202110655349.9A patent/CN115466756A/zh active Pending
-
2022
- 2022-05-10 WO PCT/CN2022/092053 patent/WO2022257686A1/zh not_active Ceased
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117467733A (zh) * | 2023-12-27 | 2024-01-30 | 北京元延医药科技股份有限公司 | 高手性纯度西格列汀及使用固定化转氨酶制备的方法 |
| CN117467733B (zh) * | 2023-12-27 | 2024-03-12 | 北京元延医药科技股份有限公司 | 高手性纯度西格列汀及使用固定化转氨酶制备的方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2022257686A1 (zh) | 2022-12-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113061594B (zh) | 转氨酶突变体、固定化转氨酶及用于制备西他列汀的用途 | |
| CN111041019B (zh) | 一种天冬氨酸酶突变体及其应用 | |
| CN111411094B (zh) | 一种(R)-ω-转氨酶突变体及其应用 | |
| CN112048485B (zh) | 一种制备西他列汀的工程化转氨酶多肽 | |
| CN110777123B (zh) | 突变的l-氨基酸连接酶以及酶催化法制备l-谷氨酸-l-色氨酸二肽的工艺 | |
| CN115466756A (zh) | 转氨酶、固定化转氨酶及用于制备西他列汀的用途 | |
| CN113136377B (zh) | 一种聚糖酶及其在川芎嗪生物合成中的应用 | |
| CN109777813A (zh) | 一种转氨酶-plp共固定化酶及其制备与应用 | |
| CN111534494B (zh) | (R)-ω-转氨酶突变体及其在制备西他列汀中间体中的应用 | |
| CN113322291A (zh) | 一种手性氨基醇类化合物的合成方法 | |
| KR100702728B1 (ko) | 환상 뎁시펩티드 합성효소 및 그의 유전자 및 환상뎁시펩티드의 대량생산계 | |
| CN115975969B (zh) | 转氨酶及用于制备西他列汀或其中间体的用途 | |
| CN110951706B (zh) | 重组R-ω-转氨酶、突变体及在不对称合成西他列汀中的应用 | |
| CN114277008A (zh) | 一种(r)-选择性转氨酶及其应用 | |
| CN117467733B (zh) | 高手性纯度西格列汀及使用固定化转氨酶制备的方法 | |
| CN118530958A (zh) | 工业催化用转氨酶突变体及在合成西他列汀中间体中的应用 | |
| CN115850522A (zh) | 一种蓝铜肽的生物合成方法 | |
| CN111979206B (zh) | 固定化融合酶及用其制备谷胱甘肽的方法 | |
| KR101379585B1 (ko) | 무염 반응을 위한 신규한 글루타메이트 디카르복실라아제 변이체 | |
| CN113980948B (zh) | 一种高活性酪氨酸酚裂解酶突变体 | |
| CN110904066B (zh) | 重组r型转氨酶、突变体及其应用 | |
| CN114410703A (zh) | 一种微生物发酵生产色氨酸的方法及应用 | |
| CN118063351B (zh) | 使用生物酶绿色合成手性β氨基酸和使用的固定化转氨酶 | |
| CN118598873B (zh) | 一种手性的氮杂-喹啉类化合物的制备方法 | |
| KR102238654B1 (ko) | 오명사마이신 a의 생산능이 증대된 스트렙토마이세스 속 변이 균주 및 이를 이용한 오명사마이신 a의 생산 방법 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| CB02 | Change of applicant information | ||
| CB02 | Change of applicant information |
Address after: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant after: Yikelai Biotechnology (Group) Co.,Ltd. Address before: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant before: Ecolab Biotechnology (Shanghai) Co.,Ltd. |
|
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |