CN111518818A - 一个参与杨梅素生物合成的羟化酶基因及其应用 - Google Patents
一个参与杨梅素生物合成的羟化酶基因及其应用 Download PDFInfo
- Publication number
- CN111518818A CN111518818A CN202010223426.9A CN202010223426A CN111518818A CN 111518818 A CN111518818 A CN 111518818A CN 202010223426 A CN202010223426 A CN 202010223426A CN 111518818 A CN111518818 A CN 111518818A
- Authority
- CN
- China
- Prior art keywords
- myricetin
- leu
- mrf3
- ala
- hydroxylase
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- IKMDFBPHZNJCSN-UHFFFAOYSA-N Myricetin Chemical compound C=1C(O)=CC(O)=C(C(C=2O)=O)C=1OC=2C1=CC(O)=C(O)C(O)=C1 IKMDFBPHZNJCSN-UHFFFAOYSA-N 0.000 title claims abstract description 35
- 235000007743 myricetin Nutrition 0.000 title claims abstract description 35
- PCOBUQBNVYZTBU-UHFFFAOYSA-N myricetin Natural products OC1=C(O)C(O)=CC(C=2OC3=CC(O)=C(O)C(O)=C3C(=O)C=2)=C1 PCOBUQBNVYZTBU-UHFFFAOYSA-N 0.000 title claims abstract description 35
- 229940116852 myricetin Drugs 0.000 title claims abstract description 35
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 21
- 108010074633 Mixed Function Oxygenases Proteins 0.000 title claims abstract description 18
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 24
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 11
- 102000008109 Mixed Function Oxygenases Human genes 0.000 claims abstract description 7
- 239000002773 nucleotide Substances 0.000 claims abstract description 6
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 6
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims abstract description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims abstract description 4
- 150000003278 haem Chemical class 0.000 claims abstract description 4
- 150000001413 amino acids Chemical group 0.000 claims description 8
- 238000010353 genetic engineering Methods 0.000 claims description 4
- 235000018102 proteins Nutrition 0.000 claims 3
- IYRMWMYZSQPJKC-UHFFFAOYSA-N kaempferol Chemical compound C1=CC(O)=CC=C1C1=C(O)C(=O)C2=C(O)C=C(O)C=C2O1 IYRMWMYZSQPJKC-UHFFFAOYSA-N 0.000 abstract description 22
- MWDZOUNAPSSOEL-UHFFFAOYSA-N kaempferol Natural products OC1=C(C(=O)c2cc(O)cc(O)c2O1)c3ccc(O)cc3 MWDZOUNAPSSOEL-UHFFFAOYSA-N 0.000 abstract description 21
- REFJWTPEDVJJIY-UHFFFAOYSA-N Quercetin Chemical compound C=1C(O)=CC(O)=C(C(C=2O)=O)C=1OC=2C1=CC=C(O)C(O)=C1 REFJWTPEDVJJIY-UHFFFAOYSA-N 0.000 abstract description 20
- UBSCDKPKWHYZNX-UHFFFAOYSA-N Demethoxycapillarisin Natural products C1=CC(O)=CC=C1OC1=CC(=O)C2=C(O)C=C(O)C=C2O1 UBSCDKPKWHYZNX-UHFFFAOYSA-N 0.000 abstract description 11
- 235000008777 kaempferol Nutrition 0.000 abstract description 11
- UXOUKMQIEVGVLY-UHFFFAOYSA-N morin Natural products OC1=CC(O)=CC(C2=C(C(=O)C3=C(O)C=C(O)C=C3O2)O)=C1 UXOUKMQIEVGVLY-UHFFFAOYSA-N 0.000 abstract description 11
- ZVOLCUVKHLEPEV-UHFFFAOYSA-N Quercetagetin Natural products C1=C(O)C(O)=CC=C1C1=C(O)C(=O)C2=C(O)C(O)=C(O)C=C2O1 ZVOLCUVKHLEPEV-UHFFFAOYSA-N 0.000 abstract description 10
- HWTZYBCRDDUBJY-UHFFFAOYSA-N Rhynchosin Natural products C1=C(O)C(O)=CC=C1C1=C(O)C(=O)C2=CC(O)=C(O)C=C2O1 HWTZYBCRDDUBJY-UHFFFAOYSA-N 0.000 abstract description 10
- 240000004808 Saccharomyces cerevisiae Species 0.000 abstract description 10
- 235000005875 quercetin Nutrition 0.000 abstract description 10
- 229960001285 quercetin Drugs 0.000 abstract description 10
- 101150118163 h gene Proteins 0.000 abstract description 6
- 238000000338 in vitro Methods 0.000 abstract description 5
- 238000003786 synthesis reaction Methods 0.000 abstract description 5
- 101000979697 Streptomyces carzinostaticus 2-hydroxy-5-methyl-1-naphthoate 7-hydroxylase Proteins 0.000 abstract description 4
- 238000012795 verification Methods 0.000 abstract description 4
- 238000012408 PCR amplification Methods 0.000 abstract description 2
- 238000002864 sequence alignment Methods 0.000 abstract 1
- 235000009134 Myrica cerifera Nutrition 0.000 description 10
- 244000061457 Solanum nigrum Species 0.000 description 10
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 9
- 239000013598 vector Substances 0.000 description 8
- 108020004414 DNA Proteins 0.000 description 6
- 244000132436 Myrica rubra Species 0.000 description 5
- 235000014631 Myrica rubra Nutrition 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 229930003935 flavonoid Natural products 0.000 description 5
- 150000002215 flavonoids Chemical class 0.000 description 5
- 235000017173 flavonoids Nutrition 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 108010062650 Flavonoid 3',5'-hydroxylase Proteins 0.000 description 4
- 230000003197 catalytic effect Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 235000013399 edible fruits Nutrition 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 3
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000000034 method Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 235000003283 Pachira macrocarpa Nutrition 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 240000001085 Trapa natans Species 0.000 description 2
- 235000014364 Trapa natans Nutrition 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 238000010804 cDNA synthesis Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000005805 hydroxylation reaction Methods 0.000 description 2
- 101150044508 key gene Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 238000012257 pre-denaturation Methods 0.000 description 2
- 235000009165 saligot Nutrition 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- IAOXXKYIZHCAQJ-ACZMJKKPSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2,4-diamino-4-oxobutanoyl]amino]propanoyl]amino]acetyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O IAOXXKYIZHCAQJ-ACZMJKKPSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- CWRBRVZBMVJENN-UVBJJODRSA-N Ala-Trp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CWRBRVZBMVJENN-UVBJJODRSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- GOKCTAJWRPSCHP-VHWLVUOQSA-N Asn-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N GOKCTAJWRPSCHP-VHWLVUOQSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- 235000008375 Decussocarpus nagi Nutrition 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- WFZYXGSAPWKTHR-XEGUGMAKSA-N Trp-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WFZYXGSAPWKTHR-XEGUGMAKSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 1
- 235000009754 Vitis X bourquina Nutrition 0.000 description 1
- 235000012333 Vitis X labruscana Nutrition 0.000 description 1
- 240000006365 Vitis vinifera Species 0.000 description 1
- 235000014787 Vitis vinifera Nutrition 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 235000010208 anthocyanin Nutrition 0.000 description 1
- 229930002877 anthocyanin Natural products 0.000 description 1
- 239000004410 anthocyanin Substances 0.000 description 1
- 150000004636 anthocyanins Chemical class 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- -1 antioxidant Chemical compound 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 1
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 235000013402 health food Nutrition 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0083—Miscellaneous (1.14.99)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
- C12P17/06—Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明公开了一个参与杨梅素生物合成的羟化酶基因及其应用,所述羟化酶MrF3’5’H,所述羟化酶基因的核苷酸序列如SEQ:NO.1所示,通过PCR扩增获得MrF3’5’H全长序列,其编码蛋白序列如SEQ:NO.2所示,通过序列比对得到其具有CYP45075A家族的proline‑rich、SRS、CR、EXXR和heme binding保守结构域。本发明验证了MrF3’5’H基因的功能,在酵母中进行的体外功能验证表明,MrF3’5’H具有P450羟化酶功能,可以将山奈酚和槲皮素分别催化为杨梅素。本发明对研究杨梅素分流机制具有指导意义,为杨梅素合成的工程化奠定了基础。
Description
技术领域
本发明属于分子生物学领域,涉及重组蛋白和基因工程,具体涉及一个参与杨梅素生物合成的羟化酶MrF3’5’H基因包括该基因编码蛋白及其应用。
技术背景
细胞色素P450,简称CYP450,在许多物质的生物合成中起着重要作用,比如参与次生产物(如类黄酮)和激素的生物合成。类黄酮3’5’-羟化酶(F3’5’H)属于CYP45075A亚家族,参与类黄酮B环上3’5’位点羟化。目前,有关F3’5’H基因功能的研究主要集中在植物紫色花色苷合成,有关F3’5’H基因的其他功能研究比较有限。
杨梅(Morella rubra)属于我国特色水果,具有很好的医药学活性,这与其较高的黄酮类化合物含量密不可分,而杨梅素是杨梅中主要的黄酮类化合物,首次从杨梅树皮中分离并以此命名,通常以糖苷衍生物形式存在于液泡中。大量研究报道了杨梅素抗氧化、抗肿瘤、预防心血管疾病、消炎等医药学活性。富含杨梅素的植物可用于生产保健食品、药品,具有广阔的开发应用前景。
鉴别出杨梅中参与杨梅素生物合成的基因MrF3’5’H,对于阐明杨梅中杨梅素生物合成机制具有重要意义,为开发工程微生物菌奠定基础,且可用于其他植物基于基因工程技术的杨梅素组分改良,对提高食物中的杨梅素含量,增加食物的保健功能,有重要的应用价值。
发明内容
本发明的目的是提供一个参与杨梅素生物合成的羟化酶基因,是一个杨梅中参与杨梅素生物合成基因MrF3’5’H及其编码蛋白,来源于杨梅的CYP450家族,是一个参与杨梅素生物合成的关键基因,所述羟化酶为MrF3’5’H,所述羟化酶基因的核苷酸序列如SEQ:NO.1所示,所述羟化酶基因编码蛋白的氨基酸序列如SEQ:NO.2所示。该蛋白序列具有CYP45075A家族的proline-rich、SRS、CR、EXXR和heme binding保守结构域。
本发明的另一个目的是提供所述的一个参与杨梅素生物合成的羟化酶(MrF3’5’H)基因及其编码蛋白在植物杨梅素含量和组分改良的基因工程中的应用。在酵母中进行的体外功能验证表明,MrF3’5’H具有P450羟化酶功能,可以将山奈酚和槲皮素分别催化为杨梅素,且对山奈酚的催化活性显著高于槲皮素。本发明为杨梅素合成的工程化奠定基础。
本发明提供的基因特征如下:
(1)基因序列特征:MrF3’5’H基因的可编码CDS序列如SEQ:NO.1所示,编码序列全长为1530个核苷酸,可编码一个含509个氨基酸的蛋白。其蛋白序列如SEQ:NO.2所示,含有保守的proline-rich、SRS、CR、EXXR和heme binding结构域,属于CYP450家族。
(2)基因功能特征:在酵母中进行的体外功能验证表明,MrF3’5’H具有P450羟化酶功能,可以将山奈酚和槲皮素分别催化为杨梅素,且对山奈酚的催化活性显著高于槲皮素。
本发明提供了一个杨梅中参与杨梅素生物合成的关键基因MrF3’5’H,具有如SEQ:NO.1所示的核苷酸序列,SEQ:NO.2所示的氨基酸序列。酵母体外功能验证表明,MrF3’5’H具有P450羟化酶功能,可以将山奈酚和槲皮素分别催化为杨梅素,且对山奈酚的催化活性显著高于槲皮素。本工作对研究杨梅素分流机制具有指导意义,为开发工程微生物菌奠定基础。
附图说明
图1.MrF3’5’H氨基酸序列比对结果;SlF3’5’H(ACF32346),PhF3’5’H-Hf1(CAA80266),PhF3’5’H-Hf2(CAA80265)。
图2.重组蛋白MrF3’5’H对山奈酚和槲皮素体外酶活分析LC-MS图谱。
具体实施方式
本发明结合附图和实施例作进一步的说明。
说明:本发明中涉及的目的基因引物设计、全长克隆、表达载体构建、RNA提取、cDNA合成、测序分析和鉴定以及PCR产物的分离纯化等基本操作,可按照本领域已知的技术进行,若未特别说明,实施例中的技术手段为本领域技术人员熟知的常规手段。
实施例1:MrF3’5’H基因全长获得及鉴定
1.杨梅组织材料
荸荠和东魁杨梅组织(果实、花、叶片)当天采收,经液氮冷冻后存放于-80℃冰箱,每个组织样品设置3个生物学重复,每个重复7-8个果实,每个重复花的质量在500g以上,每个重复10-15片整叶。
2.RNA提取和cDNA合成
杨梅组织样品在液氮环境中研磨成粉末,利用普通CTAB法提取总RNA,经电泳检测合格后,参照TURBO DNAase Kit(Ambion)说明书,去除DNA,根据iScript cDNA Synthesis Kit(Bio-Rad)要求,取1.0μg RNA,反转录成cDNA。
3.MrF3’5’H基因全长获得
在荸荠杨梅的RNA-Seq数据库中,以Flavonoid 3',5'-hydroxylase为关键词搜索与杨梅素合成相关的基因,并以葡萄中功能明确的SlF3’5’H氨基酸序列为参考,通过CLUSTALX软件同源比对,筛选出一个可能参与杨梅果实杨梅素合成的基因Unigene5190(MrF3’5’H),应用序列为:SEQ:NO.1,并通过BLAST(https://blast.ncbi.nlm.nih.gov/Blast.cgi)在线分析,确认其为全长序列。设计全长克隆引物:SEQ:NO.3和SEQ:NO.4,PCR反应体系为50μL,成分分别为:0.5μL Roche高保真酶,5μL缓冲液(10×),4μL dNTP(2.5mM),上下游引物(10μM,Hua Gene)各2μL,4μL cDNA,32.5μL H2O。反应程序为:预变性95℃,2min;变性95℃,30s;退火58℃,30s;延伸72℃,90min,35个循环;72℃延伸10min,4℃保存。
4.MrF3’5’H基因全长鉴定及序列分析
将PCR扩增产物连接到T-easy载体,转化大肠杆菌DH5α,进行菌落PCR验证,获得阳性菌落进行测序。克隆结果经测序验证,获得与转录组数据库相匹配的MrF3’5’H全长序列如SEQ:NO.1所示含有1530个核苷酸。在线翻译成氨基酸序列(http://web.expasy.org/translate/),即:SEQ:NO.2。利用MrF3’5’H氨基酸序列与已发表具有3’5’-羟化的P45075A家族羟化酶比对,结果如图1所示。
实施例2:pYES2-MrF3’5’Hs表达载体的构建
根据pYES2 NT/C(Invitrogen)载体多克隆位点序列及MrF3’5’H(SEQ:NO.1)全长基因序列,设计包含BamHI和EcoRI酶切位点的引物序列:SEQ:NO.5和SEQ:NO.6,该引物设计包含起始密码子和终止密码子,扩增得到含有BamHI和EcoRI酶切位点的MrF3’5’H序列。PCR反应体系为50μL,成分分别为:1μL Phanta高保真酶(Vazyme),25μL缓冲液(2×),1μL dNTP(10mM),上下游引物(10μM,Hua Gene)各2μL,1μL cDNA,18μL H2O。反应程序为:预变性95℃,2min;变性95℃,15s;退火58℃,15s;延伸72℃,1min,35个循环;72℃彻底延伸5min,4℃保存。分别用BamHI(NEB)和EcoRI(NEB)双酶切pYES2载体,使用II连接酶(Vazyme)将目的基因片段连接到pYES2载体上。连接反应体系为10μL,成分分别为:1μLII连接酶(Vazyme),2μL缓冲液(5×),1μL PCR回收产物,3μL载体,3μL H2O。混匀后,37℃连接0.5h后冰上放置5min。将连接产物转化到DH5а感受态(Takara)中,涂布于含有Amp的培养板上37℃过夜培养,挑取阳性克隆菌株,送往上海Hua Gene测序,分析测序结果,含有正确目的基因序列的载体,则为构建成功的pYES2-MrF3’5’H重组质粒。
实施例3:酿酒酵母异源表达MrF3’5’H
1.重组载体酵母转化
将构建成功的pYES2-MrF3’5’H重组质粒或pYES2空载体,通过酵母转化试剂盒(Clontech)用LiAC法转化至酿酒酵母株系INVScI(Invitrogen)。然后涂布于SD/-Ura的培养板上30℃培养3天,挑取单菌落,PCR检测重组质粒或空载。选取PCR条带正确的单菌落,则为含有pYES2-MrF3’5’H重组质粒的酿酒酵母INVScI,用25%甘油保存于-80℃冰箱备用。
2.MrF3’5’H诱导表达
挑取单菌落于5mL SD/-Ura+20g/L glucose培养液,在30℃,250rpm摇床上培养12h。室温条件下700g离心5min收集酵母菌体,然后加入SD/-Ura+20g/L galactose培养液重悬浮菌体至OD600为0.4。取2mL重悬浮菌体后的培养液至两个新的离心管,各自加入1mM NADPH(Sigma)和5mM反应底物(山奈酚或者槲皮素),16℃,250rpm摇床上培养12h。以上操作均以空载酵母作为对照。
3.杨梅素检测
诱导完成后,加入1:1体积的乙酸乙酯溶液终止反应,涡旋混匀,1000g离心5min,取上清到新的10mL离心管,重复操作一次,合并上清,真空旋转蒸干有机相,加入150μL的色谱甲醇溶解备用。液相色谱检测流动相:A:水(0.1%甲酸)B:乙腈(0.1%甲酸);进样体积:10μL;流速:0.3mL/min;柱温:25℃;检测波长为370nm洗脱梯度:0-7min 90%-50%A,7-10min50%A,10-15min 50%-0%A,15-15.1min 0-90%A,15.1-21min 90%A。
检测结果表明,MrF3’5’H具有P45075A家族羟化酶的活性,可将山奈酚和槲皮素分别催化为杨梅素,且对山奈酚的催化活性更高(附图2)。
以上对本发明的具体实施例进行了描述,需要理解的是,对于本领域普通技术人员来说,在权利要求范围内,可以根据上述说明加以改进或变换,这并不影响本发明的实质内容。
序列表
<110> 浙江大学
<120> 一个参与杨梅素生物合成的羟化酶基因及其应用
<160> 6
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1530
<212> DNA
<213> 杨梅(Morella rubra)
<400> 1
atggccgtag acatgttcct cctcagagaa cttgttgtgg cgattgtcct cttcttcata 60
acccgctttt ctatccagtt actatttaaa aaaccttctc gaaaacttcc acctggccct 120
aaaggttggc cttttcttgg ggcccttaca attctaggag ctatgcctca tgtaacctta 180
gcccagatgg ccaagaaata tggacccgtc atgtacctga aaatgggcac ttgtaacatg 240
gtcgtggcct ctactccaga tgcagcacga gcgttcttga aaacgctaga cctgaatttc 300
tcgaaccgtc caccgaacgc tggcgcaacc cacttagcct atgatgctca ggacatggtg 360
tttgcggact atggagcaag gtggaagttg cttagaaagt tgagcaacct acacatgctt 420
ggagggaagg ctctcgaaga ctgggctcag gttcgagcat ttgagctagg ccacatgctt 480
agagccatgt gtgagtctag caagagagcc gagcctgtgg tgataccaga gatgttgact 540
tatgccatgg caaacatgat cggacaggtg atactaagtc gccgtgtgtt cgtgactaag 600
ggctcggagt ctaacgagtt taaggacatg gtggtggagc tcatgacatc agctgggtac 660
ttcaacatcg gcgatttcat accatccatc gcgtggatgg acttgcaagg aattgagcgc 720
ggaatgaagc gcctgcacaa acgcttcgac gtgctactga caaagatgat tgaggagcat 780
actgcttctg cccgtgaccg caagggaaag ccagatttct tggatgtcgt catggctaac 840
agagaaaact ccgagggcga gaggcttagt ttgactaaca ttaaggcact cctgttgaac 900
ttatttactg ccggcaccga cacatcatca agcattatag aatgggcact tgcggagatg 960
ttgagcaacc ccagcatcct taggcgggct cacgaggaga tggatcaagt gattggcagg 1020
aacagacgcc tcgaggaggc agacatatca aagctaccat atctccaagc catatgcaaa 1080
gaaaccatgc ggaagcaccc ttccacgcca ctcaacctgc cccgggtttc aaccgaagca 1140
tgcgaagtga atggctacta cattccaaag aacaccaggc ttagcgtgaa catatgggga 1200
atagggagag accctgatgt gtgggaaaac ccgctggatt tcacgccaga aagatttttg 1260
tctgggagaa atgccaagat cgatccaaga gggaatgatt tcgagctgat tccattcggg 1320
gctggaagga ggatttgtgc agggaccagg atgggaatta cgctggtgga gtacattctc 1380
ggcacgttgg tgcactcctt tgactggaaa ttgcccaatg gagttgataa gctagacatg 1440
caggagtcct ttggacttgc gttgcaaaag agtgtgccac ttgcggctct agttacccca 1500
cgcctatctt taagcgcata tgcttcttaa 1530
<210> 2
<211> 509
<212> PRT
<213> 杨梅(Morella rubra)
<400> 2
Met Ala Val Asp Met Phe Leu Leu Arg Glu Leu Val Val Ala Ile Val
1 5 10 15
Leu Phe Phe Ile Thr Arg Phe Ser Ile Gln Leu Leu Phe Lys Lys Pro
20 25 30
Ser Arg Lys Leu Pro Pro Gly Pro Lys Gly Trp Pro Phe Leu Gly Ala
35 40 45
Leu Thr Ile Leu Gly Ala Met Pro His Val Thr Leu Ala Gln Met Ala
50 55 60
Lys Lys Tyr Gly Pro Val Met Tyr Leu Lys Met Gly Thr Cys Asn Met
65 70 75 80
Val Val Ala Ser Thr Pro Asp Ala Ala Arg Ala Phe Leu Lys Thr Leu
85 90 95
Asp Leu Asn Phe Ser Asn Arg Pro Pro Asn Ala Gly Ala Thr His Leu
100 105 110
Ala Tyr Asp Ala Gln Asp Met Val Phe Ala Asp Tyr Gly Ala Arg Trp
115 120 125
Lys Leu Leu Arg Lys Leu Ser Asn Leu His Met Leu Gly Gly Lys Ala
130 135 140
Leu Glu Asp Trp Ala Gln Val Arg Ala Phe Glu Leu Gly His Met Leu
145 150 155 160
Arg Ala Met Cys Glu Ser Ser Lys Arg Ala Glu Pro Val Val Ile Pro
165 170 175
Glu Met Leu Thr Tyr Ala Met Ala Asn Met Ile Gly Gln Val Ile Leu
180 185 190
Ser Arg Arg Val Phe Val Thr Lys Gly Ser Glu Ser Asn Glu Phe Lys
195 200 205
Asp Met Val Val Glu Leu Met Thr Ser Ala Gly Tyr Phe Asn Ile Gly
210 215 220
Asp Phe Ile Pro Ser Ile Ala Trp Met Asp Leu Gln Gly Ile Glu Arg
225 230 235 240
Gly Met Lys Arg Leu His Lys Arg Phe Asp Val Leu Leu Thr Lys Met
245 250 255
Ile Glu Glu His Thr Ala Ser Ala Arg Asp Arg Lys Gly Lys Pro Asp
260 265 270
Phe Leu Asp Val Val Met Ala Asn Arg Glu Asn Ser Glu Gly Glu Arg
275 280 285
Leu Ser Leu Thr Asn Ile Lys Ala Leu Leu Leu Asn Leu Phe Thr Ala
290 295 300
Gly Thr Asp Thr Ser Ser Ser Ile Ile Glu Trp Ala Leu Ala Glu Met
305 310 315 320
Leu Ser Asn Pro Ser Ile Leu Arg Arg Ala His Glu Glu Met Asp Gln
325 330 335
Val Ile Gly Arg Asn Arg Arg Leu Glu Glu Ala Asp Ile Ser Lys Leu
340 345 350
Pro Tyr Leu Gln Ala Ile Cys Lys Glu Thr Met Arg Lys His Pro Ser
355 360 365
Thr Pro Leu Asn Leu Pro Arg Val Ser Thr Glu Ala Cys Glu Val Asn
370 375 380
Gly Tyr Tyr Ile Pro Lys Asn Thr Arg Leu Ser Val Asn Ile Trp Gly
385 390 395 400
Ile Gly Arg Asp Pro Asp Val Trp Glu Asn Pro Leu Asp Phe Thr Pro
405 410 415
Glu Arg Phe Leu Ser Gly Arg Asn Ala Lys Ile Asp Pro Arg Gly Asn
420 425 430
Asp Phe Glu Leu Ile Pro Phe Gly Ala Gly Arg Arg Ile Cys Ala Gly
435 440 445
Thr Arg Met Gly Ile Thr Leu Val Glu Tyr Ile Leu Gly Thr Leu Val
450 455 460
His Ser Phe Asp Trp Lys Leu Pro Asn Gly Val Asp Lys Leu Asp Met
465 470 475 480
Gln Glu Ser Phe Gly Leu Ala Leu Gln Lys Ser Val Pro Leu Ala Ala
485 490 495
Leu Val Thr Pro Arg Leu Ser Leu Ser Ala Tyr Ala Ser
500 505
<210> 3
<211> 21
<212> DNA
<213> 人工序列(Unknown)
<400> 3
atggccgtag acatgttcct c 21
<210> 4
<211> 23
<212> DNA
<213> 人工序列(Unknown)
<400> 4
ttaagaagca tatgcgctta aag 23
<210> 5
<211> 36
<212> DNA
<213> 人工序列(Unknown)
<400> 5
tgacgataag gtacccggat ccatggccgt agacat 36
<210> 6
<211> 37
<212> DNA
<213> 人工序列(Unknown)
<400> 6
gtgctggata tctgcagaat tcttaagaag catatgc 37
Claims (3)
1.一个参与杨梅素生物合成的羟化酶基因,其特征在于,所述羟化酶为MrF3’5’H,所述羟化酶基因的核苷酸序列如SEQ:NO.1所示。
2.根据权利要求1所述的一个参与杨梅素生物合成的羟化酶基因,其特征在于,所述羟化酶基因编码蛋白的氨基酸序列如SEQ:NO.2所示,该蛋白序列具有CYP45075A家族的proline-rich、SRS、CR、EXXR和heme binding保守结构域。
3.根据权利要求1和2所述的一个参与杨梅素生物合成的羟化酶基因及其编码蛋白在植物杨梅素含量和组分改良的基因工程中的应用。
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010223426.9A CN111518818B (zh) | 2020-03-26 | 2020-03-26 | 一个参与杨梅素生物合成的羟化酶基因及其应用 |
| JP2021053581A JP7122715B2 (ja) | 2020-03-26 | 2021-03-26 | 水酸化酵素遺伝子及びその使用 |
| PCT/CN2021/083269 WO2021190632A1 (zh) | 2020-03-26 | 2021-03-26 | 一种羟化酶基因及其应用 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010223426.9A CN111518818B (zh) | 2020-03-26 | 2020-03-26 | 一个参与杨梅素生物合成的羟化酶基因及其应用 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111518818A true CN111518818A (zh) | 2020-08-11 |
| CN111518818B CN111518818B (zh) | 2021-11-30 |
Family
ID=71902207
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010223426.9A Active CN111518818B (zh) | 2020-03-26 | 2020-03-26 | 一个参与杨梅素生物合成的羟化酶基因及其应用 |
Country Status (3)
| Country | Link |
|---|---|
| JP (1) | JP7122715B2 (zh) |
| CN (1) | CN111518818B (zh) |
| WO (1) | WO2021190632A1 (zh) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2021190632A1 (zh) * | 2020-03-26 | 2021-09-30 | 浙江大学 | 一种羟化酶基因及其应用 |
| CN113862288A (zh) * | 2021-10-25 | 2021-12-31 | 杭州市农业科学研究院 | 三叶青ThF3’5’H基因及其应用 |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117327715B (zh) * | 2023-08-25 | 2024-06-14 | 云南农业大学 | 一种假马齿苋P450酶基因BmCYP068及其应用 |
| CN117866980B (zh) * | 2023-12-29 | 2025-10-17 | 南阳医学高等专科学校 | 艾叶AaMYB1基因及其在提高植物黄酮含量中的应用 |
| CN118773150B (zh) * | 2024-06-03 | 2025-06-17 | 上海大学 | 一种银胶菊素c2羟化酶及其编码基因、制备方法与应用 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006010117A2 (en) * | 2004-07-10 | 2006-01-26 | The Research Foundation Of State University Of New York | Production of flavonoids by recombinant microorganisms |
| CN101784667A (zh) * | 2007-06-15 | 2010-07-21 | 先锋高级育种国际公司 | 来自玉米的次生壁形成基因及其用途 |
| CN108048415A (zh) * | 2018-02-01 | 2018-05-18 | 浙江大学 | 两个杨梅黄酮醇合成酶MrFLSs蛋白及其编码基因的应用 |
| CN109679972A (zh) * | 2019-01-21 | 2019-04-26 | 浙江大学 | 一种催化杨梅udp-鼠李糖生物合成的基因及编码蛋白和应用 |
| CN110408649A (zh) * | 2019-07-25 | 2019-11-05 | 中国农业大学 | Nor基因及其编码的蛋白质在调控番茄果实中类黄酮化合物合成中的应用 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2004173617A (ja) | 2002-11-28 | 2004-06-24 | Hokko Chem Ind Co Ltd | シクラメンのアントシアニン類合成酵素遺伝子 |
| KR102010833B1 (ko) * | 2017-11-08 | 2019-08-16 | 대한민국 | 화훼식물의 꽃잎, 암술기관 및 화심 색 변형 기능 매발톱꽃 유래 신규 유전자 및 이의 용도 |
| CN111518818B (zh) * | 2020-03-26 | 2021-11-30 | 浙江大学 | 一个参与杨梅素生物合成的羟化酶基因及其应用 |
-
2020
- 2020-03-26 CN CN202010223426.9A patent/CN111518818B/zh active Active
-
2021
- 2021-03-26 WO PCT/CN2021/083269 patent/WO2021190632A1/zh not_active Ceased
- 2021-03-26 JP JP2021053581A patent/JP7122715B2/ja active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006010117A2 (en) * | 2004-07-10 | 2006-01-26 | The Research Foundation Of State University Of New York | Production of flavonoids by recombinant microorganisms |
| CN101784667A (zh) * | 2007-06-15 | 2010-07-21 | 先锋高级育种国际公司 | 来自玉米的次生壁形成基因及其用途 |
| CN108048415A (zh) * | 2018-02-01 | 2018-05-18 | 浙江大学 | 两个杨梅黄酮醇合成酶MrFLSs蛋白及其编码基因的应用 |
| CN109679972A (zh) * | 2019-01-21 | 2019-04-26 | 浙江大学 | 一种催化杨梅udp-鼠李糖生物合成的基因及编码蛋白和应用 |
| CN110408649A (zh) * | 2019-07-25 | 2019-11-05 | 中国农业大学 | Nor基因及其编码的蛋白质在调控番茄果实中类黄酮化合物合成中的应用 |
Non-Patent Citations (3)
| Title |
|---|
| GENBANK: "登录号XM_031397174:PREDICTED: Pistacia vera flavonoid 3",5"-hydroxylase 2-like (LOC116110956), mRNA", 《GENBANK数据库》 * |
| GENBANK: "登录号XP_030959345:flavonoid 3",5"-hydroxylase 2-like [Quercus lobata]", 《GENBANK数据库》 * |
| 许明等: "藤茶黄烷酮3-羟化酶基因AgF3H的克隆及表达分析", 《西北植物学报》 * |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2021190632A1 (zh) * | 2020-03-26 | 2021-09-30 | 浙江大学 | 一种羟化酶基因及其应用 |
| CN113862288A (zh) * | 2021-10-25 | 2021-12-31 | 杭州市农业科学研究院 | 三叶青ThF3’5’H基因及其应用 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2021190632A1 (zh) | 2021-09-30 |
| CN111518818B (zh) | 2021-11-30 |
| JP2021168646A (ja) | 2021-10-28 |
| JP7122715B2 (ja) | 2022-08-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111518818A (zh) | 一个参与杨梅素生物合成的羟化酶基因及其应用 | |
| CN114717247B (zh) | 木薯转录因子MebHLH72、MebHLH114及其在抑制亚麻仁苦苷合成上的应用 | |
| CN108048415B (zh) | 两个杨梅黄酮醇合成酶MrFLSs蛋白及其编码基因的应用 | |
| Lin et al. | Molecular cloning and functional characterization of multiple NADPH-cytochrome P450 reductases from Andrographis paniculata | |
| CN114634939A (zh) | 一种调节人参中茉莉酸甲酯合成的PgJMT1基因及其应用 | |
| CN113265408B (zh) | 一种三七DOF转录因子基因PnDof1及其应用 | |
| CN110283805A (zh) | 一种紫色红曲霉酯合成酶lip05、编码基因及其应用 | |
| CN102399270A (zh) | 毛白杨中的MYB类转录因子PtrMYB01及其cDNA的克隆方法及应用 | |
| CN109503703B (zh) | 抗旱耐盐基因IpNY-B1及其编码蛋白和应用 | |
| CN115074375B (zh) | 一种丹参2-酮戊二酸依赖性双加氧酶基因及其应用 | |
| CN113186209B (zh) | 一种茅苍术鲨烯合酶基因AlSQS2及其编码的产物和应用 | |
| CN118027166B (zh) | 广金钱草转录因子GsNSP1及其编码基因的应用 | |
| CN110283806B (zh) | 一种紫色红曲霉酯合成酶lip05-50、编码基因及其应用 | |
| CN113817692B (zh) | 薯蓣皂素合成相关蛋白、其编码基因及其应用 | |
| CN102925458B (zh) | 人参ABC转运蛋白基因PgPDR2及其编码蛋白和应用 | |
| CN117568290A (zh) | 一种与薯蓣皂苷合成相关的Dp7-DR蛋白、基因和应用及方法 | |
| CN106754989A (zh) | 布渣叶黄烷酮‑2‑羟基化酶及其编码基因与应用 | |
| CN114181956B (zh) | 小麦条锈病抗性相关代谢物及其合成相关基因与应用 | |
| CN112899249B (zh) | 延龄草苷鼠李糖基转移酶及其编码基因与应用 | |
| CN108998428B (zh) | 滇重楼胆甾醇c22羟化酶及其编码基因和应用 | |
| CN108218969A (zh) | 甘薯花青素转运相关蛋白IbGSTF4及其编码基因与应用 | |
| CN111647589A (zh) | 大戟二烯醇合酶及其编码基因与应用 | |
| CN111321128A (zh) | 一种榼藤细胞色素p450基因及其获得方法和应用 | |
| CN118325880B (zh) | 夏枯草细胞色素氧化酶cyp蛋白及其编码基因与应用 | |
| CN115197921B (zh) | 五味子松脂醇-落叶松脂醇还原酶及其编码基因和应用 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |