US20040077536A1 - Human and rat pgc-3, ppar-gamma coactivations and splice variants thereof - Google Patents
Human and rat pgc-3, ppar-gamma coactivations and splice variants thereof Download PDFInfo
- Publication number
- US20040077536A1 US20040077536A1 US10/380,492 US38049203A US2004077536A1 US 20040077536 A1 US20040077536 A1 US 20040077536A1 US 38049203 A US38049203 A US 38049203A US 2004077536 A1 US2004077536 A1 US 2004077536A1
- Authority
- US
- United States
- Prior art keywords
- seq
- ser
- pro
- leu
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 241000282414 Homo sapiens Species 0.000 title claims abstract description 61
- 230000006690 co-activation Effects 0.000 title 1
- 239000003814 drug Substances 0.000 claims abstract description 14
- 229940124597 therapeutic agent Drugs 0.000 claims abstract description 9
- 239000002299 complementary DNA Substances 0.000 claims description 58
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 53
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 48
- 229920001184 polypeptide Polymers 0.000 claims description 47
- 238000000034 method Methods 0.000 claims description 44
- 102000040430 polynucleotide Human genes 0.000 claims description 33
- 108091033319 polynucleotide Proteins 0.000 claims description 33
- 239000002157 polynucleotide Substances 0.000 claims description 33
- 230000000694 effects Effects 0.000 claims description 31
- 239000012634 fragment Substances 0.000 claims description 22
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 21
- 150000001875 compounds Chemical class 0.000 claims description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 6
- 208000030159 metabolic disease Diseases 0.000 claims description 6
- 230000018406 regulation of metabolic process Effects 0.000 claims description 6
- 239000013604 expression vector Substances 0.000 claims description 5
- 150000007523 nucleic acids Chemical group 0.000 claims description 5
- 241001465754 Metazoa Species 0.000 claims description 4
- 208000016097 disease of metabolism Diseases 0.000 claims description 4
- 230000001404 mediated effect Effects 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 150000003839 salts Chemical class 0.000 claims description 3
- 239000003085 diluting agent Substances 0.000 claims description 2
- 239000008194 pharmaceutical composition Substances 0.000 claims description 2
- 230000000063 preceeding effect Effects 0.000 claims 5
- 108090000623 proteins and genes Proteins 0.000 abstract description 43
- 108010016731 PPAR gamma Proteins 0.000 abstract description 13
- 208000008589 Obesity Diseases 0.000 abstract description 12
- 235000020824 obesity Nutrition 0.000 abstract description 12
- 208000001072 type 2 diabetes mellitus Diseases 0.000 abstract description 10
- 201000001320 Atherosclerosis Diseases 0.000 abstract description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 7
- 208000032928 Dyslipidaemia Diseases 0.000 abstract description 6
- 208000031773 Insulin resistance syndrome Diseases 0.000 abstract description 6
- 208000017170 Lipid metabolism disease Diseases 0.000 abstract description 6
- 208000035475 disorder Diseases 0.000 abstract description 6
- 210000000577 adipose tissue Anatomy 0.000 abstract description 5
- 210000000593 adipose tissue white Anatomy 0.000 abstract description 4
- 238000011161 development Methods 0.000 abstract description 4
- 230000001105 regulatory effect Effects 0.000 abstract description 4
- 230000002103 transcriptional effect Effects 0.000 abstract description 4
- 102000000536 PPAR gamma Human genes 0.000 abstract 1
- 239000003614 peroxisome proliferator Substances 0.000 abstract 1
- 108020004414 DNA Proteins 0.000 description 42
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 33
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 33
- 210000004027 cell Anatomy 0.000 description 30
- 102000004169 proteins and genes Human genes 0.000 description 25
- 235000018102 proteins Nutrition 0.000 description 24
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 22
- 230000014509 gene expression Effects 0.000 description 20
- 108010026333 seryl-proline Proteins 0.000 description 18
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 17
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 16
- 241000700159 Rattus Species 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 15
- 210000001789 adipocyte Anatomy 0.000 description 14
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 13
- 108010087924 alanylproline Proteins 0.000 description 13
- 235000001014 amino acid Nutrition 0.000 description 13
- 230000002441 reversible effect Effects 0.000 description 12
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 11
- 102100038825 Peroxisome proliferator-activated receptor gamma Human genes 0.000 description 11
- 239000000523 sample Substances 0.000 description 11
- 238000012163 sequencing technique Methods 0.000 description 11
- 229940024606 amino acid Drugs 0.000 description 10
- 150000001413 amino acids Chemical class 0.000 description 10
- 239000003446 ligand Substances 0.000 description 10
- 230000000692 anti-sense effect Effects 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 210000001519 tissue Anatomy 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 8
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 7
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 7
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 7
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 7
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 7
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 230000004071 biological effect Effects 0.000 description 7
- 210000002216 heart Anatomy 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 102000054765 polymorphisms of proteins Human genes 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- 102000017946 PGC-1 Human genes 0.000 description 6
- 108700038399 PGC-1 Proteins 0.000 description 6
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 238000002955 isolation Methods 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 5
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 5
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 5
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 5
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 5
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 5
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 5
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 5
- 239000000427 antigen Substances 0.000 description 5
- 108091007433 antigens Proteins 0.000 description 5
- 102000036639 antigens Human genes 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 230000027455 binding Effects 0.000 description 5
- 210000000481 breast Anatomy 0.000 description 5
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 210000001980 omental adipocyte Anatomy 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 4
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 4
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 4
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 4
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 4
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 4
- 238000002965 ELISA Methods 0.000 description 4
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 4
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 4
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 4
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 4
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 4
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 4
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 4
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 4
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 4
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 4
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 4
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 4
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 4
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 4
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 4
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 4
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 230000005714 functional activity Effects 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 4
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 3
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 3
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 3
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 3
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 3
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 3
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 3
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 3
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 3
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 3
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 3
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 3
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 3
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 3
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 3
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 3
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 3
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 3
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 3
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 3
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 3
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 3
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 3
- WYWBYSPRCFADBM-GARJFASQSA-N His-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O WYWBYSPRCFADBM-GARJFASQSA-N 0.000 description 3
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 3
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 3
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 3
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 3
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 3
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 3
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 3
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 3
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 3
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 3
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 3
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 3
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 3
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 3
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 3
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 3
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 3
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 3
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 3
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 3
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 3
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 3
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 3
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 3
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 3
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 3
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 3
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 3
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 3
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 3
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 3
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 238000000423 cell based assay Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 238000004590 computer program Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 230000004060 metabolic process Effects 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 230000002974 pharmacogenomic effect Effects 0.000 description 3
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 3
- 238000001742 protein purification Methods 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000002821 scintillation proximity assay Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 2
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 2
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 2
- GDVDRMUYICMNFJ-CIUDSAMLSA-N Arg-Cys-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O GDVDRMUYICMNFJ-CIUDSAMLSA-N 0.000 description 2
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 2
- FOWOZYAWODIRFZ-JYJNAYRXSA-N Arg-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCN=C(N)N)N FOWOZYAWODIRFZ-JYJNAYRXSA-N 0.000 description 2
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 2
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 2
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 2
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 2
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 102000008169 Co-Repressor Proteins Human genes 0.000 description 2
- 108010060434 Co-Repressor Proteins Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 2
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 2
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 2
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 2
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 2
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 2
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 2
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 108700039887 Essential Genes Proteins 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 2
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 2
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 2
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 2
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 2
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 2
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 2
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 2
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 2
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 2
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 2
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 2
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 2
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 2
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 2
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 2
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 2
- SCKPOOMCTFEVTN-QTKMDUPCSA-N Met-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCSC)N)O SCKPOOMCTFEVTN-QTKMDUPCSA-N 0.000 description 2
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 2
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010067902 Peptide Library Proteins 0.000 description 2
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 2
- HNURHHFOINNTPL-IHPCNDPISA-N Phe-Cys-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N HNURHHFOINNTPL-IHPCNDPISA-N 0.000 description 2
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 2
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 2
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 2
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 2
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 2
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 2
- ZZCJYPLMOPTZFC-SRVKXCTJSA-N Pro-Met-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZZCJYPLMOPTZFC-SRVKXCTJSA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- 241000700157 Rattus norvegicus Species 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 2
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 2
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 2
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 2
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 2
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 2
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 2
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- XKKBFNPJFZLTMY-CWRNSKLLSA-N Trp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O XKKBFNPJFZLTMY-CWRNSKLLSA-N 0.000 description 2
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 2
- QOIKZODVIPOPDD-AVGNSLFASA-N Tyr-Cys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOIKZODVIPOPDD-AVGNSLFASA-N 0.000 description 2
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 2
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 210000000496 pancreas Anatomy 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 210000002027 skeletal muscle Anatomy 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- WXPZDDCNKXMOMC-AVGNSLFASA-N (2s)-1-[(2s)-2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carboxylic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@H](C(O)=O)CCC1 WXPZDDCNKXMOMC-AVGNSLFASA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- ALBODLTZUXKBGZ-JUUVMNCLSA-N (2s)-2-amino-3-phenylpropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound NCCCC[C@H](N)C(O)=O.OC(=O)[C@@H](N)CC1=CC=CC=C1 ALBODLTZUXKBGZ-JUUVMNCLSA-N 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 1
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 1
- RJUHZPRQRQLCFL-IMJSIDKUSA-N Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O RJUHZPRQRQLCFL-IMJSIDKUSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- IQTUDDBANZYMAR-WDSKDSINSA-N Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O IQTUDDBANZYMAR-WDSKDSINSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- IVOMOUWHDPKRLL-KQYNXXCUSA-N Cyclic adenosine monophosphate Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-KQYNXXCUSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 1
- ZIKWRNJXFIQECJ-CIUDSAMLSA-N Cys-Cys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZIKWRNJXFIQECJ-CIUDSAMLSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 1
- VTBGVPWSWJBERH-DCAQKATOSA-N Cys-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N VTBGVPWSWJBERH-DCAQKATOSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- XAHWYEYOMSGKDA-CWRNSKLLSA-N Cys-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CS)N)C(=O)O XAHWYEYOMSGKDA-CWRNSKLLSA-N 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 208000036566 Erythroleukaemia Diseases 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- YADSXULAFMJZRL-QEJZJMRPSA-N Gln-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N YADSXULAFMJZRL-QEJZJMRPSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- PABVKUJVLNMOJP-WHFBIAKZSA-N Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(O)=O PABVKUJVLNMOJP-WHFBIAKZSA-N 0.000 description 1
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- AYUOWUNWZGTNKB-ULQDDVLXSA-N His-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AYUOWUNWZGTNKB-ULQDDVLXSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- 101000899111 Homo sapiens Hemoglobin subunit beta Proteins 0.000 description 1
- 101000590492 Homo sapiens Nuclear fragile X mental retardation-interacting protein 1 Proteins 0.000 description 1
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- 206010065042 Immune reconstitution inflammatory syndrome Diseases 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- FMIIKPHLJKUXGE-GUBZILKMSA-N Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN FMIIKPHLJKUXGE-GUBZILKMSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- WXUUEPIDLLQBLJ-DCAQKATOSA-N Met-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WXUUEPIDLLQBLJ-DCAQKATOSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- 102100032428 Nuclear fragile X mental retardation-interacting protein 1 Human genes 0.000 description 1
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 1
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 206010033307 Overweight Diseases 0.000 description 1
- 238000009004 PCR Kit Methods 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000003728 Peroxisome Proliferator-Activated Receptors Human genes 0.000 description 1
- 108090000029 Peroxisome Proliferator-Activated Receptors Proteins 0.000 description 1
- 102000012132 Peroxisome proliferator-activated receptor gamma Human genes 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- GKZIWHRNKRBEOH-HOTGVXAUSA-N Phe-Phe Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C=CC=CC=1)C([O-])=O)C1=CC=CC=C1 GKZIWHRNKRBEOH-HOTGVXAUSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- FSXRLASFHBWESK-HOTGVXAUSA-N Phe-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 FSXRLASFHBWESK-HOTGVXAUSA-N 0.000 description 1
- HMNSRTLZAJHSIK-YUMQZZPRSA-N Pro-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 HMNSRTLZAJHSIK-YUMQZZPRSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- BEPSGCXDIVACBU-IUCAKERBSA-N Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BEPSGCXDIVACBU-IUCAKERBSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- MDAWMJUZHBQTBO-XGEHTFHBSA-N Pro-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1)O MDAWMJUZHBQTBO-XGEHTFHBSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 1
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 1
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 101100409191 Rattus norvegicus Ppargc1a gene Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- UJTZHGHXJKIAOS-WHFBIAKZSA-N Ser-Gln Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O UJTZHGHXJKIAOS-WHFBIAKZSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- AXVNLRQLPLSIPQ-FXQIFTODSA-N Ser-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N AXVNLRQLPLSIPQ-FXQIFTODSA-N 0.000 description 1
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- LZLREEUGSYITMX-JQWIXIFHSA-N Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(O)=O)=CNC2=C1 LZLREEUGSYITMX-JQWIXIFHSA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 239000012163 TRI reagent Substances 0.000 description 1
- 229940123464 Thiazolidinedione Drugs 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- LOHBIDZYHQQTDM-IXOXFDKPSA-N Thr-Cys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LOHBIDZYHQQTDM-IXOXFDKPSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- KDWZQYUTMJSYRJ-BHYGNILZSA-N Trp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O KDWZQYUTMJSYRJ-BHYGNILZSA-N 0.000 description 1
- UHXOYRWHIQZAKV-SZMVWBNQSA-N Trp-Pro-Arg Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UHXOYRWHIQZAKV-SZMVWBNQSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- IVOMOUWHDPKRLL-UHFFFAOYSA-N UNPD107823 Natural products O1C2COP(O)(=O)OC2C(O)C1N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-UHFFFAOYSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- OIBDVHSTOUGZTJ-PEBLQZBPSA-N [(2r,3r,4s,5s,6s)-3,4,6-triacetyloxy-5-(trifluoromethylsulfonyloxy)oxan-2-yl]methyl acetate Chemical compound CC(=O)OC[C@H]1O[C@@H](OC(C)=O)[C@@H](OS(=O)(=O)C(F)(F)F)[C@@H](OC(C)=O)[C@@H]1OC(C)=O OIBDVHSTOUGZTJ-PEBLQZBPSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 210000003486 adipose tissue brown Anatomy 0.000 description 1
- 230000011759 adipose tissue development Effects 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 150000003862 amino acid derivatives Chemical class 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical class CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000007321 biological mechanism Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 235000019577 caloric intake Nutrition 0.000 description 1
- -1 cationic lipid Chemical class 0.000 description 1
- 230000019522 cellular metabolic process Effects 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 230000003081 coactivator Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 208000029078 coronary artery disease Diseases 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229940095074 cyclic amp Drugs 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 208000010643 digestive system disease Diseases 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 229940000406 drug candidate Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 230000003328 fibroblastic effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 208000026278 immune system disease Diseases 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 238000000760 immunoelectrophoresis Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical class N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 1
- 230000004068 intracellular signaling Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000008263 liquid aerosol Substances 0.000 description 1
- 108010056787 lysyl-arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000003818 metabolic dysfunction Effects 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- YACKEPLHDIMKIO-UHFFFAOYSA-N methylphosphonic acid Chemical class CP(O)(O)=O YACKEPLHDIMKIO-UHFFFAOYSA-N 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- ZAHQPTJLOCWVPG-UHFFFAOYSA-N mitoxantrone dihydrochloride Chemical compound Cl.Cl.O=C1C2=C(O)C=CC(O)=C2C(=O)C2=C1C(NCCNCCO)=CC=C2NCCNCCO ZAHQPTJLOCWVPG-UHFFFAOYSA-N 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 239000007923 nasal drop Substances 0.000 description 1
- 229940100662 nasal drops Drugs 0.000 description 1
- 239000007922 nasal spray Substances 0.000 description 1
- 229940097496 nasal spray Drugs 0.000 description 1
- 239000002547 new drug Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 239000002674 ointment Substances 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000037081 physical activity Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 229940124606 potential therapeutic agent Drugs 0.000 description 1
- 210000000229 preadipocyte Anatomy 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000003156 radioimmunoprecipitation Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 230000001235 sensitizing effect Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 210000003207 subcutaneous adipocyte Anatomy 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 229910021653 sulphate ion Inorganic materials 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 230000035924 thermogenesis Effects 0.000 description 1
- 230000000476 thermogenic effect Effects 0.000 description 1
- 150000001467 thiazolidinediones Chemical class 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 238000003160 two-hybrid assay Methods 0.000 description 1
- 238000010396 two-hybrid screening Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/5759—Products of obesity genes, e.g. leptin, obese (OB), tub, fat
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/04—Anorexiants; Antiobesity agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/06—Antihyperlipidemics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/08—Drugs for disorders of the metabolism for glucose homeostasis
- A61P3/10—Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P9/00—Drugs for disorders of the cardiovascular system
- A61P9/10—Drugs for disorders of the cardiovascular system for treating ischaemic or atherosclerotic diseases, e.g. antianginal drugs, coronary vasodilators, drugs for myocardial infarction, retinopathy, cerebrovascula insufficiency, renal arteriosclerosis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Definitions
- rat clone having a high degree of homology to PGC-3 (cf. Example 5).
- Polynucleotide and polypeptide molecules based on the rat PGC-3 sequence may be used by analogy with the human sequences.
- the rat sequence shows a high degree of sequence homology (78% sequence identity and rats are therefore expected to be useful in animal models of metabolism.
- homologues and orthologues of the isolated and purified polynucleotide molecules of the present invention are polynucleotide molecules which display greater than 80% sequence homology, conveniently greater than 85%, for example 90%, to the PGC-3 cDNA sequences set out in SEQ ID NO:1 and SEQ ID NO:3.
- a homologue may be a polynucleotide molecule from the same species i.e. a homologous family member, alternatively, the homologue may be a similar polynucleotide molecule from a different species such as human, useful in developing new therapies for the treatment of IRS and other related disorders such as NIDDM, obesity and atherosclerosis.
- orthologue we mean a functionally equivalent molecule in another species.
- the full sequences of the individual homologues and orthologues may be determined using conventional techniques such as hybridisation, PCR and sequencing techniques, starting with any convenient part of the sequence set out in SEQ ID NO: 1 or SEQ ID NO:3.
- polypeptides of the present invention may be expressed in a variety of hosts such as bacteria, plant cells, insect cells, fungal cells and human and animal cells.
- Eukaryotic recombinant host cells are especially preferred. Examples include yeast, mammalian cells including cell lines of human, bovine, porcine, monkey and rodent origin, and insect cells including Drosophila and silkworm derived cell lines.
- Polypeptides of the present invention may be expressed as fusion proteins, for example with one or more additional polypeptide domains added to facilitate protein purification.
- additional polypeptides include metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilised metals (Porath, J., Protein Exp. Purif. 3:263 (1992)), protein A domains that allow purification on immobilised immunoglobulin, and the domain utilised in the FLAGS extension/affinity purification system (Immunex Corp, Seattle Wash.).
- a purified polypeptide comprising the human PGC-3a amino acid sequence set out in SEQ ID NO.2 or a variant of SEQ ID NO.2 having at least about 90% homology to a member selected from (SEQ ID NO.2 positions 1-600, SEQ ID NO.2 positions 400-1002, SEQ ID NO.2 positions 200-800), or a biologically active fragment thereof.
- a variant is a polynucleotide or polypeptide which differs from a reference polynucleotide or polypeptide, but which retains some of its essential characteristics.
- a variant of a PGC-3 polypeptide may have an amino acid sequence that is different by one or more amino acid substitutions, deletions and/or additions.
- the variant may have conservative changes (amino acid similarity), wherein a substituted amino acid has similar structural or chemical properties, for example, the replacement of leucine with isoleucine.
- a variant may have nonconservative changes, e.g., replacement of a glycine with a tryptophan.
- “Homology” as used in this description is a measure of the similarity or identity of nucleotide sequences or amino acid sequences. In order to characterise the homology, subject sequences are aligned so that the highest order identity match is obtained. Identity can be calculated using published techniques. Computer program methods to determine identity between two sequences, for example, include DNAStar software (DNAStar Inc., Madison, Wis.); the GCG program package (Devereux, J., et al., Nucleic Acids Research 1984, 12(1):387); and BLASTP, BLASTN, FASTA (Atschul, S. F. et al., J Molec Biol 1990, 215:403).
- Antisense molecules may also be synthesised for use in antisense therapy, using techniques known to persons skilled in the art. These antisense molecules may be DNA, stable derivatives of DNA such as phosphorothioates or methylphosphonates, RNA, stable derivatives of RNA such as 2′-O-alkylRNA, or other oligonucleotide mimetics.
- Polyclonal antibodies can be readily generated from a variety of sources, for example, horses, cows, goats, sheep, dogs, chickens, rabbits, mice or rats, using procedures that are well-known in the art.
- antigen is administered to the host animal typically through parenteral injection.
- the immunogenicity of antigen may be enhanced through the use of an adjuvant, for example, Freund's complete or incomplete adjuvant.
- an adjuvant for example, Freund's complete or incomplete adjuvant.
- small samples of serum are collected and tested for reactivity to antigen.
- the monoclonal antibodies of the invention can be produced using alternative techniques, such as those described by Alting-Mees et al., “Monoclonal Antibody Expression Libraries: A Rapid Alternative to Hybridomas”, Strategies in Molecular Biology 3: 1-9 (1990) which is incorporated herein by reference.
- binding partners can be constructed using recombinant DNA techniques to incorporate the variable regions of a gene that encodes a specific binding antibody. Such a technique is described in Larrick et al., Biotechnology, 7: 394 (1989).
- a preferred cellular assay system for use in the method of the invention is a two-hybrid assay system.
- the two-hybrid system utilises the ability of a pair of interacting proteins to bring the activation domain of a transcription factor into close proximity with its DNA-binding domain, restoring the functional activity of the transcription factor and inducing the expression of a reporter gene (S Fields & O Song, Nature, 340, 245-246, 1989).
- Commercially available systems such as the Clontech Matchmakers systems and protocols may be used with the present invention.
- a pharmaceutical composition which comprises a novel PGC-3 modulator, or a pharmaceutically acceptable salt thereof, in association with a pharmaceutically acceptable diluent or carrier.
- SEQ ID NO.3. shows human PGC-3b cDNA
- SEQ ID NO 10 shows rat PGC-3 protein sequence
- PCR primers CME 9748 and CME 9749, listed in Table 2 were synthesised and used to amplify a 347 bp product from human adipose cDNA (Human adipocyte Marathon Ready cDNA, Clontech Cat.# 7447-1, Clontech, Basingstoke, UK).
- RACE Rapid Amplification of cDNA Ends
- PCR screening of the master plate identified several wells positive for PGC-3a cDNA.
- the subplates corresponding to these wells were obtained from Origene and a subsequent round of PCR screening was performed to identify individual clones containing PGC-3a cDNA.
- the complete cDNA sequence for PGC-3b is shown in SEQ ID NO:3.
- the PGC-3b cDNA sequence comprises a coding region of 2991 nucleotides encoding a protein of 996 amino acids.
- the protein sequence for PGC-3b is shown in SEQ ID NO:4.
- FIG. 4 shows the mean relative expression of PGC-3 mRNA in the human tissues listed above. Where multiple samples of the same tissue type were assayed the sample number is given, eg Omental adipocyte S24 refers to the data obtained from the omental adipocyte sample number 24. These results demonstrate that PGC-3 is highly expressed in human breast adipocytes, omental adipocytes and subcutaneous adipocytes. High levels of expression of PGC-3 were also observed in lung and heart samples. Expression of PGC-3 was lower in human kidney, skeletal muscle, pancreas and liver samples.
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Diabetes (AREA)
- Obesity (AREA)
- Veterinary Medicine (AREA)
- General Chemical & Material Sciences (AREA)
- Pharmacology & Pharmacy (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Hematology (AREA)
- Endocrinology (AREA)
- Child & Adolescent Psychology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Emergency Medicine (AREA)
- Urology & Nephrology (AREA)
- Heart & Thoracic Surgery (AREA)
- Cardiology (AREA)
- Vascular Medicine (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
A gene PGC-3 PPAR gamma coactivator-3, and it's role in regulating the transcriptional activity of peroxisome proliferator activated receptor-Y PPAR-Y in adipose tissue. PGC-3 is highly expressed in human white adipose tissue and has utility in the development of new therapeutic agents for use in the treatement of obesity and other related disorders such as non-insulin dependent diabetes mellitus, insulin resistance syndrome, dyslipidemia, and atherosclerosis.
Description
- This invention relates to the regulation of metabolism and in particular to human genes involved in obesity. The invention further relates to proteins encoded by the genes and to means of regulating their biological activity. In addition the invention relates to the use of the genes and proteins to identify therapeutic agents for controlling obesity and other related disorders such as non-insulin dependent diabetes mellitus (NIDDM), insulin resistance syndrome, dyslipidemia, and atherosclerosis.
- Obesity results from an excessive accumulation of adipose tissue and is a growing public health problem in developed and developing countries. Highly overweight individuals show significant increases in the occurrence of NIDDM, coronary heart disease, some cancers and digestive diseases.
- Current treatment is unsatisfactory and new drugs need to be developed. A major problem is that the mechanisms regulating obesity, and the role of increased adiposity in the development of metabolic dysfunction are unclear. What is apparent is that obesity results from an imbalance between energy intake and expenditure. Energy expenditure can be affected by alterations in basal metabolism, physical activity and adaptive thermogenesis.
- Peroxisome proliferator-activated receptor-γ (PPARγ) is a recently identified member of the peroxisome proliferator-activated receptor family of nuclear hormone receptors (Tontonoz et al., Genes Dev. (1994) 8, 1224-1234). The expression of this protein is induced very early in the adipocyte differentiation process and, when expressed ectopically in fibroblastic cells, induces adipogenesis in response to activators of the receptor. Synthetic and naturally occurring ligands for PPARγ have been identified. Thiazolidinediones CIZDs), a class of insulin sensitising agents which are used for the treatment of NIDDM, have been shown to bind to and activate PPARγ. TZDs promote adipocyte differentiation of murine and human preadipocytes to mature, fat storing adipocytes. TZD activation of PPARγ has also been shown to regulate transcription of many adipocyte genes. In addition to the presence of a ligand, the activity of PPARγ has been shown to be influenced by the presence of coactivators and corepressors. When co-expressed in cells alongside PPARγ these proteins have been shown to greatly increase or repress the transcriptional activity of PPARγ. Differences in expression of these coactivators and corepressors between cell types may explain the observed differences in PPARγ mediated transcriptional activity between cells from different tissues.
- One such coactivator is PGC-1 (Puigserver et al., Cell (1998) 92, 829-839). The expression of this 90 kDa nuclear protein is greatly increased in muscle and brown fat of mice upon their exposure to cold temperatures. Co-expression of PGC-1 with PPARγ has been shown to activate aspects of the adaptive thermogenic program.
- However, PGC-1 is not expressed in white adipose tissue which makes up the majority of adipose tissue found in humans. The identification of a protein which regulates the activity of PPARγ in white adipose tissue is thus of great importance in understanding the development of human obesity.
- In the present invention we disclose the cloning and identification of PGC-3, and its role in regulating the transcriptional activity of PPARγ in adipose tissue. PGC-3 is highly expressed in human white adipose tissue and shares sequence homology with PGC-1 in domains known to be responsible for distinct activity of the protein. Two distinct variants of PGC-3 have been identified, termed PGC-3a and PGC-3b, which arise from alternative splicing of the PGC-3 gene. A further splice variant has been identified, termed PGC-3c. Full length cDNA and protein sequences for each of the splice variants are provided. The invention further discloses that PGC-3 has utility in the development of new therapeutic agents for use in the treatment of obesity and other related disorders such as non-insulin dependent diabetes mellitus, insulin resistance syndrome, dyslipidemia, and atherosclerosis. The invention further provides methods for the identification of such therapeutic agents.
- Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which this invention belongs. All publications and patents referred to herein are incorporated by reference.
- The term “PGC-3” as used herein encompasses both of the splice variants, PGC-3a and PGC-3b, as well as PGC-3c.
- According to one aspect of the present invention we provide an isolated and purified polynucleotide molecule comprising a nucleic acid sequence which encodes a polypeptide having at least about 90% homology to a member selected from any one of
- (a) (SEQ ID NO:2, SEQ ID NO:2 positions 1-600, SEQ ID NO:2 positions 400-1002, and SEQ ID NO:2 positions 200-800)
- or
- (b) (SEQ ID NO:4, SEQ ID NO:4 positions 1-600, SEQ ID NO:4 positions 400-996, and SEQ ID NO:4 positions 200-800)
- or
- (c) (SEQ ID NO:8, SEQ ID NO:8 positions 1-600, SEQ ID NO:4 positions 400-1023, and SEQ ID NO:4 positions 200-800
- Isolated and purified polynucleotides of the present invention include sequences which comprise the human PGC-3a cDNA sequence set out in SEQ ID NO:1 and the human PGC-3b cDNA sequence set out in SEQ ID NO:3 and the human PGC-3c cDNA sequence set out in SED ID NO:7.
- In addition we have also identified and sequenced a rat clone having a high degree of homology to PGC-3 (cf. Example 5). Polynucleotide and polypeptide molecules based on the rat PGC-3 sequence may be used by analogy with the human sequences. The rat sequence shows a high degree of sequence homology (78% sequence identity and rats are therefore expected to be useful in animal models of metabolism.
- Therefore in a further aspect of the invention we provide an isolated and purified polynucleotide molecule comprising a nucleic acid sequence which encodes a polypeptide having at least about 90% homology to any one of SEQ ID NO:9, SEQ ID NO:9 positions 1-600, SEQ ID NO:2 positions 400-990, and SEQ ID NO:9 positions 200-800.
- In a further aspect of the invention we provide fragments of the isolated and purified polynucleotide molecules of the present invention. By fragments we mean contiguous regions of the polynucleotide molecule including complementary DNA and RNA sequences, starting with short sequences useful as probes or primers of say about 8-50 bases, such as 10-30 bases or 15-35 bases, to longer sequences of up to 50, 100, 200, 500 or 1000 bases. Indeed any convenient fragment of the polynucleotide molecule may be a useful fragment for further research, therapeutic or diagnostic purposes. Further convenient fragments include those whose terminii are defined by restriction sites within the molecule of one or more kinds, such as any combination of Rsa1, Alu1 and Hinf1.
- In a further aspect we provide homologues and orthologues of the isolated and purified polynucleotide molecules of the present invention. Preferred homologues and orthologues are polynucleotide molecules which display greater than 80% sequence homology, conveniently greater than 85%, for example 90%, to the PGC-3 cDNA sequences set out in SEQ ID NO:1 and SEQ ID NO:3. A homologue may be a polynucleotide molecule from the same species i.e. a homologous family member, alternatively, the homologue may be a similar polynucleotide molecule from a different species such as human, useful in developing new therapies for the treatment of IRS and other related disorders such as NIDDM, obesity and atherosclerosis. By the term orthologue we mean a functionally equivalent molecule in another species. The full sequences of the individual homologues and orthologues may be determined using conventional techniques such as hybridisation, PCR and sequencing techniques, starting with any convenient part of the sequence set out in SEQ ID NO: 1 or SEQ ID NO:3.
- In a further aspect of the invention we provide isolated and purified polynucleotide molecules capable of specifically hybridising to the polynucleotide molecules of the present invention. By specifically hybridising we mean that the polynucleotide hybridises by base-pair interactions, under stringent conditions, to the polynucleotide molecules of the present invention or to the corresponding complementary sequences. Experimental procedures for hybridisation under stringent conditions are well known to persons skilled in the art. For example, hybridisation filters may be incubated overnight at 42° C. in a solution comprising 50% formamide, 5×SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH7.6) 5×Denhardt's solution, 10% dextran sulphate, and 20 μg/ml denatured salmon sperm DNA; followed by washing the filters in 0.1×SSC at about 65° C. Hybridisation techniques are thoroughly described in Sambrook J., Fritsch E. F. and Maniatis T., Molecular Cloning a Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989.
- In a further aspect we provide an expression vector comprising a polynucleotide molecule of the present invention.
- A variety of mammalian expression vectors may be used to express the recombinant polypeptides of the present invention. Commercially available mammalian expression vectors which are suitable for recombinant expression include, pcDNA3 (Invitrogen), pMC1neo (Stratagene), pXT1 (Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593), pBPV-1(8-2) (ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), IZD35 (ATCC 37565), pLXIN, pSIR (CLONTECH), and pIRES-EGFP (CLONTECH).
- Baculoviral expression systems may also be used with the present invention to produce high yields of biologically active polypeptides. Preferred vectors include the CLONTECH, BacPak™ Baculovirus expression system and protocols which are commercially available (CLONTECH, Palo Alto, Calif.).
- Further preferred vectors include vectors for use with the mouse erythroleukaemia cell (MEL cell) expression system comprising the human beta globin gene locus control region (Davies et al., J. of Pharmacol. and Toxicol.
Methods 33, 153-158). - Vectors comprising one or more polynucleotide molecules of the present invention may then be purified and introduced into appropriate host cells. Therefore in a further aspect we provide a transformed host cell comprising a polynucleotide molecule of the present invention.
- The polypeptides of the present invention may be expressed in a variety of hosts such as bacteria, plant cells, insect cells, fungal cells and human and animal cells. Eukaryotic recombinant host cells are especially preferred. Examples include yeast, mammalian cells including cell lines of human, bovine, porcine, monkey and rodent origin, and insect cells including Drosophila and silkworm derived cell lines. Cell lines derived from mammalian species which may be used and which are commercially available include, L cells L-M(TK-) (ATCC CCL 1.3), L cells L-M (ATCC CCL 1.2), HEK 293 (ATCC CRL 1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and MRC-5 (ATCC CCL 171).
- The expression vector may be introduced into host cells to express a polypeptide of the present invention via any one of a number of techniques including calcium phosphate transformation, DEAE-dextran transformation, cationic lipid mediated lipofection, electroporation or infection
- The transformed host cells are propagated and cloned, for example by limiting dilution, and analysed to determine the expression level of recombinant polypeptide. Identification of transformed host cells which express a polypeptide of the present invention may be achieved by several means including immunological reactivity with antibodies described herein and/or the detection of biological activity.
- Polypeptides of the present invention may be expressed as fusion proteins, for example with one or more additional polypeptide domains added to facilitate protein purification. Examples of such additional polypeptides include metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilised metals (Porath, J., Protein Exp. Purif. 3:263 (1992)), protein A domains that allow purification on immobilised immunoglobulin, and the domain utilised in the FLAGS extension/affinity purification system (Immunex Corp, Seattle Wash.). The inclusion of cleavable linker sequences such as Factor XA or enterokinase (Invitrogen, San Diego Calif.) between the purification domain and the coding region is useful to facilitate purification. A preferred protein purification system is the CLONTECH, TALON™ nondenaturing protein purification kit for purifying 6xHis-tagged proteins under native conditions (CLONTECH, Palo Alto, Calif.).
- Therefore in a further aspect we provide a method for producing a polypeptide of the present invention, which method comprises culturing a transformed host cell comprising a polynucleotide of the present invention under conditions suitable for the expression of said polypeptide.
- In a further aspect of the present invention we provide a purified polypeptide comprising the human PGC-3a amino acid sequence set out in SEQ ID NO.2 or a variant of SEQ ID NO.2 having at least about 90% homology to a member selected from (SEQ ID NO.2 positions 1-600, SEQ ID NO.2 positions 400-1002, SEQ ID NO.2 positions 200-800), or a biologically active fragment thereof.
- In a further aspect of the present invention we provide a purified polypeptide comprising the human PGC-3b amino acid sequence set out in SEQ ID NO.4 or a variant of SEQ ID NO.4 having at least about 90% homology to a member selected from (SEQ ID NO.4 positions 1-600, SEQ ID NO.4 positions 400-996, SEQ ID NO.4 positions 200-800), or a biologically active fragment thereof.
- In a further aspect of the present invention we provide a purified polypeptide comprising the human PGC-3c amino acid sequence set out in SEQ ID NO.8 or a variant of SEQ ID NO.8 having at least about 90% homology to a member selected from (SEQ ID NO.8 positions 1-600, SEQ ID NO.8 positions 400-1023, SEQ ID NO.8 positions 200-800), or a biologically active fragment thereof.
- In a further aspect of the present invention we provide a purified polypeptide comprising the rat PGC-3 amino acid sequence set out in SEQ ID NO.10 or a variant of SEQ ID NO.10 having at least about 90% homology to a member selected from (SEQ ID NO.10 positions 1-600, SEQ ID NO.10 positions 400-990, SEQ ID NO.10 positions 200-800), or a biologically active fragment thereof.
- A variant is a polynucleotide or polypeptide which differs from a reference polynucleotide or polypeptide, but which retains some of its essential characteristics. For example, a variant of a PGC-3 polypeptide may have an amino acid sequence that is different by one or more amino acid substitutions, deletions and/or additions. The variant may have conservative changes (amino acid similarity), wherein a substituted amino acid has similar structural or chemical properties, for example, the replacement of leucine with isoleucine. Alternatively, a variant may have nonconservative changes, e.g., replacement of a glycine with a tryptophan. Guidance in determining which and how many amino acid residues may be substituted, inserted or deleted and the effect this will have on biological activity may be reasonably inferred from the present disclosure by a person skilled in the art and may further be found using computer programs well known in the art, for example, DNAStar software.
- Amino acid substitutions may be made, for instance, on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues. Negatively charged amino acids, for example, include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine; asparagine, glutamine; serine, threonine, phenylalanine, and tyrosine.
- Suitable substitutions of amino acids include the use of a chemically derivatised residue in place of a non-derivatised residue. D-isomers and other known derivatives may also be substituted for the naturally occurring amino acids. See, e.g., U.S. Pat. No. 5,652,369, Amino Acid Derivatives, issued Jul. 29, 1997. Example substitutions are set forth in Table 1.
- “Homology” as used in this description is a measure of the similarity or identity of nucleotide sequences or amino acid sequences. In order to characterise the homology, subject sequences are aligned so that the highest order identity match is obtained. Identity can be calculated using published techniques. Computer program methods to determine identity between two sequences, for example, include DNAStar software (DNAStar Inc., Madison, Wis.); the GCG program package (Devereux, J., et al., Nucleic Acids Research 1984, 12(1):387); and BLASTP, BLASTN, FASTA (Atschul, S. F. et al., J Molec Biol 1990, 215:403). Homology as defined herein is determined conventionally using the well known computer program, BESTFIT (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis., 53711). When using BESTFIT or another sequence alignment program to determine the similarity of a particular sequence to a reference sequence, the parameters are typically set such that the percentage identity is calculated over the full length of the reference nucleotide sequence or amino acid sequence and that gaps in homology of up to about 10% of the total number of nucleotides or amino acid residues in the reference sequence are allowed.
- In a further aspect we provide polymorphic variants of the polynucleotides and polypeptides of the present invention. Polymorphisms are variations in polynucleotide or polypeptide sequences between one individual and another. DNA polymorphisms may lead to variations in amino acid sequence and consequently to altered protein structure and functional activity. Polymorphisms may also affect mRNA synthesis, maturation, transport and stability. Polymorphisms which do not result in amino acid changes (silent polymorphisms) or which do not alter any known consensus sequences may nevertheless have a biological effect, for example by altering mRNA folding or stability.
- Knowledge of polymorphisms may be used to help identify patients most suited to therapy with particular pharmaceutical agents (this is often termed “pharmacogenetics”). Pharmacogenetics may also be used in pharmaceutical research to assist the drug selection process. Polymorphisms may be used in mapping the human genome and to elucidate the genetic component of diseases. The reader is directed to the following references for background details on pharmacogenetics and other uses of polymorphism detection: Linder et al. (1997), Clinical Chemistry, 43, 254; Marshall (1997), Nature Biotechnology, 15, 1249; International Patent Application WO 97/40462, Spectra Biomedical; and Schafer et al. (1998), Nature Biotechnology, 16, 33.
- The polypeptides of the present invention may be genetically engineered in such a way that their interaction with other intracellular and membrane associated proteins are maintained but their effector function and biological activity are removed. A polypeptide genetically modified in this way is known as a dominant negative mutant. In the construction of a dominant negative mutant at least one amino acid residue position at a site required for activity in the native peptide is changed to produce a peptide which has reduced activity or which is devoid of detectable activity. Overexpression of the dominant negative mutant in an appropriate cell type down-regulates the effect of the endogenous polypeptide, thereby revealing the biological mechanisms involved in the control of metabolism.
- Similarly, the polypeptides of the present invention may be genetically engineered in such a way that their effector function and biological activity are enhanced. The resultant overactive polypeptide is known as a dominant positive mutant. At least one amino acid residue position at a site required for activity in the native peptide is changed to produce a peptide which has enhanced activity. Overexpression of a dominant positive mutant in an appropriate cell type amplifies the response of the endogenous native polypeptide highlighting the regulatory mechanisms controlling cell metabolism.
- Therefore in a further aspect we provide dominant negative and dominant positive mutants of the polypeptides of the present invention.
- Novel sequences disclosed herein, may be used in another embodiment of the invention to regulate expression of PGC-3 genes in cells by the use of antisense constructs. For example an antisense expression construct may be readily constructed using the pREP10 vector (Invitrogen Corporation). Transcripts are expected to modulate translation of the gene in cells transfected with the construct. Antisense transcripts are effective for modulating translation of the native gene transcript, and are capable of altering the effects (e.g., regulation of tissue physiology) herein described. Oligonucleotides which are complementary to and hybridisable with any portion of mRNA disclosed herein are contemplated for therapeutic use. U.S. Pat. No. 5,639,595, “Identification of Novel Drugs and Reagents”, issued Jun. 17, 1997, wherein methods of identifying oligonucleotide sequences that display in vivo activity are thoroughly described, is herein incorporated by reference. Antisense molecules may also be synthesised for use in antisense therapy, using techniques known to persons skilled in the art. These antisense molecules may be DNA, stable derivatives of DNA such as phosphorothioates or methylphosphonates, RNA, stable derivatives of RNA such as 2′-O-alkylRNA, or other oligonucleotide mimetics. U.S. Pat. No. 5,652,355, “Hybrid Oligonucleotide Phosphorothioates”, issued Jul. 29, 1997, and U.S. Pat. No. 5,652,356, “Inverted Chimeric and Hybrid Oligonucleotides”, issued Jul. 29, 1997, which describe the synthesis and effect of physiologically-stable antisense molecules, are incorporated by reference. Antisense molecules may be introduced into cells by microinjection, liposome encapsulation or by expression from vectors harboring the antisense sequence.
- In a further aspect we provide an antibody specific for a polypeptide of the present invention.
- Antibodies can be prepared using any suitable method, for example, purified polypeptide may be utilised to prepare specific antibodies. The term “antibodies” includes polyclonal antibodies, monoclonal antibodies, and the various types of antibody constructs such as for example F(ab′) 2, Fab and single chain Fv. Antibodies are defined to be specifically binding if they bind the antigen with a Ka of greater than or equal to about 107M−1. Affinity of binding can be determined using conventional techniques, for example those described by Scatchard et al., Ann. N.Y. Acad Sci., 51:660 (1949).
- Polyclonal antibodies can be readily generated from a variety of sources, for example, horses, cows, goats, sheep, dogs, chickens, rabbits, mice or rats, using procedures that are well-known in the art. In general, antigen is administered to the host animal typically through parenteral injection. The immunogenicity of antigen may be enhanced through the use of an adjuvant, for example, Freund's complete or incomplete adjuvant. Following booster immunisations, small samples of serum are collected and tested for reactivity to antigen. Examples of various assays useful for such determination include those described in: Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory Press, 1988; as well as procedures such as countercurrent immuno-electrophoresis (CIEP), radioimmunoassay, radioimmunoprecipitation, enzyme-linked immuno-sorbent assays (ELISA), dot blot assays, and sandwich assays, see U.S. Pat. Nos. 4,376,110 and 4,486,530.
- Monoclonal antibodies may be readily prepared using well-known procedures, see for example, the procedures described in U.S. Pat. Nos. RE 32,011, 4,902,614, 4,543,439 and 4,411,993; Monoclonal Antibodies, Hybridomas: A New Dimension in Biological Analyses, Plenum Press, Kennett, McKearn, and Bechtol (eds.), (1980).
- The monoclonal antibodies of the invention can be produced using alternative techniques, such as those described by Alting-Mees et al., “Monoclonal Antibody Expression Libraries: A Rapid Alternative to Hybridomas”, Strategies in Molecular Biology 3: 1-9 (1990) which is incorporated herein by reference. Similarly, binding partners can be constructed using recombinant DNA techniques to incorporate the variable regions of a gene that encodes a specific binding antibody. Such a technique is described in Larrick et al., Biotechnology, 7: 394 (1989).
- Once isolated and purified, the antibodies may be used to detect the presence of antigen in a sample using established assay protocols.
- In a further aspect of the invention we provide a method for identifying a therapeutic agent capable of modulating the activity of PGC-3 for use in the regulation of metabolism, which method comprises:
- (i) contacting a candidate compound modulator with a PGC-3 polypeptide comprising any one of
- (a) the amino acid sequence set out in SEQ ID NO.2 or a variant of SEQ ID NO.2 having at least about 90% homology to a member selected from (SEQ ID NO.2 positions 1-600, SEQ ID NO.2 positions 400-1002, SEQ ID NO.2 positions 200-800) or a biologically active fragment thereof;
- or
- (b) the amino acid sequence set out in SEQ ID NO.4 or a variant of SEQ ID NO.4 having at least about 90% homology to a member selected from (SEQ ID NO.4 positions 1-600, SEQ ID NO.4 positions 400-996, SEQ ID NO.4 positions 200-800) or a biologically active fragment thereof;
- or
- (c) the amino acid sequence set out in SEQ ID NO.8 or a variant of SEQ ID NO.8 having at least about 90% homology to a member selected from (SEQ ID NO.8 positions 1-600, SEQ ID NO.8 positions 400-996, SEQ ID NO.8 positions 200-800) or a biologically active fragment thereof;
- and
- (ii) measuring an effect of the candidate compound modulator on the activity of the PGC-3 polypeptide.
- Activity as used herein refers to the ability of the therapeutic agent to mediate cell processes related to insulin resistance syndrome and other related disorders such as non-insulin dependent diabetes mellitus, dyslipidemia, obesity and atherosclerosis.
- Modulation of the activity of PGC-3 comprises either stimulation or inhibition. Thus a therapeutic agent capable of modulating the activity of PGC-3 is an agent that either stimulates or inhibits the activity of PGC-3. The terms “modulator of PGC-3 activity” and “PGC-3 modulator” are also used herein to refer to an agent that either stimulates or inhibits the activity of PGC-3. The therapeutic agents of the invention have utility in the regulation of metabolism; in particular in obesity and the control of insulin resistance syndrome and other related disorders such as non-insulin dependent diabetes mellitus, dyslipidemia, and atherosclerosis.
- In a further aspect of the invention we provide a screen for identifying compounds which modulate the activity of PGC-3, the invention extends to such a screen and to the use of compounds obtainable therefrom to modulate the activity of PGC-3 in vivo.
- Potential therapeutic agents which may be tested in the screen include simple organic molecules, commonly known as “small molecules”, for example those having a molecular weight of less than 2000 Daltons. The screen may also be used to screen compound libraries such as peptide libraries, including synthetic peptide libraries and peptide phage libraries. Other suitable molecules include antibodies, nucleotide sequences and any other molecules which modulate the activity of PGC-3.
- Once an inhibitor or stimulator of PGC-3 activity is identified then medicinal chemistry techniques can be applied to further refine its properties, for example to enhance efficacy and/or reduce side effects.
- It will be appreciated that there are many screening procedures which may be employed to perform the present invention. Examples of suitable screening procedures which may be used to identify a PGC-3 modulator for use in the regulation of metabolism include rapid filtration of equilibrium binding mixtures, enzyme linked immunosorbent assays (ELISA), radioimmunoassays (RIA) and fluorescence resonance energy transfer assays (FRET). For further information on FRET the reader is directed to International Patent Application WO 94/28166 (Zeneca). Methods to identify potential drug candidates have been reviewed by Bevan P et al., 1995, TIBTECH 13 115.
- A preferred method for identifying a compound capable of modulating the activity of PGC-3 is a scintillation proximity assay (SPA). SPA involves the use of fluomicrospheres coated with acceptor molecules, such as receptors, to which a ligand will bind selectively in a reversible manner (N Bosworth & P Towers, Nature, 341, 167-168, 1989). The technique requires the use of a ligand labelled with an isotope that emits low energy radiation which is dissipated easily into an aqueous medium. At any point during an assay, bound labelled ligands will be in close proximity to the fluomicrospheres, allowing the emitted energy to activate the fluor and produce light. In contrast, the vast majority of unbound labelled ligands will be too far from the fluomicrospheres to enable the transfer of energy. Bound ligands produce light but free ligands do not, allowing the extent of ligand binding to be measured without the need to separate bound and free ligand.
- Cellular assay systems may be used to further identify PGC-3 modulators for use in the regulation of metabolism.
- Therefore in a further aspect of the invention the candidate compound modulator is contacted with a host-cell which expresses an PGC-3 polypeptide (as hereindefined).
- A preferred cellular assay system for use in the method of the invention is a two-hybrid assay system. The two-hybrid system utilises the ability of a pair of interacting proteins to bring the activation domain of a transcription factor into close proximity with its DNA-binding domain, restoring the functional activity of the transcription factor and inducing the expression of a reporter gene (S Fields & O Song, Nature, 340, 245-246, 1989). Commercially available systems such as the Clontech Matchmakers systems and protocols may be used with the present invention.
- Other preferred cellular assay systems include measurement of changes in the levels of intracellular signalling molecules such as cyclic-AMP, intracellular calcium ions, or arachidonic acid metabolite release. These may all be measured using standard published procedures and commercially available reagents. In addition the polynucleotides of the present invention may be transfected into appropriate cell lines that have been transfected with a “reporter” gene such as bacterial lacZ, luciferase, aequorin or green fluorescent protein that will “report” these intracellular changes (Egerton et al, J. Mol, Endocrinol, 1995, 14(2), 179-189).
- In a further aspect of the present invention we provide a novel PGC-3 modulator, or a pharmaceutically acceptable salt thereof, for use in a method of treatment of metabolic diseases of the human or animal body by therapy.
- Examples of metabolic diseases which may be treated using a compound of the invention include insulin resistance syndrome, non-insulin dependent diabetes mellitus, dyslipidemia, obesity and atherosclerosis.
- According to a further aspect of the invention, we provide a pharmaceutical composition which comprises a novel PGC-3 modulator, or a pharmaceutically acceptable salt thereof, in association with a pharmaceutically acceptable diluent or carrier.
- The composition may be in the form suitable for oral use, for example a tablet, capsule, aqueous or oily solution, suspension or emulsion; for topical use, for example a cream, ointment, gel or an aqueous or oily solution or suspension; for nasal use, for example a snuff, nasal spray or nasal drops; for rectal use, for example a suppository; for administration by inhalation, for example as a finely divided powder such as a dry powder, a microcrystalline form or a liquid aerosol; for sub-lingual or buccal use, for example a tablet or capsule; or for parenteral use (including intravenous, subcutaneous, intramuscular, intravascular or infusion), for example a sterile aqueous or oily solution or suspension. In general, the above compositions may be prepared in a conventional manner using conventional excipients.
- The invention also provides a method of treating a metabolic disease or medical condition mediated alone or in part by PGC-3, which comprises administering to a warm-blooded animal requiring such treatment an effective amount of an PGC-3 modulator as defined above.
- The invention also provides the use of an PGC-3 modulator in the production of a medicament for use in the treatment of a metabolic disease.
- The amount of active ingredient that is combined with one or more excipients to produce a single dosage form will necessarily vary depending on the subject treated and the particular route of administration. For example, a formulation intended for oral administration to humans will generally contain for example, from 0.5 mg to 2 g of active agent compounded with an appropriate and convenient amount of excipients which may vary from about 5 to about 98 percent by weight of the total composition. Dosage unit forms will generally contain about 1 mg to about 500 mg of an active ingredient.
- The size of the dose for therapeutic or prophylactic purposes of an PGC-3 modulator will naturally vary according to the nature and severity of the immune disease, the age and sex of the patient, and the route of administration, according to well known principles of medicine.
- In using an PGC-3 modulator for therapeutic or prophylactic purposes it will generally be administered so that a daily dose in the range for example 0.5 mg to 75 mg per kg body weight is received, given if required in divided doses. In general lower doses will be administered when a parenteral route is employed. Thus for example, for intravenous administration, a dose in the range for example 0.5 mg to 30 mg per kg body weight will generally be used. Similarly, for administration by inhalation a dose in the range for example 0.5 mg to 25 mg per kg body weight will be used.
- The invention will now be illustrated but not limited by reference to the following Tables, Examples and Figures. Unless indicated otherwise, the techniques used are those detailed in well known molecular biology textbooks such as Sambrook, Fritsch & Maniatis, Molecular Cloning a Laboratory Manual, second edition, 1989, Cold Spring Harbor Laboratory Press.
- SEQ ID NO.1 shows the full length human PGC-3a cDNA
- SEQ ID NO.2.shows human PGC-3a protein sequence
- SEQ ID NO.3.shows human PGC-3b cDNA
- SEQ ID NO.4. shows human PGC-3b protein sequence
- SEQ ID NO.5. shows the sequence of the 3′ RACE product isolated from human adipocyte cDNA.
- SEQ ID NO.6. shows the sequence of the 5′RACE product isolated from human heart cDNA.
- SEQ ID NO.7. shows human PGC-3c cDNA
- SEQ ID NO 8 shows human PGC-3c protein sequence
- SEQ ID NO 9 shows the full length rat PGC-3 cDNA
- SEQ ID NO 10 shows rat PGC-3 protein sequence
- FIG. 1 shows specific PCR products of 561 bp for PGC-3b and 491 bp for PGC-3a, isolated from breast adipose tissue cDNA, in
1 and 2 respectively. The PGC-3b specific PCR product was obtained using PCR primers CME9830 and CME9831 (Table 2). The PGC-3a specific PCR product was obtained using PCR primers CME9830 and CME9850 (Table 2).lanes - FIG. 2 shows a comparison of PGC-3a and PGC-3b with PGC-1, indicating that the molecules share regions of sequence homology in particular locations which are believed to be important for biological activity.
- FIG. 3 shows a comparison of human PGC-3a and rat PGC-3 protein sequences indicating that the molecules share a high degree of sequence homology.
- FIG. 4 shows the mean relative expression of PGC-3 mRNA in a range of human tissues. Quantitative real-time PCR was undertaken in quadruplicate on cDNA samples derived from the human tissues listed using Taqman™ fluorescent PCR technology (PE. Applied Biosystems). The normalised ratio of expression of PGC-3 relative to a housekeeping gene (GAPDH) was calculated for all tissues. SC=subcutaneous. Where multiple samples of the same tissue type were assayed the sample number is given, eg Omental adipocyte S24 refers to the data obtained from the omental
adipocyte sample number 24. AU=arbitrary units. - TABLES
TABLE 1 Examples of conservative amino acid substitutions Original residue Example conservative substitutions Ala (A) Gly; Ser; Val; Leu; Ile; Pro Arg (R) Lys; His; Gln; Asn Asn (N) Gln; His; Lys; Arg Asp (D) Glu Cys (C) Ser Gln (Q) Asn Glu (E) Asp Gly (G) Ala; Pro His (H) Asn; Gln; Arg; Lys Ile (I) Leu; Val; Met; Ala; Phe Leu (L) Ile; Val; Met; Ala; Phe Lys (K) Arg; Gln; His; Asn Met (M) Leu; Tyr; Ile; Phe Phe (F) Met; Leu; Tyr; Val; Ile; Ala Pro (P) Ala; Gly Ser (S) Thr Thr (T) Ser Trp (W) Tyr; Phe Tyr (Y) Trp; Phe; Thr; Ser Val (V) Ile; Leu; Met; Phe; Ala -
TABLE 2 Primer sequences Primer Sequence forward primer CME 9748 5′ GTCACAAAGCGACCCAACTT 3′reverse primer CME 9749 5′ GAGTCATGGTCTCCAAAGGAAC 3′AP1 adaptor primer 5′ CCATCCTAATACGACTCACTATAGGGC 3′CME 9830 forward primer 5′ GCCACTCGAAGGAACTTCAGAT 3′CME 9850 reverse primer B 5′ GGGTTAAGGCTGTTATCAATGC 3′CME 9831 reverse primer A 5′ AGGCCAGAAGAGAAACAGGATG 3′CME 9726 sequencing primer 5′ CTTCTCCTGTTCCTTTGGAGAC 3′CME 9727 sequencing primer 5′ TGGGGTTCACTTGAGGATTG 3′CME 9778 sequencing primer 5′ ATTCAAAATCTCTTCCAGCGAC 3′CME 9776 sequencing primer 5′ GAAGACAGAAGCTGTGATGCTG 3′ -
TABLE 3 Primers used in Example 4 Primer Sequence SP1A 5′-CATCACAGAGCACGTCTTGAG-3 ′ SP2A 5′-CATGTAGCGTATGAGTTGCACCATC-3′ Oligo d(T)- anchor 5′-GACCACGCGTATCGATGTCGACTTTTT primer TTTTTTTTTTTV-3′ V = A, C or G PCR anchor primer 5′-GACCACGCGTATCGATGTCGAC-3 -
TABLE 4 Details of primers used to sequence rat PGC-3 Primer primer sequence (5′→3′) CVGI169 TTGGGTAACGCCAGGGTTTTCCCAGTCAC CVGI170 CCCCAGGCTTTACACTTTATGCTTCCGGC CVGI171 GCCAGTACAGCCCTGATGAT CVGI172 TCCCCAGTGTCTGAAGTGGATG CVGI281 CTCATTCGCTACATGCATACCT CVGI282 CGGCCTTGTGTCAAGGTGGATG CVGI283 CTTCTGGACTGAGTTCTCCATC CVGI390 CAGGAGACTGAATCCAGAGCTG CVGI391 GACAGTAGTCAAGGCCAGCAGC CVGI457 GAGACCATGACTACTGCCAGGT CVGI458 ACCGCTCTGGAGGAGGAAGACT CVGI535 TTAAGCCTTAACCCTTTGAGGA CVGI536 GGCCCAGATACACCGACTATGA - Method
- PCR primers CME 9748 and CME 9749, listed in Table 2 were synthesised and used to amplify a 347 bp product from human adipose cDNA (Human adipocyte Marathon Ready cDNA, Clontech Cat.# 7447-1, Clontech, Basingstoke, UK).
- A technique known as Rapid Amplification of cDNA Ends (RACE) was then used to amplify the 3′ end of the PGC-3 cDNA. RACE is a commonly used molecular biological technique which enables the user to extend and identify sequence along a cDNA template in one direction. This allows the user to obtain a complete cDNA sequence starting from a small piece of cDNA sequence. For a more complete description of the method refer to Chenchick A, Moqadam F and Siebert P. 1996 Laboratory guide to RNA: isolation, analysis and synthesis. Wiley-Liss Inc. p273-321. In this case we used a commercially available RACE PCR kit, the Human adipocyte Marathon Ready™ cDNA (Clontech, Basingstoke, UK). It is a premade human adipocyte “library” of adaptor-ligated double stranded cDNA ready for performing both 5′ and 3′ RACE from the same template. The PGC-3 gene specific primer CME 9748 (Table 2) was used in a Marathon RACE reaction with the AP1 adapter primer (Table 2) supplied by Clontech and the Marathon Ready™ cDNA according to the manufacturer's instructions.
- Results
- A 1.5 kb PCR product was amplified in the reaction and separated from non-specific DNA by agarose gel electrophoresis using a 1.5% agarose gel and visualised by ethidium bromide staining. The 1.5 kb PCR product was isolated from the gel using a DNA extraction kit (Qiaex II™, Qiagen) and the purified PCR product cloned into PCR2.1™ vector using a TOPO TA™ Cloning kit (Invitrogen), according to the manufacturer's instructions. The cloned PCR product was fully sequenced using the vector M13 sequencing primers supplied in the kit (Invitrogen) and the PGC-3 gene specific sequencing primers CME 9726, CME 9727,CME 9778 and CME 9776 (listed in Table 2). The sequence of the 3′RACE product is shown in FIG. 5 (SEQ ID NO: 5). The predicted protein sequence of the 3′ end of SEQ ID NO:7 was found to be 50% identical to the sequence for human PGC-1 over a 135 aa region (see FIG. 7). This small region of SEQ ID NO:7 also shared 53% identity to the rat PGC-1 sequence (EMBL accession number: AB025784).
- Two variants of PGC-3 have been found with cDNA sequence which differ at the 3′ end. We have named these two variants PGC-3a and PGC-3b.
- Method
- Primers CME 9830 and CME 9850 (Table 2) were synthesised based on the PGC-
3a 3′ RACE product sequence. These were used to PCR screen the Origene human heart cDNA library master plate (ORIGENE LHT-1001, Origene, USA) according to the manufacturers instructions. - Results
- PCR screening of the master plate identified several wells positive for PGC-3a cDNA. The subplates corresponding to these wells were obtained from Origene and a subsequent round of PCR screening was performed to identify individual clones containing PGC-3a cDNA.
- Clones containing PGC-3a were identified and sequenced. This resulted in the isolation of the complete cDNA sequence for PGC-3a (SEQ ID NO:1). The PGC-3a cDNA sequence comprises a coding region of 3009 nucleotides, that encode a protein of 1002 amino acids with a calculated molecular mass of 110 kDa and an estimated isoelectric point of 4.933. The protein sequence for PGC-3a is shown in SEQ ID NO:2.
- Methods
- PCR primers were synthesised which would specifically amplify either PGC-3a or PGC-3b (CME 9830, CME 9831, CME 9850, Table 2).
- To investigate whether both PGC-3a and PGC-3b were expressed by human breast adipocytes, PCRs were carried out using the above primers to amplify PGC-3a and PGC-3b from human breast adipocyte cDNA. The PCR conditions used were 94° C. for 1 minute then 30 cycles of 94° C. for 30 sec, then 68° C. for 4 minutes. The DNA polymerase used was Extensor™ from Advanced Biotechnologies. PCR was performed according to standard procedure described in Molecular Cloning,a laboratory manual, Sambrook, Fritsch and Maniatis Second Ed 1989). The forward PCR primer (CME 9830) was used in combination with either reverse PCR primer A (CME 9831) designed specifically to amplify PGC-3b (sequence 2) or reverse primer B (CME 9850) designed specifically to amplify
sequence 3 PGC-3a (3′ RACE product). - Results
- Specific PCR products of 561 bp for PGC-3b (CME9830/CME 9831) and 491 bp for PGC-3a (CME9830/CME9850) were obtained, as shown in FIG. 1 in
1 and 2 respectively.lanes - The complete cDNA sequence for PGC-3b is shown in SEQ ID NO:3. The PGC-3b cDNA sequence comprises a coding region of 2991 nucleotides encoding a protein of 996 amino acids. The protein sequence for PGC-3b is shown in SEQ ID NO:4.
- The technique known as RACE as described in example 1 was used to amplify the 5′ end of the PGC-3c cDNA. This procedure was undertaken using the
Roche 5′/3 ′RACE kit (Cat. No. 1 734 792) and followed the manufacturer's instructions. First strand cDNA was synthesized from total human heart RNA (Stratagene, Cat. No. 73501241 ) using a gene specific primer (SP1A, listed in Table 3), AMV reverse transcriptase (supplied in kit) and deoxynucleotide mix (supplied in kit). The first strand cDNA was purified using High Pure PCR Product Purification Kit (Roche Diagnostics Corporation, Indianapolis, Ind., USA—Cat No. 1 732 668) according to the manufacturer's instructions. A homopolymeric A-tail was then added to the 3′end of the cDNA using terminate transferase using reagents and instructions supplied with the kit. The cDNA was then amplified by PCR using a gene specific primer (SP2A, see table 3) and an oligo dT-anchor primer (see table 3). The obtained cDNA was further amplified by a second PCR using a nested specific primer (SP3A, see Table 3) and a PCR anchor primer (see table 3). Resulting 5′RACE products were cloned into a vector. The cloned PCR products were fully sequenced. The sequence of the 5′RACE product is shown in Figure SEQ ID NO. 6. The full length cDNA sequence of PGC-3c is shown in SEQ ID NO. 7. The predicted protein sequence of PGC-3c is shown in SEQ ID NO. 8. - Homology searching using the human PGC-3 cDNA sequence identified a rat clone in a proprietary database that had a high level of homology to PGC-3. This clone was obtained and sequenced using primers CVGI169, CVGI170, CVGI171, CVGI172, CVGI281, CVGI282, CVGI283, CVGI390, CVGI391, CVGI457, CVGI458, CVGI535 (see table 4 for sequence information). The full length rat PGC-3 cDNA sequence is shown in SEQ ID NO. 9. The predicted protein sequence of rat PGC-3 is shown in Figure SEQ ID NO. 10. FIG. 13 shows a comparison of human PGC-3a and rat PGC-3 protein sequences, indicating that the molecules have a high degree of sequence homology (greater than 78% identity).
- Total RNA was extracted from human adipocytes using TRI reagent (Sigma-Aldrich) following the manufacturer's suggested protocol. Two micrograms of total RNA from each adipocyte samples was used to generate cDNA, using the Promega reverse transcription system (Promega; catalogue number A3500) according to the manufacturer's instructions. The heart, skeletal muscle, kidney, liver, lung, heart and pancreas cDNAs were obtained from Clontech (Clontech, catalogue numbers K1420-1 and K1421-1). Probe and primer sequences were designed for PGC-3 and were: PGC-3 forward primer; 5′-TGCTGGCCCAGATACACTGA-3′. PGC-3 reverse primer; 5′-GGCTGTTATCAATGCAGGCTC-3′. PGC-3 probe; 5-FAM-CGTCAGGGAAAAGCAAGTATGAAGCCAT-TAMRA-3′. Taqman PCR assays for each target gene were performed in quadruplicate in 96 well plates on an ABI Prism 7700 Sequence Detection system (PE Applied Biosystems). For each 25 □1 Taqman reaction 0.01-1 ng cDNA was mixed with final concentrations of 1×Taqman Universal PCR Mastermix (PE Applied Biosystems), 300 nM forward and reverse primers and 200 nM probe. PCR parameters were 50° C. for 2 minutes, 95° C. for 10 minutes, 40 cycles of 95° C. for 15 seconds and 60° C. for 1 min. Results were analysed by the comparative Ct method as previously described in ABI Prism 7700 User Bulletin #2 (PE Applied Biosystems). Briefly, for each PCR, a threshold cycle (Ct) was calculated. This refers to the PCR cycle where amplified DNA is detectable above an arbitary threshold. Ct values are semi-quantitative and are related to the amount of target sequence present within a particular cDNA sample. Quadruplicate Ct samples were averaged and normalised by dividing with Ct values obtained for a housekeeping gene (GAPDH; supplied by PE Applied Biosystems). This gives a mean □Ct value for each cDNA. These values can be directly compared between different cDNA samples to give relative expression values for each target gene. FIG. 4 shows the mean relative expression of PGC-3 mRNA in the human tissues listed above. Where multiple samples of the same tissue type were assayed the sample number is given, eg Omental adipocyte S24 refers to the data obtained from the omental
adipocyte sample number 24. These results demonstrate that PGC-3 is highly expressed in human breast adipocytes, omental adipocytes and subcutaneous adipocytes. High levels of expression of PGC-3 were also observed in lung and heart samples. Expression of PGC-3 was lower in human kidney, skeletal muscle, pancreas and liver samples. -
1 40 1 3203 DNA Homo sapiens 1 ggcacgagga agaattgaac tcatacagct gatgggagtg tacaaaggtg gagggtccgg 60 ggaggagcaa ctctatgctg actttccaga acttgacctc tcccagctgg atgccagcga 120 ctttgactcg gccacctgct ttggggagct gcagtggtgc ccagagaact cagagactga 180 acccaaccag tacagccccg atgactccga gctcttccag attgacagtg agaatgaggc 240 cctcctggca gagctcacca agaccctgga tgacatccct gaagatgacg tgggtctggc 300 tgccttccca gccctggatg gtggagacgc tctatcatgc acctcagctt cgcctgcccc 360 ctcatctgca ccccccagcc ctgccccgga gaagccctcg gccccagccc ctgaggtgga 420 cgagctctca ctgctgcaga agctcctcct ggccacatcc tacccaacat caagctctga 480 cacccagaag gaagggaccg cctggcgcca ggcaggcctc agatctaaaa gtcaacggcc 540 ttgtgttaag gcggacagca cccaagacaa gaaggctccc atgatgcagt ctcagagccg 600 aagttgtaca gaactacata agcacctcac ctcggcacag tgctgcctgc aggatcgggg 660 tctgcagcca ccatgcctcc agagtccccg gctccctgcc aaggaggaca aggagccggg 720 tgaggactgc ccgagccccc agccagctcc agcctctccc caggactccc tagctctggg 780 cagggcagac cccggtgccc cggtttccca ggaagacatg caggcgatgg tgcaactcat 840 acgctacatg cacacctact gcctccccca gaggaagctg cccccacaga cccctgagcc 900 actccccaag gcctgcagca acccctccca gcaggtcaga tcccggccct ggtcccggca 960 ccactccaaa gcctcctggg ctgagttctc cattctgagg gaacttctgg ctcaagacgt 1020 gctctgtgat gtcagcaaac cctaccgtct ggccacgcct gtttatgcct ccctcacacc 1080 tcggtcaagg cccaggcccc ccaaagacag tcaggcctcc cctggtcgcc cgtcctcggt 1140 ggaggaggta aggatcgcag cttcacccaa gagcaccggg cccagaccaa gcctgcgccc 1200 actgcggctg gaggtgaaaa gggaggtccg ccggcctgcc agactgcagc agcaggagga 1260 ggaagacgag gaagaagagg aggaggaaga ggaagaagaa aaagaggagg aggaggagtg 1320 gggcaggaaa aggccaggcc gaggcctgcc atggacgaag ctggggagga agctggagag 1380 ctctgtgtgc cccgtgcggc gttctcggag actgaaccct gagctgggcc cctggctgac 1440 atttgcagat gagccgctgg tcccctcgga gccccaaggt gctctgccct cactgtgcct 1500 ggctcccaag gcctacgacg tagagcggga gctgggcagc cccacggacg aggacagtgg 1560 ccaagaccag cagctcctac ggggacccca gatccctgcc ctggagagcc cctgtgagag 1620 tgggtgtggg gacatggatg aggaccccag ctgcccgcag ctccctccca gagactctcc 1680 caggtgcctc atgctggcct tgtcacaaag cgacccaact tttggcaaga agagctttga 1740 gcagaccttg acagtggagc tctgtggcac agcaggactc accccaccca ccacaccacc 1800 gtacaagccc acagaggagg atcccttcaa accagacatc aagcatagtc taggcaaaga 1860 aatagctctc agcctcccct cccctgaggg cctctcactc aaggccaccc caggggctgc 1920 ccacaagctg ccaaagaagc acccagagcg aagtgagctc ctgtcccacc tgcgacatgc 1980 cacagcccag ccagcctccc aggctggcca gaagcgtccc ttctcctgtt cctttggaga 2040 ccatgactac tgccaggtgc tccgaccaga aggcgtcctg caaaggaagg tgctgaggtc 2100 ctgggagccg tctggggttc accttgagga ctggccccag cagggtgccc cttgggctga 2160 ggcacaggcc cctggcaggg aggaagacag aagctgtgat gctggtgccc cacccaagga 2220 cagcacgctg ctgagagacc atgagatccg tgctagcctc accaaacact ttgggctgct 2280 ggagaccgcc ctggaggagg aagacctggc ctcctgcaag agccctgagt atgacactgt 2340 ctttgaagac agcagcagca gcagcggcga gagcagcttc ctcccagagg aggaagagga 2400 agaaggggag gaggaggagg aggacgatga agaagaggac tcaggggtca gccccacttg 2460 ctctgaccac tgcccctacc agagcccacc aagcaaggcc aaccggcagc tctgttcccg 2520 cagccgctca agctctggct cttcaccctg ccactcctgg tcaccagcca ctcgaaggaa 2580 cttcagatgt gagagcagag ggccgtgttc agacagaacg ccaagcatcc ggcacgccag 2640 gaagcggcgg gaaaaggcca ttggggaagg ccgcgtggtg tacattcaaa atctctccag 2700 cgacatgagc tcccgagagc tgaagaggcg ctttgaagtg tttggtgaga ttgaggagtg 2760 cgaggtgctg acaagaaata ggagaggcga gaagtacggc ttcatcacct accggtgttc 2820 tgagcacgcg gccctctctt tgacaaaggg cgctgccctg aggaagcgca acgagccctc 2880 cttccagctg agctacggag ggctccggca cttctgctgg cccagataca ctgactacga 2940 ttccaattca gaagaggccc ttcctgcgtc agggaaaagc aagtatgaag ccatggattt 3000 tgacagctta ctgaaagagg cccagcagag cctgcattga taacagcctt aaccctcgag 3060 gaatacctca atacctcaga caaggccctt ccaatatgtt tacgttttca aagaaatcaa 3120 gtatatgagg agagcgagcg agcgtgagag aacacccgtg agagagactt gaaactgctg 3180 tcctttaaaa aaaaaaaaaa aaa 3203 2 1002 PRT Homo sapiens 2 Met Gly Val Tyr Lys Gly Gly Gly Ser Gly Glu Glu Gln Leu Tyr Ala 1 5 10 15 Asp Phe Pro Glu Leu Asp Leu Ser Gln Leu Asp Ala Ser Asp Phe Asp 20 25 30 Ser Ala Thr Cys Phe Gly Glu Leu Gln Trp Cys Pro Glu Asn Ser Glu 35 40 45 Thr Glu Pro Asn Gln Tyr Ser Pro Asp Asp Ser Glu Leu Phe Gln Ile 50 55 60 Asp Ser Glu Asn Glu Ala Leu Leu Ala Glu Leu Thr Lys Thr Leu Asp 65 70 75 80 Asp Ile Pro Glu Asp Asp Val Gly Leu Ala Ala Phe Pro Ala Leu Asp 85 90 95 Gly Gly Asp Ala Leu Ser Cys Thr Ser Ala Ser Pro Ala Pro Ser Ser 100 105 110 Ala Pro Pro Ser Pro Ala Pro Glu Lys Pro Ser Ala Pro Ala Pro Glu 115 120 125 Val Asp Glu Leu Ser Leu Leu Gln Lys Leu Leu Leu Ala Thr Ser Tyr 130 135 140 Pro Thr Ser Ser Ser Asp Thr Gln Lys Glu Gly Thr Ala Trp Arg Gln 145 150 155 160 Ala Gly Leu Arg Ser Lys Ser Gln Arg Pro Cys Val Lys Ala Asp Ser 165 170 175 Thr Gln Asp Lys Lys Ala Pro Met Met Gln Ser Gln Ser Arg Ser Cys 180 185 190 Thr Glu Leu His Lys His Leu Thr Ser Ala Gln Cys Cys Leu Gln Asp 195 200 205 Arg Gly Leu Gln Pro Pro Cys Leu Gln Ser Pro Arg Leu Pro Ala Lys 210 215 220 Glu Asp Lys Glu Pro Gly Glu Asp Cys Pro Ser Pro Gln Pro Ala Pro 225 230 235 240 Ala Ser Pro Gln Asp Ser Leu Ala Leu Gly Arg Ala Asp Pro Gly Ala 245 250 255 Pro Val Ser Gln Glu Asp Met Gln Ala Met Val Gln Leu Ile Arg Tyr 260 265 270 Met His Thr Tyr Cys Leu Pro Gln Arg Lys Leu Pro Pro Gln Thr Pro 275 280 285 Glu Pro Leu Pro Lys Ala Cys Ser Asn Pro Ser Gln Gln Val Arg Ser 290 295 300 Arg Pro Trp Ser Arg His His Ser Lys Ala Ser Trp Ala Glu Phe Ser 305 310 315 320 Ile Leu Arg Glu Leu Leu Ala Gln Asp Val Leu Cys Asp Val Ser Lys 325 330 335 Pro Tyr Arg Leu Ala Thr Pro Val Tyr Ala Ser Leu Thr Pro Arg Ser 340 345 350 Arg Pro Arg Pro Pro Lys Asp Ser Gln Ala Ser Pro Gly Arg Pro Ser 355 360 365 Ser Val Glu Glu Val Arg Ile Ala Ala Ser Pro Lys Ser Thr Gly Pro 370 375 380 Arg Pro Ser Leu Arg Pro Leu Arg Leu Glu Val Lys Arg Glu Val Arg 385 390 395 400 Arg Pro Ala Arg Leu Gln Gln Gln Glu Glu Glu Asp Glu Glu Glu Glu 405 410 415 Glu Glu Glu Glu Glu Glu Glu Lys Glu Glu Glu Glu Glu Trp Gly Arg 420 425 430 Lys Arg Pro Gly Arg Gly Leu Pro Trp Thr Lys Leu Gly Arg Lys Leu 435 440 445 Glu Ser Ser Val Cys Pro Val Arg Arg Ser Arg Arg Leu Asn Pro Glu 450 455 460 Leu Gly Pro Trp Leu Thr Phe Ala Asp Glu Pro Leu Val Pro Ser Glu 465 470 475 480 Pro Gln Gly Ala Leu Pro Ser Leu Cys Leu Ala Pro Lys Ala Tyr Asp 485 490 495 Val Glu Arg Glu Leu Gly Ser Pro Thr Asp Glu Asp Ser Gly Gln Asp 500 505 510 Gln Gln Leu Leu Arg Gly Pro Gln Ile Pro Ala Leu Glu Ser Pro Cys 515 520 525 Glu Ser Gly Cys Gly Asp Met Asp Glu Asp Pro Ser Cys Pro Gln Leu 530 535 540 Pro Pro Arg Asp Ser Pro Arg Cys Leu Met Leu Ala Leu Ser Gln Ser 545 550 555 560 Asp Pro Thr Phe Gly Lys Lys Ser Phe Glu Gln Thr Leu Thr Val Glu 565 570 575 Leu Cys Gly Thr Ala Gly Leu Thr Pro Pro Thr Thr Pro Pro Tyr Lys 580 585 590 Pro Thr Glu Glu Asp Pro Phe Lys Pro Asp Ile Lys His Ser Leu Gly 595 600 605 Lys Glu Ile Ala Leu Ser Leu Pro Ser Pro Glu Gly Leu Ser Leu Lys 610 615 620 Ala Thr Pro Gly Ala Ala His Lys Leu Pro Lys Lys His Pro Glu Arg 625 630 635 640 Ser Glu Leu Leu Ser His Leu Arg His Ala Thr Ala Gln Pro Ala Ser 645 650 655 Gln Ala Gly Gln Lys Arg Pro Phe Ser Cys Ser Phe Gly Asp His Asp 660 665 670 Tyr Cys Gln Val Leu Arg Pro Glu Gly Val Leu Gln Arg Lys Val Leu 675 680 685 Arg Ser Trp Glu Pro Ser Gly Val His Leu Glu Asp Trp Pro Gln Gln 690 695 700 Gly Ala Pro Trp Ala Glu Ala Gln Ala Pro Gly Arg Glu Glu Asp Arg 705 710 715 720 Ser Cys Asp Ala Gly Ala Pro Pro Lys Asp Ser Thr Leu Leu Arg Asp 725 730 735 His Glu Ile Arg Ala Ser Leu Thr Lys His Phe Gly Leu Leu Glu Thr 740 745 750 Ala Leu Glu Glu Glu Asp Leu Ala Ser Cys Lys Ser Pro Glu Tyr Asp 755 760 765 Thr Val Phe Glu Asp Ser Ser Ser Ser Ser Gly Glu Ser Ser Phe Leu 770 775 780 Pro Glu Glu Glu Glu Glu Glu Gly Glu Glu Glu Glu Glu Asp Asp Glu 785 790 795 800 Glu Glu Asp Ser Gly Val Ser Pro Thr Cys Ser Asp His Cys Pro Tyr 805 810 815 Gln Ser Pro Pro Ser Lys Ala Asn Arg Gln Leu Cys Ser Arg Ser Arg 820 825 830 Ser Ser Ser Gly Ser Ser Pro Cys His Ser Trp Ser Pro Ala Thr Arg 835 840 845 Arg Asn Phe Arg Cys Glu Ser Arg Gly Pro Cys Ser Asp Arg Thr Pro 850 855 860 Ser Ile Arg His Ala Arg Lys Arg Arg Glu Lys Ala Ile Gly Glu Gly 865 870 875 880 Arg Val Val Tyr Ile Gln Asn Leu Ser Ser Asp Met Ser Ser Arg Glu 885 890 895 Leu Lys Arg Arg Phe Glu Val Phe Gly Glu Ile Glu Glu Cys Glu Val 900 905 910 Leu Thr Arg Asn Arg Arg Gly Glu Lys Tyr Gly Phe Ile Thr Tyr Arg 915 920 925 Cys Ser Glu His Ala Ala Leu Ser Leu Thr Lys Gly Ala Ala Leu Arg 930 935 940 Lys Arg Asn Glu Pro Ser Phe Gln Leu Ser Tyr Gly Gly Leu Arg His 945 950 955 960 Phe Cys Trp Pro Arg Tyr Thr Asp Tyr Asp Ser Asn Ser Glu Glu Ala 965 970 975 Leu Pro Ala Ser Gly Lys Ser Lys Tyr Glu Ala Met Asp Phe Asp Ser 980 985 990 Leu Leu Lys Glu Ala Gln Gln Ser Leu His 995 1000 3 3679 DNA Homo sapiens 3 ggcacgagga agaattgaac tcatacagct gatgggagtg tacaaaggtg gagggtccgg 60 ggaggagcaa ctctatgctg actttccaga acttgacctc tcccagctgg atgccagcga 120 ctttgactcg gccacctgct ttggggagct gcagtggtgc ccagagaact cagagactga 180 acccaaccag tacagccccg atgactccga gctcttccag attgacagtg agaatgaggc 240 cctcctggca gagctcacca agaccctgga tgacatccct gaagatgacg tgggtctggc 300 tgccttccca gccctggatg gtggagacgc tctatcatgc acctcagctt cgcctgcccc 360 ctcatctgca ccccccagcc ctgccccgga gaagccctcg gccccagccc ctgaggtgga 420 cgagctctca ctgctgcaga agctcctcct ggccacatcc tacccaacat caagctctga 480 cacccagaag gaagggaccg cctggcgcca ggcaggcctc agatctaaaa gtcaacggcc 540 ttgtgttaag gcggacagca cccaagacaa gaaggctccc atgatgcagt ctcagagccg 600 aagttgtaca gaactacata agcacctcac ctcggcacag tgctgcctgc aggatcgggg 660 tctgcagcca ccatgcctcc agagtccccg gctccctgcc aaggaggaca aggagccggg 720 tgaggactgc ccgagccccc agccagctcc agcctctccc caggactccc tagctctggg 780 cagggcagac cccggtgccc cggtttccca ggaagacatg caggcgatgg tgcaactcat 840 acgctacatg cacacctact gcctccccca gaggaagctg cccccacaga cccctgagcc 900 actccccaag gcctgcagca acccctccca gcaggtcaga tcccggccct ggtcccggca 960 ccactccaaa gcctcctggg ctgagttctc cattctgagg gaacttctgg ctcaagacgt 1020 gctctgtgat gtcagcaaac cctaccgtct ggccacgcct gtttatgcct ccctcacacc 1080 tcggtcaagg cccaggcccc ccaaagacag tcaggcctcc cctggtcgcc cgtcctcggt 1140 ggaggaggta aggatcgcag cttcacccaa gagcaccggg cccagaccaa gcctgcgccc 1200 actgcggctg gaggtgaaaa gggaggtccg ccggcctgcc agactgcagc agcaggagga 1260 ggaagacgag gaagaagagg aggaggaaga ggaagaagaa aaagaggagg aggaggagtg 1320 gggcaggaaa aggccaggcc gaggcctgcc atggacgaag ctggggagga agctggagag 1380 ctctgtgtgc cccgtgcggc gttctcggag actgaaccct gagctgggcc cctggctgac 1440 atttgcagat gagccgctgg tcccctcgga gccccaaggt gctctgccct cactgtgcct 1500 ggctcccaag gcctacgacg tagagcggga gctgggcagc cccacggacg aggacagtgg 1560 ccaagaccag cagctcctac ggggacccca gatccctgcc ctggagagcc cctgtgagag 1620 tgggtgtggg gacatggatg aggaccccag ctgcccgcag ctccctccca gagactctcc 1680 caggtgcctc atgctggcct tgtcacaaag cgacccaact tttggcaaga agagctttga 1740 gcagaccttg acagtggagc tctgtggcac agcaggactc accccaccca ccacaccacc 1800 gtacaagccc acagaggagg atcccttcaa accagacatc aagcatagtc taggcaaaga 1860 aatagctctc agcctcccct cccctgaggg cctctcactc aaggccaccc caggggctgc 1920 ccacaagctg ccaaagaagc acccagagcg aagtgagctc ctgtcccacc tgcgacatgc 1980 cacagcccag ccagcctccc aggctggcca gaagcgtccc ttctcctgtt cctttggaga 2040 ccatgactac tgccaggtgc tccgaccaga aggcgtcctg caaaggaagg tgctgaggtc 2100 ctgggagccg tctggggttc accttgagga ctggccccag cagggtgccc cttgggctga 2160 ggcacaggcc cctggcaggg aggaagacag aagctgtgat gctggcgccc cacccaagga 2220 cagcacgctg ctgagagacc atgagatccg tgccagcctc accaaacact ttgggctgct 2280 ggagaccgcc ctggaggagg aagacctggc ctcctgcaag agccctgagt atgacactgt 2340 ctttgaagac agcagcagca gcagcggcga gagcagcttc ctcccagagg aggaagagga 2400 agaaggggag gaggaggagg aggacgatga agaagaggac tcaggggtca gccccacttg 2460 ctctgaccac tgcccctacc agagcccacc aagcaaggcc aaccggcagc tctgttcccg 2520 cagccgctca agctctggct cttcaccctg ccactcctgg tcaccagcca ctcgaaggaa 2580 cttcagatgt gagagcagag ggccgtgttc agacagaacg ccaagcatcc ggcacgccag 2640 gaagcggcgg gaaaaggcca ttggggaagg ccgcgtggtg tacattcaaa atctctccag 2700 cgacatgagc tcccgagagc tgaagaggcg ctttgaagtg tttggtgaga ttgaggagtg 2760 cgaggtgctg acaagaaata ggagaggcga gaagtacggc ttcatcacct accggtgttc 2820 tgagcacgcg gccctctctt tgacaaaggg cgctgccctg aggaagcgca acgagccctc 2880 cttccagctg agctacggag ggctccggca cttctgctgg cccagataca ctgactacgg 2940 taagcccctg aaacccagcc acagtctagt aagactcaaa gcttgggaag cagtgccttc 3000 cttgaacaaa acccagagct aaagcgcctt gtggacatag cttccatccc cacaccccag 3060 tgtgctgctt ggtataactt tgcagccact ttgcctgaag actaccatcc tgtttctctt 3120 ctggcctctg gtccacctta tcctgtcctg tgactgctac caaagagaat ccagcctccc 3180 acggcctcta ggaagattca gtcatgtgca cagccagctg gcagaaccgt ggctacggtc 3240 tccttgactt cacagggcca gctgctaccc tgtccccttc aggggcattc cgtggtgacc 3300 ccagacaagg cagcagccac ctggggacaa gatgatgaag aaggacaaag aagtacaatg 3360 tacgaaagaa ttacttggcc aggctcagtg gctcatgcct gtaatcccat caccttggga 3420 ggctgaggca agaggatcac ttgagcccag gagttcgaga ccagcttggg caacatagtg 3480 aaatcctgtc tctacaaaaa atataaaaat tagccaggca tggtggcttg cgcctatagt 3540 cccagctact caggaggcag aggtgggagg atcacctgaa cccaagaggt tggagctgca 3600 gtgagccatg atggcactac tgcattccag cctgggcaac agagcaagac cctgtctcaa 3660 aaggaaaaaa aaaaaaaaa 3679 4 996 PRT Homo sapiens 4 Met Gly Val Tyr Lys Gly Gly Gly Ser Gly Glu Glu Gln Leu Tyr Ala 1 5 10 15 Asp Phe Pro Glu Leu Asp Leu Ser Gln Leu Asp Ala Ser Asp Phe Asp 20 25 30 Ser Ala Thr Cys Phe Gly Glu Leu Gln Trp Cys Pro Glu Asn Ser Glu 35 40 45 Thr Glu Pro Asn Gln Tyr Ser Pro Asp Asp Ser Glu Leu Phe Gln Ile 50 55 60 Asp Ser Glu Asn Glu Ala Leu Leu Ala Glu Leu Thr Lys Thr Leu Asp 65 70 75 80 Asp Ile Pro Glu Asp Asp Val Gly Leu Ala Ala Phe Pro Ala Leu Asp 85 90 95 Gly Gly Asp Ala Leu Ser Cys Thr Ser Ala Ser Pro Ala Pro Ser Ser 100 105 110 Ala Pro Pro Ser Pro Ala Pro Glu Lys Pro Ser Ala Pro Ala Pro Glu 115 120 125 Val Asp Glu Leu Ser Leu Leu Gln Lys Leu Leu Leu Ala Thr Ser Tyr 130 135 140 Pro Thr Ser Ser Ser Asp Thr Gln Lys Glu Gly Thr Ala Trp Arg Gln 145 150 155 160 Ala Gly Leu Arg Ser Lys Ser Gln Arg Pro Cys Val Lys Ala Asp Ser 165 170 175 Thr Gln Asp Lys Lys Ala Pro Met Met Gln Ser Gln Ser Arg Ser Cys 180 185 190 Thr Glu Leu His Lys His Leu Thr Ser Ala Gln Cys Cys Leu Gln Asp 195 200 205 Arg Gly Leu Gln Pro Pro Cys Leu Gln Ser Pro Arg Leu Pro Ala Lys 210 215 220 Glu Asp Lys Glu Pro Gly Glu Asp Cys Pro Ser Pro Gln Pro Ala Pro 225 230 235 240 Ala Ser Pro Gln Asp Ser Leu Ala Leu Gly Arg Ala Asp Pro Gly Ala 245 250 255 Pro Val Ser Gln Glu Asp Met Gln Ala Met Val Gln Leu Ile Arg Tyr 260 265 270 Met His Thr Tyr Cys Leu Pro Gln Arg Lys Leu Pro Pro Gln Thr Pro 275 280 285 Glu Pro Leu Pro Lys Ala Cys Ser Asn Pro Ser Gln Gln Val Arg Ser 290 295 300 Arg Pro Trp Ser Arg His His Ser Lys Ala Ser Trp Ala Glu Phe Ser 305 310 315 320 Ile Leu Arg Glu Leu Leu Ala Gln Asp Val Leu Cys Asp Val Ser Lys 325 330 335 Pro Tyr Arg Leu Ala Thr Pro Val Tyr Ala Ser Leu Thr Pro Arg Ser 340 345 350 Arg Pro Arg Pro Pro Lys Asp Ser Gln Ala Ser Pro Gly Arg Pro Ser 355 360 365 Ser Val Glu Glu Val Arg Ile Ala Ala Ser Pro Lys Ser Thr Gly Pro 370 375 380 Arg Pro Ser Leu Arg Pro Leu Arg Leu Glu Val Lys Arg Glu Val Arg 385 390 395 400 Arg Pro Ala Arg Leu Gln Gln Gln Glu Glu Glu Asp Glu Glu Glu Glu 405 410 415 Glu Glu Glu Glu Glu Glu Glu Lys Glu Glu Glu Glu Glu Trp Gly Arg 420 425 430 Lys Arg Pro Gly Arg Gly Leu Pro Trp Thr Lys Leu Gly Arg Lys Leu 435 440 445 Glu Ser Ser Val Cys Pro Val Arg Arg Ser Arg Arg Leu Asn Pro Glu 450 455 460 Leu Gly Pro Trp Leu Thr Phe Ala Asp Glu Pro Leu Val Pro Ser Glu 465 470 475 480 Pro Gln Gly Ala Leu Pro Ser Leu Cys Leu Ala Pro Lys Ala Tyr Asp 485 490 495 Val Glu Arg Glu Leu Gly Ser Pro Thr Asp Glu Asp Ser Gly Gln Asp 500 505 510 Gln Gln Leu Leu Arg Gly Pro Gln Ile Pro Ala Leu Glu Ser Pro Cys 515 520 525 Glu Ser Gly Cys Gly Asp Met Asp Glu Asp Pro Ser Cys Pro Gln Leu 530 535 540 Pro Pro Arg Asp Ser Pro Arg Cys Leu Met Leu Ala Leu Ser Gln Ser 545 550 555 560 Asp Pro Thr Phe Gly Lys Lys Ser Phe Glu Gln Thr Leu Thr Val Glu 565 570 575 Leu Cys Gly Thr Ala Gly Leu Thr Pro Pro Thr Thr Pro Pro Tyr Lys 580 585 590 Pro Thr Glu Glu Asp Pro Phe Lys Pro Asp Ile Lys His Ser Leu Gly 595 600 605 Lys Glu Ile Ala Leu Ser Leu Pro Ser Pro Glu Gly Leu Ser Leu Lys 610 615 620 Ala Thr Pro Gly Ala Ala His Lys Leu Pro Lys Lys His Pro Glu Arg 625 630 635 640 Ser Glu Leu Leu Ser His Leu Arg His Ala Thr Ala Gln Pro Ala Ser 645 650 655 Gln Ala Gly Gln Lys Arg Pro Phe Ser Cys Ser Phe Gly Asp His Asp 660 665 670 Tyr Cys Gln Val Leu Arg Pro Glu Gly Val Leu Gln Arg Lys Val Leu 675 680 685 Arg Ser Trp Glu Pro Ser Gly Val His Leu Glu Asp Trp Pro Gln Gln 690 695 700 Gly Ala Pro Trp Ala Glu Ala Gln Ala Pro Gly Arg Glu Glu Asp Arg 705 710 715 720 Ser Cys Asp Ala Gly Ala Pro Pro Lys Asp Ser Thr Leu Leu Arg Asp 725 730 735 His Glu Ile Arg Ala Ser Leu Thr Lys His Phe Gly Leu Leu Glu Thr 740 745 750 Ala Leu Glu Glu Glu Asp Leu Ala Ser Cys Lys Ser Pro Glu Tyr Asp 755 760 765 Thr Val Phe Glu Asp Ser Ser Ser Ser Ser Gly Glu Ser Ser Phe Leu 770 775 780 Pro Glu Glu Glu Glu Glu Glu Gly Glu Glu Glu Glu Glu Asp Asp Glu 785 790 795 800 Glu Glu Asp Ser Gly Val Ser Pro Thr Cys Ser Asp His Cys Pro Tyr 805 810 815 Gln Ser Pro Pro Ser Lys Ala Asn Arg Gln Leu Cys Ser Arg Ser Arg 820 825 830 Ser Ser Ser Gly Ser Ser Pro Cys His Ser Trp Ser Pro Ala Thr Arg 835 840 845 Arg Asn Phe Arg Cys Glu Ser Arg Gly Pro Cys Ser Asp Arg Thr Pro 850 855 860 Ser Ile Arg His Ala Arg Lys Arg Arg Glu Lys Ala Ile Gly Glu Gly 865 870 875 880 Arg Val Val Tyr Ile Gln Asn Leu Ser Ser Asp Met Ser Ser Arg Glu 885 890 895 Leu Lys Arg Arg Phe Glu Val Phe Gly Glu Ile Glu Glu Cys Glu Val 900 905 910 Leu Thr Arg Asn Arg Arg Gly Glu Lys Tyr Gly Phe Ile Thr Tyr Arg 915 920 925 Cys Ser Glu His Ala Ala Leu Ser Leu Thr Lys Gly Ala Ala Leu Arg 930 935 940 Lys Arg Asn Glu Pro Ser Phe Gln Leu Ser Tyr Gly Gly Leu Arg His 945 950 955 960 Phe Cys Trp Pro Arg Tyr Thr Asp Tyr Gly Lys Pro Leu Lys Pro Ser 965 970 975 His Ser Leu Val Arg Leu Lys Ala Trp Glu Ala Val Pro Ser Leu Asn 980 985 990 Lys Thr Gln Ser 995 5 1496 DNA Homo sapiens 5 gtcacaaagc gacccaactt ttggcaagaa gagctttgag cagaccttga cagtggagct 60 ctgtggcaca gcaggactca ccccacccac cacaccaccg tacaagccca cagaggagga 120 tcccttcaaa ccagacatca agcatagtct aggcaaagaa atagctctca gcctcccctc 180 ccctgagggc ctctcactca aggccacccc aggggctgcc cacaagctgc caaagaagca 240 cccagagcga agtgagctcc tgtcccacct gcgacatgcc acagcccagc cagcctccca 300 ggctggccag aagcgtccct tctcctgttc ctttggagac catgactact gccaggtgct 360 ccgaccagaa ggcgtcctgc aaaggaaggt gctgaggtcc tgggagccgt ctggggttca 420 ccttgaggac tggccccagc agggtgcccc ttgggctgag gcacaggccc ctggcaggga 480 ggaagacaga agctgtgatg ctggcgcccc acccaaggac agcacgctgc tgagagacca 540 tgagatccgt gccagcctca ccaaacactt tgggctgctg gagaccgccc tggaggagga 600 agacctggcc tcctgcaaga gccctgagta tgacactgtc tttgaagaca gcagcagcag 660 cagcggcgag agcagcttcc tcccagagga ggaagaggaa gaaggggagg aggaggagga 720 ggacgatgaa gaagaggact caggggtcag ccccacttgc tctgaccact gcccctacca 780 gagcccacca agcaaggcca accggcagct ctgttcccgc agccgctcaa gctctggctc 840 ttcaccctgc cactcctggt caccagccac tcgaaggaac ttcagcagat gtgagagcag 900 agggccgtgt tcagacagaa cgccaagcat ccggcacgcc aggaagcggc gggaaaaggc 960 cattggggaa ggccgcgtgg tgtacattca aaatctctcc agcgacatga gctcccgaga 1020 gctgaagagg cgctttgaag tgtttggtga gattgaggag tgcgaggtgc tgacaagaaa 1080 taggagaggc gagaagtacg gcttcatcac ctaccggtgt tctgagcacg cggccctctc 1140 tttgacaaag ggcgctgccc tgaggaagcg caacgagccc tccttccagc tgagctacgg 1200 agggctccgg cacttctgct ggcccagata cactgactac gattccaatt cagaagaggc 1260 ccttcctgcg tcagggaaaa gcaagtatga agccatggat tttgacagct tactgaaaga 1320 ggcccagcag agcctgcatt gataacagcc ttaaccctcg aggaatacct caatacctca 1380 gacaaggccc ttccaatatg tttacgtttt caaagaaatc aagtatatga ggagagcgag 1440 cgagcgtgag agaacacccg tgagagagac ttgaaactgc tgtcctaaaa aaaaaa 1496 6 172 DNA Homo sapiens 6 actccgccgc acgctgcagc cgcggctgga agatggcggg gaacgactgc ggcgcgctgc 60 tggacgaaga gctctcctcc ttcttcctca actatctcgc tgacacgcag ggtggagggt 120 ccggggagga gcaactctat gctgactttc cagaacttga cctctcccag ct 172 7 3267 DNA Homo sapiens 7 actccgccgc acgctgcagc cgcggctgga agatggcggg gaacgactgc ggcgcgctgc 60 tggacgaaga gctctcctcc ttcttcctca actatctcgc tgacacgcag ggtggagggt 120 ccggggagga gcaactctat gctgactttc cagaacttga cctctcccag ctggatgcca 180 gcgactttga ctcggccacc tgctttgggg agctgcagtg gtgcccagag aactcagaga 240 ctgaacccaa ccagtacagc cccgatgact ccgagctctt ccagattgac agtgagaatg 300 aggccctcct ggcagagctc accaagaccc tggatgacat ccctgaagat gacgtgggtc 360 tggctgcctt cccagccctg gatggtggag acgctctatc atgcacctca gcttcgcctg 420 ccccctcatc tgcacccccc agccctgccc cggagaagcc ctcggcccca gcccctgagg 480 tggacgagct ctcactgctg cagaagctcc tcctggccac atcctaccca acatcaagct 540 ctgacaccca gaaggaaggg accgcctggc gccaggcagg cctcagatct aaaagtcaac 600 ggccttgtgt taaggcggac agcacccaag acaagaaggc tcccatgatg cagtctcaga 660 gccgaagttg tacagaacta cataagcacc tcacctcggc acagtgctgc ctgcaggatc 720 ggggtctgca gccaccatgc ctccagagtc cccggctccc tgccaaggag gacaaggagc 780 cgggtgagga ctgcccgagc ccccagccag ctccagcctc tccccaggac tccctagctc 840 tgggcagggc agaccccggt gccccggttt cccaggaaga catgcaggcg atggtgcaac 900 tcatacgcta catgcacacc tactgcctcc cccagaggaa gctgccccca cagacccctg 960 agccactccc caaggcctgc agcaacccct cccagcaggt cagatcccgg ccctggtccc 1020 ggcaccactc caaagcctcc tgggctgagt tctccattct gagggaactt ctggctcaag 1080 acgtgctctg tgatgtcagc aaaccctacc gtctggccac gcctgtttat gcctccctca 1140 cacctcggtc aaggcccagg ccccccaaag acagtcaggc ctcccctggt cgcccgtcct 1200 cggtggagga ggtaaggatc gcagcttcac ccaagagcac cgggcccaga ccaagcctgc 1260 gcccactgcg gctggaggtg aaaagggagg tccgccggcc tgccagactg cagcagcagg 1320 aggaggaaga cgaggaagaa gaggaggagg aagaggaaga agaaaaagag gaggaggagg 1380 agtggggcag gaaaaggcca ggccgaggcc tgccatggac gaagctgggg aggaagctgg 1440 agagctctgt gtgccccgtg cggcgttctc ggagactgaa ccctgagctg ggcccctggc 1500 tgacatttgc agatgagccg ctggtcccct cggagcccca aggtgctctg ccctcactgt 1560 gcctggctcc caaggcctac gacgtagagc gggagctggg cagccccacg gacgaggaca 1620 gtggccaaga ccagcagctc ctacggggac cccagatccc tgccctggag agcccctgtg 1680 agagtgggtg tggggacatg gatgaggacc ccagctgccc gcagctccct cccagagact 1740 ctcccaggtg cctcatgctg gccttgtcac aaagcgaccc aacttttggc aagaagagct 1800 ttgagcagac cttgacagtg gagctctgtg gcacagcagg actcacccca cccaccacac 1860 caccgtacaa gcccacagag gaggatccct tcaaaccaga catcaagcat agtctaggca 1920 aagaaatagc tctcagcctc ccctcccctg agggcctctc actcaaggcc accccagggg 1980 ctgcccacaa gctgccaaag aagcacccag agcgaagtga gctcctgtcc cacctgcgac 2040 atgccacagc ccagccagcc tcccaggctg gccagaagcg tcccttctcc tgttcctttg 2100 gagaccatga ctactgccag gtgctccgac cagaaggcgt cctgcaaagg aaggtgctga 2160 ggtcctggga gccgtctggg gttcaccttg aggactggcc ccagcagggt gccccttggg 2220 ctgaggcaca ggcccctggc agggaggaag acagaagctg tgatgctggt gccccaccca 2280 aggacagcac gctgctgaga gaccatgaga tccgtgctag cctcaccaaa cactttgggc 2340 tgctggagac cgccctggag gaggaagacc tggcctcctg caagagccct gagtatgaca 2400 ctgtctttga agacagcagc agcagcagcg gcgagagcag cttcctccca gaggaggaag 2460 aggaagaagg ggaggaggag gaggaggacg atgaagaaga ggactcaggg gtcagcccca 2520 cttgctctga ccactgcccc taccagagcc caccaagcaa ggccaaccgg cagctctgtt 2580 cccgcagccg ctcaagctct ggctcttcac cctgccactc ctggtcacca gccactcgaa 2640 ggaacttcag atgtgagagc agagggccgt gttcagacag aacgccaagc atccggcacg 2700 ccaggaagcg gcgggaaaag gccattgggg aaggccgcgt ggtgtacatt caaaatctct 2760 ccagcgacat gagctcccga gagctgaaga ggcgctttga agtgtttggt gagattgagg 2820 agtgcgaggt gctgacaaga aataggagag gcgagaagta cggcttcatc acctaccggt 2880 gttctgagca cgcggccctc tctttgacaa agggcgctgc cctgaggaag cgcaacgagc 2940 cctccttcca gctgagctac ggagggctcc ggcacttctg ctggcccaga tacactgact 3000 acgattccaa ttcagaagag gcccttcctg cgtcagggaa aagcaagtat gaagccatgg 3060 attttgacag cttactgaaa gaggcccagc agagcctgca ttgataacag ccttaaccct 3120 cgaggaatac ctcaatacct cagacaaggc ccttccaata tgtttacgtt ttcaaagaaa 3180 tcaagtatat gaggagagcg agcgagcgtg agagaacacc cgtgagagag acttgaaact 3240 gctgtccttt aaaaaaaaaa aaaaaaa 3267 8 1023 PRT Homo sapiens 8 Met Ala Gly Asn Asp Cys Gly Ala Leu Leu Asp Glu Glu Leu Ser Ser 1 5 10 15 Phe Phe Leu Asn Tyr Leu Ala Asp Thr Gln Gly Gly Gly Ser Gly Glu 20 25 30 Glu Gln Leu Tyr Ala Asp Phe Pro Glu Leu Asp Leu Ser Gln Leu Asp 35 40 45 Ala Ser Asp Phe Asp Ser Ala Thr Cys Phe Gly Glu Leu Gln Trp Cys 50 55 60 Pro Glu Asn Ser Glu Thr Glu Pro Asn Gln Tyr Ser Pro Asp Asp Ser 65 70 75 80 Glu Leu Phe Gln Ile Asp Ser Glu Asn Glu Ala Leu Leu Ala Glu Leu 85 90 95 Thr Lys Thr Leu Asp Asp Ile Pro Glu Asp Asp Val Gly Leu Ala Ala 100 105 110 Phe Pro Ala Leu Asp Gly Gly Asp Ala Leu Ser Cys Thr Ser Ala Ser 115 120 125 Pro Ala Pro Ser Ser Ala Pro Pro Ser Pro Ala Pro Glu Lys Pro Ser 130 135 140 Ala Pro Ala Pro Glu Val Asp Glu Leu Ser Leu Leu Gln Lys Leu Leu 145 150 155 160 Leu Ala Thr Ser Tyr Pro Thr Ser Ser Ser Asp Thr Gln Lys Glu Gly 165 170 175 Thr Ala Trp Arg Gln Ala Gly Leu Arg Ser Lys Ser Gln Arg Pro Cys 180 185 190 Val Lys Ala Asp Ser Thr Gln Asp Lys Lys Ala Pro Met Met Gln Ser 195 200 205 Gln Ser Arg Ser Cys Thr Glu Leu His Lys His Leu Thr Ser Ala Gln 210 215 220 Cys Cys Leu Gln Asp Arg Gly Leu Gln Pro Pro Cys Leu Gln Ser Pro 225 230 235 240 Arg Leu Pro Ala Lys Glu Asp Lys Glu Pro Gly Glu Asp Cys Pro Ser 245 250 255 Pro Gln Pro Ala Pro Ala Ser Pro Gln Asp Ser Leu Ala Leu Gly Arg 260 265 270 Ala Asp Pro Gly Ala Pro Val Ser Gln Glu Asp Met Gln Ala Met Val 275 280 285 Gln Leu Ile Arg Tyr Met His Thr Tyr Cys Leu Pro Gln Arg Lys Leu 290 295 300 Pro Pro Gln Thr Pro Glu Pro Leu Pro Lys Ala Cys Ser Asn Pro Ser 305 310 315 320 Gln Gln Val Arg Ser Arg Pro Trp Ser Arg His His Ser Lys Ala Ser 325 330 335 Trp Ala Glu Phe Ser Ile Leu Arg Glu Leu Leu Ala Gln Asp Val Leu 340 345 350 Cys Asp Val Ser Lys Pro Tyr Arg Leu Ala Thr Pro Val Tyr Ala Ser 355 360 365 Leu Thr Pro Arg Ser Arg Pro Arg Pro Pro Lys Asp Ser Gln Ala Ser 370 375 380 Pro Gly Arg Pro Ser Ser Val Glu Glu Val Arg Ile Ala Ala Ser Pro 385 390 395 400 Lys Ser Thr Gly Pro Arg Pro Ser Leu Arg Pro Leu Arg Leu Glu Val 405 410 415 Lys Arg Glu Val Arg Arg Pro Ala Arg Leu Gln Gln Gln Glu Glu Glu 420 425 430 Asp Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Lys Glu Glu Glu 435 440 445 Glu Glu Trp Gly Arg Lys Arg Pro Gly Arg Gly Leu Pro Trp Thr Lys 450 455 460 Leu Gly Arg Lys Leu Glu Ser Ser Val Cys Pro Val Arg Arg Ser Arg 465 470 475 480 Arg Leu Asn Pro Glu Leu Gly Pro Trp Leu Thr Phe Ala Asp Glu Pro 485 490 495 Leu Val Pro Ser Glu Pro Gln Gly Ala Leu Pro Ser Leu Cys Leu Ala 500 505 510 Pro Lys Ala Tyr Asp Val Glu Arg Glu Leu Gly Ser Pro Thr Asp Glu 515 520 525 Asp Ser Gly Gln Asp Gln Gln Leu Leu Arg Gly Pro Gln Ile Pro Ala 530 535 540 Leu Glu Ser Pro Cys Glu Ser Gly Cys Gly Asp Met Asp Glu Asp Pro 545 550 555 560 Ser Cys Pro Gln Leu Pro Pro Arg Asp Ser Pro Arg Cys Leu Met Leu 565 570 575 Ala Leu Ser Gln Ser Asp Pro Thr Phe Gly Lys Lys Ser Phe Glu Gln 580 585 590 Thr Leu Thr Val Glu Leu Cys Gly Thr Ala Gly Leu Thr Pro Pro Thr 595 600 605 Thr Pro Pro Tyr Lys Pro Thr Glu Glu Asp Pro Phe Lys Pro Asp Ile 610 615 620 Lys His Ser Leu Gly Lys Glu Ile Ala Leu Ser Leu Pro Ser Pro Glu 625 630 635 640 Gly Leu Ser Leu Lys Ala Thr Pro Gly Ala Ala His Lys Leu Pro Lys 645 650 655 Lys His Pro Glu Arg Ser Glu Leu Leu Ser His Leu Arg His Ala Thr 660 665 670 Ala Gln Pro Ala Ser Gln Ala Gly Gln Lys Arg Pro Phe Ser Cys Ser 675 680 685 Phe Gly Asp His Asp Tyr Cys Gln Val Leu Arg Pro Glu Gly Val Leu 690 695 700 Gln Arg Lys Val Leu Arg Ser Trp Glu Pro Ser Gly Val His Leu Glu 705 710 715 720 Asp Trp Pro Gln Gln Gly Ala Pro Trp Ala Glu Ala Gln Ala Pro Gly 725 730 735 Arg Glu Glu Asp Arg Ser Cys Asp Ala Gly Ala Pro Pro Lys Asp Ser 740 745 750 Thr Leu Leu Arg Asp His Glu Ile Arg Ala Ser Leu Thr Lys His Phe 755 760 765 Gly Leu Leu Glu Thr Ala Leu Glu Glu Glu Asp Leu Ala Ser Cys Lys 770 775 780 Ser Pro Glu Tyr Asp Thr Val Phe Glu Asp Ser Ser Ser Ser Ser Gly 785 790 795 800 Glu Ser Ser Phe Leu Pro Glu Glu Glu Glu Glu Glu Gly Glu Glu Glu 805 810 815 Glu Glu Asp Asp Glu Glu Glu Asp Ser Gly Val Ser Pro Thr Cys Ser 820 825 830 Asp His Cys Pro Tyr Gln Ser Pro Pro Ser Lys Ala Asn Arg Gln Leu 835 840 845 Cys Ser Arg Ser Arg Ser Ser Ser Gly Ser Ser Pro Cys His Ser Trp 850 855 860 Ser Pro Ala Thr Arg Arg Asn Phe Arg Cys Glu Ser Arg Gly Pro Cys 865 870 875 880 Ser Asp Arg Thr Pro Ser Ile Arg His Ala Arg Lys Arg Arg Glu Lys 885 890 895 Ala Ile Gly Glu Gly Arg Val Val Tyr Ile Gln Asn Leu Ser Ser Asp 900 905 910 Met Ser Ser Arg Glu Leu Lys Arg Arg Phe Glu Val Phe Gly Glu Ile 915 920 925 Glu Glu Cys Glu Val Leu Thr Arg Asn Arg Arg Gly Glu Lys Tyr Gly 930 935 940 Phe Ile Thr Tyr Arg Cys Ser Glu His Ala Ala Leu Ser Leu Thr Lys 945 950 955 960 Gly Ala Ala Leu Arg Lys Arg Asn Glu Pro Ser Phe Gln Leu Ser Tyr 965 970 975 Gly Gly Leu Arg His Phe Cys Trp Pro Arg Tyr Thr Asp Tyr Asp Ser 980 985 990 Asn Ser Glu Glu Ala Leu Pro Ala Ser Gly Lys Ser Lys Tyr Glu Ala 995 1000 1005 Met Asp Phe Asp Ser Leu Leu Lys Glu Ala Gln Gln Ser Leu His 1010 1015 1020 9 3163 DNA Rattus norvegicus 9 atgcaggggg aagggaaggg tggggagtct ggagaggaac agttatgtgc tgacttgcca 60 gagctcgacc tctcccagct ggatgccagt gacttcgact cagccacgtg ctttggggag 120 ctgcagtggt gcccggagac ctcagagaca gagcccagcc agtacagccc tgatgattcc 180 gagttcttcc agattgacag tgagaatgaa gctctcttgg ctgcgcttac caagaccctg 240 gatgacatcc ccgaagacga tgtggggctg gctgccttcc caggactgga tgaaggcgac 300 acaccctcct gcaccccagc ttcacctgct cctttatctg tgccccccag ccccgccttg 360 gagaggcttc tgtccccagt gtctgaagtg gatgagcttt cactgctgca gaagctcctc 420 ctggccacat cctccccaac agcaagctct gatgctctga aggacggggc cacctggtcg 480 cagaccagcc tcagttccag aagtcagcgg ccttgtgtca aggtggatgg cacccaggac 540 aagaagaccc ccatgctacg gtctcagagc cggccttgta cagaactgca taagcacctc 600 acttcggtgc tgccctgccc caggggaaaa gcctgttccc cacctcccca cccaagtcct 660 cagctcctct ccaaagagga tgaggaggtg ggagaggatt gcccaagccc ctggccagct 720 ccagcgtctc cccaagactc actaggacag gacacggcca accccaacag tgcccaagtt 780 cccaaggacg acgtgagggc catggtacag ctcattcgct acatgcatac ctactgcctg 840 cctcagagga agctgcccca acgggcctca gagccaatcc cccagtcctg cagcagcccc 900 ttgaggaagg tcccaccccg atcccggcaa acccccaaag ccttctggac tgagttctcc 960 atcctaaggg aacttctggc ccaagatatc ctctgtgatg ttagcaagcc ctaccgcctg 1020 gccacacctg tctatgcttc tctcacaccc cagtccagaa ccaggccccc caaagacagt 1080 caggcctccc ctgcccactc tgccatggca gaagaggtga gaatcactgc ttcccccaag 1140 agcactggac ctagacccag cctccgtcct ctgaggctag aggtgaaacg ggatgtcaac 1200 aagcctgcaa ggcaaaagcg ggaggaagat gaggaggagg aagaggaaga agaagaggaa 1260 gaagaaaaag aggatgaaga agaggagtgg ggcaggaaga gaccaggtcg tggcctgcca 1320 tggaccaaac tagggaggaa gatggacagc tctgtgtgcc ctgtgcggcg ctccaggaga 1380 ctgaatccag agctgggccc ttggctgaca ttcactgatg agcccctagg tgctctaccc 1440 tcgatgtgcc tggctacaga gacccacgac ctggaagaag agctgggcgg cctcacagac 1500 agtagtcaag gccagcagct ccccctggga tcccagatcc ccaccctgga aagcccctgt 1560 gaaagtgggt gtggggacac agatgaagat ccaagctgcc cgcggccccc ttccagagac 1620 tcccccaggt gcctcatgct ggccttgtca caaagtgacc ctcttggcaa gaagagcttt 1680 gaggagtcct tgacagtgga gctttgtggc acagcaggac tcactccacc caccacacct 1740 ccatataagc ccatggagga ggaccccttc aagcaggaca ccaagcacag cccaggccaa 1800 gacacagctc ccagcctccc ttcccctgag actcttcagc tcacagccac cccaggggct 1860 tcccacaagc tgcccaagag gcacccggag cgaagtgagc tcctgtctca tctgcaacat 1920 gccacaaccc agccagtctc acaggctggc cagaagcgtc ccttctcctg ctcctttgga 1980 gaccatgact actgccaggt gatcaggcca gaggctgccc tgcagaggaa ggtgctgcgg 2040 tcctgggagc caatcaaggt ccaccttgaa gacttggccc accagggtgc aaccctgcca 2100 gtggaaacaa agacccctag gagggaggca gaccagaact gtgaccccac ccccaaggac 2160 agcatgcagc taagagacca tgagatccgt gccagcctca caaagcactt tgggctgctg 2220 gaaaccgctc tggaggagga agacttggct tcatgtaaaa gcccggagta tgacaccgta 2280 tttgaggaca gcagcagcag cagtggcgag agcagcttcc tgctagagga ggaggaagag 2340 gagggagggg aagaggacga tgaaggagag gactcagggg tcagccctcc ctgctccgac 2400 cactgcccct accagagccc acccagtaag gccagtcggc agctctgttc ccgaagccgc 2460 tccagttctg gctcctcatc ctgtagctcc tggtcaccag ctacccggaa gaacttcaga 2520 cttgagagca gagggccctg ttcagatgga accccaagcg cccggcatgc caagaagcgg 2580 cgggaaaagg ccatcggtga aggtcgtgtg gtatacatcc gaaatctctc cggtgacatg 2640 agctctcgag aactaaagaa gcgcttcgag gtgtttggtg agatagtcga gtgccaggtg 2700 ctgaggagaa gtaagagagg ccagaagcac ggttttatta ccttccggtg ttcggagcat 2760 gccgccctgt ccgtgaggaa cggcgctacc ctgagaaaac gcaatgagcc ctccttccac 2820 ctgagctatg gagggctccg gcacttccgc tggcccagat acaccgacta tgatcccacg 2880 tctgaagagt cccttccctc gtctgggaaa agcaagtacg aagccatgga ttttgacagc 2940 ttactgaaag aggcccagca gagcctgcat taatatcagc cttaaccttc gaggaatacc 3000 tcaatacctc agacaaggcc cttccaatat gtttacgttt tcaaagaaat gagtatatga 3060 ggaggagagc aagccaatga gcgagcgagc gagcgagcgt gagagaacac acaggagaga 3120 gagacttgaa tctgctgtcg tttcctttaa aaaaaaaaaa aaa 3163 10 990 PRT Rattus norvegicus 10 Met Gln Gly Glu Gly Lys Gly Gly Glu Ser Gly Glu Glu Gln Leu Cys 1 5 10 15 Ala Asp Leu Pro Glu Leu Asp Leu Ser Gln Leu Asp Ala Ser Asp Phe 20 25 30 Asp Ser Ala Thr Cys Phe Gly Glu Leu Gln Trp Cys Pro Glu Thr Ser 35 40 45 Glu Thr Glu Pro Ser Gln Tyr Ser Pro Asp Asp Ser Glu Phe Phe Gln 50 55 60 Ile Asp Ser Glu Asn Glu Ala Leu Leu Ala Ala Leu Thr Lys Thr Leu 65 70 75 80 Asp Asp Ile Pro Glu Asp Asp Val Gly Leu Ala Ala Phe Pro Gly Leu 85 90 95 Asp Glu Gly Asp Thr Pro Ser Cys Thr Pro Ala Ser Pro Ala Pro Leu 100 105 110 Ser Val Pro Pro Ser Pro Ala Leu Glu Arg Leu Leu Ser Pro Val Ser 115 120 125 Glu Val Asp Glu Leu Ser Leu Leu Gln Lys Leu Leu Leu Ala Thr Ser 130 135 140 Ser Pro Thr Ala Ser Ser Asp Ala Leu Lys Asp Gly Ala Thr Trp Ser 145 150 155 160 Gln Thr Ser Leu Ser Ser Arg Ser Gln Arg Pro Cys Val Lys Val Asp 165 170 175 Gly Thr Gln Asp Lys Lys Thr Pro Met Leu Arg Ser Gln Ser Arg Pro 180 185 190 Cys Thr Glu Leu His Lys His Leu Thr Ser Val Leu Pro Cys Pro Arg 195 200 205 Gly Lys Ala Cys Ser Pro Pro Pro His Pro Ser Pro Gln Leu Leu Ser 210 215 220 Lys Glu Asp Glu Glu Val Gly Glu Asp Cys Pro Ser Pro Trp Pro Ala 225 230 235 240 Pro Ala Ser Pro Gln Asp Ser Leu Gly Gln Asp Thr Ala Asn Pro Asn 245 250 255 Ser Ala Gln Val Pro Lys Asp Asp Val Arg Ala Met Val Gln Leu Ile 260 265 270 Arg Tyr Met His Thr Tyr Cys Leu Pro Gln Arg Lys Leu Pro Gln Arg 275 280 285 Ala Ser Glu Pro Ile Pro Gln Ser Cys Ser Ser Pro Leu Arg Lys Val 290 295 300 Pro Pro Arg Ser Arg Gln Thr Pro Lys Ala Phe Trp Thr Glu Phe Ser 305 310 315 320 Ile Leu Arg Glu Leu Leu Ala Gln Asp Ile Leu Cys Asp Val Ser Lys 325 330 335 Pro Tyr Arg Leu Ala Thr Pro Val Tyr Ala Ser Leu Thr Pro Gln Ser 340 345 350 Arg Thr Arg Pro Pro Lys Asp Ser Gln Ala Ser Pro Ala His Ser Ala 355 360 365 Met Ala Glu Glu Val Arg Ile Thr Ala Ser Pro Lys Ser Thr Gly Pro 370 375 380 Arg Pro Ser Leu Arg Pro Leu Arg Leu Glu Val Lys Arg Asp Val Asn 385 390 395 400 Lys Pro Ala Arg Gln Lys Arg Glu Glu Asp Glu Glu Glu Glu Glu Glu 405 410 415 Glu Glu Glu Glu Glu Glu Lys Glu Asp Glu Glu Glu Glu Trp Gly Arg 420 425 430 Lys Arg Pro Gly Arg Gly Leu Pro Trp Thr Lys Leu Gly Arg Lys Met 435 440 445 Asp Ser Ser Val Cys Pro Val Arg Arg Ser Arg Arg Leu Asn Pro Glu 450 455 460 Leu Gly Pro Trp Leu Thr Phe Thr Asp Glu Pro Leu Gly Ala Leu Pro 465 470 475 480 Ser Met Cys Leu Ala Thr Glu Thr His Asp Leu Glu Glu Glu Leu Gly 485 490 495 Gly Leu Thr Asp Ser Ser Gln Gly Gln Gln Leu Pro Leu Gly Ser Gln 500 505 510 Ile Pro Thr Leu Glu Ser Pro Cys Glu Ser Gly Cys Gly Asp Thr Asp 515 520 525 Glu Asp Pro Ser Cys Pro Arg Pro Pro Ser Arg Asp Ser Pro Arg Cys 530 535 540 Leu Met Leu Ala Leu Ser Gln Ser Asp Pro Leu Gly Lys Lys Ser Phe 545 550 555 560 Glu Glu Ser Leu Thr Val Glu Leu Cys Gly Thr Ala Gly Leu Thr Pro 565 570 575 Pro Thr Thr Pro Pro Tyr Lys Pro Met Glu Glu Asp Pro Phe Lys Gln 580 585 590 Asp Thr Lys His Ser Pro Gly Gln Asp Thr Ala Pro Ser Leu Pro Ser 595 600 605 Pro Glu Thr Leu Gln Leu Thr Ala Thr Pro Gly Ala Ser His Lys Leu 610 615 620 Pro Lys Arg His Pro Glu Arg Ser Glu Leu Leu Ser His Leu Gln His 625 630 635 640 Ala Thr Thr Gln Pro Val Ser Gln Ala Gly Gln Lys Arg Pro Phe Ser 645 650 655 Cys Ser Phe Gly Asp His Asp Tyr Cys Gln Val Ile Arg Pro Glu Ala 660 665 670 Ala Leu Gln Arg Lys Val Leu Arg Ser Trp Glu Pro Ile Lys Val His 675 680 685 Leu Glu Asp Leu Ala His Gln Gly Ala Thr Leu Pro Val Glu Thr Lys 690 695 700 Thr Pro Arg Arg Glu Ala Asp Gln Asn Cys Asp Pro Thr Pro Lys Asp 705 710 715 720 Ser Met Gln Leu Arg Asp His Glu Ile Arg Ala Ser Leu Thr Lys His 725 730 735 Phe Gly Leu Leu Glu Thr Ala Leu Glu Glu Glu Asp Leu Ala Ser Cys 740 745 750 Lys Ser Pro Glu Tyr Asp Thr Val Phe Glu Asp Ser Ser Ser Ser Ser 755 760 765 Gly Glu Ser Ser Phe Leu Leu Glu Glu Glu Glu Glu Glu Gly Gly Glu 770 775 780 Glu Asp Asp Glu Gly Glu Asp Ser Gly Val Ser Pro Pro Cys Ser Asp 785 790 795 800 His Cys Pro Tyr Gln Ser Pro Pro Ser Lys Ala Ser Arg Gln Leu Cys 805 810 815 Ser Arg Ser Arg Ser Ser Ser Gly Ser Ser Ser Cys Ser Ser Trp Ser 820 825 830 Pro Ala Thr Arg Lys Asn Phe Arg Leu Glu Ser Arg Gly Pro Cys Ser 835 840 845 Asp Gly Thr Pro Ser Ala Arg His Ala Lys Lys Arg Arg Glu Lys Ala 850 855 860 Ile Gly Glu Gly Arg Val Val Tyr Ile Arg Asn Leu Ser Gly Asp Met 865 870 875 880 Ser Ser Arg Glu Leu Lys Lys Arg Phe Glu Val Phe Gly Glu Ile Val 885 890 895 Glu Cys Gln Val Leu Arg Arg Ser Lys Arg Gly Gln Lys His Gly Phe 900 905 910 Ile Thr Phe Arg Cys Ser Glu His Ala Ala Leu Ser Val Arg Asn Gly 915 920 925 Ala Thr Leu Arg Lys Arg Asn Glu Pro Ser Phe His Leu Ser Tyr Gly 930 935 940 Gly Leu Arg His Phe Arg Trp Pro Arg Tyr Thr Asp Tyr Asp Pro Thr 945 950 955 960 Ser Glu Glu Ser Leu Pro Ser Ser Gly Lys Ser Lys Tyr Glu Ala Met 965 970 975 Asp Phe Asp Ser Leu Leu Lys Glu Ala Gln Gln Ser Leu His 980 985 990 11 20 DNA Artificial Sequence forward primer CME 9748 11 gtcacaaagc gacccaactt 20 12 22 DNA Artificial Sequence reverse primer CME 9749 12 gagtcatggt ctccaaagga ac 22 13 27 DNA Artificial Sequence AP1 adaptor primer 13 ccatcctaat acgactcact atagggc 27 14 22 DNA Artificial Sequence CME 9830 forward primer 14 gccactcgaa ggaacttcag at 22 15 22 DNA Artificial Sequence CME 9850 reverse primer B 15 gggttaaggc tgttatcaat gc 22 16 22 DNA Artificial Sequence CME 9831 reverse primer A 16 aggccagaag agaaacagga tg 22 17 22 DNA Artificial Sequence CME 9726 sequencing primer 17 cttctcctgt tcctttggag ac 22 18 20 DNA Artificial Sequence CME 9727 sequencing primer 18 tggggttcac ttgaggattg 20 19 22 DNA Artificial Sequence CME 9778 sequencing primer 19 attcaaaatc tcttccagcg ac 22 20 22 DNA Artificial Sequence CME 9776 sequencing primer 20 gaagacagaa gctgtgatgc tg 22 21 21 DNA Artificial SEquence SP1A primer 21 catcacagag cacgtcttga g 21 22 25 DNA Artificial Sequence SP2A primer 22 catgtagcgt atgagttgca ccatc 25 23 39 DNA Artificial Sequence Oligo d(T)-anchor primer 23 gaccacgcgt atcgatgtcg actttttttt ttttttttv 39 24 22 DNA Artificial Sequence PCR anchor primer 24 gaccacgcgt atcgatgtcg ac 22 25 29 DNA Artificial Sequence CVGI169 primer 25 ttgggtaacg ccagggtttt cccagtcac 29 26 29 DNA Artificial Sequence CVGI170 primer 26 ccccaggctt tacactttat gcttccggc 29 27 20 DNA Artificial Sequence CVGI171 primer 27 gccagtacag ccctgatgat 20 28 22 DNA Artificial Sequence CVGI172 primer 28 tccccagtgt ctgaagtgga tg 22 29 22 DNA Artificial Sequence CVGI281 primer 29 ctcattcgct acatgcatac ct 22 30 22 DNA Artificial Sequence CVGI282 primer 30 cggccttgtg tcaaggtgga tg 22 31 22 DNA Artificial Sequence CVGI283 primer 31 cttctggact gagttctcca tc 22 32 22 DNA Artificial Sequence CVGI390 primer 32 caggagactg aatccagagc tg 22 33 22 DNA Artificial Sequence CVGI391 primer 33 gacagtagtc aaggccagca gc 22 34 22 DNA Artificial Sequence CVGI457 primer 34 gagaccatga ctactgccag gt 22 35 22 DNA Artificial Sequence CVGI458 primer 35 accgctctgg aggaggaaga ct 22 36 22 DNA Artificial Sequence CVGI535 primer 36 ttaagcctta accctttgag ga 22 37 22 DNA Artificial Sequence CVGI536 37 ggcccagata caccgactat ga 22 38 20 DNA Artificial Sequence PGC-3 forward primer 38 tgctggccca gatacactga 20 39 21 DNA Artificial Sequence PGC-3 reverse primer 39 ggctgttatc aatgcaggct c 21 40 28 DNA Artificial Sequence PGC-3 probe 40 cgtcagggaa aagcaagtat gaagccat 28
Claims (23)
1. An isolated and purified polynucleotide comprising a nucleic acid sequence which encodes a polypeptide having at least about 90% homology to a member selected from any one of
(a) (SEQ ID NO:2, SEQ ID NO:2 positions 1-600, SEQ ID NO:2 positions 400-1002, and SEQ ID NO:2 positions 200-800)
or
(b) (SEQ ID NO:4, SEQ ID NO:4 positions 1-600, SEQ ID NO:4 positions 400-996, and SEQ ID NO:4 positions 200-800)
or
(c) (SEQ ID NO:8, SEQ ID NO:8 positions 1-600, SEQ ID NO:4 positions 400-1023, and SEQ ID NO:4 positions 200-800
2. A polynucleotide which comprises the human PGC-3a cDNA sequence set out in SEQ ID NO:1 or a fragment thereof consisting of at least 8 bases.
3. A polynuclotide which comprises the human PGC-3b cDNA sequence set out in SEQ ID:NO:3 or a fragment thereof consisting of at least 8 bases.
4. A polynucleotide which comprises the human PGC-3c cDNA sequence set out in SED ID NO:7 or a fragment thereof consisting of at least 8 bases.
5. A homologue or orthologue of a polynucleotide according to any preceeding claim and having greater than 80% sequence homology to the to the PGC-3a cDNA sequence as set out in SEQ ID NO:1.
6. A homologue or orthologue of a polynucleotide according to any preceeding claim and having greater than 80% sequence homology to the to the PGC-3b cDNA sequence as set out in SEQ ID NO:3.
7. A homologue or orthologue of a polynucleotide according to any preceeding claim and having greater than 80% sequence homology to the to the PGC-3c cDNA sequence as set out in SEQ ID NO:7.
8. An isolated and purified polynucleotide molecule comprising a nucleic acid sequence which encodes a polypeptide having at least about 90% homology to any one of SEQ ID NO:10 positions 1-600, SEQ ID NO:10 positions 400-990, and SEQ ID NO:10 positions 200-800.
9. An expression vector comprising a polynucleotide according to any preceeding claim.
10. A transformed host cell comprising a polynucleotide according to any preceeding claim.
11. A purified polypeptide comprising the human PGC-3a amino acid sequence set out in SEQ ID NO.2 or a variant of SEQ ID NO.2 having at least about 90% homology to a member selected from (SEQ ID NO.2 positions 1-600, SEQ ID NO.2 positions 400-1002, SEQ ID NO.2 positions 200-800), or a biologically active fragment thereof.
12. A purified polypeptide comprising the human PGC-3b amino acid sequence set out in SEQ ID NO.4 or a variant of SEQ ID NO.4 having at least about 90% homology to a member selected from (SEQ ID NO.4 positions 1-600, SEQ ID NO.4 positions 400-996, SEQ ID NO.4 positions 200-800), or a biologically active fragment thereof.
13. A purified polypeptide comprising the human PGC-3c amino acid sequence set out in SEQ ID NO.8 or a variant of SEQ ID NO.8 having at least about 90% homology to a member selected from (SEQ ID NO.8 positions 1-600, SEQ ID NO.8 positions 400-1023, SEQ ID NO.8 positions 200-800), or a biologically active fragment thereof.
14. A purified polypeptide comprising the rat PGC-3 amino acid sequence set out in SEQ ID NO.10 or a variant of SEQ ID NO.10 having at least about 90% homology to a member selected from (SEQ ID NO.10 positions 1-600, SEQ ID NO.8 positions 400-990, SEQ ID NO.10 positions 200-800), or a biologically active fragment thereof.
15. A dominant negative mutant of a polypeptide according to any one of claims 11-14
16. A dominant positive mutants of a polypeptide according to any one of claims 11-14.
17. Antibodies specific for a polypeptide according to any one of claims 11-15.
18. A method for identifying a therapeutic agent capable of modulating the activity of PGC-3 for use in the regulation of metabolism, which method comprises:
(i) contacting a candidate compound modulator with a PGC-3 polypeptide comprising any one of
(a) the amino acid sequence set out in SEQ ID NO.2 or a variant of SEQ ID NO.2 having at least about 90% homology to a member selected from (SEQ ID NO.2 positions 1-600, SEQ ID NO.2 positions 400-1002, SEQ ID NO.2 positions 200-800) or a biologically active fragment thereof;
or
(b) the amino acid sequence set out in SEQ ID NO.4 or a variant of SEQ ID NO.4 having at least about 90% homology to a member selected from (SEQ ID NO.4 positions 1-600, SEQ ID NO.4 positions 400-996, SEQ ID NO.4 positions 200-800) or a biologically active fragment thereof;
or
(c) the amino acid sequence set out in SEQ ID NO.8 or a variant of SEQ ID NO.8 having at least about 90% homology to a member selected from (SEQ ID NO.8 positions 1-600, SEQ ID NO.8 positions 400-996, SEQ ID NO.8 positions 200-800) or a biologically active fragment thereof;
and
(ii) measuring an effect of the candidate compound modulator on the activity of the PGC-3 polypeptide.
19. A method as claimed in claim 18 and wherein the candidate compound modulator is contacted with a host-cell which expresses an PGC-3 polypeptide.
20. A PGC-3 modulator identified according to a method as claimed in claim 18 or claim 19 .
21. A pharmaceutical composition which comprises a PGC-3 modulator as claimed in claim 20 , or a pharmaceutically acceptable salt thereof, in association with a pharmaceutically acceptable diluent or carrier.
22. A method of treating a metabolic disease or medical condition mediated alone or in part by PGC-3, which comprises administering to a warm-blooded animal requiring such treatment an effective amount of an PGC-3 modulator as claimed in claim 20 or claim 21 .
23. The use of an PGC-3 modulator as claimed in claim 20 in the production of a medicament for use in the treatment of a metabolic disease.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US11/978,832 US20090142351A1 (en) | 2000-09-15 | 2007-10-29 | Human and rat PGC-3, PPAR-gamma coactivations and splice variants thereof |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GBGB0022670.4A GB0022670D0 (en) | 2000-09-15 | 2000-09-15 | Molecules |
| GB0022670.4 | 2000-09-15 | ||
| PCT/GB2001/004074 WO2002022818A1 (en) | 2000-09-15 | 2001-09-12 | Human and rat pgc-3, ppar-gamma coactivations and splice variants thereof |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US11/978,832 Division US20090142351A1 (en) | 2000-09-15 | 2007-10-29 | Human and rat PGC-3, PPAR-gamma coactivations and splice variants thereof |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20040077536A1 true US20040077536A1 (en) | 2004-04-22 |
| US7306922B2 US7306922B2 (en) | 2007-12-11 |
Family
ID=9899529
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US10/380,492 Expired - Fee Related US7306922B2 (en) | 2000-09-15 | 2001-09-12 | Human and rat PGC-3, PPAR-gamma coactivations and splice variants thereof |
| US11/978,832 Abandoned US20090142351A1 (en) | 2000-09-15 | 2007-10-29 | Human and rat PGC-3, PPAR-gamma coactivations and splice variants thereof |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US11/978,832 Abandoned US20090142351A1 (en) | 2000-09-15 | 2007-10-29 | Human and rat PGC-3, PPAR-gamma coactivations and splice variants thereof |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US7306922B2 (en) |
| EP (1) | EP1320601A1 (en) |
| JP (1) | JP2004508826A (en) |
| AU (1) | AU2001287855A1 (en) |
| GB (1) | GB0022670D0 (en) |
| WO (1) | WO2002022818A1 (en) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002024728A2 (en) * | 2000-09-22 | 2002-03-28 | Lion Bioscience Ag | Mammalian nuclear receptor cofactor cf6 and methods of use |
| JP2005514921A (en) | 2001-11-09 | 2005-05-26 | ダナ−ファーバー キャンサー インスティチュート インク | PGC-1β, novel PGC-1 homologue and use thereof |
| JPWO2006098314A1 (en) * | 2005-03-15 | 2008-08-21 | アステラス製薬株式会社 | Novel sugar uptake activator and screening method thereof |
| US8151082B2 (en) * | 2007-12-06 | 2012-04-03 | Fusion-Io, Inc. | Apparatus, system, and method for converting a storage request into an append data storage command |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020048763A1 (en) * | 2000-02-04 | 2002-04-25 | Penn Sharron Gaynor | Human genome-derived single exon nucleic acid probes useful for gene expression analysis |
| US20040044181A1 (en) * | 2001-08-31 | 2004-03-04 | Tang Y. Tom | Novel nucleic acids and polypeptides |
Family Cites Families (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2240108A1 (en) | 1995-12-14 | 1997-06-19 | Peter Lin | Antagonists of gonadotropin releasing hormone |
| US6200957B1 (en) | 1995-12-14 | 2001-03-13 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| AU704937B2 (en) | 1995-12-14 | 1999-05-06 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| AU709090B2 (en) | 1995-12-14 | 1999-08-19 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| EA000828B1 (en) | 1995-12-14 | 2000-04-24 | Мерк Энд Ко., Инк. | Antagonists of gonadotropin releasing hormone |
| US6166192A (en) * | 1997-05-30 | 2000-12-26 | Dana-Farber Cancer Institute | PGC-1, a novel brown fat PPARγ coactivator |
| US6426411B1 (en) * | 1997-05-30 | 2002-07-30 | Dana-Farber Cancer Institute | PGC-1, a novel brown fat pparγ coactivator |
| CA2291647A1 (en) | 1997-06-05 | 1998-12-10 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| CA2292605A1 (en) | 1997-06-05 | 1998-12-10 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| JP2002503252A (en) | 1997-06-05 | 2002-01-29 | メルク エンド カンパニー インコーポレーテッド | Gonadotropin-releasing hormone antagonist |
| CA2292880A1 (en) | 1997-06-05 | 1998-12-10 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| EP0986385A4 (en) | 1997-06-05 | 2001-05-16 | Merck & Co Inc | GONADOLIBERINE ANTAGONISTS |
| US6156772A (en) | 1997-06-05 | 2000-12-05 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| AU1124399A (en) | 1997-10-28 | 1999-05-17 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| WO1999021557A1 (en) | 1997-10-28 | 1999-05-06 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| JP2002503661A (en) | 1998-02-11 | 2002-02-05 | メルク エンド カムパニー インコーポレーテッド | Gonadotropin-releasing hormone antagonist |
| JP2002503660A (en) | 1998-02-11 | 2002-02-05 | メルク エンド カムパニー インコーポレーテッド | Gonadotropin-releasing hormone antagonist |
| AU3118399A (en) | 1998-04-02 | 1999-10-25 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| WO1999051595A1 (en) | 1998-04-02 | 1999-10-14 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| JP2002510631A (en) | 1998-04-02 | 2002-04-09 | メルク エンド カムパニー インコーポレーテッド | Gonadotropin-releasing hormone antagonist |
| JP2002510630A (en) | 1998-04-02 | 2002-04-09 | メルク エンド カムパニー インコーポレーテッド | Gonadotropin-releasing hormone antagonist |
| AU3117899A (en) | 1998-04-02 | 1999-10-25 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| CA2326140A1 (en) | 1998-04-02 | 1999-10-14 | Merck & Co., Inc. | Antagonists of gonadotropin releasing hormone |
| WO2000053185A1 (en) | 1999-03-10 | 2000-09-14 | Merck & Co., Inc. | 6-azaindole compounds as antagonists of gonadotropin releasing hormone |
| EP1161431A4 (en) | 1999-03-10 | 2002-04-24 | Merck & Co Inc | 6-AZAINDOLE COMPOUNDS FOR USE AS GONADOTROPIN RELEASING HORMONE ANTAGONISTS |
| JP2002538206A (en) | 1999-03-10 | 2002-11-12 | メルク エンド カムパニー インコーポレーテッド | 6-azaindole compounds as gonadotropin-releasing hormone antagonists |
| JP2002538204A (en) | 1999-03-10 | 2002-11-12 | メルク エンド カムパニー インコーポレーテッド | 6-azaindole compounds as gonadotropin-releasing hormone antagonists |
| WO2000053180A1 (en) | 1999-03-10 | 2000-09-14 | Merck & Co., Inc. | 6-azaindole compounds as antagonists of gonadotropin releasing hormone |
| JP2003526618A (en) | 1999-03-10 | 2003-09-09 | メルク エンド カムパニー インコーポレーテッド | 6-azaindole compounds as gonadotropin-releasing hormone antagonists |
| GB2373500B (en) * | 2000-02-04 | 2004-12-15 | Aeomica Inc | Methods and apparatus for predicting, confirming, and displaying functional information derived from genomic sequence |
| AU2001292561A1 (en) | 2000-09-01 | 2002-03-13 | Hyseq, Inc. | Nucleic acids and polypeptides |
| SE0100566D0 (en) | 2001-02-20 | 2001-02-20 | Astrazeneca Ab | Compounds |
-
2000
- 2000-09-15 GB GBGB0022670.4A patent/GB0022670D0/en not_active Ceased
-
2001
- 2001-09-12 EP EP01967480A patent/EP1320601A1/en not_active Withdrawn
- 2001-09-12 JP JP2002527260A patent/JP2004508826A/en not_active Withdrawn
- 2001-09-12 US US10/380,492 patent/US7306922B2/en not_active Expired - Fee Related
- 2001-09-12 AU AU2001287855A patent/AU2001287855A1/en not_active Abandoned
- 2001-09-12 WO PCT/GB2001/004074 patent/WO2002022818A1/en not_active Ceased
-
2007
- 2007-10-29 US US11/978,832 patent/US20090142351A1/en not_active Abandoned
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020048763A1 (en) * | 2000-02-04 | 2002-04-25 | Penn Sharron Gaynor | Human genome-derived single exon nucleic acid probes useful for gene expression analysis |
| US20040044181A1 (en) * | 2001-08-31 | 2004-03-04 | Tang Y. Tom | Novel nucleic acids and polypeptides |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2002022818A1 (en) | 2002-03-21 |
| AU2001287855A1 (en) | 2002-03-26 |
| EP1320601A1 (en) | 2003-06-25 |
| US7306922B2 (en) | 2007-12-11 |
| US20090142351A1 (en) | 2009-06-04 |
| JP2004508826A (en) | 2004-03-25 |
| GB0022670D0 (en) | 2000-11-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6239267B1 (en) | VANILREP1 polynucleotides and VANILREP1 polypeptides | |
| EP1811035A1 (en) | Aspartic proteinase 2 (ASP2) | |
| SE523013C2 (en) | Nucleic Acid (IIP-10) Encoding an IGF-1 Receptor Binding Polypeptide and Using the Nucleic Acid or Polypeptide in a Method for Detecting the Proliferative Potential of a Cancer Cell or to Identify Substances That Modulate the Interaction between IIP-10 and IGF-1R | |
| JP2003527067A (en) | ACRP30R1L, a homologue of ACRP30 (30 KD adipocyte complement-related protein) | |
| JP2002512781A (en) | G protein-coupled 7TM receptor (AXOR-1) | |
| JP2002511233A (en) | TREK1-like two-hole potassium channel | |
| US20090142351A1 (en) | Human and rat PGC-3, PPAR-gamma coactivations and splice variants thereof | |
| US6368823B1 (en) | Kv potassium channel polypeptides and polynucleotides | |
| EP0889127A1 (en) | Serine/threonine protein kinase (H-SGK2) | |
| EP0908515A2 (en) | Pancreatic polypeptide | |
| JPH10201482A (en) | Calcitonin gene-related peptide receptor component factor (houdc44) | |
| US6359116B1 (en) | Compounds | |
| JPH1156375A (en) | New splicing mutant of g protein conjugated receptor derived by epstein-barr virus | |
| US20010010929A1 (en) | Polynucleotides and polypeptides belonging to the uncoupling proteins family | |
| JPH10337189A (en) | New compound | |
| JP2002300892A (en) | Nerve cell-adhered splicing mutant | |
| JPH11127868A (en) | 7 tm receptor hlwar 77 | |
| US20010049121A1 (en) | Cytokine family member, 2-19 | |
| US6100062A (en) | Expression system for HSCLOCK | |
| EP0913472A2 (en) | Human LIG-1 Homolog (HLIG-1) | |
| EP0902837A1 (en) | Tailless nuclear hormone receptor (tlx receptor) | |
| US6316219B1 (en) | Compounds | |
| US20040029141A1 (en) | Human and mouse e2-protein nucleic acids coding therefor and uses thereof | |
| US20040096842A1 (en) | Molecules involved in the regulation of insulin resistance syndrome (irs) | |
| EP0877030A2 (en) | EPO primary response gene 1,(EPRG1) |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: ASTRAZENECA AB, SWEDEN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:HART, KEVIN ANTHONY;MONTAGUE, CARL THOMAS;VIDAL-PUIG, ANTONIO;REEL/FRAME:014364/0677;SIGNING DATES FROM 20030122 TO 20030225 |
|
| REMI | Maintenance fee reminder mailed | ||
| LAPS | Lapse for failure to pay maintenance fees | ||
| STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
| FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20111211 |