MXPA98008627A - Transgenic plants expressing geminivi genes - Google Patents
Transgenic plants expressing geminivi genesInfo
- Publication number
- MXPA98008627A MXPA98008627A MXPA/A/1998/008627A MX9808627A MXPA98008627A MX PA98008627 A MXPA98008627 A MX PA98008627A MX 9808627 A MX9808627 A MX 9808627A MX PA98008627 A MXPA98008627 A MX PA98008627A
- Authority
- MX
- Mexico
- Prior art keywords
- leu
- ser
- pro
- lys
- val
- Prior art date
Links
- 230000009261 transgenic effect Effects 0.000 title claims abstract description 56
- 108090000623 proteins and genes Proteins 0.000 title claims description 46
- 208000015181 infectious disease Diseases 0.000 claims abstract description 37
- 230000010076 replication Effects 0.000 claims abstract description 36
- 241000702463 Geminiviridae Species 0.000 claims description 82
- 230000035772 mutation Effects 0.000 claims description 44
- 102000004169 proteins and genes Human genes 0.000 claims description 31
- 241000702308 Tomato yellow leaf curl virus Species 0.000 claims description 15
- 238000000034 method Methods 0.000 claims description 15
- 241000702286 Bean golden mosaic virus Species 0.000 claims description 7
- 230000027455 binding Effects 0.000 claims description 6
- 208000036142 Viral infection Diseases 0.000 claims description 2
- 230000009385 viral infection Effects 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 239000013611 chromosomal DNA Substances 0.000 claims 2
- 241000702305 Tomato mottle virus Species 0.000 claims 1
- 230000002452 interceptive effect Effects 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 abstract description 11
- 239000012634 fragment Substances 0.000 description 146
- 108020004414 DNA Proteins 0.000 description 82
- 239000013612 plasmid Substances 0.000 description 82
- 241000196324 Embryophyta Species 0.000 description 75
- 108020004707 nucleic acids Proteins 0.000 description 69
- 102000039446 nucleic acids Human genes 0.000 description 69
- 150000007523 nucleic acids Chemical class 0.000 description 69
- 240000003768 Solanum lycopersicum Species 0.000 description 40
- 230000014509 gene expression Effects 0.000 description 40
- 230000000692 anti-sense effect Effects 0.000 description 38
- 150000001413 amino acids Chemical group 0.000 description 36
- 235000001014 amino acid Nutrition 0.000 description 34
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 32
- 239000013598 vector Substances 0.000 description 32
- 235000018102 proteins Nutrition 0.000 description 30
- 102100035623 ATP-citrate synthase Human genes 0.000 description 28
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 26
- 241000589158 Agrobacterium Species 0.000 description 25
- 108091034117 Oligonucleotide Proteins 0.000 description 25
- 210000004027 cell Anatomy 0.000 description 25
- 230000002458 infectious effect Effects 0.000 description 24
- 108010034529 leucyl-lysine Proteins 0.000 description 24
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 22
- 241000700605 Viruses Species 0.000 description 22
- 230000015572 biosynthetic process Effects 0.000 description 22
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 22
- 238000003786 synthesis reaction Methods 0.000 description 21
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 20
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 20
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 19
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 19
- 239000013604 expression vector Substances 0.000 description 18
- 238000011081 inoculation Methods 0.000 description 18
- 108010034507 methionyltryptophan Proteins 0.000 description 18
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 17
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 17
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 17
- 108010060035 arginylproline Proteins 0.000 description 17
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 17
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 16
- 108010077245 asparaginyl-proline Proteins 0.000 description 16
- 108010047857 aspartylglycine Proteins 0.000 description 16
- 238000010276 construction Methods 0.000 description 16
- 108010009298 lysylglutamic acid Proteins 0.000 description 16
- UDPGUMQDCGORJQ-UHFFFAOYSA-N (2-chloroethyl)phosphonic acid Chemical compound OP(O)(=O)CCCl UDPGUMQDCGORJQ-UHFFFAOYSA-N 0.000 description 15
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 15
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 15
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 15
- 238000004690 coupled electron pair approximation Methods 0.000 description 15
- 238000013518 transcription Methods 0.000 description 15
- 230000035897 transcription Effects 0.000 description 15
- PDLQNLSEJXOQNQ-IHPCNDPISA-N His-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CN=CN1 PDLQNLSEJXOQNQ-IHPCNDPISA-N 0.000 description 14
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 14
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 14
- 108010073969 valyllysine Proteins 0.000 description 14
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 13
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 13
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 13
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 13
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 13
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 13
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 13
- 108010020688 glycylhistidine Proteins 0.000 description 13
- 239000003550 marker Substances 0.000 description 13
- 230000003612 virological effect Effects 0.000 description 13
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 12
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 12
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 12
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 12
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 12
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 12
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 12
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 12
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 12
- 230000029087 digestion Effects 0.000 description 12
- 108010078144 glutaminyl-glycine Proteins 0.000 description 12
- 108010054155 lysyllysine Proteins 0.000 description 12
- 208000024891 symptom Diseases 0.000 description 12
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 11
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 11
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 11
- 108010092526 GKPV peptide Proteins 0.000 description 11
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 11
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 11
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 11
- 244000046052 Phaseolus vulgaris Species 0.000 description 11
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 11
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 11
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 11
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 11
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 10
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 10
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 10
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 10
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 10
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 10
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 10
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- 101710163504 Phaseolin Proteins 0.000 description 10
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 10
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 10
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 10
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 10
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 10
- 108700019146 Transgenes Proteins 0.000 description 10
- 108010092854 aspartyllysine Proteins 0.000 description 10
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 10
- 108010031719 prolyl-serine Proteins 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 10
- LQJAALCCPOTJGB-YUMQZZPRSA-N Arg-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O LQJAALCCPOTJGB-YUMQZZPRSA-N 0.000 description 9
- ULZOQOKFYMXHPZ-AQZXSJQPSA-N Asn-Trp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ULZOQOKFYMXHPZ-AQZXSJQPSA-N 0.000 description 9
- CZECQDPEMSVPDH-MNXVOIDGSA-N Asp-Leu-Val-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CZECQDPEMSVPDH-MNXVOIDGSA-N 0.000 description 9
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 9
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 9
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 9
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 9
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 9
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 9
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 9
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 9
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 9
- GVUVRRPYYDHHGK-VQVTYTSYSA-N Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GVUVRRPYYDHHGK-VQVTYTSYSA-N 0.000 description 9
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 9
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 9
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 9
- 108020005202 Viral DNA Proteins 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 108010085325 histidylproline Proteins 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- 238000012546 transfer Methods 0.000 description 9
- 230000009466 transformation Effects 0.000 description 9
- 108700004896 tripeptide FEG Proteins 0.000 description 9
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 8
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 8
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 8
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 8
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 8
- RGTVXXNMOGHRAY-WDSKDSINSA-N Cys-Arg Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RGTVXXNMOGHRAY-WDSKDSINSA-N 0.000 description 8
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 8
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 8
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 8
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 8
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 8
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 8
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 8
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 8
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 8
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 8
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 8
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 8
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 8
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 8
- AWYXDHQQFPZJNE-QEJZJMRPSA-N Trp-Gln-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N AWYXDHQQFPZJNE-QEJZJMRPSA-N 0.000 description 8
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 8
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 8
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 8
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 230000000295 complement effect Effects 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 231100000518 lethal Toxicity 0.000 description 8
- 230000001665 lethal effect Effects 0.000 description 8
- 108010077112 prolyl-proline Proteins 0.000 description 8
- 238000013519 translation Methods 0.000 description 8
- 108010051110 tyrosyl-lysine Proteins 0.000 description 8
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 7
- XNSKSTRGQIPTSE-ACZMJKKPSA-N Arg-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XNSKSTRGQIPTSE-ACZMJKKPSA-N 0.000 description 7
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 7
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 7
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 7
- VHLZDSUANXBJHW-QWRGUYRKSA-N Gln-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHLZDSUANXBJHW-QWRGUYRKSA-N 0.000 description 7
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 7
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 7
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 7
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 7
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 7
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 7
- 241000207746 Nicotiana benthamiana Species 0.000 description 7
- 238000000636 Northern blotting Methods 0.000 description 7
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 7
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 7
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 7
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 7
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 7
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 7
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 7
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 7
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 7
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 7
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 7
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 7
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 7
- 108010092114 histidylphenylalanine Proteins 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 231100000350 mutagenesis Toxicity 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 6
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 6
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 6
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 6
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 6
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 6
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 6
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 6
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 6
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 6
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 6
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 6
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 6
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 6
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 6
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 6
- 108010047562 NGR peptide Proteins 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 6
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 6
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 6
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 6
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 6
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 6
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 6
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 6
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 6
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 6
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 6
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 6
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 6
- 229930006000 Sucrose Natural products 0.000 description 6
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 6
- GIBPOCDKBPNRJB-HSHDSVGOSA-N Thr-Met-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GIBPOCDKBPNRJB-HSHDSVGOSA-N 0.000 description 6
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 6
- 239000000539 dimer Substances 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 239000012528 membrane Substances 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- 108010090894 prolylleucine Proteins 0.000 description 6
- 239000005720 sucrose Substances 0.000 description 6
- 238000011282 treatment Methods 0.000 description 6
- 229920001817 Agar Polymers 0.000 description 5
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 5
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 5
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 5
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 5
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 5
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 5
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 5
- CLSDNFWKGFJIBZ-YUMQZZPRSA-N Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O CLSDNFWKGFJIBZ-YUMQZZPRSA-N 0.000 description 5
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 5
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 5
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 5
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 5
- 241000258937 Hemiptera Species 0.000 description 5
- WSDOHRLQDGAOGU-BQBZGAKWSA-N His-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 WSDOHRLQDGAOGU-BQBZGAKWSA-N 0.000 description 5
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 5
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 5
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 5
- 206010020649 Hyperkeratosis Diseases 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 5
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 5
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 5
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 5
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 5
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 5
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 5
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 5
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 5
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 5
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 5
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 5
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 5
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 5
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 5
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 5
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 5
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 5
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 5
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 5
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 5
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 5
- TYYLDKGBCJGJGW-WMZOPIPTSA-N Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 TYYLDKGBCJGJGW-WMZOPIPTSA-N 0.000 description 5
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 5
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 5
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 5
- 239000008272 agar Substances 0.000 description 5
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 108010058731 nopaline synthase Proteins 0.000 description 5
- 108010018625 phenylalanylarginine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 108010044292 tryptophyltyrosine Proteins 0.000 description 5
- 239000011782 vitamin Substances 0.000 description 5
- 235000013343 vitamin Nutrition 0.000 description 5
- 229940088594 vitamin Drugs 0.000 description 5
- 229930003231 vitamin Natural products 0.000 description 5
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 4
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 4
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 4
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 4
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 4
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 4
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 4
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 4
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 4
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 4
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 4
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 4
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 4
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 4
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 4
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 4
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 4
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 4
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 4
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 4
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 4
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 4
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 description 4
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 4
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 4
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 4
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 4
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 4
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 4
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 4
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 4
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 4
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 4
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 4
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 4
- JPNRPAJITHRXRH-BQBZGAKWSA-N Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O JPNRPAJITHRXRH-BQBZGAKWSA-N 0.000 description 4
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 4
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 4
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 4
- 101150085302 MAS2 gene Proteins 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 4
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 4
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 4
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 4
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 4
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 4
- 101100401318 Schizosaccharomyces pombe (strain 972 / ATCC 24843) meu1 gene Proteins 0.000 description 4
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 4
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 4
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 4
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 4
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 4
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 4
- 101100020617 Solanum lycopersicum LAT52 gene Proteins 0.000 description 4
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 4
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 4
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 4
- 102100031590 Uncharacterized protein encoded by LINC01587 Human genes 0.000 description 4
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 4
- WPSXZFTVLIAPCN-WDSKDSINSA-N Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(O)=O WPSXZFTVLIAPCN-WDSKDSINSA-N 0.000 description 4
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 4
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 4
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 4
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 4
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 4
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 231100000219 mutagenic Toxicity 0.000 description 4
- 230000003505 mutagenic effect Effects 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 3
- LJIRBXZDQGQUOO-KVTDHHQDSA-N 6-amino-3-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1,4-dihydro-1,3,5-triazin-2-one Chemical compound C1NC(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 LJIRBXZDQGQUOO-KVTDHHQDSA-N 0.000 description 3
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 3
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 3
- BNODVYXZAAXSHW-IUCAKERBSA-N Arg-His Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BNODVYXZAAXSHW-IUCAKERBSA-N 0.000 description 3
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 3
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 3
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 3
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 3
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 3
- OMSMPWHEGLNQOD-UWVGGRQHSA-N Asn-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMSMPWHEGLNQOD-UWVGGRQHSA-N 0.000 description 3
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 3
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 3
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 3
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 3
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 3
- JUUMIGUJJRFQQR-KKUMJFAQSA-N Cys-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O JUUMIGUJJRFQQR-KKUMJFAQSA-N 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- 102000012410 DNA Ligases Human genes 0.000 description 3
- 108010061982 DNA Ligases Proteins 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 3
- KYFSMWLWHYZRNW-ACZMJKKPSA-N Gln-Asp-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KYFSMWLWHYZRNW-ACZMJKKPSA-N 0.000 description 3
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 3
- WVUZERSNWGUKJY-BPUTZDHNSA-N Gln-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N WVUZERSNWGUKJY-BPUTZDHNSA-N 0.000 description 3
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 3
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 3
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 3
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 3
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 3
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 3
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 3
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 3
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 3
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 3
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 3
- WRPDZHJNLYNFFT-GEVIPFJHSA-N His-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O WRPDZHJNLYNFFT-GEVIPFJHSA-N 0.000 description 3
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 3
- 101000578774 Homo sapiens MAP kinase-activated protein kinase 5 Proteins 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 3
- HIZYETOZLYFUFF-BQBZGAKWSA-N Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O HIZYETOZLYFUFF-BQBZGAKWSA-N 0.000 description 3
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 3
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 3
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 3
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 3
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 3
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 3
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 3
- 102100028396 MAP kinase-activated protein kinase 5 Human genes 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- 241000208125 Nicotiana Species 0.000 description 3
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 3
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 3
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 3
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 3
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 3
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 3
- JQOHKCDMINQZRV-WDSKDSINSA-N Pro-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 JQOHKCDMINQZRV-WDSKDSINSA-N 0.000 description 3
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 3
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 3
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 3
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 3
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 3
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 3
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 3
- 238000002105 Southern blotting Methods 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 3
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 3
- UYKREHOKELZSPB-JTQLQIEISA-N Trp-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(O)=O)=CNC2=C1 UYKREHOKELZSPB-JTQLQIEISA-N 0.000 description 3
- SGQSAIFDESQBRA-IHPCNDPISA-N Trp-Tyr-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SGQSAIFDESQBRA-IHPCNDPISA-N 0.000 description 3
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 3
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 3
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 3
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 3
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 3
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 3
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 3
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 3
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 3
- 239000011543 agarose gel Substances 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 3
- 229960003669 carbenicillin Drugs 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 230000002596 correlated effect Effects 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 3
- 229960002064 kanamycin sulfate Drugs 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000012250 transgenic expression Methods 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 2
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 2
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 2
- ODBSSLHUFPJRED-CIUDSAMLSA-N Asn-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ODBSSLHUFPJRED-CIUDSAMLSA-N 0.000 description 2
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 2
- VBKIFHUVGLOJKT-FKZODXBYSA-N Asn-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)N)O VBKIFHUVGLOJKT-FKZODXBYSA-N 0.000 description 2
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- FKBFDTRILNZGAI-IMJSIDKUSA-N Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O FKBFDTRILNZGAI-IMJSIDKUSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 2
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- NTQDELBZOMWXRS-IWGUZYHVSA-N Asp-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O NTQDELBZOMWXRS-IWGUZYHVSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 2
- 241000252254 Catostomidae Species 0.000 description 2
- 241000701515 Commelina yellow mottle virus Species 0.000 description 2
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 2
- IMXSCCDUAFEIOE-UHFFFAOYSA-N D-Octopin Natural products OC(=O)C(C)NC(C(O)=O)CCCN=C(N)N IMXSCCDUAFEIOE-UHFFFAOYSA-N 0.000 description 2
- IMXSCCDUAFEIOE-RITPCOANSA-N D-octopine Chemical compound [O-]C(=O)[C@@H](C)[NH2+][C@H](C([O-])=O)CCCNC(N)=[NH2+] IMXSCCDUAFEIOE-RITPCOANSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 108020003215 DNA Probes Proteins 0.000 description 2
- 239000003298 DNA probe Substances 0.000 description 2
- 241000450599 DNA viruses Species 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- LLEUXCDZPQOJMY-AAEUAGOBSA-N Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 LLEUXCDZPQOJMY-AAEUAGOBSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- 101000831616 Homo sapiens Protachykinin-1 Proteins 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 2
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 2
- CTBMEDOQJFGNMI-IHPCNDPISA-N Lys-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCCCN)N CTBMEDOQJFGNMI-IHPCNDPISA-N 0.000 description 2
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 2
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 2
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- UEKYKRQIAQHOOZ-KBPBESRZSA-N Pro-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)[O-])C(=O)[C@@H]1CCC[NH2+]1 UEKYKRQIAQHOOZ-KBPBESRZSA-N 0.000 description 2
- 102100024304 Protachykinin-1 Human genes 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 240000005578 Rivina humilis Species 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 2
- 241000543828 Tomato yellow leaf curl Sardinia virus Species 0.000 description 2
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 2
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 2
- YBRHKUNWEYBZGT-WLTAIBSBSA-N Trp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 YBRHKUNWEYBZGT-WLTAIBSBSA-N 0.000 description 2
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 2
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- 108700005077 Viral Genes Proteins 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000000840 anti-viral effect Effects 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 2
- 238000000376 autoradiography Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 235000005489 dwarf bean Nutrition 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 102200111112 rs397514590 Human genes 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 229910000162 sodium phosphate Inorganic materials 0.000 description 2
- 238000009331 sowing Methods 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 230000029812 viral genome replication Effects 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- ZNAIHAPCDVUWRX-DUCUPYJCSA-N (4s,4as,5as,6s,12ar)-7-chloro-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide;4-amino-n-(4,6-dimethylpyrimidin-2-yl)benzenesulfonamide;(2s,5r,6r)-3,3-dimethyl-7-oxo-6-[(2-phenylacetyl)amino]-4-t Chemical compound CC1=CC(C)=NC(NS(=O)(=O)C=2C=CC(N)=CC=2)=N1.N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1.C1=CC(Cl)=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O ZNAIHAPCDVUWRX-DUCUPYJCSA-N 0.000 description 1
- 239000005631 2,4-Dichlorophenoxyacetic acid Substances 0.000 description 1
- 101150071927 AANAT gene Proteins 0.000 description 1
- 101150018711 AASS gene Proteins 0.000 description 1
- 241000254032 Acrididae Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- SIFXMYAHXJGAFC-WDSKDSINSA-N Arg-Asp Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O SIFXMYAHXJGAFC-WDSKDSINSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- YQNBILXAUIAUCF-CIUDSAMLSA-N Asn-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N YQNBILXAUIAUCF-CIUDSAMLSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- QCWJKJLNCFEVPQ-WHFBIAKZSA-N Asn-Gln Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O QCWJKJLNCFEVPQ-WHFBIAKZSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- GADKFYNESXNRLC-WDSKDSINSA-N Asn-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GADKFYNESXNRLC-WDSKDSINSA-N 0.000 description 1
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- ZARXTZFGQZBYFO-JQWIXIFHSA-N Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(O)=O)=CNC2=C1 ZARXTZFGQZBYFO-JQWIXIFHSA-N 0.000 description 1
- LLRJPYJQNBMOOO-QEJZJMRPSA-N Asp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N LLRJPYJQNBMOOO-QEJZJMRPSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000702285 Bean dwarf mosaic virus Species 0.000 description 1
- 241000254127 Bemisia tabaci Species 0.000 description 1
- 101100322249 Caenorhabditis elegans lev-1 gene Proteins 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 241000233838 Commelina Species 0.000 description 1
- YHDXIZKDOIWPBW-WHFBIAKZSA-N Cys-Gln Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O YHDXIZKDOIWPBW-WHFBIAKZSA-N 0.000 description 1
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- WYVKPHCYMTWUCW-YUPRTTJUSA-N Cys-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)N)O WYVKPHCYMTWUCW-YUPRTTJUSA-N 0.000 description 1
- LMKYZBGVKHTLTN-NKWVEPMBSA-N D-nopaline Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[C@@H](C(O)=O)CCC(O)=O LMKYZBGVKHTLTN-NKWVEPMBSA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 108091028709 DNA adenine Proteins 0.000 description 1
- 230000030914 DNA methylation on adenine Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- UKKNTTCNGZLJEX-WHFBIAKZSA-N Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UKKNTTCNGZLJEX-WHFBIAKZSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- 101000688216 Homo sapiens Intestinal-type alkaline phosphatase Proteins 0.000 description 1
- 108091029795 Intergenic region Proteins 0.000 description 1
- 102100024319 Intestinal-type alkaline phosphatase Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- FAIXYKHYOGVFKA-UHFFFAOYSA-N Kinetin Natural products N=1C=NC=2N=CNC=2C=1N(C)C1=CC=CO1 FAIXYKHYOGVFKA-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 1
- 240000007472 Leucaena leucocephala Species 0.000 description 1
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- QCZYYEFXOBKCNQ-STQMWFEESA-N Lys-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCZYYEFXOBKCNQ-STQMWFEESA-N 0.000 description 1
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- TUZSWDCTCGTVDJ-PJODQICGSA-N Met-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 TUZSWDCTCGTVDJ-PJODQICGSA-N 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 241001579016 Nanoa Species 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 101000854546 Petunia hybrida Protein WUSCHEL Proteins 0.000 description 1
- 241000219833 Phaseolus Species 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- BXNGIHFNNNSEOS-UWVGGRQHSA-N Phe-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 BXNGIHFNNNSEOS-UWVGGRQHSA-N 0.000 description 1
- KLAONOISLHWJEE-QWRGUYRKSA-N Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KLAONOISLHWJEE-QWRGUYRKSA-N 0.000 description 1
- JXWLMUIXUXLIJR-QWRGUYRKSA-N Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JXWLMUIXUXLIJR-QWRGUYRKSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- PYOHODCEOHCZBM-RYUDHWBXSA-N Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 PYOHODCEOHCZBM-RYUDHWBXSA-N 0.000 description 1
- UXQFHEKRGHYJRA-STQMWFEESA-N Phe-Met-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O UXQFHEKRGHYJRA-STQMWFEESA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 101100271190 Plasmodium falciparum (isolate 3D7) ATAT gene Proteins 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- GLEOIKLQBZNKJZ-WDSKDSINSA-N Pro-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GLEOIKLQBZNKJZ-WDSKDSINSA-N 0.000 description 1
- BEPSGCXDIVACBU-IUCAKERBSA-N Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BEPSGCXDIVACBU-IUCAKERBSA-N 0.000 description 1
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 101710150114 Protein rep Proteins 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 101710152114 Replication protein Proteins 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- UJTZHGHXJKIAOS-WHFBIAKZSA-N Ser-Gln Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O UJTZHGHXJKIAOS-WHFBIAKZSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- SBMNPABNWKXNBJ-BQBZGAKWSA-N Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO SBMNPABNWKXNBJ-BQBZGAKWSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- UQTNIFUCMBFWEJ-IWGUZYHVSA-N Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O UQTNIFUCMBFWEJ-IWGUZYHVSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- BWUHENPAEMNGQJ-ZDLURKLDSA-N Thr-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O BWUHENPAEMNGQJ-ZDLURKLDSA-N 0.000 description 1
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 241000018137 Trialeurodes vaporariorum Species 0.000 description 1
- OHGNSVACHBZKSS-KWQFWETISA-N Trp-Ala Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)N[C@@H](C)C([O-])=O)=CNC2=C1 OHGNSVACHBZKSS-KWQFWETISA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 1
- LWFWZRANSFAJDR-JSGCOSHPSA-N Trp-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 LWFWZRANSFAJDR-JSGCOSHPSA-N 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 1
- HPYDSVWYXXKHRD-VIFPVBQESA-N Tyr-Gly Chemical compound [O-]C(=O)CNC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 HPYDSVWYXXKHRD-VIFPVBQESA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000000061 acid fraction Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 238000000211 autoradiogram Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000007640 basal medium Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 101150055766 cat gene Proteins 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000003501 co-culture Methods 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- STVZJERGLQHEKB-UHFFFAOYSA-N ethylene glycol dimethacrylate Chemical compound CC(=C)C(=O)OCCOC(=O)C(C)=C STVZJERGLQHEKB-UHFFFAOYSA-N 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000011536 extraction buffer Substances 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 238000002513 implantation Methods 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 208000000509 infertility Diseases 0.000 description 1
- 230000036512 infertility Effects 0.000 description 1
- 231100000535 infertility Toxicity 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- QANMHLXAZMSUEX-UHFFFAOYSA-N kinetin Chemical compound N=1C=NC=2N=CNC=2C=1NCC1=CC=CO1 QANMHLXAZMSUEX-UHFFFAOYSA-N 0.000 description 1
- 229960001669 kinetin Drugs 0.000 description 1
- PWPJGUXAGUPAHP-UHFFFAOYSA-N lufenuron Chemical compound C1=C(Cl)C(OC(F)(F)C(C(F)(F)F)F)=CC(Cl)=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F PWPJGUXAGUPAHP-UHFFFAOYSA-N 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019799 monosodium phosphate Nutrition 0.000 description 1
- 239000004570 mortar (masonry) Substances 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 231100001160 nonlethal Toxicity 0.000 description 1
- 230000031787 nutrient reservoir activity Effects 0.000 description 1
- -1 one of which Proteins 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 230000008659 phytopathology Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 230000003134 recirculating effect Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 1
- SUKJFIGYRHOWBL-UHFFFAOYSA-N sodium hypochlorite Chemical compound [Na+].Cl[O-] SUKJFIGYRHOWBL-UHFFFAOYSA-N 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- UZKQTCBAMSWPJD-UQCOIBPSSA-N trans-Zeatin Natural products OCC(/C)=C\CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-UQCOIBPSSA-N 0.000 description 1
- UZKQTCBAMSWPJD-FARCUNLSSA-N trans-zeatin Chemical compound OCC(/C)=C/CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-FARCUNLSSA-N 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000004804 winding Methods 0.000 description 1
- 229940023877 zeatin Drugs 0.000 description 1
Abstract
The invention includes the production of transgenic plants containing DNA encoding wild-type AC1 / C1 and mutant sequences that negatively interfere in trans with geminiviral replication during infection. The transgenic plants produced by the invention with resistant to vir infection
Description
TRANSGENIC PLANTS EXPRESSING GEMINIVIRUS GENES
DESCRIPTION OF THE INVENTION A variety of geminivirus genes and mutant derivatives were generated and transferred to plant cells. Transgenic plants containing these genes were produced. Transgenic plants that contain trans-dominant mutations developed resistance to geminivirus infection.
BACKGROUND OF THE INVENTION Geminiviruses present the most serious disease problem in many vegetable crops in tropical and subtropical regions. For example, the major epidemics of geminivirus infections of beans and tomatoes have occurred in Florida, the Caribbean Basin, Mexico, and Central America. In the past, traditional production methods could not produce crops with significant levels of resistance to geminiviruses. An approach according to the present invention consists in the production of transgenic plants resistant to the virus. The group of geminiviruses are DNA viruses, single-stranded, which infect plants both onocotyledonous (monocot) and dicotyledonous (dicot).
P1681 / 98MX A common feature among all geminiviruses is the genomic replication mode, which comprises a coiling circle mechanism. The tomato spot virus (ToMoV) is an example of a geminivirus. It has a two-component (two-part) genome, a capacity to infect dicotyledonous plants and is transmitted by the whitefly. The DNA of its two genomic components, DNA-A and DNA-B, has been previously cloned and sequenced. The clones isolated from ToMoV DNA-A and DNA-B are themselves infectious when they are inoculated mechanically into tomato and N. benthamiana, or when they are distributed to any host by agro-inoculation. A DNA sequence, geminiviral, invariant, required for replication is present in an intergenic common region (CR), in each genomic component. The ToMoV DNA-A genomic component has four ORFs, one of which, ACl, must be expressed for efficient replication of both component A and component B. The ACL ORF encodes a protein that has several functional activities: a specific DNA binding site for the CRs of DNA-A; a DNA cutting activity; and an NTP binding activity. The DNA binding region mediates a function of the initiator protein during the replication of the winding circle. P1681 / 98MX The AC3-protein is a second function encoded by ToMoV comprised of DNA replication and the production of circular, single-stranded DNA. The virus of the normal alechugamiento of the yellow leaf of the tomato (TYLCV) is another example of a geminivirus. The TYLCV has a monopartite genome organization, infects monocotyledonous plants, and is transmitted by the grasshopper. The Cl protein of TYLCV is required for replication, it is encoded by Cl ORF. Being DNA viruses, geminiviruses offer advantages for antiviral strategies. Several geminiviruses have been cloned and sequenced. Transgenic plants having mutant viral genes can be produced, for example, by introducing expression cartridges comprising virus genes mutated directly into the plants with a particle gun, or into the plant suspension cells or protoplasts by electroporation, or by transfection with agrobacterium.
SUMMARY OF THE INVENTION The invention comprises the production of transgenic plants comprising DNA encoding wild and mutant ACl / Cl sequences that negatively interfere in trans with geminiviral replication during infection. The resulting transgenic plants
P1681 / 98MX are resistant to viral infection.
BRIEF DESCRIPTION OF THE FIGURE Figure 1 shows the results of a transient assay for trans-domination by BGMV-GA in NT-1 cells.
DETAILED DESCRIPTION OF THE INVENTION A. Production of infectious clones Geminivirus infectious genes are produced by methods known to those skilled in the art. Geminivirus DNA is extracted from the known tissue. Young tissue is collected from infected plants, frozen in liquid nitrogen and ground in a mortar, in the presence of extraction buffer (10 mM Tris-Cl, pH 7.5, 10 M EDTA, and 1% SDS, weight / volume ratio 1: 4) and centrifuge for approximately 105 g-minutes. The supernatant is adjusted to approximately 1 M NaCl and stored at about 42 C for approximately 12 hours, then centrifuged for approximately 107 g-minutes. After phenol extraction, the solution is adjusted to OA M sodium acetate, and the DNA is precipitated in alcohol. Viral nucleic acids are isolated by agarose gel electrophoresis. These viral nucleic acid fractions are
P1681 / 98MX digest with restriction enzymes and are isolated by electrophoresis with agarose gel. The DNA is cloned into a suitable cloning vector, for example pBluescript KS +, and its identity is confirmed by sequencing. Full-length clones of the geminivirus genome are constructed, for example, by a PCR-based cloning strategy. Values are synthesized that will amplify the full ORF plus about 10 nucleotides on each side of the ORF. The primers must include unmatched bases to create restriction sites before and after the Cl or ACL ORF, which will allow convenient cloning, without altering the initiation and germination codons of Cl or ACl ORF. Primer 1 is complementary to the viral homosentide strand of the geminiviral genome and binds with it. The 5 'end of the primer is located 40-50 base pairs 3' from the translation start, and the 3 'end is located 10-20 base pairs 3' from the translation start site. The start of translation is defined by CAT in the homosense, viral strand; the ACl or Cl ORF are located in the complementary strand of the viral genome and the coordinates of the sequences are given for the homosense, viral polarity strands. Primer 2 is complementary to, and fixed with, the strand (complementary homosense polarity) of the
P1681 / 98MX geminiviral genome. The 5 'end of the primer is located 40-50 base pairs 3' of the ACl or Cl termination codon, and the 3 'end of the primer is located 10-20 base pairs 3' of the translation stop as determined by the strand of polarity of homosentido, complementary. The primers are used in a PCR reaction to amplify the Cl or ACl ORF of the cloned viral DNA or geminivirus DNA, purified. The amplified DNA is digested with appropriate restriction enzymes to shorten the engineered sites at the ends of the PCR fragment and the resulting fragment is cloned into a suitable vector. Clones containing Cl or ACl are identified and sequenced to confirm the presence and integrity of Cl or ACL cloned ORF. The sequence of the ACl / Cl ORF is used to design the primers for the amplification of the PCR fragment of ACl / Cl ORF, for example, these primers are designed so that this ORF is cloned in the BamHI and HindIII sites of pBluescript KS +. The Ba HI site is located at the 5 'end of the complementary homosentide primer, which amplifies the amino-terminal end of the ORF. A HindIII site is located at the 5 'end of the viral homosentide primer that is fixed to the carboxy terminus of the ORF. Preferably, infectious clones are selected. The infectivity of the clones is determined
P1681 / 98MX for the construction of viral genes of Agrobacterium that have a greater length than complete it, with at least two common regions of DNA-A and DNA-B. The effectiveness is determined by inoculation with microparticles. The seeds are germinated on moist filter paper to produce 1-3 cm long radicles from a host, and this tissue is bombarded by particles coated with DNA with a particle gun. The inoculated plants are placed in a growth chamber at approximately 262C with photoperiods of approximately 14 hours. Infectivity is confirmed by PCR with specific primers for geminivirus or Southern blot analysis. For example, 1.3 kb PCR products are expected when primers PAL1V1978 and PAV1c715 are used. The cloned viral DNA is digested with restriction enzymes and analyzed on agarose gels to produce a unique fragment of 2.5 to 2.7 kb. The DNA bands are removed from the gel and cloned into an appropriate vector. For monopartite geminivirus, the graft preferably includes the complete genome. For bipartite geminiviruses, complete inserts of both genomic components are preferred. The individual insert of the monopartite geminiviruses or both cloned components of the bipartite geminiviruses are introduced into a host plant and tested for infertility by
P1681 / 98MX biolistic distribution or agroinoculation. The Cl (monopartite virus) or ACL (bipartite virus) ORF, cloned, are isolated by selecting the following characteristics: A. The ACl or Cl ORF codes for a protein product of approximately 42 kd. B. The nucleotide sequence of Cl or ACl ORF is at least 60% homologous to the ACL ORF of a previously identified geminivirus (eg, BDMV, ToMoV, or TYLCV). The deduced amino acid sequence of the ORF will contain several characteristic sequences that are similar in sequence and relative position within the ORF ie, portions within the Cl or ACl sequences.
B. Introduction of Mutations Mutations are introduced by mutagenesis directed to Cl sequence or ACL ORF cloned by methods known in the art, for example, using the method of Kunkel et al. (Recombinant DNA Methodology, 1989, pp. 587-601) (in the present "Kunkel mutagenesis"). In particular, mutations are introduced into the portions of the amino acid sequence in Cl or ACL ORF that are highly conserved among all geminiviruses. Four portions in the domain are preferred
P1681 / 98 X cut of protein DNA. These include (uppercase letters denote high concentration of amino acids, and lowercase letters denote some conservation, and "x" denotes a variable expression of the portion: (1) FLTYpxC (2) HlHvliQ (3) vKxYxdKd; and (4) FHPNIQxak Additionally, two portions are preferred in the NTP binding domain of the protein, these include: (5) Egx2RTGKt; and (6) NviDDi The individual codons that specify most of the highly conserved amino acids within these portions are mutated. , one or more of the following mutations were introduced to Cl or ACl ORF: (1) vKxYxdKd a (a) vKxFxdKd; (b) vKxAxdKd; (c) vKxYxdRd; (2) EGx2RTGKt (a) EGx2RTGHt; (b) EGx. RTGAt; (c) RGx2RTGKht; (3) NviDDi a (a) NviRDi; (b) NviKDi; or (c) NviDYi,
P1681 / 98MX (here, the mutations l (a), 1 (b), l (c), 2 (a), 2 (b), 2 (c), and 3 (a), 3 (b) , 3 (c), respectively). The acidic or basic amino acids are changed to the opposite charge, to alanine (alanine scan) or to other neutral amino acids. Mutant combinations are also made. An individual Cl or ACl ORF containing codon changes corresponding to vKxFxdKd and EGx2RTGHt (double mutations l (a) and 2 (a), above) are constructed and tested. Other mutants in the portions within ACl / Cl are possible and are used. The presence of the codon change is confirmed by DNA sequencing. The Agrobacterium-mediated transfer of the mutated ACL / Cl ORF, expressible by the plant, is done using methods known to those skilled in the art. If an infectious clone of geminivirus is available, the effects of mutations on replication can be tested. The mutation is introduced into the Cl or ACl ORF of an infectious clone. The mutant DNA is transferred to the cells of the plant. The replication of wild virus for infection is tested as a positive control. Mutations that create transdominant molecules generally cancel replication when engineered into infectious clones. A number of mutations change codons by
P1681 / 98MX amino acids conserved within these portions will be lethal and potentially transdominant. Other mutations in Cl or ACl that cancel the replication should also be considered potentially transdominant. Any non-functional Cl or ACl molecule has the potential to be transdominant. Cl or ACl ORF are installed in a suitable plant transformation vector in the homosense orientation and low in control of a constitutive, strong promoter sequence, and the terminator suitable for high level expression in target plant species . This step is performed for each of the created ACL / Cl ORF mutants.
C. Assays A transient assay is useful for selecting candidate constructs for transdominant interference activity. This is done by first co-inoculating protoplasts or a suspension culture of plant cells with the infectious geminivirus clone and a plasmid containing the Cl mutant or ACL ORF under the control of a plant promoter., constitutive, strong. The control treatments are inoculated with an infectious clone. The total DNA is harvested from the inoculated cells, and it is assessed for viral replication. The
P1681 / 98MX transdominant Cl or ACl mutants are identified as those that suppress genoviral replication relative to control treatments after co-inoculation. In vitro assays for trans-domination correlate lethal mutations and transdominant activity in transient assays. This is exemplified in a BGMV-GA model system. These results are easily applicable to produce a Cl or ACl transdominant ORF from other geminiviruses. The transgenic ToMoV-resistant plants were created by transforming them with an ACL ORF derived from ToMoV and engineered to contain similar mutations. The expression cartridges constructed above are installed in binary plasmids and transformed into Agrobacterium strains for the protocols and transformation of plants. The plants are transformed by methods adjusted to the specific variety or specific line. The transgenic status of the R0 plants and the latest generation plants and their segregation progeny is verified by routine methods, including: ELISA assays for the detection of the NPTII protein; DNA assays such as PCR amplification with the AC / C1 primers of the plants and Southern blot hybridization for the detection of transgenes using AC / C1 as
P1681 / 98MX viral probes; and Southern blot hybridization to detect ACl or Cl transgenes. The demonstration that R. plants transformed with geminivirus gene constructs express the NPTII protein is done by ELISA. The protein in the leaf tissue samples taken from the trees of transgenic plants R. is extracted and made for the NPTII protein by ELISA. The transgenic expression of geminivirus is also measured by Northern blot analysis. Transgenic expression in a number of R0 and R plants was by Northern blot hybridization. The total RNA extracted from the leaves of the transgenic plants is separated by agarose gel electrophoresis. After electrophoresis, the RNA is transferred by pressure on the membrane. The membranes are hybridized with radiolabeled probes, washed and autoradiographed.
D. Identification of gemini-resistant transgenic plants Transgenic plants resistant to geminivirus are identified by stimulating transgenic plants and progeny. The self-pollinated R regenerating primary R plants were agro-inoculized approximately 3 weeks after sowing. Alternative methods include biolistic inoculation,
P1681 / 98MX wise transmission of infected tissue (if the infected product can be transmitted mechanically), transmission by insect, or graft. For bipartite geminiviruses, the agroinoculation preferably comprises the distribution of DNA-A and DNA-B components of greater length than the complete one (ie, at least 2 common regions), in the saplings using strains of Agrobacterium, for example, which contain a binary vector that has in its T-DNA a tandem, partial or complete duplication of the infectious geminivirus DNA. Geminivirus resistant plants are incorporated into traditional breeding programs to develop elite breeding lines that include the transgene that transfers resistance. These changes produce Cl or ACl molecules when they are made alone or in combination with a mutant. Plants that show the highest steady-state levels of the transgenic RNA are stimulated by Agrobacterium-mediated inoculation. Resistance is determined by lack or delay of expression of symptoms and low levels of viral DNA in plants as determined by hybridization tests by crush transfer with viral probes
(Gilbertson et al., 1991. Plant Disease 75: 335-342). Resistance is also determined by inoculation by virulently Bemisia tabaci as described. HE
P1681 / 98MX expects that plants with low mRNA levels for the mutated ACl / Cl have symptoms and those with high levels have no symptoms. Since the ACl / Cl proteins have domains required for the DNA cut and NTP binding that are conserved among all geminiviruses, an antiviral strategy comprising the mutated ACl / Cl protein is applicable to the geminivirus schemes of plants in general . Other viruses include:
Viruses Genes / Regions SEQ ID No
TGV-GA1 DNA-A of ACl ORF (Seq 57 complementary) TGV-GAl Common and intergenic (viral) region 58 TLCV-IND Full length sequence 59 (Piece of stem block) "Chinese" ACl Partial, common region, AVI 60 PHV ACl, Common Region, Intergenic, AVI 61 PHV BV1 ORF 62
PHV BCI, Hypervor., Common, and Interg. 63 Reg
The following examples are offered by way of illustration and not by way of limitation P1681 / 98MX EXAMPLE 1. Gene Expression Vectors All methods of isolating plasmid DNA and culture of E. coli were carried out according to normal methods. The restriction digests, which are filled with the protrusions of the '5' end, the treatments of alkaline phosphatase, intestinal calf, DNA and ligation of the DNA fragments and electrophoretic gel separation of the DNA fragments and their isolation from The gels were made according to the recommendations and methods of the manufacturers. The large plasmid DNA of Agrobacterium was isolated. The transformation culture of Agrobacterium was analyzed according to the general methods known to those skilled in the art. Tables 1A and IB list the transcribed, geminiviral, expression vectors, binary plasmids, and Agrobacterium strains described in the examples.
P1681 / 98MX Table 1A. Constructions used to create transgenic plants 1/98 X Table IB. Constructions for inoculation for Agrobacterium
Example 1.1 Synthesis of the expression vector pRTIOle. The pRTlOle expression vector listed in Table 1A was developed by removing a HindIII-EcoRV fragment of 325 bp from pUC8-CaMCAT (Pharmacia) and inserting it in pRTlOl digested with HincII-HindIII (Dr. Topfer, Max Plank-Institut fur Zuchtungsforschung, 5000 Koln 30, Germany), thereby adding a segment of the 35S promoter containing the inserter in the 5 'direction (Kay et al., Science, 1987, 236: 1299-1302) to the 5' end of the sequence promoter 35S of PRT101.
Example 1.2 Expression Vector pDH51 The expression vector pDH51 of Table 1A (T. Hohn, Friedrich Miescher Institute, PO Box 2543, CH-4002, Basel, Switzerland) is comprised of a 35S-terminator 35S promoter expression cartridge of CaMV.
P1681 / 98MX Example 1.3 Synthesis of the expression vector P? 1C035 The expression vector p? LC035 of Table 1A was derived from pCOlbam (Dr. Neil Olszewski, University of Minnesota-T in Citiies, College of Biological Sciences). The 1.0 kb EcoRI-SalI fragment of pCOlbam, which contains the promoter that controls the expression of the yellow spot virus of the commelin (CoYMV) transcript (Medberry et al., Plant Cell, 1992, 4: 185-92), was inserted into pSLlldO digested with EcoRI-Dral (Pharmacia). The 1.1 kb EcoRI-Dral fragment of the resulting construct was inserted into pRTlOl, digested with EcoRI-HincII, thereby replacing the 35S CaMV promoter of pRTIOl with the CoYMV promoter. Some restriction sites, including BamHI and BglII, were removed by partially digesting this plasmid with HindIII and recirculating it with T4-DNA ligase to produce p? ICO35.
Example 1.4 Synthesis of the expression vector pRTIN The expression vector pRtIN of Table 1A is a derivative of pRTlOle and pCOIN, in which the 35S terminator of pRTlOle replaced with the terminator / polyadenylation site of the protease inhibitor gene (TIHH) of pCOIN. To produce pCOIN, the fragment of
P1681 / 98MX HindIII-Xbal 760 bp pTPI-1 (Dr. C. Ryan, Washington State University, Pullman, WA) containing T ^ was inserted into Bluescript II KS + (Stratagene) digested with HindlII-Xbal. The 770 bp Xbal-Kpnl fragment of the resulting construct was inserted into pUCl9 digested with Xbal-Kpnl. The 800 bp Pstl-Kpnl terminator fragment of the resulting plasmid was ligated with the Kpnl-Pstl fragment of p? C035 to produce pCOIN. The 805 bp Shpl-SspI fragment of pCOIN was inserted into pRTl013 digested with Sphl-Smal, thereby replacing the 35S terminator of pRT1013 with TINH. The resulting plasmid was further modified by the insertion into an EcoRI site of a DNA fragment with the EcoRI ends and the internal restriction sites including BamHI to produce pRTIN.
EXAMPLE 1.5 Synthesis of the expression vector eP mas-T p. Nae m The vector of exp-ression ePmas p "nas of Table 1A was assembled by combining an activation sequence in the 5 'direction of octopine-synthase (oes) UAS) and a promoter of mannopine synthase (mas2 '). The UAS was excised from p5050 (Dr. Paul J.J. Hooykaas, State Univ. Of Leiden, 2333 AL Leiden, The Netherlands), which was isolated from the LBA4404 strain of Agrobacterium.
P1681 / 98MX tumafaciens (Dr. P.J.J. Hooykaas). An EcoRI fragment of 2.8 kb of pALl050, containing nt 13362-16202 from oes UAS was inserted into the EcoRI site of pSLH80 (Pharmacia). A 311 bp SacI-BamHI fragment of the resulting plasmid, containing nt 13774-14085 of the UAS, was ligated into pBluescript II KS + digested with SacI-BamHI. A 285 bp Xhol-Mfel fragment containing the UAS was ligated with the EcoRI-XhoI fragment of pBluescript II KS + with OAS to produce a plasmid having a structure of UAS oes elements repeated in tandem. An EcoRI-XhoI fragment of the recombinant plasmid was ligated with another Mfel-XhoI fragment to produce a recombinant plasmid, pBluescript 9 UAS3, having three or more UAS elements, repeated in tandem. The promoter element mas2 'was isolated as follows.
Plasmid pE93 (Dr. stan Gelvin, Purdue University) was derived from pRK290 (Ditta et al., 1980). The # 13 fragment of EcoRI of pE93 contains nt 16020-21634 of the Ti plasmid of octopine, which lacks an internal Clal fragment at nt 8672-20128 (Ecol3? Cla). A 4kb EcoRI-Xhol fragment was ligated with the SalI-EcorI fragment of pBR322 to produce pJTS213. This plasmid was introduced into E. coli GM119 (Dr. Gurnam Gili, Pharmacia &Upjohn, Kalamazoo, MI), which is deficient in DNA adenine methylation. In this way, the Clal site normally
P1681 / 98MX indigestible starting at nt 20128 on Ecol3? Cla is split by Clal. A 951 bp Clal-Ncol fragment of pJTS213 containing nt 21079-20128 was isolated and ligated with the ClaI-Ncol fragment of pSL1180 to produce A cartridge of the mannopin synthase promoter, enhanced with UAS, was assembled as follows. A fragment of Clal-FspI mas2 'of 365 bp from pSLUSO + P ^. ligated with the ClaI-EcoRV fragment of pBluescript + VAS3 the clones in which the mas2 'was inserted in the 3' direction of the UAS repeat were identified by restriction, to facilitate the addition of the phaseolin transcription terminator , an Xhol-Sall fragment from the 250 bp (month) multiple cloning site was ligated from pSLH80, into the recombinant plasmid digested with
Xhol. The two plasmids, pBluescript + UAS3 + Pmas + mcs
(guidelines I and II), which contain a construction with the month inserted in the two possible orientations was isolated. The phaseolin terminator was added to pBluescript + UAS3 + P? Nas + mcs, ending the assembly of Epmas-mcs-Tphas, as follows. A 1.1 kb PstI-EcoRi fragment of pUC19-hph-Tphas (described later in the assembly of Pga482? + HYGR), which contains the phaseolin transcription terminator (Tphas), was ligated with pBluescript II KS + digested
P1681 / 98MX with SacII-Clal. A 1.2 kb SacII-Clal fragment of the resulting plasmid was ligated with the SacII-Clal fragment of pBluescript + UAS3 + Pmas + mcs (orientation I) to produce a plasmid having the ePmas-Tphas insert.
Example 1.6 Synthesis of the expression vector pLAT The expression vector pLAT Table 1A was produced as follows. The promoter of LAT52 (Twell et al., Development, 1990, 109: 705-13) was used to construct an ACl gene construct in the homosense orientation that is not expressed in vegetative tissue. A 600 bp Ncol-Sall fragment from pLAT52-7a (Dr. S. McCormick, Plant Gene Expression Center, USDA ARS, Albany, CA), which contains the LAT52 promoter, was ligated with pSLH80 digested with Ncol-Sall for produce pLAT.
Example 2.1 synthesis of the binary vector pJTS246? The binary vector pJTS246? of Table 1A was produced as a derivative of pGA482 (Dr. G. An. Washington State University, Pullman, WN), by replacing the NPTII sequence controlled by nopaline-synthase with a selectable marker of the 35S-NTPII-terminator promoter. phaseolin, from CaMV. The selected marker was placed on the left border of T-Adn to ensure that the passenger gene, inserted in the right border of T-DNA, was seen
P1681 / 98MX transferred to the plant cell. A BamHI fragment of pUC8-CaMVCAT was ligated with a 2.2 kb BamHI fragment of pDOB513ro4.6K (JL Slinghton, Pharmacia &Upjohn), which contains the coding region of NPTII and the transcription terminator of No. 26 of T-DNA from octopine T plasmid, to produce pJTS228. The construction of pJTS228 has the 2.2 kb fragment, inserted as a transcription fusion unit immediately in the 3 'direction of the CaMV 35S promoter of pUC8-CaMVCAT. The majority of the CAT gene of pUC8-CaMVCAT was deleted from pJTS228 when digested with EcoRI to produce pJTS228 ?. A 4.0 kb BamHI-NcoI fragment from pJTS228 was ligated with a 1.55 kb BamHI-NcoI fragment from pkanPhas (JL Slingtom, Pharmacia &Upjohn) which contains the 5 'coding sequences of NPTII distant from the Ncol site of the phaseolin terminator. A resulting plasmid, in which the T-DNA transcription terminator was fused to the NPTII ORF was replaced with the phaseolin storage protein terminator of Phaseolus vulgaris, was designated pJTS233. The PJTS233 was digested with HindIII and the blunt ends were made. The EcoRI fragment of 2.8 kb containing the 35S promoter, the region and coding of NPTII and the phaseolin terminator was isolated and ligated in a 3 part reaction with the Smal-BamHI fragment of
P1681 / 98MX pUC9 and a 8.0 kb BamHI-EcoRI fragment of pGA482 containing the host wide range replicon. The boundaries of T-DNA from nopaline Ti, right and left, and the nopaline-synthase promoter. The desired construction, pHJTS246 was cloned and isolated. PJTS246 was modified to eliminate drug resistance of amplicillin contributed by pUC9. The plasmid was digested with Seal and HindIII, and treated with the HindIII linkers followed by digestion with HindIII. The resulting plasmid, pGTS246 ?, had 1730 bp of the deleted pUC sequence of PJTS246.
Example 2.2 Synthesis of the binary vector pJTL222 pJTS222 is pGA492 (Dr. G. An) in which a 2.2 kb BamHI-HindIII fragment was replaced by the 430 bp BamHI-HindIII fragment of pUC8-CamVCAT containing the promoter 35S of CaMV.
Example 2.3 Synthesis of binary vector pJTS235 pJTS235 was a binary plasmid derived from pGA492 in which the coding sequence of NPTII and its transcription terminator were removed and replaced with the selectable marker of CaMV 35S promoter-NPTII coding sequence. Phaseolysis erminator. PJTS235 was constructed by ligating a fragment of BamHI from
P1681 / 98MX 2.1 kb of pJTS233 containing the coding sequence of NPTII and the phaseolin terminator in the BamHI fragment of pJTS222. The resulting plasmid, pJTS235, bore the NPTII structural gene under the control of the 35S promoter.
Example 2.4 Synthesis of the binary vector pJTS250. The pJTS250 was mounted as follows. A 353 bp PstI-BamHI fragment of pLG90 (provided by Dr. L. Gritz, Biogen, SA, 46 Route des Acacias, Geneva, Switzerland), which includes the coding region of the complete hygromycin-phosphotransferase (hph) gene of the translation initiation codon of ATG at 15 bp distant from the translation terminator was ligated with the PstI-BamHI digestion of pUC9 to produce pUC9 + hph-a. Another aliquot of pLG90 digested with Aval with blunt ends of Aval. The 670 bp PstI fragment was cloned into the Smal-Pstl fragment of pUC9 to produce pUC9 + hph-b, creating a 670 bp fragment of PstI-EcoRI. The 1.18 kb Nael-BamHI fragment containing the phaseolin terminator (J.L. Slington) was cloned into the BamHI-Smal fragment of pUC9 to create pUC9 + Tphas. The three above fragments (353-, 670-, and 1180-pb) were ligated with the Ba HI digestion of pJTS222. The resulting binary plasmid, pJTS250, was produced comprised of a selected marker of plant P356-hph-Tphas and the ability to transform the plant tissue to
P1681 / 98MX hygromycin resistance via gene transfer mediated by Agrobacterium.
Example 2.5 Synthesis of the binary vector pGA482? + HYGR. PGA482? + HYGR was produced from the following plasmids: pGA470 (Dr. G. An); pJTS262, which includes the full T-DNA of pGA470 and a broad host range replicon; pJTS222; pJTS250, a binary plasmid including HYGR constructed by the ligation of four fragments, which includes the 353 bp pstl-BamHI fragment encoding part of the hph coding region, the 670 bp PstI fragment encoding the rest php coding, the 1180 bp Nael-BamHI fragment constituting Tphas and pJTS222 digested with BamHI; pUC19B2-Pnos; pUCl9B2 + hph-Tphas; Pnos-hph-Tphas and pGA482G (Dr. G. An). PGA482? + HYGR was constructed as follows: SalI fragments from pGA470 were ligated into pBR322 digested with SalI. The resulting construct, pJTS262, is comprised of the complete T-DNA of pGA470 (from the right to left boundary, and a second fragment containing part of the wide host range replicon.) The BclI-BamHI fragment of 345 bp of the resulting plasmid, which has the nopaline-synthase promoter (Pnos) fused
P1681 / 98MX 42 bp 5 'nopaline synthase (14 n-terminal amino acids) was inserted into the BamHI site of pUC19B2, which has the Smal site of pUC19 converted to a BglII site. The resulting recombinant plasmid, pUC19B2 + Pnos, had the Pnos segment within the BamHI-BglII fragment. A 2.2 kb BamHI fragment containing the region and coding of hph from the bacterial transposon Tn5 and the phaseolin transcription terminator (hph-Tphas) was isolated from pJTS250. The 2.2 kb hph-Tphas fragment was inserted into the BamHI site of pUC19B2. PUC19B2-Pnos was digested with BamHI and HindIII. PUC19B2 + hph-Tphas was partially digested with BamHI and completely with HindIII to produce the 2.2 kb fragment with BamHI-HindIII ends. The fragment was ligated with the p-UCI9B2-Pnos p-lásmid digired with BamHI-HindIII. The resulting construction, a pressure cartridge of Pnos-hph-tPhs v pUC19B2 + HYGR, was partially digested with BamHI; The resulting 5.3 kb fragment was digested with BglII to produce a 2.6 kb fragment. Separately, pGA482 digested with HindIII-EcoRI was freed with pSLH80 digested with HindIII-EcoRI that lacks a month. The resulting construction was further restricted to suppress 2.5 kb of the original T-DNA that contains the month. This binary construct was digested with BglII and ligated with the 2.6 kb BamHI-BglII fragment with ends of
P1681 / 98MX BamHI-BglII, to produce pn? S_PhP_Th_s fragment to produce pGa482? + HYGR.
EXAMPLE 3 Insertion of Geminivirus DNA in the Expression Vector Constructs Example 3.1 Synthesis of ToMoV-FL ACL ORF, wild type The ToMoV was collected from the infected tomato plants in Bradenton, Florida and inoculated into Nicotiana benthamiana and tomato. The DNA of the infected plants was isolated and the viral DNA was isolated by preparative agarose gels. Viral DNA was digested with BglII, pSP72 digested with BglII was inserted to produce a clone of full-length component A (SEQ ID No. 17). Similarly, a full-length B-DNA clone was produced from the viral DNA digested with BamHI and inserted into pBluescript II KS + (SEQ ID No. 18). The DNA of any clone inoculated in N. benthamiana produced symptoms similar to the original virus. A dimer clone in which DNA-A was inserted as a tandem duplicate, direct, into the cloning vector was made by removing the individual insert from its original vector with BglII and reinserting it into pSP72 digested with BglII. The Apal fragment of the resulting plasmid comprising the cloned DNA-A was inserted into the
P1681 / 98MX Apal site from pBluescript II KS +
EXAMPLE 3.3 Synthesis of ToMoV-Acl The wild-type ACL homosense ORF and the antisense ORF (as) of Table 1A were constructed from the ACL ORF (SEQ ID No. 1 and 2) and part of the intergenic region was amplified by PCR of the N-benthamiana DNA infected with ToMoV, using the primers PFL-2449B SEQ ID 9) (5 '-GGATCCGAGTAACTCATCTGGAGTACC-3') and PFL-1108B (SEQ ID 10) (5'-GGATCCGGAAGTAGATGGAGCACCCGC-3 '). The 1.1 kb PCR product was digested with Ba HI and inserted into the pBluescript II KS + of the BamHI site to produce pTFAcl.
Example 3.4 Synthesis of ToMoV-ACldl For the production of the mutated ORF, the ACL ORF and part of the intragenic region was amplified by PCR of the N. benthamiana DN infected with ToMoV by PCR using primers PFL-2549H (SEQ ID 16) ( 5'-TATCAAAGCTTGAGTAACTCATCTGGAGTACC-3 ') and PFL-1108B (SEQ ID 10) (5'-TATCGGATCCGGAAGTAGATGGAGCACCCGC-3') to produce a HindIII site near the translation initiation codon and a BamHI site near the translation terminator codon. The product digested with HindIII-BamHI was ligated with pBluescript II KS + digested with HindIII-BamHI, in a
P1681 / 98MX orientation homosentido in relation to the origin fl of replication. Mutations were generated in the NTP binding portions of ACl of this clone. The trans-dominant lethal mutants (dlm) of the ACl protein (SEQ ID 3 and 4) were created by Kunel mutagenesis. The plasmid of pBluescript above was transformed into CJ236 (Invitrogen Co.), a dut-, ung-strain, so that the amplified plasmid DNA contains the bacillus. The individual strand DNA was produced by transfecting the cells transformed previously with the auxiliary phage M13-K07. The complementary sense strand of the ssDNA was synthesized in vitro using deoxynucleotides, including dTTP, and two mutagenic primers. (SEQ ID 11) (5'-CAAGAACAGGGcAcACGATGTGGG-3 '). The lowercase letters indicate the altered nucleotides. The product was treated with T4 in DNA ligase and transformed into E. coli XLl-Blue (Stratagene) to amplify the plasmids containing mutations produced by mutagenic buffers, which resulted in mutations 2 (a), 3 (b) and 3 (c), described above.
Example 3.5 Synthesis of ToMoV-ACldlml The BamHI 1.1 kb fragment of pTFACl, containing wild-type ACL ORF, was inserted into the BamHI site of pRTlOle to produce a homosent construct
P1681 / 98MX (pRTACl-S). The triple mutant ORF of Acl (ACl dlm) was removed as a 1.1 kb XhoI-BamHI fragment of its vector and inserted into the homosense orientation in pRTIOle digested with XhoI-BamHI to produce pRTlOle + ACldlm. Plasmids pRTACl-S and pRTlOle + ACldlm were excised at the unique Pmll site. After further digestion with Seal, the 1.6 and 3.2 kb fragments of each digestion were isolated. The 1.6 kb fragment of pRTAACl-S was ligated with the 3.2 kb fragment of pRTlOle + ACldlm to produce a construct comprising the sequence designated ToMoV-ACldlml (SEQ ID 5) in Table 1A, mutation 2A described above .
Example 3.6 Synthesis of ToMoV-ACldlm23 Plasmids pRTACl-s and PrtlOlE + ACldlm were excised at the unique Pmll site. After further digestion with Seal, 1.6 and 3.2 kb fragments were isolated from each digestion. The 3.2 kb fragment of pRTACl-S was ligated with the 1.6 kb fragment of pRTlOle + ACldlm to produce a construct comprising the sequence designated ToMoV-ACldlm23 (SEQ ID 7 and 8) in Table 1A, double mutations 3 ( b) and 3 (c) described above.
P1681 / 98MX Example 3.7 Synthesis of ToMoV-ACl-AC2-AC3 A construct containing the AC1-AC2-AC3 fragments was produced by ligating a BamHI-HindlII fragment from a binary plasmid comprised of a dimer of the infectious ToMoV component, full length, with PJTS222 digested with BamHI-HindIII. The BamHI-HindlII fragment of this construct was inserted into pBluescript II KS + digested with BamHI-HindIII. A 1.24 kb BglII-Sphl fragment of the resulting plasmid, containing the complete coding sequences of AC2 and AC3 and two thirds C-terminal of the ACl ORF (SEQ ID 15) was ligated into pSL1180 digested with BglII-Sphl. The resulting plasmid contained the? AC1-AC2-AC3 fragment of ToMoV-A.
Example 4 Mutant wild-type sequences of
TYLCV-IS-EG Example 4.1 Synthesis of TYLCV-Cl Tomato leaves with symptoms of TYLCV were harvested in Fayoum, Giza and Ismailia, Egypt. Tomatoes from Geneva 80 and N. benthamiana were injected. Tomatoes and tobacco developed typical symptoms of TYLCV. The infectious TYLCV DNA (TYLCV-IS-EG1) was isolated from infected N-benthamine. The Cl OFR of TYLCV-IS-EG1 (SEQ ID 19 and 20) was produced as a 1.1 kb fragment by PCR amplification of the DNA of the infected plant. The used primers
P1681 / 98MX were pTYIRc4 SEQ ID 21) (5 '-GGCCATAGAGCTTTGAGGGATCC CGATTCATTTC-3') and PTYC2vl679. After digestion with BamHI, the PCR products were ligated with pBluescript II KS + digested with BamHI to produce pEGALI-ASl, which contained the Cl ORF, as TYLCV-C1.
Example 4.2 Synthesis of TYLCV-? C2As A truncated C2 ORF (? C2) was produced as a 365 bp fragment for PCR amplification of N. benthamiana DNA infection with TYLCV-IS-EG1. The primers PTYC2vl499 (SEQ ID 32) (5'- ATTTGTGGATCCTGATTACCTTCCTGATGTTGTGG-3 ') and PTYC2cl814 (SEQ ID 35) (5'-AAACGGATCCTTTGAAAAATTGGGC-3') were used. The primers were digested with BamHI and ligated into pBluescript II KS + digested with BamHI to produce pTYC2-25-1, which contained the? C2 ORF in the antisense orientation.
Example 4.3 Synthesis of TYLCV-V1 A truncated VI ORF was produced as a 625 bp fragment by PCR amplification of N. benthamiana DNA from TYLCV-IS-EG1. The primers used were PTYARlv466 (SEQ ID 33) (5 '-TTAGGATCCTATATCTGTTGTAAGGGC-3') and PTYARlcl046 (SEQ ID 34) (5'-TTAACTAATGCAGGATCCTACATTCCAGAGGGC-3 '). The primers were digested with BamHI and
P1681 / 98MX ligated into pBluescript II KS + digested with BamHI to produce pTYVl-6-1, which contains the Vl ORF.
Example 4.4 Synthesis of TYLCV-C1-? C2-? C3 A 1.3 kb fragment of the genotype TYLCV-IS-EG1 nt 1471 to nt 20 via nt 2787 (Navot et al. 1991) was produced by PCR amplification of N-DNA. infected benthamiana. The primers used were PTYIRc4 and PTYC2vl499. The primers were digested with BamHI and inserted into pBluescript II KS + digested with BamHI to produce pTYEGC4.
Example 4.5 Synthesis of TYLCV ORF mutations A full-length infectious clone of TYLCV-IS-EG1 (pTYEG14) was created to serve as the basis for TYLCV ORF constructs and for agroinoculation (see below). The DNA of a tomato infected with TYLCV-IS-EG1 was used as a template in two PCR amplification reactions. The first primers used were PTYClc2196 (SEQ ID 37) (5 '-AAATCTGCAGATGAACTAGAAGAGTGGG-3') and PTYVlvll64 (SEQ ID 36) (5'-GTACGAGAACCATACTGAAAACGCCT-3 ') to amplify a fragment. The fragment digested by PstI-SphI was ligated with pGEM-5zf + digested with PstI-SphI to produce the pEGUA plasmid. The second amplification reaction used the
P1681 / 98MX primers PTYClv2182 (SEQ ID 39) (5'-TAGGCCATGGCCGCGCAGCGGAATACACG-3 ') and PTYC3cl320 (SEQ ID 38) (5'-GGTTCTGCAGCAGAGCAGTTGATCATGTATTG-3'). the fragment digested with PstI-NcoI was ligated with pGEm-5zf + digested with PstI-NcoI to produce Pegil-7B. To assemble or assemble the full length virus, the PstI-NcoI fragment of pEGIl-7B was ligated with the PstI-NcoI fragment of pEGIlA to produce a construct comprising the full length 2.7 kb viral DNA. The full-length construct was tested for infectivity by biolistic distribution in tobacco cells and was found to create symptoms identical to the original disease. This clone was called pTYEG14. The orientation of the insertion with respect to the origin fl of the replication was confirmed. Three mutant Cl ORFs were constructed, each having one or two base changes that alter the amino acid specificity of a codon by Kunkel mutagenesis using a plasmid representing the full-length infectious clone of TYLCV-IS-EG1 (pTYEG14) as template. Mutagenic primers (all viral homosense) were: PClv2467 (SEQ ID 25) (5'-GTTTCCGTCTcgCTCCACGTAGG-3 '); PClv2101 (SEQ ID 28) (5'-GGCCCACATTGTTgCGCCTGTTCTGC-3 '); and PClv2000 (SEQ ID 31) (5'-
P1681 / 98MX GGGTCTACGTCTctAATGACGTTGTACC-3 ') - (The lowercase letters indicate altered nucleotides). The resulting DNA was treated with T4-DNA ligase and transformed into E. coli XLI cells to produce the following constructs: pTYKl04R # 1 (SEQ ID 23 and 24), mutation I (c); pTYK235A # 4 (SEQ ID 26 and 27), mutation 2 (b); and pTYD259R # 5 (SEQ ID 29 and 30), mutation 3 (a), as described above. The three mutant Cl ORFs were cloned into pCRII (Invitrogen). Cl ORF for each mutant was amplified by PCR using primers PTYIRc4 (SEQ ID 21) (5'-GGCCATAGAGCTTTGAGGATCCCGATTCATTTC-3 ') and PTYCvl707 (SEQ ID 42) (5'GGTAGTATGAGGATCCACAGTCTAGGTCT-3'). The amplified fragments were ligated with pCRII to produce: pClKl04R # 2, mutation l (c); pClK225A # 4, mutation 2 (b); and pClD259R # 2, mutation 3 (a), as described above. These three ORFs in the BamHI fragments of their respective vectors provided the Cl ORF mutants for the expression cartridges for the Agrobacterium-mediated transformation.
Example 5 Constructions of BGMV Wild type and mutated versions have been prepared in BGMV Cl (replication protein) ORF. The wild type sequence (SEQ ID 43 and 44) was mutated by Kunkel mutagenesis. The mutations in BGMV-Cl described
P1681 / 98MX herein include:
SEQ ID Nos. 45, 48, 51 and 54, refer to the mutagenized BGMV-C1 ORF DNA sequences presented in the sequence listing. These code for protein sequences 46, 49, 52 and 55, respectively. The mutant sequences were derived from the wild type DNA by Kulken mutagenesis with the mutagenic primers 47, 50, 53 and 56, respectively. A 1.8 kb BamHI-XhoI fragment containing the 35S promoter transcriptionally fused to a mutated ACL ORF of BGMV-GA followed by the nopaline synthase transcription terminator was removed from WRG2398 (Dr. DR Russell, Agracetus Corp., Middleton , Wl). The ACl coding sequence was mutated in vitro using Kunkel mutagenesis to produce double mutations 2 (c) and 2 (a). this fragment was ligated with pRTlOle digested with the same enzymes and the ligation mixture was used to transform DH5 cells of E. coli. Some
P1681 / 98MX transformants produced the desired recombinant plasmids that had the complete expression cartridge of WRG2398 inserted into PRTlOle (pJTS364). The new expression cartridge was removed as a 2.9 kb fragment from one of the recombinant plasmids by partial digestion with HindIII. Was it linked with pJTS246? which was digested with HindIII was treated with CIAP. After transformation of the DH5 cells, a recombinant was identified among the transformants that the expression cartridge inserted into the binary vector had. The DNA of this binary vector was transformed into LBA4404 of A. tumefaciens and a transformant containing the binary was called strain At364. Plasmid pJTS364 was digested with EcoRV to remove the duplicated 35S promoters (P355) and the excised DNA ligated. A fraction of the pooled molecules had a deletion for the fragment between the EcoRV sites containing the 35S (e35s) enhancers of WRG2398 and P35S of pRTlOle. The ligation mixture was used to transform the DH5 cells. Among the transformants, the desired, deleted plasmid was found and named pJTS365. The 2.5 kb expression cartridge was removed and ligated with pJTS246? treated with CIAP, digested with HindIII. The ligation mixture was used to transform DH5 cells. The recombinant binary plasmid was identified among the resonants and one of these was used as
P1681 / 98MX a source of DNA that was transformed into LBA4404 from A. tumefaciens. The transformed Agrobacterium that has the recombinant binary was called AT365. The BGMV ORF listing was installed in appropriate promoter vectors and then in binary plasmids for the transformation measured by Agrobacterium in Phaseolus plants. Additionally, expression vector was distributed in plants by biolistic acceleration and other methods with which plants can be transformed. Transformed, regenerated plants are evaluated for levels of transgenic RNA accumulation by RNA transfer analysis to build transgene activity. Subsequently, the progeny were evaluated for the ability to resist infection by BGMV.
Example 6 Expression cartridges and strains of Agrobacterium The following constructions of ToMoV were produced. Example 6.1 RTSC and RTAC The 1.1 kb BamHI fragment of pTFACl, which contains wild-type ACL ORF was inserted into the BamHI site of pRTl013. The antisense constructs (pRTACl-A) and homosentide (pRTACl-S) were produced. The HindIII fragments of each plasmid were each inserted into the site
P1681 / 98MX HindIII from pJTS246? in the same transcriptional direction as the selectable NPTII marker. The binary vectors were transformed into LBA4404 to produce RTAC (antisense) and RTSC (homosentide).
Example 6.2 DHSC and DHAC The wild type ACL ORF was also inserted as a BamHI fragment in pDH51 digested with BamHI in both orientations creating pDHACl-S (homosentide) and pDHACl-AS (antisense). The recombinant expression cartridge was removed with EcoRI and inserted into pJTS235 digested with EcoRI. The recombinant binary plasmids were selected to have the expression cartridge inserted such that the directions of the transcript as the selectable marker. These binary plasmids were introduced into LBA4404 by transformation to produce DHSC (homosentide) and DHAC (antisense).
Example 6.3 RTSFS pRTACl-S was digested with Bgll and blunt ends were made by filling. The resulting plasmid, pRTACl-S? BglII, lacked a BglII site but had a Sau3A site of 4 core bases. This mutation changed in translation reading frame when adding four nucleotides, thus producing a finalizing codon
Translation P1681 / 98MX, and truncating the polypeptide (SEQ ID 13 and 14). A 2.1 kb HindIII fragment of pRTACl-S? BglII, which contains the expression cartridge, was inserted in both orientations into the HindIII site of pJTS246 ?, unidirectional or divergent with respect to the homosense of the selectable marker. A plasmid having a unidirectional orientation was introduced into LBA4404 by transformation to produce RTSFS.
Example 6.4 RT3AA The 1.24 kb BglII-Kpnl fragment of PSL1180 +? AC1-AC2-AC3 was ligated into pRTACl-A digested with BglII-Kpnl to produce, pRT3AA, a construct similar to pRTlOle with the ACL, AC2 and AC3 ORF inserted in the antisense orientation. A 2.7 kb HindIII fragment of pRT3AA was inserted into the HindIII site of pJTS246? in the unidirectional orientation. The construction was introduced into LBA4404 by transformation to produce RT3AA.
Example 6.5 LASD and LASU The 600 bp EcoRI-HincIl fragment of psL1180 + PLAT52 was ligated with pRTACl-S digested with EcoRI-HincII to replace the 800 bp EcoRI-HincII promoter fragment by the EcoRI promoter fragment. HincII of LAT52 of 600 bp. After the alignment of the plasmid
P1681 / 98MX EcoRI, the start codon of ATG was instructed by bean nuclease. The resulting plasmid contained an EcoRI and HindIII, but lacked an Ncol site. Accordingly, the sequences flanking the Ncol site mutated therefor as in the 5'-untranslated, promoter, guideline of LAT52, original. The untransformed 5 'guide was lengthened to 181 bp and included 60% nucleotides A / T. The HindIII plasmid fragment containing the expression cartridge was inserted into the HindIII site of pJTS246? in both unidirectional and divergent orientations with respect to the homosense of the selectable marker. A binary plasmid of each type was transformed into LBA4404 creating the LASU and lasd strains, respectively.
Example 6.6 MEU and MEU2 The triple mutant ACL ORl (dlmACl) was removed as a 1.1 kb XhoI-BamHI fragment of its vector and inserted in homosense orientation in pRTlOle digested with XhoI-BamHI. The 2.1 kb expression cartridge created in this way was removed from pRTlOle + ACldlm by incompletely digesting the recombinant vector with HindIII and by isolating a 2.1 kb fragment. This fragment was inserted into the HindIII site of pJTS246? to produce an improved, mutated unidirectional vector (MEU). A second binary plasmid comprising the expression cartridge that is
P1681 / 98MX doubled in tandem in the unidirectional orientation was called MEU2. Both of the above binary vectors were transformed into LBA4404 to produce MEU and MEU2, respectively.
Example 6.7 MUA and MUB ToMoV-ACldlml was partially digested with HindIII and the 2.1 kb expression cartridge was isolated. ToMoV-ACldlm23 was completely digested with HindIII and the 2.1 cartridge was isolated. Each cartridge was inserted into the HindIII site of pJTS246 ?. The recombinants were transformed into LBA4404 creating the strains of Agrobacterium MUA and MUB, respectively.
Example 6.8 MUAIN and MUBIN The 1.2 kb XhoI-BamHI fragment of PRT1013 + AC1 dlml containing the ACldlml ORF was ligated with the XhoI-BamHI fragment of pRTIN + Geneblock in a homosense orientation. This construct was incompletely digested with HindIII followed by complete digestion with Seal to produce a 2.6 kb fragment comprising the expression cartridge. Are they linked with pJTS246? digested with HindIII in a divergent orientation with respect to the selectable marker. The resulting Agrobacterium strain was called MUAIN. P1681 / 98MX The 2.1 kb BamHI fragment of pRTl01e + ACldlm23, which contains the ACldlm23, was ligated with the OpRTIN + Geneblock plasmid digested with BamHI in the homosense orientation. This plasmid was digested with HindIII and Seal that produces a 2.6 kb expression cartridge fragment that was inserted into pJTS246? digested with HindIII in a unidirectional orientation. The plasmid DNA of this clone was transformed into LBA4404 to produce MUBIN.
Example 6.9 CODLM The 1.1 kb BamHI fragment containing the wild type ACL ORF was inserted into the BamHI site of p? LC035 in a homosense orientation to produce p? L035 + AClS. the 4.5 kb Apal-BglII fragment of P? 1C035 + AC1S was restricted to suppress a wild-type ACL ORF comprising 475 bp and ligated to the Apal-BglII fragment of pRTlOlei + ACl dlml to replace the wild-type internal fragment by the mutated fragment. The recombinant (p? LC035 + ACl dlm) was incompletely digested with HindIII, the 2.4 kb fragment containing the isolated expression cartridge and inserted into the HindIII site of pJTS246? in a unidirectional orientation. The plasmid was transformed into LBA4404 cells to produce CODLM.
P1681 / 98MX Example 7 Constructs Containing TYLCV-IS-EG1 Example 7.1 LCA A 4.1 kb BamHI fragment of pEGALl-ASl containing Cl ORF was inserted into the BamHI site of pRTlOle in an antisense orientation to produce pRTLACl-A . A 2.1 kb HindIII fragment of pRTLCAl-A was inserted into the HindIII of pJTS246? in unidirectional orientation (U) with respect to the directions of transcription. The LBA4404 cells were transformed into the resulting plasmid to produce LCA.
Example 7.2 LCR 'The 350 bp BamHI fragment encoding part of the C2 ORF of TYLCV-IS-EG was removed from pTYC2-25-l and ligated into the BamHI site of pRTIOle. The resulting construct contained the truncated C2 ORF inserted in an antisense orientation with respect to P35S. The 1.3 kb expression cartridge was removed by digestion of HindIII and inserted into the HindIII site of pJTS246 ?. The resulting recombinant plasmid DNA was partially digested with the HindIII and ligated with the Cl antisense expression cartridge. The desired plasmid had a copy of each of each inserted expression cartridge such that the directions of the transcription of all the
P1681 / 98MX cartridges were unidirectional. The DNA of this binary plasmid was transformed into LBA4404 to produce a strain, LCR 'comprising the binary, recombinant, two-cartridge plasmid.
Example 7.3 LCR '' A 620 bp BamHI fragment of pTYVl-6 encoding part of the VL ORF of TYLCV-IS-EG1 was ligated into the BamHI site of pRTlOle in an antisense orientation with respect to the 35S promoter. A HindIII fragment of the resulting plasmid was ligated into the HindIII site of pJTS246? in a unidirectional direction with respect to the selected marker. The plasmid DNA of this recombinant was transformed into LBA4404 to produce LCR ".
Example 7.4 RT3CA The 1.3 kb BamHI fragment of pTYEGC4 containing the C1 +? C2 +? C3C structure was inserted into the BamHI site of pRTlOle in an antisense manner with respect to transcription of the 35S promoter. The 2.3 kb HindIII fragment from pTYEGC4, the resulting plasmid containing the expression cartridge was inserted into the HindIII site of pJTS246? in a direction in transcription unidirectional LBA4404 was transformed with this plasmid to produce RT3CA. P1681798MX The 1.3 kb BamHI fragment of pTYEGC4 comprising the C1 +? C2 +? C3 DNA was ligated into the BamHI site of p? LCo35 in an antisense orientation with respect to the promoter of the yellow spot virus of Commelina. The 2.8 kb HindIII fragment the resulting plasmid containing an expression cartridge was inserted into the HindIII site of pJTS246? in a unidirectional orientation with respect to the selected marker. The transformed plasmid DNA of LBA4404 produced C03CA. 1.2 kb mutated fragments containing Cl
ORF were removed from their pCRII vectors and ligated directionally into the EcoRV-HindIII fragment of eP as-mcs-Tphas. The resulting constructions were digested with Xhol and Nael. The HindIII fragments of pGA482? 9 + HYGR were blunt-ended with filler, and were digested with XhoI. The Xhol-Nael expression cartridges were ligated into the binary vector which had a Xhol cohesive end and a blunt end to produce three C104, C325 and C259 constructs. The DNA of each of the constructs was transformed into LBA4404 to produce the strains LC104, LC225 and LC259 and in the Agrobacterium strain EHA105 (Mogen International, N.V.) to produce strains EC104, EC225 and EC259.
P1681 / 98MX Example 7.5 RT3CS The 1.3 kb BamHI fragment of pTYEGC4 containing the C1 +? C2 +? C3C structure was inserted into the BamHI site of pRTlOle in a manner of homosense with respect to the transcription direction of the 35S promoter. The 2.3 kb HindIII fragment from pTYEGC4, the resulting plasmid containing the expression cartridge was inserted into the HindIII site of pJTS246? in a unidirectional transcription direction. LBA4404 was transformed with this plasmid to produce strain RT3CS.
Example 8. Production of Transgenic Plants Containing the Described Constructs and Analysis of Transgenic Expression. Transgenic plants were produced by Agrobacterium culture methods, well known to those skilled in the art. The media of the compositions used are defined herein, for 1 liter: 1 / 2X basal: l / 2x MS salts (Gibco), 10 g sucrose, 7 g agar, pH 5.8; TCM: salts IX MS, 3 g30 G of sucrose, 0.2 g of KH2P04, vitamins of IX N & N (Gibco), 0.1 mg of 2,4-D, 0.05 mg of kinetin, 20 mg of acetosyringone, 7 g of agar , pH 5.8.
P1681 / 98MX 1Z: Sales IX MS, 30 g of sucrose, vitamins IX N &N, 1 mg of zeatin, 10 mg of kanamycin sulfate, 500 mg of carbenicillin, 7 g of agar, pH 5.8; TRI: Sales IX MS, 30 g of sucrose, vitamins IX N & N, 3 mg of glycine, 0.17 g of NaH2PO4.H20, 40 mg of acetosyringone, pH 5.8; MK 5: 1 / 2X MS salts, 10 g of sucrose, vitamins IX N & N, 3 mg of glycine, 0.17 g of NaH2P04.H20, 50 mg of kanamycin sulfate, 500 mg of carbenicillin, 7 g of agar, pH 5.8; C: Sales IX MS, 30 g of sucrose, vitamin IX N &N, 3 mg of glycine, 0.8 g of NH4N03, 2 mg of BAP, 0.5 mg of IAA, 100 mg of kanamycin sulfate, 250 mg of carbenicillin, 7 g of agar, pH 5.8. The seeds were sterilized by rinsing again in 70% EtOH and then in a solution of chlorox plus 20% Tween 20. The seeds were dried in vacuo and then rinsed several times with sterile water. The washed seeds were transferred into the 1 / 2X basal medium and incubated in a Magenta box for approximately 7 days in the 16-hour photoperiod daily. Fully extended cotyledons were cut aseptically under water. Two cuts were made at the end, and the tip of the cotyledonous piece and the
P1681 / 98MX center piece was retained and used. An agrobacterium culture containing the appropriate binary plasmid was started 24 hours before co-culture. The bacteria in 4-5 ml of culture were collected by centrifugation and redispersed in the TRI liquid medium. The solution was poured onto cut cotyledon pieces and incubated for approximately 25 minutes. The cotyledonary pieces were placed on sterile filter papers and placed compactly in the TMC medium. The plates were kept in the dark at room temperature for approximately 48 hours, after which they were placed on plates containing the 1Z medium. The plates were incubated in 16 hours of light daily at approximately 242C for approximately 21 days. The calluses that formed in the cotyledonary pieces were transferred to fresh 1Z plates and the shoots were removed as they were formed at 1 / 2X MK-5 for the implant. A 4 mm piece of shoot leaf was also placed in medium C for callus formation. From 12 to 14 days after planting in medium C, callus "-" or "+" were marked. Approximately 60 to 70% of the suckers with callus + were implanted in the MK5 medium. Those who had the callus + but were not implanted were cut out at the end and re-implanted in fresh MK5 tubes. Approximately 80% of these will be re-implanted
P1681 / 98MX on the second attempt. The suckers implanted to pot soil were removed when a strong root system has been developed, usually about 3 weeks after implantation. The plants were kept in a closed plastic bag for approximately 3 days, the bags opened directly after acclimatizing the young plant. A 6 to 8 mm piece of leaf tissue was collected for the NPTII ELISA assay. NPTII-positive plants were transferred to the greenhouse for seed production. Approximately 4 to 5 weeks in the greenhouse, leaf tissues were collected for RNA isolation and Northern blots were performed for these plants.
Example 9. Analysis of Transgenic Plants The expression of the transgene RNA in the transgenic tomato lines was achieved by maximizing the levels of transcription in steady state using Northern blot hybridization. The expression level of the transgene was used to select lines for inoculation. Total RNA was isolated from the leaves and stems of young plants and subjected to agarose gel electrophoresis. The appropriate ORF DNA probe was radiolabelled and hybridized to RNA transferred on paper. After the
P1681 / 98MX washed, the RNA was visualized for autoradiography on X-ray film. The following Tables 2, 3, and 4 summarize the results showing plants produced with geminivirus constructions described above. The following symbols are used: No + or No-, Positive or negative to the Northern blot; So + or So-, positive or negative to the Northern blot; *, no data; R0 and R., primary and progeny lines. Table 2 summarizes the transgenic tomato plants produced by transfer of the wild type ToMoV ORF DNA into the plant by Agrobacterium infection. For example, several tomato plants (TGM-1 to -17, -20, -24, -28, -29, -33 to -41, -47 to 4-49, -53, -54, -59 to - 67, -70 to -131; TTGV92-1 to -5, -10, -13 to -20) were produced by Agrobacterium containing the construction of RTAC. As shown in Table 1A, this construction is comprised of ToMoV OFL ACL in an antisense configuration. The predominant characteristics of these plants that contain RTAC in the presence of ToMoV in the plant tissue (ie So +), transcribed RNA (ie, No +), and transmitted these
P1681 / 98MX traits its progeny (R. RNA). Table 2 also describes transgenic plants with DHAC and RT3AA constructs, comprised of ToMoV ACl and antisense OR1-AC2-AC3 ORF, respectively (Table 1A). Table 3 describes transgenic tomato plants containing mutant ToMoV ORF. These include the constructions meu, meu2, Codlm, mub, mua, mubin, mauin, rtsfs, lasu, and lasd described in Table 1A. Table 4 describes transgenic tomato plants containing TYLCV ORF. These include the LCA, LCR, RT3CA, RT3CS and Co3CA constructs, of Table 1, which comprise Cl, C2 and C3 ORF of TYLCV. These results establish that the methods described herein produce transgenic plants that use DNA constructs that contain the ORF of the geminivirus.
P1681 / 98MX Table 2 TOMATO PLANTS TRANSFORMED WITH ToMoV 81 / 98MX Table 3 TOMATO PLANTS TRANSFORMED WITH CONSTRUCTIONS
MUTANTS, LETHAL, KINNERS, OF REP ORF OF TOMoV
MX
P1681 / 98MX Table 4 TRANSGENIC TOMATO PLANTS TRANSFORMED WITH
TYLCV GEN CONSTRUCTIONS
Example 10. Viral Stimulation of Plants
Transgenic Example 10.1 ToMoV Agro-Inoculation Vector
P1681 / 98MX A fragment of 5.6 kb composed of an infectious, full-length DNA-A dimer was ligated with the binary plasmid pJTS222 digested with BamHI-HindIII to produce the construct comprising the ToMoV-A dimer. The resulting plasmid produced transformed LBA4404 cells, used as component A in the agroinoculation experiments. A fragment of 6.9 kb Xbal including a full-length infectious clone of B-DNA and the complete pBluescript II KS + plasmid was inserted into the Xbal site of pJTS222. The resulting plasmid produced transformed the LBA4404 cells used as the B component in the agroinoculation experiments.
Example 10.2 Agroinoculation vector of TYLC-IS-EG1 The full-length TYLCV-IS-EG1 DNA of the infectious clone pTYEG14 was removed and the plasmid by digestion with Sphl was inserted in excess in high, molar in the Sphl site of pGEM5Zf +. The resulting plasmid, pTYEG7, contained an infectious TYLCV-IS-EG1 DNA dimer. The 6.7 kg fragment of the Scal-Pstl fragment of pTYEG7 comprised the dimer and part of pGEMZAf +. The 1.9 kb PstI-Scal fragment of psLH80 was ligated with the 6.7 kb fragment of pTYEG to produce a 8.7 kb construct
P1681 / 98MX with a single BglII site. The 7.0 kb Scal-BamHI fragment of the resulting recombinant plasmid was ligated with pJTS22 of Hpal-BamHI.
A resultant construct was made to transform LBA4404 cells to produce AtLCl, which was used in the TYLCV agroinoculation experiments.
. 3 Agro-inoculation procedure The R. plants of the self-pollinated R0 primary regenerants were inoculated 3 weeks after sowing. For bi-partite geminiviruses, agroinoculation comprised the distribution of ToMoV DNA-A and DNA-B of greater than full length (they must contain 2 common regions, in the small trees using agrobacterium.) A small amount of a mixture of Agrobacterium each containing a binary vector having in its T-DNA a tandem duplication, partial or complete, of the infectious geminivirus DNA was injected into the plant.For monopartite geminivirus, only one agrobacterial strain was required if it has a binary vector comprising a complete duplication partial of a full-length infectious DNA During the night, the cultures of agrobacteria were diluted and injected into the stems of one-month-old tomato trees Approximately 100 hours more
P1681 / 98 X late, a second inoculation identical to the first was performed. The detection of NPTII by ELISA was taken as an indication of the presence of the transgene. The agroinoculation experiments summarized in Tables 5 to 10 show an arrangement of the resistance phenotypes. The data show several transgenic tomatoes resistant to ToMoV infection, including DLM12, TTGV92-42, CODLM6, CODLM8, CODLM13, CODLM14, MUA9, MUB20, MUA8, MUA18, MUA8, and MUA41.
P1681 / 98 X Table 5. Agroinoculations with transgenic ToMoV-DLM
/ 98MX Table 5 (continued)
P1681 / 98MX Table 6. ToMoV Agroinoculations: 3AA Transgenic Plant Fraction free of symptoms and free of viruses
Line Positive Observation to NPTp Negative to NPT? (Generation) DPI vinal stain vteual stain
3AA3 (Rl) 25 1/14 1/14 0/6 0/6
3AA7 (Rl) 25 3/18 3/18 0/2 0/2
3AA9 (RÍ) 25 1/19 1/19 0/1 0/1
3AA12 (Rl) 25 0/14 0/14 0/6 0/6
3A? 13 (Rl) 25 1/4 0/4 0/4 0/4
3AA16 (Rl) 25 1/11 1/11 0/9 0/9
3AA18 (Rl) 25 0/4 0/4 2/20 2/20 transformed 25 0/15 0/15
3AA13 (Rl) 22 2/19 2/19 0/1 0 / i
3AA21 (Rl) 25 3/13 3/13 0/7 0/7 AA22 (R1) 22 6/16 9/16 0/4 0/4 AA23 (R1) 25 3/9 4/9 0 1 0/1 AA26 (R1) 25 0/16 0 16 0/4 0/4 AA27 (Rl) 25 0/10 0 10 04 0/4 AA30 (Rl) 25 2/18 5/18 0/2 0/2 ransformed 25 0 / 15 0/15
98MX Table 7. Agroinoculations with ToMoV: The Transgenic Table 8. Agroinoculations with ToMoV: Transgenic CODLM 681 / 98MX Table 9. Agroinoculations with ToMoV: MUA and MUB Transgenic 8MX Table 10. Agroinoculations with ToMoV: of RTFS Transgenics
Example 10.4. Transfer Test by
Crushing of Geminivirus Approximately 3 weeks after agro-inoculation, visible symptoms were inspected and compared to tomato lines. Not transformed. At the same time, two leaf extract plant samples were applied to a hybridization membrane. This was done by crushing a sheet disc about 1/8 inch in diameter in the membrane such that the leaf blade completely impregnated the membrane. After the membrane was treated to denature the DNA in the extract, it was hybridized according to the same protocol as used
P1681 / 98MX for Northern blots with a radioactive probe that detected the DNA-B component of ToMoV and the Cl ORF of TYLCV. The presence of viral DNA in the plant cell could be detected by autoradiography in the presence of viral DNA highly correlated with the appearance of symptoms, an indication of susceptibility to infection. The free phenotype of the virus was correlated with the presence of the marker in families of transgenic tomatoes that secrete the NPTII marker. Figure 1 shows that the expression of the transgene
ToMoV ACldlm is required to give resistance to ToMoV infection mediated by agroinoculation. High expression is necessary but in itself it does not ensure resistance.
Example 10.4 Inoculations with White Fly Viruliferosa Ten white flies that have ToMoV were placed on each eight-day-old tree. Twenty-five saplings were used per family. In those families of small trees that were not homozygous for the transgene, the NPTII trials were correlated with the results of the transfer by crushing. Forty-one to thirty-one days after inoculation, the samples from each plant were taken by molecular undulation assays and
P1681 / 98MX biochemicals. The results are summarized in Table 11. The visual proportion gives on average of the plants, in which "0" there are no symptoms and "4" are the most marked symptoms. The results of the crush transfer gave the fraction of plants that were free of viruses.
P1681 / 98MX Table 11. Inoculations with ToMoV of the Florida Greenhouse Whitefly
Example 11. Transdomain in plant cell lines A mutated form of the ACL protein of BGMV inhibits the replication of DNA-A in a cellular system of tobacco suspension. To evaluate the mutants of
P1681 / 98MX protein from ACl for its potential to interfere with viral replication, a transgene assay was used to detect the trans-dominant interference activity of the viral ORF, mutant (Table 12 and Figure 2).
Table 12. Effects of BGMV ACl Mutations in Replication and Trans-domination
NT-1 cells were inoculated with wild-type DNA-A or a lethal mutant of BGMV-GA DNA-A (ADM, double mutations 2 (a) and (c) in combination with carrier DNA (PBS) or vectors of trans-expression of ACl containing the mutated forms of ACl ORF.The total DNA was collected from the NT-1 snuff areas at 72 hours of inoculation, subjected to electrophoresis in a gel of
P1681 / 98MX agarose, the paper was transferred and probed with a radiolabeled DNA probe corresponding to the DNA-A coat protein of BGMV-GA. The results demonstrate that the wild type ACl protein produced in trans can replicate in a lethal ACl mutant of DNA-A. More importantly, the results show that the codon changes in the cutting portion of the ACL ORF canceling the ineffectiveness and replication. In the transient assay for trans-domination interference, the double mutations l (a) and l (c) showed trans-domination interference (Table 12). Additional experimental treatments included: A + PBS: BGMV-wild-type DNA-A was introduced into NT-1 cells with PBS at DNA weight ratios of 1: 100 and 5:95 wild type: PBS; A + TDM: BGMV-DNA-A was introduced in NT-1 with transexpression vector coding for double mutations 2 (a) and 2 (c) ratios of 1: 100 and 5:95; A + TD262R: BGMV-DNA-A was introduced with trans-expression vector coding for mutation 3 (a) at ratios of 1: 100 and 5:95; ADM + PBS: DNA-A containing double mutations 2 (a) and 2 (c) with PBS at 5:95;
P1681 / 98MX ADM + TAC1: DNA-A containing double mutation 2 (a) and 2 (c) with the trans-expression vector encoding wild-type ACl at a ratio of 5:95. Transepression vectors used in these experiments expressed AC1 in the appropriate context for replication. Figure 1 represents the results of these experiments. Mutations created in the ACL ORF driven by the 35S promoter are listed in the first column. These ORFs are used in trans with the wild-type DNA-A of BGMV-GA to determine transdomination interference. Replication was carried out in an NT-1 test system. Replication is presented as the amount of reduction in replication compared to the level of wild-type replication. Transdomination was determined by engineering each mutation in a transliteration vector of ACl that contained the ACL ORF under control of the CaMV 35S promoter. The expression promoters of ACl mutants were co-inoculated into NT-1 cells together with WT DNA-A and productions in DNA-A replication were estimated from autoradiograms. The trans-domination data were expressed as the reduction observed in DNA-A replication when co-inoculated with each ACl mutant. Mutation 2 (c) confers a phenotype
P1681 / 98MX temperature sensitive for replication, which supports replication at 232C, but not at 282C. Replication was observed in inoculations with GGMV-DNA-A plus wild-type carrier DNA (A + PBS) (Figure 1). No replication was observed in inoculations with a DNA-A mutant containing double mutations 2 (a) and 2 (c) co-inoculated with the carrier DNA (DNA + PBS). Replication of double mutations 2 (a) and 2 (c) was complemented, however, by the trans expression of wild-type ACl in the transient expression vector (ADM + TAC1). The replication of BGMV-DNA-A in the presence of two different ACl mutants, the A + TDM and A + TD362R treatments produced the DNA-A replication of the virus compared to the A + PBS treatments. Accordingly, trans-expressing the ACl mutants can inhibit the replication of BGMV-DNA-A. The additional lethal mutants of AC1 inhibit replication when expressed in trans to DNA-A. The results show that non-lethal mutants do not exhibit detectable transdominant activity. While the trans-domination levels varied among different ACl mutants, only mutants lethal to replication exhibited transdominant interference. The levels of expression of ACl were directly related to trans-domination levels and
P1681 / 98MX replication (Figure 1). In this way, the expression of AC1 results in the production of a protein that mediates "trans" -efective suppression. That is, this protein probably binds to the CR region that mediates its suppressive effect by inhibiting the binding of wild-type ACl protein.
P1681 / 98MX LIST OF SEQUENCES
(1) GENERAL INFORMATION (i) APPLICANT: Stout, John T Luu, Hang T Maxwell, Douglas Ahlquist, Paul Hanson, Steve (ii) TITLE OF THE INVENTION: TRANSGENIC PLANTS THAT
EXPRESS GENES OF GEMINIVIRUS (iii) NUMBER OF SEQUENCES: 63 (iv) ADDRESS OF CORRESPONDENCE: (A) RECIPIENT: Dressler, Rockey, Milnamow & Kats (B) STREET: Two Prudential Plaza, Suite 4700 (C) CITY: Chicago (D) STATE: Illinois (E) COUNTRY: USA (F) POSTAL CODE: 60601 (v) COMPUTER READABLE FORM: (A) TYPE OF MEDIUM: Flexible disk (B) COMPUTER: compatible with IBM PC (C) OPERATING SYSTEM: PC-DOS / MS-DOS (D) PROGRAM: Patentln Relay # 1.0, version # 1.30 (vi) CURRENT APPLICATION DATA:
P1681 / 98MX (A) APPLICATION NUMBER: US (B) SUBMISSION DATE: (C) CLASSIFICATION: (viii) INFORMATION OF THE AGENT / LAWYER (A) NAME: Mueller, Lisa V (B) REGISTRATION NUMBER: 38,978 (C) ) ORDER NUMBER / REFERENCE: SVS3801P0260 (ix) TELECOMMUNICATION INFORMATION (A) TELEPHONE: 312-616-5400 (B) TELEFAX: 312-616-5460) INFORMATION FOR SEQ ID NO: l: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1162 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Giminivirus of the tomato spot (C) INDIVIDUAL ISOLATED: Florida (ix) FEATURE: (A) NAME / KEY: CDS
681 / 98MX (B) LOCATION: 44..1127 (x) PUBLICATION INFORMATION: (A) AUTHORS: Gilbertson, RL Hidayat, SH Paplomatas, EJ Rojas, MR Hou, YM Maxwell, DP (B) TITLE: Pseudorrecombination between DNA components, cloned, infectious of the dwarf bean mosaic geminivirus and the tomato spot. (C) GACETA: Jour. General Virol. (D) VOLUME: 74 (F) PAGES: 23-31 (G) DATE: 1993 (xi) DESCRIPTION FOR SEQ. ID No: 1: GGATCCGAGT AACTCATCTG GAGTACCCCT TCTTATTACA AAA ATG CCC CCA CCA 55 Met Pro Pro i AAG AAA TTT AGA GTT CAG TCA AAG AAC TAT TTC CTC ACT TAT CCA CAG 103 Lys Lys Phe Arg Val Gln Ser Lys Asn Tyr Phe Leu Thr Tyr Pro Gln 5 10 15 20 TGC TCT TTT TCT AAA GAA GAA GCA CTT TCC CAA TTA CAA AAC CTA AAT 151 Cys Ser Leu Ser Lys Glu Glu Ala Leu Ser ßln Leu Gln Asn Leu Asn 25 30 35 ACC CCA GTC AAT AAG AAA TTC ATC AAA ATT TGC AGA GAG CTT CAT GAA 199 Thr Pro Va ^ Asn Lys Lys Phe lie Lys lie Cys Arg Glu Leu His Glu 40 45 50 AAT GGG QAA CCT CAT CTC CAT GTG CTT GTT CAG TTC GAA GGA AAG TAC 247 Handle Gly Glu Pro His Leu His Val Leu Val Gln Phe Glu Gly Lyß Tyr 55 60 65
P1S81 / 98MX CAG TGC ACG AAT AAC AGA TTC TTC GAC CTG GTC TCC CCA ACC CGG TCA 295 Gln Cys Thr Asn Asn Arg Phe Phe Asp Leu Val Ser Pro Thr Arg Ser 70 75 80 GCA CAT TTC CAT CCG AAT ATT CAG GGA GCT AAA TCG AGC TCC GAC GTC 343 Wing His Phe His Pro Asn lie Gln ßly Wing Lys Ser Ser As Asp Val 85 90 95 100 AAA TCG TAC ATC GAC AAG GAC GQA GAT ACA ATC GAA TGG GGA GAT TTC 391 Lys Ser Tyr He Asp Lys Asp Gly Asp Thr He Glu Trp Gly Asp Phe 105 110 115 CAG ATC GAC GGC AGA TCT GCC AGA GGA GGC CAG CAG TCT GCT AAT GAT 439 Gln He Asp Gly Arg Ser Wing Arg Gly Gly Gln Glp Ser Wing Asn Asp 120 125 130 TCA TAT GCG AAA GCG TTA AAT GCA AGT TCG GTT CAA TCT GCC TTA GCA 487 Ser Tyr Ala Lys Ala Leu Asn Ala Ser Ser Val Gln Ser Ala Leu Ala 135 140 145 GTT CTA AGG GAA GAA CAA CCA AAA GAT TTT OTA TTA CAA AAT CAT AAC 535 Val Leu Arg Glu Glu Gln Pro Lys Aßp Phe Val Leu Gln.Asn His Asn 150 155 160 ATC CGC TCT AAC CTA GAA CGA ATA TTC GCA AAG GCT CCG GAA CCG TGG 583 lie Arg Ser Asn Leu Glu Arg He Phe Ala Lys Ala Pro Glu Pro Trp 165 170 175 180 GTT CCT CCA TTT CAA GTC TCT TCT TTC ACT AAC GTT CCT GAC GAG ATG 631 Val Pro Pro Phe Gln Val Ser Ser Phe Thr Asn Val Pro Asp slu Met 185 190 195 CAG GAA TGG GCG GAT AAT TAT TTC GGG ACG GGT GCA GCT GCT GCG CG CCA 679 Gln Glu Trp Wing Asp Asn Tyr Phe Gly Thr Gly Wing Wing Arg Pro 200 205 210 GAG AGA CCT GTA AGT ATC ATC GTC GAG GGT GAT TCA AGA ACÁ GGG AAG 727 Glu Arg Pro Val Ser He He Val Val Glu Gly Asp Ser Arg Thr Gly Lys 215 220 225 ACG ATG TGG GCA CGT GCG TTA GGC CCA CAT AAC TAT CTC AGT GGA CAC 775 Thr Met Trp Wing Arg Wing Leu Gly Pro His Aßn Tyr Leu Ser Gly His 230 235 240 CTA GAC TTC AAT GGT CGA GTC TTC TCG AAT GAT GTG CAG TAT AAC GTC 823 Leu Asp Phe Aßn Gly Arg Val Phe Ser Asn Asp Val Gln Tyr Asn Val 245 250 255 260 ATT GAT G &C ATC GCA CCG CAT TAT CTA AAG CTA AAG CAC TGG AAA GAA 871 He Asp Asp He Wing Pro His Tyr Leu Lys Leu Lys His Trp Lyß Glu 265 270 275 TTs CTA QGG GCC CAG AAA GAT TGG CAA TCA AAT TGC AAG TAC GGT AAG 919
Leu Leu Gly Wing Gln Lys Asp Trp Gln Ser Asn Cys Lys Tyr Gly Lys 280 285 290
P1681 / 98MX CCA GTT CAA ATT AAA GGC GGA ATC CCA GCA ATC GTG CTT TGC AAT CCT 967
Pro Val Gln He Lyß Gly Gly He Pro Wing He Val Leu Cys Asn Pro 295 300 305 GGT GAG GGT GCC AGC TAT AAA GAG TTC TTA GAC AAA GCA GAA AAT ACÁ 1015
Gly Glu Gly Wing Ser Tyx Lys Glu Phe Leu Asp Lyß Wing Glu Asn Thr 310 315 320 GGT CTA AAG AAC TGG ACT GTC AAG AAT GCG ATC TTC ATC ACC CTC ACÁ 1063
Gly Leu Lys Asn Trp Thr Val Lys Asn Wing He Phe He Thr Leu Thr 325 330 335 340 GCC CCC CTC TAT CAA GAC AGC ACÁ CAG GCA AGC CAA GAA ACG GGC AAT 1111
Ala Pro Leu Tyr Gln Aßp Ser Thr Gln Ala Ser Gln Glu Thr Gly Aßn 345 350 355 CAG AAG GCG CAG GGT T GATCTACAGT GCGGGTGCTC CATCTACTTC CTAGG 1162
Gln Lys Ala Gln Gly 360 (2) INFORMATION FOR SEQ ID NO: 2: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 361 amino acids (B) TI PO: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRI PCION FOR THE SEQUENCE: SEQ. ID No: 2: Met Pro Pro Pro Lys Lys Phe Arg Val Gln Ser Lys Asn Tyr Phe Leu 1 5 10 15
Thr Tyr Pro Gln Cys Ser Leu Ser Lys Glu Glu Wing Leu Ser Gln Leu 20 25 30 Gln Asn Leu Asn Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg 35 40 45 Glu Leu His Glu Asn Gly Glu Pro His Leu His Val Leu Val .ln Phe
50 55 60 Glu Gly Lys Tyr sln Cys Thr Asn Asn Arg Phe Phe Asp Leu Val Ser 65 70 75 80
P1681 / 98 X Pro Thr Arg Be Wing Hls Phe His Pro Asn He Gln Gly Wing Lys Ser 85 90 95
Be Ser Asp Val Lys Ser Tyr He Asp Lys Asp Gly Asp Thr He Glu 100 105 110 Trp Gly Aap Phe Gln He Asp Gly Arg Be Wing Arg Gly Gly Gln Gln 115 120 125 Ser Wing Asn Asp Ser Tyr. Ala Lys Ala Leu Asn Ala Ser Ser Val Gln 130 135 140 Ser Ala Leu Ala Val Leu Arg Glu ßlu Gln Pro Lys Asp Phe Val Leu 145 150 1S5 160
Gln Asn His Asn He Arg Ser Asn Leu Glu Arg He Phe Wing Lys Wing 165 170 175
Pro Glu Pro Trp Val Pro Pro Phe Gln Val Ser Ser Phe Thr Asn Val 180 185 190 Pro Asp Glu Met Gln Glu Trp Wing Asp Asn Tyr Phe Gly Thr Gly Wing 195 200 205 Wing Wing Arg Pro Glu Arg Pro Val Ser He He Val Glu Gly Asp Ser 210 215 220 Arg Thr Gly Lys Thr Met Trp Wing Arg Wing Leu Gly Pro His Asn Tyr 225 230 235 240
Leu Ser Gly His Leu Aßp Phe Asn Gly Arg Val Phe Ser Asn Asp Val 245 250 255
Gln Tyr Asn Val He Asp Asp He Wing Pro His Tyr Leu Lys Leu Lys 260 265 270 His Trp Lys Glu Leu Leu Gly Wing Gln Lys Asp Trp Gln Ser Asn Cys 275 280 285 Lys Tyr Gly Lys Pro Val Qln He Lys Gly Gly He Pro Ala He Val 290 295 300 Leu Cys Aßn Pro Gly Glu Gly Ala Ser Tyr Lys Glu Phe Leu Asp Lys 305 310 315 320
P1681 / 98MX Wing Glu Asn Thr Gly Leu Lys Asn Trp Thr Val Lys Asn Wing He Phe 325 330 335 He Thr Leu Thr Wing Pro Leu Tyr Gln Asp Ser Thr Gln Wing Be sln 340 345 350 Glu Thr Gly Asn Gln Lys Wing Gln Gly 355 360
(2) INFORMATION FOR SEQ ID NO: 3: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1169 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: singular (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of tomato spot (B) CEPA: Florida (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION: 44..1127 (x) PUBLICATION INFORMATION: (A) AUTHORS: Gilbertson, RL et al. (B) TITLE: Pseudorecombination between the DNA components, cloned, infectious of the
P1 £ 81/98 X geminivirus of bean dwarf mosaic and tomato blotch. (C) GACETA: Jour. General Virol. (D) VOLUME: 74 (F) PAGES: 23-31 (G) DATE: 1993 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 3: GGATCCGAGT AACTCATCTG GAGTACCCCT TCTTATTACA AAA ATG CCC CCA CCA 55 Met Pro Pro Pro 1 AAG AAA TTT AGA GTT CAG TCA AAG AAC TAT TTC CTC ACT TAT CCA CAG 103 Lys Lys Phe Arg Val sln Ser Lys Asn Tyr Phe Leu Thr Tyr Pro Gln 5 10 15 20 TGC TCT TTG TCT AAA GAA GAA GCA CTT TCC CAA TTA CAA AAC CTA AAT 151 Cys Ser Leu Ser Lys Glu Glu Ala Leu Ser Gln Leu ßln Asn Leu Asn 25 30 35 ACC CCA GTC AAT AAA TTC ATC AAA ATT TGC AGA GAG CTT CAT GAA 199 Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg ßlu Leu His Glu 40 45 50 AAT GGG GAA CCT CAT CTC CAT GTG CTT ÜTT CAG TTC GAA GOA AAG TAC 247 Asn Gly slu Pro His Leu His Val Leu Val Gln Phe Glu Gly Lys Tyr 55 60 65 CAG TGC ACG AAT AAC AGA TTC TTC GAC CTG GTC TCC CCA ACC CGG TCA 295 61n Cys Thr Aßn Asn Arg Phe Phe Aßp Leu Val Ser Pro Thr Arg Ser 70 75 80 scA CAT TTC- CAT ccs AAT ATT CAG GGA GCT AAA TCG AGC TCC GAC GTC 343 Wing His Phe His Pro Asn He sln Gly Wing Lys Ser Ser Asp Val 85 90 95 100 AAA TCG TAC ATC GAC AAG GAC GGA GAT ACA ATC GAA TGG GGA GAT TTC 391 Lys Ser Tyr He Asp Lys Asp Gly Asp Thr He Glu Trp Gly Asp Phe 105 110 115 CAG ATC GAC GGC AGA TCT GCC AGA GGA GGC CAG CAG TCT GCT AAT GAT 439 Gln He Asp Gly Arg Ser Ala Arg ßly ßly Gln Gln Ser Ala Asn Asp 120 125 130
1S81 / 98 X TCA TAT GCG AAA GCG TTA AAT GCA AGT TCG GTT CAA TCT GCC TTA GCA 487 Ser Tyr Ala Lys Ala Leu Asn Ala Ser Ser Val Gln Ser Ala Leu Ala 135 140 145 GTT CTA AGG GAA GAA CAA CCA AAA GAT TTT GTA TTA CAA AAT CAT AUC 535 Val Leu Arg Glu Glu Gln Pro Lys Asp Phe Val Leu Gln Asn His Aßn 150 155 160 ATC CGC TCT AAC CTA GAA CGA ATA TTC GCA AAG GCT CCG GAA CCG TGG 583 He Arg Ser Asn Leu Glu Arg He Phe Wing Lys Wing Pro Glu Pro Trp 165 170 175 180 GTT CCT CCA TTT CAA GTC TCT TCT TTC ACT AAC GTT CCT GAC GAG ATG 631 Val Pro Pro Phe Gln Val Ser Ser Phe Thr Asn Val Pro Asp Glu Met 185 180 195 CAG GAA TGG GCG GAT AAT TAT TTC GGG ACG GGT GCA GCT GCG CGG CCA 679 Gln Glu Trp Wing Asp Asn Tyr Phe Gly Thr Gly Wing Wing, Wing Arg Pro 200 205 210 GAG AGA CCT GTA AGT ATC ATC GTC GAG GGT GAT TCA AGA ACÁ GGG CAC 727 Glu Arg Pro Val Ser He He Val Val Glu Gly Asp Ser Arg Thr Gly His 215 220 225 ACG ATG TGG GCA CGT GCG TTA GGC CCA CAT AAC TAT CTC AGT GGA CAC 775 Thr Met Trp Ala Arg Ala Leu Gly Pro His As n Tyr Leu Ser Gly His 230 235 240 CTA GAC TTC AAT GGT CGA GTC TTC TCG AAT GAT GTG CAG TAT AAC GTC 823 Leu Asp Phe Asn Gly Arg Val Phe Ser Asn Asp Val ßln Tyr Asn Val 245 250 255 260 ATT AAA TAC ATC GCA CCG CAT TAT CTA AAG CTA AAG CAC TGG AAA GAA 871 He Lys Tyr He Wing Pro His Tyr Leu Lys Leu Lys His Trp Lyß Glu 265 270 275 TTG CTA GGG GCC CAG AAA GAT TGG CAA TCA AAT TGC AAG TAC GGT AAG 919 Leu Leu Gly Ala sln Lys Asp Trp Gln Ser Asn Cys Lys Tyr Gly Lys 280 265 290 CCA GTT CAA ATT AAA GGC GGA ATC CCA GCA ATC GTG CTT TGC AAT CCT 967 Pro Val Gln He Lys Gly Gly He Pro Ala lie Val Leu Cyß Asn Pro 295 300 305 GGT GAG GGSI GCC AGC TAT AAA GAG TTC TTA GAC AAA GCA GAA AAT ACÁ 1015 ßly ßlu Gly Ala Ser Tyr Lys Glu Phe Leu Asp Lys Ala Qlu Asn Thr 310 315 320
P1681 / 98MX GGT CTA AAG AAC TGG ACT GTC AAG AAT GCG ATC TTC ATC ACC CTC ACA 1063 ßly Leu Lys Aßn Tzp Thr Val Lys Asn Wing He Phe He Thr Leu Thr 325 330 335 340 GCC CCC CTC TAT CAA GAC AGC AC CAG GCA AGC CAA GAA ACG GGC AAT 1111
Wing Pro Leu Tyr Gln Asp Ser Thr Gln Wing Ser Gln Glu Thr Gly Asn 345 350 355 CAG AAG GCG CAG GGT T GATCTACAGT GCGGGTGCTC CATCTACTTC 1157 360 CACTTAGACT GT 1169
2) INFORMATION FOR SEQ ID NO: 4: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 361 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 4: Met Pro Pro Pro Lys Lys Phe. Arg Val Gln Ser Lys Asn Tyr Phe Lev 1 5 10 15
Thr Tyr Pro Gln Cys Ser Leu Ser Lys Glu Glu Wing Leu Ser Gln Leu 20 25 30 Gln Asn Leu Asn Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg 35 40 45 Glu Leu His Glu Asn Gly Glu Pro His Leu His Val Leu Val Gln Phe 50 55 60 Glu Gly Lys Tyr sln Cys Thr Asn Asn Arg Phe Phe Asp Leu Val Ser 65 70 75 80
Pro Thr Arg Ser Wing His Phe His Pro Asn He Gln Gly Wing Lys Ser 85 90 95
Be Ser Asp Val Lys Ser Tyr He Asp Lys Asp Gly Asp Thr He Glu 100 105 110
P1681 / 98MX Trp Gly Asp Phe Gln He Asp sly Arg Ser Wing Arg Oly sly sln sln 115 120 125 Be Ala Asn Asp Ser Tyr Ala Lys Ala Leu Asn Ala Ser Ser Val sln 130 135 140 Ser Ala Leu Ala Val Leu Arg slu Glu sln Pro Lys Asp Phe Val Leu 145 150 155 160 sln Asn His Asn He Arg Ser Asn Leu slu Arg He Phe Wing Lys Wing 165 170 175
Pro Glu Pro Trp Val Pro Pro Phe Gln Val Ser Ser Phe Thr Asn Val
180 185 190 Pro Aßp slu Met Oln slu Trp Wing Asp Asn Tyr Phe sly Thr Oly Wing 195 200 205 Wing Wing Arg Pro slu Arg Pro Val Ser He He Val Olu sly Asp Ser 210 215 220 Arg Thr Gly His Thr Met Trp Wing Arg Ala Leu Gly Pro His Asn Tyr 225 230 235 240
Leu Ser Gly His Leu Asp Phe Asn Gly Arg Val Phe Ser Asn Asp Val 245 250 255
Gln Tyr Asn Val He Lys Tyr He Wing Pro His Tyr Leu Lys Leu Lys 260 265 270 His Trp Lys slu Leu Leu Gly Wing Sln Lys Asp Trp sln Ser Asn Cys 275 280 285 Lys Tyr Gly Lys Pro Val Gln He Lys Gly Gly He Pro Ala He Val 290 295 300 Leu Cys Asn Pro Gly Olu sly Wing Ser Tyr Lys Olu Phe Leu Asp Lys
305 310 315 320
Ala slu Asn Thr Gly Leu Lys Asn Trp Thr Val Lys Asn Ala He Phe 325 330 335
He Thr Leu Thr Ala Pro Leu Tyr Gln Asp Ser Thr Gln Ala Ser sln 340 345 350
P1681 / 98MX Glu Thr Gly Asn Gln Lys Wing Gln Gly 355 360
2) INFORMATION FOR SEQ ID NO: 5: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1169 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: singular (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of the tomato spot (B) CEPA: Florida (ix) FEATURE: ( A) NAME / KEY: CDS (B) LOCATION: 44..1127 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 5: GGATCCQAGT AACTCATCTG GAGTACCCCT TCTTATTACA AAA ATG CCC CCA CCA 55 Met Pro Pro Pro 1 AAG AAA TTT AGA GTT CAG TCA AAG AAC TAT TTC CTC ACT TAT CCA CAG 103
Lys Lys Phe Arg Val Gln Ser Lyß Asn Tyr Phe Leu Thr Tyr Pro Gln 5 10 15 20 TGC TCT TTG TCT AAA GAA GAA OCA CTT TCC CAA TTA CAA AAC CTA AAT 151
Cys Ser Leu Ser Lys Glu Qlu Wing Leu Ser Gln Leu Gln Asn Leu Asn 25 30 35
P1681 / 98MX ACC CCA GTC AAT AAG AAA TTC ATC AAA ATT TQC AGA GAG CTT CAT GAA 199 Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg Glu Leu His ßlu 40 45 50 AAT ßGG GAA CCT CAT CTC CAT GTG CTT GTT CAG TTC GAA GGA AAss TAC 247 Asn Gly Glu Pro His Leu His Val Leu Val Gln Phe Glu Gly Lys Tyr 55 60 65 CAG TGC ACG AAT AAC AGA TTC TTC GAC CTG GTC TCC CCA ACC CGG TCA 295 Gln Cys Thr Asn Asn Arg Phe Phe 'Aßp Leu Val Ser Pro Thr Arg Ser 70 75 80 GCA CAT TTC CAT CCG AAT ATT CAG GGA GCT AAA TCG AGC TCC GAC GTC 343
Ala His Phe His Pro Asn He Gln Gly Ala Lys Ser Ser As Asp Val 85 90 95 100 AAA TCG TAC ATC GAC AAG GAC GGA GAT ACA ATC GAA TGG GGA GAT TTC 391 Lys Ser Tyr He Asp Lys Aßp Gly Asp Thr He Glu Trp Gly Asp Phe 105 110 115 CAG ATC GAC GGC AGA TCT GCC AGA GGA GGC CAG TCG GCT AAT GAT 439 Gln He Asp Gly Arg Be Ala Arg ßly ßly Gln ßln Ser Ala Aßn Asp 120 125 130 TCA TAT ßCC AAA GCG TTA AAT GCA AGT TCG GTT CAA TCT GCC TTA GCA 487 Be Tyr Ala Lyß Ala Leu Aßn Ala Ser Ser Val Gln'Ser Ala Leu Ala 135 140 145 GTT CTA AGG GAA GAA CAA CCA AAA GAT TTT GTA TTA CAA AAT CAT AAC 535 Val Leu Arg Glu Glu ßln Pro Lys Asp Phe Val Leu Gln Asn His Aßn 150 155 160 ATC CGC TGT AAC CTA GAA CGA ATA TTC OCA AAG GCT CCG GAA CCG TGG 583 He Arg Ser Asn Leu Glu Arg He Phe Ala Lys Ala Pro ßlu Pro Trp 165 170 175 180 GTT CCT CCA TTT CAA GTC TCT TCT TTC ACT AAC GTT CCT GAC GAG ATG 631 Val Pro Pro Phe Gln Val Ser Ser Phe Thr Asn Val Pro Asp Glu Met 185 190 195 CAG GAA TGG GCß GAT AAT TAT TTC GGG ACG GGT GCA GCT GCG Cßß CCA 679 ßln ßlu Trp Wing Asp Asn Tyr Phe Gly Thr Gly Wing Wing Wing Arg Pro 200 205 210 GAG AQA CCT GTA AGT ATC ATC GTC GGT GAT TCA AGA ACÁ GGG CAC 727 ßlu Arg Pro Val Ser He He Val ßlu ßly Asp Ser Arg Thr ßly His 215 220 225
P1681 / 98MX ACG ATG TGG GCA CGT GCC TTA GGC CCA CAT AAC TAT CTC AGT GGA CAC 775 Thr Met Trp Wing Arg Wing Leu Gly Pro His Asn Tyr Leu Ser Gly His 230 235 240 CTA GAC TTC AAT GGT CGA GTC TTC TCG AAT GAT GTG CAG TAT AAC GTC 823 Leu Asp Phe Asn Gly Arg Val Phe Ser Asn Asp Val Gln Tyr Asn Val 245 250 255 260 ATT GAT GAC ATC CCA CCG CAT TAT CTA AAG CTA AAG CAC TGG AAA GAA 871 He Asp Asp He Ala Pro His Tyr Leu Lys Leu Lys His Trp Lys Glu 265 270 275 TTG CTA ßGG ßCC CAß AAA ßAT TGß CAA TCA AAT TGC AAG TAC CGT AAG 919 Leu Leu Gly Wing Gln Lys Asp Trp Gln Ser Aßn Cys Lys Tyr sly Lys 280 285 290 CCA OTT CAA ATT AAA GGC GGA ATC CCA GCA ATC GTG CTT TGC AAT CCT 967 Pro Val Gln He Lys Gly Gly He Pro Wing He Val Leu Cys Asn Pro 295 300 305 GGT GAG GGT GCC AGC TAT AAA GAG TTC TTA GAC AAA GCA GAA AAT ACÁ 1015 Gly Glu Gly Ala Ser Tyr Lys Glu Phe Leu Asp Lys Ala slu Asn Thr 310 315 320 GGT CTA AAG AAC TGG ACT GTC AAG AAT GCG ATC TTC ATC ACC CTC ACÁ 1063 Gly Leu Lyß Asn Trp Thr Val Lys As n Wing He Phe He Thr Leu Thr 325 330 335 340 GCC CCC CTC TAT CAA GAC AGC ACA CAO GCA AGC CAA GAA ACG GGC AAT 1111 Wing Pro Leu Tyr sln Asp Ser Thr Gln Wing Gln Glu Thr Gly Asn 345 350 355 CAG AAG GCG CAG GGT T GATCTACAGT GCGGGTGCTC CATCTACTTC 1157
Gln Lys Ala sln Gly 360 CACTTAGACT GT 1169
) INFORMATION FOR SEQ ID NO: 6: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 361 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 6:
1S81 / 98MX Met Pro Pro Pro Lys Lys Phe Arg Val Gln Ser Lys Asn Tyr Phe Leu 1 5? O 15
Thr Tyr Pro Gln Cys Ser Leu Ser Lys Glu Glu Ala Leu Ser Gln Leu 2 ° 25 3rd Gln Asn Leu Asn Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg 35 40 45 slu Leu His Glu Asn Gly Glu Pro His Leu His Val Leu Val sln Phe 50 55 60 Glu Gly Lyß Tyr Gln Cys Thr Asn Asn Arg Phe Phe Asp Leu Val Ser 65 70 75 80
Pro Thr Arg Ser Wing His Phe His Pro Asn He Gln ßly Wing Lyß Ser 85 90 95
Be Ser Asp Val Lys Ser Tyr He Asp Lys Asp sly Asp Thr He slu 100 105 110 Trp Gly Asp Phe Gln He Asp Gly Arg Be Wing Arg Gly ßly Gln Gln 115 120. 125 Ser Ala Asn Asp Ser Tyr Ala Lys Ala Leu Asn Ala Ser Ser Val Gln 130 135 140 'Ser Ala Leu Ala Val Leu Arg slu slu sln Pro Lys Asp Phe Val Leu 145 150 155 160
Gln Asn His Asn He Arg Ser Asn Leu Glu Arg He Phe Wing Lys Wing 165 170 175
Pro slu Pro Trp Val Pro Pro Phe sln Val Ser Ser Phe Thr Asn Val 180 185 190 Pro Asp slu Met sln Glu Trp Wing Asp Asn Tyr Phe Gly Thr Oly Wing 195 200 205 Wing Wing Arg Pro Glu Arg Pro Val Ser He He Val slu sly Asp Ser 210 215 220 Arg Thr sly His Thr Met Trp Wing Arg Wing Leu sly Pro His Asn Tyr 225 230 235 240
Leu Ser sly His Leu Asp Phe Asn sly Arg Val Phe Ser Asn Asp Val 245 250 255 ßln Tyr Asn Val He Asp Asp He Wing Pro His Tyr Leu Lys Leu Lys 260 265 270 His Trp Lys Glu Leu Leu Gly Wing Oln Lys Asp Trp Oln Ser Asn Cys 275 280 285
1 / 98MX Lys Tyr Gly Lys Pro Val Gln He Lys Gly Gly He Pro Wing He Val 290 295 300 Leu Cys Asn Pro sly Glu Gly Wing Ser Tyr Lys Glu Phe Leu Asp Lys 305 310 315 320
Wing Glu Asn Thr Gly Leu Lys Asn Trp Thr Val Lys Asn Wing He Phe 325 330 335. He Thr Leu Thr Wing Pro Leu Tyr Gln Asp Being Thr Gln Wing Being Gln 340 345 350 Glu Thr Gly Asn Gln Lys Wing Gln Gly 355 360
) INFORMATION FOR SEQ ID NO: 7: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1169 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: singular (D) TOPOLOGY: circular (ii) ) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of the tomato spot (B) CEPA: Florida (ix) FEATURE: (A) ) NAME / KEY: CDS (B) LOCATION: 44..1127 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 7:
681 / 98MX GGATCCGAGT AACTCATCTG GA-TACCCCT TCTTATTACA AAA ATG CCC CCA CCA. 55 Met Pro Pro 1 AAG AAA TTT AGA GTT CAG TCA AAG AAC TAT TTC CTC ACT TAT CCA CAG 103 Lys Lys Phe Arg Val Gln Ser Lyß Asn Tyr Phe Leu Thr Tyr Pro Gln 5 10 15 20 TGC TCT TTG TCT AAA QAA GAA GCA CTT TCC CAA TTA CAA AAC CTA AAT 151 Cys Ser Leu Ser Lys Glu Glu Ala Leu Ser sln Leu Gln Asn Leu Asn 25 30 35 ACC CCA GTC AAT AAG AAA TTC ATC AAA ATT TGC AGA OAG CTT CAT GAA 199 Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg Olu Leu His Glu 40 45 50 AAT GGs GAA CCT CAT CTC CAT GTG CTT GTT CAG TTC GAA GGA AAG TAC 247 Asn Gly Glu Pro His Leu His Val Leu Val Gln Phe Glu Lymph Tyr 55 60 65 CAG TGC ACG AAT AAC AGA TTC TTC GAC CTG GTC TCC CCA ACC CGG TCA 2 5 Gln Cys Thr Asn Asn Arg Phe Phe Asp Leu Val Ser Pro Thr Arg Ser 70 75 80 s_A CAT TTC CAT ccs AAT ATT CAO s A scT AAA TCO AOC TCC GAC GTC 343 Wing His Phe His Pro Asn He sln Gly Wing Lys Ser Ser As Asp Val 85 90 95 100 AAA TCG TAC ATC GAC AAG GAC GGA GAT AC ATC GAA TGG GGA GAT TTC 391 Lys Ser Tyr He Asp Lys Asp Gly Asp Thr He slu Trp sly Asp Phe 105 110 115 CAO ATC GAC ssc AGA TCT OCC AGA GGA GGC CAO CAO TCT GCT AAT GAT 439 sln He Asp sly Arg Ser Ala Arg Gly Gly Gln sln Ser Wing Asn Asp 120 125 130 TCA TAT scs AAA scs TTA AAT OCA ACT TCC CTT CAA TCT scc TTA OCA 487
Be Tyr Ala Lys Ala Leu Aßn Ala Be Ser Val Gln Be Ala Leu Ala 135 140 145 GTT CTA AGG GAA GAA CAA CCA AAA GAT TTT GTA TTA CAA AAT CAT AAC 535 Val Leu Arg Glu slu Oln Pro Lys Aßp Phe Val Leu sln Aßn His Aßn 150 155 160 ATC CGC TCT AAC CTA GAA CGA ATA TTC OCA AAO OCT CCO GAA CCG TGG 583 He Arg Ser Asn Leu Glu Arg He Phe Ala Lys Ala Pro slu Pro Trp 165 170 175 180 GTT CCT CCA TTT CAA GTC TCT TCT TTC ACT AAC GTT CCT GAC GAG ATG 631 Val Pro Pro Phe Gln Val Ser Ser Phe Thr Asn Val Pro Asp Glu Met 185 190 195
P1681 / 98MX CAG GAA TGG OCO AAT TAT TTC sss ACO GGT GCA GCT GCG CGG CCA 679 Gln Glu Trp Wing Asp Asn Tyr Phe sly Thr Gly Wing Wing Wing Arg Pro 200 205 210 GAG AGA CCT GTA AGT ATC ATC GTC GAO GGT GAT TCA AGA ACÁ GGG AAG 727 slu Arg Pro Val Ser He He Val Glu sly Asp Ser Arg Thr Gly Lys 215 220 225 ACG ATG tso OCA c_t scs TTA sac CCA CAT AAC TAT CTC AGT GGA CAC 775
Thr Met Trp Wing Arg Wing Leu sly Pro His Asn Tyr Leu Ser Gly His 230 - 235 240 CTA OAC TTC AAT GOT COA GTC TTC TCG AAT OAT OTC CAG TAT AAC GTC 823 Leu Asp Phe Asn Gly Arg Val Phe Ser Asn Asp Val sln Tyr Asn Val 245 250 255 260 ATT AAA TAC ATC OCA CCG CAT TAT CTA AAß CTA AAO CAC TGß AAA GAA 871 He Lys Tyr He Wing Pro His Tyr Leu Lyß Leu Lys His Trp Lys slu 265 270 275 TTG CTA ßßß GCC CAG AAA GAT TGG CAA TCA AAT TGC AAG TAC GGT AAG 919 Leu Leu Gly Wing Sln Lys Asp Trp sln Ser Aßn Cys Lys Tyr sly Lys 280 285 290 CCA OTT CAA ATT AAA OOC OOA ATC CCA OCA ATC OTO CTT TCC AAT CCT 967 Pro Val sln He Lys ßly ßly He Pro Ala Wing Val Leu Cys Asn Pro 295 300 305 GGT GAO GGT GCC AGC TAT AAA GAG TTC TTA GAC AAA GCA GAA AAT ACÁ.1015 Gly Glu Gly Ala Ser Tyr Lys Glu Phe Leu Asp Lys Ala Glu Asn Thr 310 315 320 GGT CTA AAG AAC TGG ACT GTC AAG AAT GCG ATC TTC ATC ACC CTC ACA 1063 sly Leu Lys Asn Trp Thr Val Lys Asn Wing He Phe He Thr Leu Thr 325 330 335 340 OCC CCC CTC TAT CAA GAC AGC AC CAG GCA AGC CAA GAA ACG GGC AAT 1111 Wing Pro Leu Tyr Gln Asp Ser Thr Gln Wing Ser Gln slu Thr sly Aßn 345 350 355 CAO AAG GCs CAG QGT T GATCTACAGT GCGGGTGCTC CATCTACTTC 1157
Gln Lys Ala sln Gly 360 CACTTAGACT GT 1169
2) INFORMATION FOR SEQ ID NO: 8: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 361 amino acids
P1681 / 98MX (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 8: Met Pro Pro Pro Lys Lys Phe Arg Val sln Ser Lys Asn Tyr Phe Leu 1 5 10 15
Thr Tyr Pro Gln Cys Ser Leu Ser Lys slu Olu Ala Leu Ser sln Leu 20 25 30 Gln Asn Leu Asn Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg 35 40 45 slu Leu His slu Asn sly slu Pro His Leu His Val Leu Val sln Phe 50 55 60 slu Oly Lys Tyr Oln Cys Thr Asn Asn Arg Phe Phe Asp Leu Val Ser 65 70 75 80
Pro Thr Arg Ser Wing His Phe His Pro Asn He sln sly Wing Lys Ser 85 90 95
Be Ser Asp Val Lys Ser Tyr He Asp Lys Asp sly Asp Thr He slu 100 105 110 Trp sly Asp Phe Gln He Asp Gly Arg Be Wing Arg Gly Gly sln Gln 115 120 125 Be Ala Asn Asp Ser Tyr Ala Lys Ala Leu Aen Ala Ser Ser Val Gln 130 135 140 Ser Ala Leu Ala Val Leu Arg Glu Glu Gln Pro Lys Asp Phe Val Leu
145 150 155 160
Gln Asn His Asn He Arg Ser Asn Leu Glu Arg He Phe Wing Lys Wing .165 170 175 Pro slu Pro Trp Val Pro Pro Phe Gln Val Ser Ser Phe Thr Asn Val 180 185 190
P1681 / 98MX Pro Asp Glu Met Gln Glu Trp Wing Asp Asn Tyr Phe Gly Thr Gly Wing 195 200 205 Wing Wing Arg Pro Glu Arg Pro Val Ser He He Val Glu sly Asp Ser 210 215 220 Arg Thr Gly Lys Thr Met Trp Wing Arg Ala Leu Gly Pro His Asn Tyr 225 230 235 240
Leu Ser Gly His Leu Asp Phe Asn Gly Arg Val Phe Ser Asn Asp Val 245 25C 255 sln Tyr Asn Val He Lys Tyr He Wing Pro His Tyr Leu Lys Leu Lys 260 265 270 His Trp Lys Glu Leu Leu Gly Ala sln Lys Asp Trp Gln Ser Asn Cys 275 280 285 Lys Tyr Gly Lys Pro Val Gln He Lys Gly Gly He Pro Wing He Val 290 295 300 Leu Cys Asn Pro Gly Glu Gly Wing Ser Tyr Lys Glu Phe Leu Asp Lys 305 310 315 320
Wing Glu Asn Thr Gly Leu Lys Asn Trp Thr Val Lys Asn Wing He Phe 325 330 335
He Thr Leu Thr Ala Pro Leu Tyr sln Asp Ser Thr Gln Ala Ser sln * 340 345 350 slu Thr Gly Asn Gln Lys Wing Gln Gly 355 360 2) INFORMATION FOR SEQ ID NO: 9: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid
P1681 / 98MX (A) DESCRIPTION: / desc. = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: FL2549B (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 9: TATCGGATCC GAGTAACTCA TCTGGAGTAC C 31
2) INFORMATION FOR SEQ ID NO: 10: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: FL1108B (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 10: TATCGGATCC GGAAGTAGAT GGAGCACCCG C 31
2) INFORMATION FOR SEQ ID NO: 11:
P1681 / 98MX (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other acid nucleic (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: PFAC1680 (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 11: CAAGAACAGG GCACACGATG TGGG 24
2) INFORMATION FOR SEQ ID NO: 12: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO
P1681 / 98MX (vi) ORIGINAL SOURCE (A) ORGANISM: PFAC1781 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 12: GTATAACGTC ATTAAATACA TCßCACCGC '29
2) INFORMATION FOR SEQ ID NO: 13: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1166 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of the tomato spot '(C) INDIVIDUAL ISOLATED: Florida (ix) CHARACTERISTICS : (A) NAME / KEY: CDS (B) LOCATION: 44..436 (xi) DESCRI PTION FOR THE SEQUENCE: SEQ. ID No: 13:
GGATCCGAGT AACTCATCTG GAGTACCCCT TCTTATTACA AAA ATG CCC CCA CCA 55 Met Pro Pro Pro 1
P1681 / 98MX AAG AAA TTT AGA GTT CAG TCA AAG AAC TAT TTC CTC ACT TAT CCA CAG 103 Lys Lyß Phe Arg Val Gln Ser Lys Aßn Tyr Phe Leu Thr Tyr Pro Gln 5 10. 15 20 TGC TCT TTG TCT AAA GAA GAA GCA CTT TCC CAA TTA CAA AAC CTA AAT 151 Cys Ser Leu Ser Lys Glu Glu Ala Leu Ser Gln Leu ßln Aßn Leu Aßp 25 30 35 ACC CCA ßTC AAT AA ß AAA TTC ATC AAA ATT TGC AGA ßAß CTT CAT GAA 199 Thr Pro Val Asn Lys Lyß Phe He Lyß He Cys Arg ßlu Leu His Glu 40 45 50 AAT GGG GAA CCT CAT CTC CAT GTß CTT GTT CAß TTC GAA GGA AAß TAC 247 Aßn Gly Glu Pro Hiß Leu His Val Leu Val Gln Phe Glu Gly Lys Tyr 55 60 65 CAG TGC ACG AAT AAC AGA TTC TTC GAC CTG GTC TCC CCA ACC CGG TCA 295 Gln Cys Thr Asn Asn Arg Phe Phe 'Asp Leu Val Ser Pro Thr Arg Ser 70 75 t 80 GCA CAT TTC CAT CCG AAT ATT CAG GGA GCT AAA TCG ACC TCC GAC GTC 3 3 Wing His Phe His Pro Aßn He Gln Gly Wing Lys Ser Ser As Asp Val 85 90 95 100 AAA TCG TAC ATC GAC AAG GAC GGA GAT ATAT GAA TGß GGA GAT TTC 391 Lys Ser Tyr He Asp Lys Asp Gly Asp Thr He Glu Trp sly? Sp Phe 105 110 115 CAG ATC GAC GGC AGA TCG ATC TGC CAG AGG AGG CCA GCA GTC TGC 436
Gln He Asp Gly Arg Ser He Cys Gln Arg Arg Pro Wing Val Cys 120 125 130 TAATGATTCA TATGCGAAAG CGTTAAATGC AAGTTCGGTT CAATCTGCCT TAGCAGTTCT 496
AAGGGAAßAA CAACCAAAAG ATTTTGTATT ACAAAATCAT AACATCCGCT CTAACCTAGA 556
ACGAATATTC GCAAAGGCTC CGGAACCGTG GGTTCCTCCA TTTCAAGTCT CTTCTTTCAC 616
TAACGTTCCT GACGAGATGC AGGAATGGGC GGATAATTAT TTCGGGACGG GTGCAGCTGC 676
GCGGCCAGAO-AGACCTGTAA GTATCATCGT CGAGGGTGAT TCAAGAACAG GGAAGACGAT 736
GTGGGCACGT GCGTTAGGCC CACATAACTA TCTCAGTGGA CACCTAGACT TCAATGGTCG 796
AGTCTTCTCG AATGATGTGC AGTATAACGT "CATTGATGAC ATCGCACCGC ATTATCTAAA 856
GCTAAAGCAC TGGAAAGAAT TGCTAGGGGC CCAGAAAGAT TGGCAATCAA ATTGCAAGTA 916
CGGTAAGCCA GTTCAAATTA AAGGCGGAAT CCCAGCAATC GTGCTTTGCA ATCCTGGTGA 976
GGGTGCCAGC TATAAAGAGT TCTTAGACAA AGCAGAAAAT ACAGGTCTAA AGAACTGGAC 1036
TGTCAAGAAT GCGATCTTCA TCACCCTCAC AGCCCCCCTC TATCAAGACA GCACACAGGC 1096
AAGCCAAGAA ACGGGCAATC AGAAGGCGCA GGGTTGATCT ACAGTGCGGG TGCTCCATCT 1156 ACTTCCTAGG 1 66
) INFORMATION FOR SEQ ID NO: 14: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 131 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 14:
Met Pro Pro Pro Lys Lys Phe Arg Val Gln Ser Lys Asn Tyr Phe Leu 1 5 10 15
Thr Tyr Pro sln Cys Ser Leu Ser Lys slu slu Ala Leu Ser sln Leu 20 25 30 sln Asn Leu Asn Thr Pro Val Asn Lys Lys Phe He Lys He Cys Arg 35 40 45 Olu Leu His slu Asn sly slu Pro His Leu His Val Leu Val sln Phe
50 55 60 slu sly Lys Tyr sln Cys Thr Asn Asn Arg Phe Phe Asp Leu Val Ser 65 70 75 80
Pro Thr Arg Ser Wing His Phe His Pro Asp He sln Gly Wing Lys Ser 85 90 95
Be Ser Asp Val Lys Ser Tyr He Asp Lys Asp Gly Asp Thr He Glu 100 105 110 Trp sly Asp Phe sln He Asp sly Arg Ser He Cys sln Arg Arg Pro 115 120 125 Wing Val Cys 130) INFORMATION FOR SEQ ID NO: 15: (i) CHARACTERISTICS OF THE SEQUENCE:
681/98 X (A) LENGTH: 1246 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of tomato spot (B) CEPA: Florida (x) PUBLICATION INFORMATION: (A) AUTHORS: Gilbertson, RL Hidayat, SH Paplomatas, EJ Roj as, MR Hou, YM Maxwell, DP (B) TITLE: Pseudorecombination between DNA components, cloned, infectious of dwarf bean mosaic geminiviruses and tomato spot (C) GACETA: Jour. General Virol. (D) VOLUME: 74 (F) PAGES: 23-31 (G) DATE: 1993 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 15:
MX AGATCTGCCA GAGGAGGCCA GCAGTCTGCT AATGATTCAT ATGCsAAAGC GTTAAATGCA 60
AGTTCGGTTC AATCTGCCTT AsCAGTTCTA AGGGAAGAAC AACCAAAAGA TTTTGTATTA 120
CAAAATCATA ACATCCGCTC TAACCTAGAA CGAATATTCG CAAAGGCTCC GsAACCsTGs 1B0
GTTCCTCCAT TTCAAGTCTC TTCTTTCACT AACGTTCCTG ACGAGATGCA GGAATGssCG 240
GATAATTATT TCGGGACGGG TGCAGCTCCG CGGCCAGAGA GACCTsTAAs TATCATCGTC 300
GAGGGTGATT CAAGAACAGG GAAGACGATG TGGGCACGTs CGTTAGGCCC ACATAACTAT 360
CTCAGTGGAC ACCTAGACTT CAATGßTCGA GTCTTCTCGA ATGATGTGCA GTATAACGTC 420
ATTGATGACA TCCCACCGCA TTATCTAAAG CTAAAsCACT GGAAAsAATT OCTAOOOGCC 80
CAGAAAGATT GGCAATCAAA TTGCAAGTAC GGTAAGCCAG TTCAAATTAA AGGCsGAATC 540
CCAGCAATCG TßCTTTßCAA TCCTCßTGAG GGTGCCAGCT ATAAAGAGTT CTTAGACAAA 600
GCAGAAAATA CAGGTCTAAA QAACTssACT OTCAACAATO CGATCTTCAT CACCCTCACA 660
GCCCCCCTCT ATCAAGACAG CACACAssCA AGCCAAGAAA CGGßCAATCA GAAGGCGCAG 720
GGTTGATCTA CAQTßCßGGT GCTCCATCTA CTTCCACTTA ßACTGTGCGG GACATGGATT 780
CACGCACACG GGAACTCATC ACTGCACATC AGßCßßAßAA TGGCGTßTAT ATCTGGßAßC 8 0
TAAAAAATCC CCTTTATTTC AAGATACACA GGGTAGAßßA ACCACTGTAT ACCAßAACGA 900
GGGTATACCA COTACAGATA CGGTTCAACC ACAACCTßAG GAAAsCßTTs CATCTCCACA 960
AAGCCTACCT GAACTTCCAA GTTTGGACsA CsTGGATsAC AGCTTCTssA TCAATTTATT 1020
TAGCTAGATT TAGATATTTA GTCAACATGT ATCTAGATCA ATTAGGTGTT ATTTCAATAA 1080
ACAATGTAAT TAGAGCTGTA CGTTTCGCAA CAAACAGAGT GTATGTAAAT CATGTATTGG 1140
AGAATCATTC AATAAAATTC AAATTTTATT AATTCATGAT CGAATCATAA AAATAGATTC 1200
GAATTTTCAA AGTTGCATAT ACACGGTTAG ACGCATGAßT GCATGC 1246
2) INFORMATION FOR SEQ ID NO: 16: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 32 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple
P1681 / 98 X (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: FL-2549H (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 16: TATCAAAsCT TOAOTAACTC ATCTG_AsTA CC 32
2) INFORMATION FOR SEQ ID NO: 17: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 2602 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: Geminivirus of the tomato spot (B) CEPA: Florida (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 17:
P1681 / 98MX GGGGCATTTT TGTAATAAGA AGGGGTACTC CAGATGAGTT ACTCCAATTG AGCCTTCTCA 60
AACTTGCTCA TTCAATTGGA GTATTAGAGT AACTTATATA TAAGAACCCT CTATAGAACT 120
ATTAATCTGß TTCATACACG TGGCGGCCAT CCGATATAAT ATTACCGGAT GGCCGCGCGC 180
TTTTTTTTTAA TCCGTACAGT CCAATACTCT CACATCCAAT CATAATGCGT CGTACAAGCC 240
TATATATTTC CAACAACTTG GßCCTTAAGT TGTTGGAGsC CCATTATAAA TTAAAGTGAT 300
CTTGGCCCAA TGTCTTTAAC TCAAAATGCC TAAGCGTGAT TTGCCATGGC GATCGATGGC 360
GGGAACCTCA AAGGTTAGCC GCAATGCTAA TTATTCTCCT CGTGCAGGTA TTAGGCCAAG 420
AATTAACAAG CCCCCTOAAT G-GTOAATCG GCCCATGTAT AGGAAsCCCA GGATCTATCG 480
GACTCTTAGT ACAACTGACG TGCCCACßßO CTGTGAAGGC CCATßTAAßG TCCAßTCTTT 540
COAACACCßC CATGACATCT CACATATCGG TAAGOTCATC TGCATATCCC ATOTOACACO 600
TGGTAATGGC ATAACCCACC sTGTTssTAA GCGTTTCTGT GTTAAGTCTG TGTATATCCT 660
TGGTAAGATT TGGATGOATs AGAACATCAA GCTCAAGAAC CACACGAATA ßTGTCATGTT 720
CTGGTTGGTC AGAGATCGTA GACCCTATGG TACTCCAATß GATTTTGGAC AGGTGTTCAA 780
CATGTTCGAT AACGAGCCTA sCACTGCTAC TGTCAAAAAC GATCTACsCs ATCGTTACCA 840
GGTCATGCAT AAGTTCTATG GCAAGGTGAC AGGTGGACAG TATGCCAQCA ACGAGCAOGC 900
TATAOTTAAG AGsTTCTGGA AGOTOAACAA TCATGTAGTC TATAATCATC AAGAsGCTGG 960
CAAGTACGAG AATCACACAG AGAACGCCTT GTTATTsTAT ATGGCATsCA CTCATGCGTC 1020
TAACCCTGTA TATGCAACTT TsAAAATTCs AATCTATTTT TATßATTCGA TCATGAATTA 1080
ATAAAATTTG AATTTTATTG AATGATTCTC CAATACATGA TTTACATACA CTCTGTTTGT 1140
TGCGAAACGT ACAGCTCTAA TTACATTGTT TATTGAAATA ACACCTAATT GATCTAGATA 1200
CATGTTGACT AAATATCTAA ATCTACCTAA ATAAATTGAT CCACAAGCTG TCATCCACGT 1260
CGTCCAAACT TGGAAGTTCA ssTAGGCTTT OTGGAGATGC AACGCTTTCC TCAGGTTGTG 1320
GTTGAACCGT ATCTGTACGT GGTATACCCT CGTTCTGGTA TACACTßßTT CCTCTACCCT 1380 ßTßTATCTTß AAATAAAGGG GATTTTTTAG CTCCCAGATA TACACCCCAT TCTCCCCCTG 1440
ATGTGCAGTG ATGAGTTCCC CTGTGCGTGA ATCCATGTCC CGCACAGTCT AAGTGGAAßT 1500
AGATGGAGCA CCCßCACTCT AßATCAACCC TGCGCCTTCT GATTGCCCGT TTCTTGßCTT 1560
P1681 / 98MX GCCTGTGTGC TGTCTTGATA GAGGGGGGCT GTGAGGGTGA TGAAGATCGC ATTCTTGACA 1620
GTCCAGTTCT TTAGACCTGT ATTTTCTGCT TTGTCTAAGA ACTCTTTATA GCTGGCACCC 1680
TCACCAGGAT TGCAAAGCAC GATTGCTGGG ATTCCGCCTT TAATTTGAAC TGGCTTACCG 1740
TACTTGCAAT TTGATTGCCA ATCTTTCTGG GCCCCTAGCA ATTCTTTCCA GTGCTTTAGC 1800
TTTAGATAAT GCGGTGCGAT GTCATCAATG ACßTTATACT ßCACATCATT CßAGAAGACT 1860
CGACCATTGA AGTCTAGGTG TCCACTßAGA TAOTTATßTO OOCCTAACGC ACGTGCCCAC 1920
ATCGTCTTCC CTGTTCTTGA ATCACCCTCG ACGATGATAC TTACAGGTCT CTCTGGCCGC 1980 aCAGCTGCAC CCGTCCCGAA ATAATTATCC GCCCATTCCT GCATCTCGTC AGGAACGTTA 2040 sTGAAAGAAs AGACTTGAAA TGGAGGAACC CACGGTTCCG GAGCCTTTGC GAATATTCßT 2100
TCTAGßTTAO AGCGGATGTT ATGATTTTGT AATACAAAAT CTTTTGßTTG TTCTTCCCTT 2160
AGAACTGCTA AGGCAGATTG AACCGAACTT GCATTTAACG CTTTCGCATA TGAATCATTA 2220
GCAGACTßCT GGCCTCCTCT ssCAGATCTs CCCTCGATCT sGAAATCTCC CCATTCGATT 2280
GTATCTCCGT CCTTOTCGAT sTACGATTTG ACGTCGGAGC TCGATTTAßC TCCCTCAATA 2340
TTCGßATGGA AATGTGCTGA CCGGßTTGGG GAGACCAGßT CGAAGAATCT GTTATTCGTG 2400
CACTGsTACT TTCCTTCGAA CTGAACAAGC- ACATGGAGAT GAGGTTCCCC ATTTTCATGA 2460
AGCTCTCTGC AAATTTTGAT sAATTTCTTA TTGACTssGG TATTTAGGTT TTGTAATTGG 2520 sAAAGTGCTT CTTCTTTAGA CAAAGAGCAC TGTGGATAAG TGAGGAAATA GTTCTTTGAC 2580
TGAACTCTAA ATTTCTTTGG TG 2602
2) INFORMATION FOR SEQ ID NO: 18: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 2540 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: DNA (genomic)
P1681 / 98MX (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: Geminivirus of tomato spot (B) CEPA: Florida (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 18:
GTßGCATTTT TGTAATAAGA AGßGGTACTC CAGATGAGTT ACTCCAATTG AGCCTTCTCA 60
AACTTGCTCA TTCAATTGGA GTATTAGAGT AACTTATATA TAAGAACCCT CTATAGAATT 120
ATTAATCTGG TTCATACACG TGGCGGCCAT CCGATATAAT ATTACCGGAT GGCCGCGCCC 180
CCCCCCCTTT TATACGCGCs CCTCTTTGGT CßTATTTCCA CGCTTCTTCC TGTTGGTGCG 240
TATCCTTCAC TTCCCATCTT TTTGAGTAßC CTTTAATTTG AATTAAAGGT TAAAACTTTA 300
TCGCGATGAC TAATCATATC ACATTGACCA TGTGAAGGAC GTGGCATTAT TTCGACCATG 360
CTGCTGAGTT TATTTGCTAT TATTGTTCTA TCCATAATCT ATATATTGGA TTGGTCAGGA 420
ATATTTTGTT TATCCAACTC AGCTGCATAC CACGTTTATA TCGTTAGCTA AAT1TTGATT 480
AATCTTAGTT AAGTGTTTGA CTATGTATCC TTTAAAGTGT AAACGTGGTT TATCATATTC 540
AAATCGAAGA TTTAACTCAC GTAATAATGT GTTTAACCGT CCAGTTTCTG ßTAAGAGACA 600
TGATssAAAG CGTCGGGGAG GTAATTTCGT GAAGCCCAAT GATGAGCCCA AGATGTTAGC 660
CCAACGCATA CATGAGAATC AGTATGGGCC TGAATTTGTA TTGGCCCATA ACTCAGCTAT 720
CTCCACATTT ATCAGTTATC CCATCTTGGG CAAGTCCGAA GCCAGTCGAA GTAßGTCCTA 780
TATCAAGTTG AAACGTCTTC ßTTTCAAAßß GACTGTßAAß ATTQAGCGTG TTCAATCTGA 840
TTTGAACATG GATGGCTTTA TGCCTAAAßT CGAAGßAßTA TTCTCTATGG TTGTTßTTGT 900
GßATCGTAAA CCACACTTGG ßTCCCTCCGG GTGTTTGCAT ACATTCGACG AGCTATTTGG 960
TGCAAGGATC AATAGTCATG GCAACCTCAC TATAGTACCT TCTCTGAAAG ACCGCTTCTA 1020
CATTAGACAT GTGTTCAAGC GAGTGCTCTC AGTTGAGAAG GATACGTTGA TGGTGGACGT 1080
TGAAGGATCC ACAACACTCT CTAACAGsCG TTACAACT-C TGsTCTACGT TTAAAGACCT 1140
P1681 / 98MX TGATCGTGAA TCATGCAAGG GTGTTTATGA TAACATTAGC AAGAACGCCT TßTTAßTTTA 1200
TTATTGCTGG ATGTCTßACA CGCCTGCGAA TGCATCCTCT TTTGTATCTT TTGATCTTGA 1260
TTATATTGGT TAACTTAACG AAGTGTGTTT GTCTAAAGAT GATTAAAAAA ATGAAAATGT 1320
AAAAATAAAA TTTTATTTTA ATGGTTTCGT CTGAGACGCC TTACAATTAC TATTAATACA 1380
TTCATGGACC GTAGTCCGTA TTAATTCATT CAACTGTCCC ATAGACATTG TAATGTTGGA 1440
CTCTGTTCTC TGGsCCCCCA CAATAGAAGC AGACTCTCCC GGGTCCAGTA TGCCTGTTCC 1500
TAGCCTGTTT AGATGTCTGT ACGGGTGGAG TTCGTTCTCC ACATCTGAGT CCGCATCTGA 1560
ATGCCCTATG CCTATTGTAC TCCTTGAAGC CCATGACTCA CCAGQCCTGA TCTCAATTGG 1620
ACCTCTAAGß CCAAGTCTGG ACATGGACGC GCATCTAATG GGCTTCCTCT CCCATTTACC 1680
GTAATCCACA TGGGAAAAGT CCACATCTTT ATCTGTGAAC TQTTTOGACA GGATTTTTAC 1740
TGTTGGTGCC CGGAAGGGGA TGTCTACTGA GTGTTTTGCT QTQGACAATT TCAGCTTCCC 1800
CTTAAACTTG GCGAAGTGGG TCCGTTGATG AACATTCGTA TCGCAAACCC TGTAATACAA 1860
TTTCCATGGA ATTGGGTCTT TCAAGGAGAA GAAGGAAGCT GAGAAATAGT GGAGATCTAT 1920
GTTGCACCTG ATCGsAAATG TCCATGATGC CTGTAAAGAC TCATTCTCCQ T € ATTCTTTT 1980
GTCGTGAATC TCCACTATTA CCGACCCAGT GGCGTTTATT GGTACTTGTT GTCTGTACTC 2040
TATGACACAG TGGTCGATTT TCATGCAGCT ACGGCTGAGC CTAGCGGTTA ACTGCGACGC 2100 CGTGGACGGA AATTGCAGTA TTATTTCAGT TAGGTCATGA GAAAGCTGAT ATTCGTCACG 2160
GTGTGCATCT ATGTAGTTGA ATGCGCTAGG AOGATTAACT AACTGAGAAT CCATATGAAG 2220
AAAATAAGGC CGCGCAGGAC TGATTGCTGA AGTTGAATCA GAAAGAAGTC GAACAAGCTA 2280
TQAAACßßCA GTTTCQAACT CGAAGAAGAA AGACAGCCAA CTATATTTTC TTTTTTCCAA 2340
GAATTCAGCT GTGCTGAATA TAAAGTTTAT GAAGAGCGGA AATGAAAAAA AGTATATCAG 2400
GATTCGAAGT GTTTGAsAAA GAAAAGAAAT ATGAAAGAsA ATTTTGGAGA AATTTGAGTA 2460
AGAAGGAATT TGTATATGAA CTAAGAAACC TAGGGTTGAT GßGTATTTAA ATTGGTAAAG 2520
TGTTCATCCC ATGAGATAGA 2540
2) INFORMATION FOR SEQ I D NO: 1 9:
P1681 / 98MX (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1145 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA ( genomic) (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (v) TYPE OF FRAGMENT: N-terminal (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of the tomato spot (B) CEPA: Israel (ix) FEATURE : (A) NAME / KEY: CDS (B) LOCATION: 37..1107 (x) PUBLICATION INFORMATION: (A) AUTHORS: Navot, N Pichersky, R Zeidan, D Zamir, D Czosnek (B) TITLE: Virus wrinkling of the tomato yellow leaf: A geminivirus transmitted by the whitefly with an individual genomic component
X (C) GACETA: Virology (D) VOLUME: 185 (F) PAGES: 151-168 (G) DATE: 1991 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 19:
GCCATAGAGC TTTGAGGGAT CCCGATTCAT TTCAAC ATG CCT CGT TTA TTT AAA 54 Met Pro Arg Leu Phe Lys l 5 ATA TAT GCC AAA AAT TAT TTC CTA ACAT TAT CCC AAT TGT TCT CTC TCT 102 He Tyr Ala Lyß Asn Tyr Phe Leu Thr Tyr Pro Asn Cys Ser Leu Ser 10 15 20 AAA GAG GAA GCA CTT TCC CAA TTA AAA AAA CTA GAA ACC CCA ACA AAT 150 Lyß Glu Qlu Ala Leu Ser Gln Leu Lyß Lys Leu Glu Thr Pro Thr Asn 25 30 35 AAA AAA TAC ATC AAA GTT TGC AAA GAA CTC CAC GAG AAT GGG GAA CCA 198 Lys Lys Tyr He Lys Val Cys Lys Glu Leu His slu Aßn sly slu Pro 40 45 50 CAT CTC CAT GTG CTT ATC CAA TTC GAA GGC AAA TAC CAA TGT AAG AAC 246 His Leu His Val Leu He Gln Phe Glu Gly Lys Tyr Gln Cys Lys Aβn 55 60 65 70 CAA CGG TTC TAC GAC TTG QTA TCC CC AAC AQG TCA GCA CAT TTC CAT 294 G n Arg Phe Phe Asp Leu Val Ser Pro Asn Arg Ser Wing His Phe His 75 80 85 CCG AAC ATT CAG OCA QCT AAQ AGC TCA ACA GAT GTC AAG ACC TAC GTG 3 2
Pro Asn He Gln Wing Wing Lys Ser Ser Thr Aßp Val Lyß Thr Tyr Val 90 95 100 GAG AAA GAC GGA AAC TTC ATT GAT TTT GGA GTT TCC CAA ATC GAT GGC 390 Glu Lyß Asp Gly Aßn Phe He Asp Phe Gly Val Ser Gln He Aßp Gly 105 110 115 AGA TCA GCT AGA GGA GGT CAG CAA TCT QCC AAC QAC QCA TAT OCC GAA 438 Arg Ser Wing Arg Gly Qly Qln Gln Ser Wing Aßn Aßp Wing Tyr Wing Glu 120 125 130 GCA CTC AAT TCA G_C AOT ATA TCC OAG GCC CTC AAT ATA TTA AAA GAO 486 Wing Leu Aßn Ser Gly Ser He Ser Glu Wing Leu Aßn He Leu Lyß Glu 135 - 140 145 150 AAG GCC CCA AAG GAC TAT ATT TTA CAA TTT CAT AAT TTA AOT TCA AAT 534 Lys Wing Pro Lys Aßp Tyr He Leu Gln Phe His Aßn Leu Ser Ser Aßn 155 160 165
P1681 / 98MX TTA GAT AGs ATT TTT AOT CCT CCT TTA GAA GTT TAT OTT TCT CCA TTT 582 Leu Asp Arg lie Phe Ser Pro Pro Leu Glu Val Tyr Val Ser Pro Phe 170 175 180 CTT TCT TCT TCT TTT AAT CAA GTT CCA GAT GAA CTT GAA GAG TGG GTC 630 Leu Ser Be Ser Phe Asn Gln Val Pro Aßp Glu Leu Glu Glu Trp Val 185 190 195 GCC GAG AAC GTC GTG TAT TCC GCT GCQ CGG CCA AGO ATA CCC AGT 678
Wing Glu Asn Val Val Tyr Wing Wing Arg Pro Trp Arg Pro He Ser 200 205 210 ATT GTC ATT GAG GGT GAT AGC AGA ACA GGC AAA ACA ATG TGG GCC AGG 726 He Val He Glu Gly Asp Ser Arg Thr Gly Lyß Thr Met Trp Wing Arg 215 220 225 230 TCT CTA ssc CCA CAT AAT TAT TTA TGT G_A CAT CTA GAC CTA AGC CCA 774 Ser Leu Oly Pro Hiß Asn Tyr Leu Cys Gly His Leu Asp Leu Ser Pro 235 240 245 AAß OTO TAC AGT AAT ßAT GCG TGG TAC AAC OTC ATT OAT GAC AAC 822 Lys Val Tyr Ser Asn Asp Wing Trp Tyr Aßn Val He Aßp Aßp Val Aßn 250 255 • 260 CCG CAT TAT TTA AAO CAC TTC AAO GAA TTC ATT TGG GCC CAO AGO OAC 870 Pro His Tyr Leu Lyß His Phe Lys ßlu Phe He Trp Ala ßln Arg Asp 265 270 275 TGG CAA AGC AAC ACA AAG TAC GGG AAG CCC ATT CAA ATT AAA GGG GGA 918 Trp ßln Ser Asn Thr Lys Tyr ßly Lys Pro He ßln He Lys ßly ßly 280 285 290 ATT CCC ACT ATC TTC CTC TCC AAT CCA COA CCT ACC TCC TCA TAT AGG 966 He Pro Thr He Phe Leu Cys Asn Pro Gly Pro Thr Ser Ser Tyr Arg 295 300 305 310 ßAA TAT CTA GAC GAA GAA AA A AAC ATA TCC TTG AAA AAT TCG ßCT CTC 1014 ßlu Tyr Leu Asp Glu ßlu Lys Asn He Ser Leu Lys Asn Trp Wing Leu 315 320 325 AAG AAT GCA ACC TTC GTC ACC CTC TAC GAQ CCA CTG TTC GCA AGT ATC 1062 Lys Asn Wing Thr Phe Val Thr Leu Tyr slu Pro Leu Phe Wing Ser He 330 335 340 AAT CAA GGT CCA ACA CAA ßAT AßC CAA GAA ßAA ACC AAT AAß GCG 1107
Asn sln Gly Pro Thr Gln Asp Ser Gln Glu Glu Thr Asn Lyß Ala 345 345 355 TAAGCCTßTA GACCTAßACT GTGsCTßCTC ATACTACC 1145
2) INFORMATION FOR SEQ ID NO: 20: P1S81 / 98MX (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 357 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein ( xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 20: Met Pro Arg Leu Phe Lys He Tyr Wing Lys Asn Tyr Phe Leu Thr Tyr 1 5 10 15 Pro Asn Cys Ssx Leu Ser Lys Glu Glu Wing Leu Ser Gl Leu Lys Lys 20 25 30 Leu Glu Thr Pro Thr Asn Lys Lys Tyr He Lys Val Cys Lys Glu Leu 35 40 45 His ßlu Asn ßly Glu Pro His Leu His Val Leu Xle Gln Phe Glu Gly 50 55 60 Lys Tyr Gln Cys Lys Asn Gln Arg Phe Phe Asp Leu Val Ser Pro Asn 65 70 75 80
Arg Ser Ala His Phe His Pro Aßn Xle Gln Ala Ala Lys Ser Ser Thr 85 90 95
Asp Val Lys Thr Tyr Val Glu Lys Asp Gly Asn Phe lie Asp Phe Qly 100 105 lio Val Ser Gln He Asp Gly Arg Ser Wing Arg Gly Gln Gln Gln Ser Wing 115 120 125 Asn Asp Ala Tyr Ala Glu Ala Leu Asn Ser Gly Ser lie Ser Glu Wing 130 135 140 Leu Aßn He Leu Lys Glu Lys Wing Pro Lys Asp Tyr He Leu Gln Phe 145 150 155 160
His Asn Leu Being Ser Asn Leu Aßp Arg Xle Phe Ser Pro Pro Leu Glu 165 170 175
Val Tyr Val Pro Pro Phe Leu Ser Ser Phe Asn Gln Val Pro Asp 180 1T5 190 Glu Leu Glu Glu Trp Val Wing Glu Asn Val Val Tyr Ser Ala Wing Arg 195 200 205 Pro Trp Arg Pro lie Ser lie Val lie Glu Gly Asp Being Arg Thr Gly 210 215 220
P1681 / 98 X Lys Thr Met Trp Wing Arg Ser Leu Gly Pro His Asn Tyr Leu Cya Oly 225 ^ 230 235 240
His Leu Asp Leu Ser Pro Lys Val Tyr Ser Asn Aßp Wing Trp Tyr Asn 245 250 255
Val He Asp Asp Val Asn Pro His Tyr Leu Lyß His Phe Lys Glu Plus 260 265 270 lie Trp Wing Gln Arg Asp Trp Gln Ser Aßn Thr Lys Tyr Gly Lys Pro 275 280 285 He Gln He Lys Gly Gly He Pro Thr He Phe Leu Cys Asn Pro Gly 290 295 300 Pro Thr Ser Ser Tyr Arg Glu Tyr Leu Aßp Glu Glu Lys Asn lie Ser 305 310 315 320
Leu Lys Asn Trp Wing Leu Lys Asn Wing Thr Phe Val Thr Leu Tyr slu 325 330 335
Pro Leu Phe Ala Ser He A shan Sln Oly Pro Thr Gln Asp Ser sln slu 340 345 350 Qlu Thr Asn Lys Ala 355 2) INFORMATION FOR SEQ ID NO: 21: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETICAL: NO (iv) ANTICIPATION: NO (vi) ORIGINAL SOURCE
P1681 / 98MX (A) ORGANISM: PTYIRC4 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 21: GCCATAOAsC TTTGAGssAT CCCGATTCAT TTC 33
) INFORMATION FOR SEQ ID NO: 22: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: linear (ii) ) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: PTYC1V1679 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 22: GOTAGTATGA OGATCCACAG TCTAGGTCTA CACGCTTAC 39) INFORMATION FOR SEQ ID NO: 23: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1145 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN : simple (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic)
1S81 / 98MX (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of yellow leaf crumpling of tomato (B) CEPA: Israel (ix) CHARACTERISTIC: (A) NAME / KEY: CDS (B) LOCATION 37 ... 1107 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 23: accATAGAsc TTTGAsssAT CCCGATTCAT TTCAAC ATO CCT CCT TTA TTT AAA 54 Met Pro Arg Leu Phe Lys
ATA TAT OCC AAA AAT TAT TTC CTA ACA TAT CCC AAT TGT TCT TCT TCT 102 He Tyr Wing Lyß Aßn Tyr Phe Leu Thr Tyr Pro Asn Cyß Ser Leu Ser 10 15 20 AAA GAG GAA GCA CTT TCC CAA TTA AAA AAA CTA GAA ACC CCA ACA AAT 150 Lys Glu Glu Ala Leu Ser Gln Leu Lys Lyß Leu ßlu Thr Pro Thr Aßn 25 30 35 AAA AAA TAC ATC AAA GTT TGA AAA GAA CTC CAC GAG AAT 0__ GAA CCA 198 Lys Lys Tyr He Lys Val Cys Lys Glu Leu His ßlu Asn ßly Glu Pro 40 45 50 CAT CTC CAT GTG CTT ATC CAA TTC GAA GGC AAA TAC CAA TGT AAG AAC 246
His Leu His Val Leu He Gln Phe Glu Gly Lys Tyr Gln Cys Lys Asn 55 60 65 70 CAA CGG TTC TTC GAC TT ßTA TCC CCA AAC Aßß TCA CCA CAT TTC CAT 294 ßln Arg Phe Phe Aßp Leu Val Ser Pro Asn Arg Ser Ala His Phe His 75 80 85 CCG AAC ATT CAG GCA CCT AAG AGC TCA ACA ßAT (3TC AAO ACC TAC GTG 342 Pro Asn He sln Ala Ala Lys Ser Ser Thr Aßp Val Lyß Thr Tyr Val 90 95 100 GAG COA ßAC ßßA AAC TTC ATT ßAT TTT ßGA GTT TCC CAA ATC GAT OOC 390 Glu Arg Asp Gly Asn Phe He Asp Phe Gly Val Ser Gln He Asp Gly 105 110 115
P1S81 / 98MX AGA TCA GCT AGA GGA GGT CAG CAA TCT QCC AAC GAC GCA TAT GCC GAA 438 Arg Wing Arg Gly Gly Gln Gln Ser Wing Aßn Asp Wing Tyr Wing Glu 120 125 130 QCA CTC AAT TCA GGC AGT AAA TCC GAG GCC CTC AAT ATA TTA AAA GAG 486 Wing Leu Asn Ser Gly Ser Lys Ser Glu Wing Leu Asn He Leu Lys Olu 135 140 145 150 AAQ GCC CCA AAQ QAC TAT ATT TTA CAA TTT CAT AAT TTA AGT TCA AAT 534 Lyß Ala Pro Lys Aßp Tyr He Leu sln Phe His Asn Leu Ser Ser Aßn 155 160 165 TTA GAT AGG ATT TTT AGT CCT CCT TTA GAA GTT TAT OTT TCT CCA TTT 582 Leu Aßp Arg lie Phe Ser Pro Pro Leu slu Val Tyr Val -Ser Pro Phe 170 175 180 CTT TCT TCT TCT TTT AAT CAA OTT CCA GAT GAA CTT GAA TGG GTC 630 Leu Ser Ser Phe Aßn sln Val Pro Aßp slu Leu Glu Glu Trp Val 185 190 195 QCC QAG QAC AQ QQ QQ QQQ QQQ QQQ QQQQ QQG QQQ QQQ QQQ AGT 678
Wing Glu Asn Val Val Tyr Wing Wing Arg Pro Trp Arg Pro He Ser 200 205 210 ATT QTC ATT GAQ GGT GAT AGC AGA ACA GGC AAA ACA ATG TGG GCC AGG 726 lie Val lie Qlu Qly Aßp Ser Arg Thr Gly Lys Thr Met Trp Wing Arg 215 220 225 230 TCT CTA GsC CCA CAT AAT TAT TTA TCT OCA CAT CTA GAC CTA AGC CCA 774 Ser Leu Gly Pro Hiß Aßn Tyr Leu Cyß Gly His Leu Aßp Leu Ser Pro 235 240 245 AAQ QTQ TAC AQT AAT GAT GCG TGG TAC AAC QTC ATT GAT QAC QTA QAC 822
Lyß Val Tyr Ser Asn Aßp Wing Trp Tyr Aßn Val He Aßp Aßp Val Aßp 250 255 260 CCG CAT TAT TTA AAG CAC TTC AAG OAA TTC ATG GGG GCC CAO AGQ GAC 870
Pro Hiß Tyr Leu Lyß His Phe Lys Qlu Phe Met Qly Wing Gln Arg Asp 265 270 275 TGG CAA AOC AAC ACA AAG TAC GGG AAG CCC ATT CAA ATT AAA GGG GGA 918 Trp Gln Ser Aßn Thr Lyß Tyr Gly Lyß Pro He Gln lie Lyß Qly Qly 280 285 290 ATT CCC ACT ATC TTC CTC TGC AAT CCA GGA CCT ACC TCC TCA TAT AQG 966
He Pro Thr He Phe Leu Cys Asn Pro Gly Pro Thr Ser Ser Tyr Arg 295 300 305 310 ßAA TAT CTA GAC OAA GAA AAA AAC ATA TCC TTG AAA AAT TGG GCT CTC 1014
Glu Tyr Leu Asp Glu slu Lyß Aßn He Ser Leu Lys Asn Trp Wing Leu 315 320 325 AAG AAT OCA ACC TTC OTC ACC CTC TAC CAO CCA CTG TTC GCA AOT ATC 1062 Lys Asn Wing Thr Phe Val Thr Leu Tyr ßlu Pro Leu Phe Wing Ser Xle 330 335 340
P1581 / 98MX AAT CAA GGT CCA ACA CAA GAT AGC CAA GAA GAA ACC AAT AAG GCG 1107
Asn Gln Gly Pro Thr Gln Asp Ser Gln Glu Glu Thr Asn Lys Wing 345 350 355 TAAGCGTGTA GACCTAGACT GTGGCTGCTC ATACTACC 1145
) INFORMATION FOR SEQ ID NO: 24: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 357 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 24 Met Pro Arg Leu Phe Lys He Tyr Wing Lys Asn Tyr Phe Leu Thr Tyr 1 5 10 15 Pro Asn Cys Ser Leu Ser Lys Olu Olu Wing Leu Being sln Leu Lys Lys 20 25 30 Leu Glu Thr Pro Thr Asn Lys Lys Tyr He Lys Val Cys Lys slu Leu 35 40 45 His Glu Asn sly slu Pro His Leu His Val Leu He sln Phe Glu Gly 50 55 60 Lys Tyr sln Cys Lys Asn sln Arg Phe Phe Asp Leu Val Ser Pro Asn
65 70 75 80
Arg Ser Wing His Phe His Pro Asn He sln Wing Wing Lys Ser Ser Thr 85 90 95 Asp Val Lys Thr Tyr Val Olu Arg Asp sly Asn Phe He Asp Phe sly 100 105 110 Val Ser sln He Asp sly Arg Ser Wing Arg sly sly sln sln Ser Ala 115 »120 125 Asn Asp Ala Tyr Ala slu Ala Leu Asn Ser sly Ser Lys Ser slu Ala 130 135 140
1S81 / 98MX Leu Asn He Leu Lys slu Lys Wing Pro Lys Asp Tyr He Leu sln Phß "5 150 155 160
His Aßn Leu Ser Ser Asn Leu Aßp Arg He Phe Ser Pro Pro Leu slu 165 170 175
Val Tyr Val Pro Pro Phe Leu Ser Being Phe Aan Oln Val Pro Asp 180 185 190 Olu Leu Olu slu Trp Val Ala slu Asn Val Val Tyr Ser Ala Ala Arg 195 200 205 Pro Trp Arg Pro He Ser He Val II3 slu sly Asp Being Arg Thr sly 210 215 220 Lys Thr Met Trp Wing Arg Ser Leu sly Pro His Asn Tyr Leu Cys Oly
225 230 235 240
His Leu Asp Leu Ser Pro Lys Val Tyr Ser Asn Asp Wing Trp Tyr Asn 245 250 255
Val He Asp Asp Val Asp Pro His Tyr Leu Lys His Phe Lys slu Phe 260 265 270 Met Gly Ala sln Arg Asp Trp Gln Ser Asn Thr Lys Tyr Oly Lys Pro 275 280 2B5 He sln He Lys sly sly He Pro Thr He Phe Leu Cys Asn Pro sly 290 295 300 Pro Thr Ser Ser Tyr Arg slu Tyr Leu Asp Glu Glu Lys Asn He Ser 305 310 315 320
Leu Lys Asn Trp Wing Leu Lys Asn Wing Thr Phe Val Thr Leu Tyr Glu 325 330 335
Pro Leu Phe Wing Being He Asn sln Gly Pro Thr Gln Asp Being Gln Olu 340 345 350 355
P1681 / 98MX) INFORMATION FOR SEQ ID NO: 25: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: C1V2467 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 25 GTTTCCGTCT CGCTCCACGT AGG 23) INFORMATION FOR SEQ ID NO: 26: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1145 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (v) TYPE OF FRAGMENT: N-terminal
1S81 / 98 X (vi) ORIGINAL SOURCE (A) ORGANISM: Tomato yellow leaf wrinkling virus (B) CEPA: Israel (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION 37 ... 1107 (x) PUBLICATION INFORMATION: (A) AUTHORS: Mavot, N Pichersky, R Zeidan, D Zamir, D Czosne (B) TITLE: Yellow leaf crumpling virus of the tomato: A geminivirus transmitted by the whitefly with an individual geomomic component. (C) GACETA: Virology (D) VOLUME: 185 (F) PAGES: 151-168 (G) DATE: 1991 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 26 sccATAsAGc-TTTGAsssAT CCCGATTCAT TTCAAC ATG CCT COT TTA TTT AAA 54 Met Pro Arg Leu Phe Lys 1 5 ATA TAT sCC AAA AAT TAT TTC CTA ACA TAT CCC AAT TGT TCT CTC TCT 102 He Tyr Ala Lys Aßn Tyr Phe Leu Thr Tyr Pro Asn Cys Ser Leu Ser 10 15 20
P1681 / 98MX AAA GAG GAA GCA CTT TCC CAA TTA AAA AAA CTA GAA ACC CCA ACA AAT 150 Lys Glu Glu Ala Leu Ser ßln Leu Lys Lys Leu Glu Thr Pro Thr Asn 25 30 35 AAA AAA TAC ATC AAA GTT TGC AAA GAA CTC CAC GAO AAT GGG GAA CCA 198 Lyß Lyß Tyr He Lyß Val Cys Lyß ßlu Leu Hiß ßlu Aßn ßly and Glu Pro 40 45 50 CAT CTC CAT GTG CTT ATC CAA TTC GAA GGC AAA TAC CAA TGT AAG AAC 246
His Leu His Val Leu He Gln Phe Glu Gly Lys Tyr sln Cys Lys Asn 55 60 65 70 CA CGG TTC TTC GAC TTG GTA TCC CCA AAC AGG TCA OCA CAT TTC CAT 294 Gln Arg Phe Phe Asp Leu Val Ser Pro Asn Arg Ser Ala His Phe Hiß 75 80 85 CCG AAC ATT CAO OCA GCT AAG AGC TCA ACA GAT GTC AAG ACC TAC GTG 342 Pro Aßn He Qln Wing Ala Lyß Ser Ser Thr Asp Val Lyß Thr Tyr Val 90 95 100 GAG AAA GAC GQA AAC TTC ATT GAT TTT GGA GTT TCC CAA ATC GAT GGC 390 Glu Lyß Asp Gly Asn Phe He Aßp Phe Oly Val Ser sln He Asp Gly 105 110 115 AGA TCA GCT AGA GGA GGT CAO CAA- TCT GCC AAC GAC QCA TAT QCC QAA 438 Arg Ser Ala Arg Gly Gly Gln Gln Be Wing Aßn Aßp Wing Tyr Wing Glu 120 125 130 GCA CTC AAT TCA GGC AGT AAA TCC GAG QCC CTC AAT ATA TTA AAA GAG 486 Wing Leu Aßn Ser Qly Ser Lys Ser slu Wing Leu Aßn He Leu Lyß Qlu 135 140 145 150 • AAG GCC CCA AAG GAC TAT ATT TTA CAA TTT CAT AAT TTA AGT TCA AAT 534 Lyß Ala Pro Lyß Aßp Tyr He Leu Gln Phe His Asn Leu Ser Ser Asn 155 160 165 TTA OAT AGG ATT TTT AQT CCT CCT TTA OAA QTT TAT QTT TCT CCA TTT 582 Leu Asp Arg He Phe Ser Pro Pro Leu Glu Val Tyr Val Ser Pro Phe 170 175 180 CTT TCT TCT TCT TTT AAT CAA GTT CCA GAT GAA CTT GAA GAO TGG GTC 630 Leu Being Being Phe Asn sln Val Pro Asp slu Leu Qlu slu Trp Val 185 190 195 GCC QAQ AAC GTC QTQ TAT TCCGCT QCG CGQ CCA TGG AGA CCC ATA AGT 678 Wing Glu Aßn Val Val Tyr Ser Wing Wing Arg Pro Trp Arg Pro He Ser 200 205 210 ATT GTC ATT GAG GGT GAT AQC AGA ACA GGC OCA ACA ATG TGG GCC AGG 726 Ile Val He Qlu Gly Aßp Ser Arg Thr Gly Ala Thr Met Trp Ala Arg 215 220 225 230
1S81 / 98MX TCT CTA GGC CCA CAT AAT TAT TTA TGT GGA CAT CTA GAC CTA AGC CCA 774 Ser Leu sly Pro His Aßn Tyr Leu Cys Gly His Leu Asp Leu Ser Pro 235 240 245 AAG GTG TAC AGT AAT GAT GCG TGG TAC AAC GTC ATT GAT GAC GTA OTA 822 Lya Val Tyr Ser Aßn Asp Wing Trp Tyr Asn Val He Aßp Aßp Val Aßp 250 255 260 CCG CAT TAT TTA AAG CAC TTC AAG GAA TTC ATG GGG GCC CAG AGG GAC 870 Pro Hiß Tyr Leu Lys His Phe Lyß Glu Phe Met Gly Wing Gln Arg Asp 265 270 275 TGG CAA AGC AAC ACA AAG TAC CGG AAQ CCC ATT CAA ATT AAA GGG GGA 918 Trp Gln Ser Asn Thr Lys Tyr Gly Lys Pro He Gln He Lys Gly Gly 280 285 290 ATT CCC ACT ATC TTC CTC TCC AAT CCA GGA CCT ACC TCC TCA TAT AGO 966 He Pro Thr He Phe Leu Cys Aßn Pro Gly Pro Thr Ser Ser Tyr Arg 295 300 305 310 GAA TAT CTA GAC OAA GAA AAA AAC ATA TCC TTO AAA AAT TOO GCT CTC 1014 slu Tyr Leu Aßp Glu Glu Lys Asn He Ser Leu Lyß Aßn Trp Ala Leu 315 320 325 AAG AAT GCA ACC TTC GTC ACC CTC TAC GAO CCA CTQ TTC OCA AOT ATC 1062 Lys Asn Wing Thr Phe Val Thr Leu Ty r Glu Pro Leu Phe Wing Ser He 330 335 340 AAT CAA GGT CCA ACA CAA GAT AGC CAA GAA GAA ACC AAT AAG ßCß 1107
Asn Gln ßly Pro Thr Gln Asp Ser sln Glu Glu Thr Asn Lys Wing 345 350 355 TAAGCGTGTA GACCTAGACT GTßßCTGCTC ATACTACC 1145
2) INFORMATION FOR SEQ ID NO: 27: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 357 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID N? : 27:
P1681 / 98MX Met Pro Arg Leu Phe Lys He Tyr Ala Lys Asn Tyr Phe Leu Thr Tyr 1 5 10 15
Pro Asn Cys Ser Leu Ser Lys slu slu Ala Leu Ser sln Leu Lys Lys 20 25 30 Leu Qlu Thr Pro Thr Asn Lys Lys Tyr He Lyß Val Cys Lys slu Leu 35 40 45 His Qlu Asn sly slu Pro His Leu His Val Leu He sln Phe slu sly 50 55 60 Lys Tyr sln Cys Lys Asn sln Arg Phe Phe Asp Leu Val Ser Pro Asn 65 70 75 80
Arg Ser Ala His Phe His Pro Asn He sln Ala Ala Lys Ser Ser Thr 85 90 95
Aßp Val Lyß Thr Tyr Val slu Lys Aßp sly Aßn Phe He Asp Phe sly 100 105 110 Val Ser sln He Asp sly Arg Ser Wing Arg Gly Gln Gln Gln Ser Wing 115 120 125 Asn Aep Ala Tyr Ala slu Ala Leu Asn Ser Gly Ser Lys Ser slu Wing 130 135 140 Leu Asn He Leu Lys slu Lys Wing Pro Lys Asp Tyr He Leu Oln Phe 145 150 155 160
His Asn Leu Being Ser Asn Leu Asp Arg He Phe Ser Pro Pro Leu siu 165 170 175
Val Tyr Val Pro Pro Phe Leu Ser Ser Phe Asn Gln Val Pro Asp 180 185 190 slu Leu slu slu Trp Val Ala slu Asn Val Val Tyr Ser Ala Ala Arg 195 200 205 Pro Trp Arg Pro He Ser He Val He slu sly Asp Being Arg Thr sly 210 215 220 Wing Thr Met Trp Wing Arg Ser Leu sly Pro His Asn Tyr Leu Cys Oly
225 230 235 240
P1681 / 98MX His Leu Asp Leu Ser Pro Lys Val Tyr Ser Asn Asp Wing Trp Tyr Aßn 245 250 255
Val He Asp Aßp Val Asp Pro His Tyr Leu Lys His Phe Lys slu Phe 260 265 270 Met Oly Ala sln Arg Asp Trp sln Ser Asn Thr Lys Tyr sly Lys Pro 275 280 285 He Oln He Lyiss sly sly He Pro Thr He Phe Leu Cys Aßn Pro sly 290 295 300 Pro Thr Ser Ser Tyr Arg slu Tyr Leu Aßp slu slu Lyß Asn He Ser 305 310 315 320
Leu Lys Asn Trp Wing Leu Lys Asn Wing Thr Phe Val Thr Leu Tyr Olu ~ 325 330 335
Pro Leu Phe Wing Be He Asn sln sly Pro Thr sln Asp Ser sln Olu 340 345 350
355 2) INFORMATION FOR SEQ ID NO: 28: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: »NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE
P1681 / 98MX (A) ORGANISM: C1V2101 (xi) DESCRI PTION FOR THE SEQUENCE: SEQ. ID No: 28: GGCCCACATT GTTGCGCCTG TTCTGC 26
2) INFORMATION FOR SEQ ID NO: 29: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1145 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: genomic DNA (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (v) TYPE OF FRAGMENT: N-terminal (vi) ORIGINAL SOURCE (A) ORGANISM: Tomato spot virus (C) CEPA : Israel (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION: 37..1107 (x) PUBLICATION INFORMATION: (A) AUTHORS: Navot, N Pichersky, R Zeidan, D Zamir, D Czosnek ( B) TITLE: Leaf crumpling virus
P1681 / 98MX tomato yellow: A geminivirus transmitted by the whitefly with an individual genomic component (C) GACETA: Virology. (D) VOLUME: 185 (F) PAGES: 151-168 (G) DATE: 1991 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 29: GCCATAGAGC TTTGAGGGAT CCCGATTCAT TTCAAC ATG CCT CGT TTA TTT AAA 54 Met Pro Arg Leu Phe Lys 1 5 ATA TAT GCC AAA AAT TAT TTC CTA ACA TAT CCC AAT TGT TCT CTC TCT 102 He Tyr Ala Lys Asn Tyr Phe Leu Thr Tyr Pro Asn Cys Ser Leu Ser 10 15 20 AAA GAG GAA GCA CTT TCC CAA TTA AAA AAA CTA GAA ACC CCA AAT 150 Lyß ßlu Glu Ala Leu Ser Gln Leu Lyß Lyß Leu Glu Thr Pro Thr Aßn 25 30 35 AAA AAA TAC ATC AAA GTT TGC AAA GAA CTC CAC GAG AAT GGG GAA CCA 198 Lyß Lys Tyr He Lyß Val Cys Lyß Glu Leu His slu Aßn ßly Glu Pro 40 45 50 CAT CTC CAT GTG CTT ATC CAA TTC OAA ßGC AAA TAC CAA TGT AAG AAC 246 His Leu His Val Leu He Gln Phe slu Gly Lys Tyr Gln Cys Lys Asn 55 60 65 70 CAA CGG TTC TTC GAC TTG GTA TCC CCA AAC AGß TCA GCA CAT TTC CAT 294 Gln Arg Phe Phe Asp Leu Val Ser Pro Aßn Arg Ser Ala His Phe Hiß 75 80 85 CCG AAC ATT CAG GCA GCT AAG AGC TCA ACA GAT GTC AAG ACC TAC GTG 342 Pro Aßn He Gln Wing Wing Lyß Ser Ser Thr Asp Val Lys Thr Tyr Val 90 95 100 GAG AAA GAC GGA AAC TTC ATT GAT TTT GGA GTT TCC CAA ATC GAT GGC 390 Glu Lyß Asp ßly Asn Phe He Asp Phe Gly Val Ser sln He Asp sly 105 110 115 AßA TCA OCT AGA GGA GGT CAG CAA TCT GCC AAC GAC GCA TAT GCC ßAA 438 Arg Be Ala Arg ßly ßly ßln ßln Ser Ala Asn Aßp Ala Tyr Ala ßlu 120 125 130
P1681 / 98MX GCA CTC AAT TCA GGC AGT AAA TCC GAG GCC CTC AAT TTA AAA GAG 486 Wing Leu Asn Ser Gly Ser Lys Ser Glu Wing Leu Asn He Leu Lys Glu 135 140 145 150 AAG GCC CCA AAG GAC TAT ATT TTA CAA TTT CAT AAT TTA AGT TCA AAT 534 Lys Wing Pro Lys Aßp Tyr He Leu Gln Phe His Asn Leu Ser Ser Asn 155 160 165 TTA GAT AGß ATT TTT AßT CCT CCT TTA ßAA ßTT TAT GTT TCT CCA TTT 582
Leu Asp Arg He Phe Ser Pro Pro Leu Glu Val Tyr Val Ser Pro Phe 170 175 180 CTT TCT TCT TCT TTT AAT CAA GTT CCA GAT GAA CTT GAA TGG GTC 630 Leu Ser Ser Phe Asn Gln Val Pro Aßp Glu Leu Glu Glu Trp Val 185 190 195 GCC GAG AAC GTC GTG TAT TCC GCT GCG CGG CCA TGG AGA CCC ATA AGT 678
Wing Glu Asn Val Val Tyr Wing Wing Arg Pro Trp Arg Pro He Ser 200 205 210 ATT GTC ATT GAG GGT GAT AGC AGA ACA GGC AAA ACA ATG TGG GCC AGG 726 He Val He Glu Gly Asp Ser Arg Thr Gly Lys Thr Met Trp Wing Arg 215 220 225 230 TCT CTA GGC CCA CAT AAT TAT TTA TGT GGA CAT CTA GAC CTA AGC CCA 774 Ser Leu Gly Pro His Asn Tyr Leu Cys Gly His Leu Aßp Leu Ser Pro 235 240 245 AAG QTG TAC AGT AAT GAT GCG TGG TAC AAC GTC ATT AGA GAC GTA GAC 822 Lys Val Tyr Ser Asn Asp Wing Trp Tyr Aßn Val He Arg Asp Val Asp 250 255 260 CCG CAT TAT TTA AAG CAC TTC AAC GAA TTC ATG GGG GCC CAG AGG GAC 870 Pro His Tyr Leu Lys His Phe Lys Glu Phe Met Gly Ala sln Arg Asp 265 270 275 TGG CAA AGC AAC ACA AAG TAC GGG AAG CCC ATT CAA ATT AAA GGG GGA 918 Trp sln Ser Asn Thr Lys Tyr sly Lys Pro He Gln He Lys Gly Gly 280 285 290 ATT CCC ACT ATC TTC CTC TGC AAT CCA GGA CCT ACC TCC TCA TAT AGG 966 He Pro Thr He Phe Leu Cys Asn Pro Gly Pro Thr Ser Ser Tyr Arg 295 300 305 310 GAA TAT CTA GAC GAA GAA AAA AAC ATA TCC TTG AAA AAT TGG GCT CTC 1014 Glu Tyr Leu Asp Glu Glu Lys Asn He Ser Leu Lys Asn Trp Ala Leu 315 320 325
P1681 / 98MX AAA AAT OCA ACC TTC CTC ACC CTC TAC CAG CCA CTG TTC GCA ACT ATC 1062
Lys Aßn Wing Thr Phe Val Thr Leu Tyr Glu Pro Leu Phe Wing Ser He 330 335 340 AAT CAA CGT CCA ACA CAA GAT AGC CAA GAA GAA ACC AAT AAß ßCß 1107
Asn sln sly Pro Thr Gln Asp Ser Gln slu slu Thr Asn Lys Wing 345 350 355 TAAGCGTGTA GACCTAGACT GTGGCTGCTC ATACTACC 1145
2) INFORMATION FOR SEQ ID NO: 30: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 357 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCION FOR THE SEQUENCE: SEQ. I D No: 30:
Met Pro Arg Leu Phe Lys lie Tyr Ala Lyß Aßn Tyr Phe Leu Thr Tyr 1 5 10 15
Pro Asn Cys Ser Leu Ser Lys Glu Glu Ala Leu Ser Gln Leu Lys Lys 20 25 30 Leu Glu Thr Pro Thr Aen Lys Lys Tyr He Lys Val Cys Lys Olu Leu 35 40 45 His slu Aßn sly slu Pro His Leu His Val Leu He Oln Phe Glu sly 50 55 60 Lys Tyr Gln Cys Lys Aßn Gln Arg Phe Phe Asp Leu Val Ser Pro Aßn 65 70 75 80
Arg Ser Ala His Phe His Pro Asn He Oln Ala Ala Lys Ser Ser Thr 85 90 95
Asp Val Lys Thr Tyr Val slu Lys Asp sly Asn Phe He Asp Phe Oly 100 105 110
P1681 / 98MX Val Ser Gln He Asp sly Arg Ser Wing Arg sly Oly sln sln Ser Wing 115 120 125 Asn Asp Ala Tyr Ala slu Ala Leu Asn Ser sly Ser Lys Ser slu Ala 130 135 140 Leu Asn He Leu Lys slu Lys Ala Pro Lys Asp Tyr He Leu sln Phe 145 150 155 160
His Asn Leu Being Ser Asn Leu Asp Arg He Phe Ser Pro Pro Leu slu 165 170 175
Val Tyr Val Pro Pro Phe Leu Ser Be Ser Phe Asn sln Val Pro Asp 180 185 190 Glu Leu Olu Olu Trp Val Ala slu Asn Val Val Tyr Ser Ala Ala Arg 195 200 205 Pro Trp Arg Pro He Ser He Val He slu sly Asp Ser Arg Thr Oly 210 215 220 Lys Thr Met Trp Wing Arg Ser Leu Qly Pro His Asn Tyr Leu Cys Gly
225 - 230 235 240
His Leu Asp Leu Ser Pro Lys Val Tyr Ser Asn Aßp Wing Trp Tyr Aßn 245 250 255
Val He Arg Asp Val Asp Pro His Tyr Leu Lys His Phe Lys Glu Phe 260 265 270 Met sly Wing Oln Arg Asp Trp Oln Ser Asn Thr Lys Tyr sly Lys Pro 275 280 285 He Oln He Lys Oly Gly He Pro Thr He Phe Leu Cyß Aßn Pro Gly 290 295 300 Pro Thr Ser Ser Tyr Arg Glu Tyr Leu Asp Glu Glu Lyß Asn He Ser
P1681 / 98MX 305 310 315 320
Leu Lys Asn Trp Wing Leu Lys Asn Wing Thr Phe Val Thr Leu Tyr slu 325 330 335
Pro Leu Phe Wing Be As Asn Gln Gly Pro Thr Gln Asp Ser sln slu 340 345 350 slu Thr Asn Lys Wing 355 2) INFORMATION FOR SEQ ID NO: 31: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCION: / desc = "Oligonucleotide primer" (iii) HYPOTHETICAL: NO (iv) ANTICIPATE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: C1V2000 (xi) DESCION FOR THE SEQUENCE: SEQ. ID No: 31: GGGTCTACGT CTCTAATGAC GTTGTACC 28
2) INFORMATION FOR SEQ ID NO: 32: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 34 base pairs (B) TYPE: nucleic acid
P1681 / 98MX (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (vi) SOURCE ORIGINAL (TO) ORGANISM: PTYC2V1499 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID N ?: 32: ATTTGTGGAT CCCATTACCT TCCTGATGTT GTGG 34
2) INFORMATION FOR SEQ ID NO: 33: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE (A) ORGANISM: PTYARIv466 (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 33: TTAGGATCCT ATATCTGTTG TAAGGGC 27
2) INFORMATION FOR SEQ ID NO: 34:
P1681 / 98MX (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other acid nucleic (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE (A) ORGANISM: PTYARIcl046 (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID N? : 34: TTAACTAATG CAGGATCCTA CATTCCAGAG CGC 33
2) INFORMATION FOR SEQ ID NO: 35: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE (A) ORGANISM: PTYC2cl814
P1681 / 98MX (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 35: AAACGGATCC TTGAAAAATT GGGC 24
2) INFORMATION FOR SEQ ID NO: 36: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE (A) ORGANISM: PYTVlvll64 (xi) DESCRI PTION FOR THE SEQUENCE: SEQ. I D No: 36: ßTACG-AGAAC CATACTGAAA ACGCCT 26
2) INFORMATION FOR SEQ ID NO: 37: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer"
P1681 / 98MX (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE (A) ORGANISM: PTYClc2196 (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 37: AAATCTsCAG ATGAACTAGA AGAGTGGG 28
2) INFORMATION FOR SEQ ID NO: 38: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 32 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 38: GGTTCTscAs CA_AscAstt OATCATOTAT TO 32
2) INFORMATION FOR SEQ ID NO: 39: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Primer of
P1681 / 98MX oligonucleotide PTYClv2182"(iii) HYPOTHETIC: NO (xi) DESCRI PTION FOR SEQUENCE: SEQ ID No: 39: TAssccATss ccßCGCACcs GAATACACG 29
2) INFORMATION FOR SEQ ID NO: 40: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer PTYC1V2406" (iii) HYPOTHETIC: NO (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 40: osttctscAs CTTCGCCATA TGCGTCCT 28
2) INFORMATION FOR SEQ ID NO: 41: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer PTYClc2140"
P1681 / 98MX (iii) HYPOTHETIC: NO (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 41:
ATTTCCATGß AGACCCATAA GTATTßTC 28
2) INFORMATION FOR SEQ ID NO: 42: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer PTYCvl707" (iii) HYPOTHETIC: NO (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 42: GGTAGTATsA OGATCCACAG TCTACßTCT 29
2) INFORMATION FOR SEQ ID NO: 43: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1183 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (iv) ANTICIPATION: NO (vi) ORIGINAL SOURCE
P1681 / 98MX (A) ORGANISM: Geminivirus of golden bean mosaic (B) CEPA: Isolated Type II (C) INDIVIDUAL ISOLATED: Guatemala (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION: 1 .. 1059 (x) PUBLICATION INFORMATION: (A) AUTHORS: Faria, JC Gilbertson, RL Hanson, SF Morales, FJ Ahlquist, P Loniello, AO Maxwell, D (B) TITLE: Type II isolates of bean dwarf mosaic geminivirus of the Dominican Republic and Guatemala: Nucleotide sequences, infectious pseudorecombinants and phylogenetic relationships. (C) GACETA: Phytopathology (D) VOLUME: 84 (E) EXAMPLE: 3 (F) PAGES: 321-329 (G) DATE: 1994 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 43:
MX ATO CCA CCA CCT CAA AGA TTT AGA OTT CAO TCG AAA AAC TAT TTC CTC 48 Mét Pro Pro Pro Gln Arg Phe Arg Val sln Ser Lyß Asn Tyr Phe Leu 1 5 10 15 ACT TAT CCT csr toc CCT ATA ccs AAA GAA GAA GTT CTT TCG CAÁ CTT 96 Thr Tyr Pro Arg Cys Pro He Pro Lys Glu Glu Val Leu Ser sln Leu 20 25 30 CAG AAG ATT CAT ACA GCC ACG AAT AAA AAA TTC ATC AAA GTC TGT GAG 144 Gln Lys He Hiß Thr Ala Thr Aßn Lys Lys Phe He Lys Val Cys Glu 35 40 45 GAA CGT CAC GAG AAT GGT GAA CCT CAT CTT CAT GCG CTT ATT CAA TTC 192 Glu Arg His Glu Aßn ßly Glu Pro Hiß Leu Hiß Ala Leu He Gln Phe 50 55 60 OAA aOT AAA TTC GTC TGC ACA AAT AAA AGA TTG TTC GAC CTG GTA TCC 240
Glu Gly Ly # Phe Val Cyß Thr Asn Lys Arg Leu Phe Asp Leu Val Ser 65 70 75 80 TCA ACC AGG TCA GCA CCT TTC CAT CCG AAC ATT CAO CJGA GCT AAA TCA 288 Ser Thr Arg Ser Wing Pro Phe His Pro Aßn He Gln Gly Ala Lys Ser 85 90 95 AßT TCA ßAC ßTC AAG GCA TAC ATC GAC AAA GAT GGA GTC ACA ATC GAA 336 Ser Ser Asp Val Lyß Ala Tyr He Aßp Lyß Aßp Gly Val Thr He Glu 100 105 110 Tßß COA CAA TTC CAA CTC CAC GGC AGA TCT GCA AGA GGA GGT CAG CAG 384 Trp Gly ßln Phe Gln Val Asp Gly Arg Ser Wing Arg Gly Gln Gln Gln 115 120 125 TCT sCC AAC OAC TCA TAT GCA AAG OCA TTA AAC OCA GAT TCA ATT OAA 432 Ser Ala Aan Asp Be Tyr Ala Lyß Ala Leu Asn Ala Asp Ser He Glu 130 135 140 TCT GCC TTG ACA ATA TTG AAG GAA GAA CAA CCG AAA GAT TAC GTC CTT 480 Ser Ala Leu Thr He Leu Lys slu slu Gln Pro Lys Asp Tyr Val Leu 145 150 155 160 CAA CAT CAC AAC ATC CGT TCT AAT CTC GAA CGG ATC TTC GTC AAA GTG 528
Gln His His Asn He Arg Ser Asn Leu slu Arg He Phe Val Lys Val 165 170 175 CCQ GAA CCA TGG GTT CCT CCA TTT CCG TTG TCA TCA TTC ATC AAT GTT 576 Pro Glu Pro Trp Val Pro Pro Phe Pro Leu Ser Ser Phe Xle Asn Val 180 185 190 ccs sTT CTT ATO CAA OAA TGO GTT OAC OAC TAT TTC ssA Ass sot TCC 624
Pro Val Val Met Gln Glu Trp Val Aßp Aßp Tyr Phe Gly Arg Gly Ser 195 200 205
P1681 / 98MX GCT GCG CGG CCG GAA AGA CCT ATT AGT ATC ATC GTC GAA GGT GAT TCA 672 Wing Wing Arg Pro Glu Arg Pro lie Ser He He Val Qlu Qly Asp Ser 210 215 220 CGA ACC COA AAO ACA ATC TGG GCT COT OCA TTA GGA CCA CAT AAT TAT 720 Arg Thr Gly Lys Thr Met Trp Wing Arg Wing Leu Gly Pro His Aßn Tyr 225 230 235 240 TTG AGC GGT CAT TTG GAC TTT AAT TCA CGT GTC TAT TCC AAC GCA GTQ 768 Leu Ser Gly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val 245 250 255 GAA TAC AAC GTC ATT GAT GAC ATA AGC CCC AAT TAT TTG AAG TTA AAG 816
Glu Tyr Asn Val He Asp Asp As Pro Asn Tyr Leu Lys Leu Lys 260 265 270 CAC TGG AAA GAA CTA ATT GGG GCA CAA AAG GAC TGG CAA TCT AAC TGT 864 His Trp Lys Glu Leu He Gly Wing Gln Lys Aßp Trp sln Ser Aßn Cys 275 280 285 AAA TAT ss¿_ AAs ccs ott CAÁ ATT AAA OGA ssA ATA CCA TCA ATC ots 912
Lys Tyr Gly Lys Pro Val Gln He Lys sly Gly He Pro Ser He Val 290 295 300 TTG TC5C AAT CCA G_T GAO OOT TCC AOT TAT AAA GAC TTC CTC GAC AAA 960 Leu Cys Aßn Pro Qly slu Qly Ser Ser Tyr Lyß Asp Phe Leu Asp Lyß 305 310 315 320 QAA QAA AAC CGA QCT TAC CAC AAC TGG ACT ATT CAT AAT GCG ATC TTC 1008
Glu slu Asn Arg Ala Leu His Asn Trp Thr He His Asn Wing He Phe 325 330 335 GTC ACC CTC ACA GCC CCC CTC TAT CAA AGC ACA ACA CAG GAT TGC CAA 1056 Val Thr Leu Thr Ala Pro Leu Tyr Gln Ser Thr Thr Gln Asp Cys sln 340 345 350 ACß TAßAGCCATT CGTCGACGAC GCATTGACTT GAACTGCGGC TGTTCCATAT 1109
Thr TTTACCATAT CAAGTGCGCA GATCATGGAT TCACGCACAG GßGAGAACAT CACTGCGCAT 1169
CAGGCAGAGA ATTC i * 83
2) INFORMATION FOR SEQ ID NO: 44: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 353 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear
P1681 / 98MX (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR SEQUENCE: SEQ. ID No: 44:
Met Pro Pro Pro sln Arg Phe Arg Val sln Ser Lys Asn Tyr Phe Leu 1 5 10 15
Thr Tyr Pro Arg Cys Pro He Pro Lys slu slu Val Leu Ser sln Leu 20 25 30 Gln Lys He His Thr Wing Thr Asn Lys Lys Phe He Lyß Val Cyß Qlu
40 45 Glu Arg His Glu Asn Gly slu Pro His Leu His Wing Leu He sln Phe 50 55 60 Olu Gly Lys Phe Val Cys Thr Asn Lys A.rg Leu Phe Asp Leu Val Ser 65 70 75 80
Being Thr Arg Being Wing Pro Phe His Pro Asn He Oln Gly Wing Lys Ser 85 90 95
Be Ser Aßp Val Lyß Wing Tyr He Asp Lys Aßp Gly Val Thr He Glu
100 105 110 Trp Oly OIA Phe Gln Val Asp Gly Arg Be Wing Arg Gly Gly Gln Oln 115 120 125 Be Wing Asn Asp Ser Tyr Wing Lys Wing Leu Asn Wing Asp Ser He Olu 130 135 140 Be Ala Leu Thr He Leu Lys slu Glu Gln Pro Lys Aßp Tyr Val Leu 145 150 155 160 sln His His Asn He Arg Ser Asn Leu slu Arg He Phe Val Lyß Val 165 170 175
Pro Olu Pro Trp Val Pro Pro Phe Pro Leu Ser Ser Phe He Asn Val 180 185 190
P1681 / 98MX Pro Val Val Met Gln Glu Trp Val Asp Asp Tyr Phe Gly Arg Gly Ser 195 200 205 Wing Wing Arg Pro Glu Arg Pro W He Ser He He Val Glu Gly Asp Ser
21C 215 220 Arg Thr Gly Lys Thr Met Trp Wing Arg Wing Leu Gly Pro His Asn Tyr 225 230 235 240
Leu Ser Gly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val 245 250 255
Glu Tyr Asn Val He Asp Asp He Ser Pro Aßn Tyr Leu Lys Leu Lys
260 265 270 His Trp Lys Glu Leu He Oly Wing Oln Lys Asp Trp Gln Ser Asn Cys 275 280 285 Lys Tyr Gly Lys Pro Val Gln He Lys Oly sly He Pro Ser He Val 290 295 300 Leu Cys Asn Pro Oly Olu Oly Ser Ser Tyr Lys Asp Phe Leu Asp Lys 305 310 315 320
Glu Glu Asn Arg Ala Leu His Asn Trp Thr He His Asn Ala He Phe 325 330 335
Val Thr Leu Thr Ala Pro Leu Tyr Gln Ser Thr Thr Qln Aßp Cys Qln 340 345 350 Thr
2) INFORMATION FOR SEQ ID NO: 45: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1183 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: DNA (genomic)
P1681 / 9-8MX (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of golden bean mosaic (B) CEPA: Type II (C) INDIVIDUAL ISOLATED: Guatemala (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION: 1..1059 (xi) DESCRIPTION OF THE SEQUENCE: SEQ. ID No: 45: ATO CCA CCA CCT CAA AGA TTT AOA GTT CAG TCG AAA AAC TAT TTC CTC 4B Met Pro Pro Pro Gln Arg Phe Arg Val Gln Ser Lys Asn Tyr Phe Leu 1 5 10 15 ACT TAT CCT CGT TGC CCT ATA CCG AAA CAA C3AA sTT CTT CTG CAA CTT 96 Thr Tyr Pro Arg Cyß Pro He Pro Lyß Glu Glu Val Leu Ser Gln Leu 20 25 30 CAG AAG ATT CAT ACA GCC ACG AAT AAA AAA TTC ATC AAA GTC TGT GAO 144 ßln Lyß He His Thr Ala Thr Asn Lys Lys Phe He Lys Val Cys Qlu 35 40 45 OAA COT CAC GAG AAT GGT OAA CCT CAT CTT CAT GCG CTT ATT CAA TTC 192 ßlu Arg His slu Asn sly Glu Pro His Leu His Ala Leu He ßln Phe 50 55 60 ßAA OGT AAA TTC OTC TCC ACA AAT AAA AßA TTβ TTC GAC CTß ßTA TCC 240 ßlu ßly Lys Phe Val Cys Thr Asn Lys Arg Leu Phe Asp Leu Val Ser 65 70 75 80 TCA ACC AOG TCA OCA CCT TTC CAT ccs AAC ATT CAO ßßA GCT AAA TCA 288 Ser Thr Arg Ser Wing Pro Phe Hiß Pro Asn He Gln Gly Wing Lyß Ser 85 90 95 AGT TCA GAC OTC AAQ GCA TAC ATC GAC AAA OAT GGA GTC ACA ATC GAA 336 Ser Ser Asp Val Lys Wing Tyr He Asp Lys Asp sl and Val Thr He siu 100 105 110
681/98 X TGG GGA CAA TTC CAA CTC GAC OGC AGA TCT GCA AGA GGA GGT CAQ CAG 384 Trp Gly sln Phe sln Val Asp sly Arg Ser Wing Arg sly sly sln sln 115 120 125 TCT GCC AAC GAC TCA TAT GCA AAO OCA TTA AAC OCA GAT TCA ATT GAA 432 Be Ala Asn Asp Ser Tyr Ala Lys Ala Leu Aßn Ala Aßp Ser He slu 130 135 140 TCT GCC TTG ACA ATA TTß AAG OAA GAA CAA CCG AAA CAT TAC OTC CTT 480 Ser Ala Leu Thr He Leu Lys slu slu Gln Pro Lys Asp Tyr Val Leu 1 5 • _ 150 155 160 CAA CAT CAC AAC ATC COT TCT AAT CTC GAA CGβ ATC TTC ßTC AAA ßTβ 528 ßln His His Asn He Arg Ser Asn Leu Glu Arg He Phe Val Lys Val 165 170 175 CCG GAA CCA TGG GTT CCT CCA TTT CCG TST TCA TCA TTC CGC AAT OTT 576 Pro slu Pro Trp Val Pro Pro Phe Pro Leu Ser Ser Phe Arg Asn Val 180 185 190 CCG GTT GTT ATG CAA GAA TGG GTT GAC CAC TAT TTC GGA AGG GGT TCC 624 Pro Val Val Met Gln Glu Trp Val Asp Asp Tyr Phe Gly Arg Gly Ser 195 200 205 QCT QCß CGG CCG OAA ATS CCT ATT ACT ATC ATC ßTC OAA OOT OAT TCA 672 Ala Wing Arg Pro Olu Arg Pro H e Ser -He He Val Val Glu Gly Aßp Ser 210 215 220 CGA ACC GGA AAG ACA ATG TGG GCT CGT GCA TTA GGA CCA CAT AAT TAT 720 Arg Thr Gly Lys Thr Met Trp Wing Arg Wing Leu Gly Pro His Asn Tyr 225 230 235 240 TTG AGC GGT CAT TTG GAC TTT AAT TC ~ CGT GTC TAT TCC AAC GCA OTG 768 Leu Ser Gly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val 245 250 255 GAA TAC AAC GTC ATT ßAT ßAC ATA AGC CCC AAT TAT TTG AAG TTA AAG 816 Glu Tyr Asn Val He Asp Asp He Ser Pro Asn Tyr Leu Lys Leu Lys 260 265 270 CAC TGG AAA (3AA CTA ATT QGG GCA CAA AAG GAC TGG CAA TCT AAC TGT 864 His Trp Lys slu Leu He sly Wing s Lys Asp Trp sln Ser Asn Cys 275 280 285 AAA TAT GGA AAG CCG GTT CAA ATT AAA OGA GGA ATA CCA TCA ATC GTG 912 Lys Tyr Gly Lys Pro Val Oln He Lys sly Gly He Pro Ser He Val 290 295 300 TTG TGC AAT CCA GGT GAO 0_T TCC AOT TAT AAA GAC TTC CTC GAC AAA 960 Leu Cys Asn Pro Gly Glu sly Ser Ser Tyr Lys Asp Phe Leu Asp Lys 305 310 315 320
P1681 / 98MX ßAA GAA AAC COA ßCT TTA CAC AAC TGG ACT ATT CAT AAT GCG ATC TTC 1008
Glu slu Asn Arg Ala Leu His Asn Trp Thr He His Asn Wing He Phe 325 330 335 3TC ACC CTC ACA GCC CCC CTC TAT CAA AGC ACA ACA CAG GAT TGC CAA 1056
/ al Thr Leu Thr Ala Pro Leu Tyr sln aer Thr Thr sln Asp Cys sln 340 345 350 -CG TAGAGCCATT CGTCGACGAC GCATTGACTT GAACTOCßGC TGTTCCATAT 1109
? hr
TTACCATAT CAAßTßCCCA GATCATGGAT TCACGCACAG GßGAsAACAT CACTGCGCAT 1169
AGGCAGAGA ATTC H83
2) INFORMATION FOR SEQ ID NO: 46: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 353 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 46:
Met Pro Pro Pro sln Arg Phe Arg Val Oln Ser Lys Asn Tyr Phe Leu 1 5 10 15
Thr Tyr Pro Arg Cys Pro He Pro Lys slu Olu Val Leu Ser sln Leu 20 25 30 sln Lys He His Thr Wing Thr Asn Lys Lys Phe He Lys Val Cys Olu 35 40 45 Qlu Arg His slu Asn sly Olu Pro His Leu His Wing Leu He Oln Phe 50 55 60 slu sly Lys Phe Val Cys Thr Aßn Lys Arg Leu Phe Aép Leu Val Ser 65 70 75 80
Being Thr Arg Being Wing Pro Phe His Pro Asn He Oln sly Wing Lys Ser 85 - 90 95
P1681 / 98MX Ser Ser Asp Val Lys Wing Tyr He Asp Lys Asp Oly Val Thr He Olu 100 105 110 Trp Oly Oln Phe sln Val Asp sly Arg Ser Wing Arg Gly Gln Gln sln 115 120 125 Ser Wing Asn Asp Ser Tyr Wing Lys Wing Leu Asn Ala Asp Ser He Glu 130 135 140 Be Ala Leu Thr He Leu Lys Glu Glu Oln Pro Lys Asp Tyr Val Leu 145 150 155 160
Oln His His Asn lie Arg Ser Asn Leu Olu Arg He Phe Val Lys Val 165 170 175
Pro Glu Pro Trp Val Pro Pro Phe Pro Leu Ser Ser Phe Arg Asn Val 180 185 190 Pro Val Val Met Gln Glu Trp Val Asp Asp Tyr Phe Gly Arg Gly Ser 195 200 205 Wing Wing Arg Pro Glu Arg Pro He Ser He He Val Glu Oly Aßp Ser 210 * 215 220 Arg Thr Oly Lys Thr Met Trp Ala Arg Ala Leu Oly Pro His Asn Tyr 225 230 235 240
Leu Ser Oly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val 245 250 255
Glu Tyr Asn Val He Asp Asp He Ser Pro Asn Tyr Leu Lys Leu Lys 260 265 270 His Trp Lys Olu Leu He Oly Wing Qln Lyß Asp Trp Oln Ser Asn Cys 275 280 285 Lys Tyr Gly Lys Pro Val Gln He Lys sly sly He Pro Ser He Val 290 295 300 Leu Cys Asn Pro sly slu Oly Ser Ser Tyr Lys Asp Phe Leu Asp Lys 3 ° 5 310 315 320
P1681 / 98MX Glu Glu Asn Arg Ala Leu His Asn Trp Thr He His Asn Ala He Phe 325 330 335
Val Thr Leu Thr Ala Pro Leu Tyr Gln Ser Thr Thr sln Asp Cys sln 340 345 350 Thr
2) INFORMATION FOR SEQ ID NO: 47: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear ( ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: SHGA191 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 47: CATCATTCCs CAATCTTCCs GT 22
2) INFORMATION FOR SEQ ID NO: 48: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1062 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple
P1681 / 98 X (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of golden bean mosaic ( B): CEPA: Type II (C): INDIVIDUAL ISOLATED: Guatemala (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION: 1 ... 1059 (xi) DESCRI PTION FOR THE SEQUENCE: SEQ. I D No: 48: ATG CCA CCA CCT CA AGA TTT AGA GTT CAG TCG AAA AAC TAT TTC CTC 48
Met Pro Pro Pro Gln Arg Phe Arg Val Gln Ser Lys Asn Tyr Phe Leu 1 5 10 15 ACT TAT CCT CGT TGC CCT ATA CCG AAA GAA GAA GTT CTT TCG CAA CTT 96
Thr Tyr Pro Arg Cys Pro He Pro Lys Glu slu Val Leu Ser Gln Leu 20 25 30 CAG AAG ATT CAT ACA GCC ACG AAT AAA AAA TTC ATC AAA GTC TGT QAG 144 Gln Lys He His Thr Ala Thr Asn Lys Lys Phe He Lys Val Cys Glu 35 40 45 GAA CGT CAC GAG AAT GGT GAA CCT CAT CTT CAT GCG CTT ATT CAA TTC 192 Glu Arg His Glu Asn Gly Pro His Leu His Ala Leu He ßln Phe 50 55 60 GAA GGT AAA TTC GTC TGC ACA AAT AAA AGA TTG TTC GAC CTG GTA TCC 240
Glu Gly Lys Phe Val Cys Thr Asn Ly§ Arg Leu Phe Aßp Leu Val Ser 65 70 75 80
P1681 / 98MX TCA ACC AGG TCA GCA CCT TTC CAT CCG AAC ATT CAG GGA GCT AAA TCA 288
Be Thr Arg Be Wing Pro Phe His Pro Asn He Gln Gly Wing Lys Ser 85 90 95 AGT TCA GAC GTC AAG GCA TAC ATC GAC AAA GAT GGA GTC ACA ATC GAA 336 Ser Ser Aßp Val Lys Wing Tyr He Asp Lys Asp Gly Val Thr He Glu 100 105 110 TGG GGA CA CA TTC GTC GAC GGC AGA TCT GCA AGA GGA GGT CAG CAG 384 Trp Gly Gln Phe Gln Val Asp Gly Arg Ser Wing Arg Gly Gln Gln Gln 115 120 125 TCT GCC AAC GAC TCA TAT GCA AAG GCA TTA AAC GCA GAT TCA ATT GAA 432 Be Wing Aßn Asp Ser Tyr Wing Lys Ala Leu Asn Wing Asp Ser He Glu 130 - «135 140 TCT GCC TTG ACA ATA TTG AAG GAA GAA CAA CCG AAA GAT TAC GTC CTT 480 Ser Ala Leu Thr He Leu Lys slu Glu Gln Pro Lyß Asp Tyr Val Leu 145 150 155 160 CAA CAT CAC AAC ATC CGT TCT AAT CTC GAA ATC TTC GTC AAA GTG 528 Gln His His Asn He Arg Ser Asn Leu Glu Arg He Phe Val Lyß Val 165 170 175 CCG GAA CCA TGG GTT CCT CCA TTT CCG TTG TCA TCA TTC ATC AAT GTT 576 Pro Glu Pro Trp Val Pro Pro Phe Pro Leu Ser Ser Phe He Asn Val 180 185 190 CCG GTT GTT ATG CA GAA TGG GTT GAC GAC TAT TTC GGA AGG GGT TCC 624 Pro Val Val Met Gln Glu Trp Val Aßp Asp Tyr Phe Gly Arg sly Ser 195 200 205 GCT GCG CCG GAA AGA CCT ATT AGT ATC ATC GTC AGA GGT GAT TCA 672 Ala Wing Arg Pro Glu Arg Pro He Ser He He Val Arg Gly Asp Ser 210 215 220 CGA ACC GGA AAG ACA ATG TGG GCT CGT GCA TTA GGA CCA CAT AAT TAT 720 Arg Thr Gly Lys Thr Met Trp Wing Arg Wing Leu Gly Pro His Asn Tyr 225 230 235 240 tto Asc ost CAT TTO GAC TTT AAT TCA CCT OTC TAT TCC AAC GCA GTß 768 Leu Ser ßly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Wing Val 245 250 255 ßAA TAC AAC OTC ATT GAT _AC ATA AGC CCC AAT TAT TTG AAO TTA AAO 816 slu Tyr Asn Val He Asp Asp He Ser Pro Asn Tyr Leu Lys Leu Lyß 260 • 265 270
P1681 / 98MX CAC TGG AAA GAA CTA ATT GGG GCA CAA AAG GAC TGG CAA TCT AAC TGT 864 His Trp Lys slu Leu He sly wing sln Lys Asp Trp sln Ser Asn Cys 275 280 285 AAA TAT COA AAO COT OTT CAA ATT AAA GGA GGA ATA CCA TCA ATC GTß 912 Lys Tyr ßly Lys Pro Val ßln He Lys Gly Gly He Pro Ser He Val 290 295 300 TTG TGC AAT CCA OßT GAß ßßT TCC ACT TAT AAA OAC TTC CTC GAC AAA 960 Leu Cys Asn Pro ßly slu Gly Ser Ser Tyr Lys Asp Phe Leu Asp Lys 305 310 315 320 GAA OAA AAC CGA GCT TTA CAC AAC TGG ACT ATT CAT AAT GCO ATC TTC 1008 Glu Glu Asn Arg Ala Leu His Asn Trp Thr He His Asn Wing He Phe 325 330 335 ßTC ACC CTC ACA GCC CCC CTC TAT CAA AGC ACA CAA CAO OAT TGC CAA 1056 Val Thr Leu Thr Ala Pro Leu Tyr Gln Ser Thr Gln Aßp Cys ßln 340 345 350 ACG TAG * 1062
Thr
2) INFORMATION FOR SEQ ID NO: 49: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 353 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 49: Met Pro Pro Pro Gln Arg Phe Arg Val Gln Ser Lys Asn Tyr Phe Leu 1 5 10 15
P1681 / 98MX Thr Tyr Pro Arg Cys Pro He Pro Lys Glu Glu Val Leu Ser Gln Leu 20 25 30 sln Lys He His Thr Ala Thr Asn Lys Lys Phe lie Lys Val Cys Glu 35 40 45 Glu Arg His Glu Asn Gly Glu Pro His Leu His Wing Leu Xle Gln Phe 50 55 60 Glu Gly Lys Phe Val Cys Thr Asn Lys Arg Leu Phe Asp Leu Val Ser 65 70 75 80
Being Thr Arg Being Wing Pro Phe His Pro Asn lie Gln Gly Wing Lys Ser 85 90 95
Be Ser Asp Val Lyß Wing Tyr lie Asp Lys Asp Gly Val Thr He Glu 100 105 110 Trp Gly Gln Phe Gln Val Asp Gly Arg Be Wing Arg Gly Gln Gln Gln 115 120 125 Ser Wing Asn Asp Ser Tyr Wing Lys Wing Leu Asn Wing Asp Ser Xle Glu 130 135 140 Be Ala Leu Thr He Leu Lys Glu slu Gln Pro Lys Asp Tyr Val Leu 145 150 155 160
Gln His His Asn He Arg Ser Asn Leu Glu Arg He Phe Val * Lys Val 165 170 175
Pro Glu Pro Trp Val Pro Pro Phe Pro Leu Ser Ser Phe lie Asn Val 180 185 190 Pro Val Val Met Gln Glu Trp Val Asp Asp Tyr Phe Gly Arg Gly Ser 195 200 205 Wing Wing Arg Pro Glu Arg Pro lie Ser lie He Val Arg Gly Asp Ser 210 215 220 Arg Thr sly Lys Thr Met Trp Wing Arg Wing Leu Gly Pro His Asn Tyr 225 - 230 235 240
Leu Ser Gly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val 245 250 255
Glu Tyr Asn Val lie Aßp Asp He Ser Pro Asn Tyr Leu Lys Leu Lys 260 265 270 His Trp Lys Glu Leu Xle Gly Ala ßln Lys Asp Trp Gln Ser Asn Cys 275 280 285 Lys Tyr Gly Lys Pro Val Gln He Lys Gly Gly He Pro Ser He Val 290 295 3O0 Leu Cys Asn Pro Gly Glu Gly Ser Ser Tyr Lys Asp Phe Leu Asp Lys 681 / 98MX 305 310 315 320 slu slu Asn Arg Ala Leu His Asn Trp Thr He His Asn Wing He Phe 325 330 335
Val Thr Leu Thr Ala Pro Leu Tyr sln Ser Thr Thr Gln Asp Cyß sln 340 345 350 Thr 2) INFORMATION FOR SEQ ID NO: 50: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 21 base pairs (B) ) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTICIPATION: NO (vi) ORIGINAL SOURCE (A) ORGANISM: SHGA221 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 50:
TCATCGTCAG AGßTOATTCA C 21
2) INFORMATION FOR SEQ ID NO: 51: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1062 base pairs (B) TYPE: nucleic acid (C) CHAIN TYPE: simple
P1681 / 98MX (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus of golden bean mosaic (B) ) CEPA: Type II (C) ISOLATED INDIVIDUAL: Guatemala (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION: 1..1059 (xi) DESCRI PTION FOR THE SEQUENCE: SEQ. I D No: 51:
ATG CCA CCA CCT CAA AOA TTT AGA GTT CAG TCG AAA AAC TAT TTC CTC 48 Met Pro Pro Pro Gln Arg Phe Arg Val Gln Ser Lyß Aßn Tyr Phe Leu 1 5 10 15 ACT TAT CCT CGT TGC CCT ATA CCG AAA GAA GAA GTT CTT TCG CAA CTT 96 Thr Tyr Pro Arg Cys Pro He Pro Lys Glu Glu Val Leu Ser Gln Leu 20 25 30 CAG AAG ATT CAT ACA GCC ACG AAT AAA AAA TTC ATC AAA GTC TGT GAG 144 Qln Lys He His Thr Wing Thr Asn Lyß Lys Phe He Lys Val Cys Glu 35 40 45 'GAA CGT CAC GAG AAT GGT GAA CCT CAT CTT CAT GCG CTT ATT CAA TTC 192 Glu Arg His Glu Aßn Gly Glu Pro His Leu Hiß Ala Leu He Gln Phe 50 55 60 GAA GGT AAA TTC GTC TGC ACA AAT AAA AGA TTG TTC GAC CTG GTA TCC 240
Glu Gly Lyß Phe Val Cyß Thr Asn Lys Arg Leu Phe Asp Leu Val Ser 65 70 75 80
P1681 / 98MX TCA ACC AGG TCA GCA CCT TTC CAT CCG AAC ATT CAG GGA GCT AAA TCA 288 Ser Thr Arg Ser Wing Pro Phe His Pro Aßn He Qln Gly Wing Lyß Ser 85 90 95 AGT TCA GAC GTC AAG GCA TAC ATC GAC AAA GAT GQA GTC ACA ATC GAA 336 Ser Ser Asp Val Lys Wing Tyr He Asp Lys Asp Gly Val Thr He Glu 100 105 110 TGG GGA CAA TTC CAA GTC GAC GGC AGA TCT GCA AGA GGA GGT CAG CAG 384 Trp Gly Gln Phe Gln Val Asp Qly Arg Be Wing Arg Gly Gly Gln Gln 115 120 125 TCT GCC AAC GAC TCA TAT GCA AAG GCA TTA AAC GCA GAT TCA ATT GAA 432
Be Ala Asn Asp Ser Tyr Ala Lys Ala Leu Asn Ala Asp Ser He Glu 130 135 140 TCT GCC TTQ. ACA ATA TTG AAG GAA GAA CAA CCG AAA QAT TAC QTC CTT 480 Be Ala Leu Thr He Leu Lys Glu Gllu Pro Lyß Asp Tyr Val Leu 145 ISO 155 160 CAA CAT CAC AAC ATC CGT TCT AAT CTC GAA CGG ATC TTC GTC AAA QTG 528 Gln His His Asn He Arg Ser Aßn Leu Glu Arg He Phe Val Lyß Val 165 170 175 CCG GAA CCA TGG GTT CCT CCA TTT CCG TTG TCA TCA TTC ATC AAT GTT 576 Pro Qlu Pro Pro Val Pro Pro Phe Pro Leu Ser Ser Phe lie Asn Val 180 185 190 CCG GTT GTT ATG CAA GAA TGG GTT GAC GAC TAT TTC GGA AGG GGT TCC 624
Pro Val Val Met Gln Glu Trp Val Asp Asp Tyr Phe Gly Arg Gly Ser 195 200 205 GCT GCG CGG CCG GAA AGA CCT ATT AGT ATC ATC CTC GAA OOT GAT TCA 672
Wing Wing Arg Pro Glu Arg Pro W hese He He Val Glu Gly Aßp Ser 210 215 220 CGA ACC COA CAC ATG TGO OCT COT OCA TTA GGA CCA CAT AAT TAT 720 Arg Thr Gly His Thr Met Trp Ala Arg Ala Leu Gly Pro His Asn Tyr 225 230 235 240 TTG AGC G3T CAT TTC _AC TTT AAT TCA COT GTC TAT TCC AAC OCA OTO 768 Leu Ser Gly Hiß Leu Aßp Phe Aßn Ser Arg Val Tyr Ser Asn Ala Val 245 250 255 GAA TAC AAC GTC ATT OAT _AC ATA ACC CCC AAT TAT TTC AAO TTA AAG 816 Glu Tyr Asn Val He Aßp Asp He Ser Pro Aßn Tyr Leu Lys Leu Lyß 260 265 270
P1681 / 98MX CAC TGG AAA GAA CTA ATT OßQ GCA CAA AAC C3AC TOO CAA TCT AAC TOT 864
His Trp Lys Glu Leu He ßly Wing Gln Lys Asp Trp Gln Ser Asn Cys 275 280 285 AAA TAT ß_A AAO CCG OTT CAA ATT AAA COA GGA ATA CCA TCA ATC OTG 912
Lyß Tyr Gly Lys Pro Val Gln He Lys Gly Gly He Pro Ser He Val 290 295 300 TTG TOC AAT CCA CGT C3AG ßßT TCC AGT TAT AAA GAC TTC CTC GAC AAA 960
Leu Cys Aßn Pro Gly Glu Gly Ser Ser Tyr Lys Aßp Phe Leu Aßp Lyß 305 310 315 320 GAA ßAA AAC CGA GCT TTA CAC AAC Tßß ACT ATT CAT AAT ßCO ATC TTC 1008 ßlu ßlu Aßn Arg Ala Leu His Aßn Trp Thr He His Aßn Wing He Phe 325 330 335 GTC ACC CTC ACA GCC CCC CTC TAT CAA AOC ACA ACA CAO sAT TCC CAA 1056
Val Thr Leu Thr Ala Pro Leu Tyr Gln Ser Thr Thr Gln Aßp Cys Gln 340 '345 350 ACO TAO 1062
Thr
2) INFORMATION FOR SEQ ID NO: 52: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 353 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 52: Met Pro Pro Pro Gln Arg Phe Arg Val Oln Ser Lys Asn Tyr Phe Leu 1 5 10 15 Thr Tyr Pro Arg Cys Pro He Pro Lys Olu slu Val Leu Ser Oln Leu 20 25 30 sln Lys He His Thr Ala Thr Asn Lys Lys Phe He Lys Val Cys Glu 35 40 45 Glu Arg His Olu Asn Gly Glu Pro His Leu His Wing Leu He Gln Phe 50 55 60
P1681 / 98MX Glu sly Lys Phe Val Cys Thr Asn Lys Arg Leu Phe Asp Leu Val Ser 65 70 75 80 be Thr Arg Ser Wing Pro Phe His Pro Asn He Gln Gly Wing Lys Ser 85 90 95
Be Ser Asp Val Lys Wing Tyr He Asp Lys Asp Gly Val Thr He Glu 100 105 110 Trp Gly Gln Phe Gln Val Asp Gly Arg Be Wing Arg Gly Gly sln Gln 115 120. 125 Ser Wing Asn Asp Ser Tyr Wing Lys Wing Leu Asn Wing Asp Ser lie Glu 130 135 140 Be Ala Leu Thr lie Leu Lys Glu Glu Gln Pro Lys Asp Tyr Val Leu 145 150 155 160
Gln His His Asn lie Arg Ser Asn Leu Glu Arg lie Phe Val Lys Val 165 170 175
Pro Glu Pro Trp Val Pro Pro Phe Pro Leu Ser Ser Phe He Asn Val 180 185 190 Pro Val Val Met Gln Glu Trp Val Asp Asp Tyr Phe Gly Arg Gly Ser 195 200 205 Wing Wing Arg Pro Glu Arg Pro He Ser He He Val Glu aiy Asp Ser
210 215 220 Arg Thr Gly His Thr Mßt Trp Wing Arg Wing Leu Gly Pro His Asn Tyr 225 230 235 240
Leu Ser Gly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val -. 245 250 255
Glu Tyr Asn Val lie Asp Asp He Ser Pro Asn Tyr Leu Lyß Leu Lys 260 265 270 His Trp Lys Glu Leu He Gly Wing Gln Lys Asp Trp Gln Ser Aßn Cys 275 280 285 Lys Tyr Gly Lys Pro Val Gln He Lys Gly Gly He Pro Ser He Val 290 295 300 Leu Cys Asn Pro Gly Glu Gly Ser Ser Tyr Lys Asp Phe Leu Asp Lys 305 310 315 320
Glu Glu Asn Arg Ala Leu His Asn Trp Thr He His Asn Ala Xle Phe
P1681 / 98MX 325 330 335
Val Thr Leu Thr Ala Pro Leu Tyr Gln Ser Thr Thr Gln Asp Cys Gln 340 345 350 Thr 2) INFORMATION FOR SEQ ID NO: 53: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 22 base pairs (B) ) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) HYPOTHETIC: NO (iv) ANTICIPATION: NO (vi) ORIGINAL SOURCE (TO) ORGANISM: SHGA228 (xi) DESCRI PTION FOR THE SEQUENCE: SEQ. I D No: 53: GAACCCOACA CA_AAtstßs sc 22ION FOR SEQ ID NO: 54: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1062 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular
P1681 / 98MX (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Golden bean mosaic geminivirus (B) CEPA: Type II ( C) INDIVIDUAL ISOLATED: Guatemala (ix) CHARACTERISTICS: (A) NAME / KEY: CDS (B) LOCATION: 1..1059 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 54: ATß CCA CCA CCT CAA AßA TTT AßA GTT CAG TCG AAA AAC TAT TTC CTC 48 Met Pro Pro Pro ßln Arg Phe Arg Val ßln Ser Lys Aßn Tyr Phe Leu 1 5 10 15 ACT TAT CCT CGT TGC CCT ATA CCG AAA GAA GAA GTT CTT CTG CAA CTT 96 Thr Tyr Pro Arg Cyß Pro He Pro Lyß ßlu ßlu Val Leu Ser ßln Leu 20 25 30 CAß AAG ATT CAT ACA CCC ACO AAT AAA AAA TTC ATC AAA GTC TGT GAß 144 Gln Lys He His Thr Ala Thr Asn Lys Lys Phe He Lys Val Cys slu 35 40 45 OAA COT CAC (SAO AAT GGT GAA CCT CAT CTT CAT GCG CTT ATT CAA TTC 192 slu Arg His slu Asn Gly Glu Pro His Leu His Ala Leu He ßln Phe 50 55 60 ßAA OOT AAA TTC GTC TGC ACA AAT AAA AGA TTG TTC OAC CTG GTA TCC 240 Glu Oly Lys Phe Val Cys Thr Asn Lys Arg Leu Phe Asp Leu Val Ser 65 70 75 80 TCA ACC AGG TCA GCA CCT TTC CAT CCO AAC ATT CAO OOA OCT AAA TCA 288 Ser Thr Arg Ser Wing Pro Phe His Pro Asn He sln ßly Wing Lys Ser 85 90 95
P1681 / 98MX AGT TCA GAC OTC AAO OCA TAC ATC GAC AAA OAT C3GA < 3TC ACA ATC GAA 336 Ser Ser Asp Val Lys Wing Tyr He Asp Lys Asp sly Val Thr He Olu 100 105 110 TGG GGA CAA TTC CAA GTC GAC GGC AGA TCT GCA AGA GGA GGT CAG CAG 384 Trp Oly sln Phe sln Val Asp sly Arg Be Wing Arg Gly Gly sln 115 120 125 TCT GCC AAC CAC TCA TAT GCA AAß ßCA TTA AAC GCA GAT TCA ATT GAA 432 Ser Ala Aßn Asp Ser Tyr Ala Lys Ala Leu Asn Ala Asp Ser He Glu 130 135 140 TCT GCC TTO ACA ATA TTß AAß ßAA ßAA CAA CCß AAA ßAT TAC ßTC CTT 480 Be Ala Leu Thr He Leu Lys Glu Glu Gln Pro Lys Asp Tyr Val Leu 145 * 150 155 160 CAA CAT CAC AAC ATC CGT TCT AAT CTC ßAA CG ß ATC TTC OTC AAA OT ß 528 Gln His His Asn He Arg Ser Asn Leu ßlu Arg He Phe Val Lys Val 165 170 175 CCG GAA CCA TGG GTT CCT CCA TTT CCG TTG TCA TCA TTC ATC AAT GTT 576 Pro Olu Pro Pro Val Pro Pro Phe Pro Leu Ser Ser Phe He Asn Val 180 185 190 CCß ßTT GTT ATG CAA ßAA Tßß ßTT ßAC ßAC TAT TTC ßßA Aßß ßßT TCC 624 Pro Val Val Met ßln ßlu Trp Val Asp Asp Tyr Phe ßly Arg ßly Ser 195 200 205 GCT ßCß CßG CCG GAA AGA CCT ATT AGT ATC ATC GTC QAA ßGT ßAT TCA 672
Wing Wing Arg Pro ßlu Arg Pro He Ser He He Val Olu sly Asp Ser 210 215 220 COA ACC ßGA AAG ACA ATG TGG GCT CGT GCA TTA GGA CCA CAT AAT TAT 720 Arg Thr sly Lys Thr Met Trp Wing Arg Ala Leu sly Pro His Asn Tyr 2 5 230 235 240 TTG AGC GGT CAT TTG GAC TTT AAT TCA CGT GTC TAT TCC AAC GCA GTß 768 Leu Ser sly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val 245 250 255 GAA TAC AAC GTC ATT AGA GAC ATA AGC CCC AAT TAT TTG AAG TTA AAG 816 slu Tyr Asn Val He Arg Asp He Ser Pro Asn Tyr Leu Lys Leu Lys 260 265 270 CAC TCG AAA GAA CTA ATT GGG GCA CAA AAG GAC TGG CAA TCT AAC TGT 864 His Trp Lys Glu Leu He Gly Wing Gln Lys Asp Trp sln Ser Asn Cys 275 280 • 285 AAA TAT GGA AAG CCG GTT CAA ATT AAA GGA GGA ATA CCA TCA ATC GTß 912 Lys Tyr sly Lys Pro Val sln He Lys Gly Gly He Pro Ser He Val 290 295 300
P1681 / 98MX TTG TGC AAT CCA GßT ßAG GßT TCC AßT TAT AAA ßAC TTC CTC GAC AAA 960 Leu Cys Asn Pro Gly slu sly Ser Ser Tyr Lys Asp Phe Leu Asp Lys 305 310 315 320 GAA GAA AAC CGA GCT TTA CAC AAC TGG ACT ATT CAT AAT GCG ATC TTC 1008 ßlu slu Asn Arg Ala Leu His Asn Trp Thr He His Asn Wing He Phe 325 330 335 ßTC ACC CTC ACA GCC CCC CTC TAT CAA AGC ACA ACA CAG GAT TGC CAA 1056 Val Thr Leu Thr Ala Pro Leu Tyr Gln s < r Thr Thr Gln Asp Cys sln 340 345 350 ACG TAG_1062_Thr
2) INFORMATION FOR SEQ ID NO: 55: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 353 amino acids (B) TYPE: amino acids (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: protein (xi) DESCRI PTION FOR THE SEQUENCE: SEQ. ID No: 55: Met Pro Pro Pro Gln Arg Phe Arg Val Gln Ser Lys Aßn Tyr Phe Leu 1 5 10 15 Thr Tyr Pro Arg Cys Pro He Pro Lys Glu Glu Val Leu Ser Gln Leu 20 25 30 sln Lys He His Thr Ala Thr Asn Lys Lys Phe He Lys Val Cys Glu 35 40 45 Glu Arg His Glu Asn Gly Glu Pro His Leu His Wing Leu He Gln Phe 50 55 60 Glu Gly Lys Phe Val Cys Thr Asn Lys Arg Leu Phe Asp Leu Val Ser 65 70 75 80
Being Thr Arg Being Wing Pro Phe His Pro Asn He Gln Gly Wing Lys Ser 85 90 95 Being Ser Aßp Val Lyß Wing Tyr He Asp Lys Asp Gly Val Thr lie Glu 100 105 -10
P1681 / 98MX Trp sly Gln Phe Qln Val Asp Gly Arg Ser Wing Arg Gly Gly Gln Qln 115 120 125 Ser Wing Asn Asp Ser Tyr Wing Lys Wing Leu Asn Wing Asp Ser He slu 130 135 140 Ser Wing Leu Thr lie Leu Lys Glu Glu Gln Pro Lys Asp Tyr Val Leu 145 150 155 160
Gln His His Asn He Arg Ser Asn Leu Glu Arg He Phß Val Lys Val 165 170 175
Pro Glu Pro Trp Val Pro Pro Phe Pro Leu Ser Ph Phi lie Asn Val 180 185 190 Pro Val Val Met sln Glu Trp Val Asp Asp Tyr Phe Gly Arg Gly Ser 195 200 205 Wing Wing Arg Pro Glu Arg Pro He Ser He He Val Glu Gly Asp Ser 210 215 220 Arg Thr sly Lys Thr Met Trp Wing Arg Wing Leu Gly Pro His Asn Tyr 225 230 235 240
Leu Ser Gly His Leu Asp Phe Asn Ser Arg Val Tyr Ser Asn Ala Val • 245 250 255
Glu Tyr Asn Val lie Arg Asp lie Ser Pro Aßn Tyr Leu Lys Leu Lys 260 265 270 His Trp Lys Glu Leu He Gly Wing Oln Lyß Asp Trp Gln Ser Asn Cys 275 280 285 Lys Tyr sly Lys Pro Val Gln lie Lys Gly Gly He Pro Ser He Val 290 295 300 Leu Cys Asn Pro Gly Glu sly Ser Ser Tyr Lys Asp Phß Leu Asp Lys 305 310 315 320
Glu Glu Asn Arg Ala Leu His Aßn Trp Thr He Hiß Aßn Ala Xle Phe 325 330 335
Val Thr Leu Thr Ala Pro Leu Tyr Gln Ser Thr Thr Gln Asp Cys Gln 340 345 350 Thr 2) INFORMATION FOR SEQ ID NO: 56: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 20 base pairs
P1681 / 98MX (B) TYPE: nucleic acid (C) TYPE OF 'CHAIN: simple (D) TOPOLOGY: linear (ii) TYPE OF MOLECULE: other nucleic acid (A) DESCRIPTION: / desc = "Oligonucleotide primer" (iii) ) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE (A) ORGANISM: SHGA262 (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 56: ACOTCATTAO AGACATAACC 20
2) INFORMATION FOR SEQ ID NO: 57: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 198 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN: simple (D) TOPOLOGY: circular ( ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus that infects the tomato of Guatemala (TGV-GA1) (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 57:
P1681 / 98MX AGATGAAGCT ATTGAAATGC TTCAAAATCT GCCATGQTCA QTCGTCAAAC CAACGTACAT 60
ACGAGTCGCC AGAGAGQAAC ACGCAGATGß ATTTCCGCAC CTCCACTGTC TCATCCAACT 120
CTCCGGGAAG TCCAACATCA AGGATGCTAG ATTTTTCGAC CTCACTCACC CAGAAGOTCT 180 sCCAATTTTC ATCCAAAC 198) INFORMATION FOR SEQ ID NO: 58: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 379 base pairs (B) TYPE: nucleic acid (C) TYPE OF CHAIN : simple (D) TOPOLOGY: circular (ii) TYPE OF MOLECULE: DNA (genomic) (iii) HYPOTHETIC: NO (vi) ORIGINAL SOURCE (A) ORGANISM: Geminivirus that infects the tomato of Guatemala (TGV-GA1) (xi) DESCRIPTION FOR THE SEQUENCE: SEQ. ID No: 58: AGGGTTCCGT (3sCATTTTTs CAAATATGAG CCAGGACACC AGGGGsAsCT CTCTCTAAAA 60 CTTTATTTTG CTssTGTCCT GGTGTCCCAT TTATACTAAA ACCCTCTTGG GGACACCAAG 120 sßCAAATTCs ßCCATCCßCA ATAATATTAC CGGATGCCCG CGATTTTTTT TGGACCTGGC 180 CCACTATCAG AAATTsCGTT GGGCCTTTCT GGATAAGTTA ACCAATCAAT ACACGTTTGG 240 GTAßTCTAAT TATTACAACT TGGTCACCAA GTTßTTTTAT GßTCTATAAA TTTGTCGTTA 300 TGTGTGTßßT CCAACCACßT AAATATTGAT AATGCCTAAß CßTGATCCCC CATßßCßCTT 360 AATGGCGGGT ACCCTAAGs 379
) INFORMATION FOR SEQ ID NO: 59: (i) CHARACTERISTICS OF THE SEQUENCE:
681 / 98MX
096 XVXWVOODX ODDVDWODV OOWXVXXO XXOXVXXWX WXXDDDVXX OXXXDWOW
006 DXDVXXVODD VDWDD VD XXDDOVXDVO VDDXDDDDVO XDXDWDOXV DDDXDWDV
Ot-8 DXXDXDDVOX VXXDOXVDVD VOYODXVXW DWXXDXOVX ODXOVODVXO OD-DOWOVD
08 ¿XXIDXVXWD XXDXDDVDVD OXXXOVDWO VODDWOVDX XDXODXODXO OOVDXOOXXD
0_ ¿XXOXXXXXXD XVDXDXOVOV VDOVXVDXW DWOOVD W XVXWWDOV DDXVDDXVXV
099 OWDDODXXX XDDVXXXDXO XDWXXOXDX XXXXDODWX ODXXDVDVXV OOOVDXODOO
009 DDWODXDDX D DXDXVOXO VXXVXOXDXV DXDDWDDV XVXVDDXDVX DXVODVDVDV
Ot-S ODXOVOXXXD DXOVODXDDV VXDXVOOODD WDXOXVODV DVXOOOXOXV OVOOXDWOV
08 * DVXOXVODVD VXDXVDOVDD OD WDVDW OXVOOODDVO WDODDDXVO OWWDDVW
0 £ * DDVDXODODD XOXXVDDDDD DXXDXDDXDD DDVDXOXVXV DDDOVDVODX XDWDXDXOD
09 € DDDXDDVXDD WDDXXDDD DXOVOOXXXV DXWXVXVDV DDVODVDDV VDOXDXVXXV
OOE WDVXDDDDV XXVXXDVOXX XWXXXYDDD VXXWXXXVD DVXXDDVXXV DVDVDDDDXD
0VZ XXVXDDVDW WDVXDVXXV VDSXDXVX-V VXXDVDDVXX DXVXDXVDVX XXXDDDVDXX
08 DDD VDXDD XXXDVD0WV XXOXXVDDXV OODXDXVXDV DDXXXXVXDX XVXOWDDVD
OZX DDDXXDVWX XXVXDDDDXD DXDXXXXOXX WXXDDW-0 XDDDDDVDXV DVDXWDDW
09 XDDXVDVODV VOVOXDWXD VD03WDIDD DDDODXDXXX XXXXXVDDDD DDDXVDDDV
A X O ai Xas: ¥ i? Nanoas vi v_vd Noiodiposaa (t?) (IVS-? OI) ETPUI eT TP ms I = S ^ HUIO:]. ? p - Coq _? ? p o ^ u? tuipiio-ru? s sn-tt? tutui? g: 0WSINV3_0 (Y) IVNISIHO aiNana (J?) ON: 00I! 3! 0dIH (TTT) (ootuiou? D) NQV: v _O03 _0W 30 Odll (TT). Thinojto: vi90 _0_0_. (a)? d? ts:? N3_VD 30 Odll O) oot? onu optoe rodil (3)? s_q? p s? _? _ d f ^ i? : _O_.I9N0_ (V)
39t
OZS- VOOVDOXXXV OOWXXXXXD XV-WDVOVD OXVDVOXVXX XWXWVXVO XOXOVXDDDD 09 * 3 XXDXDXXXOX DDIXDXVDDD VDVXVDDXVX VDDXVDDDDV OXVDDXXWD XXDDDVXXXV oot-and DDXXD DDD XXXODDDXDX DXWDDXVD VOXDVDXODD VXXVXXDVDD XDDDXXWDD or * ez XVODXXXDW DDXDDDXDDV XXDDVDXDDV DDDXVDVOXX DVDDVXOXVD DXDXXDDXOX
08ZZ DVDXWXDXV ODXOVDDDV DXDV OXDX VDDXVODXXD XVDVDDXDDX DDXDOVDXXD ozzz XDXDXOQVXX VDXVDDVDDD DDXDXDDDV VXXXVDDXDD WDXDDVDXX DDXDD WXD
09XZ DXXVXXWXD DDXXXXXOVD DDVXXXDXW WX WDX XVXVDXVXXV WXXXXVDVX ooxz IVWXDIVID DXWVWXOX DDVODXXVDD XDDWVXVDD WDVDDWW VOWDWDW-oe heard OOVWDXVDX XOWDDXDXV DXXOWDXXO XDVODDDVDD ODXDXXDDVD DVQ1DDVDOD m 086X ODODV OXD DDDDDVXXDV XVXDVDXWD XDDDVDXDXO VXDDXDDDDD XXDXOVXVDV
0-61 DDDDDDXDXVD XDVDDDXDOX DXVXXWXW VDVDVDDVDX VOVXDXDDW XDDDDXXXDD 098X VDVXDXDVXX VDXVDODVDD VXVXXODVXX WDX DXDDV XDXODDDXXO VXDOVXXXDD
0081 XOVWXXDDX XWDXVDDDO DDDDXXXOOD XOVDDDXXVD XXXVXDXXXD VXDXXVXXDD or * t OXDWDXXXV VXXXDDDDDX XWDDDXOXX VDWDDVDVD DXXVODXDDX DODXXDVDDX
089T ODVXVXXXDX XWDWOOXV OXXOXXXXDX XWOVXXXOV XXXXWO DD ODVXVXXXDX 0Z9X XVDDDXDDW DDYDXDDDYD DXXDDDDDXD VOVXO VOXDD VXODXXVOXX XDVDDXDDXD
XDDDVXODDX XDXDDDDXD OXDXDOVDDX DDXDWDXVD VXDDOVDDDDDWLRQVWDWXD OOT WOXVXDDXV VXDDDVDDDXDDXDDXVOOX WDXD3DXDX DDOOXXDVOX VDXDVDOXDD
OM-t VDXXDDXDXD XXVDDODVDV WXVDVDDDX DXWOXXXXV DDDDVXVXW • VDXXDXVXXD Osset XDXDDXVDXD XXOODDOOXV VDDVOXVOXX DDXXDXDXVD XVDXDDDXDO XXDVXDXDW Özet DXXVOXDXXD DVDXDDXXXD VDWDDDDXV XDXDXXXVDV WDOVXXVOV XODDXXVDDD
09et DXVDXDDVDX DXODDVDDDX DVDDVDDDW DWXXDODVX VXXDXDDD XXOVODWXX oozt OVXDWXXXV XXVWXDDXO WXVDXDVXV VXXDXXVOVX XVDXOXDDDD WOXWXVOV or * xx XWXVDDXXV DVXWXDXDD XODWDVX? X YDVXOXDDXX XOXVXWDXX VXXXXWDXX oßot YXYWXWXX WDDXVXDVD XXVDXVXXXX XVXDXVODDV XVOWDXXXD VXODXVXDXO oeox XDDOWXDXD DODVDXDVXO XDDDDXVXVX DXXOXVOXXV ODXWWDXD VXVDXWDYD
e9t XW86 / t891 < -
09e DDDXWWW DDXDVXVOVD DDDDXODXOV DXDXXVXDXD DDDVDXDXW XXXDXDXVXX
00 XDXDXVDDDX DDXODXXWX WVDXWWD XOVDXDDXXV OXDXVDDDDD DVDVDXXDVD '
O? -Z VXDXVXWVD VDODWODVD XDOXXDDVOV DDVDDVXVXX XVDDXODVDD XVDVDDXVW
08T XVXVOVWXX WDXXXXVDD DDXXDDDDDX VDDDDDXVDD DO WXXVDD DDDDOXDDVD OZX DXXDDVWOO ODODVDDVXX WXWDXODV DDOXODDDVX VDDDDDDVXX XWODXVXXD 09 DDODDVDDW XXXWXXDXD DDDDXVDYXD XDDXXDDDDX DDXVXVXVDD DXDDXDDWV
A9: ° ai "Das: YI3N30.D3S ¥ 1 Y_Yd NOIOdlHOSaa (t?) E07t" "02st: NOIOVZI_VOOI (a) ojn?? Sut: 3? Y_0 / 3_aW0N (V): YDI! SI_3_, DYHY0 (xt) (outu t j s _ tts s_u or .utg) OOTXTW p ^ -??????.???? io i P ^ T nb snat OTJ tuturao AWSINVSHO (V) TVNISI O aiNana (A) ON AOIíaiOdlH (TTT) ( ? ootuiou b) ^ IflOaiOR NOV 30 O ll (TT); ri2tnD to VlOOlOdOl (a) lduits: VN30 ¥ 0 30 OdLL (D) oot ?? onu optoE rodil (a) IPD ps eo seq.????? l: an_.I9N0_ (V): VI0N3O03S ¥ 1 30 SY0I_, SI-3¿_DYW0 (T) A9: ON 01 Das VI Y_Yd NOID -? - OaNI (z? tLZ XXVX WXVXDDOXV DDDDDDWW VDDXDODVDX DDXWWDXX
00_Z WXDV DDD XXXWXOIXX XXVDDDX V XDDDXODOOV VXVXXXVDOD VDDDDDDXDV
0V9Z DDDDDXWDD -DXDXDWW DXOVOVODDD XXWDXXVXV DOOVODVXW XXOVD WW
08SZ O OWXXVOD DXXXXXVX? D WODWXOW XVDDDXXXVO OODXXVXDDD XXXDXXDXXX
fr9T.
XM86 / I89Td
ON AOIISIOd lH (TTT) (? Ootuiou fj) NOY: YinOSTOW 30 OdLL (TT) _retno_rto YlSOlOdOI l (a) tdu ts I YN30Y0 30 OdLL (O) oot Ionu optoe: _ >?? ai_. (9)? Seq? P s? J _d 08 _ AniIONOT (Y): YI0N3Í_D3S Yl 30 SYOIISIHSIOYHYO (T) A9 AN 01 03S VI YHYd N0I0YWH03NI (Z eOfrX XDX X DXVDDXX VDDDDDDD
08et XXVWOXXDV DDWVDODXV WXOXOWOO XOVODDXOVD DDVWDDDDe DDXXVXXODV
OZ t DWVDDXDDV WDXXDWVX OXVXXVDODD DVDXVXVDXV OXXVDXDOW XVXOVDDXDX
09ZX VDDVWDXXV XDXDW DX XWDXXXVOX XDDVDXDXDV DXXXVXXWX VDVDDDODVX
00ZI XVDOXDDVDD ODXDXWDVD WDDDWDDD OXXVDDDDDV OXXVDXDXXV XDVXXVDDOV
0 «I DVXVDVDODD ODDDXODOOX XOVODOXDDX XOXVXWOXO DXXDDDXXOV WODXVXVDD
080X VDXDDXXDDV VXWXXXDDX DDXXXWDDX VXXDDXDDXD VOOXVDDWV XDDXDDXDDV
OZOX WDXXOXDDD DDVODXVDW XDXDWXXOD WXVDXVXDD VDXDXXVDXI DDWDWDDX
096 VDVDDXDDV VXVXXVDVDD XDDDDDVDVD VDDDXOWDD YDDXWVXOX DODWXVDV
006 XDXODVODVO WXWVDDV WDDXXODDD DDOXXXDVW DVXOYWDDX DXWDDDXXV
0 * 8 XXVDDODXVD VWXDVDXVX VXXWDVWX DDDDXVWDV DVD VXDDD DDDVDXXDDV
08WDDXVDXXX OVDDDDDXDXVDDDDDDDV DXDVXXVDVD DXXDXXODDX DDXVDWVDX
OZ? VDWDDW VDXDWXDD VDDXVDXDXO XDVDDIIDVD ODDXXXDDOX VDDDIIVDDV 099 DDVDODDVOX VXOODWXVO VXVXVOOOW WXDDXOXDX DDXVWWXX ODXOVDDXDD
009 XVWODXDXD DWOXVDDW YDDIVIVDD OXWDVDDXV XVDWXXDXX XXVDVWWX
0 * S DXWDVXXXD DXXXVDXOW VDDDDDVXDV ODDXWWDD DIIIVIVDDV DVXDDXDVDD
08 * XDDXDXDVDD OVDXVXXXXO ODVXOVXVDV WDVDDXDVD OVDDXDVDDO WXXXVXDVX
OZ * DXXOXOXDXD DDODXXOVDD XDDDDOXXXX DDDDDDXVDD VXVXXVXWX DODDIVDDDD
S9T- XW86 / _89.a
0 * S DVDDODXDDV DVDDXSDDV VXXDVXXDDV VODDXDXDDD XDDDVDXWD DXVXDDXVXX
08 * XXDVXXDXVD VWXXWOXX XXVXXXODVD VXDOOXVXDX VXXDDVXXDD DOVDDDDDXV
OZ * ODDOVXXVXV VXVXXODDXV DDDDVDDXDD VDVXDDXXDX VDDVXDXVXX OVDDDXVXVX
09 e OVXVXVXWD DDVXDOXXVX DXODDXVXDX XVXXDDVOXD DWXXXDXDD VODXXVODDV
00 e DVXDXDDVOV DWXWXOXX XVXVDDDXW XDDVXXXDDX VWODWVXX XVDDDXXDXX
0 * Z WXVWDOVD XDDVXVOODO XXVDWDDXV WODXXXDXX DXDDDVOVOD DVOXXOVXDX
08X XXXDOVDVOD XODDDDDVXX XOXXDXXDVX DXVDXXXXW VDOXXXDXXV OVDXVDXXDX
OZX VDDDDXXOOV DXVDVDDXDD VDDWXWDX DWDDXXDDX XXXVXDDXXV DDXDDXXDXX 09 DXDXWDYW DXDDVDDVXV DDVDXXDDXD XVDXDDXDXY WDDXVODDX XDXWOXDDD A9 = ° N d -03S: Y_ YI0N3n03S YHYd NOIDdlHDSSO (p E661: YH033 () tezz-_2Z3 AYNIOYd (3) PL: Nawn_0 (a) • I TOJTA -JTUT9 T:? _, 30Y0 (0) • oo? .s_nq o ^ u? Tuitd i? P srut? T_ pn .t? T? Ts _un?? Uoo OOTXTH '_o? _uts sp? ._u? O ^ T ^ _q.??jut? Nb sn_rt? Tutui? ß o ?? nu un? Ptou? No? S roiülll (a) _? .ueuieq.sng-'ea - ?? tH 1 Ati? _i-i. sa BJTIITH op uzt,?, - uozj: _9 I Oo? qoPd-s? jo,: saHOIflY (Y) NOioYopand sa OiovwHoa í (x) OOTXTH AoiHuts: _Ynai? IONI OOYISIY (0) ooeq.s_nt [oq.u ? tu? td? _ _r _- [you? ts? .eu? o ^ TT e? jut? nb sn_rt? tutui? 9 AWSINYSHO (Y) TYNIOIHO 3IN3O3 (T?)
991 XW86 / T89.a
08 * DXD-DX XVO DOODWXDOO VOXX xXDXX OXXOOXDOOV OXXXXDVDDD VOXXVDWVO
0Z * DXDWXXDDQ OODDXYX W XXDVDXODDD OXXDXOOWD XXVDWXXDO OWDDDWXX
09e XXVDOXXXDX WWXXDWX XVXVXDDXOD VDDVXDDXW DDDDVDXDVD WXDXDXVW
00e DYDDDXXXW DXVDXXDDVX DXVXVYDXVD YOWXVODDD OXWXOXXXX VDVDDXDDXV
OfrZ XDYOXVWVO X? ODXVXOOD VODOXVOXDX VWWXX0VD WDXVDOOW WDWDXDW
081 DDWDVDDDD DDDXWXDDX VDXXXXDDDY VXXDDDDSXX VDVDDDWVX XXXDDXVXXV
0ZT VXDDVDDXW XDXXDDXODV DDXDDXVXSD XVXXDDDXDD VOVXXXDDVX DVDDXXVXOX
09 VXWXXXWO OWXVWOVX XWODVXXVX WXXXXVDDX XXVXVD W OXXVXVXXXX
A9 = ON 01 O3S: YI0N3fl03S VI YHYd N0IDdI_DS30 (Tx) OOTXTH _o? Euts ^ vnOI? IONI OdVISIY (O) oo? ^ S.nq oq.u? Tu? Td? _ _r_? Tu? Ts? -Huioq. XT p .o? Jut? Nb sn t? Tutui? O: 0WSINY9H0 (Y) TYNIOI-O aiNana (T?) ON ADIJ-SlOdlH (TTT) (ootmou? ß) NOY rviODa'IOW 30 OdII (TT) iE- [no? to:? i30T: 0d0_, (O) ?? du? ts:? N30Y0 30 OdII O) oot ?? onu optoe odil (a)? spq? ps? -ied 9? 2? : 0O1I3N01 (Y): YI0N3Í03S VI 30 SY0II, SIH310Y_Y0 (?) A9AN 01 03S VI Y-Yd N0I0YWH03NI (Z
OBL VXOOXVXXXV ODOOOODWD 0V0VX0XW0 0DD0XWXX0 DOXXODVOXO ODDDVODVXO
0Z_ VXOVDDDDD DXVDXVXXDX DDDDDVDXXV VXWDDDXDV VDDDDVXXVD WDDDDVDDD
099 DDDDDWXXV DDODXIDDXD DXVDXODDW XDDDXWWD XXWXXIDDV-XVXVDDXDDX
009 DWXVDVDW XXVWOXXW VXVXXXDDVD XDXDVDXXVX XXDVDDVDXV XDXDVXOWD
_9T XW86 / I89ta
: £ 9: 0N 01 03S VI YHYd N0IDYWH03NI (Z
9XZX ODX D XDDVDXDDXV
OOZI OXXYDVDXXD DWDDDDXW XDXDDDDDXX DXXVVXXX XDDDDDVDXX VDXVDDDOXV
0 * XX DVXDXXDDXD DXVXDDXXOD XOVDXOXDXX XVDDDDXDVD DDXXDDXOXV XDDDDXWXD
080X XVDDDVXVXX XDXYDWOXV DXDWXDDDD DXXDVDWDV XDXODDXDDV DX DXDDXV
OZOT DXOXXDWOO VDVXXDXDOX XDDDXDVDXX XVXVXXDVXV OVDDWOODO DXXWXXXVD
096 XXDWDXXDD XDYXODDVDD XVDXXVOVXV VXXDXDVXXV VOVXXVDDDX DXOXODDVXX
006 DVDOWOXXX VXXXVXVXW XWWOVDXD XXXXVXXXDX WVDXDDXXX DVDDDDXXDV * 8 WXWXWVD VDXIDDliDX VXOVOOiXiV DiüVOXVXD XXIVOVOOXV ODOWXOXOi 8¿? XOOXVODOI OXVDDXODXX VXXVXXIDVX IDXIOOOOW VWXDWXVX WOVDXVXOX ZL DODDOWXDX VOXVOXXOOX VODXOOVODV VXXXVOVOOX DDXXOXXWX XXXDODDOXV 99 VXOXDXOODX DOVDOXVDOV VOVXOXVD? X OOXVOXVIOV OVDDWDVDX XOOOXOXVXX 09 DYDVOWDXX OXDXVOXDOX XVOVXOXXDD OWDOWDXX OOXXOOOOVX XDXODVXXOV * S VXODXVOXDV XVOXX DW ODXODXXIVX XDVDXVOXXX VOVOVODXXO WODDXOVOO
89T.
Claims (14)
- CLAIMS: 1. A transgenic plant comprising chromosomal DNA, the plant harbors the DNA of the geminivirus integrated in the chromosomal DNA - the DNA of geminivirus codes for a protein required for the replication of the geminivirus - and the geminivirus DNA confers the resistance to the Viral infection.
- 2. Transgenic plants according to claim 1, wherein the DNA of the geminivirus is wild-type DNA.
- 3. The transgenic plant according to claim 1, wherein the DNA of the geminivirus comprises an ORF selected from the group consisting of ACl and Cl. The transgenic plant according to claim 2, wherein the DNA of the geminivirus comprises the DNA coding for an amino acid sequence selected from the group consisting of FLTYpxC; pHlHvliQ; vKxYxdKd; FHPNlQxak; EGx2RTGKt; and NviDDi. 5. The transgenic plant according to claim 1, wherein the geminivirus DNA is a transdominant interference mutant of a geminivirus gene. 6. The transgenic plant according to P1681 / 98MX claim 5 in which the DNA of the geminivirus comprises an ORF selected from the group consisting of ACl and Cl. The transgenic plant according to claim 5 in which the geminivirus DNA codes for a selected sequence portion. of the group consisting of a DNA cut domain and an NTP binding domain. The transgenic plant according to claim 7, wherein at least one mutation region of the transdominant interference mutant of the geminivirus DNA codes for an amino acid sequence comprising FLTYpxC; pHlJvliQ; vKxYxdKd; FHPNIQxak; EGx2RTGKt; and NviDDi. The transgenic plant according to claims 7 or 8- wherein the DNA of the geminivirus consists of at least one mutation of FLTYpxC; NpHlJvliQ; vKxYxdKd; FHPNlQxak; EGx2RTGKt; and NviDDi in ACL ORF. The transgenic plant according to claims 7 or 8 in which the geminivirus DNA consists of at least one FLTYpxC mutation; XHUvliQ; vKxYxdKd; FHPNIQxak; EGx2RTGKt; and NviDDi. 11. A transgenic plant according to claim 8, wherein the geminivirus DNA is an AcM mutant of ToMoV selected from the group consisting of P1681 / 98MX of SEQ ID Nos. 3- 5, 7- 13, 14 and 15. 12. A transgenic plant according to claim 8, wherein the geminivirus DNA is a mutant of TYLCV selected from the group consisting of SEQ ID Nos. 23, 26 and 29. 13. A transgenic plant according to claim 8, wherein the DNA of the geminivirus is an ACL mutant of BGMV selected from the group consisting of SEQ ID Nos. 45, 48, 51 and 54. A method for interfering with the geminivirus infection of a transgenic plant, comprising: selecting a transgenic plant according to claim 1; and cultivate the transgenic plant. P1681 / 98MX
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US015517 | 1996-04-16 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MXPA98008627A true MXPA98008627A (en) | 1999-09-01 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2130454C (en) | Production of viral resistant plants via introduction of untranslatable plus sense viral rna | |
| JP3281512B2 (en) | Methods for producing virus-resistant plants | |
| Gal et al. | Agroinfection of transgenic plants leads to viable cauliflower mosaic virus by intermolecular recombination | |
| US4970168A (en) | Virus-resistant plants | |
| JPH09505461A (en) | DNA virus ribozyme | |
| CA2132323A1 (en) | Fungus-responsive chimaeric gene | |
| AU2761395A (en) | Transgenic plants expressing dna constructs containing a plurality of genes to impart virus resistance | |
| JP2014057593A (en) | Improvement relating to specificity of gene expression | |
| WO1985004899A1 (en) | Methods and vectors for transformation of plant cells | |
| AU661359B2 (en) | Improvements in or relating to organic compounds | |
| US5877403A (en) | Papaya ringspot virus protease gene | |
| US6291743B2 (en) | Transgenic plants expressing mutant geminivirus AC1 or C1 genes | |
| WO1997039110A9 (en) | Transgenic plants expressing geminivirus genes | |
| US6395963B1 (en) | Nematode-inducible regulatory DNA sequences | |
| EP0938574A1 (en) | Method for inducing viral resistance into a plant | |
| AU707563B2 (en) | Nematode-inducible plant gene promoter | |
| US6262344B1 (en) | Nematode-inducible plant gene promoter | |
| MXPA98008627A (en) | Transgenic plants expressing geminivi genes | |
| US6448471B1 (en) | Nematode-feeding structure specific gene and its application to produce nematode resistant plants | |
| Hackland | The development of transgenic plants resistant to cucumber mosaic virus and tobacco necrosis virus | |
| MXPA00005493A (en) | Ribozymes capable of conferring resistance to potyvirus infection, and plants expressing said ribozymes |