US20030145345A1 - LexA DNA binding domain optimized for arabidopsis species - Google Patents
LexA DNA binding domain optimized for arabidopsis species Download PDFInfo
- Publication number
- US20030145345A1 US20030145345A1 US10/160,508 US16050802A US2003145345A1 US 20030145345 A1 US20030145345 A1 US 20030145345A1 US 16050802 A US16050802 A US 16050802A US 2003145345 A1 US2003145345 A1 US 2003145345A1
- Authority
- US
- United States
- Prior art keywords
- plant
- dna
- binding domain
- gene
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000004568 DNA-binding Effects 0.000 title claims abstract description 55
- 241000219194 Arabidopsis Species 0.000 title claims description 13
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 177
- 241000196324 Embryophyta Species 0.000 claims abstract description 164
- 108020004414 DNA Proteins 0.000 claims abstract description 60
- 239000002773 nucleotide Substances 0.000 claims abstract description 27
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 27
- 230000009261 transgenic effect Effects 0.000 claims abstract description 26
- 241000219195 Arabidopsis thaliana Species 0.000 claims abstract description 25
- 210000004027 cell Anatomy 0.000 claims description 78
- 108020004705 Codon Proteins 0.000 claims description 63
- 102000004169 proteins and genes Human genes 0.000 claims description 51
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 39
- 150000007523 nucleic acids Chemical group 0.000 claims description 22
- 241001233957 eudicotyledons Species 0.000 claims description 10
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 210000003527 eukaryotic cell Anatomy 0.000 claims 6
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 108700010070 Codon Usage Proteins 0.000 abstract description 33
- 230000014509 gene expression Effects 0.000 description 51
- 235000018102 proteins Nutrition 0.000 description 43
- 150000001413 amino acids Chemical class 0.000 description 29
- 238000000034 method Methods 0.000 description 29
- 241000588724 Escherichia coli Species 0.000 description 28
- 210000001519 tissue Anatomy 0.000 description 27
- 239000003550 marker Substances 0.000 description 20
- 238000013518 transcription Methods 0.000 description 20
- 230000035897 transcription Effects 0.000 description 20
- 229940024606 amino acid Drugs 0.000 description 17
- 108091026890 Coding region Proteins 0.000 description 14
- 239000013598 vector Substances 0.000 description 14
- 102000039446 nucleic acids Human genes 0.000 description 12
- 108020004707 nucleic acids Proteins 0.000 description 12
- 230000027455 binding Effects 0.000 description 10
- 239000003795 chemical substances by application Substances 0.000 description 10
- 230000001105 regulatory effect Effects 0.000 description 10
- 230000009466 transformation Effects 0.000 description 10
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 9
- 230000004913 activation Effects 0.000 description 9
- 239000004009 herbicide Substances 0.000 description 9
- 230000001965 increasing effect Effects 0.000 description 9
- 230000004927 fusion Effects 0.000 description 8
- 108090000765 processed proteins & peptides Proteins 0.000 description 8
- 238000013519 translation Methods 0.000 description 8
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 7
- 238000003556 assay Methods 0.000 description 7
- 230000002363 herbicidal effect Effects 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 102000004196 processed proteins & peptides Human genes 0.000 description 7
- 241000894007 species Species 0.000 description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid Chemical compound CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 5
- 108010025815 Kanamycin Kinase Proteins 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 230000008929 regeneration Effects 0.000 description 5
- 238000011069 regeneration method Methods 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 4
- 241000206602 Eukaryota Species 0.000 description 4
- 241000209510 Liliopsida Species 0.000 description 4
- 108700001094 Plant Genes Proteins 0.000 description 4
- 240000008042 Zea mays Species 0.000 description 4
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 4
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 4
- 108010002685 hygromycin-B kinase Proteins 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 235000009973 maize Nutrition 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 210000001938 protoplast Anatomy 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 3
- 108010000700 Acetolactate synthase Proteins 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 3
- 241000209094 Oryza Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 240000003768 Solanum lycopersicum Species 0.000 description 3
- 108700005078 Synthetic Genes Proteins 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical class CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- OVSKIKFHRZPJSS-UHFFFAOYSA-N 2,4-D Chemical compound OC(=O)COC1=CC=C(Cl)C=C1Cl OVSKIKFHRZPJSS-UHFFFAOYSA-N 0.000 description 2
- 239000005631 2,4-Dichlorophenoxyacetic acid Substances 0.000 description 2
- 229940087195 2,4-dichlorophenoxyacetate Drugs 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- 240000002791 Brassica napus Species 0.000 description 2
- 101710151559 Crystal protein Proteins 0.000 description 2
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 2
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 2
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108091000041 Phosphoenolpyruvate Carboxylase Proteins 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 2
- 102000009661 Repressor Proteins Human genes 0.000 description 2
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 108010044281 TATA-Box Binding Protein Proteins 0.000 description 2
- 102000006467 TATA-Box Binding Protein Human genes 0.000 description 2
- 108010022394 Threonine synthase Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 108090000848 Ubiquitin Proteins 0.000 description 2
- 102000044159 Ubiquitin Human genes 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 101150103518 bar gene Proteins 0.000 description 2
- GINJFDRNADDBIN-FXQIFTODSA-N bilanafos Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCP(C)(O)=O GINJFDRNADDBIN-FXQIFTODSA-N 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- -1 boromoxynil Chemical compound 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 102000004419 dihydrofolate reductase Human genes 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 230000001747 exhibiting effect Effects 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000000749 insecticidal effect Effects 0.000 description 2
- 235000021374 legumes Nutrition 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 108010058731 nopaline synthase Proteins 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- 108010082527 phosphinothricin N-acetyltransferase Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 231100000654 protein toxin Toxicity 0.000 description 2
- 229960000268 spectinomycin Drugs 0.000 description 2
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 239000003053 toxin Substances 0.000 description 2
- 108700012359 toxins Proteins 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- NDUPDOJHUQKPAG-UHFFFAOYSA-M 2,2-Dichloropropanoate Chemical compound CC(Cl)(Cl)C([O-])=O NDUPDOJHUQKPAG-UHFFFAOYSA-M 0.000 description 1
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 1
- UPMXNNIRAGDFEH-UHFFFAOYSA-N 3,5-dibromo-4-hydroxybenzonitrile Chemical compound OC1=C(Br)C=C(C#N)C=C1Br UPMXNNIRAGDFEH-UHFFFAOYSA-N 0.000 description 1
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical compound O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 1
- 108091000044 4-hydroxy-tetrahydrodipicolinate synthase Proteins 0.000 description 1
- 101150001232 ALS gene Proteins 0.000 description 1
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 1
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 241000208173 Apiaceae Species 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- 241001167018 Aroa Species 0.000 description 1
- 108010055400 Aspartate kinase Proteins 0.000 description 1
- 241000193388 Bacillus thuringiensis Species 0.000 description 1
- 108700003860 Bacterial Genes Proteins 0.000 description 1
- 235000016068 Berberis vulgaris Nutrition 0.000 description 1
- 241000335053 Beta vulgaris Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 1
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 1
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 1
- 235000000540 Brassica rapa subsp rapa Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 241000219193 Brassicaceae Species 0.000 description 1
- 239000005489 Bromoxynil Substances 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 241000871189 Chenopodiaceae Species 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000009849 Cucumis sativus Nutrition 0.000 description 1
- 241000219104 Cucurbitaceae Species 0.000 description 1
- 108030005585 Cyanamide hydratases Proteins 0.000 description 1
- ZAQJHHRNXZUBTE-NQXXGFSBSA-N D-ribulose Chemical compound OC[C@@H](O)[C@@H](O)C(=O)CO ZAQJHHRNXZUBTE-NQXXGFSBSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-UHFFFAOYSA-N D-threo-2-Pentulose Natural products OCC(O)C(O)C(=O)CO ZAQJHHRNXZUBTE-UHFFFAOYSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- 235000002767 Daucus carota Nutrition 0.000 description 1
- 208000005156 Dehydration Diseases 0.000 description 1
- 108700016256 Dihydropteroate synthases Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 101100491986 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) aromA gene Proteins 0.000 description 1
- 241000220485 Fabaceae Species 0.000 description 1
- 241000701484 Figwort mosaic virus Species 0.000 description 1
- 102100039556 Galectin-4 Human genes 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 101001125540 Homo sapiens 26S proteasome regulatory subunit 6A Proteins 0.000 description 1
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 1
- 101000891654 Homo sapiens TATA-box-binding protein Proteins 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 101710196632 LexA repressor Proteins 0.000 description 1
- 244000070406 Malus silvestris Species 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 1
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108010033272 Nitrilase Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- UOZODPSAJZTQNH-UHFFFAOYSA-N Paromomycin II Natural products NC1C(O)C(O)C(CN)OC1OC1C(O)C(OC2C(C(N)CC(N)C2O)OC2C(C(O)C(O)C(CO)O2)N)OC1CO UOZODPSAJZTQNH-UHFFFAOYSA-N 0.000 description 1
- 108010060806 Photosystem II Protein Complex Proteins 0.000 description 1
- 108010064851 Plant Proteins Proteins 0.000 description 1
- 101710142009 Protein insensitive Proteins 0.000 description 1
- 241000220324 Pyrus Species 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108010034634 Repressor Proteins Proteins 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 235000004789 Rosa xanthina Nutrition 0.000 description 1
- 241000220222 Rosaceae Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 241000208292 Solanaceae Species 0.000 description 1
- 244000061458 Solanum melongena Species 0.000 description 1
- 235000002597 Solanum melongena Nutrition 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- 102100040296 TATA-box-binding protein Human genes 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- 108091026828 U2 spliceosomal RNA Proteins 0.000 description 1
- 108091026837 U5 spliceosomal RNA Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000000843 anti-fungal effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 235000021016 apples Nutrition 0.000 description 1
- 101150037081 aroA gene Proteins 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 229940097012 bacillus thuringiensis Drugs 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 238000010256 biochemical assay Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 229930002868 chlorophyll a Natural products 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 229930002869 chlorophyll b Natural products 0.000 description 1
- NSMUHPMZFPKNMZ-VBYMZDBQSA-M chlorophyll b Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C=O)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 NSMUHPMZFPKNMZ-VBYMZDBQSA-M 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 238000003936 denaturing gel electrophoresis Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000001687 destabilization Effects 0.000 description 1
- 235000021186 dishes Nutrition 0.000 description 1
- 230000024346 drought recovery Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 230000009483 enzymatic pathway Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002270 exclusion chromatography Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 230000008642 heat stress Effects 0.000 description 1
- 101150029559 hph gene Proteins 0.000 description 1
- 101150062015 hyg gene Proteins 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 101150047523 lexA gene Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000000442 meristematic effect Effects 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- UOZODPSAJZTQNH-LSWIJEOBSA-N paromomycin Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO UOZODPSAJZTQNH-LSWIJEOBSA-N 0.000 description 1
- 229960001914 paromomycin Drugs 0.000 description 1
- 101150113864 pat gene Proteins 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 235000021017 pears Nutrition 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 235000021118 plant-derived protein Nutrition 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 101150075980 psbA gene Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000021749 root development Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000011869 shoot development Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 101150090749 sulI gene Proteins 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 239000003744 tubulin modulator Substances 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
Definitions
- This invention relates to increasing the expression of heterologous or chimeric proteins in plants, and more specifically to increasing protein expression in Arabidopsis species and other dicotyledons.
- One of the primary goals of plant genetic research and development is the production of transgenic plants that express a heterologous gene (i.e., produce a “foreign” protein) in an amount sufficient to confer a desired phenotype to the plant. While significant advances have been made in pursuit of this goal, the expression of certain heterologous genes in transgenic plants remains problematic. It is thought that numerous factors are involved in determining the ultimate level of expression of a heterologous gene in a plant. The amount of protein that is synthesized from a gene is a function of several complex and interrelated events, including transcription, RNA maturation, translation, and post-translational modification. Each of these processes is comprised of a large number of events, all of which are potentially regulated either independently or in concert.
- the genetic code is considered “degenerate” in that more than one nucleic acid triplet (i.e., a “codon”) encodes the same amino acid.
- Many amino acids may be coded for by several different codons.
- genes within a taxonomic group exhibit similarities in codon choice, regardless of the function of these genes.
- an estimate of the overall use of the genetic code by a taxonomic group can be obtained by summing codon frequencies of all its sequenced genes. Variation between degenerate base frequencies is not a neutral phenomenon, since systematic codon preferences have been reported for bacterial, yeast, plant and mammalian genes. Bias in codon choice within genes in a single species appears related to the level of expression of protein encoded by any particular gene.
- Codon bias is most extreme in highly expressed proteins of bacteria (e.g., E. coli ) and yeast.
- highly expressed genes use a smaller subset of codons than do weakly expressed genes, although the codons preferred are distinct in some cases.
- Sharp and Li Nucl. Acids Res. 14, 7734-7749 (1986), report that codon usage in 165 E. coli genes reveals a positive correlation between high expression and increased codon bias. In these organisms, a strong positive correlation has been reported between the abundance of an isoaccepting tRNA species and the favored synonymous codon. For example, in one group of highly expressed proteins in yeast, over 96% of the amino acids are encoded by only 25 of the 61 available codons.
- codon usages for codons XCG and XUA are 1.8% and 3.2% in dicotyledonous plants and 6.3% and 1.4% in monocotyledonous plants.
- the combined codon usage for codons XXC and XXG (hereinafter, referred to as the codon XXC/G usage, wherein each of the two Xs is independently selected from the group consisting of A, G, C and T) is 45% in dicotyledon and 73.5% in monocotyledon. It is well established that GC content in genes which can be translated is higher in monocotyledon such as gramineous plants, e.g., rice plants, than in dicotyledons. As to bacteria, the codon usage apparently varies by strain
- U.S. Pat. No. 5,380,831 to Adang et al. describes synthetic Bt genes designed to be expressed at a level higher than naturally-occurring Bt genes.
- the genes utilize codons preferred in highly expressed monocot or dicot protein.
- the synthetic genes while about 85% homologous to the native bacterial sequence, are chemically modified to contain codons that are preferred by highly expressed plant genes, and to eliminate undesirable sequences that cause destabilization, termination of RNA, secondary structures and RNA splice sites.
- U.S. Pat. No. 5,436,391 to Fujimoto et al. describes a synthetic gene encoding the insecticidal protein Bt.
- the gene is provided having a base sequence which has been modified to bring the codon usage in conformity with the genes of graminaceous plants, particularly rice plants (e.g., oryza).
- U.S. Pat. No. 5,689,052 to Brown et al. describes a method for modifying a foreign nucleotide sequence for enhanced accumulation of its protein product in a monocotyledonous plant, and/or increasing the frequency of obtaining transgenic monocotyledonous plants which accumulate useful amounts of a transgenic protein, by reducing the frequency of the rare and semi-rare monocotyledonous codons in the foreign gene and replacing them with more preferred monocotyledonous codons.
- heterologous structural genes i.e., the genes encoding a desired protein product, such as Bt toxin
- the modification of regulatory elements that control transcription by optimizing codon usage in the host plant has not been emphasized. It is widely recognized that the upstream regulatory elements that control transcription and translation have very significant roles in determining the quantity, timing, and tissue specificity of gene expression.
- Various nucleotide sequences other than the heterologous structural coding sequence affect the expression levels of a foreign DNA sequence introduced into a plant, including promoter sequences, intron sequences, 3′ untranslated sequences, polyadenylation sites, and other regulatory sequences.
- activation and control of transcription are processes that may desirably be manipulated in order to achieve altered (i.e., increased or decreased) expression of a heterologous structural gene in a plant cell.
- Transcription can be activated through the use of two functional domains of a transcription activation moiety: a domain (i.e., sequence of amino acids) that recognizes and binds to a specific site or sequence of nucleotides on a target DNA, (the DNA binding domain); and a domain that is capable of activating transcription of the DNA when physically associated with the DNA-binding domain and which may be necessary for activation of the target gene (the activation domain).
- a domain i.e., sequence of amino acids
- the two functional domains may be derived from a single transcription activation protein. Alternatively, it has been shown that these two functions can also reside on separate proteins. See McKnight et al., Proc. Natl. Acad. Sci. USA 89, 7061-7065 (1987); Curran et al. 55, 395-397 (1988).
- the transcription activation domains may also be derived from synthetic DNA-binding and transcription activation proteins.
- Additional flexibility in controlling heterologous gene expression in plants may be obtained by using DNA binding domains and response elements from heterologous sources (i.e., DNA binding domains from non-plant sources).
- heterologous DNA binding domains include the LexA and GAL4 DNA binding domains.
- the LexA DNA-binding domain is part of the repressor protein LexA from Escherichia coli ( E. Coli ) (Brent and Ptashne, Cell 43:729-736 (1985)).
- the LexA DNA binding domain functions as an efficient DNA binding domain in its natural bacterial host, when transferred by recombinant DNA technology into higher eukaryotes such as plants, the domain is not efficiently expressed. Accordingly, it would be desirable to alter the LexA DNA binding domain to increase its expression in higher eukaryotes such as plants.
- the invention relates to adapting the codons of the DNA binding domain of the LexA gene from E. Coli to the codon usage of Arabidopsis thaliana. This method is advantageous in that it allows for the increased expression of heterologous and chimeric proteins containing this artificial DNA binding domain.
- Additional aspects of this invention include constructs (i.e., vectors, DNA fusions and polynucleotides), comprising the synthetic DNA sequence of the present invention. These constructs are useful for increasing heterologous protein expression in plant cells. Further aspects of the invention are cells, plant lines, and transgenic plants transformed with the described constructs. Methods of increasing expression of heterologous proteins in a cell or a transgenic plant are an additional aspect of the present invention.
- FIG. 1 sets forth the nucleotide and amino acid sequences of the LexA DNA binding domain optimized for Arabidopsis thaliana codon usage.
- FIG. 2 sets forth the nucleotide sequences of the seven oligonucleotides (SEQ ID NO: 3-9) that were annealed together to arrive at the LexA DNA binding domain optimized for Arabidopsis thaliana codon usage.
- SEQ ID NO: 10 is the LexA activation domain to which the LexA binding domain normally binds.
- FIG. 3 is a schematic representation of the plasmid pUCNLSTBP1.
- Amino acid sequences disclosed herein are presented in the amino to carboxy direction, from left to right. The amino and carboxy groups are not presented in the sequence. Nucleotide sequences are presented herein by single strand only, in the 5′ to 3′ direction, from left to right. Nucleotides and amino acids are represented herein in the manner recommended by the IUPAC-IUB Biochemical Nomenclature Commission, or (for amino acids) by three letter code, in accordance with 37 CFR ⁇ 1.822 and established usage. See, e.g., Patent In User Manual, 99-102 (November 1990) (U.S. Patent and Trademark Office).
- amino acid sequence refers to either a naturally occurring or a synthetic oligopeptide, peptide, polypeptide, or protein sequence, and fragments thereof. Where “amino acid sequence” is recited herein to refer to an amino acid sequence of a naturally occurring protein molecule, the term “amino acid sequence,” and like terms, are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule.
- nucleic acid sequence refers to a nucleotide, oligonucleotide, or polynucleotide, and fragments thereof, and to DNA or RNA of genomic or synthetic origin which may be single- or double-stranded, and which may represent a sense or antisense strand.
- “Chemically synthesized,” as related to a sequence of DNA, means that the component nucleotides are assembled in vitro.
- Manual chemical synthesis of DNA may be accomplished using well established procedures (Caruthers, M., in Methodology of DNA and RNA Sequencing, Chapter 1 (Weissman (ed.), Praeger Publishers, New York, (1983)).
- automated chemical synthesis of DNA can be performed using one of a number of commercially available apparatus.
- LexA or “LexA binding domain” refers to a protein or domain that is naturally encoded in E. coli by the lexA gene, which domain normally binds to the DNA sequence CATACTGTATGAGCATACAG (the LexA activation domain) [SEQ ID NO: 10].
- the term “LexA binding domain adapted (or optimized) for Arabidopsis thaliana usage” means a protein domain in which the codons encoding the domain have been modified from the natural E. Coli sequence in order to maximize DNA binding (and accordingly, transcription) in Arabidopsis thaliana.
- expression refers to the transcription and translation of a heterologous or homologous gene to yield the protein encoded by the gene.
- Heterologous is used to indicate that a nucleic acid sequence (e.g., a gene) or a protein has a different natural origin or source with respect to its current host. “Heterologous” is also used to indicate that one or more of the domains present in a protein differ in their natural origin with respect to other domains present.
- a “structural gene” is that portion of a gene comprising a DNA segment encoding a protein, polypeptide or a portion thereof, and excluding the 5′ sequence which drives the initiation of transcription.
- the structural gene may be one which is normally found in the cell or one which is not normally found in the cellular location wherein it is introduced, in which case it is termed a heterologous gene.
- a heterologous gene may be derived in whole or in part from any source known to the art, including a bacterial genome or episome, eukaryotic, nuclear or plasmid DNA, cDNA, viral DNA or chemically synthesized DNA.
- a structural gene may contain one or more modifications in either the coding or the untranslated regions which could affect the biological activity or the chemical structure of the expression product, the rate of expression or the manner of expression control. Such modifications include, but are not limited to, mutations, insertions, deletions and substitutions of one or more nucleotides.
- the structural gene may constitute an uninterrupted coding sequence or it may include one or more introns, bounded by the appropriate splice junctions.
- the structural gene may be a composite of segments derived from a plurality of sources, either naturally occurring or synthetic, or both. When synthesizing a gene for improved expression in a host cell it is desirable to design the gene such that its frequency of codon usage approaches the frequency of preferred codon usage of the host cell.
- chimeric protein is used to indicate that the protein is comprised of domains, at least one of which has an origin or source that is heterologous with respect to the other domains present.
- Chimeric proteins are encoded by nucleotide sequences that have been fused or ligated together, resulting in a coding sequence that does not occur naturally.
- “Chimeric sequences” or genes refer to nucleic acid sequences containing at least two heterologous parts, e.g., parts derived from naturally occurring nucleic acid sequences that are not associated in their naturally occurring states, or containing at least one part that is of synthetic origin and not found in nature.
- the amino acid sequence of the E. Coli LexA DNA binding domain is determined and back-translated into all the available codon choices for each amino acid.
- the amino acid sequence of the protein can be analyzed using commercially available computer software such as the BACKTRANSLATE® program of the GCG Sequence Analysis software package. After the sequence is back-translated, the codons that are preferentially and optimally used in Arabidposis thaliana are substituted for the naturally occurring codons of the E Coli. LexA binding domain.
- the codon preferred by Arabidopsis thaliana is placed in the sequence in place of the original codon.
- the 258 nucleotides representing one strand of the DNA is arbitrarily divided into three oligonucleotide sequences.
- the 258 nucleotides representing the other, complementary strand of the DNA is arbitrarily divided into four oligonucleotide sequences.
- the oligonucleotide sequences are then chemically synthesized into seven oligomers, each being phosphorylated at their 5′ end.
- oligomers have additional nucleotides added to the ends in order to create “sticky ends” (restriction sites) for use in ligating the DNA into recombinant vectors.
- the oligomers are synthesized, the oligomers are annealed together, to yield the following synthetic DNA sequence encoding the LexA DNA binding domain codon optimized for usage in Arabidopsis thaliana:
- Every codon encoding the normal, E. Coli LexA DNA binding domain is substituted with a corresponding codon that is preferred by A. thaliana (i.e., is completely or 100 percent optimized for usage by A. thaliana ), when the two codons are different (i.e., when a particular amino acid may be encoded by more than one codon, and a codon preferred by E. Coli differs from one preferred by A. thaliana ).
- the invention also encompasses synthetic nucleotide sequences and proteins that are partially (i.e., less than 100 percent) optimized for usage by A. thaliana, and the uses therefor.
- less than all of amino acids of the LexA DNA binding domain that may be encoded for by more than one codon is replaced in the synthetic sequence with a codon preferred by A. thaliana.
- more than about 50 percent of the codons of the LexA binding domain are replaced in the synthetic sequence with a codon preferred by A. thaliana; more preferably, more than about 80 percent of the codons of the LexA binding domain are replaced in the synthetic sequence with a codon preferred by A. thaliana; most preferably, 100 percent of the codons of the LexA binding domain are codons that are preferred by A. thaliana.
- the LexA DNA binding domain set forth herein will be more efficiently expressed in higher eukaryotes (e.g., plants and animals), and more specifically will be more efficiently expressed in dicotyledenous plants, which include but are by no means limited to species of legumes (from the family Fabaceae), including soybean, peanut, and alfalfa; species of the Solanaceae family such as tomato, eggplant and potato; species of the family Brassicaceae such as cabbage, turnips and rapeseed; species of the family Rosaceae such as apples, pears and berries; and members of the families Cucurbitaceae (cucumbers), Chenopodiaceae (beets) and Umbelliferae (carrots).
- legumes from the family Fabaceae
- Solanaceae family such as tomato, eggplant and potato
- species of the family Brassicaceae such as cabbage, turnips and rapeseed
- species of the family Rosaceae such as apples, pears and berries
- the present invention provides an advantageously modified DNA binding domain for the enhanced expression of desired heterologous protein genes in transgenic plants.
- one embodiment of the present invention is a DNA construct comprising a DNA sequence encoding the LexA synthetic DNA binding domain of the invention.
- Such DNA constructs accordingly provide for the preparation of stably transformed cells expressing heterologous protein, which transformed cells are also an aspect of the invention.
- the synthetic DNA binding domain of the present invention provides for the subsequent regeneration of fertile, transgenic plants and progeny containing desired heterologous protein genes.
- DNA constructs (also referred to herein as DNA vectors) of the present invention comprise the nucleotide sequence of the synthetic Lex A binding domain described herein, which nucleotide sequence is preferably the sequence provided herein as SEQ ID NO: 1.
- the preparation of DNA constructs is well known in the art. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (1989).
- the DNA constructs of the present invention are useful in the transformation of cells (e.g., plant cells), and thus useful in the expression of heterologous genes in the cells.
- a heterologous DNA sequence i.e., gene
- a DNA construct preferably contains some or all of the necessary elements to permit the transcription and ultimate expression of the coding sequence in the host plant.
- DNA constructs of the present invention may contain suitable promoters for the expression of heterologous genes in plants.
- promoter refers to the nucleotide sequences at the 5′ end of a structural gene which direct the initiation of transcription. Generally, promoter sequences are necessary, but not always sufficient, to drive the expression of a downstream gene.
- the structural gene is placed under the regulatory control of a promoter such that the expression of the gene is controlled by promoter sequences. The promoter is positioned preferentially upstream to the structural gene and at a distance from the transcription start site that approximates the distance between the promoter and the gene it controls in its natural setting.
- operatively linked means that a promoter is connected to a coding region in such a way that the transcription of that coding region is controlled and regulated by that promoter.
- Means for operatively linking a promoter to a coding region are well known in the art.
- promoters For expression in plants, suitable promoters must be chosen for the host cell, the selection of which promoters is well within the skill of one knowledgeable in the art. Promoters useful in the practice of the present invention include, but are not limited to, constitutive, inducible, temporally regulated, developmentally regulated, chemically regulated, tissue-preferred and tissue-specific promoters.
- promoters are known or are found to facilitate transcription of RNA in plant cells and can be used in the DNA construct of the present invention.
- suitable promoters include the nopaline synthase (NOS) and octopine synthase (OCS) promoters, the light-inducible promoter from the small subunit of ribulose bis-phosphate carboxylase promoters, the CaMV 35S and 19S promoters, the full-length transcript promoter from Figwort mosaic virus, histone promoters, tubulin promoters, or the mannopine synthase promoter (MAS).
- NOS nopaline synthase
- OCS octopine synthase
- MAS mannopine synthase promoter
- the promoter may also be one that causes preferential expression in a particular tissue, such as leaves, stems, roots, or meristematic tissue, or the promoter may be inducible, such as by light, heat stress, water stress or chemical application or production by the plant.
- exemplary green tissue-specific promoters include the maize phosphoenol pyruvate carboxylase (PEPC) promoter, small submit ribulose bis-carboxylase promoters (ssRUBISCO) and the chlorophyll a/b binding protein promoters.
- Additional promoters useful in the present invention include but are not limited to one of several of the actin genes, which are known to be expressed in most cell types.
- Yet another constitutive promoter useful in the practice of the present invention is derived from ubiquitin, which is another gene product known to accumulate in many cell types.
- the ubiquitin promoter has been cloned from several species for use in transgenic plants (e.g., sunflower (Binet et al., Plant Science 79: 87-94 (1991); and maize (Christensen et al., Plant Molec. Biol. 12, 619-632 (1989)).
- U2 and U5 snRNA promoters from maize (Brown et al., Nucleic Acids Res. 17, 8991 (1989)) and the promoter from alcohol dehydrogenase (Dennis et al., Nucleic Acids Res. 12, 3983 (1984)).
- Tissue-specific or tissue-preferential promoters useful in the present invention in plants are those which direct expression in root, pith, leaf or pollen. Such promoters are disclosed in U.S. Pat. No. 5,625,136 (herein incorporated by reference in its entirety). Also useful are promoters which confer seed-specific expression, such as those disclosed by Schernthaner et al., EMBO J. 7: 1249 (1988); anther-specific promoters ant32 and ant43D; anther (tapetal) specific promoter B6 (Huffman et al., J. Cell. Biochem. 17B, Abstract #D209 (1993)); and pistil-specific promoters such as a modified S13 promoter (Dzelkalns et al., Plant Cell 5,855 (1993)).
- plant promoters may be obtained, preferably from plants or plant viruses, and may be utilized so long as the selected promoter is capable of causing sufficient expression in a plant resulting in the production of an effective amount of the desired protein.
- Preferred constitutive promoters include but are not limited to the CaMV 35S and 19S promoters (see U.S. Pat. No. 5,352,605, the disclosure of which is incorporated herein in its entirety). Any promoter used in the present invention may be modified, if desired, to alter their control characteristics. For example, the CaMV 35S or 19S promoters may be enhanced by the method described in Kay, et al. Science (1987) Vol. 236, pp.1299-1302.
- the DNA sequences that comprise the DNA constructs of the present invention are preferably carried on suitable vectors, which are known in the art.
- Preferred vectors for are plasmids that may be propagated in a plant cell.
- Particularly preferred vectors for transformation are those useful for transformation of plant cells or of Agrobacteria, as described further below.
- the preferred vector is a Ti-plasmid derived vector.
- Other appropriate vectors which can be utilized as starting materials are known in the art. Suitable vectors for transforming plant tissue and protoplasts have been described by deFramond, A. et al., Bio/Technology 1, 263 (1983); An, G. et al., EMBO J. 4, 277 (1985); and Rothstein, S. J. et al., Gene 53, 153 (1987). In addition to these, many other vectors have been described in the art which are suitable for use as starting materials in the present invention.
- the DNA encoding the synthetic LexA binding domain of the present invention, and the DNA constructs comprising them, have applicability to any structural gene that is desired to be introduced into a plant to provide any desired characteristic in the plant, such as herbicide tolerance, virus tolerance, insect tolerance, disease tolerance, drought tolerance, or enhanced or improved phenotypic characteristics such as improved nutritional or processing characteristics.
- DNA constructs of the present invention also comprise DNA sequences encoding transactivation domains.
- Transactivation domains can be defined as amino acid sequences that, when combined with the DNA binding domain, increase productive transcription initiation by RNA polymerases. (See generally Ptashne, Nature 335, 683-689 (1988)). Different transactivation domains are known to have different degrees of effectiveness in their ability to increase transcription initiation. In the present invention it is desirable to use transactivation domains that have superior transactivating effectiveness in plant cells in order to create a high level of target polypeptide expression in response to the presence of chemical ligand.
- Transactivation domains that have been shown to be particularly effective in the method of the present invention include but are not limited to VP16 (isolated from the herpes simplex virus), C1 (isolated from maize), and Thm18 (isolated from tomato).
- VP16 isolated from the herpes simplex virus
- C1 isolated from maize
- Thm18 isolated from tomato
- codon-optimized LexA DNA-binding domain is the in-frame fusion of this DNA-binding domain to other domains that have other functions; for example, the fusion to transcription activator domains like VP16 from Herpes simplex or Thm18 from tomato, or the fusion to TATA-box binding proteins (TBPs) such as the TBP1 protein from Arabidopsis thaliana.
- TBPs TATA-box binding proteins
- Other transactivation domains may also be effective.
- Transgenes (heterologous genes to be transformed into a plant cell) will often be genes that direct the expression of a particular protein or polypeptide product, but they may also be non-expressible DNA segments, e.g., transposons that do not direct their own transposition.
- an “expressible gene” is any gene that is capable of being transcribed into RNA (e.g., mRNA, antisense RNA, etc.) or translated into a protein, expressed as a trait of interest, or the like, etc., and is not limited to selectable, screenable or non-selectable marker genes.
- the invention also contemplates that, where both an expressible gene that is not necessarily a marker gene is employed in combination with a marker gene, one may employ the separate genes on either the same or different DNA segments for transformation. In the latter case, the different vectors are delivered concurrently to recipient cells to maximize cotransformation
- heterologous gene or nucleic acid that is desired to be expressed in a plant is suitable for the practice of the present invention.
- Heterologous genes to be transformed and expressed in the plants of the present invention include but are not limited to genes that encode resistance to diseases and insects, genes conferring nutritional value, genes conferring antifungal, antibacterial or antiviral activity, and the like.
- therapeutic e.g., for veterinary or medical uses
- immunogenic e.g., for vaccination
- the transfer of any nucleic acid for controlling gene expression in a plant is contemplated as an aspect of the present invention.
- the nucleic acid to be transferred can encode an antisense oligonucleotide.
- plants may be transformed with one or more genes to reproduce enzymatic pathways for chemical synthesis or other industrial processes.
- Marker genes are genes that impart a distinct phenotype to cells expressing the marker gene and thus allow such transformed cells to be distinguished from cells that do not have the marker.
- Such genes may encode either a selectable or screenable marker, depending on whether the marker confers a trait which one can ‘select’ for by chemical means, i.e., through the use of a selective agent (e.g., a herbicide, antibiotic, or the like), or whether it is simply a trait that one can identify through observation or testing, i.e., by ‘screening’ (e.g., the R-locus trait).
- a selective agent e.g., a herbicide, antibiotic, or the like
- screening e.g., the R-locus trait
- suitable marker genes are known to the art and can be employed in the practice of the invention.
- the selectable marker gene may be the only heterologous gene expressed by a transformed cell, or may be expressed in addition to another heterologous gene transformed into and expressed in the transformed cell. Selectable marker genes are utilized for the identification and selection of transformed cells or tissues.
- Selectable marker genes include genes encoding antibiotic resistance, such as those encoding neomycin phosphotransferase II (NEO) and hygromycin phosphotransferase (HPT), as well as genes conferring resistance to herbicidal compounds.
- Herbicide resistance genes generally code for a modified target protein insensitive to the herbicide or for an enzyme that degrades or detoxifies the herbicide in the plant before it can act. See, DeBlock et al., EMBO J. 6, 2513 (1987); DeBlock et al., Plant Physiol. 91, 691 (1989); Fromm et al., BioTechnology 8, 833 (1990); Gordon-Kamm et al., Plant Cell 2, 603 (1990).
- glyphosphate or sulfonylurea herbicides has been obtained using genes coding for the mutant target enzymes, 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) and acetolactate synthase (ALS).
- EPSPS 5-enolpyruvylshikimate-3-phosphate synthase
- ALS acetolactate synthase
- Resistance to glufosinate ammonium, boromoxynil, and 2,4-dichlorophenoxyacetate (2,4-D) have been obtained by using bacterial genes encoding phosphinothricin acetyltransferase, a nitrilase, or a 2,4-dichlorophenoxyacetate monooxygenase, which detoxify the respective herbicides.
- Selectable marker genes include, but are not limited to, genes encoding: neomycin phosphotransferase II (Fraley et al., CRC Critical Reviews in Plant Science 4, 1 (1986)); cyanamide hydratase (Maier-Greiner et al., Proc. Natl. Acad. Sci. USA 88, 4250 (1991)); aspartate kinase; dihydrodipicolinate synthase (Perl et al., BioTechnology 11, 715 (1993)); bar gene (Toki et al., Plant Physiol. 100, 1503 (1992); Meagher et al., Crop Sci.
- the bar gene confers herbicide resistance to glufosinate-type herbicides, such as phosphinothricin (PPT) or bialaphos, and the like.
- PPT phosphinothricin
- other selectable markers that could be used in the vector constructs include, but are not limited to, the pat gene, also for bialaphos and phosphinothricin resistance, the ALS gene for imidazolinone resistance, the HPH or HYG gene for hygromycin resistance, the EPSP synthase gene for glyphosate resistance, the Hm1 gene for resistance to the Hc-toxin, and other selective agents used routinely and known to one of ordinary skill in the art. See generally, Yarranton, Curr. Opin.
- selectable marker genes are not meant to be limiting. Any selectable marker gene can be used in the present invention.
- transformation describes a process by which heterologous nucleic acid enters and changes a recipient cell. It may occur under natural or artificial conditions using various methods well known in the art. Transformation may rely on any known method for the insertion of foreign nucleic acid sequences into a eukaryotic host cell.
- Such “transformed” cells include stably transformed cells in which the inserted DNA is capable of replication either as an autonomously replicating plasmid or as part of the host chromosome. They also include cells which transiently express the inserted DNA or RNA for limited periods of time.
- recipient cells for transformation are plant cells, more preferably dicot plant cells, even more preferably Arabidopsis species plant cells, and most preferably Arabidopsis thaliana plant cells.
- Plant cells as used herein includes plant cells in plant tissue or plant tissue and plant cells and protoplasts in culture. Plant tissue includes differentiated and undifferentiated tissues of plants, including but not limited to, roots, shoots, leaves, pollen, seeds, tumor tissue and various forms of cells in culture, such as single cells, protoplasts, embryos and callus tissue. The plant tissue may be in plant, or in organ, tissue or cell culture.
- the recombinant DNA molecule carrying the synthetic LexA DNA binding domain of the invention and optionally a structural gene under promoter control can be introduced into plant tissue by any means known to those skilled in the art.
- the technique used for a given plant species or specific type of plant tissue depends on the known successful techniques. As novel means are developed for the stable insertion of foreign genes into plant cells and for manipulating the modified cells, skilled artisans will be able to select from known means to achieve a desired result.
- Means for introducing recombinant DNA into plant tissue include, but are not limited to, direct DNA uptake (Paszkowski, J. et al. (1984) EMBO J. 3,2717), electroporation (Fromm, M., et al. Proc. Natl.
- T-DNA mediated transfer from Agrobacterium tumefaciens to the plant tissue which techniques are known in the art.
- T-DNA transformation There appears to be no fundamental limitation of T-DNA transformation to the natural host range of Agrobacterium. Representative T-DNA vector systems are described in the following references: An, G. et al. EMBO J. 4, 277 (1985); Herrera-Estrella, L. et al., Nature 303, 209 (1983); Herrera-Estrella, L. et al. EMBO J.
- the expression of the structural gene may be assayed by any means known to the art, and expression may be measured as mRNA transcribed or as protein synthesized, as provided herein.
- Transgenic plants comprising the synthetic LexA DNA binding domain of the present invention (as present, for example, in a DNA construct of the present invention, or a transformed cells of the present invention) are also an aspect of the present invention.
- Procedures for cultivating transformed cells to useful cultivars are known to those skilled in the art. Techniques are known for the in vitro culture of plant tissue, and in a number of cases, for regeneration into whole plants.
- a further aspect of the invention are plant tissue, plants or seeds containing the chimeric DNA sequences described above. Preferred are plant tissues, plants or seeds containing those chimeric DNA sequences which are mentioned as being preferred.
- transgenic plants comprising the synthetic LexA DNA binding domain of the present invention.
- transgenic plants is intended to refer to plants that have incorporated DNA sequences, including but not limited to genes which are perhaps not normally present, DNA sequences not normally transcribed into RNA or translated into a protein (“expressed” ), or any other genes or DNA sequences which one desires to introduce into the non-transformed plant, such as genes which may normally be present in the non-transformed plant but which one desires to either genetically engineer or to have altered expression. It is contemplated that in some instances the genome of transgenic plants of the present invention will have been augmented through the stable introduction of the transgene. However, in other instances, the introduced gene will replace an endogenous sequence.
- transgenic plant of the present invention may be prepared by the process of creating a translational fusion protein gene and introducing the fusion into a plant.
- a plant to be transformed by the methods of the invention is a wild-type Arabidopsis.
- a specific illustration of the method of the invention would involve constructing gene fusions comprising the LexA DNA binding domain optimized for usage in Arabidopsis thaliana and a DNA segment encoding an exogenous protein one desires to express, and introducing the fusion into wild-type Arabidopsis.
- the gene introduced into the transformed plant line will comprise an exogenous protein gene operatively linked to its own promoter or another promoter that is active in plants.
- This gene will contain the cloned LexA DNA binding domain optimized for Arabidposis thaliana in a DNA construct for the purpose of increasing the expression of that exogenous protein gene.
- the transformed cells identified by selection or screening and cultured in an appropriate medium that supports regeneration as provided herein, will then be allowed to mature into plants.
- Plants are preferably matured either in a growth chamber or greenhouse. Plants are regenerated from about 6 weeks to 10 months after a transformant is identified, depending on the initial tissue. During regeneration, cells are grown on solid media in tissue culture vessels. Illustrative embodiments of such vessels are petri dishes and Plant Con®s. After the regenerating plants have reached the stage of shoot and root development, they may be transferred to a greenhouse for further growth and testing. Progeny may be recovered from the transformed plants and tested for expression of the exogenous expressible gene by localized application of an appropriate substrate to plant parts such as leaves.
- the regenerated plants are screened for transformation by standard methods illustrated below. Progeny of the regenerated plants is continuously screened and selected for the continued presence of the integrated DNA sequence in order to develop improved plant and seed lines.
- the DNA sequence can be moved into other genetic lines by a variety of techniques, including classical breeding, protoplast fusion, nuclear transfer and chromosome transfer.
- identifying the cells exhibiting successful or enhanced expression of a heterologous gene for further culturing and plant regeneration generally occurs.
- a selectable or screenable marker gene as, or in addition to, the expressible gene of interest.
- “Screening” generally refers to identifying the cells exhibiting expression of a heterologous gene that has been transformed into the plant. Usually, screening is carried out to select successfully transformed seeds (i.e., transgenic seeds) for further cultivation and plant generation (i.e., for the production of transgenic plants). As mentioned above, in order to improve the ability to identify transformants, one may desire to employ a selectable or screenable marker gene as, or in addition to, the heterologous gene of interest. In this case, one would then generally assay the potentially transformed cells, seeds or plants by exposing the cells, seeds, plants, or seedlings to a selective agent or agents, or one would screen the cells, seeds, plants or tissues of the plants for the desired marker gene.
- transgenic cells, seeds or plants may be screened under selective conditions, such as by growing the seeds or seedlings on media containing selective agents, such as antibiotics (e.g., hygromycin, kanamycin, paromomycin or BASTA®), the successfully transformed plants having been transformed with genes encoding resistance to such selective agents.
- selective agents such as antibiotics (e.g., hygromycin, kanamycin, paromomycin or BASTA®), the successfully transformed plants having been transformed with genes encoding resistance to such selective agents.
- assays include, for example, molecular biological assays, such as Southern and Northern blotting and PCR; biochemical assays, such as detecting the presence of a protein product, e.g., by immunological means (ELISAs and Western blots) or by enzymatic function; by plant part assays, such as leaf or root assays; and also, by analyzing the phenotype of the whole regenerated plant.
- molecular biological assays such as Southern and Northern blotting and PCR
- biochemical assays such as detecting the presence of a protein product, e.g., by immunological means (ELISAs and Western blots) or by enzymatic function
- plant part assays such as leaf or root assays
- analyzing the phenotype of the whole regenerated plant include, for example, molecular biological assays, such as Southern and Northern blotting and PCR; biochemical assays, such as detecting the presence of a protein product, e.
- Assays for the production and identification of specific proteins may make use of physical-chemical, structural, functional, or other properties of the proteins.
- Unique physical-chemical or structural properties allow the proteins to be separated and identified by electrophoretic procedures, such as native or denaturing gel electrophoresis or isoelectric focusing, or by chromatographic techniques such as ion exchange or gel exclusion chromatography.
- the unique structures of individual proteins offer opportunities for use of specific antibodies to detect their presence in formats such as an ELISA assay. Combinations of approaches may be employed with even greater specificity such as Western blotting in which antibodies are used to locate individual gene products that have been separated by electrophoretic techniques. Additional techniques may be employed to absolutely confirm the identity of the product of interest such as evaluation by amino acid sequencing following purification. Although these techniques are among the most commonly employed, other procedures are known in the art and may be additionally used.
- the result of the translation back was a DNA sequence of 258 nucleic acid residues.
- the sequence representing one strand of a double-stranded DNA encoding the LexA DNA binding domain was divided into three segments of 84, 90 and 84 nucleotides, respectively.
- the DNA sequence of 258 nucleic acid residues representing the other (i.e., complementary) strand of the double-stranded DNA encoding the LexA DNA binding domain was divided into four segments of 51, 81, 81 and 45 nucleic acid residues.
- the DNA sequences of the three segments of the first strand were as follows: AGCTTCATATGAAGGCTCTTACCGCTAGACAGCAGGAGGTTTTCGATCTTATC [SEQ ID NO:3] AGAGATCACATCTCTCAGACCGGAATGCCACCAACCAGA GCTGAGATCGCTCAGAGACTTGGATTCAGATCTCCAAACGCTGCTGAGGAGC [SEQ ID NO:4] ACCTTAAGGCTCTTGCTAGAAAGGGAGTTATCGAGATC GTTTCTGGAGCTTCTAGAGGAATCAGACTTCTTCAGGAGGAGGAGGAGGGA [SEQ ID NO:5] CTTCCACTTGTTGGAAGAGTTGCTGCTGGAGAGG
- the DNA sequences of the four segments of the second strand were as follows: ATCTCTGATAAGATCGAAAACCTGCTGCTGTCTAGCGGTAAGAGCCTTCATATGA [SEQ ID NO:6] CTCAGCAGCGTTTGGAGATCTGAATCCAAGTCTCTGAGCGATCTCAGCTCTG [
- the seven oligomers were then annealed together.
- the desiccated oligomers were dissolved at high concentration (10 OD 260 units/100 ⁇ l) in STE buffer (50 mM NaCl, 10 mM Tris pH 8.0, 1 mM EDTA).
- Equal volumes (12 ⁇ l each) were mixed in a 1.5 ml centrifuge tube and heated in a water bath to 94° C., then slowly cooled down to room temperature over about 3 hours by “unplugging” the water bath (i.e., by allowing the water bath to reach room temperature naturally).
- the DNA was precipitated by adding 0.5 volumes (42 ⁇ l) of 7.5 M ammonium acetate and 0.03 volumes of 1 M MgCl 2 and 2.5 volumes of ethanol with mixing, followed by incubation at 4° C. for 12 hours, and centrifugation for 15 minutes at room temperature (RT) at 14,000 rpm.
- the DNA pellet was washed once with 500 ⁇ l of 70% ethanol, then air-dried and resuspended in 100 ⁇ l TE (10 mM Tris pH 8.0, 1 mM EDTA).
- Annealing of the seven oligonucleotides yielded the artificial gene sequence provided above as SEQ ID NO: 1, which translate into the artificial protein (amino acid sequence) set forth above as SEQ ID NO: 2.
- the annealed DNA was then used for cloning into HindIII and BamHI sites of the plasmid pUCNLSTBP1 (SEQ ID NO: 11), illustrated in FIG. 3.
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
A synthetic nucleotide sequence encodes the LexA DNA binding domain, the nucleotide sequence having been modified to bring the codon usage in conformity with the preferred codon usage of Arabidopsis thaliana. The preferred sequence of the gene is provided as SEQ ID NO: 1. DNA constructs, transformed host cells and transgenic plants comprising the synthetic nucleotide sequence are also provided.
Description
- This application is a divisional of U.S. patent application Ser. No. 09/401,171, filed on Sept. 22, 1999, the contents of which is incorporated by reference.
- This invention relates to increasing the expression of heterologous or chimeric proteins in plants, and more specifically to increasing protein expression in Arabidopsis species and other dicotyledons.
- One of the primary goals of plant genetic research and development is the production of transgenic plants that express a heterologous gene (i.e., produce a “foreign” protein) in an amount sufficient to confer a desired phenotype to the plant. While significant advances have been made in pursuit of this goal, the expression of certain heterologous genes in transgenic plants remains problematic. It is thought that numerous factors are involved in determining the ultimate level of expression of a heterologous gene in a plant. The amount of protein that is synthesized from a gene is a function of several complex and interrelated events, including transcription, RNA maturation, translation, and post-translational modification. Each of these processes is comprised of a large number of events, all of which are potentially regulated either independently or in concert.
- The genetic code is considered “degenerate” in that more than one nucleic acid triplet (i.e., a “codon”) encodes the same amino acid. Many amino acids may be coded for by several different codons. In general, genes within a taxonomic group exhibit similarities in codon choice, regardless of the function of these genes. Thus, an estimate of the overall use of the genetic code by a taxonomic group can be obtained by summing codon frequencies of all its sequenced genes. Variation between degenerate base frequencies is not a neutral phenomenon, since systematic codon preferences have been reported for bacterial, yeast, plant and mammalian genes. Bias in codon choice within genes in a single species appears related to the level of expression of protein encoded by any particular gene.
- Codon bias is most extreme in highly expressed proteins of bacteria (e.g., E. coli) and yeast. In unicellular organisms, highly expressed genes use a smaller subset of codons than do weakly expressed genes, although the codons preferred are distinct in some cases. Sharp and Li, Nucl. Acids Res. 14, 7734-7749 (1986), report that codon usage in 165 E. coli genes reveals a positive correlation between high expression and increased codon bias. In these organisms, a strong positive correlation has been reported between the abundance of an isoaccepting tRNA species and the favored synonymous codon. For example, in one group of highly expressed proteins in yeast, over 96% of the amino acids are encoded by only 25 of the 61 available codons. See Bennetzen and Hall, J. Biol. Chem. 257, 3026-3031. (1982). These 25 codons are preferred in all sequenced yeast genes, but the degree of preference varies with the level of expression of the genes. Biased codon choice in highly expressed genes appears to enhance translation, and is required for maintaining mRNA stability in yeast. It has been proposed that the good fit of abundant yeast and E. coli mRNA codon usage to isoacceptor tRNA abundance promotes high translation levels and high steady state levels of these proteins. These results strongly suggest that the potential for high levels of expression of plant genes in yeast or E. coli is limited by their codon usage, and conversely that high levels of expression of E. Coli or yeast genes in plant cells is similarly limited by the preferred codon usage of the different organisms.
- Although plant codon usage patterns are distinct from those reported for bacteria, yeast, and animals, in general plant codon usage pattern more closely resembles that of higher eukaryotes than unicellular organisms, due to the overall preference for G+C content in codon position III. Moreover, analysis of a large group of plant gene sequences indicates that synonymous codons are used differently by monocots and dicots. Wilbur et al., Plant Physiol. 92: 1-11 (1990), describes the difference in codon usage between bacteria and higher plants such as dicotyledonous and monocotyledonous plants. For example, the codon usages for codons XCG and XUA are 1.8% and 3.2% in dicotyledonous plants and 6.3% and 1.4% in monocotyledonous plants. The combined codon usage for codons XXC and XXG (hereinafter, referred to as the codon XXC/G usage, wherein each of the two Xs is independently selected from the group consisting of A, G, C and T) is 45% in dicotyledon and 73.5% in monocotyledon. It is well established that GC content in genes which can be translated is higher in monocotyledon such as gramineous plants, e.g., rice plants, than in dicotyledons. As to bacteria, the codon usage apparently varies by strain
- In this regard, investigators have determined that typical plant structural coding sequences preferentially utilize certain codons to encode certain amino acids in a different frequency than the frequency of usage appearing in bacterial or other non-plant coding sequences. Thus, it has been suggested that the differences between the typical codon usage present in plant coding sequences as compared to the typical codon usage present in non-plant coding sequence is a factor contributing to the low levels of non-plant mRNA and non-plant protein produced in transgenic plants. These differences in codon usage may contribute to the low levels of mRNA or protein expressed by the non-plant coding sequence in a transgenic plant by affecting the transcription or translation of the coding sequence or proper mRNA processing.
- Recently, attempts have been made to alter the structural coding sequence of a desired polypeptide or protein in an effort to enhance its expression in the plant. In particular, investigators have altered the codon usage of heterologous, structural coding sequences (i.e., heterologous genes) in an attempt to enhance their expression in plants. Most notably, the sequence encoding insecticidal crystal proteins of Bacillus thuringiensis (Bt) has been modified in various ways to enhance its expression in a plant, particularly monocotyledonous plants, to produce commercially viable insect-tolerant plants.
- U.S. Pat. No. 5,380,831 to Adang et al. describes synthetic Bt genes designed to be expressed at a level higher than naturally-occurring Bt genes. The genes utilize codons preferred in highly expressed monocot or dicot protein. Specifically, the synthetic genes, while about 85% homologous to the native bacterial sequence, are chemically modified to contain codons that are preferred by highly expressed plant genes, and to eliminate undesirable sequences that cause destabilization, termination of RNA, secondary structures and RNA splice sites.
- U.S. Pat. No. 5,436,391 to Fujimoto et al. describes a synthetic gene encoding the insecticidal protein Bt. The gene is provided having a base sequence which has been modified to bring the codon usage in conformity with the genes of graminaceous plants, particularly rice plants (e.g., oryza).
- U.S. Pat. No. 5,689,052 to Brown et al. describes a method for modifying a foreign nucleotide sequence for enhanced accumulation of its protein product in a monocotyledonous plant, and/or increasing the frequency of obtaining transgenic monocotyledonous plants which accumulate useful amounts of a transgenic protein, by reducing the frequency of the rare and semi-rare monocotyledonous codons in the foreign gene and replacing them with more preferred monocotyledonous codons.
- Another approach to altering the codon usage of a Bt toxin gene to enhance its expression in plants is described in U.S. Pat. No. 5,500,365 to Fischhoff et al. Here, the synthetic plant gene was prepared by modifying the coding sequence to remove all ATTTA sequences and certain identified putative polyadenylation signals. Moreover, the gene sequence was scanned to identify regions with greater than four consecutive adenine or thymine nucleotides. If there were more than one of the minor polyadenylation signals identified within ten nucleotides of each other, then the nucleotide sequence of this region was altered to remove these signals while maintaining the original encoded amino acid sequence. The overall G+C content was also adjusted to provide a final sequence having a G+C ratio of about 50%. Similarly, U.S. Pat. No. 5,877,306 Cornelissen et al. discloses a method of modifying a DNA sequence encoding a Bt crystal protein toxin wherein the gene was modified by reducing the A+T content. This was accomplished by changing the adenine and thymine bases to cytosine and guanine, while maintaining a coding sequence for the original protein toxin.
- While the foregoing examples emphasize the modification or optimization of codon usage in heterologous structural genes (i.e., the genes encoding a desired protein product, such as Bt toxin), the modification of regulatory elements that control transcription by optimizing codon usage in the host plant has not been emphasized. It is widely recognized that the upstream regulatory elements that control transcription and translation have very significant roles in determining the quantity, timing, and tissue specificity of gene expression. Various nucleotide sequences other than the heterologous structural coding sequence affect the expression levels of a foreign DNA sequence introduced into a plant, including promoter sequences, intron sequences, 3′ untranslated sequences, polyadenylation sites, and other regulatory sequences.
- In view of the foregoing, activation and control of transcription are processes that may desirably be manipulated in order to achieve altered (i.e., increased or decreased) expression of a heterologous structural gene in a plant cell. Transcription can be activated through the use of two functional domains of a transcription activation moiety: a domain (i.e., sequence of amino acids) that recognizes and binds to a specific site or sequence of nucleotides on a target DNA, (the DNA binding domain); and a domain that is capable of activating transcription of the DNA when physically associated with the DNA-binding domain and which may be necessary for activation of the target gene (the activation domain). See Keegan, et al., Science 231, 669-704 (1986); Ma and Ptashne, Cell 48, 847-853 (1987). The two functional domains may be derived from a single transcription activation protein. Alternatively, it has been shown that these two functions can also reside on separate proteins. See McKnight et al., Proc. Natl. Acad. Sci. USA 89, 7061-7065 (1987); Curran et al. 55, 395-397 (1988). The transcription activation domains may also be derived from synthetic DNA-binding and transcription activation proteins.
- Additional flexibility in controlling heterologous gene expression in plants may be obtained by using DNA binding domains and response elements from heterologous sources (i.e., DNA binding domains from non-plant sources). Some examples of such heterologous DNA binding domains include the LexA and GAL4 DNA binding domains. The LexA DNA-binding domain is part of the repressor protein LexA from Escherichia coli (E. Coli) (Brent and Ptashne, Cell 43:729-736 (1985)).
- Although the LexA DNA binding domain functions as an efficient DNA binding domain in its natural bacterial host, when transferred by recombinant DNA technology into higher eukaryotes such as plants, the domain is not efficiently expressed. Accordingly, it would be desirable to alter the LexA DNA binding domain to increase its expression in higher eukaryotes such as plants.
- Certain objects, advantages and novel features of the invention will be set forth in the description that follows, and will become apparent to those skilled in the art upon examination of the following, or may be learned with the practice of the invention.
- It is an object of the invention to provide a synthetic LexA DNA binding domain optimized for codon usage in plants, more specifically in dicots, and most specifically in Arabidopsis thaliana.
- The invention relates to adapting the codons of the DNA binding domain of the LexA gene from E. Coli to the codon usage of Arabidopsis thaliana. This method is advantageous in that it allows for the increased expression of heterologous and chimeric proteins containing this artificial DNA binding domain.
- Additional aspects of this invention include constructs (i.e., vectors, DNA fusions and polynucleotides), comprising the synthetic DNA sequence of the present invention. These constructs are useful for increasing heterologous protein expression in plant cells. Further aspects of the invention are cells, plant lines, and transgenic plants transformed with the described constructs. Methods of increasing expression of heterologous proteins in a cell or a transgenic plant are an additional aspect of the present invention.
- The foregoing and other aspects of the present invention are explained in detail in the specification set forth below.
- FIG. 1 sets forth the nucleotide and amino acid sequences of the LexA DNA binding domain optimized for Arabidopsis thaliana codon usage.
- FIG. 2 sets forth the nucleotide sequences of the seven oligonucleotides (SEQ ID NO: 3-9) that were annealed together to arrive at the LexA DNA binding domain optimized for Arabidopsis thaliana codon usage. SEQ ID NO: 10 is the LexA activation domain to which the LexA binding domain normally binds.
- FIG. 3 is a schematic representation of the plasmid pUCNLSTBP1.
- The present invention will now be described more fully hereinafter with reference to the accompanying figures, in which preferred embodiments of the invention are shown. This invention may, however, be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
- Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety.
- Amino acid sequences disclosed herein are presented in the amino to carboxy direction, from left to right. The amino and carboxy groups are not presented in the sequence. Nucleotide sequences are presented herein by single strand only, in the 5′ to 3′ direction, from left to right. Nucleotides and amino acids are represented herein in the manner recommended by the IUPAC-IUB Biochemical Nomenclature Commission, or (for amino acids) by three letter code, in accordance with 37 CFR §1.822 and established usage. See, e.g., Patent In User Manual, 99-102 (November 1990) (U.S. Patent and Trademark Office).
- The term “amino acid sequence,” as used herein, refers to either a naturally occurring or a synthetic oligopeptide, peptide, polypeptide, or protein sequence, and fragments thereof. Where “amino acid sequence” is recited herein to refer to an amino acid sequence of a naturally occurring protein molecule, the term “amino acid sequence,” and like terms, are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule.
- The term “nucleic acid sequence” as used herein refers to a nucleotide, oligonucleotide, or polynucleotide, and fragments thereof, and to DNA or RNA of genomic or synthetic origin which may be single- or double-stranded, and which may represent a sense or antisense strand.
- “Chemically synthesized,” as related to a sequence of DNA, means that the component nucleotides are assembled in vitro. Manual chemical synthesis of DNA may be accomplished using well established procedures (Caruthers, M., in Methodology of DNA and RNA Sequencing, Chapter 1 (Weissman (ed.), Praeger Publishers, New York, (1983)). Alternatively, automated chemical synthesis of DNA can be performed using one of a number of commercially available apparatus.
- As used herein, the term “LexA” or “LexA binding domain” refers to a protein or domain that is naturally encoded in E. coli by the lexA gene, which domain normally binds to the DNA sequence CATACTGTATGAGCATACAG (the LexA activation domain) [SEQ ID NO: 10]. The term “LexA binding domain adapted (or optimized) for Arabidopsis thaliana usage” means a protein domain in which the codons encoding the domain have been modified from the natural E. Coli sequence in order to maximize DNA binding (and accordingly, transcription) in Arabidopsis thaliana.
- As used herein, the term “expression” refers to the transcription and translation of a heterologous or homologous gene to yield the protein encoded by the gene.
- “Heterologous” is used to indicate that a nucleic acid sequence (e.g., a gene) or a protein has a different natural origin or source with respect to its current host. “Heterologous” is also used to indicate that one or more of the domains present in a protein differ in their natural origin with respect to other domains present.
- As used herein, a “structural gene” is that portion of a gene comprising a DNA segment encoding a protein, polypeptide or a portion thereof, and excluding the 5′ sequence which drives the initiation of transcription. The structural gene may be one which is normally found in the cell or one which is not normally found in the cellular location wherein it is introduced, in which case it is termed a heterologous gene. A heterologous gene may be derived in whole or in part from any source known to the art, including a bacterial genome or episome, eukaryotic, nuclear or plasmid DNA, cDNA, viral DNA or chemically synthesized DNA. A structural gene may contain one or more modifications in either the coding or the untranslated regions which could affect the biological activity or the chemical structure of the expression product, the rate of expression or the manner of expression control. Such modifications include, but are not limited to, mutations, insertions, deletions and substitutions of one or more nucleotides. The structural gene may constitute an uninterrupted coding sequence or it may include one or more introns, bounded by the appropriate splice junctions. The structural gene may be a composite of segments derived from a plurality of sources, either naturally occurring or synthetic, or both. When synthesizing a gene for improved expression in a host cell it is desirable to design the gene such that its frequency of codon usage approaches the frequency of preferred codon usage of the host cell.
- As used herein, the term “chimeric protein” is used to indicate that the protein is comprised of domains, at least one of which has an origin or source that is heterologous with respect to the other domains present. Chimeric proteins are encoded by nucleotide sequences that have been fused or ligated together, resulting in a coding sequence that does not occur naturally. “Chimeric sequences” or genes refer to nucleic acid sequences containing at least two heterologous parts, e.g., parts derived from naturally occurring nucleic acid sequences that are not associated in their naturally occurring states, or containing at least one part that is of synthetic origin and not found in nature.
- To improve expression of the DNA-binding domain of the LexA repressor protein in plants, the codon usage of the natural, E. Coli LexA DNA binding domain was modified and optimized to the codon usage of Arabidopsis thaliana. The characteristics of codon usage for Arabidopsis thaliana is based on the report of Wada et al., “Codon Usage Tabulated From The GenBank Genetic Sequence Data,” Nucleic Acids Research 19 (Supp.) 1981-1986 (1991). Table 1, below, based upon the data provided in the Wada et al. article, illustrates the differences between the codons preferred by E. Coli for each amino acid vs. the codons preferred by A. thaliana for each amino acid (the codon listed in the table being the most preferred codon, according to the Wada et al. data).
- In order to construct a synthetic DNA sequence in accordance with the present invention, the amino acid sequence of the E. Coli LexA DNA binding domain is determined and back-translated into all the available codon choices for each amino acid. The amino acid sequence of the protein can be analyzed using commercially available computer software such as the BACKTRANSLATE® program of the GCG Sequence Analysis software package. After the sequence is back-translated, the codons that are preferentially and optimally used in Arabidposis thaliana are substituted for the naturally occurring codons of the E Coli. LexA binding domain. In other words, for each amino acid of the LexA DNA binding domain that may be encoded for by more than one codon, the codon preferred by Arabidopsis thaliana is placed in the sequence in place of the original codon. The result of the back-translation of the LexA DNA binding protein into codons that are preferred by Arabidopsis thaliana, when all codons of the E. Coli LexA binding domain are replaced with the preferred codons of A. thaliana, results in a DNA sequence of 258 nucleotides that encodes the LexA DNA binding domain optimized for use in Arabidopsis thaliana.
TABLE 1 Preferred Codon Usage of E. Coli vs. A. thaliana. Data taken from Wada et al., “Codon Usage Tabulated From The GenBank Genetic Sequence Data,” Nucleic Acids Research 19 (Supp.) 1981-1986 (1991) Preferred Preferred Amino Acid Codon For E. Coli Codon for A. thaliana ARG CGT AGA LEU CTG CTT SER AGC TCT THR ACC ACC PRO CCG CCA ALA GCG GCT GLY GGC GGA VAL GTT GTT LYS AAA AAG ASN AAC AAC GLN CAG CAG HIS CAT CAC GLU GAA GAG ASP GAT GAT TYR TAT TAC CYS TGC TGC PHE TTT TTC ILE ATC ATC MET ATG ATG TRP TGG TGG - As further explained herein in the Examples, and in one preferred embodiment of the invention, after the DNA sequence encoding the LexA DNA binding domain completely optimized for use in Arabidopsis thaliana is determined, the 258 nucleotides representing one strand of the DNA is arbitrarily divided into three oligonucleotide sequences. The 258 nucleotides representing the other, complementary strand of the DNA is arbitrarily divided into four oligonucleotide sequences. The oligonucleotide sequences are then chemically synthesized into seven oligomers, each being phosphorylated at their 5′ end. In addition, four of the oligomers have additional nucleotides added to the ends in order to create “sticky ends” (restriction sites) for use in ligating the DNA into recombinant vectors. After the seven oligomers are synthesized, the oligomers are annealed together, to yield the following synthetic DNA sequence encoding the LexA DNA binding domain codon optimized for usage in Arabidopsis thaliana:
- ATGAAGGCTCTTACCGCTACCGCTAGACAGCAGGAGGTTTTCGATCTTATCAGAGATCACATCT CTCAGACCGGAATGCCACCACCAACCAGAGCTGAGATCGCTCAGAGACTTGGATTCAGAT CTCCAAACGCTGCTGAGGAGCACCTTAAGGCTCTTGCTAGAAAGGGAGTTATCGAGA TCGTTTCTGGAGCTTCTAGAGGAATCAGACTTCTTCAGGAGGAGGAGGAGGGACTTC CACTTGTTGGAAGAGTTGCTGCTGGAGAG (SEQ ID NO: 1, also set forth in FIG. 1)
- which DNA sequence translates into the following amino acid sequence of the LexA DNA binding domain codon optimized for Arabidopsis thaliana:
- Met-Lys-Ala-Leu-Thr-Ala-Arg-Gln-Gln-Glu-Val-Phe-Asp-Leu-Ile-Arg-Asp-His-Ile-Ser-Gln-Thr -Gly-Met-Pro-Pro-Thr-Arg-Ala-Glu-Ile-Ala-Gln-Arg-Leu-Gly-Phe-Arg-Ser-Pro-Asn-Ala-Ala -Glu-Glu-His-Leu-Lys-Ala-Leu-Ala-Arg-Lys-Gly-Val-Ile-Glu-Ile-Val-Ser-Gly-Ala-Ser-Arg-Gly -Ile-Arg-Leu-Leu-Gln-Glu-Glu-Glu-Glu-Gly-Leu-Pro-Leu-Va-IGly-Arg-Val-Ala-Ala-Gly-Glu (SEQ ID NO: 2, also set forth in FIG. 1).
- The foregoing description describes a particularly preferred embodiment of the invention in which every codon encoding the normal, E. Coli LexA DNA binding domain is substituted with a corresponding codon that is preferred by A. thaliana (i.e., is completely or 100 percent optimized for usage by A. thaliana), when the two codons are different (i.e., when a particular amino acid may be encoded by more than one codon, and a codon preferred by E. Coli differs from one preferred by A. thaliana). The invention also encompasses synthetic nucleotide sequences and proteins that are partially (i.e., less than 100 percent) optimized for usage by A. thaliana, and the uses therefor. In this alternative embodiment of the invention, using the methods described above, less than all of amino acids of the LexA DNA binding domain that may be encoded for by more than one codon is replaced in the synthetic sequence with a codon preferred by A. thaliana. Preferably, more than about 50 percent of the codons of the LexA binding domain are replaced in the synthetic sequence with a codon preferred by A. thaliana; more preferably, more than about 80 percent of the codons of the LexA binding domain are replaced in the synthetic sequence with a codon preferred by A. thaliana; most preferably, 100 percent of the codons of the LexA binding domain are codons that are preferred by A. thaliana.
- The synthetic nucleotide sequences and proteins set forth in the present invention are described as being optimized for usage in Arabidopsis species, particularly Arabidopsis thaliana; however, the skilled artisan will appreciate that the synthetic nucleotide sequences and proteins encoding the Lex A DNA binding domain optimized for Arabidopsis usage are useful in non-bacterial species other than Arabidopsis thaliana. The LexA DNA binding domain set forth herein will be more efficiently expressed in higher eukaryotes (e.g., plants and animals), and more specifically will be more efficiently expressed in dicotyledenous plants, which include but are by no means limited to species of legumes (from the family Fabaceae), including soybean, peanut, and alfalfa; species of the Solanaceae family such as tomato, eggplant and potato; species of the family Brassicaceae such as cabbage, turnips and rapeseed; species of the family Rosaceae such as apples, pears and berries; and members of the families Cucurbitaceae (cucumbers), Chenopodiaceae (beets) and Umbelliferae (carrots).
- The present invention provides an advantageously modified DNA binding domain for the enhanced expression of desired heterologous protein genes in transgenic plants. To this end, one embodiment of the present invention is a DNA construct comprising a DNA sequence encoding the LexA synthetic DNA binding domain of the invention. Such DNA constructs accordingly provide for the preparation of stably transformed cells expressing heterologous protein, which transformed cells are also an aspect of the invention. Still further, the synthetic DNA binding domain of the present invention provides for the subsequent regeneration of fertile, transgenic plants and progeny containing desired heterologous protein genes. These aspects of the invention are further described herein below.
- DNA constructs (also referred to herein as DNA vectors) of the present invention comprise the nucleotide sequence of the synthetic Lex A binding domain described herein, which nucleotide sequence is preferably the sequence provided herein as SEQ ID NO: 1. The preparation of DNA constructs is well known in the art. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (1989). The DNA constructs of the present invention are useful in the transformation of cells (e.g., plant cells), and thus useful in the expression of heterologous genes in the cells. The expression of a heterologous DNA sequence (i.e., gene) in a plant requires proper transcriptional initiation regulatory regions that are recognized in the host plant to be transformed, with the regions linked in a manner which permits the transcription of the coding sequence and subsequent processing in the nucleus. Thus, a DNA construct preferably contains some or all of the necessary elements to permit the transcription and ultimate expression of the coding sequence in the host plant.
- DNA constructs of the present invention may contain suitable promoters for the expression of heterologous genes in plants. The term “promoter” refers to the nucleotide sequences at the 5′ end of a structural gene which direct the initiation of transcription. Generally, promoter sequences are necessary, but not always sufficient, to drive the expression of a downstream gene. In the construction of heterologous promoter/structural gene combinations, the structural gene is placed under the regulatory control of a promoter such that the expression of the gene is controlled by promoter sequences. The promoter is positioned preferentially upstream to the structural gene and at a distance from the transcription start site that approximates the distance between the promoter and the gene it controls in its natural setting. As is known in the art, some variation in this distance can be tolerated without loss of promoter function. As used herein, the term “operatively linked” means that a promoter is connected to a coding region in such a way that the transcription of that coding region is controlled and regulated by that promoter. Means for operatively linking a promoter to a coding region are well known in the art.
- For expression in plants, suitable promoters must be chosen for the host cell, the selection of which promoters is well within the skill of one knowledgeable in the art. Promoters useful in the practice of the present invention include, but are not limited to, constitutive, inducible, temporally regulated, developmentally regulated, chemically regulated, tissue-preferred and tissue-specific promoters.
- Numerous promoters are known or are found to facilitate transcription of RNA in plant cells and can be used in the DNA construct of the present invention. Examples of suitable promoters include the nopaline synthase (NOS) and octopine synthase (OCS) promoters, the light-inducible promoter from the small subunit of ribulose bis-phosphate carboxylase promoters, the CaMV 35S and 19S promoters, the full-length transcript promoter from Figwort mosaic virus, histone promoters, tubulin promoters, or the mannopine synthase promoter (MAS). The promoter may also be one that causes preferential expression in a particular tissue, such as leaves, stems, roots, or meristematic tissue, or the promoter may be inducible, such as by light, heat stress, water stress or chemical application or production by the plant. Exemplary green tissue-specific promoters include the maize phosphoenol pyruvate carboxylase (PEPC) promoter, small submit ribulose bis-carboxylase promoters (ssRUBISCO) and the chlorophyll a/b binding protein promoters.
- Additional promoters useful in the present invention include but are not limited to one of several of the actin genes, which are known to be expressed in most cell types. Yet another constitutive promoter useful in the practice of the present invention is derived from ubiquitin, which is another gene product known to accumulate in many cell types. The ubiquitin promoter has been cloned from several species for use in transgenic plants (e.g., sunflower (Binet et al., Plant Science 79: 87-94 (1991); and maize (Christensen et al., Plant Molec. Biol. 12, 619-632 (1989)). Further useful promoters are the U2 and U5 snRNA promoters from maize (Brown et al., Nucleic Acids Res. 17, 8991 (1989)) and the promoter from alcohol dehydrogenase (Dennis et al., Nucleic Acids Res. 12, 3983 (1984)).
- Tissue-specific or tissue-preferential promoters useful in the present invention in plants are those which direct expression in root, pith, leaf or pollen. Such promoters are disclosed in U.S. Pat. No. 5,625,136 (herein incorporated by reference in its entirety). Also useful are promoters which confer seed-specific expression, such as those disclosed by Schernthaner et al., EMBO J. 7: 1249 (1988); anther-specific promoters ant32 and ant43D; anther (tapetal) specific promoter B6 (Huffman et al., J. Cell. Biochem. 17B, Abstract #D209 (1993)); and pistil-specific promoters such as a modified S13 promoter (Dzelkalns et al., Plant Cell 5,855 (1993)).
- Other plant promoters may be obtained, preferably from plants or plant viruses, and may be utilized so long as the selected promoter is capable of causing sufficient expression in a plant resulting in the production of an effective amount of the desired protein. Preferred constitutive promoters include but are not limited to the CaMV 35S and 19S promoters (see U.S. Pat. No. 5,352,605, the disclosure of which is incorporated herein in its entirety). Any promoter used in the present invention may be modified, if desired, to alter their control characteristics. For example, the CaMV 35S or 19S promoters may be enhanced by the method described in Kay, et al. Science (1987) Vol. 236, pp.1299-1302.
- The DNA sequences that comprise the DNA constructs of the present invention are preferably carried on suitable vectors, which are known in the art. Preferred vectors for are plasmids that may be propagated in a plant cell. Particularly preferred vectors for transformation are those useful for transformation of plant cells or of Agrobacteria, as described further below. For Agrobacterium-mediated transformation, the preferred vector is a Ti-plasmid derived vector. Other appropriate vectors which can be utilized as starting materials are known in the art. Suitable vectors for transforming plant tissue and protoplasts have been described by deFramond, A. et al., Bio/Technology 1, 263 (1983); An, G. et al., EMBO J. 4, 277 (1985); and Rothstein, S. J. et al., Gene 53, 153 (1987). In addition to these, many other vectors have been described in the art which are suitable for use as starting materials in the present invention.
- The DNA encoding the synthetic LexA binding domain of the present invention, and the DNA constructs comprising them, have applicability to any structural gene that is desired to be introduced into a plant to provide any desired characteristic in the plant, such as herbicide tolerance, virus tolerance, insect tolerance, disease tolerance, drought tolerance, or enhanced or improved phenotypic characteristics such as improved nutritional or processing characteristics.
- In a particularly preferred embodiment of the invention, DNA constructs of the present invention also comprise DNA sequences encoding transactivation domains. Transactivation domains can be defined as amino acid sequences that, when combined with the DNA binding domain, increase productive transcription initiation by RNA polymerases. (See generally Ptashne, Nature 335, 683-689 (1988)). Different transactivation domains are known to have different degrees of effectiveness in their ability to increase transcription initiation. In the present invention it is desirable to use transactivation domains that have superior transactivating effectiveness in plant cells in order to create a high level of target polypeptide expression in response to the presence of chemical ligand.
- Transactivation domains that have been shown to be particularly effective in the method of the present invention include but are not limited to VP16 (isolated from the herpes simplex virus), C1 (isolated from maize), and Thm18 (isolated from tomato). One preferred example for the use of codon-optimized LexA DNA-binding domain is the in-frame fusion of this DNA-binding domain to other domains that have other functions; for example, the fusion to transcription activator domains like VP16 from Herpes simplex or Thm18 from tomato, or the fusion to TATA-box binding proteins (TBPs) such as the TBP1 protein from Arabidopsis thaliana. Other transactivation domains may also be effective.
- Transgenes (heterologous genes to be transformed into a plant cell) will often be genes that direct the expression of a particular protein or polypeptide product, but they may also be non-expressible DNA segments, e.g., transposons that do not direct their own transposition. As used herein, an “expressible gene” is any gene that is capable of being transcribed into RNA (e.g., mRNA, antisense RNA, etc.) or translated into a protein, expressed as a trait of interest, or the like, etc., and is not limited to selectable, screenable or non-selectable marker genes. The invention also contemplates that, where both an expressible gene that is not necessarily a marker gene is employed in combination with a marker gene, one may employ the separate genes on either the same or different DNA segments for transformation. In the latter case, the different vectors are delivered concurrently to recipient cells to maximize cotransformation
- Any heterologous gene or nucleic acid that is desired to be expressed in a plant is suitable for the practice of the present invention. Heterologous genes to be transformed and expressed in the plants of the present invention include but are not limited to genes that encode resistance to diseases and insects, genes conferring nutritional value, genes conferring antifungal, antibacterial or antiviral activity, and the like. Alternatively, therapeutic (e.g., for veterinary or medical uses) or immunogenic (e.g., for vaccination) peptides and proteins can be expressed in plants transformed with the synthetic DNA LexA DNA binding domain of the present invention. Likewise, the transfer of any nucleic acid for controlling gene expression in a plant is contemplated as an aspect of the present invention. For example, the nucleic acid to be transferred can encode an antisense oligonucleotide. Alternately, plants may be transformed with one or more genes to reproduce enzymatic pathways for chemical synthesis or other industrial processes.
- In order to improve the ability to identify transformants, one may desire to employ a selectable or screenable marker gene as, or in addition to, the expressible gene of interest. “Marker genes” are genes that impart a distinct phenotype to cells expressing the marker gene and thus allow such transformed cells to be distinguished from cells that do not have the marker. Such genes may encode either a selectable or screenable marker, depending on whether the marker confers a trait which one can ‘select’ for by chemical means, i.e., through the use of a selective agent (e.g., a herbicide, antibiotic, or the like), or whether it is simply a trait that one can identify through observation or testing, i.e., by ‘screening’ (e.g., the R-locus trait). Of course, many examples of suitable marker genes are known to the art and can be employed in the practice of the invention. The selectable marker gene may be the only heterologous gene expressed by a transformed cell, or may be expressed in addition to another heterologous gene transformed into and expressed in the transformed cell. Selectable marker genes are utilized for the identification and selection of transformed cells or tissues. Selectable marker genes include genes encoding antibiotic resistance, such as those encoding neomycin phosphotransferase II (NEO) and hygromycin phosphotransferase (HPT), as well as genes conferring resistance to herbicidal compounds. Herbicide resistance genes generally code for a modified target protein insensitive to the herbicide or for an enzyme that degrades or detoxifies the herbicide in the plant before it can act. See, DeBlock et al., EMBO J. 6, 2513 (1987); DeBlock et al., Plant Physiol. 91, 691 (1989); Fromm et al.,
BioTechnology 8, 833 (1990); Gordon-Kamm et al.,Plant Cell 2, 603 (1990). For example, resistance to glyphosphate or sulfonylurea herbicides has been obtained using genes coding for the mutant target enzymes, 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) and acetolactate synthase (ALS). Resistance to glufosinate ammonium, boromoxynil, and 2,4-dichlorophenoxyacetate (2,4-D) have been obtained by using bacterial genes encoding phosphinothricin acetyltransferase, a nitrilase, or a 2,4-dichlorophenoxyacetate monooxygenase, which detoxify the respective herbicides. - Selectable marker genes include, but are not limited to, genes encoding: neomycin phosphotransferase II (Fraley et al., CRC Critical Reviews in
Plant Science 4, 1 (1986)); cyanamide hydratase (Maier-Greiner et al., Proc. Natl. Acad. Sci. USA 88, 4250 (1991)); aspartate kinase; dihydrodipicolinate synthase (Perl et al., BioTechnology 11, 715 (1993)); bar gene (Toki et al., Plant Physiol. 100, 1503 (1992); Meagher et al., Crop Sci. 36, 1367 (1996)); tryptophane decarboxylase (Goddijn et al., Plant Mol. Biol. 22, 907 (1993)); neomycin phosphotransferase (NEO; Southern et al., J. Mol. Appl. Gen. 1, 327 (1982)); hygromycin phosphotransferase (HPT or HYG; Shimizu et al., Mol. Cell. Biol. 6, 1074 (1986)); dihydrofolate reductase (DHFR); phosphinothricin acetyltransferase (DeBlock et al., EMBO J. 6, 2513 (1987)); 2,2-dichloropropionic acid dehalogenase (Buchanan-Wollatron et al., J. Cell. Biochem. 13D, 330 (1989)); acetohydroxyacid synthase (U.S. Pat. No. 4,761,373 to Anderson et al.; Haughn et al., Mol. Gen. Genet. 221, 266 (1988)); 5-enolpyruvyl-shikimate-phosphate synthase (aroA; Comai et al., Nature 317, 741 (1985)); haloarylnitrilase (WO 87/04181 to Stalker et al.); acetyl-coenzyme A carboxylase (Parker et al., Plant Physiol. 92, 1220 (1990)); dihydropteroate synthase (sulI; Guerineau et al., Plant Mol. Biol. 15, 127 (1990)); and 32 kDa photosystem II polypeptide (psbA; Hirschberg et al., Science 222, 1346 (1983)). - Also included are genes encoding resistance to: chloramphenicol (Herrera-Estrella et al., EMBO J. 2, 987 (1983)); methotrexate (Herrera-Estrella et al., Nature 303, 209 (1983); Meijer et al., Plant Mol. Biol. 16, 807 (1991)); hygromycin (Waldron et al., Plant Mol. Biol. 5, 103 (1985); Zhijian et al., Plant Science 108, 219 (1995); Meijer et al., Plant Mol. Bio. 16, 807 (1991)); streptomycin (Jones et al., Mol. Gen. Genet. 210, 86 (1987)); spectinomycin (Bretagne-Sagnard et al., Transgenic Res. 5, 131 (1996)); bleomycin (Hille et al., Plant Mol. Biol. 7, 171 (1986)); sulfonamide (Guerineau et al., Plant Mol. Bio. 15, 127 (1990); bromoxynil (Stalker et al., Science 242, 419 (1988)); 2,4-D (Streber et al., Bio/
Technology 7, 811 (1989)); phosphinothricin (DeBlock et al., EMBO J. 6, 2513 (1987)); spectinomycin (Bretagne-Sagnard and Chupeau,Transgenic Research 5, 131 (1996)). - The bar gene confers herbicide resistance to glufosinate-type herbicides, such as phosphinothricin (PPT) or bialaphos, and the like. As noted above, other selectable markers that could be used in the vector constructs include, but are not limited to, the pat gene, also for bialaphos and phosphinothricin resistance, the ALS gene for imidazolinone resistance, the HPH or HYG gene for hygromycin resistance, the EPSP synthase gene for glyphosate resistance, the Hm1 gene for resistance to the Hc-toxin, and other selective agents used routinely and known to one of ordinary skill in the art. See generally, Yarranton, Curr. Opin. Biotech. 3, 506 (1992); Chistopherson et al., Proc. Natl. Acad. Sci. USA 89, 6314 (1992); Yao et al., Cell 71, 63 (1992); Reznikoff, Mol. Microbiol. 6, 2419 (1992); Barkley, et al., The Operon 177-220 (1980); Hu et al., Cell 48, 555 (1987); Brown et al., Cell 49, 603 (1987); Figge et al., Cell 52, 713 (1988); Deuschle et al., Proc. Natl. Acad. Sci. USA 86, 5400 (1989); Fuerst et al., Proc. Natl. Acad. Sci. USA 86, 2549 (1989); Deuschle et al., Science 248, 480 (1990); Labow et al., Mol. Cell. Biol. 10, 3343 (1990); Zambretti et al., Proc. Natl. Acad. Sci. USA 89, 3952 (1992); Baim et al., Proc. Natl. Acad. Sci. USA 88, 5072 (1991); Wyborski et al., Nuc. Acids Res. 19, 4647 (1991); Hillenand-Wissman, Topics in Mol And Struc. Biol. 10, 143 (1989); Degenkolb et al., Antimicrob. Agents Chemother. 35, 1591 (1991); Kleinschnidt et al., Biochemistry 27, 1094 (1988); Gatz et al., Plant J. 2, 397 (1992); Gossen et al., Proc. Natl. Acad. Sci. USA 89, 5547 (1992); Oliva et al., Antimicrob. Agents Chemother. 36, 913 (1992); Hlavka et al., Handbook of Experimental Pharmacology 78, (1985); and Gill et al., Nature 334, 721 (1988). The disclosures described herein are incorporated by reference.
- The above list of selectable marker genes are not meant to be limiting. Any selectable marker gene can be used in the present invention.
- In view of the foregoing, it is apparent that one aspect of the present invention are transformed plant cells comprising the synthetic LexA DNA binding domain of the present invention. “Transformation”, as defined herein, describes a process by which heterologous nucleic acid enters and changes a recipient cell. It may occur under natural or artificial conditions using various methods well known in the art. Transformation may rely on any known method for the insertion of foreign nucleic acid sequences into a eukaryotic host cell. Such “transformed” cells include stably transformed cells in which the inserted DNA is capable of replication either as an autonomously replicating plasmid or as part of the host chromosome. They also include cells which transiently express the inserted DNA or RNA for limited periods of time.
- In a preferred embodiment of the invention, recipient cells for transformation are plant cells, more preferably dicot plant cells, even more preferably Arabidopsis species plant cells, and most preferably Arabidopsis thaliana plant cells. “Plant cells” as used herein includes plant cells in plant tissue or plant tissue and plant cells and protoplasts in culture. Plant tissue includes differentiated and undifferentiated tissues of plants, including but not limited to, roots, shoots, leaves, pollen, seeds, tumor tissue and various forms of cells in culture, such as single cells, protoplasts, embryos and callus tissue. The plant tissue may be in plant, or in organ, tissue or cell culture.
- The recombinant DNA molecule carrying the synthetic LexA DNA binding domain of the invention and optionally a structural gene under promoter control can be introduced into plant tissue by any means known to those skilled in the art. The technique used for a given plant species or specific type of plant tissue depends on the known successful techniques. As novel means are developed for the stable insertion of foreign genes into plant cells and for manipulating the modified cells, skilled artisans will be able to select from known means to achieve a desired result. Means for introducing recombinant DNA into plant tissue include, but are not limited to, direct DNA uptake (Paszkowski, J. et al. (1984) EMBO J. 3,2717), electroporation (Fromm, M., et al. Proc. Natl. Acad. Sci. USA 82,5824 (1985), microinjection (Crossway, A. et al. Mol. Gen. Genet. 202, 179 (1986)), or T-DNA mediated transfer from Agrobacterium tumefaciens to the plant tissue, which techniques are known in the art. There appears to be no fundamental limitation of T-DNA transformation to the natural host range of Agrobacterium. Representative T-DNA vector systems are described in the following references: An, G. et al. EMBO J. 4, 277 (1985); Herrera-Estrella, L. et al., Nature 303, 209 (1983); Herrera-Estrella, L. et al. EMBO J. 2, 987 (1983); Herrera-Estrella, L. et al. in Plant Genetic Engineering, New York: Cambridge University Press, p. 63 (1985). Once introduced into the plant tissue, the expression of the structural gene may be assayed by any means known to the art, and expression may be measured as mRNA transcribed or as protein synthesized, as provided herein.
- Transgenic plants comprising the synthetic LexA DNA binding domain of the present invention (as present, for example, in a DNA construct of the present invention, or a transformed cells of the present invention) are also an aspect of the present invention. Procedures for cultivating transformed cells to useful cultivars are known to those skilled in the art. Techniques are known for the in vitro culture of plant tissue, and in a number of cases, for regeneration into whole plants. A further aspect of the invention are plant tissue, plants or seeds containing the chimeric DNA sequences described above. Preferred are plant tissues, plants or seeds containing those chimeric DNA sequences which are mentioned as being preferred.
- The invention thus relates, in certain embodiments, to transgenic plants comprising the synthetic LexA DNA binding domain of the present invention. As used herein, the term “transgenic plants” is intended to refer to plants that have incorporated DNA sequences, including but not limited to genes which are perhaps not normally present, DNA sequences not normally transcribed into RNA or translated into a protein (“expressed” ), or any other genes or DNA sequences which one desires to introduce into the non-transformed plant, such as genes which may normally be present in the non-transformed plant but which one desires to either genetically engineer or to have altered expression. It is contemplated that in some instances the genome of transgenic plants of the present invention will have been augmented through the stable introduction of the transgene. However, in other instances, the introduced gene will replace an endogenous sequence.
- One example of a transgenic plant of the present invention may be prepared by the process of creating a translational fusion protein gene and introducing the fusion into a plant. One example of a plant to be transformed by the methods of the invention is a wild-type Arabidopsis. A specific illustration of the method of the invention would involve constructing gene fusions comprising the LexA DNA binding domain optimized for usage in Arabidopsis thaliana and a DNA segment encoding an exogenous protein one desires to express, and introducing the fusion into wild-type Arabidopsis.
- Alternatively, the gene introduced into the transformed plant line will comprise an exogenous protein gene operatively linked to its own promoter or another promoter that is active in plants. This gene will contain the cloned LexA DNA binding domain optimized for Arabidposis thaliana in a DNA construct for the purpose of increasing the expression of that exogenous protein gene.
- The transformed cells, identified by selection or screening and cultured in an appropriate medium that supports regeneration as provided herein, will then be allowed to mature into plants. Plants are preferably matured either in a growth chamber or greenhouse. Plants are regenerated from about 6 weeks to 10 months after a transformant is identified, depending on the initial tissue. During regeneration, cells are grown on solid media in tissue culture vessels. Illustrative embodiments of such vessels are petri dishes and Plant Con®s. After the regenerating plants have reached the stage of shoot and root development, they may be transferred to a greenhouse for further growth and testing. Progeny may be recovered from the transformed plants and tested for expression of the exogenous expressible gene by localized application of an appropriate substrate to plant parts such as leaves.
- The regenerated plants are screened for transformation by standard methods illustrated below. Progeny of the regenerated plants is continuously screened and selected for the continued presence of the integrated DNA sequence in order to develop improved plant and seed lines. The DNA sequence can be moved into other genetic lines by a variety of techniques, including classical breeding, protoplast fusion, nuclear transfer and chromosome transfer.
- After effecting delivery of heterologous DNA to recipient cells and plants by any of the methods discussed above, identifying the cells exhibiting successful or enhanced expression of a heterologous gene for further culturing and plant regeneration generally occurs. As mentioned above, in order to improve the ability to identify transformants, one may desire to employ a selectable or screenable marker gene as, or in addition to, the expressible gene of interest. In this case, one would then generally assay the potentially transformed cell population by exposing the cells to a selective agent or agents, or one would screen the cells for the desired marker gene.
- “Screening” generally refers to identifying the cells exhibiting expression of a heterologous gene that has been transformed into the plant. Usually, screening is carried out to select successfully transformed seeds (i.e., transgenic seeds) for further cultivation and plant generation (i.e., for the production of transgenic plants). As mentioned above, in order to improve the ability to identify transformants, one may desire to employ a selectable or screenable marker gene as, or in addition to, the heterologous gene of interest. In this case, one would then generally assay the potentially transformed cells, seeds or plants by exposing the cells, seeds, plants, or seedlings to a selective agent or agents, or one would screen the cells, seeds, plants or tissues of the plants for the desired marker gene. For example, transgenic cells, seeds or plants may be screened under selective conditions, such as by growing the seeds or seedlings on media containing selective agents, such as antibiotics (e.g., hygromycin, kanamycin, paromomycin or BASTA®), the successfully transformed plants having been transformed with genes encoding resistance to such selective agents.
- To additionally confirm the presence of the heterologous nucleic acid or “transgene(s)” in the seeds of the cultivated plant or the in the regenerated plants produced from those seeds, a variety of assays may be performed. Such assays include, for example, molecular biological assays, such as Southern and Northern blotting and PCR; biochemical assays, such as detecting the presence of a protein product, e.g., by immunological means (ELISAs and Western blots) or by enzymatic function; by plant part assays, such as leaf or root assays; and also, by analyzing the phenotype of the whole regenerated plant.
- While Southern blotting and PCR may be used to detect the gene(s) in question, they do not provide information as to whether the gene is being expressed. Expression of the heterologous gene may be evaluated by specifically identifying the protein products of the introduced genes or evaluating the phenotypic changes brought about by their expression.
- Assays for the production and identification of specific proteins may make use of physical-chemical, structural, functional, or other properties of the proteins. Unique physical-chemical or structural properties allow the proteins to be separated and identified by electrophoretic procedures, such as native or denaturing gel electrophoresis or isoelectric focusing, or by chromatographic techniques such as ion exchange or gel exclusion chromatography. The unique structures of individual proteins offer opportunities for use of specific antibodies to detect their presence in formats such as an ELISA assay. Combinations of approaches may be employed with even greater specificity such as Western blotting in which antibodies are used to locate individual gene products that have been separated by electrophoretic techniques. Additional techniques may be employed to absolutely confirm the identity of the product of interest such as evaluation by amino acid sequencing following purification. Although these techniques are among the most commonly employed, other procedures are known in the art and may be additionally used.
- The following Examples are provided to illustrate the present invention, and should not be construed as limiting thereof.
- The 86-amino acid residues of the E. Coli DNA-binding domain of LexA were translated back to a sequence of nucleic acids, substituting the E. Coli codon with the codons that are most frequently utilized in Arabidopsis thaliana, according to K. Wada, et al., Nucleic Acids Res. 19, 1981-1986 (1991).
- The result of the translation back was a DNA sequence of 258 nucleic acid residues. The sequence representing one strand of a double-stranded DNA encoding the LexA DNA binding domain was divided into three segments of 84, 90 and 84 nucleotides, respectively. The DNA sequence of 258 nucleic acid residues representing the other (i.e., complementary) strand of the double-stranded DNA encoding the LexA DNA binding domain was divided into four segments of 51, 81, 81 and 45 nucleic acid residues.
The DNA sequences of the three segments of the first strand were as follows: AGCTTCATATGAAGGCTCTTACCGCTAGACAGCAGGAGGTTTTCGATCTTATC [SEQ ID NO:3] AGAGATCACATCTCTCAGACCGGAATGCCACCAACCAGA GCTGAGATCGCTCAGAGACTTGGATTCAGATCTCCAAACGCTGCTGAGGAGC [SEQ ID NO:4] ACCTTAAGGCTCTTGCTAGAAAGGGAGTTATCGAGATC GTTTCTGGAGCTTCTAGAGGAATCAGACTTCTTCAGGAGGAGGAGGAGGGA [SEQ ID NO:5] CTTCCACTTGTTGGAAGAGTTGCTGCTGGAGAGG The DNA sequences of the four segments of the second strand were as follows: ATCTCTGATAAGATCGAAAACCTGCTGCTGTCTAGCGGTAAGAGCCTTCATATGA [SEQ ID NO:6] CTCAGCAGCGTTTGGAGATCTGAATCCAAGTCTCTGAGCGATCTCAGCTCTG [SEQ ID NO:7] GTTGGTGGCATTCCGGTCTGAGAGATGTG CTCCTGAAGAAGTCTGATTCCTCTAGAAGCTCCAGAAACGATCTCGATAACTC [SEQ ID NO:8] CCTTTCTAGCAAGAGCCTTAAGGTGCTC GATCCCTCTCCAGCAGCAACTCTTCCAACAAGTGGAAGTCCCTGCTCCTC [SEQ ID NO:9] - The seven sequences described above were chemically synthesized as DNA oligomers on a modified ABI 391 machine, using standard β-cyano-ethyl chemistry, each oligomer being phosphorylated at their 5′ ends. Four of the oligomers also had additional nucleotides added to their ends to create restrictions sites (“sticky ends”) for the purpose of ligating an annealed DNA product into a cloning vector that has been digested with the restriction enzymes HindIII and BamHI.
- The seven oligomers were then annealed together. For annealing of the seven oligomers with each other, the desiccated oligomers were dissolved at high concentration (10 OD 260 units/100 μl) in STE buffer (50 mM NaCl, 10 mM Tris pH 8.0, 1 mM EDTA). Equal volumes (12 μl each) were mixed in a 1.5 ml centrifuge tube and heated in a water bath to 94° C., then slowly cooled down to room temperature over about 3 hours by “unplugging” the water bath (i.e., by allowing the water bath to reach room temperature naturally). The DNA was precipitated by adding 0.5 volumes (42 μl) of 7.5 M ammonium acetate and 0.03 volumes of 1 M MgCl2 and 2.5 volumes of ethanol with mixing, followed by incubation at 4° C. for 12 hours, and centrifugation for 15 minutes at room temperature (RT) at 14,000 rpm. The DNA pellet was washed once with 500 μl of 70% ethanol, then air-dried and resuspended in 100 μl TE (10 mM Tris pH 8.0, 1 mM EDTA).
- Annealing of the seven oligonucleotides yielded the artificial gene sequence provided above as SEQ ID NO: 1, which translate into the artificial protein (amino acid sequence) set forth above as SEQ ID NO: 2. The annealed DNA was then used for cloning into HindIII and BamHI sites of the plasmid pUCNLSTBP1 (SEQ ID NO: 11), illustrated in FIG. 3.
- The foregoing is illustrative of the present invention and is not to be construed as limiting thereof. The invention is defined by the following claims, with equivalents of the claims to be included therein.
-
1 11 1 258 DNA Artificial Sequence LexA DNA binding domain of E. coli using Arabidopsis-optimized codons 1 atg aag gct ctt acc gct aga cag cag gag gtt ttc gat ctt atc aga 48 Met Lys Ala Leu Thr Ala Arg Gln Gln Glu Val Phe Asp Leu Ile Arg 1 5 10 15 gat cac atc tct cag acc gga atg cca cca acc aga gct gag atc gct 96 Asp His Ile Ser Gln Thr Gly Met Pro Pro Thr Arg Ala Glu Ile Ala 20 25 30 cag aga ctt gga ttc aga tct cca aac gct gct gag gag cac ctt aag 144 Gln Arg Leu Gly Phe Arg Ser Pro Asn Ala Ala Glu Glu His Leu Lys 35 40 45 gct ctt gct aga aag gga gtt atc gag atc gtt tct gga gct tct aga 192 Ala Leu Ala Arg Lys Gly Val Ile Glu Ile Val Ser Gly Ala Ser Arg 50 55 60 gga atc aga ctt ctt cag gag gag gag gag gga ctt cca ctt gtt gga 240 Gly Ile Arg Leu Leu Gln Glu Glu Glu Glu Gly Leu Pro Leu Val Gly 65 70 75 80 aga gtt gct gct gga gag 258 Arg Val Ala Ala Gly Glu 85 2 86 PRT Artificial Sequence LexA DNA binding domain protein encoded by E. coli, containing codons optimized for Arabidopsis 2 Met Lys Ala Leu Thr Ala Arg Gln Gln Glu Val Phe Asp Leu Ile Arg 1 5 10 15 Asp His Ile Ser Gln Thr Gly Met Pro Pro Thr Arg Ala Glu Ile Ala 20 25 30 Gln Arg Leu Gly Phe Arg Ser Pro Asn Ala Ala Glu Glu His Leu Lys 35 40 45 Ala Leu Ala Arg Lys Gly Val Ile Glu Ile Val Ser Gly Ala Ser Arg 50 55 60 Gly Ile Arg Leu Leu Gln Glu Glu Glu Glu Gly Leu Pro Leu Val Gly 65 70 75 80 Arg Val Ala Ala Gly Glu 85 3 92 DNA E. coli 3 agcttcatat gaaggctctt accgctagac agcaggaggt tttcgatctt atcagagatc 60 acatctctca gaccggaatg ccaccaacca ga 92 4 90 DNA E. coli 4 gctgagatcg ctcagagact tggattcaga tctccaaacg ctgctgagga gcaccttaag 60 gctcttgcta gaaagggagt tatcgagatc 90 5 85 DNA E. coli 5 gtttctggag cttctagagg aatcagactt cttcaggagg aggaggaggg acttccactt 60 gttggaagag ttgctgctgg agagg 85 6 55 DNA E. coli 6 atctctgata agatcgaaaa cctcctgctg tctagcggta agagccttca tatga 55 7 81 DNA E. coli 7 ctcagcagcg tttggagatc tgaatccaag tctctgagcg atctcagctc tggttggtgg 60 cattccggtc tgagagatgt g 81 8 81 DNA E. coli 8 ctcctgaaga agtctgattc ctctagaagc tccagaaacg atctcgataa ctccctttct 60 agcaagagcc ttaaggtgct c 81 9 51 DNA E. coli 9 gatccctctc cagcagcaac tcttccaaca agtggaagtc cctccctcct c 51 10 20 DNA E. coli 10 catactgtat gagcatacag 20 11 3307 DNA Artificial Sequence cloning vector 11 gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980 cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040 cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220 accatgatta cgccaagctt atcgatcgga tccgctcttg ctccaaagaa gaagagaaag 2280 gttgctcttg ctggtaccat gactgatcaa ggattggaag ggagtaatcc agttgatctt 2340 agcaagcatc cttcagggat tgttcctact cttcaaaaca ttgtctccac ggtgaactta 2400 gactgcaagc tagatcttaa agccatagct ttgcaggctc ggaatgctga atataatccc 2460 aagcgttttg ctgcggtgat aatgaggatc agagaaccga agactacagc attaatattc 2520 gcctcaggga aaatggtctg tactggagct aagagcgagg acttttcgaa gatggctgct 2580 agaaagtatg ctaggattgt gcagaaattg ggattccctg caaaattcaa ggatttcaag 2640 attcagaata ttgtaggttc ttgtgatgtc aaattcccta taagacttga aggtcttgct 2700 tactctcacg ctgctttctc aagttatgag cccgagctct tcccagggct gatttatagg 2760 atgaaagtcc caaaaatcgt ccttctaatc tttgtctctg ggaagatcgt aataacagga 2820 gccaagatga gagatgagac ctacaaagcc tttgagaata tataccccgt gctctcggaa 2880 ttcagaaaga tacagcaata gcctaggaat tcactggccg tcgttttaca acgtcgtgac 2940 tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc 3000 tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat 3060 ggcgaatggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 3120 atatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gccccgacac 3180 ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc cgcttacaga 3240 caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa 3300 cgcgcga 3307
Claims (20)
1. A synthetic nucleotide sequence encoding the LexA DNA binding domain, comprising at least one codon optimized for usage by an Arabidopsis species.
2. A synthetic nucleotide sequence according to claim 1 , wherein over about 50% of the codons of the sequence are optimized for usage by an Arabidopsis species.
3. A synthetic nucleotide sequence according to claim 1 , wherein over about 80% of the codons of the sequence are optimized for usage by an Arabidopsis species.
4. A synthetic nucleotide sequence according to claim 1 , wherein the nucleotide sequence has the sequence SEQ ID NO: 1.
5. A synthetic nucleotide sequence according to claim 1 , wherein the synthetic nucleotide sequence is optimized for usage by Arabidopsis thaliana.
6. A synthetic LexA DNA binding domain protein codon optimized for usage by an Arabidopsis species.
7. A synthetic LexA DNA binding domain protein according to claim 6 , wherein the synthetic LexA DNA binding domain protein is codon optimized for usage by Arabidopsis thaliana.
8. A synthetic LexA DNA binding domain protein according to claim 6 that has the amino acid sequence SEQ ID NO: 2.
9. A DNA construct comprising the synthetic nucleic acid sequence of claim 1 .
10. A DNA construct according to claim 9 , wherein the synthetic nucleic acid sequence has the sequence of SEQ ID NO: 1.
11. A DNA construct according to claim 9 , further comprising a heterologous nucleic acid sequence.
12. An eukaryotic cell comprising the DNA construct of claim 10 .
13. An eukaryotic cell comprising the DNA construct of claim 11 .
14. An eukaryotic cell according to claim 12 , wherein the eukaryotic cell is a plant cell.
15. An eukaryotic cell according to claim 14 , wherein the plant cell is a dicot cell.
16. An eukaryotic cell according to claim 15 , wherein the dicot cell is an Arabidopsis thaliana cell.
17. A transgenic plant comprising a cell of claim 13 .
18. A transgenic plant according to claim 17 , wherein the plant is a dicot.
19. A transgenic plant according to claim 18 , wherein the plant is an Arabidopsis thaliana plant.
20. A transgenic seed produced by the plant of claim 17.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/160,508 US20030145345A1 (en) | 1999-09-22 | 2002-06-03 | LexA DNA binding domain optimized for arabidopsis species |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/401,171 US6399857B1 (en) | 1999-09-22 | 1999-09-22 | Modified nucleotide sequence encoding a LexA DNA binding domain optimized for arabidopsis species |
| US10/160,508 US20030145345A1 (en) | 1999-09-22 | 2002-06-03 | LexA DNA binding domain optimized for arabidopsis species |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US09/401,171 Division US6399857B1 (en) | 1999-09-22 | 1999-09-22 | Modified nucleotide sequence encoding a LexA DNA binding domain optimized for arabidopsis species |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20030145345A1 true US20030145345A1 (en) | 2003-07-31 |
Family
ID=23586626
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US09/401,171 Expired - Lifetime US6399857B1 (en) | 1999-09-22 | 1999-09-22 | Modified nucleotide sequence encoding a LexA DNA binding domain optimized for arabidopsis species |
| US10/160,508 Abandoned US20030145345A1 (en) | 1999-09-22 | 2002-06-03 | LexA DNA binding domain optimized for arabidopsis species |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US09/401,171 Expired - Lifetime US6399857B1 (en) | 1999-09-22 | 1999-09-22 | Modified nucleotide sequence encoding a LexA DNA binding domain optimized for arabidopsis species |
Country Status (1)
| Country | Link |
|---|---|
| US (2) | US6399857B1 (en) |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7825233B2 (en) * | 2004-06-30 | 2010-11-02 | Allergan, Inc. | Optimizing expression of active Botulinum Toxin type E |
| US20080057575A1 (en) * | 2004-08-04 | 2008-03-06 | Allergan, Inc. | Optimizing Expression of Active Botulinum Toxin Type A |
| AU2007325838B2 (en) | 2006-11-22 | 2013-09-19 | Bristol-Myers Squibb Company | Targeted therapeutics based on engineered proteins for tyrosine kinases receptors, including IGF-IR |
| MX2010012963A (en) * | 2008-05-30 | 2011-04-05 | Xbiotech Inc | Il-1î± abs and methods of use. |
| TWI496582B (en) | 2008-11-24 | 2015-08-21 | 必治妥美雅史谷比公司 | Bispecific egfr/igfir binding molecules |
| PL2580236T3 (en) * | 2010-06-08 | 2019-09-30 | Pieris Pharmaceuticals Gmbh | Tear lipocalin muteins binding il-4 r alpha |
| CA2825986C (en) | 2010-06-18 | 2019-10-29 | Xbiotech Inc. | Arthritis treatment |
| CN104540851B (en) * | 2012-03-29 | 2017-09-15 | 诺夫免疫股份有限公司 | Anti-TLR4 antibodies and uses thereof |
| EP2935330B1 (en) | 2012-12-19 | 2019-04-24 | Aveo Pharmaceuticals, Inc. | Anti-notch3 antibodies |
| JP6706578B2 (en) * | 2013-12-30 | 2020-06-10 | エピムアブ バイオセラピューティクス インコーポレイテッド | Tandem Fab immunoglobulins and uses thereof |
| WO2015164815A1 (en) | 2014-04-24 | 2015-10-29 | The Board Of Trustees Of The Leland Stanford Junior University | Superagonists, partial agonists and antagonists of interleukin-2 |
| US9101100B1 (en) | 2014-04-30 | 2015-08-11 | Ceres, Inc. | Methods and materials for high throughput testing of transgene combinations |
| DK3448420T3 (en) | 2016-04-29 | 2022-12-12 | Aveo Pharmaceuticals Inc | ANTI-NOTCH3 ANTIBODY |
| JP7194104B2 (en) * | 2016-10-26 | 2022-12-21 | シーダーズ―シナイ メディカル センター | Neutralizing anti-TL1A monoclonal antibody |
| BR112019012691A2 (en) * | 2016-12-19 | 2019-11-19 | Fundacio Privada Inst Catalana De Recerca I Estudis Avancats | antibodies against lif and its uses |
| WO2018234862A1 (en) * | 2017-06-19 | 2018-12-27 | Medicenna Therapeutics Inc. | USES AND METHODS FOR IL-2 SUPERAGONISTS AND AGONISTS AND FUSIONS THEREOF |
| CN112585165B8 (en) * | 2018-04-25 | 2025-02-14 | 普罗米修斯生物科学公司 | Optimized anti-TL1A antibody |
| EP3986920B1 (en) * | 2019-06-24 | 2025-04-16 | Universität Stuttgart | Tnfr2 agonists with improved stability |
| CA3154649A1 (en) * | 2019-09-16 | 2021-03-25 | Opsidio, LLC | Anti-stem cell factor antibodies and methods of use thereof in renal disease |
| KR20220103721A (en) | 2019-10-24 | 2022-07-22 | 프로메테우스 바이오사이언시즈, 인크. | Humanized Antibodies to TNF-Like Ligand 1A (TL1A) and Uses Thereof |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5380831A (en) | 1986-04-04 | 1995-01-10 | Mycogen Plant Science, Inc. | Synthetic insecticidal crystal protein gene |
| JP3364616B2 (en) | 1989-02-24 | 2003-01-08 | モンサント テクノロジー エルエルシー | Synthetic plant genes and preparation methods |
| US5633446A (en) | 1990-04-18 | 1997-05-27 | Plant Genetic Systems, N.V. | Modified Bacillus thuringiensis insecticidal-crystal protein genes and their expression in plant cells |
| TW261517B (en) | 1991-11-29 | 1995-11-01 | Mitsubishi Shozi Kk | |
| US5689052A (en) | 1993-12-22 | 1997-11-18 | Monsanto Company | Synthetic DNA sequences having enhanced expression in monocotyledonous plants and method for preparation thereof |
-
1999
- 1999-09-22 US US09/401,171 patent/US6399857B1/en not_active Expired - Lifetime
-
2002
- 2002-06-03 US US10/160,508 patent/US20030145345A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| US6399857B1 (en) | 2002-06-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20030145345A1 (en) | LexA DNA binding domain optimized for arabidopsis species | |
| CN102146371B (en) | High glyphosate resistant variant gene and improvement method and application of high glyphosate resistant variant gene | |
| CN112375748B (en) | Novel coronavirus chimeric recombinant vaccine based on vesicular stomatitis virus vector and its preparation method and application | |
| CN108949721B (en) | Recombinant strain expressing phospholipase D and its application | |
| CA2350398C (en) | Chimeric promoters capable of mediating gene expression in plants upon pathogen infection and uses thereof | |
| CN112941038B (en) | Novel recombinant coronavirus based on vesicular stomatitis virus vector, and preparation method and application thereof | |
| CN112251464B (en) | Gene point mutation induction method | |
| CN111748546B (en) | Fusion protein for generating gene point mutation and induction method of gene point mutation | |
| CN111139261B (en) | Method for reducing polyphenol oxidase content of wheat grains by using gene editing | |
| CN107058367A (en) | The construction method of the recombined bacillus subtilis of high yield Pullulanase | |
| WO2021110993A1 (en) | An efficient shuttle vector system for the expression of heterologous and homologous proteins for the genus zymomonas | |
| CN111926009B (en) | Method for improving rice grain traits by blocking or weakening rice OsMIR394 gene expression | |
| CN111118060B (en) | BnALS1 mutant gene and protein based on gene editing and its application | |
| CN113481114B (en) | An explosive visualization biosensor based on yeast cell surface display technology and its preparation method and application | |
| CN112250743A (en) | Wheat drought stress related protein TaWRKY-A and coding gene and application thereof | |
| CN109880837B (en) | A kind of method for degrading tobacco straw lignin | |
| CN1335891A (en) | Clean synthetic vectors, plasmids, transgenic plants and plant parts containing them, and methods for obtaining them | |
| KR101443937B1 (en) | Production method of TRACP5b | |
| CN113774047B (en) | Fish source protease gene and application thereof | |
| US7179957B1 (en) | Method for altering nitrogen or oil content of seeds by down regulating AGL11 expression or activity | |
| CN108504656A (en) | Promoter enhances system and its application in bacterium | |
| AU758623B2 (en) | Kinase wee1 fusion protein compositions, nucleotide sequences, expression systems, and methods of use | |
| CN114989282B (en) | A rice mutant protein resistant to dinitroaniline herbicides and its application | |
| EP1242605B1 (en) | Artificial promoters for protein expression in plants | |
| CN108410898A (en) | A kind of artificial reconstructed DNA molecular, plasmid vector and preparation method thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: PARADIGM GENETICS, INC., NORTH CAROLINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:KLOTI, ANDREAS;REEL/FRAME:012970/0423 Effective date: 19990922 |
|
| AS | Assignment |
Owner name: ICORIA, INC., NORTH CAROLINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PARADIGM GENETICS, INC.;REEL/FRAME:015065/0876 Effective date: 20040417 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |