US20160116457A1 - Transmembrane proteins as biosensors - Google Patents
Transmembrane proteins as biosensors Download PDFInfo
- Publication number
- US20160116457A1 US20160116457A1 US14/930,395 US201514930395A US2016116457A1 US 20160116457 A1 US20160116457 A1 US 20160116457A1 US 201514930395 A US201514930395 A US 201514930395A US 2016116457 A1 US2016116457 A1 US 2016116457A1
- Authority
- US
- United States
- Prior art keywords
- protein
- engineered
- proteins
- transporter protein
- fluorescent
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108091005703 transmembrane proteins Proteins 0.000 title description 2
- 102000035160 transmembrane proteins Human genes 0.000 title description 2
- 108010078791 Carrier Proteins Proteins 0.000 claims abstract description 112
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 75
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 48
- 229920001184 polypeptide Polymers 0.000 claims abstract description 42
- 210000004027 cell Anatomy 0.000 claims description 84
- 150000007523 nucleic acids Chemical class 0.000 claims description 58
- 102000039446 nucleic acids Human genes 0.000 claims description 55
- 108020004707 nucleic acids Proteins 0.000 claims description 55
- 238000000034 method Methods 0.000 claims description 42
- 239000013598 vector Substances 0.000 claims description 40
- 150000001413 amino acids Chemical class 0.000 claims description 36
- 239000012491 analyte Substances 0.000 claims description 25
- 108091006047 fluorescent proteins Proteins 0.000 claims description 25
- 102000034287 fluorescent proteins Human genes 0.000 claims description 25
- 230000008859 change Effects 0.000 claims description 20
- 239000005090 green fluorescent protein Substances 0.000 claims description 15
- 210000000170 cell membrane Anatomy 0.000 claims description 10
- 239000012528 membrane Substances 0.000 claims description 8
- 108010043121 Green Fluorescent Proteins Proteins 0.000 claims description 6
- 102000004144 Green Fluorescent Proteins Human genes 0.000 claims description 6
- 238000012544 monitoring process Methods 0.000 claims description 6
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 5
- 210000004102 animal cell Anatomy 0.000 claims 1
- 230000004941 influx Effects 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 description 144
- 102000004169 proteins and genes Human genes 0.000 description 125
- 235000018102 proteins Nutrition 0.000 description 119
- 241000196324 Embryophyta Species 0.000 description 37
- 235000001014 amino acid Nutrition 0.000 description 36
- 230000004044 response Effects 0.000 description 35
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 34
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 33
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 33
- 125000003275 alpha amino acid group Chemical group 0.000 description 33
- 230000032258 transport Effects 0.000 description 30
- 238000007792 addition Methods 0.000 description 27
- 230000014509 gene expression Effects 0.000 description 23
- 238000003780 insertion Methods 0.000 description 22
- 230000037431 insertion Effects 0.000 description 22
- 102100039338 Aminomethyltransferase, mitochondrial Human genes 0.000 description 21
- 230000009261 transgenic effect Effects 0.000 description 21
- 150000001875 compounds Chemical class 0.000 description 20
- 230000000875 corresponding effect Effects 0.000 description 20
- 239000012099 Alexa Fluor family Substances 0.000 description 19
- 230000000694 effects Effects 0.000 description 18
- 230000012010 growth Effects 0.000 description 18
- AUWFXYNRJHALTA-CCMAZBEPSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1h-indol-3-yl)propanoyl]amino]-3-(1h-indol-3-yl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amin Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)N)C(O)=O)C1=CC=CC=C1 AUWFXYNRJHALTA-CCMAZBEPSA-N 0.000 description 17
- 101100482664 Arabidopsis thaliana ASA1 gene Proteins 0.000 description 17
- 101100216036 Oryza sativa subsp. japonica AMT1-1 gene Proteins 0.000 description 17
- 101100076556 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MEP1 gene Proteins 0.000 description 17
- 101150077112 amt1 gene Proteins 0.000 description 17
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 16
- 239000002773 nucleotide Substances 0.000 description 16
- 125000003729 nucleotide group Chemical group 0.000 description 16
- 230000004927 fusion Effects 0.000 description 15
- 108020001507 fusion proteins Proteins 0.000 description 15
- 102000037865 fusion proteins Human genes 0.000 description 15
- 238000009396 hybridization Methods 0.000 description 15
- 230000035772 mutation Effects 0.000 description 15
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 14
- 230000037430 deletion Effects 0.000 description 14
- 238000012217 deletion Methods 0.000 description 14
- 210000005253 yeast cell Anatomy 0.000 description 14
- 108020004414 DNA Proteins 0.000 description 13
- 108091028043 Nucleic acid sequence Proteins 0.000 description 13
- 125000000539 amino acid group Chemical group 0.000 description 13
- 102000040430 polynucleotide Human genes 0.000 description 13
- 108091033319 polynucleotide Proteins 0.000 description 13
- 239000002157 polynucleotide Substances 0.000 description 13
- 241001465754 Metazoa Species 0.000 description 12
- 230000005284 excitation Effects 0.000 description 12
- 239000000523 sample Substances 0.000 description 12
- 238000006467 substitution reaction Methods 0.000 description 12
- -1 6-amino-2,3-dihydro-2-(2-((iodoacetyl)amino)ethyl)-1,3-dioxo-1H-benz(de)isoquinoline-5,8-disulfonic acid salt Chemical class 0.000 description 11
- 238000001727 in vivo Methods 0.000 description 11
- 239000004475 Arginine Substances 0.000 description 10
- 239000007987 MES buffer Substances 0.000 description 10
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 10
- 239000013604 expression vector Substances 0.000 description 10
- 239000007787 solid Substances 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- 108050001492 Ammonium transporters Proteins 0.000 description 9
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- 239000011575 calcium Substances 0.000 description 9
- 229910052791 calcium Inorganic materials 0.000 description 9
- 238000005259 measurement Methods 0.000 description 9
- 125000006853 reporter group Chemical group 0.000 description 9
- 230000003595 spectral effect Effects 0.000 description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 230000033001 locomotion Effects 0.000 description 8
- 239000000758 substrate Substances 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 7
- 238000010521 absorption reaction Methods 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 229910052757 nitrogen Inorganic materials 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 238000012546 transfer Methods 0.000 description 7
- 238000005406 washing Methods 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- 241000893512 Aquifex aeolicus Species 0.000 description 6
- 241000219194 Arabidopsis Species 0.000 description 6
- 244000062793 Sorghum vulgare Species 0.000 description 6
- 230000000295 complement effect Effects 0.000 description 6
- 230000001086 cytosolic effect Effects 0.000 description 6
- 230000001419 dependent effect Effects 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- BBEAQIROQSPTKN-UHFFFAOYSA-N pyrene Chemical compound C1=CC=C2C=CC3=CC=CC4=CC=C1C2=C43 BBEAQIROQSPTKN-UHFFFAOYSA-N 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- 125000003118 aryl group Chemical group 0.000 description 5
- 238000012512 characterization method Methods 0.000 description 5
- 239000013078 crystal Substances 0.000 description 5
- 108010082025 cyan fluorescent protein Proteins 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- QDLAGTHXVHQKRE-UHFFFAOYSA-N lichenxanthone Natural products COC1=CC(O)=C2C(=O)C3=C(C)C=C(OC)C=C3OC2=C1 QDLAGTHXVHQKRE-UHFFFAOYSA-N 0.000 description 5
- 150000002632 lipids Chemical class 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 238000002864 sequence alignment Methods 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 150000003732 xanthenes Chemical class 0.000 description 5
- HMWAJFNEGAJETK-UHFFFAOYSA-N 1-[6-(dimethylamino)naphthalen-2-yl]prop-2-en-1-one Chemical compound C1=C(C(=O)C=C)C=CC2=CC(N(C)C)=CC=C21 HMWAJFNEGAJETK-UHFFFAOYSA-N 0.000 description 4
- YMHOBZXQZVXHBM-UHFFFAOYSA-N 2,5-dimethoxy-4-bromophenethylamine Chemical compound COC1=CC(CCN)=C(OC)C=C1Br YMHOBZXQZVXHBM-UHFFFAOYSA-N 0.000 description 4
- GJCOSYZMQJWQCA-UHFFFAOYSA-N 9H-xanthene Chemical compound C1=CC=C2CC3=CC=CC=C3OC2=C1 GJCOSYZMQJWQCA-UHFFFAOYSA-N 0.000 description 4
- 210000003311 CFU-EM Anatomy 0.000 description 4
- 101100001642 Caenorhabditis elegans amt-1 gene Proteins 0.000 description 4
- 102000000584 Calmodulin Human genes 0.000 description 4
- 108010041952 Calmodulin Proteins 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 108010070675 Glutathione transferase Proteins 0.000 description 4
- 102000005720 Glutathione transferase Human genes 0.000 description 4
- 241000238631 Hexapoda Species 0.000 description 4
- 102100023072 Neurolysin, mitochondrial Human genes 0.000 description 4
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 4
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 4
- 238000010459 TALEN Methods 0.000 description 4
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 4
- 241000209140 Triticum Species 0.000 description 4
- 235000021307 Triticum Nutrition 0.000 description 4
- 241000545067 Venus Species 0.000 description 4
- 150000001720 carbohydrates Chemical class 0.000 description 4
- 235000014633 carbohydrates Nutrition 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- ZYGHJZDHTFUPRJ-UHFFFAOYSA-N coumarin Chemical compound C1=CC=C2OC(=O)C=CC2=C1 ZYGHJZDHTFUPRJ-UHFFFAOYSA-N 0.000 description 4
- 150000004775 coumarins Chemical class 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 4
- 210000004209 hair Anatomy 0.000 description 4
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 239000011148 porous material Substances 0.000 description 4
- 235000021251 pulses Nutrition 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 239000002904 solvent Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 4
- QGKMIGUHVLGJBR-UHFFFAOYSA-M (4z)-1-(3-methylbutyl)-4-[[1-(3-methylbutyl)quinolin-1-ium-4-yl]methylidene]quinoline;iodide Chemical compound [I-].C12=CC=CC=C2N(CCC(C)C)C=CC1=CC1=CC=[N+](CCC(C)C)C2=CC=CC=C12 QGKMIGUHVLGJBR-UHFFFAOYSA-M 0.000 description 3
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 3
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 3
- 244000075850 Avena orientalis Species 0.000 description 3
- 240000002791 Brassica napus Species 0.000 description 3
- 235000006008 Brassica napus var napus Nutrition 0.000 description 3
- 108700010706 GCaMP2 Proteins 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 244000020551 Helianthus annuus Species 0.000 description 3
- 235000003222 Helianthus annuus Nutrition 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- 241000209056 Secale Species 0.000 description 3
- 102000003673 Symporters Human genes 0.000 description 3
- 108090000088 Symporters Proteins 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 235000001671 coumarin Nutrition 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- 235000013601 eggs Nutrition 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 3
- GVEPBJHOBDJJJI-UHFFFAOYSA-N fluoranthrene Natural products C1=CC(C2=CC=CC=C22)=C3C2=CC=CC3=C1 GVEPBJHOBDJJJI-UHFFFAOYSA-N 0.000 description 3
- 230000004907 flux Effects 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 239000001257 hydrogen Substances 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 150000002500 ions Chemical class 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 230000010412 perfusion Effects 0.000 description 3
- 108060006184 phycobiliprotein Proteins 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 230000005588 protonation Effects 0.000 description 3
- 108010054624 red fluorescent protein Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 230000005945 translocation Effects 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- IANQTJSKSUMEQM-UHFFFAOYSA-N 1-benzofuran Chemical compound C1=CC=C2OC=CC2=C1 IANQTJSKSUMEQM-UHFFFAOYSA-N 0.000 description 2
- BEDDXHFONHIYPB-UHFFFAOYSA-N 2-[methyl-(4-nitro-2,1,3-benzoxadiazol-7-yl)amino]ethyl 2-iodoacetate Chemical compound ICC(=O)OCCN(C)C1=CC=C([N+]([O-])=O)C2=NON=C12 BEDDXHFONHIYPB-UHFFFAOYSA-N 0.000 description 2
- BCHZICNRHXRCHY-UHFFFAOYSA-N 2h-oxazine Chemical compound N1OC=CC=C1 BCHZICNRHXRCHY-UHFFFAOYSA-N 0.000 description 2
- HJCUTNIGJHJGCF-UHFFFAOYSA-N 9,10-dihydroacridine Chemical compound C1=CC=C2CC3=CC=CC=C3NC2=C1 HJCUTNIGJHJGCF-UHFFFAOYSA-N 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- 241000207875 Antirrhinum Species 0.000 description 2
- 240000001436 Antirrhinum majus Species 0.000 description 2
- 241000219195 Arabidopsis thaliana Species 0.000 description 2
- 241000205042 Archaeoglobus fulgidus Species 0.000 description 2
- 235000005781 Avena Nutrition 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 240000007124 Brassica oleracea Species 0.000 description 2
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 2
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 2
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 2
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 2
- 241000219193 Brassicaceae Species 0.000 description 2
- 241000209200 Bromus Species 0.000 description 2
- 108091006146 Channels Proteins 0.000 description 2
- 235000007516 Chrysanthemum Nutrition 0.000 description 2
- 244000189548 Chrysanthemum x morifolium Species 0.000 description 2
- 241000218158 Clematis Species 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 240000002395 Euphorbia pulcherrima Species 0.000 description 2
- 241000220485 Fabaceae Species 0.000 description 2
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- 241000209219 Hordeum Species 0.000 description 2
- 240000005979 Hordeum vulgare Species 0.000 description 2
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 2
- SIKJAQJRHWYJAI-UHFFFAOYSA-N Indole Chemical compound C1=CC=C2NC=CC2=C1 SIKJAQJRHWYJAI-UHFFFAOYSA-N 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- 241000209082 Lolium Species 0.000 description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- 102000003939 Membrane transport proteins Human genes 0.000 description 2
- 108090000301 Membrane transport proteins Proteins 0.000 description 2
- BAVYZALUXZFZLV-UHFFFAOYSA-O Methylammonium ion Chemical compound [NH3+]C BAVYZALUXZFZLV-UHFFFAOYSA-O 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- UFWIBTONFRDIAS-UHFFFAOYSA-N Naphthalene Chemical compound C1=CC=CC2=CC=CC=C21 UFWIBTONFRDIAS-UHFFFAOYSA-N 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 2
- 235000007164 Oryza sativa Nutrition 0.000 description 2
- 244000046052 Phaseolus vulgaris Species 0.000 description 2
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 2
- 241000235648 Pichia Species 0.000 description 2
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 2
- SMWDFEZZVXVKRB-UHFFFAOYSA-N Quinoline Chemical compound N1=CC=CC2=CC=CC=C21 SMWDFEZZVXVKRB-UHFFFAOYSA-N 0.000 description 2
- 241000109329 Rosa xanthina Species 0.000 description 2
- 235000004789 Rosa xanthina Nutrition 0.000 description 2
- 241000209051 Saccharum Species 0.000 description 2
- 235000005775 Setaria Nutrition 0.000 description 2
- 241000232088 Setaria <nematode> Species 0.000 description 2
- 241000207763 Solanum Species 0.000 description 2
- 235000002634 Solanum Nutrition 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- 235000019714 Triticale Nutrition 0.000 description 2
- 241000209149 Zea Species 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 125000001931 aliphatic group Chemical group 0.000 description 2
- 235000019270 ammonium chloride Nutrition 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 150000001450 anions Chemical class 0.000 description 2
- MWPLVEDNUUSJAV-UHFFFAOYSA-N anthracene Chemical compound C1=CC=CC2=CC3=CC=CC=C3C=C21 MWPLVEDNUUSJAV-UHFFFAOYSA-N 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- CUFNKYGDVFVPHO-UHFFFAOYSA-N azulene Chemical compound C1=CC=CC2=CC=CC2=C1 CUFNKYGDVFVPHO-UHFFFAOYSA-N 0.000 description 2
- IOJUPLGTWVMSFF-UHFFFAOYSA-N benzothiazole Chemical compound C1=CC=C2SC=NC2=C1 IOJUPLGTWVMSFF-UHFFFAOYSA-N 0.000 description 2
- 108091005948 blue fluorescent proteins Proteins 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 125000004432 carbon atom Chemical group C* 0.000 description 2
- 150000001768 cations Chemical class 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 229940106189 ceramide Drugs 0.000 description 2
- 150000001783 ceramides Chemical class 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 244000038559 crop plants Species 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 239000002359 drug metabolite Substances 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000002189 fluorescence spectrum Methods 0.000 description 2
- KWIUHFFTVRNATP-UHFFFAOYSA-N glycine betaine Chemical compound C[N+](C)(C)CC([O-])=O KWIUHFFTVRNATP-UHFFFAOYSA-N 0.000 description 2
- 230000005283 ground state Effects 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 238000001597 immobilized metal affinity chromatography Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- CIPFCGZLFXVXBG-ZIQZFLOESA-N inositol 1,3,4,5-tetrakisphosphate Chemical compound O[C@H]1[C@H](OP(O)(O)=O)[C@@H](O)[C@@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@@H]1OP(O)(O)=O CIPFCGZLFXVXBG-ZIQZFLOESA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 229910021645 metal ion Inorganic materials 0.000 description 2
- 235000019713 millet Nutrition 0.000 description 2
- 239000002858 neurotransmitter agent Substances 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 230000000704 physical effect Effects 0.000 description 2
- 235000012015 potatoes Nutrition 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000007115 recruitment Effects 0.000 description 2
- 238000002165 resonance energy transfer Methods 0.000 description 2
- 230000004043 responsiveness Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 2
- 235000009566 rice Nutrition 0.000 description 2
- 238000009738 saturating Methods 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 238000004062 sedimentation Methods 0.000 description 2
- 239000000377 silicon dioxide Substances 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 229910052708 sodium Inorganic materials 0.000 description 2
- 239000004289 sodium hydrogen sulphite Substances 0.000 description 2
- 150000003408 sphingolipids Chemical class 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- 238000004448 titration Methods 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 241000228158 x Triticosecale Species 0.000 description 2
- 229910052725 zinc Inorganic materials 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- PHIQHXFUZVPYII-ZCFIWIBFSA-N (R)-carnitine Chemical compound C[N+](C)(C)C[C@H](O)CC([O-])=O PHIQHXFUZVPYII-ZCFIWIBFSA-N 0.000 description 1
- PORPENFLTBBHSG-MGBGTMOVSA-N 1,2-dihexadecanoyl-sn-glycerol-3-phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(O)=O)OC(=O)CCCCCCCCCCCCCCC PORPENFLTBBHSG-MGBGTMOVSA-N 0.000 description 1
- TZCPCKNHXULUIY-RGULYWFUSA-N 1,2-distearoyl-sn-glycero-3-phosphoserine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP(O)(=O)OC[C@H](N)C(O)=O)OC(=O)CCCCCCCCCCCCCCCCC TZCPCKNHXULUIY-RGULYWFUSA-N 0.000 description 1
- BCMCBBGGLRIHSE-UHFFFAOYSA-N 1,3-benzoxazole Chemical compound C1=CC=C2OC=NC2=C1 BCMCBBGGLRIHSE-UHFFFAOYSA-N 0.000 description 1
- HIYWOHBEPVGIQN-UHFFFAOYSA-N 1h-benzo[g]indole Chemical compound C1=CC=CC2=C(NC=C3)C3=CC=C21 HIYWOHBEPVGIQN-UHFFFAOYSA-N 0.000 description 1
- LRYZPFWEZHSTHD-HEFFAWAOSA-O 2-[[(e,2s,3r)-2-formamido-3-hydroxyoctadec-4-enoxy]-hydroxyphosphoryl]oxyethyl-trimethylazanium Chemical class CCCCCCCCCCCCC\C=C\[C@@H](O)[C@@H](NC=O)COP(O)(=O)OCC[N+](C)(C)C LRYZPFWEZHSTHD-HEFFAWAOSA-O 0.000 description 1
- QQSQAOKMDCMDKR-UHFFFAOYSA-N 2-bromo-n-(2-sulfamoylethyl)acetamide Chemical compound NS(=O)(=O)CCNC(=O)CBr QQSQAOKMDCMDKR-UHFFFAOYSA-N 0.000 description 1
- KXWFXUJVBMIZPU-UHFFFAOYSA-N 2-bromo-n-[2-[[4-[5-[4-(dimethylamino)phenyl]-1,3-oxazol-2-yl]phenyl]sulfonylamino]ethyl]acetamide Chemical compound C1=CC(N(C)C)=CC=C1C1=CN=C(C=2C=CC(=CC=2)S(=O)(=O)NCCNC(=O)CBr)O1 KXWFXUJVBMIZPU-UHFFFAOYSA-N 0.000 description 1
- JUIKUQOUMZUFQT-UHFFFAOYSA-N 2-bromoacetamide Chemical compound NC(=O)CBr JUIKUQOUMZUFQT-UHFFFAOYSA-N 0.000 description 1
- CMLFRMDBDNHMRA-UHFFFAOYSA-N 2h-1,2-benzoxazine Chemical compound C1=CC=C2C=CNOC2=C1 CMLFRMDBDNHMRA-UHFFFAOYSA-N 0.000 description 1
- WCICUBWFIOLNSV-UHFFFAOYSA-N 2h-oxazine-3,4-diamine Chemical class NC1=C(N)C=CON1 WCICUBWFIOLNSV-UHFFFAOYSA-N 0.000 description 1
- GOLORTLGFDVFDW-UHFFFAOYSA-N 3-(1h-benzimidazol-2-yl)-7-(diethylamino)chromen-2-one Chemical compound C1=CC=C2NC(C3=CC4=CC=C(C=C4OC3=O)N(CC)CC)=NC2=C1 GOLORTLGFDVFDW-UHFFFAOYSA-N 0.000 description 1
- JZUWLOGLONEFKL-UHFFFAOYSA-N 4-[(4-hydroxyphenyl)methylidene]imidazolidin-2-one Chemical compound C1=CC(O)=CC=C1C=C1NC(=O)NC1 JZUWLOGLONEFKL-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- BHOXPDZNQUGRRH-UHFFFAOYSA-N 4-amino-4h-oxazin-3-one Chemical class NC1C=CONC1=O BHOXPDZNQUGRRH-UHFFFAOYSA-N 0.000 description 1
- 125000003143 4-hydroxybenzyl group Chemical group [H]C([*])([H])C1=C([H])C([H])=C(O[H])C([H])=C1[H] 0.000 description 1
- ZMERMCRYYFRELX-UHFFFAOYSA-N 5-{[2-(iodoacetamido)ethyl]amino}naphthalene-1-sulfonic acid Chemical compound C1=CC=C2C(S(=O)(=O)O)=CC=CC2=C1NCCNC(=O)CI ZMERMCRYYFRELX-UHFFFAOYSA-N 0.000 description 1
- JVKVVXNSKFDAIB-UHFFFAOYSA-N 6-aminoxanthen-3-one Chemical class C1=CC(=O)C=C2OC3=CC(N)=CC=C3C=C21 JVKVVXNSKFDAIB-UHFFFAOYSA-N 0.000 description 1
- HXYAESYPQJYEBJ-UHFFFAOYSA-N 6-hydroxyxanthen-3-one Chemical class C1=CC(=O)C=C2OC3=CC(O)=CC=C3C=C21 HXYAESYPQJYEBJ-UHFFFAOYSA-N 0.000 description 1
- DJXIADRXORLNJX-UHFFFAOYSA-N 6-iminoxanthen-3-amine Chemical class C1=CC(=N)C=C2OC3=CC(N)=CC=C3C=C21 DJXIADRXORLNJX-UHFFFAOYSA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- 102000034263 Amino acid transporters Human genes 0.000 description 1
- 108050005273 Amino acid transporters Proteins 0.000 description 1
- 101710190609 Ammonium transporter MEP2 Proteins 0.000 description 1
- 108091006986 Amt proteins Proteins 0.000 description 1
- RJUHZPRQRQLCFL-IMJSIDKUSA-N Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O RJUHZPRQRQLCFL-IMJSIDKUSA-N 0.000 description 1
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 1
- IQTUDDBANZYMAR-WDSKDSINSA-N Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O IQTUDDBANZYMAR-WDSKDSINSA-N 0.000 description 1
- HSPSXROIMXIJQW-BQBZGAKWSA-N Asp-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 HSPSXROIMXIJQW-BQBZGAKWSA-N 0.000 description 1
- 229930192334 Auxin Natural products 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 235000007558 Avena sp Nutrition 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241000167854 Bourreria succulenta Species 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 108091005944 Cerulean Proteins 0.000 description 1
- 241000282552 Chlorocebus aethiops Species 0.000 description 1
- 108091005960 Citrine Proteins 0.000 description 1
- 241000207199 Citrus Species 0.000 description 1
- 235000005979 Citrus limon Nutrition 0.000 description 1
- 244000131522 Citrus pyriformis Species 0.000 description 1
- 240000000560 Citrus x paradisi Species 0.000 description 1
- 241000218631 Coniferophyta Species 0.000 description 1
- 102000034534 Cotransporters Human genes 0.000 description 1
- 108020003264 Cotransporters Proteins 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 241000700662 Fowlpox virus Species 0.000 description 1
- 240000009088 Fragaria x ananassa Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- PABVKUJVLNMOJP-WHFBIAKZSA-N Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(O)=O PABVKUJVLNMOJP-WHFBIAKZSA-N 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- ZWZWYGMENQVNFU-UHFFFAOYSA-N Glycerophosphorylserin Natural products OC(=O)C(N)COP(O)(=O)OCC(O)CO ZWZWYGMENQVNFU-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- WMDZARSFSMZOQO-DRZSPHRISA-N Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WMDZARSFSMZOQO-DRZSPHRISA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 244000202563 Meum athamanticum Species 0.000 description 1
- 238000006957 Michael reaction Methods 0.000 description 1
- 102220624132 Minor histocompatibility antigen H13_S88G_mutation Human genes 0.000 description 1
- 240000003433 Miscanthus floridulus Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical class ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- ZCQWOFVYLHDMMC-UHFFFAOYSA-N Oxazole Chemical compound C1=COC=N1 ZCQWOFVYLHDMMC-UHFFFAOYSA-N 0.000 description 1
- 102220503462 PX domain-containing protein kinase-like protein_T116V_mutation Human genes 0.000 description 1
- 241001520808 Panicum virgatum Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- FADYJNXDPBKVCA-STQMWFEESA-N Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FADYJNXDPBKVCA-STQMWFEESA-N 0.000 description 1
- 241000195887 Physcomitrella patens Species 0.000 description 1
- 108010039918 Polylysine Proteins 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 102000004257 Potassium Channel Human genes 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 108010067787 Proteoglycans Proteins 0.000 description 1
- 102000016611 Proteoglycans Human genes 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 235000007238 Secale cereale Nutrition 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- UJTZHGHXJKIAOS-WHFBIAKZSA-N Ser-Gln Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O UJTZHGHXJKIAOS-WHFBIAKZSA-N 0.000 description 1
- LZLREEUGSYITMX-JQWIXIFHSA-N Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(O)=O)=CNC2=C1 LZLREEUGSYITMX-JQWIXIFHSA-N 0.000 description 1
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- PJANXHGTPQOBST-VAWYXSNFSA-N Stilbene Natural products C=1C=CC=CC=1/C=C/C1=CC=CC=C1 PJANXHGTPQOBST-VAWYXSNFSA-N 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 235000021536 Sugar beet Nutrition 0.000 description 1
- 241000701093 Suid alphaherpesvirus 1 Species 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical compound C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- JAQGKXUEKGKTKX-HOTGVXAUSA-N Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 JAQGKXUEKGKTKX-HOTGVXAUSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 240000006677 Vicia faba Species 0.000 description 1
- 235000010749 Vicia faba Nutrition 0.000 description 1
- 235000002098 Vicia faba var. major Nutrition 0.000 description 1
- 240000006365 Vitis vinifera Species 0.000 description 1
- 235000014787 Vitis vinifera Nutrition 0.000 description 1
- ATBOMIWRCZXYSZ-XZBBILGWSA-N [1-[2,3-dihydroxypropoxy(hydroxy)phosphoryl]oxy-3-hexadecanoyloxypropan-2-yl] (9e,12e)-octadeca-9,12-dienoate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(COP(O)(=O)OCC(O)CO)OC(=O)CCCCCCC\C=C\C\C=C\CCCCC ATBOMIWRCZXYSZ-XZBBILGWSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- DPKHZNPWBDQZCN-UHFFFAOYSA-N acridine orange free base Chemical compound C1=CC(N(C)C)=CC2=NC3=CC(N(C)C)=CC=C3C=C21 DPKHZNPWBDQZCN-UHFFFAOYSA-N 0.000 description 1
- 150000001252 acrylic acid derivatives Chemical class 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000036982 action potential Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 125000003342 alkenyl group Chemical group 0.000 description 1
- 125000003545 alkoxy group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 125000000304 alkynyl group Chemical group 0.000 description 1
- 230000003281 allosteric effect Effects 0.000 description 1
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- RQVYBGPQFYCBGX-UHFFFAOYSA-N ametryn Chemical compound CCNC1=NC(NC(C)C)=NC(SC)=N1 RQVYBGPQFYCBGX-UHFFFAOYSA-N 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 150000001412 amines Chemical group 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- RWZYAGGXGHYGMB-UHFFFAOYSA-N anthranilic acid Chemical compound NC1=CC=CC=C1C(O)=O RWZYAGGXGHYGMB-UHFFFAOYSA-N 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000003125 aqueous solvent Substances 0.000 description 1
- 125000003710 aryl alkyl group Chemical group 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 238000005844 autocatalytic reaction Methods 0.000 description 1
- 239000002363 auxin Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 125000005605 benzo group Chemical group 0.000 description 1
- SJCPQBRQOOJBFM-UHFFFAOYSA-N benzo[a]phenalen-1-one Chemical compound C1=CC=C2C(C(=O)C=C3)=C4C3=CC=CC4=CC2=C1 SJCPQBRQOOJBFM-UHFFFAOYSA-N 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 229960003237 betaine Drugs 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000002551 biofuel Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000004061 bleaching Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 102220358592 c.277G>T Human genes 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000000298 carbocyanine Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000005606 carbostyryl group Chemical group 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 229960004203 carnitine Drugs 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 229930183167 cerebroside Natural products 0.000 description 1
- 150000001784 cerebrosides Chemical class 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 239000012707 chemical precursor Substances 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- OEYIOHPDSNJKLS-UHFFFAOYSA-N choline Chemical compound C[N+](C)(C)CCO OEYIOHPDSNJKLS-UHFFFAOYSA-N 0.000 description 1
- 229960001231 choline Drugs 0.000 description 1
- 239000011035 citrine Substances 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 238000004624 confocal microscopy Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 229960000956 coumarin Drugs 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 125000004093 cyano group Chemical group *C#N 0.000 description 1
- 125000000753 cycloalkyl group Chemical group 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 235000015872 dietary supplement Nutrition 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 229940000406 drug candidate Drugs 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000000295 emission spectrum Methods 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- YQGOJNYOYNNSMM-UHFFFAOYSA-N eosin Chemical compound [Na+].OC(=O)C1=CC=CC=C1C1=C2C=C(Br)C(=O)C(Br)=C2OC2=C(Br)C(O)=C(Br)C=C21 YQGOJNYOYNNSMM-UHFFFAOYSA-N 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 238000000695 excitation spectrum Methods 0.000 description 1
- 230000005281 excited state Effects 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 238000002875 fluorescence polarization Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 150000002270 gangliosides Chemical class 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 150000002327 glycerophospholipids Chemical class 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 125000001072 heteroaryl group Chemical group 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 238000012872 hydroxylapatite chromatography Methods 0.000 description 1
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Substances C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 1
- YAMHXTCMCPHKLN-UHFFFAOYSA-N imidazolidin-2-one Chemical compound O=C1NCCN1 YAMHXTCMCPHKLN-UHFFFAOYSA-N 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- PZOUSPYUWWUPPK-UHFFFAOYSA-N indole Natural products CC1=CC=CC2=C1C=CN2 PZOUSPYUWWUPPK-UHFFFAOYSA-N 0.000 description 1
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 1
- RKJUIXBNRJVNHR-UHFFFAOYSA-N indolenine Natural products C1=CC=C2CC=NC2=C1 RKJUIXBNRJVNHR-UHFFFAOYSA-N 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 238000013101 initial test Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 1
- 230000037427 ion transport Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 235000021374 legumes Nutrition 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- DLBFLQKQABVKGT-UHFFFAOYSA-L lucifer yellow dye Chemical compound [Li+].[Li+].[O-]S(=O)(=O)C1=CC(C(N(C(=O)NN)C2=O)=O)=C3C2=CC(S([O-])(=O)=O)=CC3=C1N DLBFLQKQABVKGT-UHFFFAOYSA-L 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000028161 membrane depolarization Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 229910044991 metal oxide Inorganic materials 0.000 description 1
- 150000004706 metal oxides Chemical class 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 238000012895 mono-exponential function Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 229920005615 natural polymer Polymers 0.000 description 1
- 230000006491 negative regulation of transport Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- VOFUROIFQGPCGE-UHFFFAOYSA-N nile red Chemical compound C1=CC=C2C3=NC4=CC=C(N(CC)CC)C=C4OC3=CC(=O)C2=C1 VOFUROIFQGPCGE-UHFFFAOYSA-N 0.000 description 1
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 210000003101 oviduct Anatomy 0.000 description 1
- 150000004893 oxazines Chemical class 0.000 description 1
- 230000020477 pH reduction Effects 0.000 description 1
- 125000005010 perfluoroalkyl group Chemical group 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 125000002080 perylenyl group Chemical group C1(=CC=C2C=CC=C3C4=CC=CC5=CC=CC(C1=C23)=C45)* 0.000 description 1
- CSHWQDPOILHKBI-UHFFFAOYSA-N peryrene Natural products C1=CC(C2=CC=CC=3C2=C2C=CC=3)=C3C2=CC=CC3=C1 CSHWQDPOILHKBI-UHFFFAOYSA-N 0.000 description 1
- WWBGWPHHLRSTFI-UHFFFAOYSA-N phenalen-1-one Chemical compound C1=CC(C(=O)C=C2)=C3C2=CC=CC3=C1 WWBGWPHHLRSTFI-UHFFFAOYSA-N 0.000 description 1
- XQYMIMUDVJCMLU-UHFFFAOYSA-N phenoxyperoxybenzene Chemical compound C=1C=CC=CC=1OOOC1=CC=CC=C1 XQYMIMUDVJCMLU-UHFFFAOYSA-N 0.000 description 1
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 150000003905 phosphatidylinositols Chemical class 0.000 description 1
- 229940080469 phosphocellulose Drugs 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- QWYZFXLSWMXLDM-UHFFFAOYSA-M pinacyanol iodide Chemical compound [I-].C1=CC2=CC=CC=C2N(CC)C1=CC=CC1=CC=C(C=CC=C2)C2=[N+]1CC QWYZFXLSWMXLDM-UHFFFAOYSA-M 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000768 polyamine Polymers 0.000 description 1
- 229920000656 polylysine Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 150000004032 porphyrins Chemical class 0.000 description 1
- 235000020004 porter Nutrition 0.000 description 1
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 108020001213 potassium channel Proteins 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 102220024545 rs58907919 Human genes 0.000 description 1
- YGSDEFSMJLZEOE-UHFFFAOYSA-M salicylate Chemical compound OC1=CC=CC=C1C([O-])=O YGSDEFSMJLZEOE-UHFFFAOYSA-M 0.000 description 1
- 229960001860 salicylate Drugs 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- QGMRQYFBGABWDR-UHFFFAOYSA-N sodium;5-ethyl-5-pentan-2-yl-1,3-diazinane-2,4,6-trione Chemical compound [Na+].CCCC(C)C1(CC)C(=O)NC(=O)NC1=O QGMRQYFBGABWDR-UHFFFAOYSA-N 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 238000009987 spinning Methods 0.000 description 1
- 238000012289 standard assay Methods 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- PJANXHGTPQOBST-UHFFFAOYSA-N stilbene Chemical compound C=1C=CC=CC=1C=CC1=CC=CC=C1 PJANXHGTPQOBST-UHFFFAOYSA-N 0.000 description 1
- 235000021286 stilbenes Nutrition 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 235000021012 strawberries Nutrition 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 229910052723 transition metal Inorganic materials 0.000 description 1
- 150000003624 transition metals Chemical class 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 239000010981 turquoise Substances 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
- 125000001834 xanthenyl group Chemical group C1=CC=CC=2OC3=CC=CC=C3C(C12)* 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/5005—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
- G01N33/5008—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
- G01N33/502—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics for testing non-proliferative effects
- G01N33/5035—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics for testing non-proliferative effects on sub-cellular localization
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/43504—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
- C07K14/43595—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from coelenteratae, e.g. medusae
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6872—Intracellular protein regulatory factors and their receptors, e.g. including ion channels
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/20—Fusion polypeptide containing a tag with affinity for a non-protein ligand
- C07K2319/21—Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a His-tag
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/20—Fusion polypeptide containing a tag with affinity for a non-protein ligand
- C07K2319/23—Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a GST-tag
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
- C07K2319/43—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation containing a FLAG-tag
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/60—Fusion polypeptide containing spectroscopic/fluorescent detection, e.g. green fluorescent protein [GFP]
Definitions
- a computer readable text file entitled “056100-5085-WO-SequenceListing.txt,” created on or about 12 Sep. 2012 with a file size of about 18 kb contains the sequence listing for this application and is hereby incorporated by reference in its entirety.
- the invention relates to engineered transporter proteins comprising at least one fluorescent reporter covalently bound to the transporter protein, wherein the transporter proteins of the present invention comprise a structural inverted repeat motif, with the motif comprising a first and second subunit that are structural inverted repeats of one another and that are joined to one another by a polypeptide loop.
- Transporter proteins are membrane bound proteins that often utilize potential gradients to drive transport of molecules and ions into and out of cells. Traditionally, these proteins are classified based on the similarity of their amino acid sequence, i.e., the primary structure of the protein.
- Voltage sensors have been generated by others based on, for example, potassium channels, but these voltage sensors do not directly measure potassium transport. Instead these voltage sensors merely measure membrane potential using the voltage sensor of the transporter. To date, there has been no way to monitor or directly measure molecule or ion transport.
- the present invention solves the problems by providing transport proteins that are capable of signaling a conformational change during the transport process.
- the invention thus provides sensors that are based on virtually any transporter with the required three-dimensional conformation, which, in turn, provides a new means of monitoring their activity or modifications by the regulatory machinery in real time in vivo
- the invention relates to engineered transporter proteins comprising at least one fluorescent reporter covalently bound to the transporter protein.
- the transporter proteins of the present invention comprise a structural inverted repeat motif, with the motif comprising a first and second subunit that are structural inverted repeats of one another and that are joined to one another by a polypeptide loop.
- the fluorescent reporter typically is covalently bound to the polypeptide loop that joins the two subunits of the structural inverted repeat.
- the invention also relates to methods of using these engineered transporter proteins to monitor analyte movement across a membrane, such as a cell membrane.
- the invention also relates to nucleic acids encoding the engineered transporter proteins.
- FIGS. 1A, 1B, 1C, 1D, 1E and 1F depict the design and characterization of one specific ammonium transporter named “AmTrac.”
- FIG. 1A Topological representation of AMT1; 3 by HMMTOP.
- 11 Transmembrane helices (TMH) are organized in a pseudo-symmetric structure, i.e., structural inverted repeats, (TMH I-V and TMH VI-X) with an extra terminal TMH-XI that directs the C-terminus to the cytosol.
- TMH I-V and TMH VI-X Transmembrane helices
- the position of the residues preceding the insertion points of FPs in L5-6, 7-8, 9-10 and in the C-tail are indicated.
- FIG. 1B Schematic representation of AmTrac transporter. Linkers between AMT1; 3 and mcpGFP are indicated.
- FIG. 1C Three-dimensional model of AmTrac based on the crystal structures of Af-AMT1 (2B2H) and cpGFP (3evp). One monomer is shown in cartoon and the rest of the trimer complex is represented as a shaded surface in the background. mcpGFP (bottom) was inserted in position 233 of L5-6 of AMT1; 3, connecting TMH-V and -VI.
- FIG. 1D Substrate specificity of the fluorescent response of AmTrac.
- FIG. 1F Response of a single yeast cell expressing AmTrac to square pulses of NH 4 Cl.
- FIGS. 2A, 2B, 2C, 2D, 2E and 2F depict AmTrac mutant analysis.
- FIG. 2B Growth of the ⁇ mep1,2,3 or wt yeast expressing AmTrac or its inactive variants on solid media containing 2 mM NH 4 Cl or 1 mM arginine (growth control) as sole nitrogen source for 3 days. Endogenous MEPs in wt strain are not affected by expression of mutant variants.
- FIG. 2B Growth of the ⁇ mep1,2,3 or wt yeast expressing AmTrac or its inactive variants on solid media containing 2 mM NH 4 Cl or 1 mM arginine
- FIG. 2D Growth complementation of ⁇ mep1,2,3 expressing suppressor mutants grown on solid media containing the indicated concentrations of NH 4 Cl or 1 mM arginine. AmTrac-T464D-A141E expressing cells grow poorly at high ammonium concentrations.
- FIG. 2D Growth complementation of ⁇ mep1,2,3 expressing suppressor mutants grown on solid media containing the indicated concentrations of NH 4 Cl or 1 mM arginine. AmTrac-T464D-A141E expressing cells grow poorly at high ammonium concentrations.
- FIGS. 3A and 3B depict a model for AmTrac sensor and AMT transport.
- AMT switches between at least two distinct states during transport of ammonium: an outward, open state ( FIG. 3A ) and an inward, open state ( FIG. 3B ).
- the movement of TMH-V (left helix) and TMH-VI (right helix) is transmitted to the connecting loop, affecting the inserted mcpGFP and resulting in a change in fluorescence emission.
- FIGS. 4A and 4B depict the characterization of constructs with FP insertions.
- FIG. 4A The functionality of the transporters was measured as growth of the yeast ⁇ mep1,2,3 mutant transformed with AMT-FP fusions and grown on solid media containing 2 mM NH 4 Cl or 1 mM arginine (growth control) as the sole nitrogen source for three days. Numbers indicate the position in AtAMT1; 3 preceding the insertion site. Control: empty vector.
- FIG. 4B Fluorescence emission spectra of the fusion construct AMT1; 3-mcpGFP-233 expressed in yeast.
- FIG. 5 depicts a model of the AtAMT1; 3-mcpGFP fusion.
- mcpGFP bottom portion
- AtAMT1 3 in position 233 of the cytoplasmic L5-6 bridging TMH-V to TMH-VI.
- the connection point between the two proteins is shown as dashed line.
- Residues shown in sticks highlight the location of several residues relevant for transport function, in particular TMH-V-H157 from the twin-His (H157-H305) pair, TMH-VI-5208 (forming the ammonium selectivity and recruitment site together with W137), TMH-VI-P204 from the twin Phe-gate (F96-F204) that close the pore in all known structures.
- the peptide loop connecting TMH V-VI, L5-6 also connects the two pseudo-symmetric halves of the structural inverted repeat of the protein (TMH I-V and TMH VI-X).
- FIGS. 6A and 6B depict a screen of linker variants for identifying a AMT1; 3-mcpGFP sensor.
- the composition and length of the peptide linker was varied.
- FIG. 6A Growth of the yeast ⁇ mep1,2,3 mutant transformed with fusion variants on solid media containing 2 mM NH 4 Cl or 1 mM arginine (growth control) as the sole nitrogen source for three days. All variants retained transport activity. Composition of linkers connecting AtAMT1; 3 and mcpGFP are indicated.
- Linkers at the N- and C-termini of mcpGFP are indicated in letter code and separated by a slash. In the cases of variants 1-6, no linkers were inserted between the C-terminal sequence of mcpGFP and the second part of AtAMT1; 3. Control, empty vector.
- FIG. 8 depicts the sequence of AmTrac. Protein sequence of AmTrac. Underlined residues constitute synthetic linker segments. Bolded residues correspond to the mcpGFP moiety. Numbers indicate amino acid position in AtAMT1; 3.
- FIGS. 9A, 9B and 9C depict the Influence of the insertion position of mcpGFP in L5-6 of AtAMT1; 3. To probe the tolerance of L5-6 of AMT1; 3 to insertion, the insertion position was varied within the peptide loop.
- FIG. 9A Schematic representation of the L5-6 region of AtAMT1; 3. Underlined residues correspond to TMH 5 (left) and TMH 6 (right).
- FIG. 9B Growth assay of the yeast ⁇ mep1,2,3 mutant transformed with insertion mutants on solid media containing 2 mM NH 4 Cl or 1 mM arginine (growth control) as the sole nitrogen source for three days.
- FIG. 10 depicts the growth and response of AmTrac variants with deletions in L5-6.
- incremental deletions of the peptide loop L5-6 were generated around the insertion point of mcpGFP. Growth was analyzed as described in FIGS. 9A-9C . Numbers in the left column indicate the position of the insertion in AtAMT1; 3; two numbers indicate residues preceding and following the mcpGFP insertion.
- Right column indicates whether the corresponding variant responded to addition of 1 mM NH 4 Cl with a fluorescence change. The original AmTrac is highlighted in bold. Note that deletions >4 residues abolished transport activity and deletions >3 impaired the fluorescence response.
- FIGS. 12A, 12B and 12C depict the reconstitution of the fluorescent response in cis-suppressor mutants of the transport-inactive AmTrac-T464D mutant.
- FIG. 12A Frequency of the occurrence of individual cis-suppressors of the T464D-inactivating mutation in the multicopy screen. A total of 56 colonies were retrieved. Almost half of them carried the A141E mutation. Three suppressors had a pseudo-reversion mutation: D464V.
- FIG. 12B Growth was analyzed as described in FIGS. 9A-9C , with varying concentrations of NH 4 Cl, (NH 4 ) 2 SO 4 (as anion control) or 1 mM arginine.
- FIGS. 13A, 13B and 13C depict the position of suppressor mutations in AMTs.
- FIG. 13A Sequence alignment between AtAMT1; 3 from Arabidopsis and AfAmt-1 from A. fulgidus .
- the residues belonging to TMH domains of the two halves of the structural inverted repeat of AfAMT1 are shown as underlined and speckled, respectively.
- the corresponding residues identified in the suppressor screen of AmTrac-T464D are indicated in both sequences as bold residues.
- FIG. 13B Lateral view and
- FIG. 13C cytoplasmic side view of AfAMT1 according to the crystal structure.
- the corresponding residues in AfAMT1 that suppress the T464D mutation in AmTrac are indicated by spheres.
- the connecting L5-6 is peptide loop labeled.
- FIGS. 14A and 14B depict a screen for sensor variants.
- FIG. 14A Fluorescence intensity and fluorescence response of ⁇ 350 random variants of the C-terminal linker of mcpGFP to addition of 1 mM NH 4 Cl. The brightest and most responsive variants (open squares) were sequenced and composition of the linker is reported.
- FIG. 16 depicts the responsiveness of AmTrac in root hairs of Arabidopsis . Seedlings grown in a microfluidic perfusion system (RootChip) were perfused with 50 mM MES buffer pH 6.0, or a square pulse of 1 mM NH 4 Cl in buffer.
- FIGS. 17A and 17B depict the characterization of an AmTrac version (named MEPtrac) employing the ammonium transporter MEP2 (methylammonium permease 2) from the yeast Saccharomyces caerevisiae instead of AMT1; 3 of Arabidopsis thaliana .
- MEPtrac ammonium transporter
- MEP2 methylammonium permease 2
- the mcpGFP was inserted in the middle of loop 5 of MEP2 (after amino acid 217) and connected by the linkers coding for amino acids LS (preceding mcpGFP) and FN (following mcpGFP).
- FIG. 17A Growth complementation of ⁇ mep1,2,3 expressing suppressor mutants grown on solid media containing the indicated concentrations of NH 4 Cl or 1 mM arginine or 5 mM methylammonium plus 0.1% proline. MEPtrac is able to complement growth on ammonium as well as the parent protein MEP2.
- FIGS. 18A and 18B depict the time-resolved fluorescence lifetime measurements and analysis of intact yeast cells expressing three improved AmTrac variants, AmTrac-GS, AmTrac-LS and AmTrac-IS with and without addition of 1 mM NH 4 Cl.
- FIGS. 19A and 19B depict the time-resolved anisotropy measurements of intact yeast cells expressing AmTrac-GS to elucidate the relevance of homo-FRET in the sensor mechanism.
- the invention relates to engineered transporter proteins comprising at least one fluorescent reporter covalently bound to the transporter protein.
- the transporter proteins of the present invention must comprise at least one “structural inverted repeat motif.”
- a structural inverted repeat motif refers to a specific arrangement of secondary and tertiary structures of a protein.
- a transporter protein with a “structural inverted repeat” is a protein that has multiple alpha helices as part of its secondary structure, and many, but not necessarily all, of these alpha helices are arranged in such a manner that they span a plasma membrane (“transmembrane helices”) when the protein occurs in its natural environment.
- a “structural inverted repeat” indicates that there is an “approximate axis of symmetry” dividing the transmembrane helices into two set of helices. Each set of transmembrane helices on either half of the approximate axis of symmetry is considered a “subunit” for the purposes of the present invention.
- first subunit the alpha-carbon atoms of the residues making up the first set (“first subunit”) of transmembrane helices can be superimposed on the alpha-carbon atoms of the residues making up the second set (“second subunit”) of transmembrane helices by a rotation of about 180 degrees.
- the ability to superimpose the alpha-carbon atoms of the transmembrane helices between the two subunits provides the “structural repeat” aspect of the transporter protein.
- the approximate 180 degree rotation of one subunit relative to the other subunit such that the alpha-carbon atoms between the two subunits can be superimposed on one another provides “inverted” aspect of the inverted structural repeat.
- the first and second subunits that make up the structural inverted repeat need not be rotated exactly 180 degrees relative to one another.
- the second subunit is rotated about 176.5 degrees relative to the first subunit.
- the Mhp1 transporter indolyl-methyl- and benzyl-hydantions
- NCS1 nucleobase-cation-symport
- the liquifaciens is a transporter protein with a structural inverted repeat in which the second subunit is rotated about 168 degree relative to the first subunit. See Weyand, S. et al., Science, 322:709-713 (2008), which is incorporated by reference.
- the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 90 degrees to about 270 degrees.
- the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 100 degrees to about 260 degrees.
- the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 110 degrees to about 250 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 120 degrees to about 240 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 130 degrees to about 230 degrees.
- the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 140 degrees to about 220 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 150 degrees to about 210 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 160 degrees to about 200 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 170 degrees to about 190 degrees.
- Proteins said to have a structural inverted repeat can have any number transmembrane helices in each subunit, and the number of transmembrane helices need not be the same in the two subunits.
- the first and second subunits can each independently have 2, 3, 4, 5, 6, 7, 8, 9 or even 10 transmembrane helices.
- one subunit may have 5 transmembrane helices in its first subunit and 5 transmembrane helices in its second subunit.
- one subunit may have 5 transmembrane helices in its first subunit and 6 transmembrane helices in its second subunit.
- the number of transmembrane helices that are rotated in the second subunit, relative to the first subunit, will define the number of transmembrane helices in the subunits.
- the Na + /Cl ⁇ transporter protein in Aquifex aeolicus has 5 transmembrane helices in the first subunit and 5 transmembrane helices in the second subunit.
- the transporter porter protein may have additional secondary structures that are not part of the structural inverted repeat motif.
- the transporter proteins may have additional alpha helical structures, and these alpha helices may even be transmembrane helices. But these additional transmembrane helices would not necessarily be part of the structural inverted repeat motif.
- the Na + /Cl ⁇ transporter protein in Aquifex aeolicus has 12 total transmembrane alpha helices, numbered 1-12.
- alpha helices numbered 1-5 in the Na + /Cl ⁇ transporter protein in Aquifex aeolicus are considered to make up the first subunit of the structural inverted repeat motif, and helices 6-10 make up the second subunit of the structural inverted repeat motif.
- Alpha helices 11 and 12 in the Na + /Cl ⁇ transporter protein in Aquifex aeolicus span the membrane but do not belong to either the first or second subunit of the structural inverted repeat motif.
- transmembrane helices in the subunits need not be continuous such that there may be a break helical structure along the transmembrane helix.
- the transmembrane helices numbered 1 and 6 of the Na + /Cl ⁇ transporter protein in Aquifex aeolicus are both discontinuous transmembrane helices, yet each helix would be considered a single transmembrane helix that is part of their respective subunit.
- the transmembrane helices are connected to one another by a chain of amino acids. Often, two helices are connected to one another with a chain of amino acids that do not possess a secondary structure.
- the chain of amino acids connecting two transmembrane helices may have a secondary structure, e.g., an alpha helix, occurring along its length.
- one subunit of the structural inverted repeat may have a transmembrane helix connected to a second transmembrane helix through a chain of amino acids (“polypeptide loop”).
- the transmembrane helices connected by the polypeptide loop may or may not be physically located beside each other in the protein structure.
- the helices are said to be “connected to one another” based upon the overall amino acid sequence of the protein. For example, amino acid residues 41-70 of the linear amino acid sequence may form one transmembrane helix and amino acid residues 88-124 may form another transmembrane helix.
- the two helices would be connected to one another through the polypeptide chain on amino acid residues 71-87, although the two transmembrane helices are necessarily physically beside one another in the overall three-dimensional structure of the protein.
- amino acid residues 71-87 of the polypeptide loop connecting the two transmembrane helices may themselves form an alpha helical structure, but this alpha-helical structure would not be considered one of the transmembrane helices of that form the structural inverted repeat structure if the alpha helix of the polypeptide loop did not span the membrane when normally expressed.
- polypeptide loop connecting the two helices may be external to the plasma membrane (“extracellular”) or in may be internal to the plasma membrane (“intracellular”).
- extracellular may be external to the plasma membrane
- intracellular may be internal to the plasma membrane
- any reference herein to any portion of the transporter protein relative to a cell or plasma membrane is for illustrative purposes only.
- the proteins may be synthesized and formed ex vivo and subsequently folded into a three dimensional structure that resembles or is the same as the properly folded protein if it were to be expressed natively in a cell.
- FIG. 1A shows a cartoon diagram of an ammonium transporter protein comprising a structural inverted repeat motif.
- Each subunit of the structural inverted repeat in the ammonium transporter comprises 5 transmembrane helices, with helices 1-5 belonging to the first subunit and helices 6-10 belonging to the second subunit.
- Each helix is connected to the “next” helix in the amino acid chain through a polypeptide loop as defined herein.
- Transmembrane helix 5 is joined to transmembrane helix 6 through a polypeptide loop, and helix 5 and helix 6 belong to two different subunits of the structural inverted repeat motif.
- the polypeptide loop that connects the first and second subunits of the structural inverted repeat is the loop that joins transmembrane helices 5 and 6 to one another.
- the polypeptide loop that connects the first and second subunits of the structural inverted repeat can be extracellular or it can be intracellular.
- transporter proteins include but are not limited to members of the APC (amino acid, polyamine and organocation) superfamily of transporter proteins that have a structural inverted repeat, the AMT/MEP/RH superfamily of transporters, the Neurotransmitter:Sodium Symporter (NSS) superfamily of transporters, such as but not limited to the LeuT, Tyt1 and TnaT transporters, the Betaine/Carnitine/Choline superfamily of transporters (BCCT), such as but not limited to BetP, the The Amino Acid/Auxin Permease (AAAP) superfamily of transports, the Solute:Sodium Symporter (SSS) superfamily of transporters, the Alanine or Glycine:Cation Symporter (AGCS) superfamily of transporters, the Cation-Chloride Cotransporter (CCC) superfamily of transporters, the Nucleobase:Cation Symporter-1 (NCS1) superfamily of transporters and the Hydroxy/Aromatic Amino Acid Permease
- Determining if a transporter protein contains a structural inverted repeat can be performed by resolving the crystal structure of the proteins through for example, X-ray crystallography and NMR-spectroscopy.
- the topology of protein transporters can efficiently and reliably be predicted using programs such as TMHMM (available on the internet at www.cbs.dtu.dk/services/TMHMM/) the current version of which is “Server 2.0.”
- Determining if a transporter protein contains repeated elements can be performed by alignment of the protein sequence to itself, or by algorithms such as MotifScan, Radar (de novo repeat detection in protein sequences), REP (searches a protein sequence for repeats), REPRO (de novo repeat detection in protein sequences, T-REKS (de novo detection and alignment of repeats in protein sequences), TRUST (de novo repeat detection in protein sequences), XSTREAM (de novo tandem repeat detection and architecture modeling in protein sequences.
- the fluorophore i.e., the fluorescent reporter
- the fluorescent reporter can be covalently bound to any one of the polypeptide loops that connect two transmembrane helices.
- the fluorescent reporter is bound to the polypeptide loop that connects the first and second subunits of the structural inverted repeat of the transporter peptide.
- the fluorescent reporter is bound to the polypeptide loop that does not connect the first and second subunits of the structural inverted repeat of the transporter peptide.
- fluorophore is used as it is in the art and refers to a molecule that emits light upon the absorption of energy.
- a fluorophore of the present invention is any chemical moiety that exhibits an absorption maximum at or beyond 280 nm, and when covalently attached to a protein or other reagent retains its spectral properties.
- Fluorophores of the present invention include, without limitation; a pyrene (including any of the corresponding derivative compounds disclosed in U.S. Pat. No.
- an anthracene a naphthalene, an acridine, a stilbene, an indole or benzindole, an oxazole or benzoxazole, a thiazole or benzothiazole, a 4-amino-7-nitrobenz-2-oxa-1,3-diazole (NBD), a cyanine, a carbocyanine (including any corresponding compounds in U.S. Pat. Nos.
- oxazines include resorufins (including any corresponding compounds disclosed in U.S. Pat. No.
- the fluorophore is optionally a fluorescein, a rhodol (including any corresponding compounds disclosed in U.S. Pat. Nos. 5,227,487 and 5,442,045, incorporated by reference), or a rhodamine (including any corresponding compounds in U.S. Pat. Nos. 5,798,276; 5,846,737 and 6,562,632, incorporated by reference).
- fluorescein includes benzo- or dibenzofluoresceins, seminaphthofluoresceins, or naphthofluoresceins.
- rhodol includes seminaphthorhodafluors (including any corresponding compounds disclosed in U.S. Pat. No. 4,945,171, incorporated by reference).
- the fluorophore is a xanthene that is bound via a linkage that is a single covalent bond at the 9-position of the xanthene.
- Xanthenes also include derivatives of 3H-xanthen-6-ol-3-one attached at the 9-position, derivatives of 6-amino-3H-xanthen-3-one attached at the 9-position, or derivatives of 6-amino-3H-xanthen-3-imine attached at the 9-position.
- Fluorophores for use in the present invention include, but are not limited to, xanthene (rhodol, rhodamine, fluorescein and derivatives thereof) coumarin, cyanine, pyrene, oxazine and borapolyazaindacene.
- xanthenes are sulfonated xanthenes, fluorinated xanthenes, sulfonated coumarins, fluorinated coumarins and sulfonated cyanines.
- the choice of the fluorophore will determine the absorption and fluorescence emission properties of the transporter protein or other labeling reagent complex. Physical properties of a fluorophore label include spectral characteristics (absorption, emission and stokes shift), fluorescence intensity, lifetime, polarization and photo-bleaching rate all of which can be used to distinguish one fluorophore from another.
- the fluorophore contains one or more aromatic or heteroaromatic rings, that are optionally substituted one or more times by a variety of substituents, including without limitation, halogen, nitro, cyano, alkyl, perfluoroalkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, arylalkyl, acyl, aryl or heteroaryl ring system, benzo, or other substituents typically present on fluorophores known in the art.
- substituents including without limitation, halogen, nitro, cyano, alkyl, perfluoroalkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, arylalkyl, acyl, aryl or heteroaryl ring system, benzo, or other substituents typically present on fluorophores known in the art.
- Non-limiting examples of fluorophores useful as reporter groups in this invention include acrylodan, fluorescein, coumarins, rhodamines, 5-TMRIA (tetramethylrhodamine-5-iodoacetamide), Quantum RedTM (R-phycoerythrin coupled to (9-(2(or4)-(N-(2-maleimdylethyl)-sulfonamidyl)-4(or 2)-sulfophenyl)-2,3,6,7,12,13,16,17-octahydro-(1H,5H,11H,15H-xantheno(2,3,4-ij:5,6,7-i′j′)diquinolizin-18-ium salt), Texas RedTM (9-(2(or4)-(N-(2-maleimdylethyl)-sulfonamidyl)-4(or 2)-sulfophenyl)-2,3,6,7,12,13,16,17-octa
- fluorophores that exhibit environmentally-sensitive fluorescence properties include squaraines, coumarins, aza-coumarins, IAZCO, benzodiaxoazoles, and dyes derived from Nile Red such as INR. These fluorophores are described in pending U.S. Pregrant Publication No. 2006/0280652, which is incorporated herein by reference. Additional fluorophores are described in U.S. application Ser. No. 12/250,953, Filed 14 Oct. 2008 and Ser. No. 12/124,553, Filed 21 May 2008, respectively, both of which are incorporated by reference.
- the fluorescent reporter is a fluorescent protein.
- the invention can also provide for fusion proteins comprising the transporter protein and a fluorescent protein as the reporter.
- fluorescent protein is readily understood in the art and simply means a protein that emits fluorescence at a detectable wavelength.
- Fluorescent proteins as well as enhanced versions thereof are well known in the art and are commercially available.
- “enhancement” indicates optimization of emission by increasing the protein's brightness, creating proteins that have faster chromophore maturation and/or alteration of dimerization properties. These enhancements can be achieved through engineering mutations into the fluorescent proteins.
- the reporter group may be attached to the transporter protein by any conventional means known in the art.
- the reporter group may be attached via amines or carboxyl residues on the protein.
- N-hydroxy-succinimide (NHS) esters are used to crosslink the reporter group to primary amino groups on the transporter protein.
- cysteine or other amino acid groups may be engineered into the transporter protein to provide sites of attachment for the reporter group.
- Any thiol-reactive group known in the art may be used for attaching reporter groups such as fluorophores to a native, engineered, or mutated protein's cysteine.
- acrylates, an iodoacetamide, bromoacetamide, or maleimide are well known thiol-reactive moieties that may be used for this purpose.
- the fluorescent reporters may be particularly useful for creating tandem dye labeled labeling reagents.
- the measurable signal of the fusion protein is actually a transfer of excitation energy (resonance energy transfer) from a donor molecule (e.g., a first fluorescent protein) to an acceptor molecule (e.g., a second fluorescent protein).
- the resonance energy transfer is in the form of fluorescence resonance energy transfer (FRET).
- FRET fluorescence resonance energy transfer
- donor and “acceptor,” when used in relation to FRET, are readily understood in the art. Namely, a donor is the molecule that will absorb a photon of light and subsequently initiate energy transfer to the acceptor molecule.
- the acceptor molecule is the molecule that receives the energy transfer initiated by the donor and, in turn, emits a photon of light.
- the Förster distance which can be determined experimentally by readily available techniques in the art, is the distance at which FRET is half of the maximum possible FRET value for a given donor/acceptor pair.
- a particularly useful combination is the phycobiliproteins disclosed in U.S. Pat. Nos. 4,520,110; 4,859,582; 5,055,556, incorporated by reference, and the sulforhodamine fluorophores disclosed in U.S. Pat. No. 5,798,276, or the sulfonated cyanine fluorophores disclosed in U.S. Pat. Nos. 6,977,305 and 6,974,873; or the sulfonated xanthene derivatives disclosed in U.S. Pat. No. 6,130,101, incorporated by reference and those combinations disclosed in U.S. Pat. No. 4,542,104, incorporated by reference.
- a “spectral change” in a fluorophore reporter group may be monitored to detect analyte movement through the transporter protein.
- the “spectral change” that occurs upon analyte movement can be, but is not limited to, a change in fluorescence lifetime, fluorescence intensity, fluorescence polarization, and spectral shifts of fluorescence emission.
- Such spectral changes may result from changes in the local environment of the fluorophore, such as those resulting from changes in protein conformation.
- Environmentally-sensitive dyes such as acrylodan and IANBD are particularly useful in this respect.
- Other spectral changes may result from interactions with the analyte itself or from interactions with a second reporter group, for example when FRET (fluorescence resonance energy transfer) is used to monitor changes in distance between two fluorophores.
- FRET fluorescence resonance energy transfer
- the term “detector” as used herein refers to any component, portion thereof, or system of components that can detect the detectable signal generated by the reporter group.
- the detector can detect at least one property of light including, but not limited to, a charged coupled device (CCD), back-side thin-cooled CCD, front-side illuminated CCD, a CCD array, a photodiode, a photodiode array, a photo-multiplier tube (PMT), a PMT array, complimentary metal-oxide semiconductor (CMOS) sensors, CMOS arrays, a charge-injection device (CID), CID arrays, etc.
- the detector can be adapted to relay information to a data collection device for storage, correlation, and/or manipulation of data, for example, a computer, or other signal processing system.
- the detected signal from the fluorescent reporter may simply be the measured signal, e.g., fluorescence, without any additional measurements or manipulations.
- the signal may be expressed as a difference, percentage or ratio of the measured value to a different measured value such as, but not limited to, a standard, baseline or response to another analyte.
- the signal when compared to another value may be negative, indicating a decrease in the amount of measured analyte(s).
- the quantities may also be expressed as a difference or ratio of the analyte(s) to itself, measured at a different point in time.
- the quantities of analytes may be determined directly from a generated signal, or the generated signal may be used in an algorithm, with the algorithm designed to correlate the value of the generated signals to the quantity of analyte(s) in the sample.
- the polypeptide loop connecting two transmembrane helices is linked to a fluorescent reporter without a linker peptide.
- the polypeptide loop connecting two transmembrane helices is linked to the fluorescent reporter with at least one linker peptide, for example one or two peptide linkers.
- a peptide linker is used to mean a polypeptide typically ranging from about 1 to about 120 amino acids in length that is designed to facilitate the functional connection of two distinct entities, such as the transporter protein and a fluorescent reporter. To be clear, a single amino acid can be considered a peptide linker peptide for the purposes of the present invention.
- the peptide linker comprises or in the alternative consists of amino acids numbering 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111,
- the peptide linkers used in the fusion proteins of the present invention may comprise or in the alternative consist of amino acids numbering more than 120 residues in length.
- the length of the peptide linker(s), if present, are not necessarily critical to the function of the engineered protein, provided that the peptide linker permits a functional connection between the transporter protein and any fusion partner thereof, e.g., a fluorescent protein.
- linker peptide indicates a connection that facilitates folding of the polypeptides into a three dimensional structure that allows the linked fusion polypeptide to mimic some or all of the functional aspects or biological activities of the protein or portion thereof.
- the linker may be used to create a single-chain fusion of a three dimensional structure that mimics the structure of the transporter and the fluorescent reporter.
- the term functional connection also indicates that the linked portions possess at least a minimal degree of stability, flexibility and/or tension that would be required for the transporter protein to function as desired.
- each of the linker peptides comprises or consists of the same amino acid sequence. In another embodiment, when more than one linker peptide is used, each of the amino acid sequences of the peptide linkers are different from one another.
- the transporter proteins that contain a structural inverted repeat can be from any plant source and the invention is not limited by the source of the transporter protein, i.e., the invention is not limited to the plant species from which transporter proteins that contain a structural inverted repeat normally occurs or is obtained.
- sources from which the transporter proteins that contain a structural inverted repeat may be derived include but are not limited to monocotyledonous plants that include, for example, Lolium, Zea, Triticum, Sorghum, Triticale, Saccharum, Bromus, Oryzae, Avena, Hordeum, Secale and Setaria .
- transporter proteins that contain a structural inverted repeat include but are not limited to maize, wheat, barley, rye, rice, oat, sorghum and millet.
- Additional sources from which the transporter proteins that contain a structural inverted repeat may be derived include but are not limited to dicotyledenous plants that include but are not limited to Fabaceae, Solanum , Brassicaceae, especially potatoes, beans, cabbages, forest trees, roses, clematis , oilseed rape, sunflower, chrysanthemum , poinsettia, arabidopsis , tobacco, tomato, and antirrhinum (snapdragon), soybean, canola, sunflower and even basal land plant species, (the moss Physcomitrella patens ). Additional sources also include gymnosperms.
- orthologous genes are genes from different species that perform the same or similar function and are believed to descend from a common ancestral gene. Proteins from orthologous genes, in turn, are the proteins encoded by the orthologs. As such the term “ortholog” may be to refer to a gene or a protein. Often, proteins encoded by orthologous genes have similar or nearly identical amino acid sequence identities to one another, and the orthologous genes themselves have similar nucleotide sequences, particularly when the redundancy of the genetic code is taken into account. Thus, by way of example, the ortholog of the Amt1 transporter would be an Amt1 transporter in another species of plant, regardless of the amino acid sequence of the two proteins.
- the invention provides deletion variants wherein one or more amino acid residues in the transporter proteins that contain a structural inverted repeat or one or more fluorescent protein(s) are removed.
- Deletions can be effected at one or both termini of the transporter protein or one or more fluorescent protein(s), or with removal of one or more non-terminal amino acid residues of the transporter protein or one or more fluorescent protein(s).
- the proteins of the present invention may also comprise substitution variants of a transporter protein that contain a structural inverted repeat of the fluorescent proteins used herein.
- substitution variants include those polypeptides wherein one or more amino acid residues of the transporter proteins that contain a structural inverted repeat are removed and replaced with alternative residues.
- the substitutions are conservative in nature. Conservative substitutions for this purpose may be defined as set out in the tables below. Amino acids can be classified according to physical properties and contribution to secondary and tertiary protein structure. A conservative substitution is recognized in the art as a substitution of one amino acid for another amino acid that has similar properties. Exemplary conservative substitutions are set out in below.
- conservative amino acids can be grouped as described in Lehninger (1975) Biochemistry, Second Edition; Worth Publishers, pp. 71-77, as set forth below.
- the invention therefore provides isolated peptides, with the peptides comprising an amino acid sequence at least about 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the amino acid sequences disclosed herein.
- the invention provides for polypeptides comprising or consist of amino acid sequences that are 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the amino acid sequence of SEQ ID NO:31.
- a polypeptide having an amino acid sequence at least, for example, about 95% “identical” to a reference an amino acid sequence is understood to mean that the amino acid sequence of the polypeptide is identical to the reference sequence except that the amino acid sequence may include up to about five modifications per each 100 amino acids of the reference amino acid sequence.
- up to about 5% of the amino acid residues of the reference sequence may be deleted or substituted with another amino acid or a number of amino acids up to about 5% of the total amino acids in the reference sequence may be inserted into the reference sequence.
- These modifications of the reference sequence may occur at the N-terminus or C-terminus positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among amino acids in the reference sequence or in one or more contiguous groups within the reference sequence.
- identity is a measure of the identity of nucleotide sequences or amino acid sequences compared to a reference nucleotide or amino acid sequence. In general, the sequences are aligned so that the highest order match is obtained. “Identity” per se has an art-recognized meaning and can be calculated using well known techniques. While there are several methods to measure identity between two polynucleotide or polypeptide sequences, the term “identity” is well known to skilled artisans (Carillo (1988) J. Applied Math. 48, 1073).
- Examples of computer program methods to determine identity and similarity between two sequences include, but are not limited to, GCG program package (Devereux (1984) Nucleic Acids Research 12, 387), BLASTP, ExPASy, BLASTN, FASTA (Atschul (1990) J. Mol. Biol. 215, 403) and FASTDB. Examples of methods to determine identity and similarity are discussed in Michaels (2011) Current Protocols in Protein Science, Vol. 1, John Wiley & Sons.
- the algorithm used to determine identity between two or more polypeptides is BLASTP.
- the algorithm used to determine identity between two or more polypeptides is FASTDB, which is based upon the algorithm of Brutlag (1990) Comp. App. Biosci. 6, 237-245).
- FASTDB sequence alignment the query and reference sequences are amino sequences. The result of sequence alignment is in percent identity.
- the reference sequence is shorter or longer than the query sequence because of N-terminus or C-terminus additions or deletions, but not because of internal additions or deletions, a manual correction can be made, because the FASTDB program does not account for N-terminus and C-terminus truncations or additions of the reference sequence when calculating percent identity.
- the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminus to the reference sequence that are not matched/aligned, as a percent of the total bases of the query sequence.
- the results of the FASTDB sequence alignment determine matching/alignment.
- the alignment percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score.
- This corrected score can be used for the purposes of determining how alignments “correspond” to each other, as well as percentage identity. Residues of the reference sequence that extend past the N- or C-termini of the query sequence may be considered for the purposes of manually adjusting the percent identity score. That is, residues that are not matched/aligned with the N- or C-termini of the comparison sequence may be counted when manually adjusting the percent identity score or alignment numbering.
- a 90 amino acid residue query sequence is aligned with a 100 residue reference sequence to determine percent identity.
- the deletion occurs at the N-terminus of the query sequence and therefore, the FASTDB alignment does not show a match/alignment of the first 10 residues at the N-terminus.
- the 10 unpaired residues represent 10% of the reference sequence (number of residues at the N- and C-termini not matched/total number of residues in the reference sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 residues were perfectly matched (100% alignment) the final percent identity would be 90% (100% alignment-10% unmatched overhang).
- a 90 residue query sequence is compared with a 100 reference sequence, except that the deletions are internal deletions.
- the percent identity calculated by FASTDB is not manually corrected, since there are no residues at the N- or C-termini of the subject sequence that are not matched/aligned with the query.
- a 110 amino acid query sequence is aligned with a 100 residue reference sequence to determine percent identity. The addition in the query occurs at the N-terminus of the query sequence and therefore, the FASTDB alignment may not show a match/alignment of the first 10 residues at the N-terminus. If the remaining 100 amino acid residues of the query sequence have 95% identity to the entire length of the reference sequence, the N-terminal addition of the query would be ignored and the percent identity of the query to the reference sequence would be 95%.
- the terms “correspond(s) to” and “corresponding to,” as they relate to sequence alignment, are intended to mean enumerated positions within a reference protein, e.g., wild-type Amt1, and those positions in a modified Amt1 that align with the positions on the reference protein.
- a reference protein e.g., wild-type Amt1
- the amino acids in the subject sequence that “correspond to” certain enumerated positions of the reference sequence are those that align with these positions of the reference sequence, but are not necessarily in these exact numerical positions of the reference sequence.
- GFP is an extraordinarily stable protein, and it is protected from bulk medium by a soda can-like cage formed by a ⁇ -barrel that consists of 11 ⁇ -strands.
- the chromophore interacts with residues inside the cage for example by hydrogen bonding (H148 interaction with the hydroxyl of Y66, R96 with the imidazolidone, Y145 stabilizes the chromophore by an edge-face interaction with Y66, and E222 with the hydroxyl of S65).
- GFP exists in two ground-state forms, called A and B, and these states interconvert depending on the environment such as local pH.
- the two states have different spectral properties, i.e., have two visible absorption bands at 478 and 398 nm (508 nm emission). In the ground state, interconversion of the states is relatively slow, but occurs much faster in the excited state. Further analysis indicates that the interconversion involves proton transfer followed by structural rearrangements.
- mutation at S65 or E222 result in a loss of the absorption maximum at 398 nm, supporting a role of protonation of the chromophore or adjacent residues in determining the ‘state’ of the chromophores and thus its absorption properties.
- GFP protein is enhanced GFP (“eGFP”) from eGFP.
- a calmodulin sensor was develop with a single fluorophore to detect calcium.
- the same group also constructed a functional zinc sensor by fusing the circular permuted GFP to a zinc finger domain.
- CpGFP sensor can be used for monitoring local calcium levels, e.g. GCaMP2 has been targeted to plasma membrane subdomains by fusion to Na+ pump isoforms (Lee M Y, Song H, Nakai J, Ohkura M, Kotlikoff M I, et al. 2006. Local subplasma membrane Ca2+ signals detected by a tethered Ca2+ sensor. Proc. Natl. Acad. Sci. USA 103:13232-7).
- Circular permutation of EGFP and deletion of several residues resulted in a significant opening of ⁇ 5 ⁇ 10 ⁇ in the barrel providing bulk solvent access.
- the tyrosine side chain at the tip of the chromophore pointed towards the opening.
- a dimer was discovered in which the M13 calmodulin-binding peptides interact with the calcium-loaded calmodulin of the other subunit in the dimer, respectively.
- Biochemical characterization showed that in the absence of calcium GCaMP2 predominantly existed in the monomeric form while in the presence of calcium, both monomeric and dimeric were observed. While in the dimer, channels provide solvent access, the opening is occluded in the monomeric conformation by the linkers of the fusion with calmodulin. As discussed above the main differences between open and closed forms will affect solvent access and thus affect protonation of the phenolate oxygen of the chromophores.
- red cpTomato variants are available expanding the possibility to develop such sensors and for multiplexing analysis, e.g., for the parallel use of affinity mutants in a single cells or for measuring multiple analytes in parallel (Li Y, Sierra A M, Ai H W, Campbell R E. 2008. Identification of sites within a monomeric red fluorescent protein that tolerate peptide insertion and testing of corresponding circular permutations. Photochem. Photobiol. 84:111-9). Circular permuted fluorescent reporters can thus be used in the methods of the present invention.
- SEQ ID NO:28 is an engineered ammonium transporter protein fused with a cpGFP.
- the portion with the sequence below is the cpGFP portion of the engineered protein.
- the invention also provides isolated nucleic acids, with the nucleic acids comprising polynucleotide sequence at least about 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the polynucleotide sequences disclosed herein.
- nucleic acid molecule is at least 80%, 85%, 90%, 92%, 95%, 96%, 97%, 98% or 99% identical to a disclosed nucleic acid can be determined conventionally using known computer programs a discussed herein. For example, percent identity can be determined using the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711. Bestfit uses the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2: 482-489 (1981), to find the best segment of homology between two sequences.
- Bestfit program Wiconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711. Bestfit uses the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2: 482-489 (1981), to find the best segment of homology between two sequences.
- the parameters are set, of course, such that the percentage of identity is calculated over the full length of the reference nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in the reference sequence are allowed.
- Methods for correcting percent identity of the of polynucleotides are the same as those described and disclosed herein with respect to polypeptides.
- the engineered proteins of the present invention may or may not contain additional elements that, for example, may include but are not limited to regions to facilitate purification.
- “histidine tags” (“his tags”) or “lysine tags” may be appended to the engineered protein.
- histidine tags include, but are not limited to hexaH, heptaH and hexaHN.
- lysine tags include, but are not limited to pentaL, heptaL and FLAG.
- Such regions may be removed prior to final preparation of the engineered protein.
- Other examples of a fusion partner for the engineered proteins of the present invention include, but are not limited to, glutathione S-transferase (GST) and alkaline phosphatase (AP).
- peptide moieties to engineered proteins, whether to engender secretion or excretion, to improve stability and to facilitate purification or translocation, among others, is a familiar and routine technique in the art and may include modifying amino acids at the terminus to accommodate the tags.
- the N-terminus amino acid may be modified to, for example, arginine and/or serine to accommodate a tag.
- the amino acid residues of the C-terminus may also be modified to accommodate tags.
- One particularly useful fusion protein comprises a heterologous region from immunoglobulin that can be used solubilize proteins.
- fusion proteins include but are not limited to, fusions with secretion signals and other heterologous functional regions.
- a region of additional amino acids, particularly charged amino acids may be added to the N-terminus of the engineered protein to improve stability and persistence in the host cell, during purification or during subsequent handling and storage.
- the engineered proteins of the current invention may be recovered and purified from recombinant cell cultures by well-known methods including, but not limited to, ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, e.g., immobilized metal affinity chromatography (IMAC), hydroxylapatite chromatography and lectin chromatography. High performance liquid chromatography (“HPLC”) may also be employed for purification.
- Well-known techniques for refolding protein may be employed to regenerate active conformation when the fusion protein is denatured during isolation and/or purification.
- Engineered proteins of the present invention include, but are not limited to, products of chemical synthetic procedures and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and mammalian cells. Depending upon the host employed in a recombinant production procedure, the engineered proteins of the present invention may be glycosylated or may be non-glycosylated. In addition, engineered proteins of the invention may also include an initial modified methionine residue, in some cases as a result of host-mediated processes.
- the present invention also provides for nucleic acids encoding some of the engineered proteins of the present invention.
- the invention also relates to isolated nucleic acids and to constructs comprising these nucleic acids.
- the nucleic acids of the invention can be DNA or RNA, for example, mRNA.
- the nucleic acid molecules can be double-stranded or single-stranded; single stranded RNA or DNA can be the coding, or sense, strand or the non-coding, or antisense, strand.
- the nucleic acids may encode any engineered protein of the invention.
- the nucleic acids of the invention include polynucleotide sequences that encode the engineered proteins that contain or comprise glutathione-S-transferase (GST) fusion protein, poly-histidine (e.g., His 6 ), poly-HN, poly-lysine, etc.
- GST glutathione-S-transferase
- the nucleotide sequence of the isolated nucleic acid can include additional non-coding sequences such as non-coding 3′ and 5′ sequences (including regulatory sequences, for example).
- nucleic acids of the present invention include bit are not limited to those sequences exemplified in the nucleic acid sequences of SEQ ID NO:29 and SEQ ID NO:30, where the nucleic acid sequence of SEQ ID NO:29 encodes the amino acid sequence of SEQ ID NO:28.
- the present invention also comprises vectors containing the nucleic acids encoding the engineered proteins of the present invention.
- a “vector” may be any of a number of nucleic acids into which a desired sequence may be inserted by restriction and ligation for transport between different genetic environments or for expression in a host cell.
- Vectors are typically composed of DNA although RNA vectors are also available.
- Vectors include, but are not limited to, plasmids and phagemids.
- a cloning vector is one which is able to replicate in a host cell, and which is further characterized by one or more endonuclease restriction sites at which the vector may be cut in a determinable fashion and into which a desired DNA sequence may be ligated such that the new recombinant vector retains its ability to replicate in the host cell.
- An expression vector is one into which a desired DNA sequence may be inserted by restriction and ligation such that it is operably joined to regulatory sequences and may be expressed as an RNA transcript.
- Vectors may further contain one or more marker sequences suitable for use in the identification and selection of cells which have been transformed or transfected with the vector.
- Markers include, for example, genes encoding proteins which increase or decrease either resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes whose activities are detectable by standard assays known in the art (e.g., ⁇ -galactosidase or alkaline phosphatase), and genes which visibly affect the phenotype of transformed or transfected cells, hosts, colonies or plaques.
- vectors include but are not limited to those capable of autonomous replication and expression of the structural gene products present in the DNA segments to which they are operably joined.
- the vectors to be used are those for expression of polynucleotides and proteins of the present invention.
- such vectors comprise cis-acting control regions effective for expression in a host operatively linked to the polynucleotide to be expressed.
- Appropriate trans-acting factors are supplied by the host, supplied by a complementing vector or supplied by the vector itself upon introduction into the host.
- vectors can be used to express the proteins of the invention.
- Such vectors include chromosomal, episomal and virus-derived vectors, e.g., vectors derived from bacterial plasmids, from bacteriophage, from yeast episomes, from yeast chromosomal elements, from viruses such as adeno-associated virus, lentivirus, baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids. All may be used for expression in accordance with this aspect of the present invention.
- any vector suitable to maintain, propagate or the fusion proteins in a host may be used for expression in this regard.
- the DNA sequence in the expression vector is operatively linked to appropriate expression control sequence(s) including, for instance, a promoter to direct mRNA transcription.
- promoters include, but are not limited to, the phage lambda PL promoter, the E. coli lac, trp and tac promoters, HIV promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to name just a few of the well-known promoters.
- expression constructs will contain sites for transcription, initiation and termination and, in the transcribed region, a ribosome binding site for translation.
- the coding portion of the mature transcripts expressed by the constructs will include a translation initiating AUG at the beginning and a termination codon (UAA, UGA or UAG) appropriately positioned at the end of the polypeptide to be translated.
- constructs may contain control regions that regulate, as well as engender expression. Generally, such regions will operate by controlling transcription, such as repressor binding sites and enhancers, among others.
- Vectors for propagation and expression generally will include selectable markers. Such markers also may be suitable for amplification or the vectors may contain additional markers for this purpose.
- the expression vectors may contain one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells.
- Preferred markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, and tetracycline, kanamycin or ampicillin resistance genes for culturing E. coli and other bacteria.
- yeast S. cerevisiae examples include pDRFLIP, s, pDR196, pYepSecl (Baldari (1987) EMBO J. 6, 229-234), pMFa (Kurjan (1982) Cell 30, 933-943), pJRY88 (Schultz (1987) Gene 54, 115-123), pYES2 (Invitrogen) and picZ (Invitrogen).
- the engineered proteins can be expressed in insect cells using baculovirus expression vectors.
- Baculovirus vectors available for expression of proteins in cultured insect cells include the pAc series (Smith (1983) Mol. Cell. Biol. 3, 2156 2165) and the pVL series (Lucklow (1989) Virology 170, 31-39).
- nucleic acid molecules of the invention can be “isolated.”
- an “isolated” nucleic acid molecule or nucleotide sequence is intended to mean a nucleic acid molecule or nucleotide sequence that is not flanked by nucleotide sequences normally flanking the gene or nucleotide sequence (as in genomic sequences) and/or has been completely or partially removed from its native environment (e.g., a cell, tissue).
- nucleic acid molecules that have been removed or purified from cells are considered isolated.
- the isolated material will form part of a composition (for example, a crude extract containing other substances), buffer system or reagent mix.
- an isolated nucleic acid molecule or nucleotide sequence can includes a nucleic acid molecule or nucleotide sequence which is synthesized chemically, using recombinant DNA technology or using any other suitable method.
- a nucleic acid contained in a vector would be included in the definition of “isolated” as used herein.
- isolated nucleotide sequences include recombinant nucleic acid molecules (e.g., DNA, RNA) in heterologous organisms, as well as partially or substantially purified nucleic acids in solution.
- nucleic acid molecules of the present invention may be isolated or purified. Both in vivo and in vitro RNA transcripts of a DNA molecule of the present invention are also encompassed by “isolated” nucleotide sequences.
- the invention also provides nucleic acid molecules that hybridize under high stringency hybridization conditions, such as for selective hybridization, to the nucleotide sequences described herein (e.g., nucleic acid molecules which specifically hybridize to a nucleotide sequence encoding engineered proteins described herein).
- Hybridization probes include synthetic oligonucleotides which bind in a base-specific manner to a complementary strand of nucleic acid.
- nucleic acid molecules can be detected and/or isolated by specific hybridization e.g., under high stringency conditions.
- “Stringency conditions” for hybridization is a term of art that refers to the incubation and wash conditions, e.g., conditions of temperature and buffer concentration, which permit hybridization of a particular nucleic acid to a second nucleic acid; the first nucleic acid may be perfectly complementary, i.e., 100%, to the second, or the first and second may share some degree of complementarity, which is less than perfect, e.g., 60%, 75%, 85%, 95% or more. For example, certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those of less complementarity.
- High stringency conditions “moderate stringency conditions” and “low stringency conditions” for nucleic acid hybridizations are explained in Current Protocols in Molecular Biology, John Wiley & Sons).
- the exact conditions which determine the stringency of hybridization depend not only on ionic strength, e.g., 0.2 ⁇ SSC, 0.1 ⁇ SSC of the wash buffers, temperature, e.g., room temperature, 42° C., 68° C., etc., and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non-identical sequences.
- high, moderate or low stringency conditions may be determined empirically.
- hybridization conditions By varying hybridization conditions from a level of stringency at which no hybridization occurs to a level at which hybridization is first observed, conditions which will allow a given sequence to hybridize with the most similar sequences in the sample can be determined. Exemplary conditions are described in Krause (1991) Methods in Enzymology, 200:546-556. Washing is the step in which conditions are usually set so as to determine a minimum level of complementarity of the hybrids. Generally, starting from the lowest temperature at which only homologous hybridization occurs, each degree (° C.) by which the final wash temperature is reduced, while holding SSC concentration constant, allows an increase by 1% in the maximum extent of mismatching among the sequences that hybridize. Generally, doubling the concentration of SSC results in an increase in Tm.
- the washing temperature can be determined empirically for high, moderate or low stringency, depending on the level of mismatch sought.
- Exemplary high stringency conditions include, but are not limited to, hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37° C., and a wash in 0.1 ⁇ SSC at 60° C.
- Example of progressively higher stringency conditions include, after hybridization, washing with 0.2 ⁇ SSC and 0.1% SDS at about room temperature (low stringency conditions); washing with 0.2 ⁇ SSC, and 0.1% SDS at about 42° C. (moderate stringency conditions); and washing with 0.1 ⁇ SSC at about 68° C. (high stringency conditions).
- Washing can be carried out using only one of these conditions, e.g., high stringency conditions, washing may encompass two or more of the stringency conditions in order of increasing stringency.
- Optimal conditions will vary, depending on the particular hybridization reaction involved, and can be determined empirically.
- Equivalent conditions can be determined by varying one or more of the parameters given as an example, as known in the art, while maintaining a similar degree of identity or similarity between the target nucleic acid molecule and the primer or probe used.
- Hybridizable nucleotide sequences are useful as probes and primers for identification of organisms comprising a nucleic acid of the invention and/or to isolate a nucleic acid of the invention, for example.
- the term “primer” is used herein as it is in the art and refers to a single-stranded oligonucleotide which acts as a point of initiation of template-directed DNA synthesis under appropriate conditions in an appropriate buffer and at a suitable temperature.
- primer site refers to the area of the target DNA to which a primer hybridizes.
- primer pair refers to a set of primers including a 5′ (upstream) primer that hybridizes with the 5′ end of the DNA sequence to be amplified and a 3′ (downstream) primer that hybridizes with the complement of the 3′ end of the sequence to be amplified.
- the present invention also relates to host cells containing the above-described constructs.
- the host cell can be a eukaryotic cell, such as a plant cell or yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell.
- the host cell can be stably or transiently transfected with the construct.
- the polynucleotides may be introduced alone or with other polynucleotides. Such other polynucleotides may be introduced independently, co-introduced or introduced joined to the polynucleotides of the invention.
- a “host cell” is a cell that normally does not contain any of the nucleotides of the present invention and contains at least one copy of the nucleotides of the present invention.
- a host cell as used herein can be a cell in a culture setting or the host cell can be in an organism setting where the host cell is part of an organism, organ or tissue.
- prokaryotic expression vector a prokaryotic expression vector
- the appropriate host cell would be any prokaryotic cell capable of expressing the cloned sequences.
- Suitable prokaryotic cells include, but are not limited to, bacteria of the genera Escherichia, Bacillus, Pseudomonas, Staphylococcus , and Streptomyces.
- eukaryotic cells are the host cells.
- Eukaryotic host cells include, but are not limited to, insect cells, HeLa cells, Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS cells), human 293 cells, and murine 3T3 fibroblasts.
- a yeast cell may be employed as a host cell.
- Yeast cells include, but are not limited to, the genera Saccharomyces, Pichia and Kluyveromyces .
- the yeast hosts are S. cerevisiae or P. pastoris .
- Yeast vectors may contain an origin of replication sequence from a 2T yeast plasmid, an autonomously replication sequence (ARS), a promoter region, sequences for polyadenylation, sequences for transcription termination and a selectable marker gene.
- ARS autonomously replication sequence
- Shuttle vectors for replication in both yeast and E. coli are also included herein.
- Introduction of a construct into the host cell can be affected by calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection or other methods.
- TALENs are proteins that have been engineered to cleave nucleic acids at a specific site in the sequence.
- the cleavage sites of TALENs are extremely customizable and pairs of TALENs can be generated to create double-stranded breaks (DSBs) in nucleic acids at virtually any site in the nucleic acid. See Bogdanove and Voytas, Scienc, 333:1843-1846 (2011), which incorporated by reference herein
- Transformants carrying the expression vectors are selected based on the above-mentioned selectable markers. Repeated clonal selection of the transformants using the selectable markers allows selection of stable cell lines expressing the fusion proteins constructs. Increased concentrations in the selection medium allows gene amplification and greater expression of the desired fusion proteins.
- the host cells for example E. coli cells, containing the recombinant fusion proteins can be produced by cultivating the cells containing the fusion proteins expression vectors constitutively expressing the engineered proteins constructs.
- the present invention also provides for transgenic organisms, including but not limited to plants, animals and prokaryotic organisms.
- the transgenic animals all of whose germ and somatic cells would contain the DNA construct of the invention, would include vertebrates in general. Examples of transgenic animals include but are not limited to mammals such as non-human primates, mice, sheep, pigs, cattle, goats, guinea pigs, rodents, e.g., rats, and the like.
- the term transgenic animal also includes animals in all stages of development, including embryonic and fetal stages.
- transgenic animals may be obtained, for example, by injecting the DNA constructs of the present invention into a fertilized egg which is allowed to develop into an adult animal.
- a transgenic animal a few hundred DNA molecules are injected into the pro-nucleus of a fertilized one cell egg.
- the micro injected eggs are then transferred into the oviducts of pseudopregnant foster mothers and allowed to develop. It has been reported by Brinster et al., Proc. Natl. Acad. Sci. USA 82:4438-4442 (1985), that about 25% of mice which develop will inherit one or more copies of the micro injected DNA.
- the transgenic animals may be obtained by utilizing recombinant ES cells for the generation of the transgenes, as described by Gossler et al., Proc. Natl. Acad. Sci. USA 83:9065-9069 (1986).
- the offspring may be analyzed for the integration of the transgene by isolating genomic DNA from tail tissue and the fragment coding for the engineered proteins identified by conventional DNA-hybridization techniques (Southern, J. Mol. Biol. 98:503-517 (1975)). Animals positive for the nucleic acid encoding the engineered protein are further bred to expand the colonies of transgenic mice.
- General and specific examples of methods of preparing transgenic animals are disclosed in U.S. Pat. Nos.
- transgenic animals which contain the nucleic acid encoding the engineered protein may be analyzed by immunohistology for evidence of the engineered protein expression.
- the present invention also provides for transgenic plants or plant tissue comprising transgenic plant cells, i.e. comprising stably integrated into their genome, an above-described nucleic acid molecule, expression cassette or vector of the invention.
- the present invention also provides transgenic plants, plant cells or plant tissue obtainable by a method for their production as outlined below.
- the present invention provides a method for producing transgenic plants, plant tissue or plant cells comprising the introduction of a nucleic acid molecule, expression cassette or vector of the invention into a plant cell and, optionally, regenerating a transgenic plant or plant tissue therefrom.
- the transgenic plants expressing the engineered protein can be of use in monitoring the transport or movement of analytes throughout and between the organs of an organism, such as to or from the soil.
- the transgenic plants expressing transporters of the invention can be of use for investigating metabolic or transport processes of, e.g., organic compounds with a timely and spatial resolution.
- transgenic plants examples include but are not limited to monocotyledonous plants including seed and the progeny or propagules thereof, for example Lolium, Zea, Triticum, Sorghum, Triticale, Saccharum, Bromus, Oryzae, Avena, Hordeum, Secale and Setaria .
- Especially useful transgenic plants are maize, wheat, barley plants and seed thereof.
- Dicotyledenous plants are also within the scope of the present invention include but are not limited to the species Fabaceae, Solanum , Brassicaceae, especially potatoes, beans, cabbages, forest trees, roses, clematis , oilseed rape, sunflower, chrysanthemum , poinsettia and antirrhinum (snapdragon).
- the plant may be crops, such as a food crops, feed crops or biofuels crops.
- Exemplary important crops may include soybean, cotton, rice, millet, sorghum, sugarcane, sugar beet, tomato, grapevine, citrus (orange, lemon, grapefruit, etc), lettuce, alfalfa, fava bean and strawberries, rapeseed, cassava, miscanthus and switchgrass to name a few.
- plant transformation may be carried out using Agrobacterium -mediated gene transfer, microinjection, electroporation or biolistic methods as it is, e.g., described in Potrykus and Spangenberg (Eds.), Gene Transfer to Plants. Springer Verlag, Berlin, New York, 1995.
- useful plant transformation vectors, selection methods for transformed cells and tissue as well as regeneration techniques are described which are known to the person skilled in the art and may be applied for the purposes of the present invention.
- the invention provides harvestable parts and methods to propagation material of the transgenic plants according to the invention which contain transgenic plant cells as described above.
- Harvestable parts can be in principle any useful part of a plant, for example, leaves, stems, fruit, seeds, roots etc.
- Propagation material includes, for example, seeds, fruits, cuttings, seedlings, tubers, rootstocks etc.
- the present invention also provides methods of producing any of the engineered proteins of the present invention, the method comprising culturing a host cell in conditions that promote protein expression and recovering the engineered protein from the culture, wherein the host cell comprises a vector encoding the engineered protein.
- the protein production methods generally comprise culturing the host cells of the invention under conditions such that the engineered protein is expressed, and recovering said protein.
- the culture conditions required to express the proteins of the current invention are dependent upon the host cells that are harboring the polynucleotides of the current invention.
- the culture conditions for each cell type are well-known in the art and can be easily optimized, if necessary.
- a nucleic acid encoding an engineered protein of the invention, or a construct comprising such nucleic acid can be introduced into a suitable host cell by a method appropriate to the host cell selected, e.g., transformation, transfection, electroporation, infection, such that the nucleic acid is operably linked to one or more expression control elements as described herein.
- Host cells can be maintained under conditions suitable for expression in vitro or in vivo, whereby the encoded engineered protein is produced.
- host cells may be maintained in the presence of an inducer, suitable media supplemented with appropriate salts, growth factors, antibiotic, nutritional supplements, etc., which may facilitate protein expression.
- the engineered proteins of the invention can be produced by in vitro translation of a nucleic acid that encodes the engineered protein, by chemical synthesis or by any other suitable method. If desired, the engineered protein can be isolated from the host cell or other environment in which the protein is produced or secreted. It should therefore be appreciated that the methods of producing the engineered proteins encompass expression of the polypeptides in a host cell of a transgenic plant. See U.S. Pat. Nos. 6,013,857, 5,990,385, and 5,994,616.
- the invention also provides for methods of monitoring analyte movement in a sample, comprising contacting the sample with an engineered protein of the present invention and subsequently measuring the fluorescent signal, which may or may not include FRET.
- the engineered proteins can be used in sensors for measuring a target analytes in a sample, with the sensors comprising the engineered proteins of the present invention.
- the target analytes can be any molecule or compound where the movement or concentration is desired to be measured or monitored. Whichever molecule or ion that the transporter peptide normally transports is capable of being monitored or measured. For example, if ammonium transporter is being engineered, then ammonium can be monitored or measured.
- classes of analytes that might be measured, depending on the transporter protein being engineered, include, but are not limited to amino acids, peptides, polypeptides, proteins, carbohydrates, lipids, nucleotides, oligonucleotides, polynucleotides, glycoproteins or proteoglycans, lipoproteins, lipopolysaccharides, drugs, drug metabolites, small organic molecules, inorganic molecules and natural or synthetic polymers.
- “carbohydrate” includes, but is not limited to monosaccharides, disaccharides, oligosaccharides and polysaccharides.
- Carbohydrate also includes, but is not limited to, molecules comprising carbon, hydrogen and oxygen that do not fall within the traditional definition of a saccharide—i.e., an aldehyde or ketone derivative of a straight chain polyhydroxyl alcohol, containing at least three carbon atoms. Thus, for example, a carbohydrate may contain fewer than three carbon atoms.
- lipid is used it is in the art, i.e., substances of biological origin that are made up primarily or exclusively of nonpolar chemical groups such that they are readily soluble in most organic solvents, but only sparingly soluble in aqueous solvents.
- lipids include, but are not limited to, fatty acids, triacylglycerols, glycerophospholipids, sphingolipids, cholesterol, steroids and derivatives thereof.
- lipids include but are not limited to, the ceramides, which are derivatives of sphingolipids and derivatives of ceramides, such as sphingomyelins, cerebrosides and gangliosides.
- “Lipids” also include, but are not limited to, the common classes of glycerophospholipds (or phospholipids), such as phosphatidic acid, phosphatidylethanolamine, phosphatidylcholine, phosphatidylserine, phosphatidylinositol, phosphatidylglycerol and the like.
- a “drug” can be a known drug or a drug candidate, whose activity or effects on a particular cell type are not yet known.
- a “drug metabolite” is any of the by-products or the breakdown products of a drug that is changed chemically into another compound or compounds.
- small organic molecule includes, but is not limited to, an organic molecule or compound that does not fit precisely into other classifications highlighted herein.
- the engineered proteins of the current invention can be used to assess or measure the concentrations of more than one target analyte.
- concentration is used as it is in the art.
- the concentration may be expressed as a qualitative value, or more likely as a quantitative value.
- the quantification of the analytes can be a relative or absolute quantity.
- the quantity (concentration) of any of the analytes may be equal to zero, indicating the absence of the particular analyte sought.
- the quantity may simply be the measured signal, e.g., fluorescence, without any additional measurements or manipulations.
- the quantity may be expressed as a difference, percentage or ratio of the measured value of the particular analyte to a measured value of another compound including, but not limited to, a standard or another analyte.
- the difference may be negative, indicating a decrease in the amount of measured analyte(s).
- the quantities may also be expressed as a difference or ratio of the analyte(s) to itself, measured at a different point in time.
- the quantities of analytes may be determined directly from a generated signal, or the generated signal may be used in an algorithm, with the algorithm designed to correlate the value of the generated signals to the quantity of analyte(s) in the sample.
- the engineered proteins of the current invention are designed to possess capabilities of continuously measuring the concentrations an analyte.
- the term “continuously,” in conjunction with the measuring of an analyte is used to mean the engineered protein either generates or is capable of generating a detectable signal at any time during the life span of the engineered protein.
- the detectable signal may be constant in that the engineered protein is always generating a signal, even if the signal is not detected.
- the engineered protein may be used episodically, such that a detectable signal may be generated, and detected, at any desired time.
- the engineered proteins are particularly useful in an in vivo setting for measuring target analytes as they occur or appear in a host organism or tissue or cell.
- the target analytes need not be labeled.
- unlabeled target analytes may also be measured in an in vitro or in situ setting as well.
- the target analytes may be labeled. Labeled target analytes can be measured in an in vivo, in vitro or in situ setting.
- engineered proteins allow high-throughput measurements in complex samples using an appropriate plate fluorometer (e.g. TECAN M1000).
- TECAN M1000 plate fluorometer
- This type of analysis can be used to measure the analyte content in different tissues, different individuals or different populations of, for example, crop plants experiencing drought. Purification of bulk amounts of engineered proteins can be achieved after expression in Pichia pastoris , using pPinkFLIP vectors and a protease deficient strain of Pichia.
- FP conformation-sensitive fluorescent proteins
- AMT1 Arabidopsis thaliana ammonium transporter
- FIGS. 1A-1F and 4A-4B Monomeric teal fluorescent protein (mTFP), yellow fluorescent protein (YFP) Venus or a modified circularly permuted GFP (mcpGFP) were inserted into intracellular loops of AMT1; 3.
- the constructs were inserted in the yeast expression vector pDRf1-GW, containing the f1 replication origin, GATEWAYTM cassette, PMA1 promoter fragment, ADH terminator, and the URA cassette for selection in yeast.
- the Xbal restriction site (tctaga) was inserted in different positions of AtAMT1; 3 (after amino acids 233, 312, 364 and 448) via Kunkel mutagenesis.
- mcpGFP was generated by amplifying the domains of EGFP corresponding to amino acids 150-239 and 1-144 with the primers:
- EGFP-150-for (SEQ ID NO: 1) aacgtctatatcatggcc EGFP-239-rev: (SEQ ID NO: 2) ttttt accggt accacc cttgtacagctcgtcca EGFP-1-for: (SEQ ID NO: 3) ttttt accggt ggatct atggtgagcaagggcg EGFP-144-rev: (SEQ ID NO: 4) agttgtactccagcttgtgc
- Primers EGFP-239-rev and EGFP-1-for contained an Agel restriction site (in bold) and additional nucleotides coding for Gly-Gly and Gly-Ser, respectively (underlined).
- the two amplified bands were gel-purified by a commercial kit (Machery-Nagel), digested by Agel (New England Biolabs) and ligated by T4 DNA ligase (New England Biolabs).
- the resulting cpGFP where the domains 150-239 and 1-144 were connected by the linker coding GGTGGS (SEQ ID NO: 32), was cloned into a pGEM-Teasy (Promega).
- the additional internal mutations M66K, V76A, S88G, D93Y, T116V, A119K, V2511, which were known to improve stability of cpGFP, were introduced by Kunkel mutagenesis.
- mcpGFP was amplified with the primers below and contained the Xbal restriction site (in bold):
- cpGFP-for (SEQ ID NO: 5) ttttt tctaga aacgtctatatcatggcc cpGFP-rev: (SEQ ID NO: 6) ttttt tctaga agttgtactccagcttgtgc
- the purified mcpGFP, mTFP and Venus encoding fragments were digested by Xbal (New England Biolabs) and ligated into digested pDRf1-GW vectors containing AtAMT1; 3, to generate the fusion constructs AtAMT1; 3-mcpGFP, AtAMT1; 3-TFP and AtAMT1; 3-Venus in positions 233, 312, 364 and 448 ( FIGS. 4A-4B ).
- cpGFP-for-SH (SEQ ID NO: 11) ggtcctcgtcgtggtcggttcgagaaa tctcat aacgtctatatcaag cGFP-for-GGSH: (SEQ ID NO: 12) ggtcctcgtcgtggtcggttcgagaaa ggtggttctcat aacgtctatatcaag cGFP-for-GGGGSH: (SEQ ID NO: 13) ggtcctcgtcgtggtcggttcgagaaa ggtggtggtggttctcat aacgtctatatcaag cGFP-for-LE: (SEQ ID NO: 14) ggtcctcgtcgtggtcggttcgagaaa ctcgag aacgtctatatcaag cGFP-for-LE
- the amplification products contained the mcpGFP flanked by the variable linker sequences (underlined) and about 30 by homologous to the region around the insertion point 233 of AtAMT1; 3 (in bold).
- the transformed yeast contained the pDR-AMT-mcpGFP vectors resulting from insertion of the ⁇ 800 by mcpGFPs with linkers into the vector backbone, as confirmed by DNA sequencing.
- mcpGFP Homologous recombination was also used to generate variants of the linker preceding mcpGFP ( FIGS. 14A-14B ).
- mcpGFP was amplified with the primers below (where N indicates any nucleotide).
- cGFP-for-deg (SEQ ID NO: 21) ggtcctcgtcgtggtcggttcgagaaa NNNNNN aacgtctatatcaag cGFP-rev-FN: (SEQ ID NO: 22) gtggccgcgcagagcaatagcgcgaccacc attaaa gttgtactccagcttg
- Homologous recombination was also used to insert the mcpGFP in different positions along L5-6 ( FIGS. 9A-9C ) and to generate deletions in the loop.
- Point mutations for inactivation of AtAMT1; 3 and AmTrac were generated by we used Kunkel mutagenesis (D202N, G460D, T464D; FIGS. 2A-2F and 11A-11B ).
- Yeast strains 31019b [mep1 ⁇ mep2 ⁇ ::LEU2 mep3 ⁇ ::KanMX2 ura3], a strain in which all three endogenous MEP ammonium transporter genes had been deleted, and its parental strain 23344c [ura3], were transformed using the lithium acetate method and selected on solid YNB (minimal yeast medium without nitrogen; Difco) supplemented with 3% glucose and 1 mM arginine. Single colonies were grown in 5 mL liquid YNB supplemented with 3% glucose and 0.1% proline under agitation (230 rpm) at 30° C.
- liquid cultures were diluted 10 ⁇ 1 , 10 ⁇ 2 , 10 ⁇ 3 , 10 ⁇ 4 , 10 ⁇ 5 and 10 ⁇ 6 in water and 5 ⁇ L of each dilution were spotted on solid YNB medium buffered with 50 mM MES/Tris, pH 5.2 and supplemented with 3% glucose and either NH 4 Cl, (NH 4 ) 2 SO 4 or 1 mM arginine as the sole nitrogen source.
- yeast cultures were washed twice in 50 mM MES buffer, pH 6.0, and resuspended to OD 600 nm ⁇ 0.5 in MES buffer supplemented with 5% glycerol to delay cell sedimentation. Fluorescence was measured by a fluorescence plate reader (Safire, Tecan), in bottom reading mode using a 7.5 nm bandwidth for both excitation and emission.
- Yeast cultures were washed twice in 50 mM MES buffer, pH 6.0, and resuspended to OD 600 nm ⁇ 0.5 in MES buffer supplemented with 5% glycerol to delay cell sedimentation. Fluorescence was measured by a fluorescence plate reader (Safire, Tecan), in bottom reading mode using a 7.5 nm bandwidth for both excitation and emission. To measure fluorescence response to substrate addition, 50 ⁇ L of substrate (dissolved in water as 500% stock solution) were added to 200 ⁇ L of cells in a 96-well plate (Greiner). For the spectral recordings of AmTrac ( FIGS.
- excitation scans were performed at emission wavelength of 520 nm; emissions scans were recorded with excitation at 488 nm. In all other cases, fluorescence was measured as emission at 513 nm using excitation at 488 nm. Response data are presented as (F water ⁇ F treatment )/F treatment .
- AMT1; 3 was extremely sensitive to any manipulation within the loops 7-8 and 9-10 or the cytosolic C-terminus ( FIGS. 4A-4B ).
- modification of loop 5-6 (L5-6, position 233) by insertion of either two amino acid residues (encoded by the restriction site Xbal) or mcpGFP was tolerated ( FIGS. 4A-4B ).
- the L5-6 is located between the two pseudo-symmetric halves, i.e., between the two structural inverted repeats, of the protein and connects two transmembrane helices (TMH-V and -VI) that contain residues postulated to be directly involved in recruitment, gating and substrate translocation ( FIG. 5 ).
- yeast cells expressing the sensors were acquired on an inverted confocal laser scanning microscope (SP5, Leica).
- SP5, Leica inverted confocal laser scanning microscope
- yeast cells were trapped as a single cell layer in a microfluidic perfusion system (Y04C plate, Onyx, Cellasic) and perfused with either 50 mM MES buffer, pH 6.0, or buffer supplemented with NH 4 Cl.
- the setup was imaged at a spinning disk confocal microscope (Yokogawa CSU-X1; Leica DMI6000) equipped with a motorized stage (ASI). Fluorescence was excited by a solid state laser at 488 nm; emission was detected using a 525/50 nm filter set (Semrock) and an electron multiplying charge coupled device (EMCCD) camera (Evolve, Photometrics). Measurements were taken every 2 min, with 100 ms exposure time using Slidebook 5.0 image acquisition software (Intelligent Imaging Innovations). To account for lateral shift during imaging, the image stacks were post-registered using the StackReg plugin for ImageJ.
- Fluorescence pixel intensity was quantified using Fiji software; single cells were selected and analyzed with the help of the ROI manager tool.
- Kymograph analysis was performed using the MultipleKymograph plugin for ImageJ (Rietdorf and Seitz) by measuring pixel intensities over time along a 3 pixel wide line.
- AMT1 acts as a trans-activation domain in the trimeric AMT1 complex. Mutations in the cytosolic C-terminus block AMT1 activity, which can be restored by suppressor mutations either in the cytosolic loops or in the pore region.
- AmTrac-LS a sensor with a Leu-Ser linker, was as responsive to ammonium as AmTrac ( ⁇ 40% FI change), yet it was 517% brighter ( FIGS. 14A-14B ).
- AmTrac-LS responses were detectable in single cells, were concentration-dependent, and were readily reversible, demonstrating that AmTrac-LS can be used in vivo to measure transport of ammonium and to observe conformational change kinetics of the chimeric transporter in response to substrate availability ( FIG. 15 ).
- the fluorescence response of AmTrac transporter demonstrates that AMT1; 3 undergoes conformational changes during the transport cycle.
- the pseudo-symmetry of AMT with an inverted repeat of five TMH is similar to that of the LeuT transporter.
- LeuT carries a substrate-binding site at the interface of the two repeats and undergoes a transport cycle involving multiple states.
- the transport pore of the AMTs is located between the two pseudo-symmetric halves (or structural inverted repeats), with TMH-V and -VI (connected by L5-6) carrying key residues for ammonium translocation.
- AmTrac was expressed in Arabidopsis plants stably transformed via Agrobacterium by the method of floral dip, by using the vector pEARLEY100 (Earley et al. 2006). AmTrac was expressed under control of the strong promoter 35S. Seeds of heterozygous Arabidopsis were surface-sterilized and germinated in growth medium deprived of ammonium in 16 h-8 h light-dark conditions. After 5 days, seedlings were transferred in the microfluidic perfusion system RootChip and grown in liquid medium deprived of ammonium for additional 24 h.
- Roots and root hair from the seedling were then analyzed by inverted confocal microscopy (Yokogawa CSU-X1) and roots were perfused with 50 mM MES buffer or a square pulse of 1 mM ammonium chloride in buffer for 30 minutes. Fluorescence was excited by a solid state laser at 488 nm; emission was detected using a 525/50 nm filter set (Semrock) and an electron multiplying charge coupled device (EMCCD) camera (Evolve, Photometrics).
- FIG. 16 shows that the engineered proteins were functional in Arabidopsis root hairs. Fluorescence pixel intensity from single root hairs was quantified using Fiji software.
- MEP2 was also used as scaffold for creating a new sensor (MEPtrac). MEP2 is only 44% identical to AMT1; 3, demonstrating that the invention is not limited by the amino acid sequence of the transporters.
- MepTrac was constructed in a single step based on the fluorophore insertion point determined in AMT as well as using the optimized linkers developed for AmTrac. Structural and phylogenetic studies have shown that members of the AMT/MEP/RH (AMT/MEP/RH superfamily (PFAM # PF00909) family have highly similar structures and that regulatory mechanisms are highly conserved.
- MEP2 which is evolutionary very distant, but structurally highly conserved, into an engineered protein that is responsive to analyte movement through the sensor demonstrates that transporter proteins with a structural inverted repeat can be converted into sensors using the methods provided herein.
- the mcpGFP was inserted in the middle of loop 5 of MEP2 (after amino acid 217) and connected by the linkers coding for amino acids LS (preceding mcpGFP) and FN (following mcpGFP) by using overlapping PCR cloning.
- MEPtrac The functionality of MEPtrac was tested by growth complementation of the of ⁇ mep1,2,3 yeast strain on ammonium and the response of the sensor was tested by fluorimetric analysis after addition of different concentrations of ammonium chloride or sodium chloride as a control. See FIGS. 17A-17B .
- FIG. 18A is a plot of fluorescence lifetime decays, plotted with a logarithmic scale, and shows a single exponential decay for all the variants tested with and without addition of saturating ammonium concentrations. While the lifetime was not altered, the FI decrease triggered by ammonium addition was visible.
- FIG. 18B shows normalized fluorescence lifetime decays of all the variants tested with and without the addition of ammonium. Data were plotted to show that lifetime decays in a similar manner after addition of ammonium.
- ⁇ 0 background (noise)
- a 1 amplitude (max Int.)
- ⁇ fluorescence decay.
- the measurement of flux through a transporter by reporting state changes is applicable to other transporters or enzymes for monitoring in vivo fluxes, e.g., in the context of neurotransmitter transport, for transporters that have structural inverted repeats as the AMT protein does.
- FIG. 19A shows the single exponential decay of AmTrac-GS's lifetime which is not altered after the addition of ammonium.
- the lack of an additional fast depolarization in the VV- and VH-decays leads to the conclusion, that homo-FRET cannot be detected for the system tested.
- FIG. 19B shows a similar decay in anisotropy decay curves for AmTrac-GS, with and without addition of ammonium, which indicates a similar rotation time of the fluorophore mcpGFP in AmTrac-GS.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Molecular Biology (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biomedical Technology (AREA)
- Cell Biology (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Gastroenterology & Hepatology (AREA)
- Toxicology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Zoology (AREA)
- Food Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Botany (AREA)
- Tropical Medicine & Parasitology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention relates to engineered transporter proteins comprising at least one fluorescent reporter covalently bound to the transporter protein, wherein the transporter proteins of the present invention comprise a structural inverted repeat motif, with the motif comprising a first and second subunit that are structural inverted repeats of one another and that are joined to one another by a polypeptide loop.
Description
- The application claims priority to U.S. Provisional Application No. 61/536,005, filed 18 Sep. 2011.
- Part of the work performed during development of this invention utilized U.S. Government funds from National Science Foundation Grant No. 1021677. The U.S. Government has certain rights in this invention.
- A computer readable text file, entitled “056100-5085-WO-SequenceListing.txt,” created on or about 12 Sep. 2012 with a file size of about 18 kb contains the sequence listing for this application and is hereby incorporated by reference in its entirety.
- 1. Field of the Invention
- The invention relates to engineered transporter proteins comprising at least one fluorescent reporter covalently bound to the transporter protein, wherein the transporter proteins of the present invention comprise a structural inverted repeat motif, with the motif comprising a first and second subunit that are structural inverted repeats of one another and that are joined to one another by a polypeptide loop.
- 2. Background of the Invention
- Transporter proteins are membrane bound proteins that often utilize potential gradients to drive transport of molecules and ions into and out of cells. Traditionally, these proteins are classified based on the similarity of their amino acid sequence, i.e., the primary structure of the protein.
- There is, however, an understanding of the three-dimensional structure of some of these transporters that is emerging. Indeed, some of these transporter proteins possess a structural inverted repeat, and transporters proteins with apparently disparate amino acid sequences have been shown to assume such a structure. It was not at all understood how this structural inverted repeat may affect the molecule or ion flux that is associated with these proteins.
- Voltage sensors have been generated by others based on, for example, potassium channels, but these voltage sensors do not directly measure potassium transport. Instead these voltage sensors merely measure membrane potential using the voltage sensor of the transporter. To date, there has been no way to monitor or directly measure molecule or ion transport.
- The present invention solves the problems by providing transport proteins that are capable of signaling a conformational change during the transport process. The invention thus provides sensors that are based on virtually any transporter with the required three-dimensional conformation, which, in turn, provides a new means of monitoring their activity or modifications by the regulatory machinery in real time in vivo
- The invention relates to engineered transporter proteins comprising at least one fluorescent reporter covalently bound to the transporter protein. The transporter proteins of the present invention comprise a structural inverted repeat motif, with the motif comprising a first and second subunit that are structural inverted repeats of one another and that are joined to one another by a polypeptide loop. The fluorescent reporter typically is covalently bound to the polypeptide loop that joins the two subunits of the structural inverted repeat.
- The invention also relates to methods of using these engineered transporter proteins to monitor analyte movement across a membrane, such as a cell membrane.
- The invention also relates to nucleic acids encoding the engineered transporter proteins.
-
FIGS. 1A, 1B, 1C, 1D, 1E and 1F depict the design and characterization of one specific ammonium transporter named “AmTrac.” (FIG. 1A ) Topological representation of AMT1; 3 by HMMTOP. 11 Transmembrane helices (TMH) are organized in a pseudo-symmetric structure, i.e., structural inverted repeats, (TMH I-V and TMH VI-X) with an extra terminal TMH-XI that directs the C-terminus to the cytosol. The position of the residues preceding the insertion points of FPs in L5-6, 7-8, 9-10 and in the C-tail are indicated. Residues D202, G460 and T464 are important for the activity of the transporter and are also shown. (FIG. 1B ) Schematic representation of AmTrac transporter. Linkers between AMT1; 3 and mcpGFP are indicated. (FIG. 1C ) Three-dimensional model of AmTrac based on the crystal structures of Af-AMT1 (2B2H) and cpGFP (3evp). One monomer is shown in cartoon and the rest of the trimer complex is represented as a shaded surface in the background. mcpGFP (bottom) was inserted inposition 233 of L5-6 of AMT1; 3, connecting TMH-V and -VI. (FIG. 1D ) Substrate specificity of the fluorescent response of AmTrac. Yeast cells expressing the sensor were treated with the indicated salts at 1 mM concentration. Data are normalized to water-treated control (0) (mean±SD; n=3). Only the ammonium treatments were significantly different from control (SNK test: *P<0.01). (FIG. 1E ) Titration of the fluorescent response of AmTrac (circles and left y-axis) and of ammonium uptake of AMT1; 3 (squares right y-axis). Data are normalized to water-treated controls (0) (mean±s.d.; n=3). (FIG. 1F ) Response of a single yeast cell expressing AmTrac to square pulses of NH4Cl. -
FIGS. 2A, 2B, 2C, 2D, 2E and 2F depict AmTrac mutant analysis. (FIG. 2A ) Confocal section of yeast expressing AmTrac or its inactive variants D202N, G460D or T464D. Bar=10 μm. (FIG. 2B ) Growth of the Δmep1,2,3 or wt yeast expressing AmTrac or its inactive variants on solid media containing 2 mM NH4Cl or 1 mM arginine (growth control) as sole nitrogen source for 3 days. Endogenous MEPs in wt strain are not affected by expression of mutant variants. (FIG. 2C ) Fluorescence response of Δmep1,2,3 and wt yeast expressing AmTrac or its inactive variants to 1 mM NH4Cl (mean±s.d.; n=3). Only the yeast cells expressing AmTrac showed a response significantly different from control (SNK test: *P<0.01). (FIG. 2D ) Growth complementation of Δmep1,2,3 expressing suppressor mutants grown on solid media containing the indicated concentrations of NH4Cl or 1 mM arginine. AmTrac-T464D-A141E expressing cells grow poorly at high ammonium concentrations. (FIG. 2E ) Correlation between transport efficiency (growth in 2 mM NH4Cl) and fluorescence change after addition of 1 mM NH4Cl of the suppressor mutants. Data are normalized by values of AmTrac (=100) (mean±s.d.; n=3). (FIG. 2F ) Titration of the fluorescent response of AmTrac (circles) and of the high capacity variant AmTrac-100μ. (squares). Data are normalized to water-treated controls (0) (mean±s.d.: n=3). -
FIGS. 3A and 3B depict a model for AmTrac sensor and AMT transport. AMT switches between at least two distinct states during transport of ammonium: an outward, open state (FIG. 3A ) and an inward, open state (FIG. 3B ). The movement of TMH-V (left helix) and TMH-VI (right helix) is transmitted to the connecting loop, affecting the inserted mcpGFP and resulting in a change in fluorescence emission. -
FIGS. 4A and 4B depict the characterization of constructs with FP insertions. (FIG. 4A ) The functionality of the transporters was measured as growth of the yeast Δmep1,2,3 mutant transformed with AMT-FP fusions and grown on solid media containing 2 mM NH4Cl or 1 mM arginine (growth control) as the sole nitrogen source for three days. Numbers indicate the position in AtAMT1; 3 preceding the insertion site. Control: empty vector. (FIG. 4B ) Fluorescence emission spectra of the fusion construct AMT1; 3-mcpGFP-233 expressed in yeast. -
FIG. 5 depicts a model of the AtAMT1; 3-mcpGFP fusion. mcpGFP (bottom portion) was fused to the integral membrane protein AtAMT1; 3 inposition 233 of the cytoplasmic L5-6 bridging TMH-V to TMH-VI. The connection point between the two proteins is shown as dashed line. Residues shown in sticks highlight the location of several residues relevant for transport function, in particular TMH-V-H157 from the twin-His (H157-H305) pair, TMH-VI-5208 (forming the ammonium selectivity and recruitment site together with W137), TMH-VI-P204 from the twin Phe-gate (F96-F204) that close the pore in all known structures. The peptide loop connecting TMH V-VI, L5-6 also connects the two pseudo-symmetric halves of the structural inverted repeat of the protein (TMH I-V and TMH VI-X). -
FIGS. 6A and 6B depict a screen of linker variants for identifying a AMT1; 3-mcpGFP sensor. To create an AMT-mcpGFP fusion that responded to addition of ammonium with a fluorescence change, the composition and length of the peptide linker was varied. (FIG. 6A ) Growth of the yeast Δmep1,2,3 mutant transformed with fusion variants on solid media containing 2 mM NH4Cl or 1 mM arginine (growth control) as the sole nitrogen source for three days. All variants retained transport activity. Composition of linkers connecting AtAMT1; 3 and mcpGFP are indicated. Linkers at the N- and C-termini of mcpGFP are indicated in letter code and separated by a slash. In the cases of variants 1-6, no linkers were inserted between the C-terminal sequence of mcpGFP and the second part of AtAMT1; 3. Control, empty vector. (FIG. 6B ) Fluorescence intensity before addition of ammonium and fluorescence intensity change after addition of 1 mM NH4Cl for the 24 linker variants (mean±s.d.; n=3). Most variants showed at least some change in intensity, butvariant 16, carrying LE/FN as linkers, named AmTrac, showed the highest change in fluorescence intensity. -
FIG. 7 depicts the fluorescence spectra of AmTrac (variant 16 fromFIG. 6 ). Excitation spectra (left curves) were recorded at an emission wavelength of 520 nm. Emission spectra (right curves) were recorded with excitation of 488 nm. Data were background-subtracted using data from yeast cells carrying an empty vector. Exc=excitation scan, Em=emission scan. Note that ammonium treatment triggers a decrease in fluorescence. -
FIG. 8 depicts the sequence of AmTrac. Protein sequence of AmTrac. Underlined residues constitute synthetic linker segments. Bolded residues correspond to the mcpGFP moiety. Numbers indicate amino acid position in AtAMT1; 3. -
FIGS. 9A, 9B and 9C depict the Influence of the insertion position of mcpGFP in L5-6 of AtAMT1; 3. To probe the tolerance of L5-6 of AMT1; 3 to insertion, the insertion position was varied within the peptide loop. (FIG. 9A ) Schematic representation of the L5-6 region of AtAMT1; 3. Underlined residues correspond to TMH 5 (left) and TMH 6 (right). (FIG. 9B ) Growth assay of the yeast Δmep1,2,3 mutant transformed with insertion mutants on solid media containing 2 mM NH4Cl or 1 mM arginine (growth control) as the sole nitrogen source for three days. Numbers indicate the insertion site within AtAMT1; 3 (residue preceding the point of insertion of mcpGFP). The center of L5-6 (position 233) tolerated the insertion with minimal effects on growth. (FIG. 9C ) Fluorescence response of the variants to addition of the indicated concentrations of NH4Cl. Data were normalized to the water-treated control (0) (mean±s.d.; n=2). Variants with mcpGFP inserted into the central positions showed the strongest ammonium-induced response. -
FIG. 10 depicts the growth and response of AmTrac variants with deletions in L5-6. To probe whether the loop length was critical for transport activity and fluorescence response, incremental deletions of the peptide loop L5-6 were generated around the insertion point of mcpGFP. Growth was analyzed as described inFIGS. 9A-9C . Numbers in the left column indicate the position of the insertion in AtAMT1; 3; two numbers indicate residues preceding and following the mcpGFP insertion. Right column indicates whether the corresponding variant responded to addition of 1 mM NH4Cl with a fluorescence change. The original AmTrac is highlighted in bold. Note that deletions >4 residues abolished transport activity and deletions >3 impaired the fluorescence response. -
FIGS. 11A and 11B depict the fluorescence response of transport-deficient AmTrac mutants to increasing ammonium concentrations. Fluorescence response of Δmep1,2,3 (FIG. 11A ) or wt (FIG. 11B ) yeast expressing AmTrac or the transport-inactive variants D202N, G460D or T464D. Data were normalized to water-treated controls (0) (mean±s.d.; n=3). Only yeast cells expressing AmTrac showed significantly different responses (SNK test: P<0.01). -
FIGS. 12A, 12B and 12C depict the reconstitution of the fluorescent response in cis-suppressor mutants of the transport-inactive AmTrac-T464D mutant. (FIG. 12A ) Frequency of the occurrence of individual cis-suppressors of the T464D-inactivating mutation in the multicopy screen. A total of 56 colonies were retrieved. Almost half of them carried the A141E mutation. Three suppressors had a pseudo-reversion mutation: D464V. (FIG. 12B ) Growth was analyzed as described inFIGS. 9A-9C , with varying concentrations of NH4Cl, (NH4)2SO4 (as anion control) or 1 mM arginine. The yeast expressing AmTrac-T464D-A141E grew poorly at high concentrations of ammonium, suggesting high capacity transport activity leading to ammonium toxicity. (FIG. 12C ) Fluorescence response of selected suppressors to addition of the indicated concentrations of NH4Cl. Data were normalized to water-treated controls (0) (mean±s.d.; n=3). -
FIGS. 13A, 13B and 13C depict the position of suppressor mutations in AMTs. (FIG. 13A ) Sequence alignment between AtAMT1; 3 from Arabidopsis and AfAmt-1 from A. fulgidus. The residues belonging to TMH domains of the two halves of the structural inverted repeat of AfAMT1 are shown as underlined and speckled, respectively. The corresponding residues identified in the suppressor screen of AmTrac-T464D are indicated in both sequences as bold residues. (FIG. 13B ) Lateral view and (FIG. 13C ) cytoplasmic side view of AfAMT1 according to the crystal structure. The corresponding residues in AfAMT1 that suppress the T464D mutation in AmTrac are indicated by spheres. The connecting L5-6 is peptide loop labeled. -
FIGS. 14A and 14B depict a screen for sensor variants. (FIG. 14A ) Fluorescence intensity and fluorescence response of ˜350 random variants of the C-terminal linker of mcpGFP to addition of 1 mM NH4Cl. The brightest and most responsive variants (open squares) were sequenced and composition of the linker is reported. (FIG. 14B ) Fluorescence intensity and fluorescence response after addition of 1 mM NH4Cl, normalized to values of AmTrac (100%) of yeast expressing the best variants identified in (A) (mean±s.d.; n=3). -
FIG. 15 depicts the single cell responsiveness of AmTrac-LS. Individual cells trapped in a microfluidic plate were perfused with 50 mM MES buffer pH 6.0, or a square pulse of 50 μM NH4Cl in buffer (framed region). Data were normalized to the initial value (mean±s.d.; n=3). -
FIG. 16 depicts the responsiveness of AmTrac in root hairs of Arabidopsis. Seedlings grown in a microfluidic perfusion system (RootChip) were perfused with 50 mM MES buffer pH 6.0, or a square pulse of 1 mM NH4Cl in buffer. -
FIGS. 17A and 17B depict the characterization of an AmTrac version (named MEPtrac) employing the ammonium transporter MEP2 (methylammonium permease 2) from the yeast Saccharomyces caerevisiae instead of AMT1; 3 of Arabidopsis thaliana. The mcpGFP was inserted in the middle ofloop 5 of MEP2 (after amino acid 217) and connected by the linkers coding for amino acids LS (preceding mcpGFP) and FN (following mcpGFP). (FIG. 17A ) Growth complementation of Δmep1,2,3 expressing suppressor mutants grown on solid media containing the indicated concentrations of NH4Cl or 1 mM arginine or 5 mM methylammonium plus 0.1% proline. MEPtrac is able to complement growth on ammonium as well as the parent protein MEP2. (FIG. 17B ) Fluorescence response of Δmep1,2,3 and wt yeast expressing MEPtrac to the indicated concentrations of NH4Cl or NaCl (mean±s.d.; n=3). -
FIGS. 18A and 18B depict the time-resolved fluorescence lifetime measurements and analysis of intact yeast cells expressing three improved AmTrac variants, AmTrac-GS, AmTrac-LS and AmTrac-IS with and without addition of 1 mM NH4Cl. -
FIGS. 19A and 19B depict the time-resolved anisotropy measurements of intact yeast cells expressing AmTrac-GS to elucidate the relevance of homo-FRET in the sensor mechanism. - The invention relates to engineered transporter proteins comprising at least one fluorescent reporter covalently bound to the transporter protein. The transporter proteins of the present invention must comprise at least one “structural inverted repeat motif.” As used herein, a structural inverted repeat motif refers to a specific arrangement of secondary and tertiary structures of a protein. In particular a transporter protein with a “structural inverted repeat” is a protein that has multiple alpha helices as part of its secondary structure, and many, but not necessarily all, of these alpha helices are arranged in such a manner that they span a plasma membrane (“transmembrane helices”) when the protein occurs in its natural environment. Furthermore, a “structural inverted repeat” indicates that there is an “approximate axis of symmetry” dividing the transmembrane helices into two set of helices. Each set of transmembrane helices on either half of the approximate axis of symmetry is considered a “subunit” for the purposes of the present invention. In transporter proteins containing an inverted structural repeat, the alpha-carbon atoms of the residues making up the first set (“first subunit”) of transmembrane helices can be superimposed on the alpha-carbon atoms of the residues making up the second set (“second subunit”) of transmembrane helices by a rotation of about 180 degrees. The ability to superimpose the alpha-carbon atoms of the transmembrane helices between the two subunits provides the “structural repeat” aspect of the transporter protein. The approximate 180 degree rotation of one subunit relative to the other subunit such that the alpha-carbon atoms between the two subunits can be superimposed on one another provides “inverted” aspect of the inverted structural repeat.
- The first and second subunits that make up the structural inverted repeat need not be rotated exactly 180 degrees relative to one another. For example, in the Na+/Cl− transporter protein in Aquifex aeolicus, the second subunit is rotated about 176.5 degrees relative to the first subunit. See Yamashita, A., et al. Nature, 437:215-223 (2005), which is incorporated by reference. In another example, the Mhp1 transporter (indolyl-methyl- and benzyl-hydantions), which is a member of the nucleobase-cation-symport (NCS1) family of proteins, from M. liquifaciens is a transporter protein with a structural inverted repeat in which the second subunit is rotated about 168 degree relative to the first subunit. See Weyand, S. et al., Science, 322:709-713 (2008), which is incorporated by reference. For the purposes of the present invention, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 90 degrees to about 270 degrees. In one specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 100 degrees to about 260 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 110 degrees to about 250 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 120 degrees to about 240 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 130 degrees to about 230 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 140 degrees to about 220 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 150 degrees to about 210 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 160 degrees to about 200 degrees. In another specific embodiment, the rotation needed to superimpose the two halves of the repeat structure such that the protein is said to have a “structural inverted repeat” can be any angle from about 170 degrees to about 190 degrees.
- Proteins said to have a structural inverted repeat can have any number transmembrane helices in each subunit, and the number of transmembrane helices need not be the same in the two subunits. In general, the first and second subunits can each independently have 2, 3, 4, 5, 6, 7, 8, 9 or even 10 transmembrane helices. Thus, as an example, one subunit may have 5 transmembrane helices in its first subunit and 5 transmembrane helices in its second subunit. As another example, one subunit may have 5 transmembrane helices in its first subunit and 6 transmembrane helices in its second subunit. In general, however, the number of transmembrane helices that are rotated in the second subunit, relative to the first subunit, will define the number of transmembrane helices in the subunits. For example, the Na+/Cl− transporter protein in Aquifex aeolicus has 5 transmembrane helices in the first subunit and 5 transmembrane helices in the second subunit.
- In addition, the transporter porter protein may have additional secondary structures that are not part of the structural inverted repeat motif. For example, the transporter proteins may have additional alpha helical structures, and these alpha helices may even be transmembrane helices. But these additional transmembrane helices would not necessarily be part of the structural inverted repeat motif. For example, the Na+/Cl− transporter protein in Aquifex aeolicus has 12 total transmembrane alpha helices, numbered 1-12. The alpha helices numbered 1-5 in the Na+/Cl− transporter protein in Aquifex aeolicus are considered to make up the first subunit of the structural inverted repeat motif, and helices 6-10 make up the second subunit of the structural inverted repeat motif.
11 and 12 in the Na+/Cl− transporter protein in Aquifex aeolicus span the membrane but do not belong to either the first or second subunit of the structural inverted repeat motif.Alpha helices - Moreover, the transmembrane helices in the subunits need not be continuous such that there may be a break helical structure along the transmembrane helix. For example, the transmembrane helices numbered 1 and 6 of the Na+/Cl− transporter protein in Aquifex aeolicus are both discontinuous transmembrane helices, yet each helix would be considered a single transmembrane helix that is part of their respective subunit.
- The transmembrane helices are connected to one another by a chain of amino acids. Often, two helices are connected to one another with a chain of amino acids that do not possess a secondary structure. On the other hand, the chain of amino acids connecting two transmembrane helices may have a secondary structure, e.g., an alpha helix, occurring along its length. For example, one subunit of the structural inverted repeat may have a transmembrane helix connected to a second transmembrane helix through a chain of amino acids (“polypeptide loop”). The transmembrane helices connected by the polypeptide loop may or may not be physically located beside each other in the protein structure. The helices are said to be “connected to one another” based upon the overall amino acid sequence of the protein. For example, amino acid residues 41-70 of the linear amino acid sequence may form one transmembrane helix and amino acid residues 88-124 may form another transmembrane helix. The two helices would be connected to one another through the polypeptide chain on amino acid residues 71-87, although the two transmembrane helices are necessarily physically beside one another in the overall three-dimensional structure of the protein. Continuing the example, amino acid residues 71-87 of the polypeptide loop connecting the two transmembrane helices may themselves form an alpha helical structure, but this alpha-helical structure would not be considered one of the transmembrane helices of that form the structural inverted repeat structure if the alpha helix of the polypeptide loop did not span the membrane when normally expressed.
- Moreover, the polypeptide loop connecting the two helices may be external to the plasma membrane (“extracellular”) or in may be internal to the plasma membrane (“intracellular”). Of course, any reference herein to any portion of the transporter protein relative to a cell or plasma membrane is for illustrative purposes only. The proteins may be synthesized and formed ex vivo and subsequently folded into a three dimensional structure that resembles or is the same as the properly folded protein if it were to be expressed natively in a cell.
- Based on the three dimensional arrangement of the transporter protein and the linearity of the amino acid sequence, there will be a polypeptide loop that connects two transmembrane helices, where the two transmembrane helices are in different subunits of the structural inverted repeat. This polypeptide loop is considered to be the polypeptide loop that connects the two subunits of the structural inverted repeat motif of the transporter peptide. For example,
FIG. 1A shows a cartoon diagram of an ammonium transporter protein comprising a structural inverted repeat motif. Each subunit of the structural inverted repeat in the ammonium transporter comprises 5 transmembrane helices, with helices 1-5 belonging to the first subunit and helices 6-10 belonging to the second subunit. Each helix is connected to the “next” helix in the amino acid chain through a polypeptide loop as defined herein.Transmembrane helix 5 is joined totransmembrane helix 6 through a polypeptide loop, andhelix 5 andhelix 6 belong to two different subunits of the structural inverted repeat motif. The polypeptide loop that connects the first and second subunits of the structural inverted repeat is the loop that joins 5 and 6 to one another. The polypeptide loop that connects the first and second subunits of the structural inverted repeat can be extracellular or it can be intracellular.transmembrane helices - Examples of transporter proteins include but are not limited to members of the APC (amino acid, polyamine and organocation) superfamily of transporter proteins that have a structural inverted repeat, the AMT/MEP/RH superfamily of transporters, the Neurotransmitter:Sodium Symporter (NSS) superfamily of transporters, such as but not limited to the LeuT, Tyt1 and TnaT transporters, the Betaine/Carnitine/Choline superfamily of transporters (BCCT), such as but not limited to BetP, the The Amino Acid/Auxin Permease (AAAP) superfamily of transports, the Solute:Sodium Symporter (SSS) superfamily of transporters, the Alanine or Glycine:Cation Symporter (AGCS) superfamily of transporters, the Cation-Chloride Cotransporter (CCC) superfamily of transporters, the Nucleobase:Cation Symporter-1 (NCS1) superfamily of transporters and the Hydroxy/Aromatic Amino Acid Permease (HAAAP) superfamily of transporters.
- Determining if a transporter protein contains a structural inverted repeat can be performed by resolving the crystal structure of the proteins through for example, X-ray crystallography and NMR-spectroscopy. The topology of protein transporters can efficiently and reliably be predicted using programs such as TMHMM (available on the internet at www.cbs.dtu.dk/services/TMHMM/) the current version of which is “Server 2.0.” Determining if a transporter protein contains repeated elements can be performed by alignment of the protein sequence to itself, or by algorithms such as MotifScan, Radar (de novo repeat detection in protein sequences), REP (searches a protein sequence for repeats), REPRO (de novo repeat detection in protein sequences, T-REKS (de novo detection and alignment of repeats in protein sequences), TRUST (de novo repeat detection in protein sequences), XSTREAM (de novo tandem repeat detection and architecture modeling in protein sequences. All such tools are accessible through the internet at www.expasy.org/tools/. Other systems may include but are not limited to automated systems for modeling proteins based on its amino acid sequence, for example, ModPipe (available on the internet at: www.salilab.org), CPH models (available on the internet at: www.cbs.dtu.dk/services/CPHmodels/), 3D-JIGSAW (available on the internet at: www.bmm.icnet.uk/˜3 djigsaw/), ESyPred3D (available on the internet at: www.fundp.ac.be/urbm/bioinfo/esypred/), or SDSC1 (available on the internet at: cl.sdsc.edu/hm.html). The invention is not limited to the specific methods used to of determine the three dimensional structure of the transporter protein.
- The fluorophore, i.e., the fluorescent reporter, can be covalently bound to any one of the polypeptide loops that connect two transmembrane helices. In one embodiment, the fluorescent reporter is bound to the polypeptide loop that connects the first and second subunits of the structural inverted repeat of the transporter peptide. In another embodiment, the fluorescent reporter is bound to the polypeptide loop that does not connect the first and second subunits of the structural inverted repeat of the transporter peptide.
- As used herein, “fluorophore” is used as it is in the art and refers to a molecule that emits light upon the absorption of energy. In general, a fluorophore of the present invention is any chemical moiety that exhibits an absorption maximum at or beyond 280 nm, and when covalently attached to a protein or other reagent retains its spectral properties. Fluorophores of the present invention include, without limitation; a pyrene (including any of the corresponding derivative compounds disclosed in U.S. Pat. No. 5,132,432, incorporated by reference), an anthracene, a naphthalene, an acridine, a stilbene, an indole or benzindole, an oxazole or benzoxazole, a thiazole or benzothiazole, a 4-amino-7-nitrobenz-2-oxa-1,3-diazole (NBD), a cyanine, a carbocyanine (including any corresponding compounds in U.S. Pat. Nos. 4,981,977; 5,268,486; 5,569,587; 5,569,766; 5,486,616; 5,627,027; 5,808,044; 5,877,310; 6,002,003; 6,004,536; 6,008,373; 6,043,025; 6,127,134; 6,130,094; 6,133,445; 6,664,047; 6,974,873 and 6,977,305; and publications WO 02/26891, WO 97/40104, WO 99/51702, WO 01/21624;
EP 1 065 250 A1, incorporated by reference), a carbostyryl, a porphyrin, a salicylate, an anthranilate, an azulene, a perylene, a pyridine, a quinoline, a borapolyazaindacene (including any corresponding compounds disclosed in U.S. Pat. Nos. 4,774,339; 5,187,288; 5,248,782; 5,274,113; and 5,433,896, incorporated by reference), a xanthene (including any corresponding compounds disclosed in U.S. Pat. Nos. 6,162,931; 6,130,101; 6,229,055; 6,339,392; 5,451,343 and 6,716,979, incorporated by reference), an oxazine (including any corresponding compounds disclosed in U.S. Pat. No. 4,714,763, incorporated by reference) or a benzoxazine, a carbazine (including any corresponding compounds disclosed in U.S. Pat. No. 4,810,636, incorporated by reference), a phenalenone, a coumarin (including an corresponding compounds disclosed in U.S. Pat. Nos. 5,696,157; 5,459,276; 5,501,980 and 5,830,912, incorporated by reference), a benzofuran (including an corresponding compounds disclosed in U.S. Pat. Nos. 4,603,209 and 4,849,362, incorporated by reference) and benzphenalenone (including any corresponding compounds disclosed in U.S. Pat. No. 4,812,409, incorporated by reference) and derivatives thereof. As used herein, oxazines include resorufins (including any corresponding compounds disclosed in U.S. Pat. No. 5,242,805, incorporated by reference), aminooxazinones, diaminooxazines, and their benzo-substituted analogs. Additional labeling moieties include, but are not limited to, those compounds that are described in United States Patent Publication No. 2006/0280652, published 14 Dec. 2006 and PCT Publication No. WO 2006/025887, which are incorporated by reference. Other fluorophores are described in RICHARD P. HAUGLAND, MOLECULAR PROBES HANDBOOK OF FLUORESCENT PROBES AND RESEARCH PRODUCTS (9th edition, CD-ROM, (September 2002), which is herein incorporated by reference. - When the fluorophore is a xanthene, the fluorophore is optionally a fluorescein, a rhodol (including any corresponding compounds disclosed in U.S. Pat. Nos. 5,227,487 and 5,442,045, incorporated by reference), or a rhodamine (including any corresponding compounds in U.S. Pat. Nos. 5,798,276; 5,846,737 and 6,562,632, incorporated by reference). As used herein, fluorescein includes benzo- or dibenzofluoresceins, seminaphthofluoresceins, or naphthofluoresceins. Similarly, as used herein rhodol includes seminaphthorhodafluors (including any corresponding compounds disclosed in U.S. Pat. No. 4,945,171, incorporated by reference). Alternatively, the fluorophore is a xanthene that is bound via a linkage that is a single covalent bond at the 9-position of the xanthene. Xanthenes also include derivatives of 3H-xanthen-6-ol-3-one attached at the 9-position, derivatives of 6-amino-3H-xanthen-3-one attached at the 9-position, or derivatives of 6-amino-3H-xanthen-3-imine attached at the 9-position.
- Fluorophores for use in the present invention include, but are not limited to, xanthene (rhodol, rhodamine, fluorescein and derivatives thereof) coumarin, cyanine, pyrene, oxazine and borapolyazaindacene. Examples of xanthenes are sulfonated xanthenes, fluorinated xanthenes, sulfonated coumarins, fluorinated coumarins and sulfonated cyanines. The choice of the fluorophore will determine the absorption and fluorescence emission properties of the transporter protein or other labeling reagent complex. Physical properties of a fluorophore label include spectral characteristics (absorption, emission and stokes shift), fluorescence intensity, lifetime, polarization and photo-bleaching rate all of which can be used to distinguish one fluorophore from another.
- Typically the fluorophore contains one or more aromatic or heteroaromatic rings, that are optionally substituted one or more times by a variety of substituents, including without limitation, halogen, nitro, cyano, alkyl, perfluoroalkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, arylalkyl, acyl, aryl or heteroaryl ring system, benzo, or other substituents typically present on fluorophores known in the art.
- Non-limiting examples of fluorophores useful as reporter groups in this invention include acrylodan, fluorescein, coumarins, rhodamines, 5-TMRIA (tetramethylrhodamine-5-iodoacetamide), Quantum Red™ (R-phycoerythrin coupled to (9-(2(or4)-(N-(2-maleimdylethyl)-sulfonamidyl)-4(or 2)-sulfophenyl)-2,3,6,7,12,13,16,17-octahydro-(1H,5H,11H,15H-xantheno(2,3,4-ij:5,6,7-i′j′)diquinolizin-18-ium salt), Texas Red™ (9-(2(or4)-(N-(2-maleimdylethyl)-sulfonamidyl)-4(or 2)-sulfophenyl)-2,3,6,7,12,13,16,17-octahydro-(1H,5H,11H,15H-xantheno(2,3,4-ij:5,6,7-i′j′)diquinolizin-18-ium salt), Cy™3 (2-(5-(1-(6-(N-(2-maleimdylethyl)-amino)-6-oxohexyl)-1,3-dihydro-3,3-dimethyl-5-sulfo-2H-indol-2-ylidene)-1,3-propyldienyl)-1-ethyl-3,3-dimethyl-5-sulfo-3H-indolium salt), N-((2-iodoacetoxy)ethyl)-N-methyl)amino-7-nitrobenzoxadiazo-le (IANBD), 6-acryloyl-2-dimethylaminonaphthalene (acrylodan), pyrene, Lucifer Yellow (6-amino-2,3-dihydro-2-(2-((iodoacetyl)amino)ethyl)-1,3-dioxo-1H-benz(de)isoquinoline-5,8-disulfonic acid salt), Cy™5 (2-(5-(1-(6-(N-(2-maleimdylethyl)-amino)-6-oxohexyl)-1,3-dihydro-3,3-dimethyl-5-sulfo-2H-indol-2-ylidene)-1,3-pentadienyl)-1-ethyl-3,3-dimethyl-5-sulfo-3H-indolium salt), Dapoxyl® (2-bromoacetamidoethyl)sulfonamide(4-(5-(4-dimethylaminophenyl)oxazol-2-yl)phenyl-N-(2-bromoacetamidoethyl)sulfonamide), (N-(4,4-difluoro-1,3,5,7-tetramethyl-4-bora-3a,4a-diaza-s-indacene-2-yl)-iodoacetamide (Bodipy507/545 IA), N-(4,4-difluoro-5,7-diphenyl-4-bora-3a,4a-diaza-s-indacene-3-propionyl)-N′-iodoacetylethylenediamine (BODIPY®. 530/550 IA), 5-((((2-iodoacetyl)amino)ethyl)amino)naphthalene-1-sulfonic acid (1,5-IAEDANS), carboxy-X-rhodamine, 5/6-iodoacetamide (
XRIA 5,6), eosin, acridine orange, Alexa Fluor 350™, Alexa Fluor 405™, Alexa Fluor 430™, Alexa Fluor 488™,Alexa Fluor 500™, Alexa Fluor 514™, Alexa Fluor 532™, Alexa Fluor 546™, Alexa Fluor 555™, Alexa Fluor 568™, Alexa Fluor 594™, Alexa Fluor 610™, Alexa Fluor 633™, Alexa Fluor 635™, Alexa Fluor 647™, Alexa Fluor 660™, Alexa Fluor 680™, Alexa Fluor 700™ and Alexa Fluor 750™. Other fluorophores that exhibit environmentally-sensitive fluorescence properties include squaraines, coumarins, aza-coumarins, IAZCO, benzodiaxoazoles, and dyes derived from Nile Red such as INR. These fluorophores are described in pending U.S. Pregrant Publication No. 2006/0280652, which is incorporated herein by reference. Additional fluorophores are described in U.S. application Ser. No. 12/250,953, Filed 14 Oct. 2008 and Ser. No. 12/124,553, Filed 21 May 2008, respectively, both of which are incorporated by reference. - In one embodiment, the fluorescent reporter is a fluorescent protein. In a more specific embodiment, when a fluorescent protein is used as the reporter, the invention can also provide for fusion proteins comprising the transporter protein and a fluorescent protein as the reporter. The term “fluorescent protein” is readily understood in the art and simply means a protein that emits fluorescence at a detectable wavelength. Examples of fluorescent proteins that are part of fusion proteins of the current invention include, but are not limited to, green fluorescent proteins (GFP, AcGFP, ZsGreen), red-shifted GFP (rs-GFP), red fluorescent proteins (RFP, including DsRed2, HcRed1, dsRed-Express, cherry, tdTomato), yellow fluorescent proteins (YFP, Zsyellow), cyan fluorescent proteins (CFP, AmCyan), a blue fluorescent protein (BFP), ametrine, citrine, cerulean, turquoise, VENUS, teal fluorescent protein (TFP), LOV (light, oxygen or voltage) domains, and the phycobiliproteins, as well as the enhanced versions and mutations of these proteins. Fluorescent proteins as well as enhanced versions thereof are well known in the art and are commercially available. For some fluorescent proteins, “enhancement” indicates optimization of emission by increasing the protein's brightness, creating proteins that have faster chromophore maturation and/or alteration of dimerization properties. These enhancements can be achieved through engineering mutations into the fluorescent proteins.
- The reporter group may be attached to the transporter protein by any conventional means known in the art. For example, the reporter group may be attached via amines or carboxyl residues on the protein. In one embodiment, N-hydroxy-succinimide (NHS) esters are used to crosslink the reporter group to primary amino groups on the transporter protein. Alternatively, cysteine or other amino acid groups may be engineered into the transporter protein to provide sites of attachment for the reporter group. Any thiol-reactive group known in the art may be used for attaching reporter groups such as fluorophores to a native, engineered, or mutated protein's cysteine. For example, acrylates, an iodoacetamide, bromoacetamide, or maleimide are well known thiol-reactive moieties that may be used for this purpose.
- The fluorescent reporters, for example the phycobiliproteins, may be particularly useful for creating tandem dye labeled labeling reagents. In one embodiment of the current invention, therefore, the measurable signal of the fusion protein is actually a transfer of excitation energy (resonance energy transfer) from a donor molecule (e.g., a first fluorescent protein) to an acceptor molecule (e.g., a second fluorescent protein). In particular, the resonance energy transfer is in the form of fluorescence resonance energy transfer (FRET). When the fusion proteins of the present invention utilize FRET to measure or quantify analyte(s), one fluorescent protein of the fusion protein construct can be the donor, and the second fluorescent protein of the fusion protein construct can be the acceptor. The terms “donor” and “acceptor,” when used in relation to FRET, are readily understood in the art. Namely, a donor is the molecule that will absorb a photon of light and subsequently initiate energy transfer to the acceptor molecule. The acceptor molecule is the molecule that receives the energy transfer initiated by the donor and, in turn, emits a photon of light. The efficiency of FRET is dependent upon the distance between the two fluorescent partners and can be expressed mathematically by: E=R0 6/(R0 6+r6), where “E” is the efficiency of energy transfer, “r” is the distance (in Angstroms) between the fluorescent donor/acceptor pair and “R0” is the Förster distance (in Angstroms). The Förster distance, which can be determined experimentally by readily available techniques in the art, is the distance at which FRET is half of the maximum possible FRET value for a given donor/acceptor pair. A particularly useful combination is the phycobiliproteins disclosed in U.S. Pat. Nos. 4,520,110; 4,859,582; 5,055,556, incorporated by reference, and the sulforhodamine fluorophores disclosed in U.S. Pat. No. 5,798,276, or the sulfonated cyanine fluorophores disclosed in U.S. Pat. Nos. 6,977,305 and 6,974,873; or the sulfonated xanthene derivatives disclosed in U.S. Pat. No. 6,130,101, incorporated by reference and those combinations disclosed in U.S. Pat. No. 4,542,104, incorporated by reference.
- A “spectral change” in a fluorophore reporter group may be monitored to detect analyte movement through the transporter protein. The “spectral change” that occurs upon analyte movement can be, but is not limited to, a change in fluorescence lifetime, fluorescence intensity, fluorescence polarization, and spectral shifts of fluorescence emission. Such spectral changes may result from changes in the local environment of the fluorophore, such as those resulting from changes in protein conformation. Environmentally-sensitive dyes such as acrylodan and IANBD are particularly useful in this respect. Other spectral changes may result from interactions with the analyte itself or from interactions with a second reporter group, for example when FRET (fluorescence resonance energy transfer) is used to monitor changes in distance between two fluorophores.
- The term “detector” as used herein refers to any component, portion thereof, or system of components that can detect the detectable signal generated by the reporter group. In one specific embodiment, the detector can detect at least one property of light including, but not limited to, a charged coupled device (CCD), back-side thin-cooled CCD, front-side illuminated CCD, a CCD array, a photodiode, a photodiode array, a photo-multiplier tube (PMT), a PMT array, complimentary metal-oxide semiconductor (CMOS) sensors, CMOS arrays, a charge-injection device (CID), CID arrays, etc. The detector can be adapted to relay information to a data collection device for storage, correlation, and/or manipulation of data, for example, a computer, or other signal processing system.
- The detected signal from the fluorescent reporter may simply be the measured signal, e.g., fluorescence, without any additional measurements or manipulations. Alternatively, the signal may be expressed as a difference, percentage or ratio of the measured value to a different measured value such as, but not limited to, a standard, baseline or response to another analyte. The signal when compared to another value may be negative, indicating a decrease in the amount of measured analyte(s). The quantities may also be expressed as a difference or ratio of the analyte(s) to itself, measured at a different point in time. The quantities of analytes may be determined directly from a generated signal, or the generated signal may be used in an algorithm, with the algorithm designed to correlate the value of the generated signals to the quantity of analyte(s) in the sample.
- In one embodiment, the polypeptide loop connecting two transmembrane helices is linked to a fluorescent reporter without a linker peptide. In another embodiment, the polypeptide loop connecting two transmembrane helices is linked to the fluorescent reporter with at least one linker peptide, for example one or two peptide linkers. As used herein, a peptide linker is used to mean a polypeptide typically ranging from about 1 to about 120 amino acids in length that is designed to facilitate the functional connection of two distinct entities, such as the transporter protein and a fluorescent reporter. To be clear, a single amino acid can be considered a peptide linker peptide for the purposes of the present invention. In specific embodiments, the peptide linker comprises or in the alternative consists of amino acids numbering 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119 or 120 residues in length. Of course, the peptide linkers used in the fusion proteins of the present invention may comprise or in the alternative consist of amino acids numbering more than 120 residues in length. The length of the peptide linker(s), if present, are not necessarily critical to the function of the engineered protein, provided that the peptide linker permits a functional connection between the transporter protein and any fusion partner thereof, e.g., a fluorescent protein.
- The term “functional connection” in the context of a linker peptide indicates a connection that facilitates folding of the polypeptides into a three dimensional structure that allows the linked fusion polypeptide to mimic some or all of the functional aspects or biological activities of the protein or portion thereof. For example, in the case of a labeled transporter protein, the linker may be used to create a single-chain fusion of a three dimensional structure that mimics the structure of the transporter and the fluorescent reporter. The term functional connection also indicates that the linked portions possess at least a minimal degree of stability, flexibility and/or tension that would be required for the transporter protein to function as desired.
- In one embodiment of the present invention, when more than one linker peptide is used, each of the linker peptides comprises or consists of the same amino acid sequence. In another embodiment, when more than one linker peptide is used, each of the amino acid sequences of the peptide linkers are different from one another.
- The transporter proteins that contain a structural inverted repeat can be from any plant source and the invention is not limited by the source of the transporter protein, i.e., the invention is not limited to the plant species from which transporter proteins that contain a structural inverted repeat normally occurs or is obtained. Examples of sources from which the transporter proteins that contain a structural inverted repeat may be derived include but are not limited to monocotyledonous plants that include, for example, Lolium, Zea, Triticum, Sorghum, Triticale, Saccharum, Bromus, Oryzae, Avena, Hordeum, Secale and Setaria. Other sources from which the transporter proteins that contain a structural inverted repeat may be derived include but are not limited to maize, wheat, barley, rye, rice, oat, sorghum and millet. Additional sources from which the transporter proteins that contain a structural inverted repeat may be derived include but are not limited to dicotyledenous plants that include but are not limited to Fabaceae, Solanum, Brassicaceae, especially potatoes, beans, cabbages, forest trees, roses, clematis, oilseed rape, sunflower, chrysanthemum, poinsettia, arabidopsis, tobacco, tomato, and antirrhinum (snapdragon), soybean, canola, sunflower and even basal land plant species, (the moss Physcomitrella patens). Additional sources also include gymnosperms.
- It is understood that the invention is not limited to transporter proteins that contain a structural inverted repeat from the plant species listed herein, and that the invention encompasses proteins encoded by orthologous of genes in other species. As used herein, orthologous genes are genes from different species that perform the same or similar function and are believed to descend from a common ancestral gene. Proteins from orthologous genes, in turn, are the proteins encoded by the orthologs. As such the term “ortholog” may be to refer to a gene or a protein. Often, proteins encoded by orthologous genes have similar or nearly identical amino acid sequence identities to one another, and the orthologous genes themselves have similar nucleotide sequences, particularly when the redundancy of the genetic code is taken into account. Thus, by way of example, the ortholog of the Amt1 transporter would be an Amt1 transporter in another species of plant, regardless of the amino acid sequence of the two proteins.
- In another aspect, the invention provides deletion variants wherein one or more amino acid residues in the transporter proteins that contain a structural inverted repeat or one or more fluorescent protein(s) are removed. Deletions can be effected at one or both termini of the transporter protein or one or more fluorescent protein(s), or with removal of one or more non-terminal amino acid residues of the transporter protein or one or more fluorescent protein(s).
- The proteins of the present invention may also comprise substitution variants of a transporter protein that contain a structural inverted repeat of the fluorescent proteins used herein. Substitution variants include those polypeptides wherein one or more amino acid residues of the transporter proteins that contain a structural inverted repeat are removed and replaced with alternative residues. In general, the substitutions are conservative in nature. Conservative substitutions for this purpose may be defined as set out in the tables below. Amino acids can be classified according to physical properties and contribution to secondary and tertiary protein structure. A conservative substitution is recognized in the art as a substitution of one amino acid for another amino acid that has similar properties. Exemplary conservative substitutions are set out in below.
-
TABLE I Conservative Substitutions Side Chain Characteristic Amino Acid Aliphatic Non-polar Gly, Ala, Pro, Iso, Leu, Val Polar-uncharged Cys, Ser, Thr, Met, Asn, Gln Polar-charged Asp, Glu, Lys, Arg Aromatic His, Phe, Trp, Tyr Other Asn, Gln, Asp, Glu - Alternatively, conservative amino acids can be grouped as described in Lehninger (1975) Biochemistry, Second Edition; Worth Publishers, pp. 71-77, as set forth below.
-
TABLE II Conservative Substitutions Side Chain Characteristic Amino Acid Non-polar (hydrophobic) Aliphatic: Ala, Leu, Iso, Val, Pro Aromatic: Phe, Trp Sulfur-containing: Met Borderline: Gly Uncharged-polar Hydroxyl: Ser, Thr, Tyr Amides: Asn, Gln Sulfhydryl: Cys Borderline: Gly Positively Charged (Basic): Lys, Arg, His Negatively Charged (Acidic) Asp, Glu - And still other alternative, exemplary conservative substitutions are set out below.
-
TABLE III Conservative Substitutions Original Residue Exemplary Substitution Ala (A) Val, Leu, Ile Arg (R) Lys, Gln, Asn Asn (N) Gln, His, Lys, Arg Asp (D) Glu Cys (C) Ser Gln (Q) Asn Glu (E) Asp His (H) Asn, Gln, Lys, Arg Ile (I) Leu, Val, Met, Ala, Phe Leu (L) Ile, Val, Met, Ala, Phe Lys (K) Arg, Gln, Asn Met (M) Leu, Phe, Ile Phe (F) Leu, Val, Ile, Ala Pro (P) Gly Ser (S) Thr Thr (T) Ser Trp (W) Tyr Tyr (Y) Trp, Phe, Thr, Ser Val (V) Ile, Leu, Met, Phe, Ala - The invention therefore provides isolated peptides, with the peptides comprising an amino acid sequence at least about 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the amino acid sequences disclosed herein. For example, the invention provides for polypeptides comprising or consist of amino acid sequences that are 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the amino acid sequence of SEQ ID NO:31.
- A polypeptide having an amino acid sequence at least, for example, about 95% “identical” to a reference an amino acid sequence is understood to mean that the amino acid sequence of the polypeptide is identical to the reference sequence except that the amino acid sequence may include up to about five modifications per each 100 amino acids of the reference amino acid sequence. In other words, to obtain a peptide having an amino acid sequence at least about 95% identical to a reference amino acid sequence, up to about 5% of the amino acid residues of the reference sequence may be deleted or substituted with another amino acid or a number of amino acids up to about 5% of the total amino acids in the reference sequence may be inserted into the reference sequence. These modifications of the reference sequence may occur at the N-terminus or C-terminus positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among amino acids in the reference sequence or in one or more contiguous groups within the reference sequence.
- As used herein, “identity” is a measure of the identity of nucleotide sequences or amino acid sequences compared to a reference nucleotide or amino acid sequence. In general, the sequences are aligned so that the highest order match is obtained. “Identity” per se has an art-recognized meaning and can be calculated using well known techniques. While there are several methods to measure identity between two polynucleotide or polypeptide sequences, the term “identity” is well known to skilled artisans (Carillo (1988) J. Applied Math. 48, 1073). Examples of computer program methods to determine identity and similarity between two sequences include, but are not limited to, GCG program package (Devereux (1984)
Nucleic Acids Research 12, 387), BLASTP, ExPASy, BLASTN, FASTA (Atschul (1990) J. Mol. Biol. 215, 403) and FASTDB. Examples of methods to determine identity and similarity are discussed in Michaels (2011) Current Protocols in Protein Science, Vol. 1, John Wiley & Sons. - In one embodiment of the present invention, the algorithm used to determine identity between two or more polypeptides is BLASTP. In another embodiment of the present invention, the algorithm used to determine identity between two or more polypeptides is FASTDB, which is based upon the algorithm of Brutlag (1990) Comp. App. Biosci. 6, 237-245). In a FASTDB sequence alignment, the query and reference sequences are amino sequences. The result of sequence alignment is in percent identity. In one embodiment, parameters that may be used in a FASTDB alignment of amino acid sequences to calculate percent identity include, but are not limited to: Matrix=PAM, k-tuple=2, Mismatch Penalty=1, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=1, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the length of the subject amino sequence, whichever is shorter.
- If the reference sequence is shorter or longer than the query sequence because of N-terminus or C-terminus additions or deletions, but not because of internal additions or deletions, a manual correction can be made, because the FASTDB program does not account for N-terminus and C-terminus truncations or additions of the reference sequence when calculating percent identity. For query sequences truncated at the N- or C-termini, relative to the reference sequence, the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminus to the reference sequence that are not matched/aligned, as a percent of the total bases of the query sequence. The results of the FASTDB sequence alignment determine matching/alignment. The alignment percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This corrected score can be used for the purposes of determining how alignments “correspond” to each other, as well as percentage identity. Residues of the reference sequence that extend past the N- or C-termini of the query sequence may be considered for the purposes of manually adjusting the percent identity score. That is, residues that are not matched/aligned with the N- or C-termini of the comparison sequence may be counted when manually adjusting the percent identity score or alignment numbering.
- For example, a 90 amino acid residue query sequence is aligned with a 100 residue reference sequence to determine percent identity. The deletion occurs at the N-terminus of the query sequence and therefore, the FASTDB alignment does not show a match/alignment of the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the reference sequence (number of residues at the N- and C-termini not matched/total number of residues in the reference sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 residues were perfectly matched (100% alignment) the final percent identity would be 90% (100% alignment-10% unmatched overhang). In another example, a 90 residue query sequence is compared with a 100 reference sequence, except that the deletions are internal deletions. In this case the percent identity calculated by FASTDB is not manually corrected, since there are no residues at the N- or C-termini of the subject sequence that are not matched/aligned with the query. In still another example, a 110 amino acid query sequence is aligned with a 100 residue reference sequence to determine percent identity. The addition in the query occurs at the N-terminus of the query sequence and therefore, the FASTDB alignment may not show a match/alignment of the first 10 residues at the N-terminus. If the remaining 100 amino acid residues of the query sequence have 95% identity to the entire length of the reference sequence, the N-terminal addition of the query would be ignored and the percent identity of the query to the reference sequence would be 95%.
- As used herein, the terms “correspond(s) to” and “corresponding to,” as they relate to sequence alignment, are intended to mean enumerated positions within a reference protein, e.g., wild-type Amt1, and those positions in a modified Amt1 that align with the positions on the reference protein. Thus, when the amino acid sequence of a subject protein is aligned with the amino acid sequence of a reference protein, the amino acids in the subject sequence that “correspond to” certain enumerated positions of the reference sequence are those that align with these positions of the reference sequence, but are not necessarily in these exact numerical positions of the reference sequence. Methods for aligning sequences for determining corresponding amino acids between sequences are described herein.
- The use of ‘tweakable’ fluorophores that are sensitive to conformational changes in the recognition element may also be used. For example, GFP is an extraordinarily stable protein, and it is protected from bulk medium by a soda can-like cage formed by a β-barrel that consists of 11 β-strands. The chromophore, a p-hydroxybenzylidene-imidazolidone, develops by autocatalysis from a hexapeptide. The chromophore interacts with residues inside the cage for example by hydrogen bonding (H148 interaction with the hydroxyl of Y66, R96 with the imidazolidone, Y145 stabilizes the chromophore by an edge-face interaction with Y66, and E222 with the hydroxyl of S65). GFP exists in two ground-state forms, called A and B, and these states interconvert depending on the environment such as local pH. The two states have different spectral properties, i.e., have two visible absorption bands at 478 and 398 nm (508 nm emission). In the ground state, interconversion of the states is relatively slow, but occurs much faster in the excited state. Further analysis indicates that the interconversion involves proton transfer followed by structural rearrangements. Interestingly, mutation at S65 or E222 result in a loss of the absorption maximum at 398 nm, supporting a role of protonation of the chromophore or adjacent residues in determining the ‘state’ of the chromophores and thus its absorption properties.
- One example of a GFP protein is enhanced GFP (“eGFP”) from
-
(SEQ ID NO: 32) MVSKGEELFT GVVPILVELD GDVNGHKFSV SGEGEGDATY GKLTLKFICT TGKLPVPWPT LVTTLTYGVQ CFSRYPDHMK QHDFFKSAMP EGYVQERTIF FKDDGNYKTR AEVKFEGDTL VNRIELKGID FKEDGNILGH KLEYNYNSHN VYIMADKQKN GIKVNFKIRH NIEDGSVQLA DHYQQNTPIG DGPVLLPDNH YLSTQSALSK DPNEKRDHMV LLEFVTAAGI TLGMDELYK - This observation opens the possibility to generating fluorophores that are environmentally sensitive and that can report conformational changes in a recognition element. Interestingly, an earlier reported mutant has in the positions of Y145 in the β-barrel contained six additional residues, yet the mutant protein retained fluorescence. Previous studies generated circularly permuted GFP and fused various polypeptides at this site. The circular permuted forms were sensitive to acidification, potentially caused by protonation of a non-chromophoric site leading to a conformational change, changes in hydrogen bonds, or electrostatic repulsion displacing a proton from the phenolic hydroxyl of the chromophores. With the idea that fusion of a recognition element acid quenching may be affected by the presence of polypeptides that depending on their conformation may protect the chromophores from bulk medium access, a calmodulin sensor was develop with a single fluorophore to detect calcium. In a similar approach, the same group also constructed a functional zinc sensor by fusing the circular permuted GFP to a zinc finger domain.
- Many groups further improved this type of sensor in several steps. See for example, Nagai T, Sawano A, Park E S, Miyawaki A. 2001. Circularly permuted green fluorescent proteins engineered to sense Ca2+. Proc. Natl. Acad. Sci. USA 98:3197-202. These optimizations have lead to calcium sensors with a significantly improved dynamic range (ΔF/F0) of over 10-fold. The dynamic range is important in the context of high signal-to-noise ratio (SNR), which is required for in vivo applications. The SNR of these sensors is ˜N1/2 ΔF/F. SNR increases with the square root of the number of photons collected and with the dynamic range ΔF/F. One group lab carried out a careful in vivo comparison of FRET and cpGFP calcium sensors and showed that for example the FRET sensor TN-XL was significantly brighter compared with GCaMP2, thus providing higher SNR (See, Mao T, O'Connor D H, Scheuss V, Nakai J, Svoboda K. 2008. Characterization and subcellular targeting of GCaMP-type genetically-encoded calcium indicators. PLoS ONE 3:e1796). Comparison of performance indicates that in order to achieve high signal to noise and being able to detect action potentials with high confidence further improvements is necessary. SNR is apparently a moving target since the sensors are continuously improved regarding both brightness and dynamic range. Similar as for several FRET sensors, CpGFP sensor can be used for monitoring local calcium levels, e.g. GCaMP2 has been targeted to plasma membrane subdomains by fusion to Na+ pump isoforms (Lee M Y, Song H, Nakai J, Ohkura M, Kotlikoff M I, et al. 2006. Local subplasma membrane Ca2+ signals detected by a tethered Ca2+ sensor. Proc. Natl. Acad. Sci. USA 103:13232-7).
- Circular permutation of EGFP and deletion of several residues resulted in a significant opening of ˜5 Å×10 Å in the barrel providing bulk solvent access. (Akerboom J, Rivera J D, Guilbe M M, Malave E C, Hernandez H H, et al. 2009. Crystal structures of the GCaMP calcium sensor reveal the mechanism of fluorescence signal change and aid rational design. J. Biol. Chem. 284:6455-64). The tyrosine side chain at the tip of the chromophore pointed towards the opening. A dimer was discovered in which the M13 calmodulin-binding peptides interact with the calcium-loaded calmodulin of the other subunit in the dimer, respectively. Biochemical characterization showed that in the absence of calcium GCaMP2 predominantly existed in the monomeric form while in the presence of calcium, both monomeric and dimeric were observed. While in the dimer, channels provide solvent access, the opening is occluded in the monomeric conformation by the linkers of the fusion with calmodulin. As discussed above the main differences between open and closed forms will affect solvent access and thus affect protonation of the phenolate oxygen of the chromophores.
- The concept of using circular permuted GFP variants has successfully been used to construct sensors for inositol-1,3,4,5-tetrakisphosphate (Sakaguchi R, Endoh T, Yamamoto S, Tainaka K, Sugimoto K, et al. 2009. A single circularly permuted GFP sensor for inositol-1,3,4,5-tetrakisphosphate based on a split PH domain. Bioorg Med Chem 17:7381-6), for the ATP/ADP ratio (Berg J, Hung Y P, Yellen G. 2009. A genetically encoded fluorescent reporter of ATP:ADP ratio. Nat Methods 6:161-6) as well as for the transition metals Cu2+/Zn2+(Mizuno T, Murao K, Tanabe Y, Oda M, Tanaka T. 2007. Metal-ion-dependent GFP emission in vivo by combining a circularly permutated green fluorescent protein with an engineered metal-ion-binding coiled-coil. J. Am. Chem. Soc. 129:11378-83). Another group used a similar approach to construct cpGFP-based voltage sensors (Gautam S G, Perron A, Mutoh H, Knöpfel T. 2009. Exploration of fluorescent protein voltage probes based on circularly permuted fluorescent proteins. Front Neuroeng 2:14). Besides green cpGFP, cyan cpCFP, and yellow cpYFP-Venus variants, now also red cpTomato variants are available expanding the possibility to develop such sensors and for multiplexing analysis, e.g., for the parallel use of affinity mutants in a single cells or for measuring multiple analytes in parallel (Li Y, Sierra A M, Ai H W, Campbell R E. 2008. Identification of sites within a monomeric red fluorescent protein that tolerate peptide insertion and testing of corresponding circular permutations. Photochem. Photobiol. 84:111-9). Circular permuted fluorescent reporters can thus be used in the methods of the present invention.
- One example of the engineered proteins of the present invention is seen in the amino acid sequence of SEQ ID NO:28. SEQ ID NO:28 is an engineered ammonium transporter protein fused with a cpGFP. In the amino acid sequence of SEQ ID NO:28, the portion with the sequence below is the cpGFP portion of the engineered protein.
-
(SEQ ID NO: 31) NVYIKADKQK NGIKANFKIR HNIEDGGVQL AYHYQQNTPI GDGPVLLPDN HYLSVQSKLS KDPNEKRDHM VLLEFVTAAG ITLGMDELYK GGTGGSMVSK GEELFTGVVP ILVELDGDVN GHKFSVSGEG EGDATYGKLT LKFICTTGKL PVPWPTLVTT LTYGVQCFSR YPDHMKQHDF FKSAMPEGYI QERTIFFKDD GNYKTRAEVK FEGDTLVNRI ELKGIDFKED GNILGHKLEY N - The invention also provides isolated nucleic acids, with the nucleic acids comprising polynucleotide sequence at least about 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the polynucleotide sequences disclosed herein.
- As a practical matter, whether any particular nucleic acid molecule is at least 80%, 85%, 90%, 92%, 95%, 96%, 97%, 98% or 99% identical to a disclosed nucleic acid can be determined conventionally using known computer programs a discussed herein. For example, percent identity can be determined using the Bestfit program (Wisconsin Sequence Analysis Package,
Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711. Bestfit uses the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2: 482-489 (1981), to find the best segment of homology between two sequences. When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set, of course, such that the percentage of identity is calculated over the full length of the reference nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in the reference sequence are allowed. Methods for correcting percent identity of the of polynucleotides are the same as those described and disclosed herein with respect to polypeptides. - The engineered proteins of the present invention may or may not contain additional elements that, for example, may include but are not limited to regions to facilitate purification. For example, “histidine tags” (“his tags”) or “lysine tags” may be appended to the engineered protein. Examples of histidine tags include, but are not limited to hexaH, heptaH and hexaHN. Examples of lysine tags include, but are not limited to pentaL, heptaL and FLAG. Such regions may be removed prior to final preparation of the engineered protein. Other examples of a fusion partner for the engineered proteins of the present invention include, but are not limited to, glutathione S-transferase (GST) and alkaline phosphatase (AP).
- The addition of peptide moieties to engineered proteins, whether to engender secretion or excretion, to improve stability and to facilitate purification or translocation, among others, is a familiar and routine technique in the art and may include modifying amino acids at the terminus to accommodate the tags. For example the N-terminus amino acid may be modified to, for example, arginine and/or serine to accommodate a tag. Of course, the amino acid residues of the C-terminus may also be modified to accommodate tags. One particularly useful fusion protein comprises a heterologous region from immunoglobulin that can be used solubilize proteins.
- Other types of fusion proteins provided by the present invention include but are not limited to, fusions with secretion signals and other heterologous functional regions. Thus, for instance, a region of additional amino acids, particularly charged amino acids, may be added to the N-terminus of the engineered protein to improve stability and persistence in the host cell, during purification or during subsequent handling and storage.
- The engineered proteins of the current invention may be recovered and purified from recombinant cell cultures by well-known methods including, but not limited to, ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, e.g., immobilized metal affinity chromatography (IMAC), hydroxylapatite chromatography and lectin chromatography. High performance liquid chromatography (“HPLC”) may also be employed for purification. Well-known techniques for refolding protein may be employed to regenerate active conformation when the fusion protein is denatured during isolation and/or purification.
- Engineered proteins of the present invention include, but are not limited to, products of chemical synthetic procedures and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and mammalian cells. Depending upon the host employed in a recombinant production procedure, the engineered proteins of the present invention may be glycosylated or may be non-glycosylated. In addition, engineered proteins of the invention may also include an initial modified methionine residue, in some cases as a result of host-mediated processes.
- The present invention also provides for nucleic acids encoding some of the engineered proteins of the present invention.
- The invention also relates to isolated nucleic acids and to constructs comprising these nucleic acids. The nucleic acids of the invention can be DNA or RNA, for example, mRNA. The nucleic acid molecules can be double-stranded or single-stranded; single stranded RNA or DNA can be the coding, or sense, strand or the non-coding, or antisense, strand. In particular, the nucleic acids may encode any engineered protein of the invention. For example, the nucleic acids of the invention include polynucleotide sequences that encode the engineered proteins that contain or comprise glutathione-S-transferase (GST) fusion protein, poly-histidine (e.g., His6), poly-HN, poly-lysine, etc. If desired, the nucleotide sequence of the isolated nucleic acid can include additional non-coding sequences such as
non-coding 3′ and 5′ sequences (including regulatory sequences, for example). - Examples of some of the nucleic acids of the present invention include bit are not limited to those sequences exemplified in the nucleic acid sequences of SEQ ID NO:29 and SEQ ID NO:30, where the nucleic acid sequence of SEQ ID NO:29 encodes the amino acid sequence of SEQ ID NO:28.
- The present invention also comprises vectors containing the nucleic acids encoding the engineered proteins of the present invention. As used herein, a “vector” may be any of a number of nucleic acids into which a desired sequence may be inserted by restriction and ligation for transport between different genetic environments or for expression in a host cell. Vectors are typically composed of DNA although RNA vectors are also available. Vectors include, but are not limited to, plasmids and phagemids. A cloning vector is one which is able to replicate in a host cell, and which is further characterized by one or more endonuclease restriction sites at which the vector may be cut in a determinable fashion and into which a desired DNA sequence may be ligated such that the new recombinant vector retains its ability to replicate in the host cell. An expression vector is one into which a desired DNA sequence may be inserted by restriction and ligation such that it is operably joined to regulatory sequences and may be expressed as an RNA transcript. Vectors may further contain one or more marker sequences suitable for use in the identification and selection of cells which have been transformed or transfected with the vector. Markers include, for example, genes encoding proteins which increase or decrease either resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes whose activities are detectable by standard assays known in the art (e.g., β-galactosidase or alkaline phosphatase), and genes which visibly affect the phenotype of transformed or transfected cells, hosts, colonies or plaques. Examples of vectors include but are not limited to those capable of autonomous replication and expression of the structural gene products present in the DNA segments to which they are operably joined.
- In certain respects, the vectors to be used are those for expression of polynucleotides and proteins of the present invention. Generally, such vectors comprise cis-acting control regions effective for expression in a host operatively linked to the polynucleotide to be expressed. Appropriate trans-acting factors are supplied by the host, supplied by a complementing vector or supplied by the vector itself upon introduction into the host.
- A great variety of expression vectors can be used to express the proteins of the invention. Such vectors include chromosomal, episomal and virus-derived vectors, e.g., vectors derived from bacterial plasmids, from bacteriophage, from yeast episomes, from yeast chromosomal elements, from viruses such as adeno-associated virus, lentivirus, baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids. All may be used for expression in accordance with this aspect of the present invention. Generally, any vector suitable to maintain, propagate or the fusion proteins in a host may be used for expression in this regard.
- The DNA sequence in the expression vector is operatively linked to appropriate expression control sequence(s) including, for instance, a promoter to direct mRNA transcription. Representatives of such promoters include, but are not limited to, the phage lambda PL promoter, the E. coli lac, trp and tac promoters, HIV promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to name just a few of the well-known promoters. In general, expression constructs will contain sites for transcription, initiation and termination and, in the transcribed region, a ribosome binding site for translation. The coding portion of the mature transcripts expressed by the constructs will include a translation initiating AUG at the beginning and a termination codon (UAA, UGA or UAG) appropriately positioned at the end of the polypeptide to be translated.
- In addition, the constructs may contain control regions that regulate, as well as engender expression. Generally, such regions will operate by controlling transcription, such as repressor binding sites and enhancers, among others.
- Vectors for propagation and expression generally will include selectable markers. Such markers also may be suitable for amplification or the vectors may contain additional markers for this purpose. In this regard, the expression vectors may contain one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells. Preferred markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, and tetracycline, kanamycin or ampicillin resistance genes for culturing E. coli and other bacteria.
- Examples of vectors that may be useful for engineered proteins include, but are not limited to, pPZP, pZPuFLIPs, pCAMBIA, and pRT to name a few.
- Examples of vectors for expression in yeast S. cerevisiae include pDRFLIP, s, pDR196, pYepSecl (Baldari (1987) EMBO J. 6, 229-234), pMFa (Kurjan (1982)
Cell 30, 933-943), pJRY88 (Schultz (1987) Gene 54, 115-123), pYES2 (Invitrogen) and picZ (Invitrogen). - Alternatively, the engineered proteins can be expressed in insect cells using baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF9 cells) include the pAc series (Smith (1983) Mol. Cell. Biol. 3, 2156 2165) and the pVL series (Lucklow (1989) Virology 170, 31-39).
- The nucleic acid molecules of the invention can be “isolated.” As used herein, an “isolated” nucleic acid molecule or nucleotide sequence is intended to mean a nucleic acid molecule or nucleotide sequence that is not flanked by nucleotide sequences normally flanking the gene or nucleotide sequence (as in genomic sequences) and/or has been completely or partially removed from its native environment (e.g., a cell, tissue). For example, nucleic acid molecules that have been removed or purified from cells are considered isolated. In some instances, the isolated material will form part of a composition (for example, a crude extract containing other substances), buffer system or reagent mix. In other circumstances, the material may be purified to near homogeneity, for example as determined by PAGE or column chromatography such as HPLC. Thus, an isolated nucleic acid molecule or nucleotide sequence can includes a nucleic acid molecule or nucleotide sequence which is synthesized chemically, using recombinant DNA technology or using any other suitable method. To be clear, a nucleic acid contained in a vector would be included in the definition of “isolated” as used herein. Also, isolated nucleotide sequences include recombinant nucleic acid molecules (e.g., DNA, RNA) in heterologous organisms, as well as partially or substantially purified nucleic acids in solution. “Purified,” on the other hand is well understood in the art and generally means that the nucleic acid molecules are substantially free of cellular material, cellular components, chemical precursors or other chemicals beyond, perhaps, buffer or solvent. “Substantially free” is not intended to mean that other components beyond the novel nucleic acid molecules are undetectable. The nucleic acid molecules of the present invention may be isolated or purified. Both in vivo and in vitro RNA transcripts of a DNA molecule of the present invention are also encompassed by “isolated” nucleotide sequences.
- The invention also provides nucleic acid molecules that hybridize under high stringency hybridization conditions, such as for selective hybridization, to the nucleotide sequences described herein (e.g., nucleic acid molecules which specifically hybridize to a nucleotide sequence encoding engineered proteins described herein). Hybridization probes include synthetic oligonucleotides which bind in a base-specific manner to a complementary strand of nucleic acid.
- Such nucleic acid molecules can be detected and/or isolated by specific hybridization e.g., under high stringency conditions. “Stringency conditions” for hybridization is a term of art that refers to the incubation and wash conditions, e.g., conditions of temperature and buffer concentration, which permit hybridization of a particular nucleic acid to a second nucleic acid; the first nucleic acid may be perfectly complementary, i.e., 100%, to the second, or the first and second may share some degree of complementarity, which is less than perfect, e.g., 60%, 75%, 85%, 95% or more. For example, certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those of less complementarity.
- “High stringency conditions”, “moderate stringency conditions” and “low stringency conditions” for nucleic acid hybridizations are explained in Current Protocols in Molecular Biology, John Wiley & Sons). The exact conditions which determine the stringency of hybridization depend not only on ionic strength, e.g., 0.2×SSC, 0.1×SSC of the wash buffers, temperature, e.g., room temperature, 42° C., 68° C., etc., and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non-identical sequences. Thus, high, moderate or low stringency conditions may be determined empirically.
- By varying hybridization conditions from a level of stringency at which no hybridization occurs to a level at which hybridization is first observed, conditions which will allow a given sequence to hybridize with the most similar sequences in the sample can be determined. Exemplary conditions are described in Krause (1991) Methods in Enzymology, 200:546-556. Washing is the step in which conditions are usually set so as to determine a minimum level of complementarity of the hybrids. Generally, starting from the lowest temperature at which only homologous hybridization occurs, each degree (° C.) by which the final wash temperature is reduced, while holding SSC concentration constant, allows an increase by 1% in the maximum extent of mismatching among the sequences that hybridize. Generally, doubling the concentration of SSC results in an increase in Tm. Using these guidelines, the washing temperature can be determined empirically for high, moderate or low stringency, depending on the level of mismatch sought. Exemplary high stringency conditions include, but are not limited to, hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37° C., and a wash in 0.1×SSC at 60° C. Example of progressively higher stringency conditions include, after hybridization, washing with 0.2×SSC and 0.1% SDS at about room temperature (low stringency conditions); washing with 0.2×SSC, and 0.1% SDS at about 42° C. (moderate stringency conditions); and washing with 0.1×SSC at about 68° C. (high stringency conditions). Washing can be carried out using only one of these conditions, e.g., high stringency conditions, washing may encompass two or more of the stringency conditions in order of increasing stringency. Optimal conditions will vary, depending on the particular hybridization reaction involved, and can be determined empirically.
- Equivalent conditions can be determined by varying one or more of the parameters given as an example, as known in the art, while maintaining a similar degree of identity or similarity between the target nucleic acid molecule and the primer or probe used. Hybridizable nucleotide sequences are useful as probes and primers for identification of organisms comprising a nucleic acid of the invention and/or to isolate a nucleic acid of the invention, for example. The term “primer” is used herein as it is in the art and refers to a single-stranded oligonucleotide which acts as a point of initiation of template-directed DNA synthesis under appropriate conditions in an appropriate buffer and at a suitable temperature. The appropriate length of a primer depends on the intended use of the primer, but typically ranges from about 15 to about 30 nucleotides. Short primer molecules generally require cooler temperatures to form sufficiently stable hybrid complexes with the template. A primer need not reflect the exact sequence of the template, but must be sufficiently complementary to hybridize with a template. The term “primer site” refers to the area of the target DNA to which a primer hybridizes. The term “primer pair” refers to a set of primers including a 5′ (upstream) primer that hybridizes with the 5′ end of the DNA sequence to be amplified and a 3′ (downstream) primer that hybridizes with the complement of the 3′ end of the sequence to be amplified.
- The present invention also relates to host cells containing the above-described constructs. The host cell can be a eukaryotic cell, such as a plant cell or yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. The host cell can be stably or transiently transfected with the construct. The polynucleotides may be introduced alone or with other polynucleotides. Such other polynucleotides may be introduced independently, co-introduced or introduced joined to the polynucleotides of the invention. As used herein, a “host cell” is a cell that normally does not contain any of the nucleotides of the present invention and contains at least one copy of the nucleotides of the present invention. Thus, a host cell as used herein can be a cell in a culture setting or the host cell can be in an organism setting where the host cell is part of an organism, organ or tissue.
- If a prokaryotic expression vector is employed, then the appropriate host cell would be any prokaryotic cell capable of expressing the cloned sequences. Suitable prokaryotic cells include, but are not limited to, bacteria of the genera Escherichia, Bacillus, Pseudomonas, Staphylococcus, and Streptomyces.
- If a eukaryotic expression vector is employed, then the appropriate host cell would be any eukaryotic cell capable of expressing the cloned sequence. In one embodiment, eukaryotic cells are the host cells. Eukaryotic host cells include, but are not limited to, insect cells, HeLa cells, Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS cells), human 293 cells, and murine 3T3 fibroblasts.
- In addition, a yeast cell may be employed as a host cell. Yeast cells include, but are not limited to, the genera Saccharomyces, Pichia and Kluyveromyces. In one embodiment, the yeast hosts are S. cerevisiae or P. pastoris. Yeast vectors may contain an origin of replication sequence from a 2T yeast plasmid, an autonomously replication sequence (ARS), a promoter region, sequences for polyadenylation, sequences for transcription termination and a selectable marker gene. Shuttle vectors for replication in both yeast and E. coli are also included herein.
- Introduction of a construct into the host cell can be affected by calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection or other methods.
- Other examples of methods of introducing nucleic acids into host organisms take advantage TALEN technology to effectuate site-specific insertion of nucleic actions. TALENs are proteins that have been engineered to cleave nucleic acids at a specific site in the sequence. The cleavage sites of TALENs are extremely customizable and pairs of TALENs can be generated to create double-stranded breaks (DSBs) in nucleic acids at virtually any site in the nucleic acid. See Bogdanove and Voytas, Scienc, 333:1843-1846 (2011), which incorporated by reference herein
- Transformants carrying the expression vectors are selected based on the above-mentioned selectable markers. Repeated clonal selection of the transformants using the selectable markers allows selection of stable cell lines expressing the fusion proteins constructs. Increased concentrations in the selection medium allows gene amplification and greater expression of the desired fusion proteins. The host cells, for example E. coli cells, containing the recombinant fusion proteins can be produced by cultivating the cells containing the fusion proteins expression vectors constitutively expressing the engineered proteins constructs.
- The present invention also provides for transgenic organisms, including but not limited to plants, animals and prokaryotic organisms. The transgenic animals, all of whose germ and somatic cells would contain the DNA construct of the invention, would include vertebrates in general. Examples of transgenic animals include but are not limited to mammals such as non-human primates, mice, sheep, pigs, cattle, goats, guinea pigs, rodents, e.g., rats, and the like. The term transgenic animal also includes animals in all stages of development, including embryonic and fetal stages.
- Such transgenic animals may be obtained, for example, by injecting the DNA constructs of the present invention into a fertilized egg which is allowed to develop into an adult animal. To prepare a transgenic animal, a few hundred DNA molecules are injected into the pro-nucleus of a fertilized one cell egg. The micro injected eggs are then transferred into the oviducts of pseudopregnant foster mothers and allowed to develop. It has been reported by Brinster et al., Proc. Natl. Acad. Sci. USA 82:4438-4442 (1985), that about 25% of mice which develop will inherit one or more copies of the micro injected DNA. Alternatively, the transgenic animals may be obtained by utilizing recombinant ES cells for the generation of the transgenes, as described by Gossler et al., Proc. Natl. Acad. Sci. USA 83:9065-9069 (1986). The offspring may be analyzed for the integration of the transgene by isolating genomic DNA from tail tissue and the fragment coding for the engineered proteins identified by conventional DNA-hybridization techniques (Southern, J. Mol. Biol. 98:503-517 (1975)). Animals positive for the nucleic acid encoding the engineered protein are further bred to expand the colonies of transgenic mice. General and specific examples of methods of preparing transgenic animals are disclosed in U.S. Pat. Nos. 5,602,299, 5,366,894, 5,464,758, 5,569,827, WO96/40896 (U.S. application Ser. No. 08/480,653); WO96/40895 (U.S. application Ser. Nos. 08/486,018 and 08/486,536); WO93/14200 (U.S. application Ser. Nos. 07/817,584 and 07/915,469); WO95/03397 (U.S. application Ser. No. 08/096,944); WO95/25792 (U.S. application Ser. No. 08/215,083);
EP 0 717 105 (U.S. application Ser. No. 08/358,627); and Hogan et al., Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory, Cold Spring Harbor, N. Y., 1986); Hammer et al., Cell 63:1099-1112 (1990). Once obtained, the transgenic animals which contain the nucleic acid encoding the engineered protein may be analyzed by immunohistology for evidence of the engineered protein expression. - The present invention also provides for transgenic plants or plant tissue comprising transgenic plant cells, i.e. comprising stably integrated into their genome, an above-described nucleic acid molecule, expression cassette or vector of the invention. The present invention also provides transgenic plants, plant cells or plant tissue obtainable by a method for their production as outlined below.
- In one embodiment, the present invention provides a method for producing transgenic plants, plant tissue or plant cells comprising the introduction of a nucleic acid molecule, expression cassette or vector of the invention into a plant cell and, optionally, regenerating a transgenic plant or plant tissue therefrom. The transgenic plants expressing the engineered protein can be of use in monitoring the transport or movement of analytes throughout and between the organs of an organism, such as to or from the soil. The transgenic plants expressing transporters of the invention can be of use for investigating metabolic or transport processes of, e.g., organic compounds with a timely and spatial resolution.
- Examples of species of plants that may be used for generating transgenic plants include but are not limited to monocotyledonous plants including seed and the progeny or propagules thereof, for example Lolium, Zea, Triticum, Sorghum, Triticale, Saccharum, Bromus, Oryzae, Avena, Hordeum, Secale and Setaria. Especially useful transgenic plants are maize, wheat, barley plants and seed thereof. Dicotyledenous plants are also within the scope of the present invention include but are not limited to the species Fabaceae, Solanum, Brassicaceae, especially potatoes, beans, cabbages, forest trees, roses, clematis, oilseed rape, sunflower, chrysanthemum, poinsettia and antirrhinum (snapdragon). The plant may be crops, such as a food crops, feed crops or biofuels crops. Exemplary important crops may include soybean, cotton, rice, millet, sorghum, sugarcane, sugar beet, tomato, grapevine, citrus (orange, lemon, grapefruit, etc), lettuce, alfalfa, fava bean and strawberries, rapeseed, cassava, miscanthus and switchgrass to name a few.
- Methods for the introduction of foreign nucleic acid molecules into plants are well-known in the art. For example, plant transformation may be carried out using Agrobacterium-mediated gene transfer, microinjection, electroporation or biolistic methods as it is, e.g., described in Potrykus and Spangenberg (Eds.), Gene Transfer to Plants. Springer Verlag, Berlin, New York, 1995. Therein, and in numerous other references, useful plant transformation vectors, selection methods for transformed cells and tissue as well as regeneration techniques are described which are known to the person skilled in the art and may be applied for the purposes of the present invention.
- In another aspect, the invention provides harvestable parts and methods to propagation material of the transgenic plants according to the invention which contain transgenic plant cells as described above. Harvestable parts can be in principle any useful part of a plant, for example, leaves, stems, fruit, seeds, roots etc. Propagation material includes, for example, seeds, fruits, cuttings, seedlings, tubers, rootstocks etc.
- The present invention also provides methods of producing any of the engineered proteins of the present invention, the method comprising culturing a host cell in conditions that promote protein expression and recovering the engineered protein from the culture, wherein the host cell comprises a vector encoding the engineered protein.
- The protein production methods generally comprise culturing the host cells of the invention under conditions such that the engineered protein is expressed, and recovering said protein. The culture conditions required to express the proteins of the current invention are dependent upon the host cells that are harboring the polynucleotides of the current invention. The culture conditions for each cell type are well-known in the art and can be easily optimized, if necessary. For example, a nucleic acid encoding an engineered protein of the invention, or a construct comprising such nucleic acid, can be introduced into a suitable host cell by a method appropriate to the host cell selected, e.g., transformation, transfection, electroporation, infection, such that the nucleic acid is operably linked to one or more expression control elements as described herein. Host cells can be maintained under conditions suitable for expression in vitro or in vivo, whereby the encoded engineered protein is produced. For example host cells may be maintained in the presence of an inducer, suitable media supplemented with appropriate salts, growth factors, antibiotic, nutritional supplements, etc., which may facilitate protein expression. In additional embodiments, the engineered proteins of the invention can be produced by in vitro translation of a nucleic acid that encodes the engineered protein, by chemical synthesis or by any other suitable method. If desired, the engineered protein can be isolated from the host cell or other environment in which the protein is produced or secreted. It should therefore be appreciated that the methods of producing the engineered proteins encompass expression of the polypeptides in a host cell of a transgenic plant. See U.S. Pat. Nos. 6,013,857, 5,990,385, and 5,994,616.
- The invention also provides for methods of monitoring analyte movement in a sample, comprising contacting the sample with an engineered protein of the present invention and subsequently measuring the fluorescent signal, which may or may not include FRET. Accordingly, the engineered proteins can be used in sensors for measuring a target analytes in a sample, with the sensors comprising the engineered proteins of the present invention.
- The target analytes can be any molecule or compound where the movement or concentration is desired to be measured or monitored. Whichever molecule or ion that the transporter peptide normally transports is capable of being monitored or measured. For example, if ammonium transporter is being engineered, then ammonium can be monitored or measured. Examples of classes of analytes that might be measured, depending on the transporter protein being engineered, include, but are not limited to amino acids, peptides, polypeptides, proteins, carbohydrates, lipids, nucleotides, oligonucleotides, polynucleotides, glycoproteins or proteoglycans, lipoproteins, lipopolysaccharides, drugs, drug metabolites, small organic molecules, inorganic molecules and natural or synthetic polymers. As used herein, “carbohydrate” includes, but is not limited to monosaccharides, disaccharides, oligosaccharides and polysaccharides. “Carbohydrate” also includes, but is not limited to, molecules comprising carbon, hydrogen and oxygen that do not fall within the traditional definition of a saccharide—i.e., an aldehyde or ketone derivative of a straight chain polyhydroxyl alcohol, containing at least three carbon atoms. Thus, for example, a carbohydrate may contain fewer than three carbon atoms. As used herein, the term “lipid” is used it is in the art, i.e., substances of biological origin that are made up primarily or exclusively of nonpolar chemical groups such that they are readily soluble in most organic solvents, but only sparingly soluble in aqueous solvents. Examples of lipids include, but are not limited to, fatty acids, triacylglycerols, glycerophospholipids, sphingolipids, cholesterol, steroids and derivatives thereof. For example, “lipids” include but are not limited to, the ceramides, which are derivatives of sphingolipids and derivatives of ceramides, such as sphingomyelins, cerebrosides and gangliosides. “Lipids” also include, but are not limited to, the common classes of glycerophospholipds (or phospholipids), such as phosphatidic acid, phosphatidylethanolamine, phosphatidylcholine, phosphatidylserine, phosphatidylinositol, phosphatidylglycerol and the like. As used herein, a “drug” can be a known drug or a drug candidate, whose activity or effects on a particular cell type are not yet known. A “drug metabolite” is any of the by-products or the breakdown products of a drug that is changed chemically into another compound or compounds. As used herein, “small organic molecule” includes, but is not limited to, an organic molecule or compound that does not fit precisely into other classifications highlighted herein.
- The engineered proteins of the current invention can be used to assess or measure the concentrations of more than one target analyte. As used herein, concentration is used as it is in the art. The concentration may be expressed as a qualitative value, or more likely as a quantitative value. As used herein, the quantification of the analytes can be a relative or absolute quantity. Of course, the quantity (concentration) of any of the analytes may be equal to zero, indicating the absence of the particular analyte sought. The quantity may simply be the measured signal, e.g., fluorescence, without any additional measurements or manipulations. Alternatively, the quantity may be expressed as a difference, percentage or ratio of the measured value of the particular analyte to a measured value of another compound including, but not limited to, a standard or another analyte. The difference may be negative, indicating a decrease in the amount of measured analyte(s). The quantities may also be expressed as a difference or ratio of the analyte(s) to itself, measured at a different point in time. The quantities of analytes may be determined directly from a generated signal, or the generated signal may be used in an algorithm, with the algorithm designed to correlate the value of the generated signals to the quantity of analyte(s) in the sample.
- The engineered proteins of the current invention are designed to possess capabilities of continuously measuring the concentrations an analyte. As used herein, the term “continuously,” in conjunction with the measuring of an analyte, is used to mean the engineered protein either generates or is capable of generating a detectable signal at any time during the life span of the engineered protein. The detectable signal may be constant in that the engineered protein is always generating a signal, even if the signal is not detected. Alternatively, the engineered protein may be used episodically, such that a detectable signal may be generated, and detected, at any desired time.
- While not a requirement of the present invention, the engineered proteins are particularly useful in an in vivo setting for measuring target analytes as they occur or appear in a host organism or tissue or cell. As such, the target analytes need not be labeled. Of course, unlabeled target analytes may also be measured in an in vitro or in situ setting as well. In another embodiment, the target analytes may be labeled. Labeled target analytes can be measured in an in vivo, in vitro or in situ setting.
- The samples would require minimal processing, thus the engineered proteins allow high-throughput measurements in complex samples using an appropriate plate fluorometer (e.g. TECAN M1000). This type of analysis can be used to measure the analyte content in different tissues, different individuals or different populations of, for example, crop plants experiencing drought. Purification of bulk amounts of engineered proteins can be achieved after expression in Pichia pastoris, using pPinkFLIP vectors and a protease deficient strain of Pichia.
- The examples herein are provided for illustrative purposed and are not intended to limit the scope of the invention in any way.
- Crystal structures of AMTs from Escherichia coli and Archaeoglobus fulgidus did not reveal obvious conformational changes, leading to the proposition that AMTs are rigid gas channels. Other studies, however, challenged this hypothesis by demonstrating an allosteric feedback inhibition of transport activity by phosphorylation of residues in the trans-activating cytosolic C-terminus. Thus the mode of operation of these transporters was entirely unclear and unsettled.
- To engineer a transporter that reports substrate-dependent changes in conformation, conformation-sensitive fluorescent proteins (FP) was inserted into Arabidopsis thaliana ammonium transporter (AMT1; 3) (
FIGS. 1A-1F and 4A-4B ). Monomeric teal fluorescent protein (mTFP), yellow fluorescent protein (YFP) Venus or a modified circularly permuted GFP (mcpGFP) were inserted into intracellular loops of AMT1; 3. The constructs were inserted in the yeast expression vector pDRf1-GW, containing the f1 replication origin, GATEWAY™ cassette, PMA1 promoter fragment, ADH terminator, and the URA cassette for selection in yeast. The Xbal restriction site (tctaga) was inserted in different positions of AtAMT1; 3 (after 233, 312, 364 and 448) via Kunkel mutagenesis.amino acids - mcpGFP was generated by amplifying the domains of EGFP corresponding to amino acids 150-239 and 1-144 with the primers:
-
EGFP-150-for: (SEQ ID NO: 1) aacgtctatatcatggcc EGFP-239-rev: (SEQ ID NO: 2) ttttt accggt accacc cttgtacagctcgtcca EGFP-1-for: (SEQ ID NO: 3) ttttt accggt ggatct atggtgagcaagggcg EGFP-144-rev: (SEQ ID NO: 4) agttgtactccagcttgtgc - Primers EGFP-239-rev and EGFP-1-for contained an Agel restriction site (in bold) and additional nucleotides coding for Gly-Gly and Gly-Ser, respectively (underlined). The two amplified bands were gel-purified by a commercial kit (Machery-Nagel), digested by Agel (New England Biolabs) and ligated by T4 DNA ligase (New England Biolabs). The resulting cpGFP, where the domains 150-239 and 1-144 were connected by the linker coding GGTGGS (SEQ ID NO: 32), was cloned into a pGEM-Teasy (Promega). The additional internal mutations M66K, V76A, S88G, D93Y, T116V, A119K, V2511, which were known to improve stability of cpGFP, were introduced by Kunkel mutagenesis.
- Finally, the mcpGFP was amplified with the primers below and contained the Xbal restriction site (in bold):
-
cpGFP-for: (SEQ ID NO: 5) ttttt tctaga aacgtctatatcatggcc cpGFP-rev: (SEQ ID NO: 6) ttttt tctaga agttgtactccagcttgtgc - Similarly, mTFP and Venus were amplified with primers containing the Xbal site:
-
mTFP-for: (SEQ ID NO: 7) ttttt tctaga atggtgagcaagggcgagg mTFP-rev: (SEQ ID NO: 8) ttttt tctaga cttgtacagctcgtccatg Venus-for: (SEQ ID NO: 9) ttttt tctaga aagggcgaggagctgttca Venus-rev: (SEQ ID NO: 10) ttttt tctaga cttgtacagctcgtccatg - The purified mcpGFP, mTFP and Venus encoding fragments were digested by Xbal (New England Biolabs) and ligated into digested pDRf1-GW vectors containing AtAMT1; 3, to generate the fusion constructs AtAMT1; 3-mcpGFP, AtAMT1; 3-TFP and AtAMT1; 3-Venus in
233, 312, 364 and 448 (positions FIGS. 4A-4B ). - To vary the linker regions between mcpGFP and AtAMT1; 3 in position 233 (
FIGS. 6A-6B ), homologous recombination was employed in the yeast between two DNA fragments sharing sequence homology. The yeast were co-transformed with the pDR-AtAMT1; 3 opened atposition 233 by Xbal digestion, and the mcpGFP fragments amplified by PCR with the primers: -
cpGFP-for-SH: (SEQ ID NO: 11) ggtcctcgtcgtggtcggttcgagaaa tctcat aacgtctatatcaag cGFP-for-GGSH: (SEQ ID NO: 12) ggtcctcgtcgtggtcggttcgagaaa ggtggttctcat aacgtctatatcaag cGFP-for-GGGGSH: (SEQ ID NO: 13) ggtcctcgtcgtggtcggttcgagaaa ggtggtggtggttctcat aacgtctatatcaag cGFP-for-LE: (SEQ ID NO: 14) ggtcctcgtcgtggtcggttcgagaaa ctcgag aacgtctatatcaag cGFP-for-GGLE: (SEQ ID NO: 15) ggtcctcgtcgtggtcggttcgagaaa ggtggtctcgag aacgtctatatcaag cGFP-for-GGGGLE: (SEQ ID NO: 16) ggtcctcgtcgtggtcggttcgagaaa ggtggtggtggtctcgag aacgtctatatcaag cGFP-rev: (SEQ ID NO: 17) gtggccgcgcagagcaatagcgcgaccacc gttgtactccagcttg cGFP-rev-GG: (SEQ ID NO: 18) gtggccgcgcagagcaatagcgcgaccacc tcctcc gttgtactccagcttg cGFP-rev-FN: (SEQ ID NO: 19) gtggccgcgcagagcaatagcgcgaccacc attaaa gttgtactccagcttg cGFP-rev-TR: (SEQ ID NO: 20) gtggccgcgcagagcaatagcgcgaccacc tcttgt gttgtactccagcttg - The amplification products contained the mcpGFP flanked by the variable linker sequences (underlined) and about 30 by homologous to the region around the
insertion point 233 of AtAMT1; 3 (in bold). The transformed yeast contained the pDR-AMT-mcpGFP vectors resulting from insertion of the ˜800 by mcpGFPs with linkers into the vector backbone, as confirmed by DNA sequencing. - Homologous recombination was also used to generate variants of the linker preceding mcpGFP (
FIGS. 14A-14B ). For this case, mcpGFP was amplified with the primers below (where N indicates any nucleotide). -
cGFP-for-deg: (SEQ ID NO: 21) ggtcctcgtcgtggtcggttcgagaaa NNNNNN aacgtctatatcaag cGFP-rev-FN: (SEQ ID NO: 22) gtggccgcgcagagcaatagcgcgaccacc attaaa gttgtactccagcttg - Homologous recombination was also used to insert the mcpGFP in different positions along L5-6 (
FIGS. 9A-9C ) and to generate deletions in the loop. In this case, the 30 bp-long regions of the primers that overlapped the AtAMT1; 3 sequence flanked the different insertion points (228 to 236) and contained the appropriate deletions. - Point mutations for inactivation of AtAMT1; 3 and AmTrac were generated by we used Kunkel mutagenesis (D202N, G460D, T464D;
FIGS. 2A-2F and 11A-11B ). - The functionality of the constructs was tested by complementation of ammonium uptake in a yeast mutant lacking endogenous ammonium transporters. Yeast strains 31019b [mep1Δ mep2Δ::LEU2 mep3Δ::KanMX2 ura3], a strain in which all three endogenous MEP ammonium transporter genes had been deleted, and its parental strain 23344c [ura3], were transformed using the lithium acetate method and selected on solid YNB (minimal yeast medium without nitrogen; Difco) supplemented with 3% glucose and 1 mM arginine. Single colonies were grown in 5 mL liquid YNB supplemented with 3% glucose and 0.1% proline under agitation (230 rpm) at 30° C. until OD600 nm ˜0.8. The liquid cultures were diluted 10−1, 10−2, 10−3, 10−4, 10−5 and 10−6 in water and 5 μL of each dilution were spotted on solid YNB medium buffered with 50 mM MES/Tris, pH 5.2 and supplemented with 3% glucose and either NH4Cl, (NH4)2SO4 or 1 mM arginine as the sole nitrogen source.
- After 3d of incubation at 30° C., cell growth was documented by scanning the plate at 300 dpi in grayscale mode. For fluorimetric analyses, yeast cultures were washed twice in 50 mM MES buffer, pH 6.0, and resuspended to OD600 nm˜0.5 in MES buffer supplemented with 5% glycerol to delay cell sedimentation. Fluorescence was measured by a fluorescence plate reader (Safire, Tecan), in bottom reading mode using a 7.5 nm bandwidth for both excitation and emission.
- Yeast cultures were washed twice in 50 mM MES buffer, pH 6.0, and resuspended to OD600 nm˜0.5 in MES buffer supplemented with 5% glycerol to delay cell sedimentation. Fluorescence was measured by a fluorescence plate reader (Safire, Tecan), in bottom reading mode using a 7.5 nm bandwidth for both excitation and emission. To measure fluorescence response to substrate addition, 50 μL of substrate (dissolved in water as 500% stock solution) were added to 200 μL of cells in a 96-well plate (Greiner). For the spectral recordings of AmTrac (
FIGS. 4A-4B and 7 ), excitation scans were performed at emission wavelength of 520 nm; emissions scans were recorded with excitation at 488 nm. In all other cases, fluorescence was measured as emission at 513 nm using excitation at 488 nm. Response data are presented as (Fwater−Ftreatment)/Ftreatment. - AMT1; 3 was extremely sensitive to any manipulation within the loops 7-8 and 9-10 or the cytosolic C-terminus (
FIGS. 4A-4B ). However, modification of loop 5-6 (L5-6, position 233) by insertion of either two amino acid residues (encoded by the restriction site Xbal) or mcpGFP was tolerated (FIGS. 4A-4B ). The L5-6 is located between the two pseudo-symmetric halves, i.e., between the two structural inverted repeats, of the protein and connects two transmembrane helices (TMH-V and -VI) that contain residues postulated to be directly involved in recruitment, gating and substrate translocation (FIG. 5 ). - In initial tests, addition of ammonium to yeast cells expressing the AMT-mcpGFP fusion did not lead to detectable changes in fluorescence intensity (FI;
FIG. 4A-4B ). Providing a linker connecting AMT and mcpGFP permitted detectable changes in FI. One specific linker variant demonstrated a strong response to 1 mM NH4Cl (40% FI change) (FIGS. 1A-1F, 6A-6B, 7 and 8 ). The position of the mcpGFP insertion in L5-6 and the loop-linker length affected changes in FI (FIGS. 9A-9C and 10 ), although the placement of the mcpGFP at several places in the L5-6 loop were effective. Importantly, all the variants that responded to ammonium with a FI change were functional transporters, indicating that the various constructs were able to accurately report transport activity (FIGS. 9A-9C and 10 ). - To further corroborate the correlation between transport and ammonium-induced FI change, mutations (D202N, G460D, T464D) known to inactivate AMT1 transporters were introduced into AmTrac. Confocal sections of yeast cells expressing the sensors (
FIGS. 2A-2F ) were acquired on an inverted confocal laser scanning microscope (SP5, Leica). To record fluorescence intensities in single cells over time, yeast cells were trapped as a single cell layer in a microfluidic perfusion system (Y04C plate, Onyx, Cellasic) and perfused with either 50 mM MES buffer, pH 6.0, or buffer supplemented with NH4Cl. The setup was imaged at a spinning disk confocal microscope (Yokogawa CSU-X1; Leica DMI6000) equipped with a motorized stage (ASI). Fluorescence was excited by a solid state laser at 488 nm; emission was detected using a 525/50 nm filter set (Semrock) and an electron multiplying charge coupled device (EMCCD) camera (Evolve, Photometrics). Measurements were taken every 2 min, with 100 ms exposure time using Slidebook 5.0 image acquisition software (Intelligent Imaging Innovations). To account for lateral shift during imaging, the image stacks were post-registered using the StackReg plugin for ImageJ. Fluorescence pixel intensity was quantified using Fiji software; single cells were selected and analyzed with the help of the ROI manager tool. Kymograph analysis was performed using the MultipleKymograph plugin for ImageJ (Rietdorf and Seitz) by measuring pixel intensities over time along a 3 pixel wide line. - While these mutations did not affect plasma membrane localization of one of the constructs, named “AmTrac” (
FIGS. 2A-2F ), both transport activity and fluorescence response were abolished (FIGS. 2A-2F and 11 ), demonstrating that transport activity was necessary for the FI response of AmTrac. - Previous studies have shown that the C-terminus of AMT1 acts as a trans-activation domain in the trimeric AMT1 complex. Mutations in the cytosolic C-terminus block AMT1 activity, which can be restored by suppressor mutations either in the cytosolic loops or in the pore region. A saturating multicopy suppressor screen with the inactive mutant AmTrac-T464D identified eight gain-of-function mutations (
FIGS. 2A-2F, 12A-12C and 13A-13C ), seven in the pore region and one pseudo-reversion, D464V. The extent to which the suppressors were able to restore transport activity, as measured by growth, correlated highly with the FI response (R2=0.72) (FIGS. 2A-2F ), giving direct proof of the strict link between transport activity and FI response. - To test the reversibility of the sensor response, the FI of single cells in microfluidic chambers was analyzed after withdrawal of ammonium. AmTrac responses were detectable in single cells, were concentration-dependent, and were readily reversible, demonstrating that AmTrac-LS can be used in vivo to measure transport of ammonium and to observe conformational change kinetics of the chimeric transporter in response to substrate availability (
FIGS. 1A-1F ). To create sensors with enhanced SNR, residues were randomly substituted into the two-amino acid linker directly preceding mcpGFP. Mutant versions of the AmTrac that maintained transport activity were selected, and each showed both high FI and high ammonium-induced FI response. Interestingly, the majority of the brightest variants carried a Ser residue instead of a Glu in the position immediately preceding the mcpGFP insertion (FIGS. 14A-14B ). AmTrac-LS, a sensor with a Leu-Ser linker, was as responsive to ammonium as AmTrac (˜40% FI change), yet it was 517% brighter (FIGS. 14A-14B ). AmTrac-LS responses were detectable in single cells, were concentration-dependent, and were readily reversible, demonstrating that AmTrac-LS can be used in vivo to measure transport of ammonium and to observe conformational change kinetics of the chimeric transporter in response to substrate availability (FIG. 15 ). - The fluorescence response of AmTrac transporter demonstrates that AMT1; 3 undergoes conformational changes during the transport cycle. Interestingly, the pseudo-symmetry of AMT with an inverted repeat of five TMH is similar to that of the LeuT transporter. LeuT carries a substrate-binding site at the interface of the two repeats and undergoes a transport cycle involving multiple states. Similarly, the transport pore of the AMTs is located between the two pseudo-symmetric halves (or structural inverted repeats), with TMH-V and -VI (connected by L5-6) carrying key residues for ammonium translocation.
- To test the functionality of AmTrac in systems other than yeast, AmTrac was expressed in Arabidopsis plants stably transformed via Agrobacterium by the method of floral dip, by using the vector pEARLEY100 (Earley et al. 2006). AmTrac was expressed under control of the strong promoter 35S. Seeds of heterozygous Arabidopsis were surface-sterilized and germinated in growth medium deprived of ammonium in 16 h-8 h light-dark conditions. After 5 days, seedlings were transferred in the microfluidic perfusion system RootChip and grown in liquid medium deprived of ammonium for additional 24 h. Roots and root hair from the seedling were then analyzed by inverted confocal microscopy (Yokogawa CSU-X1) and roots were perfused with 50 mM MES buffer or a square pulse of 1 mM ammonium chloride in buffer for 30 minutes. Fluorescence was excited by a solid state laser at 488 nm; emission was detected using a 525/50 nm filter set (Semrock) and an electron multiplying charge coupled device (EMCCD) camera (Evolve, Photometrics).
FIG. 16 shows that the engineered proteins were functional in Arabidopsis root hairs. Fluorescence pixel intensity from single root hairs was quantified using Fiji software. - To show that the strategy of creating sensors by inserting fluorophores in transmembrane proteins is not limited to AtAMT1; 3, the yeast MEP2 was also used as scaffold for creating a new sensor (MEPtrac). MEP2 is only 44% identical to AMT1; 3, demonstrating that the invention is not limited by the amino acid sequence of the transporters. Notably, MepTrac was constructed in a single step based on the fluorophore insertion point determined in AMT as well as using the optimized linkers developed for AmTrac. Structural and phylogenetic studies have shown that members of the AMT/MEP/RH (AMT/MEP/RH superfamily (PFAM # PF00909) family have highly similar structures and that regulatory mechanisms are highly conserved. The one step conversion of MEP2, which is evolutionary very distant, but structurally highly conserved, into an engineered protein that is responsive to analyte movement through the sensor demonstrates that transporter proteins with a structural inverted repeat can be converted into sensors using the methods provided herein. The mcpGFP was inserted in the middle of
loop 5 of MEP2 (after amino acid 217) and connected by the linkers coding for amino acids LS (preceding mcpGFP) and FN (following mcpGFP) by using overlapping PCR cloning. The functionality of MEPtrac was tested by growth complementation of the of Δmep1,2,3 yeast strain on ammonium and the response of the sensor was tested by fluorimetric analysis after addition of different concentrations of ammonium chloride or sodium chloride as a control. SeeFIGS. 17A-17B . - Fluorescence lifetime was also measured in AmTrac variations. Briefly, cells were cultivated with proline as sole source of nitrogen. Before measurements, the cells were washed 3× and resuspended in 50 mM MES,
pH 6+5% Glycerol. The total volume in each cuvette was 2.5 mL (2 ml cells of OD=0.5+0.5ml 5 mM NH4Cl). Measurements were performed for 5 minutes each. After addition of NH4Cl, sufficient time was given, to achieve saturation of the sensor. As control the same concentration of NaCl was used. Lifetime decays were recorded on a FL920 spectrometer (Edinburgh Instruments, UK), for 5 minutes each, using 3.5 mL silica cuvettes (Hellma Analytics). Excitation and emission were set to 475 nm and 514 nm, respectively. Graphs were created by the software Origin (OriginLab).FIG. 18A is a plot of fluorescence lifetime decays, plotted with a logarithmic scale, and shows a single exponential decay for all the variants tested with and without addition of saturating ammonium concentrations. While the lifetime was not altered, the FI decrease triggered by ammonium addition was visible.FIG. 18B shows normalized fluorescence lifetime decays of all the variants tested with and without the addition of ammonium. Data were plotted to show that lifetime decays in a similar manner after addition of ammonium. - Using a monoexponential function, the single fluorescence decay times can be calculated according to the formula below. The results are presented in Table IV, which shows that the decay times remain the same for each sensor tested.
-
γ=γ0 +A 1·exp(−t/τ) - γ0=background (noise)
A1=amplitude (max Int.)
τ=fluorescence decay. -
TABLE IV Variant y0 A1 τ [ns] AmTrac-GS 3.72 1607.22 2.5 AmTrac-GS + 3.61 825.40 2.5 1 mM NH4Cl AmTrac-LS 3.74 810.61 2.4 AmTrac-LS + 3.68 351.62 2.4 1 mM NH4Cl AmTrac-IS 3.57 664.76 2.4 AmTrac-IS + 3.71 250.97 2.4 1 mM NH4Cl - The measurement of flux through a transporter by reporting state changes, as shown here, is applicable to other transporters or enzymes for monitoring in vivo fluxes, e.g., in the context of neurotransmitter transport, for transporters that have structural inverted repeats as the AMT protein does.
- Time-resolved anisotropy was also measured in intact yeast cells expressing AmTrac-GS to elucidate the relevance of homo-FRET in the sensor mechanism. VV—(vertical excitation and vertical emission) and VH—(vertical excitation and horizontal emission) polarized decay curves with and without the addition of 1 mM NH4Cl were recorded for 15 minutes each on a FL920 spectrometer (Edinburgh Instruments, UK) using 3.5 mL silica cuvettes (Hellma Analytics). Excitation and emission were set to 475 nm and 514 nm, respectively. Graphs were created by the software Origin (OriginLab).
FIG. 19A shows the single exponential decay of AmTrac-GS's lifetime which is not altered after the addition of ammonium. The lack of an additional fast depolarization in the VV- and VH-decays leads to the conclusion, that homo-FRET cannot be detected for the system tested.FIG. 19B shows a similar decay in anisotropy decay curves for AmTrac-GS, with and without addition of ammonium, which indicates a similar rotation time of the fluorophore mcpGFP in AmTrac-GS.
Claims (14)
1. An engineered transporter protein comprising at least one fluorescent reporter covalently bound to the transporter protein, with the transporter protein comprising a structural inverted repeat motif comprising a first and second subunit that are structural inverted repeats of one another and that are joined to one another by a polypeptide loop, wherein the at least one fluorescent reporter is covalently bound to the polypeptide loop.
2. The engineered transporter protein of claim 1 , wherein the fluorescent reporter is a fluorescent protein.
3. The engineered transporter protein of claim 2 , wherein the fluorescent protein is selected from the group consisting of green fluorescent protein (GFP) and circular permutated green fluorescent protein (cpGFP)
4. The engineered transporter protein of claim 1 , further comprising at least one linker peptide that links the fluorescent reporter to the polypeptide loop.
5. The engineered transporter protein of claim 4 , comprising two linker peptides that link the fluorescent reporter to the polypeptide loop.
6. The engineered transporter protein of claim 5 , wherein each of the first and second subunits of the structural inverted repeats independently comprises three, four, five, six or seven alpha helices.
7. The engineered transporter protein of claim 6 , wherein the engineered transporter protein is a member of the AMT/MEP/RH superfamily of transporters.
8. A nucleic acid encoding the engineered transporter protein of claim 1 .
9. A vector comprising the nucleic acid of claim 8 .
10. A host cell comprising the vector of claim 9 .
11. The host cell of claim 10 selected from the group consisting of a plant cell and an animal cell.
12. A method of monitoring transport of a target analyte across a cell membrane, the method comprising inserting the engineered transporter protein of claim 1 into the cell membrane to control influx and efflux of the target analyte across the membrane and detecting changes in fluorescence of the fluorescent reporter, wherein a change in fluorescence indicates transport of the target analyte across the membrane through the engineered transporter protein.
13. The method of claim 12 , wherein the cell is a eukaryotic cell.
14. The method of claim 12 , wherein the target analyte is an amino acid.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/930,395 US20160116457A1 (en) | 2011-09-18 | 2015-11-02 | Transmembrane proteins as biosensors |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201161536005P | 2011-09-18 | 2011-09-18 | |
| PCT/US2012/055875 WO2013040604A1 (en) | 2011-09-18 | 2012-09-18 | Transmembrane proteins as biosensors |
| US201414345529A | 2014-03-18 | 2014-03-18 | |
| US14/930,395 US20160116457A1 (en) | 2011-09-18 | 2015-11-02 | Transmembrane proteins as biosensors |
Related Parent Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2012/055875 Continuation WO2013040604A1 (en) | 2011-09-18 | 2012-09-18 | Transmembrane proteins as biosensors |
| US14/345,529 Continuation US9176143B2 (en) | 2011-09-18 | 2012-09-18 | Transmembrane protein as biosensors |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20160116457A1 true US20160116457A1 (en) | 2016-04-28 |
Family
ID=47883836
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/345,529 Expired - Fee Related US9176143B2 (en) | 2011-09-18 | 2012-09-18 | Transmembrane protein as biosensors |
| US14/930,395 Abandoned US20160116457A1 (en) | 2011-09-18 | 2015-11-02 | Transmembrane proteins as biosensors |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/345,529 Expired - Fee Related US9176143B2 (en) | 2011-09-18 | 2012-09-18 | Transmembrane protein as biosensors |
Country Status (2)
| Country | Link |
|---|---|
| US (2) | US9176143B2 (en) |
| WO (1) | WO2013040604A1 (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12162921B2 (en) | 2018-06-04 | 2024-12-10 | Howard Hughes Medical Institute | Inverted transporter polypeptides and methods of using |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2544688A1 (en) * | 2003-11-06 | 2005-05-26 | Discoverx Corporation | Cellular membrane protein assay |
| WO2008108994A1 (en) * | 2007-03-01 | 2008-09-12 | Carnegie Institution Of Washington | Trp/his exchange and kynurenine induced trp transport |
-
2012
- 2012-09-18 WO PCT/US2012/055875 patent/WO2013040604A1/en not_active Ceased
- 2012-09-18 US US14/345,529 patent/US9176143B2/en not_active Expired - Fee Related
-
2015
- 2015-11-02 US US14/930,395 patent/US20160116457A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| US20140356896A1 (en) | 2014-12-04 |
| WO2013040604A1 (en) | 2013-03-21 |
| US9176143B2 (en) | 2015-11-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Klima et al. | Incorporation of sensing modalities into de novo designed fluorescence-activating proteins | |
| US6900304B2 (en) | Emission ratiometric indicators of phosphorylation | |
| US8669074B2 (en) | Chimeric phosphorylation indicator | |
| WO2014026136A2 (en) | Protease-resistant systems for polypeptide display and methods of making and using thereof | |
| US20170247769A1 (en) | Biosensors and methods of use | |
| AU2002312149A1 (en) | Emission ratiometric indicators of phosphorylation | |
| CN108164591A (en) | The method of metal ion sensor and detection metal ion | |
| JP4214206B2 (en) | Fluorescent indicator using FRET | |
| IL190706A (en) | Isolated nucleic acid which encodes a sucrose fluorescent indicator, sucrose biosensors and methods of using the same | |
| US20090203035A1 (en) | Fluorescent proteins with increased photostability | |
| JP2009153399A (en) | Single molecule real-time bioluminescence imaging probe | |
| US20160229899A1 (en) | LUCIGEN YELLOW (LucY), A YELLOW FLUORESCENT PROTEIN | |
| US9176143B2 (en) | Transmembrane protein as biosensors | |
| AU2018265070B2 (en) | Genetically encoded potassium ion indicators | |
| US8541558B2 (en) | Fluorescent proteins and uses thereof | |
| JP5148498B2 (en) | Phosphate biosensor and method using the same | |
| WO2013087921A1 (en) | Engineered fluorescent proteins for enhanced fret and uses thereof | |
| JP7324656B2 (en) | Method and kit for predicting drug cardiotoxicity | |
| US12098184B2 (en) | G protein-coupled receptor (GPCR) ligand assay | |
| WO2013087922A1 (en) | Methods for producing engineered fluorescent proteins for enhanced fret, products and uses thereof | |
| Kumaş | Detecting g-protein coupled receptor interactions using enhanced green fluorescent protein reassembly | |
| Ozawa et al. | Inteins for split-protein reconstitutions and their applications | |
| US20150125893A1 (en) | Transporter biosensors | |
| Metterlein et al. | ChromoTek GmbH, Am Klopferspitz 19, 82152 Planegg-Martinried, Germany | |
| Renna | The role of sec24 in protein export from the plant endoplasmic reticulum |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: CARNEGIE INSTITUTION OF WASHINGTON, DISTRICT OF CO Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:FROMMER, WOLF B.;DE MICHELE, ROBERTO;AST, CINDY;SIGNING DATES FROM 20140511 TO 20140718;REEL/FRAME:038602/0900 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |