CA2369058A1 - Synthetic gene for expressing active retroviral protein in eukaryotes - Google Patents
Synthetic gene for expressing active retroviral protein in eukaryotes Download PDFInfo
- Publication number
- CA2369058A1 CA2369058A1 CA002369058A CA2369058A CA2369058A1 CA 2369058 A1 CA2369058 A1 CA 2369058A1 CA 002369058 A CA002369058 A CA 002369058A CA 2369058 A CA2369058 A CA 2369058A CA 2369058 A1 CA2369058 A1 CA 2369058A1
- Authority
- CA
- Canada
- Prior art keywords
- gene
- protein
- retroviral
- integrase
- gag
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 198
- 108700005078 Synthetic Genes Proteins 0.000 title claims abstract description 102
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 98
- 230000001177 retroviral effect Effects 0.000 title claims abstract description 92
- 241000206602 Eukaryota Species 0.000 title description 5
- 108010061833 Integrases Proteins 0.000 claims abstract description 155
- 230000014509 gene expression Effects 0.000 claims abstract description 97
- 108020004705 Codon Proteins 0.000 claims abstract description 52
- 210000003527 eukaryotic cell Anatomy 0.000 claims abstract description 50
- 102000004190 Enzymes Human genes 0.000 claims abstract description 42
- 108090000790 Enzymes Proteins 0.000 claims abstract description 42
- 108700010070 Codon Usage Proteins 0.000 claims abstract description 26
- 108700008625 Reporter Genes Proteins 0.000 claims abstract description 23
- 230000002255 enzymatic effect Effects 0.000 claims abstract description 23
- 238000001514 detection method Methods 0.000 claims abstract description 19
- 150000001413 amino acids Chemical class 0.000 claims abstract description 12
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 claims abstract description 8
- 230000003834 intracellular effect Effects 0.000 claims abstract description 8
- 108010002459 HIV Integrase Proteins 0.000 claims abstract description 7
- 108010027225 gag-pol Fusion Proteins Proteins 0.000 claims abstract description 7
- 108091005804 Peptidases Proteins 0.000 claims abstract description 5
- 239000004365 Protease Substances 0.000 claims abstract description 5
- 239000002243 precursor Substances 0.000 claims abstract description 5
- 102100034343 Integrase Human genes 0.000 claims abstract 19
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims abstract 3
- 210000004027 cell Anatomy 0.000 claims description 144
- 238000000034 method Methods 0.000 claims description 62
- 239000013598 vector Substances 0.000 claims description 62
- 238000004806 packaging method and process Methods 0.000 claims description 41
- 230000000694 effects Effects 0.000 claims description 38
- 239000002773 nucleotide Substances 0.000 claims description 33
- 125000003729 nucleotide group Chemical group 0.000 claims description 33
- 239000013604 expression vector Substances 0.000 claims description 30
- 239000012634 fragment Substances 0.000 claims description 30
- 108700004026 gag Genes Proteins 0.000 claims description 27
- 239000013612 plasmid Substances 0.000 claims description 27
- 210000004962 mammalian cell Anatomy 0.000 claims description 26
- 101150098622 gag gene Proteins 0.000 claims description 23
- 230000010354 integration Effects 0.000 claims description 23
- 241000700605 Viruses Species 0.000 claims description 19
- 238000001890 transfection Methods 0.000 claims description 18
- 108700004029 pol Genes Proteins 0.000 claims description 14
- 210000000349 chromosome Anatomy 0.000 claims description 13
- 108010089520 pol Gene Products Proteins 0.000 claims description 13
- 239000005089 Luciferase Substances 0.000 claims description 12
- 239000000758 substrate Substances 0.000 claims description 12
- 238000012546 transfer Methods 0.000 claims description 12
- 108060001084 Luciferase Proteins 0.000 claims description 11
- 101150088264 pol gene Proteins 0.000 claims description 11
- 210000001519 tissue Anatomy 0.000 claims description 11
- 230000006870 function Effects 0.000 claims description 9
- 101710177291 Gag polyprotein Proteins 0.000 claims description 8
- 230000001939 inductive effect Effects 0.000 claims description 8
- 229940124524 integrase inhibitor Drugs 0.000 claims description 7
- 239000002850 integrase inhibitor Substances 0.000 claims description 7
- 230000009261 transgenic effect Effects 0.000 claims description 7
- 238000004113 cell culture Methods 0.000 claims description 6
- 101710125418 Major capsid protein Proteins 0.000 claims description 5
- 239000003550 marker Substances 0.000 claims description 5
- 230000000638 stimulation Effects 0.000 claims description 5
- 241000238631 Hexapoda Species 0.000 claims description 4
- 241001465754 Metazoa Species 0.000 claims description 4
- 108091006104 gene-regulatory proteins Proteins 0.000 claims description 4
- 102000034356 gene-regulatory proteins Human genes 0.000 claims description 4
- 230000003115 biocidal effect Effects 0.000 claims description 3
- 229940127089 cytotoxic agent Drugs 0.000 claims description 3
- 239000002254 cytotoxic agent Substances 0.000 claims description 3
- 238000012216 screening Methods 0.000 claims description 3
- 239000003242 anti bacterial agent Substances 0.000 claims description 2
- 101150078890 POLG gene Proteins 0.000 claims 4
- 230000009088 enzymatic function Effects 0.000 claims 3
- 101100224481 Dictyostelium discoideum pole gene Proteins 0.000 claims 2
- 101100224484 Mus musculus Pole gene Proteins 0.000 claims 2
- 101150073374 Pole gene Proteins 0.000 claims 2
- 238000005457 optimization Methods 0.000 claims 2
- 206010059866 Drug resistance Diseases 0.000 claims 1
- 241000124008 Mammalia Species 0.000 claims 1
- 102100034349 Integrase Human genes 0.000 description 125
- 108020004414 DNA Proteins 0.000 description 37
- 108020004999 messenger RNA Proteins 0.000 description 26
- 239000002245 particle Substances 0.000 description 23
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 22
- 208000031886 HIV Infections Diseases 0.000 description 20
- 108700020129 Human immunodeficiency virus 1 p31 integrase Proteins 0.000 description 19
- 241000725303 Human immunodeficiency virus Species 0.000 description 15
- 102220036548 rs140382474 Human genes 0.000 description 15
- 230000003612 virological effect Effects 0.000 description 15
- 108700004025 env Genes Proteins 0.000 description 14
- 238000010361 transduction Methods 0.000 description 14
- 230000026683 transduction Effects 0.000 description 14
- 241000282414 Homo sapiens Species 0.000 description 13
- 238000010276 construction Methods 0.000 description 11
- 230000001965 increasing effect Effects 0.000 description 11
- 241001430294 unidentified retrovirus Species 0.000 description 11
- 238000003556 assay Methods 0.000 description 10
- 238000013461 design Methods 0.000 description 9
- 238000002474 experimental method Methods 0.000 description 9
- 230000035772 mutation Effects 0.000 description 9
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 8
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 8
- 239000012894 fetal calf serum Substances 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 150000007523 nucleic acids Chemical class 0.000 description 8
- 239000002953 phosphate buffered saline Substances 0.000 description 8
- 238000001262 western blot Methods 0.000 description 8
- 229920002873 Polyethylenimine Polymers 0.000 description 7
- 230000001413 cellular effect Effects 0.000 description 7
- 239000000284 extract Substances 0.000 description 7
- 208000015181 infectious disease Diseases 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 230000007246 mechanism Effects 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 6
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 6
- 101150006655 INS gene Proteins 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 241000714474 Rous sarcoma virus Species 0.000 description 6
- 230000003197 catalytic effect Effects 0.000 description 6
- 101150030339 env gene Proteins 0.000 description 6
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 6
- 230000005764 inhibitory process Effects 0.000 description 6
- 230000006798 recombination Effects 0.000 description 6
- 238000005215 recombination Methods 0.000 description 6
- 230000002463 transducing effect Effects 0.000 description 6
- 230000010474 transient expression Effects 0.000 description 6
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 5
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 5
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 5
- 108700026244 Open Reading Frames Proteins 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 230000002950 deficient Effects 0.000 description 5
- 239000005090 green fluorescent protein Substances 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 4
- 102000002464 Galactosidases Human genes 0.000 description 4
- 108010093031 Galactosidases Proteins 0.000 description 4
- 102000012330 Integrases Human genes 0.000 description 4
- 241000713666 Lentivirus Species 0.000 description 4
- 101710149951 Protein Tat Proteins 0.000 description 4
- 241000711975 Vesicular stomatitis virus Species 0.000 description 4
- 230000007541 cellular toxicity Effects 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 210000005260 human cell Anatomy 0.000 description 4
- 238000010166 immunofluorescence Methods 0.000 description 4
- 230000002401 inhibitory effect Effects 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 238000003146 transient transfection Methods 0.000 description 4
- 101710205625 Capsid protein p24 Proteins 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 230000004568 DNA-binding Effects 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 101100364969 Dictyostelium discoideum scai gene Proteins 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 101100364971 Mus musculus Scai gene Proteins 0.000 description 3
- 239000002033 PVDF binder Substances 0.000 description 3
- 101710177166 Phosphoprotein Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- 241000714229 Simian retrovirus 1 Species 0.000 description 3
- 101710149279 Small delta antigen Proteins 0.000 description 3
- 108091036066 Three prime untranslated region Proteins 0.000 description 3
- 102100022563 Tubulin polymerization-promoting protein Human genes 0.000 description 3
- 108091023045 Untranslated Region Proteins 0.000 description 3
- 108020005202 Viral DNA Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 230000004186 co-expression Effects 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 239000013613 expression plasmid Substances 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000002458 infectious effect Effects 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 230000011987 methylation Effects 0.000 description 3
- 238000007069 methylation reaction Methods 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000001052 transient effect Effects 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- HJCMDXDYPOUFDY-WHFBIAKZSA-N Ala-Gln Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O HJCMDXDYPOUFDY-WHFBIAKZSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- 240000006108 Allium ampeloprasum Species 0.000 description 2
- 235000005254 Allium ampeloprasum Nutrition 0.000 description 2
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 2
- PYDIIVKGTBRIEL-SZMVWBNQSA-N Arg-Trp-Pro Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(O)=O PYDIIVKGTBRIEL-SZMVWBNQSA-N 0.000 description 2
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- UKHNKRGNFKSHCG-CUJWVEQBSA-N Cys-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N)O UKHNKRGNFKSHCG-CUJWVEQBSA-N 0.000 description 2
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 2
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 2
- 101710121417 Envelope glycoprotein Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 2
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 2
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 2
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 2
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 2
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- PSVAVKGDUAKZKU-BZSNNMDCSA-N Lys-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N)O PSVAVKGDUAKZKU-BZSNNMDCSA-N 0.000 description 2
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 2
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108700005077 Viral Genes Proteins 0.000 description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000002153 concerted effect Effects 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 108010078428 env Gene Products Proteins 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000000799 fluorescence microscopy Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000003119 immunoblot Methods 0.000 description 2
- 238000010820 immunofluorescence microscopy Methods 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 230000030648 nucleus localization Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 230000010473 stable expression Effects 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 230000029812 viral genome replication Effects 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- 229910052725 zinc Inorganic materials 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- ZRGNRZLDMUACOW-HERUPUMHSA-N Ala-Cys-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N ZRGNRZLDMUACOW-HERUPUMHSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- MAEQBGQTDWDSJQ-LSJOCFKGSA-N Ala-Met-His Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MAEQBGQTDWDSJQ-LSJOCFKGSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- 238000000035 BCA protein assay Methods 0.000 description 1
- 241001416152 Bos frontalis Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241001227713 Chiron Species 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 241001559589 Cullen Species 0.000 description 1
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010031111 EBV-encoded nuclear antigen 1 Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 101710168592 Gag-Pol polyprotein Proteins 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108060003393 Granulin Proteins 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- 241000598436 Human T-cell lymphotropic virus Species 0.000 description 1
- 206010020460 Human T-cell lymphotropic virus type I infection Diseases 0.000 description 1
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 108700020127 Human immunodeficiency virus 2 p31 integrase Proteins 0.000 description 1
- 241000711920 Human orthopneumovirus Species 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- 102100034353 Integrase Human genes 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 101710128836 Large T antigen Proteins 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- 241000713821 Mason-Pfizer monkey virus Species 0.000 description 1
- JHKXZYLNVJRAAJ-WDSKDSINSA-N Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(O)=O JHKXZYLNVJRAAJ-WDSKDSINSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 1
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010051791 Nuclear Antigens Proteins 0.000 description 1
- 102000019040 Nuclear Antigens Human genes 0.000 description 1
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 1
- 102000011931 Nucleoproteins Human genes 0.000 description 1
- 108010061100 Nucleoproteins Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 101150093999 PLN gene Proteins 0.000 description 1
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101710150344 Protein Rev Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 241000205156 Pyrococcus furiosus Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 101150052863 THY1 gene Proteins 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 1
- CUHBVKUVJIXRFK-DVXDUOKCSA-N Trp-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CUHBVKUVJIXRFK-DVXDUOKCSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000031016 anaphase Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 208000005266 avian sarcoma Diseases 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 150000005829 chemical entities Chemical class 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 230000003412 degenerative effect Effects 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 231100000676 disease causative agent Toxicity 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 101150047047 gag-pol gene Proteins 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 238000003125 immunofluorescent labeling Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000000464 low-speed centrifugation Methods 0.000 description 1
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 210000004779 membrane envelope Anatomy 0.000 description 1
- 230000031864 metaphase Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 230000011278 mitosis Effects 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- 238000012758 nuclear staining Methods 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229940056360 penicillin g Drugs 0.000 description 1
- JTJMJGYZQZDUJJ-UHFFFAOYSA-N phencyclidine Chemical compound C1CCCCN1C1(C=2C=CC=CC=2)CCCCC1 JTJMJGYZQZDUJJ-UHFFFAOYSA-N 0.000 description 1
- 239000000419 plant extract Substances 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 239000000276 potassium ferrocyanide Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 238000002731 protein assay Methods 0.000 description 1
- 230000006920 protein precipitation Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 210000003370 receptor cell Anatomy 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 235000017557 sodium bicarbonate Nutrition 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- QTENRWWVYAAPBI-YCRXJPFRSA-N streptomycin sulfate Chemical compound OS(O)(=O)=O.OS(O)(=O)=O.OS(O)(=O)=O.CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](N=C(N)N)[C@H](O)[C@@H](N=C(N)N)[C@H](O)[C@H]1O.CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](N=C(N)N)[C@H](O)[C@@H](N=C(N)N)[C@H](O)[C@H]1O QTENRWWVYAAPBI-YCRXJPFRSA-N 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108700004027 tat Genes Proteins 0.000 description 1
- 101150098170 tat gene Proteins 0.000 description 1
- XOGGUFAVLNCTRS-UHFFFAOYSA-N tetrapotassium;iron(2+);hexacyanide Chemical compound [K+].[K+].[K+].[K+].[Fe+2].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-] XOGGUFAVLNCTRS-UHFFFAOYSA-N 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16041—Use of virus, viral particle or viral elements as a vector
- C12N2740/16043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16051—Methods of production or purification of viral material
- C12N2740/16052—Methods of production or purification of viral material relating to complementing cells and packaging systems for producing virus or viral particles
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16111—Human Immunodeficiency Virus, HIV concerning HIV env
- C12N2740/16122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/108—Plasmid DNA episomal vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2840/00—Vectors comprising a special translation-regulating system
- C12N2840/20—Vectors comprising a special translation-regulating system translation of more than one cistron
- C12N2840/203—Vectors comprising a special translation-regulating system translation of more than one cistron having an IRES
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Virology (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
The present invention features a synthetic gene or region of a gene which ha s an amended codon usage compared with the wild-type gene and which is for the high level expression of a retroviral protein in eukaryotic cells, the expressed retroviral protein having enzymatic activity in the eukaryotic cel l. In addition, the invention features a synthetic gene or region of a gene encoding a retroviral enzyme or part of a retroviral enzyme normally express ed in a mammalian or other eukaryotic cell wherein at least one non-preferred codon in the wild-type gene encoding the enzyme has been replaced by a preferred codon encoding the same amino acid. The retroviral protein may be a protease, reverse transcriptase, integrase protein or a polyprotein gag-pol precursor thereof. In one embodiment the retroviral protein with enzymatic activity is a lentiviral protein. In other embodiments the enzymatically active protein is a pol enzyme. In more preferred embodiments, the enzymatically active protein is a lentiviral integrase. In an even more preferred embodiment the enzyme is an HIV enzyme. In more preferred embodiments the enzymatically active protein is HIV integrase. The present invention also includes a detection method for intracellular integrase using a promoterless reporter gene.
Description
I
SYNTHETIC GENE FOR EXPRESSING ACTIVE RETROVIRAL PROTEIN IN EUKARYOTES
The present invention relates to the design of a synthetic gene for expressing retroviral proteins in eukaryotic cells especially mammalian cells as well as a synthetic gene, an expression vector containing the gene, eukaryotic cells stably harboring the gene, as well as methods of detection.
TECHNICAL BACKGROUND
Retroviruses are diploid positive strand RNA viruses that replicate through an integrated DNA intermediate. Typically, retroviruses comprise a protein-containing lipid envelope surrounding a protein-encapsulated core carrying the viral genome.
Within the infected cell the retroviral genome is reverse-transcribed into double stranded DNA by a virally encoded reverse transcriptase enzyme that is part of the retroviral particle. The particle also includes other enzymes such as integrase. Integrase is the virus-encoded enzyme that is responsible for inserting the viral DNA
copy into the chromosome of the host cell, a process referred to as retroviral integration. (For a review see Brown (1997), in Retroviruses, Cold Spring Harbor Laboratory Press USA, pp. 161-203). Integration is an essential step in the replication cycle of the human immunodeficiency virus type 1 (HIV-1), the causative agent of AIDS (LaFemina et al.
(1992), J. Virol. 66: 7414-7419). Since no human counterpart is known to exist, integration has attracted a lot of attention as a potential new antiviral target. However, integrase inhibitor development has suffered from the lack of a relevant cellular integration assay; integrase activity is typically evaluated using artificial oligonucleotide-based test tube reactions. There is therefore a need to provide an intracellular integration assay.
Wild-type retroviral genomes contain at least three genes known as the gag, pol and env genes. The gag gene encodes internal core structural proteins, the pol gene encodes for certain enzymes such as protease, reverse transcriptase and integrase, and the env gene encodes the retroviral envelope glycoproteins. Integrases from different retroviruses vary in size from 30 to 46 kDa, are encoded by the 3'-end of the pol gene and are released from a gag pol polyprotein precursor by proteolytic processing. The aminoterminal domain of integrase is characterized by a zinc finger (HHCC), is CONFIRM~I'ION COPT
SYNTHETIC GENE FOR EXPRESSING ACTIVE RETROVIRAL PROTEIN IN EUKARYOTES
The present invention relates to the design of a synthetic gene for expressing retroviral proteins in eukaryotic cells especially mammalian cells as well as a synthetic gene, an expression vector containing the gene, eukaryotic cells stably harboring the gene, as well as methods of detection.
TECHNICAL BACKGROUND
Retroviruses are diploid positive strand RNA viruses that replicate through an integrated DNA intermediate. Typically, retroviruses comprise a protein-containing lipid envelope surrounding a protein-encapsulated core carrying the viral genome.
Within the infected cell the retroviral genome is reverse-transcribed into double stranded DNA by a virally encoded reverse transcriptase enzyme that is part of the retroviral particle. The particle also includes other enzymes such as integrase. Integrase is the virus-encoded enzyme that is responsible for inserting the viral DNA
copy into the chromosome of the host cell, a process referred to as retroviral integration. (For a review see Brown (1997), in Retroviruses, Cold Spring Harbor Laboratory Press USA, pp. 161-203). Integration is an essential step in the replication cycle of the human immunodeficiency virus type 1 (HIV-1), the causative agent of AIDS (LaFemina et al.
(1992), J. Virol. 66: 7414-7419). Since no human counterpart is known to exist, integration has attracted a lot of attention as a potential new antiviral target. However, integrase inhibitor development has suffered from the lack of a relevant cellular integration assay; integrase activity is typically evaluated using artificial oligonucleotide-based test tube reactions. There is therefore a need to provide an intracellular integration assay.
Wild-type retroviral genomes contain at least three genes known as the gag, pol and env genes. The gag gene encodes internal core structural proteins, the pol gene encodes for certain enzymes such as protease, reverse transcriptase and integrase, and the env gene encodes the retroviral envelope glycoproteins. Integrases from different retroviruses vary in size from 30 to 46 kDa, are encoded by the 3'-end of the pol gene and are released from a gag pol polyprotein precursor by proteolytic processing. The aminoterminal domain of integrase is characterized by a zinc finger (HHCC), is CONFIRM~I'ION COPT
universally conserved among all retroviruses, and is essential for in vivo integration.
The central domain is the most conserved region with an essential DD35E motif involved in catalysis. This portion can catalyze the disintegration reaction in vitro. The carboxyterminal domain is referred to as DNA binding domain and shows the least sequence conservation. This fragment is required for 3'-end processing and integration.
The active enzyme is thought to exist as a multimer wherein active domains can transcomplement inactive domains.
Transient expression of avian sarcoma-leukosis virus (ASLV) integrase in COS
cells has been obtained previously (Morns-Vasios et al. (1988), J. Virol. 62:
349-353).
1o A mouse cell line stably expressing the integrase of Rous sarcoma virus (RSV) has also been reported (Mumm et al. (1992), Virology 189: 500-510). Expression levels were not specified but appeared rather low. The integrase (IN) of HIV-1 has been expressed in Escherichia coli (E. coli) (Sherman and Fyfe (1990), Proc. Natl. Acad. Sci.
USA 87, 5119-5123), insect cells using baculovirus (Bushman et al. (1991), Science 249: 1555-1558), and Saccharomyces cerevisiae (Caumont et al. ( 1996), Curr. Genet. 29:
510). In yeast integrase expression proved to be toxic in cells defective in DNA repair.
High level expression of HIV-1 integrase in mammalian cells has remained elusive, in large part because expression of HIV-1 gag and pol proteins in general is Rev-dependent (Cullen (1992), Microb. Rev. 56: 375-395). In mammalian cells Rev-dependent expression of HIV-IN or HIV-IN fused to (3-galactosidase or GFP has been reported previously (Faust et al. (1995), Biochem. Mol. Biol. Int. 36: 745-758; Kukolj et al. (1997), J. Virol. 71: 843-847; Pluymers et al., (1999), Virology 258:
327-332).
However, expression levels, even after transient transfection, were always low. In the absence of Rev, multiple inhibitory or instability sequences (INS), also referred to as cis-acting repressor elements (CRS), in the mRNA interfere with protein expression.
Potential mechanisms include: nuclear retention or mRNA instability. It was observed that mRNA containing CRS is trapped in the nuclei and that the inhibition of expression is at least partly due to the poor translocation of mRNA to the cytosol (Mikaelian et al. (1996), J. Mol. Biol. 257: 246-264; Borg et al. (1997), Virology 236:
95-103). Elements of the RNA processing machinery could be involved in nuclear trapping of mRNA that contains CRS. There is also evidence that several regions of the HIV-1 genome that contribute to the instability of the mRNA, have high AU
contents.
They may represent binding sites for cellular factors which contribute to mRNA
The central domain is the most conserved region with an essential DD35E motif involved in catalysis. This portion can catalyze the disintegration reaction in vitro. The carboxyterminal domain is referred to as DNA binding domain and shows the least sequence conservation. This fragment is required for 3'-end processing and integration.
The active enzyme is thought to exist as a multimer wherein active domains can transcomplement inactive domains.
Transient expression of avian sarcoma-leukosis virus (ASLV) integrase in COS
cells has been obtained previously (Morns-Vasios et al. (1988), J. Virol. 62:
349-353).
1o A mouse cell line stably expressing the integrase of Rous sarcoma virus (RSV) has also been reported (Mumm et al. (1992), Virology 189: 500-510). Expression levels were not specified but appeared rather low. The integrase (IN) of HIV-1 has been expressed in Escherichia coli (E. coli) (Sherman and Fyfe (1990), Proc. Natl. Acad. Sci.
USA 87, 5119-5123), insect cells using baculovirus (Bushman et al. (1991), Science 249: 1555-1558), and Saccharomyces cerevisiae (Caumont et al. ( 1996), Curr. Genet. 29:
510). In yeast integrase expression proved to be toxic in cells defective in DNA repair.
High level expression of HIV-1 integrase in mammalian cells has remained elusive, in large part because expression of HIV-1 gag and pol proteins in general is Rev-dependent (Cullen (1992), Microb. Rev. 56: 375-395). In mammalian cells Rev-dependent expression of HIV-IN or HIV-IN fused to (3-galactosidase or GFP has been reported previously (Faust et al. (1995), Biochem. Mol. Biol. Int. 36: 745-758; Kukolj et al. (1997), J. Virol. 71: 843-847; Pluymers et al., (1999), Virology 258:
327-332).
However, expression levels, even after transient transfection, were always low. In the absence of Rev, multiple inhibitory or instability sequences (INS), also referred to as cis-acting repressor elements (CRS), in the mRNA interfere with protein expression.
Potential mechanisms include: nuclear retention or mRNA instability. It was observed that mRNA containing CRS is trapped in the nuclei and that the inhibition of expression is at least partly due to the poor translocation of mRNA to the cytosol (Mikaelian et al. (1996), J. Mol. Biol. 257: 246-264; Borg et al. (1997), Virology 236:
95-103). Elements of the RNA processing machinery could be involved in nuclear trapping of mRNA that contains CRS. There is also evidence that several regions of the HIV-1 genome that contribute to the instability of the mRNA, have high AU
contents.
They may represent binding sites for cellular factors which contribute to mRNA
instability (Schneider et al. (1997), J. Virol. 71: 4892-4903). According to another hypothesis, mRNA containing inhibitory sequences fails to be translated efficiently without Rev. Whatever the mechanism of the observed inhibition, it is clear that inhibition occurs at the level of the mRNA and is due to some AU-rich regions.
During the HIV replication cycle Rev interaction with the Rev responsive element (RRE) relieves the inhibition in a regulated manner (Schwartz et al. (1992), J.
Virol. 66: 150-159). In this perspective, it is not surprising that by mutating some INS
while preserving the coding function for gag pol transcripts, efficient Rev-independent expression of viral particles has been obtained (Schneider et al. (1997), J.
Virol. 71:
l0 4892-4903). There is evidence that in the case of HIV gp120 mRNA poor translatability due to inefficient codon usage rather than mRNA instability is responsible for low level protein expression (Haas et al., 1996; Schneider et al. ( 1997), J. Virol. 71: 4892-4903).
US 5,811,270 (Grandgenett) describes a test tube method of analysis of concerted integration in which a viral integrase enzyme is first incubated with donor DNA molecules followed by incubation with target DNA molecules. The donor DNA
has at least one unique restriction site for analysis of the concerted integration product.
The described method is said to be useful for studying integrase such as screening of HIV-1 or HIV-2 integrase inhibitors as well as production of transgenic non-human animals and gene transfer. The integrase used is purified from virus particles and the activity is analyzed in the test tube, not intracellularly.
US x,795,737, WO 96/09378, WO 97/11086 and WO 98/12207 all describe methods of producing a synthetic gene encoding a protein normally expressed in a mammalian cell whereby the synthetic gene is reported to overexpress the encoded proteins in mammalian cells. The known synthetic genes are constructed by replacing non-preferred codons or less preferred codons with preferred codons which encode the same amino acid by utilising the redundancy of the genetic code. Examples are given of synthetic env genes which encode envelope glycoproteins but there is no discussion of expressing a protein with enzymatic activity in the host cell. A method of designing a 3o synthetic gene for the overexpression of a protein while maintaining its enzymatic activity is not derivable from the known teaching. There are a significant number of factors which may allow expression of a (retroviral) protein which fails to show intracellular enzymatic activity. The expressed enzyme may be defective for many reasons of which intracellular inhibition of the enzyme and the need for the presence of another viral protein at the same time are but a few. Further, it is not obvious that an enzyme can be overexpressed, for example there may be some limiting factor such as poor solubility or cellular toxicity. On the one hand high level expression of a retroviral enzyme will be required to detect the enzymatic activity, on the other hand levels which are too high may cause protein precipitation or cellular toxicity. For any retroviral enzyme to be active in the cell an optimal intracellular concentration will be required.
In case of failure the suggestion is to replace non-preferred codons to a certain percentage, e.g. 90%, 80%, 70% ..., but there is no precise teaching of how to select 1o which codons are to be replaced. In particular, there is no indication that a specific nucleotide pair frequency is of relevance to high level gene expression. - It is not conclusive that the mechanism of RRE-instability (in env) is the same as, or even related to the mRNA instability problem in gag and pol. In fact there is evidence that the mechanisms are different. Hence, it is not predictable that Rev-independent expression of an env gene may be extrapolated to cure the instability problem of gag and pol genes.
It is an object of the present invention to develop an efficient expression system for an enzymatically active retroviral protein, in particular HIV-1 integrase, in eukaryotic cells, especially mammalian cells.
2o It s a further object to provide a more efficient detection method for retroviral enzyme inhibitors.
It is a further object of the present invention to provide a design method for the construction of a gene encoding a retroviral protein with enzymatic activity.
A further object of the present invention is to provide an expression vector capable of delivering a gene to a target cell, in which cell the enzymatically active protein encoded by the gene is expressed.
SUMMARY OF THE INVENTION
The present invention features a synthetic gene or region of a gene which has an 3o amended codon usage compared with the wild-type gene and which is for the high level expression of a retroviral protein in eukaryotic cells, the expressed retroviral protein having enzymatic activity. in the eukaryotic cell. In addition, the invention features a synthetic gene or region of a gene encoding a retroviral enzyme or part of a retroviral enzyme normally expressed in a mammalian or other eukaryotic cell wherein at least one non-preferred codon in the wild-type gene encoding the enzyme has been replaced by a preferred codon encoding the same amino acid. By "region of a gene with amended codon usage" is meant that it can be sufficient to change codons only in those parts of a gene that normally produce instability sequences (INS) or cis-acting repressor elements (CRS) in the transcribed mRNA of the gene.
By "retroviral protein or enzyme normally expressed in a mammalian or eukaryotic cell" is meant a protein or enzyme which is expressed in a mammalian or eukaryotic cell under disease conditions. These are genes which are encoded by a 1o retrovirus (including a lentivirus) which are expressed in mammalian or eukaryotic cells post-infection.
In preferred embodiments, the synthetic gene is capable of expressing the retroviral enzyme at a level at least 200% of that expressed by the "natural"
(or "native") gene in a mammalian or eukaryotic cell culture system.
The retroviral protein may be a protease, reverse transcriptase, integrase protein or a polyprotein gag-pol precursor thereof. In one embodiment the retroviral protein with enzymatic activity is a lentiviral protein. In other embodiments the enzymatically active protein is a pol enzyme. In more preferred embodiments, the enzymatically active protein is a lentiviral integrase. In an even more preferred embodiment the 2o enzyme is an HIV enzyme. In more preferred embodiments the enzymatically active protein is HIV integrase. The enzymatic activity includes at least an integrase function, namely of promotion or stimulation of the integration of DNA fragments into host cell DNA, preferably the chromosome of the host cell. The integrase hereby is expressed on its own id est as a single component, independent of any retroviral components..
By "retroviral components" is meant the retroviral, specifically the lentiviral, and more specificially the HIV-1 regulatory and accessory proteins like Tat, Rev, Nef, Vpu, Vif, Vpr.
The invention also features a eukaryotic expression vector comprising the synthetic gene or region of a gene. The expression vector preferably includes a 3o constitutive or an inducible or a tissue-specific promoter. Expression from the eukaryotic expression vector can be transient after transfection of the vector in a eukaryotic cell by any of suitable, e.g. established, transfection procedures.
The vector may be any suitable vector such as a plasmid, a mammalian or insect virus.
Expression may also be permanent in a eukaryotic cell line stably harbouring the expression vector.
The expression vector may be comprised in a packaging construct for producing retroviral particles for gene transfer. The retroviral particle may be a lentiviral particle.
Another aspect of the present invention features a eukaryotic cell line that harbours the synthetic gene or region of a gene. The cell line preferably expresses the retroviral enzymatically active protein using a constitutive, inducible or tissue specific promoter. The expressed retroviral protein shows enzymatic activity that can be measured for example by complementation of enzyme-defective viruses or in the case of an integrase by stimulation or the promotion of the insertion of DNA
molecules into to another DNA molecule, preferably the chromosome of the cell.
The present invention also includes a transgenic non-human animal harboring the synthetic gene or region of a gene. The expression of the gene or region of a gene may be induced at any moment using an inducible promoter or, alternatively, in desired tissues using a tissue-specific promoter.
The present invention also features a method for preparing a synthetic gene or region of a gene encoding an enzymatically active retroviral protein or part of such a protein. The method not only identifies and uses preferred codon usage but also, and moreover mainly, seeks to increase mRNA stability during expression. The method includes identifying a small group of genes from the total set of genes of a target 2o eukaryotic cell which encode proteins which are naturally expressed easily and/or in high concentrations in the target cell. The small group may include 10 or less genes, more typically 5 or less genes. From the codon sequences of these identified genes, a prefeiTed codon usage and a preferred nucleotide relationship or nucleotide pair frequency is identified. By preferred codon usage is meant that for a specific amino acid a specific codon is chosen as the preferred codon to encode the amino acid based on the high use of the preferred codon within the select group of genes. By a preferred codon relationship is meant the ratios of the various nucleotides and combinations of nucleotides to each other which commonly appear in genes of the target eukaryotic cell.
One particular nucleotide relationship is the GC content or the GC nucleotide pair 3o frequency. Using the preferred codon usage, non-preferred codons are identified in the natural gene encoding the enzyme and one or more of the non-preferred codons is/are replaced with a preferred codon encoding the same amino acid as the replaced codon.
The replacement is biased to obtain the preferred nucleotide relationship or nucleotide pair frequency, resulting in even better optimized conditions for expression in eukaryotes compared to the use of preferred codon usage only. The replacement may be made based on a random choice between alternative codons encoding the same amino acid at each position using a random number generator and biasing the choice of alternative codons based on the preferred codon usage to obtain the preferred nucleotide relationship or nucleotide pair frequency. In addition, the synthetic gene sequence may be edited by removing potential splice sites and to reduce the number of CpG methylation sites while keeping the overall nucleotide relationship or the nucleotide pair frequency close to the preferred one, e.g. keeping the GC
content and codon usage close to the preferred one. GC content should be kept close to the preferred usage in the target cell, e.g. about 60% in mammalian cells. A
preferred range for the GC content is 53 to 63%, more preferably 55 to 61% for expression of the gene in human cells. To provide efficient initiation of translation the Kozak consensus sequence (ANNATGG) may be added.
It is not necessary to replace all non-preferred codons with preferred codons.
Increased expression may be accomplished even with partial non-preferred codon replacement with preferred codons. Under some circumstances it may be desirable to only partially replace non-preferred codons with preferred codons in order to obtain an intermediate level of expression.
By "synthetic gene" is meant a nucleotide sequence encoding a naturally occurring protein in which a portion of the naturally occurnng codons has been replaced by other codons. For example, a non-preferred codon is replaced with a preferred codon encoding the same amino acid. However, by replacing codons to create a synthetic gene the expression in eukaryotic, e.g. mammalian cells (especially human cells) of a wide variety of genes (of eukaryotic, mammalian, prokaryotic or viral origin) can be increased compared to the expression of the naturally occurring gene.
Thus, the invention includes improving the eukaryotic, especially a mammalian cell expression of a gene from any source by the codon replacement methods described herein.
By "vector" is meant a DNA molecule, derived, e.g., from a plasmid, or 3o mammalian or insect virus, into which fragments of DNA may be inserted or cloned. A
vector will contain one or more unique restriction sites and may be capable of autonomous replication in a defined host or vehicle organism such that the cloned sequence is reproducible. Thus, by "expression vector" is meant any autonomous element capable of directing the synthesis of a protein. Such DNA expression vectors include mammalian plasmids and viruses.
By retroviral "packaging construct" or "packaging vector" is meant a plasmid-based or virus-based vector or construct, configured to encode for the proteins necessary for producing virus particles that are devoid of genomic RNA. In general, this implies providing the gag, pol and env gene products. Lentiviral packaging constructs of interest contain changes to the coding sequences of gag or pol proteins (i.e. synthetic genes) to enhance lentiviral protein expression and to enhance safety. For biosafety reasons, the packaging functions are often divided into two genomes, one which expresses the gag and pol gene and another expressing the env gene product. In packaging constructs in accordance with the present invention, regulators of gene expression such as the Rev gene product would no longer be required. Increased biosafety of these packaging constructs is based on a reduced risk for (homologous) recombination of these synthetic genes with their wild-type counterparts.
The invention also features synthetic portion of a gene which encodes a desired portion of the protein. Such synthetic gene fragments are similar to the synthetic genes of the invention except that they encode only a portion of the protein. The portion of the gene encodes a portion of the enzyme which has some enzymatic activity, e.g. it may have catalytic activity, for example, the synthetic gene may encode a catalytic core of an enzyme, e.g. it may be a part of reverse transcriptase.
The present invention also includes a detection method for intracellular integrase using a promoterless reporter gene. The reporter gene may be luciferase, GFP
or an antibiotic selection marker (e.g. neomycin resistance). The reporter gene construct may be used as the substrate of the retroviral enzyme, e.g.
integrase expressed from the synthetic gene be it in a stable cell line or in a transient mode after transfection of the expression vector, the retroviral enzyme, e.g. integrase being in accordance with the present invention.
The present invention may provide a synthetic gene and a method of designing and constructing the same to obtain efficient expression of a retroviral, in particular lentiviral enzyme such as integrase of the human immunodeficiency virus type 1 (HIV-1), or part of a retroviral enzyme in mammalian cells. The synthetic gene circumvents mRNA instability by increasing the GC content of the wild type integrase gene from 40% to 59%. The synthetic gene, cloned in a eukaryotic expression vector, provides efficient expression of HIV-1 integrase in various mammalian cell lines. The amino terminus of the protein was as predicted by the sequence after removal of the first methionyl residue. Nuclear localization of the recombinant protein was evidenced by fluorescence microscopy. A 293T cell line stably expressing HIV-1 integrase was obtained. The functionality of integrase was proven by trans-complementation experiments. Lentiviral vector particles carrying the inactivating D64V
mutation in the integrase gene, were obtained capable of stably transducing 293T cells when complemented in the producer cell line with integrase expressed from the synthetic gene. When the cell line that stably expresses integrase was infected with the defective to virus particles, complementation of integrase function was observed.
Transfection with a linear promoterless DNA substrate that contains a reporter gene behind an IRES and is flanked by HIV LTR ends, resulted in a reproducibly higher reporter signal in cells that express integrase. Since the increase in reporter gene activity was stable upon passaging of the transfected cells, it can be concluded that the integrase promotes insertion of the linear DNA substrate in the cellular chromosome. The fold increase of reporter signal with integrase expressed from a mutant synthetic gene, containing the D64V mutation, was considerably lower, indicating that the enzymatic activity of the enzyme was required. The established cellular integration system in accordance with the present invention facilitates the study of the interplay between host and viral factors during integration, the development of specific HIV integration inhibitors as well as the design of gene transfer systems.
The present invention, its advantages and embodiments will now be described with reference to the following figures and drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1. Western blot analysis of transient expression of HIV-1 IN in 293T
cells using different expression strategies. 293T cells were transiently transfected with the various expression vectors. At 48 hrs post transfection cell extracts were made using 1 % SDS, 1 mM PMSF. Cell extracts representing 10 p.g of total protein were 3o separated by PAGE and blotted onto PVDF membranes. Detection was performed using polyclonal antibodies against HIV-1 integrase and the ECL+ detection system.
Lane 1 contains 2.5 ng of recombinant and purified His-tagged HIV-1 integrase (HT-IN). The other lanes contain extracts after transfections with equal amounts of the following plasmids: Lane 2, pCEP4; Lane 3, pCEP-IN; Lane 4, pCEP-IN-CTE; Lane 5, pCEP-IN-RRE + pEF-cREV; Lane 6 pCMV-INS.
Figure 2. Sequence and structure of the synthetic gene.
(A) Sequence of the synthetic DNA coding for pNL4-3 HIV-1 integrase. The amino 5 acid sequence is shown in the single letter code. The restriction sites used in construction are boxed. The translation initiation site is underlined.
(B) A schematic representation of the structure of the synthetic gene. The following regions are indicated : the 5'- and 3'- untranslated regions (UTR) derived from (3-globin mRNA, the Met-Gly dipeptide and the integrase open reading frame (ORF).
The 1o three domains of the integrase protein are shown: the Zinc finger motif (HHCC), the catalytic core and the DNA binding domain.
Figure 3. Western blot analysis of the 293T-derived cell line that stably expresses HIV-1 IN from the synthetic gene. 293T cells were transfected with pCMV-INs and a stable cell line was selected with HygromycinB. Cell extracts (10 pg of total protein) were separated by PAGE and blotted onto PVDF membrane.
Detection was performed using polyclonal antibodies against HIV-1 integrase and the ECL+
detection system. Lane 1, 2.5 ng recombinant His-tagged HIV-1 integrase; Lane 2, extract of 293T cells; Lane 3, extract of 293T cells stably expressing IN
(293T-INs).
Figure 4. Detection of integrase activity using a promoterless reporter construct (DIPR) Figs. 4A-C are schematic representations of the method of detection of integrase activity using a promoterless reporter gene.
DESCRIPTION OF THE ILLUSTRATED EMBODIMENTS
The present invention will mainly be described with reference to a synthetic gene for overexpressing HIV integrase in mammalian cells but the invention is not limited thereto but only by the claims.
It has long been known that expression of eukaryotic genes in prokaryotes can be optimised by designing synthetic genes with modified codon usage. Less established, although demonstrated, is the concept of increasing expression of 3o eukaryotic genes in eukaryotic cells by modified codon usage. From bacteria it is known that few general rules apply (Makrides F. (1996), Microb. Rev 60: 512-538).
A retroviral enzyme such as integrase does not normally, during the infectious cycle, work as a soluble protein in the cytoplasm of a host cell.
During the HIV replication cycle Rev interaction with the Rev responsive element (RRE) relieves the inhibition in a regulated manner (Schwartz et al. (1992), J.
Virol. 66: 150-159). In this perspective, it is not surprising that by mutating some INS
while preserving the coding function for gag pol transcripts, efficient Rev-independent expression of viral particles has been obtained (Schneider et al. (1997), J.
Virol. 71:
l0 4892-4903). There is evidence that in the case of HIV gp120 mRNA poor translatability due to inefficient codon usage rather than mRNA instability is responsible for low level protein expression (Haas et al., 1996; Schneider et al. ( 1997), J. Virol. 71: 4892-4903).
US 5,811,270 (Grandgenett) describes a test tube method of analysis of concerted integration in which a viral integrase enzyme is first incubated with donor DNA molecules followed by incubation with target DNA molecules. The donor DNA
has at least one unique restriction site for analysis of the concerted integration product.
The described method is said to be useful for studying integrase such as screening of HIV-1 or HIV-2 integrase inhibitors as well as production of transgenic non-human animals and gene transfer. The integrase used is purified from virus particles and the activity is analyzed in the test tube, not intracellularly.
US x,795,737, WO 96/09378, WO 97/11086 and WO 98/12207 all describe methods of producing a synthetic gene encoding a protein normally expressed in a mammalian cell whereby the synthetic gene is reported to overexpress the encoded proteins in mammalian cells. The known synthetic genes are constructed by replacing non-preferred codons or less preferred codons with preferred codons which encode the same amino acid by utilising the redundancy of the genetic code. Examples are given of synthetic env genes which encode envelope glycoproteins but there is no discussion of expressing a protein with enzymatic activity in the host cell. A method of designing a 3o synthetic gene for the overexpression of a protein while maintaining its enzymatic activity is not derivable from the known teaching. There are a significant number of factors which may allow expression of a (retroviral) protein which fails to show intracellular enzymatic activity. The expressed enzyme may be defective for many reasons of which intracellular inhibition of the enzyme and the need for the presence of another viral protein at the same time are but a few. Further, it is not obvious that an enzyme can be overexpressed, for example there may be some limiting factor such as poor solubility or cellular toxicity. On the one hand high level expression of a retroviral enzyme will be required to detect the enzymatic activity, on the other hand levels which are too high may cause protein precipitation or cellular toxicity. For any retroviral enzyme to be active in the cell an optimal intracellular concentration will be required.
In case of failure the suggestion is to replace non-preferred codons to a certain percentage, e.g. 90%, 80%, 70% ..., but there is no precise teaching of how to select 1o which codons are to be replaced. In particular, there is no indication that a specific nucleotide pair frequency is of relevance to high level gene expression. - It is not conclusive that the mechanism of RRE-instability (in env) is the same as, or even related to the mRNA instability problem in gag and pol. In fact there is evidence that the mechanisms are different. Hence, it is not predictable that Rev-independent expression of an env gene may be extrapolated to cure the instability problem of gag and pol genes.
It is an object of the present invention to develop an efficient expression system for an enzymatically active retroviral protein, in particular HIV-1 integrase, in eukaryotic cells, especially mammalian cells.
2o It s a further object to provide a more efficient detection method for retroviral enzyme inhibitors.
It is a further object of the present invention to provide a design method for the construction of a gene encoding a retroviral protein with enzymatic activity.
A further object of the present invention is to provide an expression vector capable of delivering a gene to a target cell, in which cell the enzymatically active protein encoded by the gene is expressed.
SUMMARY OF THE INVENTION
The present invention features a synthetic gene or region of a gene which has an 3o amended codon usage compared with the wild-type gene and which is for the high level expression of a retroviral protein in eukaryotic cells, the expressed retroviral protein having enzymatic activity. in the eukaryotic cell. In addition, the invention features a synthetic gene or region of a gene encoding a retroviral enzyme or part of a retroviral enzyme normally expressed in a mammalian or other eukaryotic cell wherein at least one non-preferred codon in the wild-type gene encoding the enzyme has been replaced by a preferred codon encoding the same amino acid. By "region of a gene with amended codon usage" is meant that it can be sufficient to change codons only in those parts of a gene that normally produce instability sequences (INS) or cis-acting repressor elements (CRS) in the transcribed mRNA of the gene.
By "retroviral protein or enzyme normally expressed in a mammalian or eukaryotic cell" is meant a protein or enzyme which is expressed in a mammalian or eukaryotic cell under disease conditions. These are genes which are encoded by a 1o retrovirus (including a lentivirus) which are expressed in mammalian or eukaryotic cells post-infection.
In preferred embodiments, the synthetic gene is capable of expressing the retroviral enzyme at a level at least 200% of that expressed by the "natural"
(or "native") gene in a mammalian or eukaryotic cell culture system.
The retroviral protein may be a protease, reverse transcriptase, integrase protein or a polyprotein gag-pol precursor thereof. In one embodiment the retroviral protein with enzymatic activity is a lentiviral protein. In other embodiments the enzymatically active protein is a pol enzyme. In more preferred embodiments, the enzymatically active protein is a lentiviral integrase. In an even more preferred embodiment the 2o enzyme is an HIV enzyme. In more preferred embodiments the enzymatically active protein is HIV integrase. The enzymatic activity includes at least an integrase function, namely of promotion or stimulation of the integration of DNA fragments into host cell DNA, preferably the chromosome of the host cell. The integrase hereby is expressed on its own id est as a single component, independent of any retroviral components..
By "retroviral components" is meant the retroviral, specifically the lentiviral, and more specificially the HIV-1 regulatory and accessory proteins like Tat, Rev, Nef, Vpu, Vif, Vpr.
The invention also features a eukaryotic expression vector comprising the synthetic gene or region of a gene. The expression vector preferably includes a 3o constitutive or an inducible or a tissue-specific promoter. Expression from the eukaryotic expression vector can be transient after transfection of the vector in a eukaryotic cell by any of suitable, e.g. established, transfection procedures.
The vector may be any suitable vector such as a plasmid, a mammalian or insect virus.
Expression may also be permanent in a eukaryotic cell line stably harbouring the expression vector.
The expression vector may be comprised in a packaging construct for producing retroviral particles for gene transfer. The retroviral particle may be a lentiviral particle.
Another aspect of the present invention features a eukaryotic cell line that harbours the synthetic gene or region of a gene. The cell line preferably expresses the retroviral enzymatically active protein using a constitutive, inducible or tissue specific promoter. The expressed retroviral protein shows enzymatic activity that can be measured for example by complementation of enzyme-defective viruses or in the case of an integrase by stimulation or the promotion of the insertion of DNA
molecules into to another DNA molecule, preferably the chromosome of the cell.
The present invention also includes a transgenic non-human animal harboring the synthetic gene or region of a gene. The expression of the gene or region of a gene may be induced at any moment using an inducible promoter or, alternatively, in desired tissues using a tissue-specific promoter.
The present invention also features a method for preparing a synthetic gene or region of a gene encoding an enzymatically active retroviral protein or part of such a protein. The method not only identifies and uses preferred codon usage but also, and moreover mainly, seeks to increase mRNA stability during expression. The method includes identifying a small group of genes from the total set of genes of a target 2o eukaryotic cell which encode proteins which are naturally expressed easily and/or in high concentrations in the target cell. The small group may include 10 or less genes, more typically 5 or less genes. From the codon sequences of these identified genes, a prefeiTed codon usage and a preferred nucleotide relationship or nucleotide pair frequency is identified. By preferred codon usage is meant that for a specific amino acid a specific codon is chosen as the preferred codon to encode the amino acid based on the high use of the preferred codon within the select group of genes. By a preferred codon relationship is meant the ratios of the various nucleotides and combinations of nucleotides to each other which commonly appear in genes of the target eukaryotic cell.
One particular nucleotide relationship is the GC content or the GC nucleotide pair 3o frequency. Using the preferred codon usage, non-preferred codons are identified in the natural gene encoding the enzyme and one or more of the non-preferred codons is/are replaced with a preferred codon encoding the same amino acid as the replaced codon.
The replacement is biased to obtain the preferred nucleotide relationship or nucleotide pair frequency, resulting in even better optimized conditions for expression in eukaryotes compared to the use of preferred codon usage only. The replacement may be made based on a random choice between alternative codons encoding the same amino acid at each position using a random number generator and biasing the choice of alternative codons based on the preferred codon usage to obtain the preferred nucleotide relationship or nucleotide pair frequency. In addition, the synthetic gene sequence may be edited by removing potential splice sites and to reduce the number of CpG methylation sites while keeping the overall nucleotide relationship or the nucleotide pair frequency close to the preferred one, e.g. keeping the GC
content and codon usage close to the preferred one. GC content should be kept close to the preferred usage in the target cell, e.g. about 60% in mammalian cells. A
preferred range for the GC content is 53 to 63%, more preferably 55 to 61% for expression of the gene in human cells. To provide efficient initiation of translation the Kozak consensus sequence (ANNATGG) may be added.
It is not necessary to replace all non-preferred codons with preferred codons.
Increased expression may be accomplished even with partial non-preferred codon replacement with preferred codons. Under some circumstances it may be desirable to only partially replace non-preferred codons with preferred codons in order to obtain an intermediate level of expression.
By "synthetic gene" is meant a nucleotide sequence encoding a naturally occurring protein in which a portion of the naturally occurnng codons has been replaced by other codons. For example, a non-preferred codon is replaced with a preferred codon encoding the same amino acid. However, by replacing codons to create a synthetic gene the expression in eukaryotic, e.g. mammalian cells (especially human cells) of a wide variety of genes (of eukaryotic, mammalian, prokaryotic or viral origin) can be increased compared to the expression of the naturally occurring gene.
Thus, the invention includes improving the eukaryotic, especially a mammalian cell expression of a gene from any source by the codon replacement methods described herein.
By "vector" is meant a DNA molecule, derived, e.g., from a plasmid, or 3o mammalian or insect virus, into which fragments of DNA may be inserted or cloned. A
vector will contain one or more unique restriction sites and may be capable of autonomous replication in a defined host or vehicle organism such that the cloned sequence is reproducible. Thus, by "expression vector" is meant any autonomous element capable of directing the synthesis of a protein. Such DNA expression vectors include mammalian plasmids and viruses.
By retroviral "packaging construct" or "packaging vector" is meant a plasmid-based or virus-based vector or construct, configured to encode for the proteins necessary for producing virus particles that are devoid of genomic RNA. In general, this implies providing the gag, pol and env gene products. Lentiviral packaging constructs of interest contain changes to the coding sequences of gag or pol proteins (i.e. synthetic genes) to enhance lentiviral protein expression and to enhance safety. For biosafety reasons, the packaging functions are often divided into two genomes, one which expresses the gag and pol gene and another expressing the env gene product. In packaging constructs in accordance with the present invention, regulators of gene expression such as the Rev gene product would no longer be required. Increased biosafety of these packaging constructs is based on a reduced risk for (homologous) recombination of these synthetic genes with their wild-type counterparts.
The invention also features synthetic portion of a gene which encodes a desired portion of the protein. Such synthetic gene fragments are similar to the synthetic genes of the invention except that they encode only a portion of the protein. The portion of the gene encodes a portion of the enzyme which has some enzymatic activity, e.g. it may have catalytic activity, for example, the synthetic gene may encode a catalytic core of an enzyme, e.g. it may be a part of reverse transcriptase.
The present invention also includes a detection method for intracellular integrase using a promoterless reporter gene. The reporter gene may be luciferase, GFP
or an antibiotic selection marker (e.g. neomycin resistance). The reporter gene construct may be used as the substrate of the retroviral enzyme, e.g.
integrase expressed from the synthetic gene be it in a stable cell line or in a transient mode after transfection of the expression vector, the retroviral enzyme, e.g. integrase being in accordance with the present invention.
The present invention may provide a synthetic gene and a method of designing and constructing the same to obtain efficient expression of a retroviral, in particular lentiviral enzyme such as integrase of the human immunodeficiency virus type 1 (HIV-1), or part of a retroviral enzyme in mammalian cells. The synthetic gene circumvents mRNA instability by increasing the GC content of the wild type integrase gene from 40% to 59%. The synthetic gene, cloned in a eukaryotic expression vector, provides efficient expression of HIV-1 integrase in various mammalian cell lines. The amino terminus of the protein was as predicted by the sequence after removal of the first methionyl residue. Nuclear localization of the recombinant protein was evidenced by fluorescence microscopy. A 293T cell line stably expressing HIV-1 integrase was obtained. The functionality of integrase was proven by trans-complementation experiments. Lentiviral vector particles carrying the inactivating D64V
mutation in the integrase gene, were obtained capable of stably transducing 293T cells when complemented in the producer cell line with integrase expressed from the synthetic gene. When the cell line that stably expresses integrase was infected with the defective to virus particles, complementation of integrase function was observed.
Transfection with a linear promoterless DNA substrate that contains a reporter gene behind an IRES and is flanked by HIV LTR ends, resulted in a reproducibly higher reporter signal in cells that express integrase. Since the increase in reporter gene activity was stable upon passaging of the transfected cells, it can be concluded that the integrase promotes insertion of the linear DNA substrate in the cellular chromosome. The fold increase of reporter signal with integrase expressed from a mutant synthetic gene, containing the D64V mutation, was considerably lower, indicating that the enzymatic activity of the enzyme was required. The established cellular integration system in accordance with the present invention facilitates the study of the interplay between host and viral factors during integration, the development of specific HIV integration inhibitors as well as the design of gene transfer systems.
The present invention, its advantages and embodiments will now be described with reference to the following figures and drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1. Western blot analysis of transient expression of HIV-1 IN in 293T
cells using different expression strategies. 293T cells were transiently transfected with the various expression vectors. At 48 hrs post transfection cell extracts were made using 1 % SDS, 1 mM PMSF. Cell extracts representing 10 p.g of total protein were 3o separated by PAGE and blotted onto PVDF membranes. Detection was performed using polyclonal antibodies against HIV-1 integrase and the ECL+ detection system.
Lane 1 contains 2.5 ng of recombinant and purified His-tagged HIV-1 integrase (HT-IN). The other lanes contain extracts after transfections with equal amounts of the following plasmids: Lane 2, pCEP4; Lane 3, pCEP-IN; Lane 4, pCEP-IN-CTE; Lane 5, pCEP-IN-RRE + pEF-cREV; Lane 6 pCMV-INS.
Figure 2. Sequence and structure of the synthetic gene.
(A) Sequence of the synthetic DNA coding for pNL4-3 HIV-1 integrase. The amino 5 acid sequence is shown in the single letter code. The restriction sites used in construction are boxed. The translation initiation site is underlined.
(B) A schematic representation of the structure of the synthetic gene. The following regions are indicated : the 5'- and 3'- untranslated regions (UTR) derived from (3-globin mRNA, the Met-Gly dipeptide and the integrase open reading frame (ORF).
The 1o three domains of the integrase protein are shown: the Zinc finger motif (HHCC), the catalytic core and the DNA binding domain.
Figure 3. Western blot analysis of the 293T-derived cell line that stably expresses HIV-1 IN from the synthetic gene. 293T cells were transfected with pCMV-INs and a stable cell line was selected with HygromycinB. Cell extracts (10 pg of total protein) were separated by PAGE and blotted onto PVDF membrane.
Detection was performed using polyclonal antibodies against HIV-1 integrase and the ECL+
detection system. Lane 1, 2.5 ng recombinant His-tagged HIV-1 integrase; Lane 2, extract of 293T cells; Lane 3, extract of 293T cells stably expressing IN
(293T-INs).
Figure 4. Detection of integrase activity using a promoterless reporter construct (DIPR) Figs. 4A-C are schematic representations of the method of detection of integrase activity using a promoterless reporter gene.
DESCRIPTION OF THE ILLUSTRATED EMBODIMENTS
The present invention will mainly be described with reference to a synthetic gene for overexpressing HIV integrase in mammalian cells but the invention is not limited thereto but only by the claims.
It has long been known that expression of eukaryotic genes in prokaryotes can be optimised by designing synthetic genes with modified codon usage. Less established, although demonstrated, is the concept of increasing expression of 3o eukaryotic genes in eukaryotic cells by modified codon usage. From bacteria it is known that few general rules apply (Makrides F. (1996), Microb. Rev 60: 512-538).
A retroviral enzyme such as integrase does not normally, during the infectious cycle, work as a soluble protein in the cytoplasm of a host cell.
Integrase is part of a large ill-defined nucleoprotein complex called the preintegration complex of which also reverse transcriptase, nucleocapsid, matrix protein, the viral DNA and other factors are part. It is not obvious that integrase on its own in the cytoplasm of a target cell is enzymatically active, for example, there may be cellular factors which inhibit activity or viral factors which are missing in this environment.
Further, it is not obvious that integrase expressed as such will interact with artificial DNA substrates (see DIPR below). One aspect of the present invention is dissecting the preintegration complex to obtain a simple integrase-linear DNA interaction.
One embodiment of the present invention is a method to detect and utilize the enzymatic activity of a retroviral, in particular a lentiviral enzyme, in particular integrase by itself in a eukaryotic cell.
Initially eukaryotic expression vectors encoding HIV-1 IN and IN-RRE were constructed based on the reasoning that co-expression of Rev in cells transfected with IN-RRE would increase expression levels of IN. However, in human cells transiently transfected with these expression vectors, little or no expression of IN was detected by either immunofluorescence microscopy or western blotting (Fig. 1). An alternative approach consisted of introducing the constitutive transport element (CTE) of simian retrovirus type 1 behind the integrase gene. Again, expression was barely detectable upon prolonged exposure of the blot and amounted to merely 40 ng per 10 x 106 2o transfected cells. The construction of a C-terminal fusion to green fluorescent protein (GFP) (GFP-IN) resulted in a more pronounced expression of wild-type HIV-1 integrase expressed in mammalian cells (Pluymers et al. (1999), Virology 258:
332). Rev co-expression was not required, in accord with Kukolj et al. (1997), J. Virol.
71: 843-847), who expressed integrase as a C-terminal fusion protein with (3-galactosidase in the absence of Rev. The impact of the INS in the IN gene on protein expression levels was illustrated by a 5-fold decrease in expression levels of the GFP-IN construct compared to the parental GFP (Pluymers et al. (1999), Virology 258: 327-332). The present invention is based on a synthetic gene for HIV-1 integrase with an increased intrinsic mRNA stability. The use of such a synthetic gene resulted in high expression levels and concurrent enzymatic integrase activity as could be demonstrated via complementation tests.
In accordance with the present invention, an integrase gene was synthesised with an increased GC content resulting in high level expression of HIV-1 IN in various mammalian cell lines. The enzyme was shown to complement defective integrase carried by HIV-1-derived vector particles and to act in trans on linear DNA
substrates that are flanked by LTR fragments and encode a reporter gene.
DESIGN AND CONSTRUCTION OF THE SYNTHETIC INTEGRASE GENE
Synthetic genes have been constructed in the past to optimize expression of eukaryotic genes in bacteria based on the knowledge that codon usage in prokaryotes is quite different from that in eukaryotes. HIV (lentiviral) genes are not optimal for high level expression in eukaryotic cells. This is related to the mechanism HIV
uses to to circumvent the mRNA instability, namely Rev. During the replication cycle early mRNA transcripts will be spliced which results in expression of regulatory proteins such as Tat and Rev. Only late in the cycle, does Rev accumulation and Rev-RRE
interaction block splicing and suppress AT-rich instability sequences resulting in unspliced transcripts encoding structural and enzymatic proteins. Whereas the synthetic gene in accordance with the present invention clearly augments protein expression in mammalian cells, which is a prerequisite to detect the functionality of the enzyme in the cell, in the context of replicating HIV the presence of a gene with an increased GC
content may well interfere with the mechanism of regulation of gene expression and be detrimental for viral replication.
2o In accordance with an embodiment of the present invention a synthetic viral gene was designed for better optimized and more efficient expression in mammalian cells. The HIV-1 integrase gene has a GC content of 40% whereas highly expressed human genes on average have a GC content of 55-61%. Hence, the GC content is one aspect of the preferred nucleotide relationship or nucleotide pair frequency in accordance with the present invention. By employing the degenerative nature of the genetic code and selecting for the preferred codon usage in the synthetic gene, the GC
code content of a synthetic gene encoding HIV integrase would be increased up to 66%
without altering the amino acid sequence. However, this is not preferred in accordance with the present invention. First of all, in accordance with the present invention, the 3o choice among the alternative codons was biased in favour of preferred triplets (codons) found in a small group of genes of the total human genome which express well/strongly, e.g. human (3-globin, a-, y-actin and EF2 genes (method of determining the preferred codon usage). In addition the bias was such as to approximate the preferred nucleotide relationship or nucleotide pair frequency, i.e. within the range 53 to 63%, more preferably 55-61% for the GC content. In fact a GC content of 59%
rather than 66% was achieved. The other rules for redesigning retroviral genes for eukaryotic expression are: (i) removal of potential splice sites, (ii) reduction of the number of CpG methylation sites, (iii) introduction of S' and 3'-untranslated regions (UTR) of a mammalian mRNA (in our case from human (3-globin), (iv) addition of an extra N-terminal peptide (Met-Gly for the examples given below) for efficient initiation of translation. As a result expression levels from the synthetic gene in various mammalian cell lines were at least 25-fold higher than from the natural integrase gene.
to Efficient expression was also obtained in yeast (Pichia pastoris) (data not shown).
In accordance with one embodiment of the present invention a gene is provided to achieve high level expression of HIV-1 integrase in human cell lines by maintaining the amino acid sequence of IN from the pNL4-3 clone of HIV-1 while adapting the nucleotide codon usage to the codon usage of constitutively and highly expressed human genes ("preferred codon usage"). A first version of an artificial IN
reading frame was based on random choice between alternative codons at each position using a random number generator, biasing in favour of preferred triplets as found in the human (3-globin, a-, y-actin and EF2 genes. Next, the DNA sequence was substantially edited to remove potential splice sites and to reduce the number of CpG methylation sites, but keeping the overall GC content and codon usage close to optimal ("preferred nucleotide relationship" or "nucleotide pair frequency"). The final version of the synthetic gene (Fig. 2 or SEQ ID NO: l ) contains fragments of the 5'- and 3'-untranslated regions from the (3-globin mRNA. This gene encodes for wild type HIV-1 integrase with addition of the N-terminal Met-Gly dipeptide. The extra glycine codon completes the Kozak's consensus sequence (ANNATGG) required for efficient initiation of translation.
In the synthetic gene the overall GC content is 59% compared to 40% in the wild type.
The gene was constructed from six synthetic DNA fragments, each approximately 1 SO
by long, by stepwise cloning. It should be understood that various homologs of the gene shown in Fig. 2 or SEQ ID NO: l are included within the scope of the present invention.
Reapplication of the random number biasing procedure in accordance with the present invention would generate alternative sequences all of them coding for the same protein and all having a similar preferred nucleotide relationship or nucleotide pair frequency.
All such synthetic gene homologs are included within the scope of the present invention.
The synthetic gene includes modification to those described above, the following modifications and improvements of the synthetic gene are included within the scope of the present invention. For example, the leader peptide can be replaced affecting the efficiency of translation and potential myristoylation (e.g. for example, a Met-Ala variant has been constructed). The 5' and 3'-UTRs may be replaced by UTRs from other mammalian mRNAs to optimize the stability of the transcript.
Mutations in the open reading frame are also included within the scope of the present invention whereby the canonical integrase sequences (e.g. HHCC and DD35E) are preferably left to unchanged. A more soluble version can be made by introducing for example the F 185K/ F 185H mutations. Other mutations may induce increased or altered catalytic activity of the enzyme in the eukaryotic cell. For example, the present invention includes a variant synthetic gene with the D64V mutation, known to reduce drastically the enzymatic activity of integrase. Synthetic genes of integrase are included within the scope of the present invention in which the genetic information of domains of other proteins are added. These domains preferably add additional properties to the enzyme such as sequence specificity in DNA binding. Examples of methods of providing specificity to a gene encoding integrase are described in WO 96/37626, US
5,811,270 without describing the specific innovative aspects of the present invention.
The synthetic gene for HIV-1 integrase was designed to circumvent inhibition of gene expression induced by instability sequences (INS) in the wild type integrase gene. This approach can be applied to retroviral integrases in general. In particular the aforementioned design method may be used to redesign any retroviral viral gene encoding a protein with enzymatic activity for efficient expression in eukaryotes. In particular, the design method of synthetic genes in accordance with the present invention will boost eukaryotic expression for retroviral genes encoding a protein with enzymatic activity, especially lentiviral integrases and pol proteins in general. The particular approach of the present invention could also be applied to redesign gag genes, in which mRNA instability due to the presence of INS elements and not poor translatability, like for env, would be the problem. Although the role of Rev in suppressing the effect of 1NS is only well studied in the case of HIV-l, all other lentiviruses are known to encode proteins analogous to Rev. Likewise the human T-lymphotropic and bovine lymphotropic viruses (HTLVs and BLV) encode Rev.
Simple retroviruses such as Mason-Pfizer monkey virus and simian retrovirus-1 (SRV-1) contain a constitutive transport element (CTE) that promotes nuclear export of unspliced mRNA. It has been shown that CTE can functionally substitute for Rev interacting with RRE. In fact, a low level transient expression from a wild type 5 integrase gene with a downstream CTE of SRV-1 has been obtained by us using the methods of the present invention. Since the design of a synthetic gene in accordance with the present invention abolishes any need for co-expression of Rev and presence of RRE or CTE in the construct, this approach can improve expression of retroviral enzymes in general and integrases in particular.
10 In creating mammalian expression vectors, various eukaryotic expression plasmids can be used. Expression can be under control of a constitutive promoter (for example hCMV and RSV) or an inducible promoter. Examples of (commercially available) inducible expression systems are the ecdysone-inducible and the tetracyclin-inducible (Tet-Off and Tet-On) expression systems. Tissue-specific promoters that 15 limit expression in specific tissues may also be envisaged. Examples are the established neuron-specific promoters Thy-1 and enolase. Inducible promoters may limit cellular toxicity, although a cell line that stably expresses integrase was obtained.
In transgenic non-human animals harbouring the synthetic gene, expression may be induced at a desired moment using an inducible promoter or in desired tissues using a tissue-specific 2o promoter.
Transient and stable expression of HIV-1 integrase in 293T and HeLa cells The synthetic gene for integrase (INS) was cloned into the expression vectors pCEP4 and pBK-RSV under control of the human cytomegalovirus (hCMV) and Rous sarcoma virus (RSV) promoters, respectively. Transient and stable expression of IN
was obtained in both 293T and HeLa cell lines, as verified by immunoblotting (Fig. l, 3) and indirect immunofluorescence (data not shown). In transfected 293T cells the expression levels from the hCMV promoter amounted to 10-20 ~g of IN per 10 x cells which is at least 25-fold higher than obtained with expression vectors that contain 3o the unfused wild type HIV-1 integrase gene.
Transfection of 293T cells with the episomal expression vector pCEP-INs followed by selection with hygromycinB, resulted in a stable cell line, referred to as 293T-lNs. Indirect immunofluorescence staining revealed that 80-90% of selected cells produce integrase at detectable levels. The expression level, as estimated by quantitative immunoblotting, was about 0.5 ~g of integrase per 10 x 106 cells.
The reduced cell growth kinetics of 293T-INs (30-50% as compared to the parental cell line) is suggestive of cellular toxicity of integrase in mammalian cells.
In HeLa cells integrase was found exclusively in the nuclei. In 293T cells transient transfections typically gave rise to an irregular, granular cytoplasmatic distribution of IN, probably due to precipitation of the protein. In the 293T cell line selected to stably express IN, nuclear localization of IN was evident. During the metaphase and anaphase steps of mitosis, IN remained stably associated with the chromosomes.
to Solid phase N-terminal sequencing of integrase purified from transiently transfected 293T cells, revealed the following amino terminus: Gly-Phe-Leu-Asp-Gly-Ile-Asp-Lys. This is the sequence predicted by the synthetic gene, the starting methionine being removed post-translationally.
Functionality of INs Complementation of IN-defective vector particles To verify whether the integrase expressed from the synthetic gene in mammalian cells is enzymatically active, the ability of IN to complement integrase-2o defective HIV-derived lentiviral vectors was tested. HIV-1-derived lentiviral vectors have been developed by Naldini et al. and Zufferey et al. (Naldini et al.
(1996), Science 272: 263-267; Zufferey et al. (1997), Nature Biotechnol. 15: 871-875).
Pseudotyped lentiviral vector particles are produced by transfecting 293T cells with a packaging plasmid encoding viral gag and pol proteins, a plasmid encoding the envelope of vesicular stomatitis virus and a plasmid encoding a reporter gene flanked by two long terminal repeats (LTRs). The first generation packaging plasmid pCMVOR8.2, containing all HIV genes except for env and the transfer vector pHR'-CMVLacZ
were used to produce wild type vector (WT vector). Integrase-defective virus particles (D64V vector) were produced using pCMVOR8.2IN(D64V) (Naldini et al. (1996), 3o Science 272: 263-267). The D64V mutation in the integrase gene is known to abolish integrase activity, without affecting any other step of the infection (Leavitt et al.
(1993), J. Biol. Chem. 268: 2113-2119; Leavitt et al. (1996), J. Virol. 70:
721-728).
The transducing titer of the D64V vector in 293T cells was 20-fold lower than the titer of WT vector (Table 1). This is in good agreement with previously reported results (Naldini et al. (1996), Science 272: 263-267). The observed "background"
expression after D64V transduction, is mostly due to transcription from non-integrated circularized viral DNA since (3-galactosidase expression after D64V transduction is reduced drastically upon passaging the cells (Table 1). Nevertheless, in some of the transduction experiments 1 or 2 galactosidase-positive colonies were observed. A residual transducing activity of D64V virus was observed before (Gaur and Leavitt ( 1998), J.
Virol. 72: 4678-4685). It is possible that this integration is independent of the viral integrase.
to Complemented vectors (C IN) were produced after quadruple transient transfection of producer cells, including pCEP-INS, the expression vector containing the synthetic gene. The transducing activity was restored up to 30% with C IN
(Table 1 ). Complementation was due to stable integration, since an equal proportion of galactosidase-positive colonies was counted after multiple passages of the transduced cells. The principle of trans-complementation of IN-defective virus was shown previously, using VPR-IN fusion expression constructs (Fletcher et al. (1997), EMBO
J. 16: 5123-5138). The transducing activity of catalytic domain mutants of IN
was restored up to 20% by transcomplementation with VPR-IN. However, in the absence of VPR, the expression construct for wild type integrase, only achieved 0.04%
2o complementation efficiency (Fletcher et al. (1997), EMBO J. 16: 5123-5138).
The synthetic gene in accordance with the present invention, in the absence of VPR, results in a complementation activity that is 750-fold more pronounced.
Moreover, evidence for trans-complementing activity of integrase expressed from the synthetic gene in target cells was also obtained (Table 1).
Transduction of IN
expressing 293T cells with IN-defective virus particles, resulted in a higher transduction efficiency as compared with the parental 293T cells. After passaging the transduced cells, the difference became even more pronounced. This points to a catalytic interaction of integrase present in the receptor cell with the pre-integration complex of the incoming vector. For the wild type and the complemented vectors increased transduction efficiencies were obtained as well. This may suggest that the amount of active integrase present in the viral particle is dose-limiting or that integrase present in the target cell neutralizes inhibitory host factors.
Detection of integrase activity using a promoterless reporter gene (DIPR).
Integration of HIV in the chromosome does not show strict sequence-specificity, although a weak consensus was found for the integration sites (Carteau et al.(1998), J. Virol. 72, 4005-4014). It is commonly accepted although not formally proven, that retroviral integration is favored in open chromatin near or within active transcription units (Rohdewohld et al. (1987), J. Virol. 61: 336-343; Scherdin et al.
(1990), J. Virol. 64, 907-912; Vijaya et al. (1986), J. Virol. 60: 683-692;
Carteau et al.
(1998), J. Virol. 72, 4005-4014). The design of a promoterless reporter substrate for measuring integrase activity in cell culture, is based on this finding (Figs.
4A - C). In to accordance with an embodiment of the present invention a method is proposed in which read-through transcription of the integrated promoterless reporter gene will occur when inserted within an actively transcribed region of the chromosome. The construct designed is a linear DNA fragment, flanked by the 200 by terminal fragments of the HIV LTRs that provide the integrase recognition sites. The marker gene may encode luciferase, for instance. The presence of an IRES (internal ribosome entry site) in front of the open reading frame of luciferase, directs cap-independent translation of mRNA
transcripts (Fig. 4A).
After transfection with this DIPR substrate (Fig. 4B, C), luciferase activity measured in 293T-INs cells was always 4 to 10 times higher than in the parental 293T
cells (Table 2). In the DIPR assay activity of the D64V mutant integrase was reduced compared to the wild type integrase (data not shown). These results point to an activity of the intracellularly expressed integrase (expressed by the synthetic gene in accordance with the present invention) (Fig. 4C). Sequencing of integrated linear DNA
molecules in 293T cells transiently expressing integrase from the synthetic gene using Alu-PCR, revealed the characteristic removal of the 3' GT dinucleotide in 10%
of integrants. In control cells not expressing integrase none of the DNA
insertions showed this hallmark.
Applications 3o An embodiment of the present invention includes the construction of an efficient eukaryotic expression vector for a retroviral enzyme, e.g. HIV-1 integrase, based on the creation of a synthetic gene. Expression from the eukaryotic expression vector can be transient after transfection of the plasmid in a eukaryotic cell by any of established transfection procedures. Expression may also be permanent in a cell line stably harbouring the expression vector. An important aspect of the present invention and its applications is the functionality of an expressed retroviral enzymatically active protein, as opposed to mere the high level expression of an enzymatically inactive retroviral protein.
Intracellular integrase test for the evaluation of integrase inhibitors An embodiment of the present invention includes assays for evaluating integrase activity in cells transfected with a DNA substrate that is flanked by fragments l0 of HIV LTR, a so-called mini-HIV. In both assays data point to enzymatic activity of IN.
In DIAS (detection of integrase activity through antibiotic selection) test, a resistance gene to a cytotoxic drug is present in the mini-HIV DNA. The presence of IN in the transfected cell augments stable insertion of the resistance gene in the chromosome. Scoring is performed by comparing the residual number of colonies resistant to the cytotoxic agent in comparison with cells transfected with heterologous DNA.
In DIPR (detection of integrase activity using a promoterless reporter gene), a reporter gene (luciferase) without promoter is present downstream of an internal ribosome entry site (IRES) in mini-HIV (Fig. 4A). The presence of IN in the transfected cell (Fig. 4B) augments stable insertion of the reporter construct in the host chromosome in close proximity to a cellular promoter (Fig. 4C). Scoring is performed by measuring enzyme activity expressed from the promoterless marker gene, e.g.
luciferase. The latter assay is highly amenable to evaluation of integrase inhibitors in cell culture in a microtiter plate format, adaptable for high throughput screening.
Potential integrase inhibitors would result in the absence of or a significant decrease in the level of detectable signal from the promoterless marker gene.
Such an assay in accordance with the present invention involves screening test inhibitory compounds from large libraries of synthetic or natural compounds.
Synthetic compound libraries are commercially available from, for example, Maybridge Chemical Co. (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH) and Microsource (New Milford, CT). A rare chemical library is available from Aldrich Chemical Company, Inc. (Milwaukee, WI).
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts are available from, for example, New Chemical Entities, Pan Laboratories, Bothell, WA or MycoSearch (NC), Chiron, or are readily producible.
Plant extracts may also be obtained form the University of Ghent, Belgium.
5 Additionally, natural and synthetically produced libraries and compounds are readily modified through conventional chemical, physical, and biochemical means.
Tool for non-viral cellular gene delivery.
Cell lines that express integrase from a synthetic gene do have greater 1o propensity to integrate foreign DNA, flanked by LTR fragments. These cell lines are thus more transducible. An embodiment of this invention is the creation of eukaryotic cell lines (or cell culture systems) that are highly transducible (at least 200% compared to the parent cell). Embodiments of the present invention also include applications in transgene technology to increase the efficiency of (non)homologous recombination in 15 ES cells be it by transient expression from a plasmid or after induced expression of the retroviral integrase in ES cells transgenic for the synthetic gene. The synthetic gene in accordance with the present invention may be brought into cells by any transfection agent or method (e.g. electroporation or lipofection) and may result in the stable integration of DNA in the chromosome.
Retroviral (lentiviral) vector packaging construct From the complementation experiment it is clear that integrase expressed from the synthetic gene in the producer cell can complement integrase-defective lentiviral virus particles encoded by a packaging plasmid and thus can substitute for the protein expressed by the packaging construct. It follows that in an expression vector based on one or more synthetic genes) for a lentiviral gag pol gene, the synthetic genes) can substitute for the natural genes) in the packaging constructs resulting in Rev-independent high level protein expression. The present invention includes a packaging construct based on non-lentiviral complex retroviruses in which protein expression is 3o dependent on a Rev homologue such as Rex in the case of HTLV-I. Lentiviral vectors per se, capable of transducing a non-dividing cell, are known in the art (see Naldini et al. (1996), Science 272: 263-267, Zufferey et al. (1997), Nature Biotechnol.
15: 871-875). Generally the vectors are plasmid-based or virus-based, and are configured to carry the essential sequences for incorporating nucleic acid, for selection and for transfer of the nucleic acid in the host cell. Gag, pol and env genes of interest are known in the art. Briefly, a first vector can provide a nucleic acid encoding a viral gag and a viral pol, and a second vector can provide a nucleic acid encoding a viral env gene product to produce a packaging cell. Packaging cells or cell lines supply in trans the proteins necessary for producing infectious virions, themselves being incapable of packaging endogenous viral genomic nucleic acids (Watanabe & Temin (1983), Molec.
Cell Biol. 3(12): 2241-2249; Mann et al. (1983), Cell 33:153-159; Embretson &
Temin (1987), J. Virol. 61(9): 2675-2683). Introducing a vector providing a heterologous to gene, the transfer vector, into such packaging cells yields producer cells which release infectious particles carrying the foreign gene of interest. Methods for transfection or infection are well known by those skilled in the art. After cotransfection of the packaging vectors and the transfer vector to the packaging cell or cell line, the recombinant vector is recovered from the culture media and titered by standard methods used by those of skill in the art.
The foreign or heterologous gene carried by the transfer vector can be any nucleic acid of interest which can be transcribed, but preferably is a nucleic acid encoding for a polypeptide of therapeutic benefit or of interest for gene therapy. The env gene in the (second) packaging vector can be derived from any virus, including 2o retroviruses, and is preferably amphotropic, allowing transduction of cells of human and other species, and is preferably under control of non-endogenous regulatory sequences. Vectors can be made target-specific through linkage of the env protein with an antibody or a ligand for a particular receptor of a particular cell-type (cell-targeting).
Design of the gag-pol synthetic gene is based on a method to circumvent mRNA
instability associated with these wild-type genes. Preferentially, the method used by us to create an expression construct for high level and Rev independent eukaryotic expression of active HIV-1 integrase is employed. Further construction of the vectors of the present invention, whereby natural gag pol genes are replaced respectively by the synthetic genes of the present invention, employ standard ligation and restriction 3o techniques which are well understood in the art (see Maniatis et al, in Molecular Cloning: a Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1982).
Biosafety requires measurements to reduce the risk of generating recombinant replication competent retroviruses (RCR) as much as possible. Dividing the packaging functions into two genomes, one which express the gag and pol gene and another expressing the env gene product help to minimize the likelihood of generating RCR.
That approach minimizes the ability for co-packaging and subsequent transfer of the two-genomes, as well as significantly decreases the frequency of recombination due to the presence of three retroviral genomes in the packaging cell to produce RCR.
To render any possible recombinations non-functional, mutations (Danos & Mulligan (1988), Proc Natl. Acad. Sci 85: 6460-6464) or deletions (Bosselman et al.
(1987), Molec. Cell Biol. 7(5):1797-1806; Markowitz et al. (1988), 62(4):1120-1124) can be configured within the undesired gene products. Deletion of the 3'LTR of both 1o packaging constructs will further reduce the likelihood to form functional recombinants. US5,994,136 describes the production of lentiviral vectors with an even more remote possibility of generating replication competent lentiviruses by functionally deleting the tat gene, which is encoding for a regulating protein that promotes viral expression through a transcriptional mechanism. The likelihood of recombination between the transfer vector, that still contains natural genetic information of the lentivirus like part of the gag gene, and synthetic packaging genes will be considerably reduced, further improving the biosafety of the lentiviral vectors. DNA
sequence mismatching, as induced by the replacement of nucleotides in the third position compared to the natural gene, seems to present a considerable barrier to homologous 2o recombination in a wide variety of species. It is therefore also unlikely that contaminating or endogenous HIV virus particles would exchange the natural integrase gene for the synthetic one through recombination.
Experimental Procedures DNA constructs Construction of integrase expression plasmids The open reading frame of IN from the HIV-1 clone HXB2 was PCR
amplified using Pfu DNA polymerase (Stratagene, Cambridge, UK) with the primers 5'-CCCCCAAGCTTGCCAGCCATGTTTTTAGATGGAATAGATAAGG and 5' CCCGCTCGAGGTTTCCTTGAAATATACATATGGTG and subcloned in pCEP4 (Invitrogen, Leek, The Netherlands), resulting in pCEP-IN. The absence of mutations was verified by DNA sequencing. The RRE sequence of HIV-1, clone HXB2, was PCR
amplified using the primers 5'-TTCCGCTCGAGTAGCACCCACCAAGGCAAAGAG
and 5'-TCGCGGATCCAAGGCACAGCAGTGGTGCAAATG. The PCR fragment was subcloned in the sense orientation downstream of the integrase gene in pCEP-IN to produce pCEP-IN-RRE. The CTE sequence (obtained from plasmid pS 12; Taberno et al., 1996, J. Virol. 70: 5998-6011) was cloned in pCEP4 in the correct orientation, followed by the insertion of the integrase gene upstream of the CTE. This resulted in the plasmid pCEP-IN-CTE. The construction of pGFP-IN is explained in Pluymers et al. (1999), (Virology 258: 327-332). The Rev expression plasmid, pEF321-cREV, was provided by Sandoz Forschungs Institut, Vienna, Austria. PCR amplification and 1o plasmid construction employed standard techniques like standard ligation and restriction techniques and conditions which are well understood in the art (see Maniatis et al, in Molecular Cloning: a Laboratory Manual, Cold Spring Harbor Laboratroy, N.Y., 1982).
Assembly of the synthetic gene The restriction sites NheI, PstI, BamHI, NaeI, NarI (indicated in Fig.2 ) divide the sequence of the synthetic gene into 6 fragments each approximately 150 by long that correspond to the sequences 1-149, 144-306, 301-456, 451-623, 618-776, (Fig. 2). Each of the fragments was constructed separately by annealing and extending 2o two partially complementary oligonucleotides (85-95 nt long, PAGE-purified and 5'-phosphorylated, synthesized by Gibco BRL Life Technologies, Merelbeke, Belgium) using Sequenase (Amersham-Pharmacia, Buckinghamshire, UK) . Each fragment was cloned into the EcoRV site of the vector pBluescript KS(+) (Stratagene, La Jolla, CA).
The sequence errors found in the resulting clones were repaired using either the Stratagene Quick Change procedure with Pyrococcus furiosus (Pfu) polymerise (for a base substitution in the fragment 451-623) or PCR (for deletions in the terminal regions of the fragment 1-149). The full 930 by sequence was built by stepwise assembly of the fragments similar to the method described in W098/12207. Choice of the cloning vector (pBluescript KS or SK) at each step was dictated by toxicity of the IN
coding 3o DNA. Finally, the two halves of the IN gene (1-451 and 452-930) were ligated together and cloned into pBluescript KS(+) resulting in plNs.
Construction of mammalian expression vectors for INs The plasmid pINs was digested by EcoRI and treated with T4 DNA polymerase followed by restriction with XhoI. The 1 kb fragment carrying the INs gene was cloned between the PvuII and XhoI sites of pCEP4 (Invitrogen, Leek, The Netherlands) resulting in pCMV-Ins. pCEP4 is an episomal mammalian expression vector containing the human cytomegalovirus (hCMV) immediate early enhancer/promoter. The Epstein Barr virus replication origin (oriP) and nuclear antigen (encoded by the EBNA-1 gene) permit extrachromosomal replication in human, primate and canine cells. A
hygromycin resistance gene is present, permitting selection of stably transduced clones by hygromycinB (GIBCO BRL). The same 1 kb fragment was also cloned between l0 NheI and XhoI sites of the pBK-RSV expression vector (Stratagene) (the NheI
cohesive end of the vector DNA was filled in using T4 DNA polymerase) resulting in pRSV-INs. In this vector expression of INs gene is driven by the promoter of Rous sarcoma virus (RSV). The presence of the neomycin resistance gene allows selection of stably transduced clones by geneticin (G418) (GIBCO BRL).
Construction of the substrate for the DIPR assay The DNA substrate for the DIPR assay was obtained by linearization of pLTR-IRES-Luc with ScaI. This plasmid was constructed in the following way. First, the 350 by KpnI/EcoRI fragment of pU3U5 (Cherepanov et al.(1999), Nucleic Acids Research 27: 2202-2210) containing the terminal U3 and U5 regions of the HXB2 HIV-1 LTRs was cloned between the Kpnl and EcoRI sites of pUC 19 resulting in pUC-LTR.
Then the ScaI site occurnng in the ampicilline resistance gene of pUC 19 was destroyed by partial digestion of pUC-LTR with ScaI and insertion of a fragment containing the kanamicine resistance gene from the Tn5 transposon yielding pUC-LTR-kan.
Finally, 7.5 kb pLTR-IRES-Luc was obtained by cloning the BamHI/PstI fragment of pBIR
(Martinez-Salas et al. (1993), J. Virol. 67, 3748-3755) carrying the IRES-luciferase gene cassette, made blunt with T4 DNA polymerase (Gibco BRL), into the SmaI
site of pUC-LTR-kan.
3o Cell culture HeLa and 293 cells were obtained from American Type Culture Collection.
HeLa and 293 cells were grown in Dulbecco's modified Eagle's medium (DMEM) (GibcoBRL) supplemented with 10 % FCS, 0.12 % (v/w) sodium bicarbonate (GibcoBRL), 2 mM glutamine (GibcoBRL) and 20 ~.g/ml gentamycin (GibcoBRL) at 37oC in 5 % C02 humidified atmosphere. 293T cells (obtained from Dr. O. Danos, Evry, France) express SV40 large T antigen and were grown in DMEM (GibcoBRL) with glutamax supplemented with 10% fetal calf serum, 45 U/ml penicillin G
(Serva, 5 Heidelberg, Germany) and 45 p.g/ml streptomycin sulphate (Sigma-Aldrich, Bornem, Belgium).
293 and 293T cells were transfected using polyethylenimine (PEI) (Abdallah et al. (1997), Hum. Gene Therapy 7:1947-1954). Polyethylenimine Mw ~ 25.000 was from Sigma-Aldrich (Bornem, Belgium). Cells were grown to 50-70 % confluency in 1o DMEM with glucose, glutamax and 10 % fetal calf serum (FCS) (Gibco BRL).
Medium was replaced by medium containing 1 % FCS 3 hours before transfection.
Mixture of DNA and PEI was added to cells in a minimal volume of medium. Next day the medium was changed to DMEM containing 25 mM HEPES. Transformation efficiency obtained in this way was 50-80 %. HeLa cells were routinely transfected by 15 electroporation. The cells were first trypsinized at 80 % confluency and pelleted by low speed centrifugation. The cells were then resuspended at a density of 2 x 106 cells/ml in growth medium; 0.5 ml of this solution was aliquoted into 4 mm cuvettes (Eurogentec, Seraing, Belgium) and 20 qg DNA was added to the cell suspension. After the electric pulse ( 10 qF, 250 V), cells were allowed to rest for 10 min at room temperature before 20 dilution into growth medium..
To establish stable cell lines expressing the INs gene, cells transfected with pRSV-INs or pCMV-1Ns were cultured in the presence of 500 ~g/ml geneticin (G418) or 200 ug/ml hygromicin B (both from GIBCO BRL), respectively. Expression of IN
was assessed by western blotting and/or indirect immunofluorescence.
Western blotting and immunofluorescence For western blotting and indirect immunofluorescence rabbit polyclonal antibodies directed against recombinant His-tagged HIV-1 IN were produced in house and were purified using a 1 HiTrap rProteinA column (Pharmacia Biotech, Uppsala, 3o Sweden) according to established procedures (Ausubel et al. (1987), Current protocols in molecular biology, John Wiley & Sons, New York). Western blotting was performed using PVDF membranes (Bio-Rad), the ECL+ chemiluminescent detection system (Amersham-Pharmacia) and HRP-conjugated goat anti-rabbit antibodies (Bio-rad).
Dilutions used were 1:30000 for the primary antibody and 1:20000 for the secondary antibody. Detection limit was 0.1-0.5 ng of recombinant integrase. Total protein concentration was determined on cells lysed with 1%SDS/1 mM PMSF (Sigma), using the BCA protein assay (Pierce, Illinois USA). For western blot analysis 10 ~g of total protein was evaluated.
For detection of IN expression in situ by indirect immunofluorescence microscopy, cells were grown on glass slides (HeLa cells) or in permanox chamber slides (GIBCO BRL) (293T cells). After 24-48 hrs, cells were washed with phosphate buffered saline (PBS) supplemented with 1 mM Mg2+ and 0.5 mM Ca2+(PBS+), fixed in 100% methanol and blocked with 10% foetal calf serum (FCS) in PBS+.
Incubations with antibodies were carried out at 37~C in blocking solution. The primary antibody (rabbit anti-IN) was diluted 1:20 to 1:80; the secondary FITC-conjugated swine anti-rabbit antibody from Dako (Glostrup, Denmark) was diluted 1:40. Nuclear staining was performed with 1 pg/ml 4', 6-diamidino-2-phenylindole (DAPI) (Sigma) in methanol.
Fluorescence microscopy was performed with a Leitz microscope (Wetzlar, Germany) using filter blocks I2 (FITC) or A (DAPI).
Detection of integrase activity using a promoterless reporter gene (DIPR) 293T and 293T-INs cells were seeded in six-well plates at a density of 106 cells/well 24 hr before transfection. Five pg of DNA was transfected per well using PEI. 48 hr post-transfection, 5 x 105 cells were lysed to determine the luciferase activity using the Luciferase Assay SystemTM (Promega Benelux, Leiden, The Netherlands) and the LumicountTM (Packard, Meriden, CT). The protein concentration of the lysate was determined using the Bradford method (Bio-Rad protein assay, Bio-Rad, Hercules, CA). The relative luciferase activity was calculated by dividing the luminescence values by the protein concentration.
Lentiviral vectors Lentiviral vector production 3o HIV-1-derived vector particles, pseudotyped with the envelope of vesicular stomatitis virus (VSV), were produced by transfecting 293T cells with a packaging plasmid encoding viral gag and pol proteins (pCMVOR8.2), a plasmid encoding the envelope of vesicular stomatitis virus (pMDG) and a plasmid encoding a reporter gene flanked by two long terminal repeats (LTRs) (pHR'-CMVLacZ). The first generation packaging plasmid, containing all HIV genes except. for env, and the transfer vector were a kind gift from Dr. O. Danos (Genethon, France). For transfection of a 10 cm dish of 293T cells, a 700 p.l mixture of three plasmids was made in 150 mM
NaCI: 20 qg of vector plasmid, 10 ~g of packaging construct and 5 pg of envelop plasmid. To this DNA solution 700 ql of a PEI solution (110 p.l of a 10 mM stock solution in 150 mM NaCI) was added slowly. After 15 min at room temperature, the DNA-PEI
complex was added dropwise to the 293T cells in DMEM medium with 1% FCS. After overnight incubation, medium was replaced with medium containing 10% FCS.
to Supernatants were collected from day two to five post-transfection. The vector particles were sedimented by ultracentrifugation in a swinging-bucket rotor (SW27 Beckman, Palo alto, CA) at 25, 000 rpm for 2 hr at 4°C. Pellets were redissolved in PBS resulting in a 100-fold concentration. Different viral stocks were normalized based on p24 antigen content (HIV-1 p24 Core Profile ELISA, DuPont, Dreieich, Germany) for use in complementation assays.
Complementation experiments Integrase-defective virus particles were produced using pCMVOR8.2IN(D64V), obtained from Dr. D. Trono, (Geneva, Switzerland) as packaging plasmid (Naldini et al. (1996), Science 272: 263-267). Complemented vectors were produced by expressing integrase from pCEP-INs in 293T cells after quadruple transient transfection.
Vector preparations were normalized for p24 antigen count Vector was added to target cells in the presence of 2 ~,g/ml polybrene and left overnight. After removal of vector, cells were incubated for an additional 36 hrs. Cells were washed with PBS, fixed with 0.75%
formaldehyde/0.05% glutaraldehyde in PBS, and stained with freshly prepared X-gal substrate (5 mM potassium ferrocyanide, 5 mM potassium ferncyanide, 2 mM MgClz and 100 pg/ml 5-bromo-4-chloro-3-indolyl-(3-D-galactopyranoside (x-gal) (Biotech Trade & Service Gmbh, St. Leon-Rot, Germany) in PBS) at 37°C
overnight. Each transduction experiment was done in duplicate in a 96-well plate. Transduction 3o efficiency was determined by counting the number of blue cells 48 hrs after infection in one of the wells, whereas the cells in the duplicate well were splitted 1:2.
Half of the sample remained in the well and was stained at confluency (passage 1) whereas the other half was cultured in a 48-well plate. At confluency, these cells were again splitted 1:2. Finally, cells were brought in a 24-well plate and grown to confluency (passage 3, dilution 1:8). After staining, the efficiency of stable transduction was measured by counting blue colonies.
Tables Table 1. Complementation of inte~rase-defective lentiviral vector particles Relative transduction efficiency' Cells Passage WT vector D64V C IN
vector # 0 1.00 0.048 0.303 # 3 1.00 0.007 0.320 293T INs # 0 1.565 0.09 0.510 # 3 1.88 0.045 0.75 lTransduction efficiency is determined by counting galactosidase-positive cells (# 0) or colonies of galactosidase-positive cells (# 3) relative to transduction efficiency obtained by WT vector in 293T cells. Results of transduction by WT vector, D64V IN-defective vector and D64V vectors complemented with IN in the producer cells, are shown.
Cells were infected with normalized amounts of vector. Transduction was done both in cells and in 293T cells that are stably expressing IN. Average numbers for two separate experiments are shown.
Table 2. Detection of inte~rase activity usin , a promoterless reporter gene (DIPR) Luciferase activity (Relative units) Experiment Cell line Blanks LTR-IRES- LTR-IRES-Luc Lucb + pCMV-INs 293T-INs 1 487 119 -293T-INs 1 499 38 990 183 aRelative background luciferase activity in cell lines b 293T and 293T-INs were transfected with equal amounts of linearized pLTR-IRES
Luc. under experimental conditions A. In experiment B total DNA concentration was equalized with parental vector pCEP4.
°293T and 293T-INs were transfected with linearized pLTR-IRES-Luc and 2 ~.g of pCMV-INs.
SEQUENCE LISTING
<110> K .U. Leuven Research & Development Debyser, Zeger De Clercq, Erik Cherepanov, Peter Pluymers, Wim <120> A synthetic gene for expression of a retroviral protein with wild type activity in eukaryotic cells <130> K1291-PCT
<140>
<141>
<150> EP99201306.0 <151> 1999-04-26 <150> EP00200171.7 <151> 2000-O1-18 <160> 2 <170> PatentIn Ver. 2.1 <210> 1 <211> 930 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic gene expressing HIV integrase <220>
<221> CDS
<222> (27)..(899) <220>
<221> misc_signal <222> (24)..(30) <223> Kozak sequence <400> 1 atcactagca acctcaaaca gacacc atg gga ttc ctg gac ggc att gac aag 53 Met Gly Phe Leu Asp Gly Ile Asp Lys get cag gag gag cac gag aag tac cac tcg aat tgg cgg gcc atg gcc 101 Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala tcc gac ttc aac ctg cca ccc gtc gtc get aag gag atc gtt get agc 149 Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val Ala Ser tgc gac aag tgc cag ctg aaa ggc gag get atg cac ggg cag gtt gat 197 Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln Val Asp tgc tct ccc ggc atc tgg cag ctc gac tgt act cac ctg gag ggc aag 245 Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu Gly Lys gtc atc ctg gtc gcc gtg cac gtg gcc tct ggt tac atc gag get gag 293 Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu Ala Glu gtc atc cct gca gag act ggc cag gag act gcc tat ttc ctg ctg aaa 341 Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu Leu Lys g0 95 100 105 ctg gcc ggc cgg tgg cct gtg aag aca gtg cac aca gat aac ggc tcc 389 Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn Gly Ser aac ttc acc tcc acc act gtg aag get gcc tgc tgg tgg get ggg atc 437 Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala Gly Ile aag cag gag ttc ggg atc ccc tat aac cca cag tct cag ggc gtg atc 485 Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly Val Ile gaa tcc atg aac aag gag ctg aag aag atc atc ggc cag gtt cgg gac 533 Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val Arg Asp cag gca gag cac ctg aag act gca gtg cag atg gcc gtg ttc atc cac 581 Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe Ile His aac ttc aag cga aag ggc ggc atc ggt ggc tac tca gcc ggc gag cgg 629 Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly Glu Arg atc gtg gac atc atc gcc act gac atc cag acc aaa gag ctg cag aag 677 Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu Gln Lys cag atc acc aag atc cag aac ttc cgt gtg tac tac cgg gac tcc cgg 725 Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg gac cct gtg tgg aag ggc cct gcc aag ctg ctg tgg aag ggc gag ggc 773 Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly Glu Gly gcc gtg gtc att cag gac aac tct gac atc aag gtt gtg ccc agg cgc 821 Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro Arg Arg aag gcc aag att atc cgg gac tac ggc aag cag atg get ggc gac gac 869 Lys A1a Lys Ile Ile Arg Rsp Tyr Gly Lys G1n Met Ala Gly Asp Asp tgt gtg gcc tct cgt caa gat gag gac taa gtccaactac taaactgggg 919 Cys Val Ala Ser Arg Gln Asp Glu Asp gatattatga t 930 <210> 2 <211> 290 <212> PRT
<213> Artificial Sequence <223> Description of Artificial Sequence: synthetic gene expressing HIV integrase <400> 2 Met Gly Phe Leu Asp Gly Ile Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu Ala Glu Val Ile Pro Ala Glu Thr Gly g5 90 95 Gln Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asp Pro Val Trp Lys Gly Pro 225 230. 235 240 Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp
Further, it is not obvious that integrase expressed as such will interact with artificial DNA substrates (see DIPR below). One aspect of the present invention is dissecting the preintegration complex to obtain a simple integrase-linear DNA interaction.
One embodiment of the present invention is a method to detect and utilize the enzymatic activity of a retroviral, in particular a lentiviral enzyme, in particular integrase by itself in a eukaryotic cell.
Initially eukaryotic expression vectors encoding HIV-1 IN and IN-RRE were constructed based on the reasoning that co-expression of Rev in cells transfected with IN-RRE would increase expression levels of IN. However, in human cells transiently transfected with these expression vectors, little or no expression of IN was detected by either immunofluorescence microscopy or western blotting (Fig. 1). An alternative approach consisted of introducing the constitutive transport element (CTE) of simian retrovirus type 1 behind the integrase gene. Again, expression was barely detectable upon prolonged exposure of the blot and amounted to merely 40 ng per 10 x 106 2o transfected cells. The construction of a C-terminal fusion to green fluorescent protein (GFP) (GFP-IN) resulted in a more pronounced expression of wild-type HIV-1 integrase expressed in mammalian cells (Pluymers et al. (1999), Virology 258:
332). Rev co-expression was not required, in accord with Kukolj et al. (1997), J. Virol.
71: 843-847), who expressed integrase as a C-terminal fusion protein with (3-galactosidase in the absence of Rev. The impact of the INS in the IN gene on protein expression levels was illustrated by a 5-fold decrease in expression levels of the GFP-IN construct compared to the parental GFP (Pluymers et al. (1999), Virology 258: 327-332). The present invention is based on a synthetic gene for HIV-1 integrase with an increased intrinsic mRNA stability. The use of such a synthetic gene resulted in high expression levels and concurrent enzymatic integrase activity as could be demonstrated via complementation tests.
In accordance with the present invention, an integrase gene was synthesised with an increased GC content resulting in high level expression of HIV-1 IN in various mammalian cell lines. The enzyme was shown to complement defective integrase carried by HIV-1-derived vector particles and to act in trans on linear DNA
substrates that are flanked by LTR fragments and encode a reporter gene.
DESIGN AND CONSTRUCTION OF THE SYNTHETIC INTEGRASE GENE
Synthetic genes have been constructed in the past to optimize expression of eukaryotic genes in bacteria based on the knowledge that codon usage in prokaryotes is quite different from that in eukaryotes. HIV (lentiviral) genes are not optimal for high level expression in eukaryotic cells. This is related to the mechanism HIV
uses to to circumvent the mRNA instability, namely Rev. During the replication cycle early mRNA transcripts will be spliced which results in expression of regulatory proteins such as Tat and Rev. Only late in the cycle, does Rev accumulation and Rev-RRE
interaction block splicing and suppress AT-rich instability sequences resulting in unspliced transcripts encoding structural and enzymatic proteins. Whereas the synthetic gene in accordance with the present invention clearly augments protein expression in mammalian cells, which is a prerequisite to detect the functionality of the enzyme in the cell, in the context of replicating HIV the presence of a gene with an increased GC
content may well interfere with the mechanism of regulation of gene expression and be detrimental for viral replication.
2o In accordance with an embodiment of the present invention a synthetic viral gene was designed for better optimized and more efficient expression in mammalian cells. The HIV-1 integrase gene has a GC content of 40% whereas highly expressed human genes on average have a GC content of 55-61%. Hence, the GC content is one aspect of the preferred nucleotide relationship or nucleotide pair frequency in accordance with the present invention. By employing the degenerative nature of the genetic code and selecting for the preferred codon usage in the synthetic gene, the GC
code content of a synthetic gene encoding HIV integrase would be increased up to 66%
without altering the amino acid sequence. However, this is not preferred in accordance with the present invention. First of all, in accordance with the present invention, the 3o choice among the alternative codons was biased in favour of preferred triplets (codons) found in a small group of genes of the total human genome which express well/strongly, e.g. human (3-globin, a-, y-actin and EF2 genes (method of determining the preferred codon usage). In addition the bias was such as to approximate the preferred nucleotide relationship or nucleotide pair frequency, i.e. within the range 53 to 63%, more preferably 55-61% for the GC content. In fact a GC content of 59%
rather than 66% was achieved. The other rules for redesigning retroviral genes for eukaryotic expression are: (i) removal of potential splice sites, (ii) reduction of the number of CpG methylation sites, (iii) introduction of S' and 3'-untranslated regions (UTR) of a mammalian mRNA (in our case from human (3-globin), (iv) addition of an extra N-terminal peptide (Met-Gly for the examples given below) for efficient initiation of translation. As a result expression levels from the synthetic gene in various mammalian cell lines were at least 25-fold higher than from the natural integrase gene.
to Efficient expression was also obtained in yeast (Pichia pastoris) (data not shown).
In accordance with one embodiment of the present invention a gene is provided to achieve high level expression of HIV-1 integrase in human cell lines by maintaining the amino acid sequence of IN from the pNL4-3 clone of HIV-1 while adapting the nucleotide codon usage to the codon usage of constitutively and highly expressed human genes ("preferred codon usage"). A first version of an artificial IN
reading frame was based on random choice between alternative codons at each position using a random number generator, biasing in favour of preferred triplets as found in the human (3-globin, a-, y-actin and EF2 genes. Next, the DNA sequence was substantially edited to remove potential splice sites and to reduce the number of CpG methylation sites, but keeping the overall GC content and codon usage close to optimal ("preferred nucleotide relationship" or "nucleotide pair frequency"). The final version of the synthetic gene (Fig. 2 or SEQ ID NO: l ) contains fragments of the 5'- and 3'-untranslated regions from the (3-globin mRNA. This gene encodes for wild type HIV-1 integrase with addition of the N-terminal Met-Gly dipeptide. The extra glycine codon completes the Kozak's consensus sequence (ANNATGG) required for efficient initiation of translation.
In the synthetic gene the overall GC content is 59% compared to 40% in the wild type.
The gene was constructed from six synthetic DNA fragments, each approximately 1 SO
by long, by stepwise cloning. It should be understood that various homologs of the gene shown in Fig. 2 or SEQ ID NO: l are included within the scope of the present invention.
Reapplication of the random number biasing procedure in accordance with the present invention would generate alternative sequences all of them coding for the same protein and all having a similar preferred nucleotide relationship or nucleotide pair frequency.
All such synthetic gene homologs are included within the scope of the present invention.
The synthetic gene includes modification to those described above, the following modifications and improvements of the synthetic gene are included within the scope of the present invention. For example, the leader peptide can be replaced affecting the efficiency of translation and potential myristoylation (e.g. for example, a Met-Ala variant has been constructed). The 5' and 3'-UTRs may be replaced by UTRs from other mammalian mRNAs to optimize the stability of the transcript.
Mutations in the open reading frame are also included within the scope of the present invention whereby the canonical integrase sequences (e.g. HHCC and DD35E) are preferably left to unchanged. A more soluble version can be made by introducing for example the F 185K/ F 185H mutations. Other mutations may induce increased or altered catalytic activity of the enzyme in the eukaryotic cell. For example, the present invention includes a variant synthetic gene with the D64V mutation, known to reduce drastically the enzymatic activity of integrase. Synthetic genes of integrase are included within the scope of the present invention in which the genetic information of domains of other proteins are added. These domains preferably add additional properties to the enzyme such as sequence specificity in DNA binding. Examples of methods of providing specificity to a gene encoding integrase are described in WO 96/37626, US
5,811,270 without describing the specific innovative aspects of the present invention.
The synthetic gene for HIV-1 integrase was designed to circumvent inhibition of gene expression induced by instability sequences (INS) in the wild type integrase gene. This approach can be applied to retroviral integrases in general. In particular the aforementioned design method may be used to redesign any retroviral viral gene encoding a protein with enzymatic activity for efficient expression in eukaryotes. In particular, the design method of synthetic genes in accordance with the present invention will boost eukaryotic expression for retroviral genes encoding a protein with enzymatic activity, especially lentiviral integrases and pol proteins in general. The particular approach of the present invention could also be applied to redesign gag genes, in which mRNA instability due to the presence of INS elements and not poor translatability, like for env, would be the problem. Although the role of Rev in suppressing the effect of 1NS is only well studied in the case of HIV-l, all other lentiviruses are known to encode proteins analogous to Rev. Likewise the human T-lymphotropic and bovine lymphotropic viruses (HTLVs and BLV) encode Rev.
Simple retroviruses such as Mason-Pfizer monkey virus and simian retrovirus-1 (SRV-1) contain a constitutive transport element (CTE) that promotes nuclear export of unspliced mRNA. It has been shown that CTE can functionally substitute for Rev interacting with RRE. In fact, a low level transient expression from a wild type 5 integrase gene with a downstream CTE of SRV-1 has been obtained by us using the methods of the present invention. Since the design of a synthetic gene in accordance with the present invention abolishes any need for co-expression of Rev and presence of RRE or CTE in the construct, this approach can improve expression of retroviral enzymes in general and integrases in particular.
10 In creating mammalian expression vectors, various eukaryotic expression plasmids can be used. Expression can be under control of a constitutive promoter (for example hCMV and RSV) or an inducible promoter. Examples of (commercially available) inducible expression systems are the ecdysone-inducible and the tetracyclin-inducible (Tet-Off and Tet-On) expression systems. Tissue-specific promoters that 15 limit expression in specific tissues may also be envisaged. Examples are the established neuron-specific promoters Thy-1 and enolase. Inducible promoters may limit cellular toxicity, although a cell line that stably expresses integrase was obtained.
In transgenic non-human animals harbouring the synthetic gene, expression may be induced at a desired moment using an inducible promoter or in desired tissues using a tissue-specific 2o promoter.
Transient and stable expression of HIV-1 integrase in 293T and HeLa cells The synthetic gene for integrase (INS) was cloned into the expression vectors pCEP4 and pBK-RSV under control of the human cytomegalovirus (hCMV) and Rous sarcoma virus (RSV) promoters, respectively. Transient and stable expression of IN
was obtained in both 293T and HeLa cell lines, as verified by immunoblotting (Fig. l, 3) and indirect immunofluorescence (data not shown). In transfected 293T cells the expression levels from the hCMV promoter amounted to 10-20 ~g of IN per 10 x cells which is at least 25-fold higher than obtained with expression vectors that contain 3o the unfused wild type HIV-1 integrase gene.
Transfection of 293T cells with the episomal expression vector pCEP-INs followed by selection with hygromycinB, resulted in a stable cell line, referred to as 293T-lNs. Indirect immunofluorescence staining revealed that 80-90% of selected cells produce integrase at detectable levels. The expression level, as estimated by quantitative immunoblotting, was about 0.5 ~g of integrase per 10 x 106 cells.
The reduced cell growth kinetics of 293T-INs (30-50% as compared to the parental cell line) is suggestive of cellular toxicity of integrase in mammalian cells.
In HeLa cells integrase was found exclusively in the nuclei. In 293T cells transient transfections typically gave rise to an irregular, granular cytoplasmatic distribution of IN, probably due to precipitation of the protein. In the 293T cell line selected to stably express IN, nuclear localization of IN was evident. During the metaphase and anaphase steps of mitosis, IN remained stably associated with the chromosomes.
to Solid phase N-terminal sequencing of integrase purified from transiently transfected 293T cells, revealed the following amino terminus: Gly-Phe-Leu-Asp-Gly-Ile-Asp-Lys. This is the sequence predicted by the synthetic gene, the starting methionine being removed post-translationally.
Functionality of INs Complementation of IN-defective vector particles To verify whether the integrase expressed from the synthetic gene in mammalian cells is enzymatically active, the ability of IN to complement integrase-2o defective HIV-derived lentiviral vectors was tested. HIV-1-derived lentiviral vectors have been developed by Naldini et al. and Zufferey et al. (Naldini et al.
(1996), Science 272: 263-267; Zufferey et al. (1997), Nature Biotechnol. 15: 871-875).
Pseudotyped lentiviral vector particles are produced by transfecting 293T cells with a packaging plasmid encoding viral gag and pol proteins, a plasmid encoding the envelope of vesicular stomatitis virus and a plasmid encoding a reporter gene flanked by two long terminal repeats (LTRs). The first generation packaging plasmid pCMVOR8.2, containing all HIV genes except for env and the transfer vector pHR'-CMVLacZ
were used to produce wild type vector (WT vector). Integrase-defective virus particles (D64V vector) were produced using pCMVOR8.2IN(D64V) (Naldini et al. (1996), 3o Science 272: 263-267). The D64V mutation in the integrase gene is known to abolish integrase activity, without affecting any other step of the infection (Leavitt et al.
(1993), J. Biol. Chem. 268: 2113-2119; Leavitt et al. (1996), J. Virol. 70:
721-728).
The transducing titer of the D64V vector in 293T cells was 20-fold lower than the titer of WT vector (Table 1). This is in good agreement with previously reported results (Naldini et al. (1996), Science 272: 263-267). The observed "background"
expression after D64V transduction, is mostly due to transcription from non-integrated circularized viral DNA since (3-galactosidase expression after D64V transduction is reduced drastically upon passaging the cells (Table 1). Nevertheless, in some of the transduction experiments 1 or 2 galactosidase-positive colonies were observed. A residual transducing activity of D64V virus was observed before (Gaur and Leavitt ( 1998), J.
Virol. 72: 4678-4685). It is possible that this integration is independent of the viral integrase.
to Complemented vectors (C IN) were produced after quadruple transient transfection of producer cells, including pCEP-INS, the expression vector containing the synthetic gene. The transducing activity was restored up to 30% with C IN
(Table 1 ). Complementation was due to stable integration, since an equal proportion of galactosidase-positive colonies was counted after multiple passages of the transduced cells. The principle of trans-complementation of IN-defective virus was shown previously, using VPR-IN fusion expression constructs (Fletcher et al. (1997), EMBO
J. 16: 5123-5138). The transducing activity of catalytic domain mutants of IN
was restored up to 20% by transcomplementation with VPR-IN. However, in the absence of VPR, the expression construct for wild type integrase, only achieved 0.04%
2o complementation efficiency (Fletcher et al. (1997), EMBO J. 16: 5123-5138).
The synthetic gene in accordance with the present invention, in the absence of VPR, results in a complementation activity that is 750-fold more pronounced.
Moreover, evidence for trans-complementing activity of integrase expressed from the synthetic gene in target cells was also obtained (Table 1).
Transduction of IN
expressing 293T cells with IN-defective virus particles, resulted in a higher transduction efficiency as compared with the parental 293T cells. After passaging the transduced cells, the difference became even more pronounced. This points to a catalytic interaction of integrase present in the receptor cell with the pre-integration complex of the incoming vector. For the wild type and the complemented vectors increased transduction efficiencies were obtained as well. This may suggest that the amount of active integrase present in the viral particle is dose-limiting or that integrase present in the target cell neutralizes inhibitory host factors.
Detection of integrase activity using a promoterless reporter gene (DIPR).
Integration of HIV in the chromosome does not show strict sequence-specificity, although a weak consensus was found for the integration sites (Carteau et al.(1998), J. Virol. 72, 4005-4014). It is commonly accepted although not formally proven, that retroviral integration is favored in open chromatin near or within active transcription units (Rohdewohld et al. (1987), J. Virol. 61: 336-343; Scherdin et al.
(1990), J. Virol. 64, 907-912; Vijaya et al. (1986), J. Virol. 60: 683-692;
Carteau et al.
(1998), J. Virol. 72, 4005-4014). The design of a promoterless reporter substrate for measuring integrase activity in cell culture, is based on this finding (Figs.
4A - C). In to accordance with an embodiment of the present invention a method is proposed in which read-through transcription of the integrated promoterless reporter gene will occur when inserted within an actively transcribed region of the chromosome. The construct designed is a linear DNA fragment, flanked by the 200 by terminal fragments of the HIV LTRs that provide the integrase recognition sites. The marker gene may encode luciferase, for instance. The presence of an IRES (internal ribosome entry site) in front of the open reading frame of luciferase, directs cap-independent translation of mRNA
transcripts (Fig. 4A).
After transfection with this DIPR substrate (Fig. 4B, C), luciferase activity measured in 293T-INs cells was always 4 to 10 times higher than in the parental 293T
cells (Table 2). In the DIPR assay activity of the D64V mutant integrase was reduced compared to the wild type integrase (data not shown). These results point to an activity of the intracellularly expressed integrase (expressed by the synthetic gene in accordance with the present invention) (Fig. 4C). Sequencing of integrated linear DNA
molecules in 293T cells transiently expressing integrase from the synthetic gene using Alu-PCR, revealed the characteristic removal of the 3' GT dinucleotide in 10%
of integrants. In control cells not expressing integrase none of the DNA
insertions showed this hallmark.
Applications 3o An embodiment of the present invention includes the construction of an efficient eukaryotic expression vector for a retroviral enzyme, e.g. HIV-1 integrase, based on the creation of a synthetic gene. Expression from the eukaryotic expression vector can be transient after transfection of the plasmid in a eukaryotic cell by any of established transfection procedures. Expression may also be permanent in a cell line stably harbouring the expression vector. An important aspect of the present invention and its applications is the functionality of an expressed retroviral enzymatically active protein, as opposed to mere the high level expression of an enzymatically inactive retroviral protein.
Intracellular integrase test for the evaluation of integrase inhibitors An embodiment of the present invention includes assays for evaluating integrase activity in cells transfected with a DNA substrate that is flanked by fragments l0 of HIV LTR, a so-called mini-HIV. In both assays data point to enzymatic activity of IN.
In DIAS (detection of integrase activity through antibiotic selection) test, a resistance gene to a cytotoxic drug is present in the mini-HIV DNA. The presence of IN in the transfected cell augments stable insertion of the resistance gene in the chromosome. Scoring is performed by comparing the residual number of colonies resistant to the cytotoxic agent in comparison with cells transfected with heterologous DNA.
In DIPR (detection of integrase activity using a promoterless reporter gene), a reporter gene (luciferase) without promoter is present downstream of an internal ribosome entry site (IRES) in mini-HIV (Fig. 4A). The presence of IN in the transfected cell (Fig. 4B) augments stable insertion of the reporter construct in the host chromosome in close proximity to a cellular promoter (Fig. 4C). Scoring is performed by measuring enzyme activity expressed from the promoterless marker gene, e.g.
luciferase. The latter assay is highly amenable to evaluation of integrase inhibitors in cell culture in a microtiter plate format, adaptable for high throughput screening.
Potential integrase inhibitors would result in the absence of or a significant decrease in the level of detectable signal from the promoterless marker gene.
Such an assay in accordance with the present invention involves screening test inhibitory compounds from large libraries of synthetic or natural compounds.
Synthetic compound libraries are commercially available from, for example, Maybridge Chemical Co. (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH) and Microsource (New Milford, CT). A rare chemical library is available from Aldrich Chemical Company, Inc. (Milwaukee, WI).
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts are available from, for example, New Chemical Entities, Pan Laboratories, Bothell, WA or MycoSearch (NC), Chiron, or are readily producible.
Plant extracts may also be obtained form the University of Ghent, Belgium.
5 Additionally, natural and synthetically produced libraries and compounds are readily modified through conventional chemical, physical, and biochemical means.
Tool for non-viral cellular gene delivery.
Cell lines that express integrase from a synthetic gene do have greater 1o propensity to integrate foreign DNA, flanked by LTR fragments. These cell lines are thus more transducible. An embodiment of this invention is the creation of eukaryotic cell lines (or cell culture systems) that are highly transducible (at least 200% compared to the parent cell). Embodiments of the present invention also include applications in transgene technology to increase the efficiency of (non)homologous recombination in 15 ES cells be it by transient expression from a plasmid or after induced expression of the retroviral integrase in ES cells transgenic for the synthetic gene. The synthetic gene in accordance with the present invention may be brought into cells by any transfection agent or method (e.g. electroporation or lipofection) and may result in the stable integration of DNA in the chromosome.
Retroviral (lentiviral) vector packaging construct From the complementation experiment it is clear that integrase expressed from the synthetic gene in the producer cell can complement integrase-defective lentiviral virus particles encoded by a packaging plasmid and thus can substitute for the protein expressed by the packaging construct. It follows that in an expression vector based on one or more synthetic genes) for a lentiviral gag pol gene, the synthetic genes) can substitute for the natural genes) in the packaging constructs resulting in Rev-independent high level protein expression. The present invention includes a packaging construct based on non-lentiviral complex retroviruses in which protein expression is 3o dependent on a Rev homologue such as Rex in the case of HTLV-I. Lentiviral vectors per se, capable of transducing a non-dividing cell, are known in the art (see Naldini et al. (1996), Science 272: 263-267, Zufferey et al. (1997), Nature Biotechnol.
15: 871-875). Generally the vectors are plasmid-based or virus-based, and are configured to carry the essential sequences for incorporating nucleic acid, for selection and for transfer of the nucleic acid in the host cell. Gag, pol and env genes of interest are known in the art. Briefly, a first vector can provide a nucleic acid encoding a viral gag and a viral pol, and a second vector can provide a nucleic acid encoding a viral env gene product to produce a packaging cell. Packaging cells or cell lines supply in trans the proteins necessary for producing infectious virions, themselves being incapable of packaging endogenous viral genomic nucleic acids (Watanabe & Temin (1983), Molec.
Cell Biol. 3(12): 2241-2249; Mann et al. (1983), Cell 33:153-159; Embretson &
Temin (1987), J. Virol. 61(9): 2675-2683). Introducing a vector providing a heterologous to gene, the transfer vector, into such packaging cells yields producer cells which release infectious particles carrying the foreign gene of interest. Methods for transfection or infection are well known by those skilled in the art. After cotransfection of the packaging vectors and the transfer vector to the packaging cell or cell line, the recombinant vector is recovered from the culture media and titered by standard methods used by those of skill in the art.
The foreign or heterologous gene carried by the transfer vector can be any nucleic acid of interest which can be transcribed, but preferably is a nucleic acid encoding for a polypeptide of therapeutic benefit or of interest for gene therapy. The env gene in the (second) packaging vector can be derived from any virus, including 2o retroviruses, and is preferably amphotropic, allowing transduction of cells of human and other species, and is preferably under control of non-endogenous regulatory sequences. Vectors can be made target-specific through linkage of the env protein with an antibody or a ligand for a particular receptor of a particular cell-type (cell-targeting).
Design of the gag-pol synthetic gene is based on a method to circumvent mRNA
instability associated with these wild-type genes. Preferentially, the method used by us to create an expression construct for high level and Rev independent eukaryotic expression of active HIV-1 integrase is employed. Further construction of the vectors of the present invention, whereby natural gag pol genes are replaced respectively by the synthetic genes of the present invention, employ standard ligation and restriction 3o techniques which are well understood in the art (see Maniatis et al, in Molecular Cloning: a Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1982).
Biosafety requires measurements to reduce the risk of generating recombinant replication competent retroviruses (RCR) as much as possible. Dividing the packaging functions into two genomes, one which express the gag and pol gene and another expressing the env gene product help to minimize the likelihood of generating RCR.
That approach minimizes the ability for co-packaging and subsequent transfer of the two-genomes, as well as significantly decreases the frequency of recombination due to the presence of three retroviral genomes in the packaging cell to produce RCR.
To render any possible recombinations non-functional, mutations (Danos & Mulligan (1988), Proc Natl. Acad. Sci 85: 6460-6464) or deletions (Bosselman et al.
(1987), Molec. Cell Biol. 7(5):1797-1806; Markowitz et al. (1988), 62(4):1120-1124) can be configured within the undesired gene products. Deletion of the 3'LTR of both 1o packaging constructs will further reduce the likelihood to form functional recombinants. US5,994,136 describes the production of lentiviral vectors with an even more remote possibility of generating replication competent lentiviruses by functionally deleting the tat gene, which is encoding for a regulating protein that promotes viral expression through a transcriptional mechanism. The likelihood of recombination between the transfer vector, that still contains natural genetic information of the lentivirus like part of the gag gene, and synthetic packaging genes will be considerably reduced, further improving the biosafety of the lentiviral vectors. DNA
sequence mismatching, as induced by the replacement of nucleotides in the third position compared to the natural gene, seems to present a considerable barrier to homologous 2o recombination in a wide variety of species. It is therefore also unlikely that contaminating or endogenous HIV virus particles would exchange the natural integrase gene for the synthetic one through recombination.
Experimental Procedures DNA constructs Construction of integrase expression plasmids The open reading frame of IN from the HIV-1 clone HXB2 was PCR
amplified using Pfu DNA polymerase (Stratagene, Cambridge, UK) with the primers 5'-CCCCCAAGCTTGCCAGCCATGTTTTTAGATGGAATAGATAAGG and 5' CCCGCTCGAGGTTTCCTTGAAATATACATATGGTG and subcloned in pCEP4 (Invitrogen, Leek, The Netherlands), resulting in pCEP-IN. The absence of mutations was verified by DNA sequencing. The RRE sequence of HIV-1, clone HXB2, was PCR
amplified using the primers 5'-TTCCGCTCGAGTAGCACCCACCAAGGCAAAGAG
and 5'-TCGCGGATCCAAGGCACAGCAGTGGTGCAAATG. The PCR fragment was subcloned in the sense orientation downstream of the integrase gene in pCEP-IN to produce pCEP-IN-RRE. The CTE sequence (obtained from plasmid pS 12; Taberno et al., 1996, J. Virol. 70: 5998-6011) was cloned in pCEP4 in the correct orientation, followed by the insertion of the integrase gene upstream of the CTE. This resulted in the plasmid pCEP-IN-CTE. The construction of pGFP-IN is explained in Pluymers et al. (1999), (Virology 258: 327-332). The Rev expression plasmid, pEF321-cREV, was provided by Sandoz Forschungs Institut, Vienna, Austria. PCR amplification and 1o plasmid construction employed standard techniques like standard ligation and restriction techniques and conditions which are well understood in the art (see Maniatis et al, in Molecular Cloning: a Laboratory Manual, Cold Spring Harbor Laboratroy, N.Y., 1982).
Assembly of the synthetic gene The restriction sites NheI, PstI, BamHI, NaeI, NarI (indicated in Fig.2 ) divide the sequence of the synthetic gene into 6 fragments each approximately 150 by long that correspond to the sequences 1-149, 144-306, 301-456, 451-623, 618-776, (Fig. 2). Each of the fragments was constructed separately by annealing and extending 2o two partially complementary oligonucleotides (85-95 nt long, PAGE-purified and 5'-phosphorylated, synthesized by Gibco BRL Life Technologies, Merelbeke, Belgium) using Sequenase (Amersham-Pharmacia, Buckinghamshire, UK) . Each fragment was cloned into the EcoRV site of the vector pBluescript KS(+) (Stratagene, La Jolla, CA).
The sequence errors found in the resulting clones were repaired using either the Stratagene Quick Change procedure with Pyrococcus furiosus (Pfu) polymerise (for a base substitution in the fragment 451-623) or PCR (for deletions in the terminal regions of the fragment 1-149). The full 930 by sequence was built by stepwise assembly of the fragments similar to the method described in W098/12207. Choice of the cloning vector (pBluescript KS or SK) at each step was dictated by toxicity of the IN
coding 3o DNA. Finally, the two halves of the IN gene (1-451 and 452-930) were ligated together and cloned into pBluescript KS(+) resulting in plNs.
Construction of mammalian expression vectors for INs The plasmid pINs was digested by EcoRI and treated with T4 DNA polymerase followed by restriction with XhoI. The 1 kb fragment carrying the INs gene was cloned between the PvuII and XhoI sites of pCEP4 (Invitrogen, Leek, The Netherlands) resulting in pCMV-Ins. pCEP4 is an episomal mammalian expression vector containing the human cytomegalovirus (hCMV) immediate early enhancer/promoter. The Epstein Barr virus replication origin (oriP) and nuclear antigen (encoded by the EBNA-1 gene) permit extrachromosomal replication in human, primate and canine cells. A
hygromycin resistance gene is present, permitting selection of stably transduced clones by hygromycinB (GIBCO BRL). The same 1 kb fragment was also cloned between l0 NheI and XhoI sites of the pBK-RSV expression vector (Stratagene) (the NheI
cohesive end of the vector DNA was filled in using T4 DNA polymerase) resulting in pRSV-INs. In this vector expression of INs gene is driven by the promoter of Rous sarcoma virus (RSV). The presence of the neomycin resistance gene allows selection of stably transduced clones by geneticin (G418) (GIBCO BRL).
Construction of the substrate for the DIPR assay The DNA substrate for the DIPR assay was obtained by linearization of pLTR-IRES-Luc with ScaI. This plasmid was constructed in the following way. First, the 350 by KpnI/EcoRI fragment of pU3U5 (Cherepanov et al.(1999), Nucleic Acids Research 27: 2202-2210) containing the terminal U3 and U5 regions of the HXB2 HIV-1 LTRs was cloned between the Kpnl and EcoRI sites of pUC 19 resulting in pUC-LTR.
Then the ScaI site occurnng in the ampicilline resistance gene of pUC 19 was destroyed by partial digestion of pUC-LTR with ScaI and insertion of a fragment containing the kanamicine resistance gene from the Tn5 transposon yielding pUC-LTR-kan.
Finally, 7.5 kb pLTR-IRES-Luc was obtained by cloning the BamHI/PstI fragment of pBIR
(Martinez-Salas et al. (1993), J. Virol. 67, 3748-3755) carrying the IRES-luciferase gene cassette, made blunt with T4 DNA polymerase (Gibco BRL), into the SmaI
site of pUC-LTR-kan.
3o Cell culture HeLa and 293 cells were obtained from American Type Culture Collection.
HeLa and 293 cells were grown in Dulbecco's modified Eagle's medium (DMEM) (GibcoBRL) supplemented with 10 % FCS, 0.12 % (v/w) sodium bicarbonate (GibcoBRL), 2 mM glutamine (GibcoBRL) and 20 ~.g/ml gentamycin (GibcoBRL) at 37oC in 5 % C02 humidified atmosphere. 293T cells (obtained from Dr. O. Danos, Evry, France) express SV40 large T antigen and were grown in DMEM (GibcoBRL) with glutamax supplemented with 10% fetal calf serum, 45 U/ml penicillin G
(Serva, 5 Heidelberg, Germany) and 45 p.g/ml streptomycin sulphate (Sigma-Aldrich, Bornem, Belgium).
293 and 293T cells were transfected using polyethylenimine (PEI) (Abdallah et al. (1997), Hum. Gene Therapy 7:1947-1954). Polyethylenimine Mw ~ 25.000 was from Sigma-Aldrich (Bornem, Belgium). Cells were grown to 50-70 % confluency in 1o DMEM with glucose, glutamax and 10 % fetal calf serum (FCS) (Gibco BRL).
Medium was replaced by medium containing 1 % FCS 3 hours before transfection.
Mixture of DNA and PEI was added to cells in a minimal volume of medium. Next day the medium was changed to DMEM containing 25 mM HEPES. Transformation efficiency obtained in this way was 50-80 %. HeLa cells were routinely transfected by 15 electroporation. The cells were first trypsinized at 80 % confluency and pelleted by low speed centrifugation. The cells were then resuspended at a density of 2 x 106 cells/ml in growth medium; 0.5 ml of this solution was aliquoted into 4 mm cuvettes (Eurogentec, Seraing, Belgium) and 20 qg DNA was added to the cell suspension. After the electric pulse ( 10 qF, 250 V), cells were allowed to rest for 10 min at room temperature before 20 dilution into growth medium..
To establish stable cell lines expressing the INs gene, cells transfected with pRSV-INs or pCMV-1Ns were cultured in the presence of 500 ~g/ml geneticin (G418) or 200 ug/ml hygromicin B (both from GIBCO BRL), respectively. Expression of IN
was assessed by western blotting and/or indirect immunofluorescence.
Western blotting and immunofluorescence For western blotting and indirect immunofluorescence rabbit polyclonal antibodies directed against recombinant His-tagged HIV-1 IN were produced in house and were purified using a 1 HiTrap rProteinA column (Pharmacia Biotech, Uppsala, 3o Sweden) according to established procedures (Ausubel et al. (1987), Current protocols in molecular biology, John Wiley & Sons, New York). Western blotting was performed using PVDF membranes (Bio-Rad), the ECL+ chemiluminescent detection system (Amersham-Pharmacia) and HRP-conjugated goat anti-rabbit antibodies (Bio-rad).
Dilutions used were 1:30000 for the primary antibody and 1:20000 for the secondary antibody. Detection limit was 0.1-0.5 ng of recombinant integrase. Total protein concentration was determined on cells lysed with 1%SDS/1 mM PMSF (Sigma), using the BCA protein assay (Pierce, Illinois USA). For western blot analysis 10 ~g of total protein was evaluated.
For detection of IN expression in situ by indirect immunofluorescence microscopy, cells were grown on glass slides (HeLa cells) or in permanox chamber slides (GIBCO BRL) (293T cells). After 24-48 hrs, cells were washed with phosphate buffered saline (PBS) supplemented with 1 mM Mg2+ and 0.5 mM Ca2+(PBS+), fixed in 100% methanol and blocked with 10% foetal calf serum (FCS) in PBS+.
Incubations with antibodies were carried out at 37~C in blocking solution. The primary antibody (rabbit anti-IN) was diluted 1:20 to 1:80; the secondary FITC-conjugated swine anti-rabbit antibody from Dako (Glostrup, Denmark) was diluted 1:40. Nuclear staining was performed with 1 pg/ml 4', 6-diamidino-2-phenylindole (DAPI) (Sigma) in methanol.
Fluorescence microscopy was performed with a Leitz microscope (Wetzlar, Germany) using filter blocks I2 (FITC) or A (DAPI).
Detection of integrase activity using a promoterless reporter gene (DIPR) 293T and 293T-INs cells were seeded in six-well plates at a density of 106 cells/well 24 hr before transfection. Five pg of DNA was transfected per well using PEI. 48 hr post-transfection, 5 x 105 cells were lysed to determine the luciferase activity using the Luciferase Assay SystemTM (Promega Benelux, Leiden, The Netherlands) and the LumicountTM (Packard, Meriden, CT). The protein concentration of the lysate was determined using the Bradford method (Bio-Rad protein assay, Bio-Rad, Hercules, CA). The relative luciferase activity was calculated by dividing the luminescence values by the protein concentration.
Lentiviral vectors Lentiviral vector production 3o HIV-1-derived vector particles, pseudotyped with the envelope of vesicular stomatitis virus (VSV), were produced by transfecting 293T cells with a packaging plasmid encoding viral gag and pol proteins (pCMVOR8.2), a plasmid encoding the envelope of vesicular stomatitis virus (pMDG) and a plasmid encoding a reporter gene flanked by two long terminal repeats (LTRs) (pHR'-CMVLacZ). The first generation packaging plasmid, containing all HIV genes except. for env, and the transfer vector were a kind gift from Dr. O. Danos (Genethon, France). For transfection of a 10 cm dish of 293T cells, a 700 p.l mixture of three plasmids was made in 150 mM
NaCI: 20 qg of vector plasmid, 10 ~g of packaging construct and 5 pg of envelop plasmid. To this DNA solution 700 ql of a PEI solution (110 p.l of a 10 mM stock solution in 150 mM NaCI) was added slowly. After 15 min at room temperature, the DNA-PEI
complex was added dropwise to the 293T cells in DMEM medium with 1% FCS. After overnight incubation, medium was replaced with medium containing 10% FCS.
to Supernatants were collected from day two to five post-transfection. The vector particles were sedimented by ultracentrifugation in a swinging-bucket rotor (SW27 Beckman, Palo alto, CA) at 25, 000 rpm for 2 hr at 4°C. Pellets were redissolved in PBS resulting in a 100-fold concentration. Different viral stocks were normalized based on p24 antigen content (HIV-1 p24 Core Profile ELISA, DuPont, Dreieich, Germany) for use in complementation assays.
Complementation experiments Integrase-defective virus particles were produced using pCMVOR8.2IN(D64V), obtained from Dr. D. Trono, (Geneva, Switzerland) as packaging plasmid (Naldini et al. (1996), Science 272: 263-267). Complemented vectors were produced by expressing integrase from pCEP-INs in 293T cells after quadruple transient transfection.
Vector preparations were normalized for p24 antigen count Vector was added to target cells in the presence of 2 ~,g/ml polybrene and left overnight. After removal of vector, cells were incubated for an additional 36 hrs. Cells were washed with PBS, fixed with 0.75%
formaldehyde/0.05% glutaraldehyde in PBS, and stained with freshly prepared X-gal substrate (5 mM potassium ferrocyanide, 5 mM potassium ferncyanide, 2 mM MgClz and 100 pg/ml 5-bromo-4-chloro-3-indolyl-(3-D-galactopyranoside (x-gal) (Biotech Trade & Service Gmbh, St. Leon-Rot, Germany) in PBS) at 37°C
overnight. Each transduction experiment was done in duplicate in a 96-well plate. Transduction 3o efficiency was determined by counting the number of blue cells 48 hrs after infection in one of the wells, whereas the cells in the duplicate well were splitted 1:2.
Half of the sample remained in the well and was stained at confluency (passage 1) whereas the other half was cultured in a 48-well plate. At confluency, these cells were again splitted 1:2. Finally, cells were brought in a 24-well plate and grown to confluency (passage 3, dilution 1:8). After staining, the efficiency of stable transduction was measured by counting blue colonies.
Tables Table 1. Complementation of inte~rase-defective lentiviral vector particles Relative transduction efficiency' Cells Passage WT vector D64V C IN
vector # 0 1.00 0.048 0.303 # 3 1.00 0.007 0.320 293T INs # 0 1.565 0.09 0.510 # 3 1.88 0.045 0.75 lTransduction efficiency is determined by counting galactosidase-positive cells (# 0) or colonies of galactosidase-positive cells (# 3) relative to transduction efficiency obtained by WT vector in 293T cells. Results of transduction by WT vector, D64V IN-defective vector and D64V vectors complemented with IN in the producer cells, are shown.
Cells were infected with normalized amounts of vector. Transduction was done both in cells and in 293T cells that are stably expressing IN. Average numbers for two separate experiments are shown.
Table 2. Detection of inte~rase activity usin , a promoterless reporter gene (DIPR) Luciferase activity (Relative units) Experiment Cell line Blanks LTR-IRES- LTR-IRES-Luc Lucb + pCMV-INs 293T-INs 1 487 119 -293T-INs 1 499 38 990 183 aRelative background luciferase activity in cell lines b 293T and 293T-INs were transfected with equal amounts of linearized pLTR-IRES
Luc. under experimental conditions A. In experiment B total DNA concentration was equalized with parental vector pCEP4.
°293T and 293T-INs were transfected with linearized pLTR-IRES-Luc and 2 ~.g of pCMV-INs.
SEQUENCE LISTING
<110> K .U. Leuven Research & Development Debyser, Zeger De Clercq, Erik Cherepanov, Peter Pluymers, Wim <120> A synthetic gene for expression of a retroviral protein with wild type activity in eukaryotic cells <130> K1291-PCT
<140>
<141>
<150> EP99201306.0 <151> 1999-04-26 <150> EP00200171.7 <151> 2000-O1-18 <160> 2 <170> PatentIn Ver. 2.1 <210> 1 <211> 930 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic gene expressing HIV integrase <220>
<221> CDS
<222> (27)..(899) <220>
<221> misc_signal <222> (24)..(30) <223> Kozak sequence <400> 1 atcactagca acctcaaaca gacacc atg gga ttc ctg gac ggc att gac aag 53 Met Gly Phe Leu Asp Gly Ile Asp Lys get cag gag gag cac gag aag tac cac tcg aat tgg cgg gcc atg gcc 101 Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala tcc gac ttc aac ctg cca ccc gtc gtc get aag gag atc gtt get agc 149 Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val Ala Ser tgc gac aag tgc cag ctg aaa ggc gag get atg cac ggg cag gtt gat 197 Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln Val Asp tgc tct ccc ggc atc tgg cag ctc gac tgt act cac ctg gag ggc aag 245 Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu Gly Lys gtc atc ctg gtc gcc gtg cac gtg gcc tct ggt tac atc gag get gag 293 Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu Ala Glu gtc atc cct gca gag act ggc cag gag act gcc tat ttc ctg ctg aaa 341 Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu Leu Lys g0 95 100 105 ctg gcc ggc cgg tgg cct gtg aag aca gtg cac aca gat aac ggc tcc 389 Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn Gly Ser aac ttc acc tcc acc act gtg aag get gcc tgc tgg tgg get ggg atc 437 Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala Gly Ile aag cag gag ttc ggg atc ccc tat aac cca cag tct cag ggc gtg atc 485 Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly Val Ile gaa tcc atg aac aag gag ctg aag aag atc atc ggc cag gtt cgg gac 533 Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val Arg Asp cag gca gag cac ctg aag act gca gtg cag atg gcc gtg ttc atc cac 581 Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe Ile His aac ttc aag cga aag ggc ggc atc ggt ggc tac tca gcc ggc gag cgg 629 Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly Glu Arg atc gtg gac atc atc gcc act gac atc cag acc aaa gag ctg cag aag 677 Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu Gln Lys cag atc acc aag atc cag aac ttc cgt gtg tac tac cgg gac tcc cgg 725 Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg gac cct gtg tgg aag ggc cct gcc aag ctg ctg tgg aag ggc gag ggc 773 Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly Glu Gly gcc gtg gtc att cag gac aac tct gac atc aag gtt gtg ccc agg cgc 821 Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro Arg Arg aag gcc aag att atc cgg gac tac ggc aag cag atg get ggc gac gac 869 Lys A1a Lys Ile Ile Arg Rsp Tyr Gly Lys G1n Met Ala Gly Asp Asp tgt gtg gcc tct cgt caa gat gag gac taa gtccaactac taaactgggg 919 Cys Val Ala Ser Arg Gln Asp Glu Asp gatattatga t 930 <210> 2 <211> 290 <212> PRT
<213> Artificial Sequence <223> Description of Artificial Sequence: synthetic gene expressing HIV integrase <400> 2 Met Gly Phe Leu Asp Gly Ile Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu Ala Glu Val Ile Pro Ala Glu Thr Gly g5 90 95 Gln Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asp Pro Val Trp Lys Gly Pro 225 230. 235 240 Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp
Claims (61)
1. A detection method for intracellular integrase activity using a promoteriess reporter.
gene.
gene.
2. The method according to claim 1, wherein the integrase activity is present in cell culture.
3. The method according to claim 1 or 2, wherein the integrase activity is present after transfection of an integrase gene.
4. The method according to any of claims 1 to 3, where the integrase activity is performed by an integrase protein.
5. The method according to claim 4, wherein the integrase protein is a wild type integrase enzyme.
6. The method according to any previous claim, wherein the integrase protein is mutated.
7. The method according to any of claims 1 to 6, wherein the integrase gene is mutated in order to obtain an optimised codon usage.
8. The method according to any previous claim, wherein the integrase gene is a synthetic gene having a portion of the wildtype codons relaced by other codons.
9. The method according to any previous claim, wherein the integrase protein is retroviral.
10. The method according to any of claims 1 to 9, where the integrase protein is lentiviral.
11. The method according to claim 10, wherein the integrase protein is an HIV
integrase.
integrase.
12. The method according to any previous claim, wherein the reporter gene is one of a luciferase, GFP and an antibiotic selection marker.
13. The method according to any of the claims 1 to 11, wherein the reporter gene is a cytotoxic drug resistance gene.
14. The method according to any previous claim, wherein a reporter gene construct is generated from the reporter gene and the construct is used as the substrate of an enzymatically active retroviral protein expressed from a synthetic retroviral pol or gag gene, the synthetic gene having modified codon usage compared with a wildtype gene, the synthetic gene being for expression of a retroviral pol or gag gene or a region of a retroviral pol or gag gene in a eukaryotic cell, the expressed retroviral protein being at a level to provide detectable activity of a wild type function of the expressed retroviral protein in the eukaryotic cell.
15. The method according to claim 14, wherein the synthetic gene is for the expression of a lentiviral pol or gag gene or a region of a lentiviral pol or gag gene in a eukaryotic cell, the expressed lentiviral protein being at a level to provide detectable activity of a wild type function of the expressed lentiviral protein in the eukaryotic cell.
16. The method according to claim 14 or 15, wherein the synthetic gene is for the expression of a retroviral or lentiviral gag or pole gene or a region of a retroviral or lentiviral gag or pol gene where the gene or region thereof, after codon optimization for a eukaryotic host in which it is expressed, contains a GC nucleotide pair content between 53 and 63 %, more preferably between 55 and 61 %, and the expressed gene is expressed at a level to provide detectable enzymatic activity of the expressed retroviral or lentiviral protein in the eukaryotic cell.
17. The method according to any of claims 14 to 18, wherein the expression of the gag or pol gene or the region thereof is independent of retroviral regulatory proteins.
18. The method according to any of claims 14 to 17 wherein the retroviral protein is a lentiviral gag or pol protein or a fragment thereof.
19. The method according to claim 18 wherein the retroviral protein is an HIV
gag or pol protein or a fragment thereof.
gag or pol protein or a fragment thereof.
20. The method according to any of claims 14 to 19, wherein the detectable activity of the enzymatic function includes at least promotion or stimulation of the integration of DNA fragments into the host cell DNA, preferably the chromosome if the host cell.
21.The method according to any of the claims 14 to 20, wherein the eukaryotc cell for expression of genes is a mammalian cell.
22. The method according to any of the claims 14 to 21 wherein the expressed protein has an expression level of at least 200% compared to the expressed wild type gene in a eukaryotic cell.
23. The method according to any of the claims 14 to 21 containing a synthetic gene comprising the sequence of Fig 2A or homologs thereof which have a GC content between 53 and 63% preferably between 55 and 61 percent.
24. The method according to any of claims 1 to 23, wherein a reporter gene construct is generated from the reporter gene and the construct contains an internal IRES.
25. The method according to any previous claim, wherein the reporter gene codes for an enzyme.
26. Use of a method according any of the claims 1 to 25 for screening for integrase inhibitors.
27. An integrase inhibitor obtained by the method of claim 26.
28. Packaging construct for a lentiviral or complex retroviral vector based on a synthetic retroviral pol or gag gene, the synthetic gene having modified codon usage compared with a wildtype gene, the synthetic gene being for expression of a retroviral pol or gag gene or a region of a retroviral pol or gag gene in a eukaryotic cell, the expressed retroviral protein being at a level to provide detectable activity of a wild type function of the expressed retroviral protein in the eukaryotic cell.
29. Packaging construct according to claim 28 wherein the synthetic gene is for the expression of a lentiviral pol or gag gene or a region of a lentiviral pol or gag gene in a eukaryotic cell, the expressed lentiviral protein being at a level to provide detectable activity of a wild type function of the expressed lentiviral protein in the eukaryotic cell.
30. Packaging construct according to claim 28 or 29, for the expression of a retroviral.
gag or pole gene or a region of a retroviral gag or pol gene where the gene or region thereof, after codon optimization for a eukaryotic host in which it is expressed, contains a GC nucleotide pair content between 53 and 63 %, more preferably between 55 and 61 %, and the expressed gene is expressed at a level to provide detectable enzymatic activity of the expressed retroviral protein in the eukaryotic cell.
gag or pole gene or a region of a retroviral gag or pol gene where the gene or region thereof, after codon optimization for a eukaryotic host in which it is expressed, contains a GC nucleotide pair content between 53 and 63 %, more preferably between 55 and 61 %, and the expressed gene is expressed at a level to provide detectable enzymatic activity of the expressed retroviral protein in the eukaryotic cell.
31. Packaging construct according to any of claims 28 to 30, wherein the expression of the gag or pol gene or the region thereof is independent of retroviral regulatory proteins.
32. Packaging construct according to any of claims 28 to 31, wherein the retroviral protein is a lentiviral gag or pol protein or a fragment thereof.
33. Packaging construct according to claim 32 wherein the retroviral protein is an HIV
gag or pol protein or a fragment thereof.
gag or pol protein or a fragment thereof.
34. Packaging construct according to any of claims 28 to 33, wherein the detectable activity of the enzymatic function includes at least promotion or stimulation of the integration of DNA fragments into the host cell DNA, preferably the chromosome if the host cell.
35. Packaging construct according to any of the claims 28 to 34, wherein the retroviral protein is a protease, reverse transcriptase, integrase or a polyprotein gag-pol precursor thereof.
36. Packaging construct according to any of the claims 28 to 35, wherein the eukaryotic cell for expression of genes is a mammalian cell.
37. Packaging construct according to any of the claims 28 to 36, wherein the expressed protein has an expression level of at least 200 % compared to the expressed wild type gene in a eukaryotic cell.
38. Packaging construct according to any of the claims 28 to 37 containing a synthetic gene comprising the sequence of Fig 2A or homologs thereof which have a GC
content between 53 and 63 % preferably between 55 and 61 percent.
content between 53 and 63 % preferably between 55 and 61 percent.
39. A method of transfecting a eukaryotic cell using the expression vector in accordance with any of claims 59 to 61.
40. A eukaryotic cell line harboring the synthetic gene or region of a gene in accordance with any of the claims 50 to 58.
41.The eukaryotic cell line according to claim 40, wherein the retroviral enzymatically active protein is expressed using a constitutive, inducable or tissue specific promoter.
42. The eukaryotic cell line according to claim 40 or 41, wherein the expression is stable.
43. A transgenic animal harboring the synthetic gene or part of a gene in accordance with any of the claims 50 to 58.
44. The transgenic animal according to claim 43, wherein the expression of the synthetic gene or part of a gene is induced by an inducable promoter or by a tissue-specific promoter.
45. The transgenic animal according to claim 43 or 44, comprising a mammal.
46. A method for preparing a synthetic gene or part of a gene encoding a retroviral protein or part of such a protein which is enzymatically active in a target eukaryotic cell, comprising the steps of:
1) identifying a group of genes from the total set of genes of the target eukaryotic cell which encode proteins which are naturally expressed easily and/or in high concentrations in the target cell;
2) determining the codon sequences of these identified genes and from these sequences a preferred codon usage and a preferred nucleotide pair frequency;
3) using the preferred codon usage, identify the non-preferred codons in the natural gene encoding the enzymatically active protein;
4) replacing one or more of the non-preferred codons with one or more preferred codons encoding the same amino acids as the replaced codons while biasing the replacement to obtain the preferred nucleotide pair frequency, the preferred nucleotide pair frequency being a GC nucleotide pair content of between 53 and 63%, more preferably between 55 and 61%.
1) identifying a group of genes from the total set of genes of the target eukaryotic cell which encode proteins which are naturally expressed easily and/or in high concentrations in the target cell;
2) determining the codon sequences of these identified genes and from these sequences a preferred codon usage and a preferred nucleotide pair frequency;
3) using the preferred codon usage, identify the non-preferred codons in the natural gene encoding the enzymatically active protein;
4) replacing one or more of the non-preferred codons with one or more preferred codons encoding the same amino acids as the replaced codons while biasing the replacement to obtain the preferred nucleotide pair frequency, the preferred nucleotide pair frequency being a GC nucleotide pair content of between 53 and 63%, more preferably between 55 and 61%.
47.The method according to claim 46, wherein the replacement step is carried out based on a random choice between alternative codons encoding the same amino acid at each position using a random number generator and biasing the choice of alternative codons based on the preferred codon usage to obtain the preferred nucleotide frequency.
48. A method for gene transfer in a eukaryotic cell expressing the synthetic gene or region of the gene in accordance with any of the claims 50 to 58.
49. A method according to claim 48, wherein the synthetic gene is transiently expressed or is stably integrated in said cell.
50. A synthetic retroviral gag or pol gene or a region of a retroviral gag or pol gene for the expression of a retroviral gag or pol protein in a eukaryotic cell, the retroviral gene having non-preferred codons when referred to the eukaryotic cell, the number of non-preferred codons being such that replacement of all the non-preferred codons by preferred codons for the eukaryotic cell results in a GC nucleotide pair content of 65% or higher, the synthetic gene having a GC nucleotide pair content of between 53 and 63%, more preferably between 55 and 61% and the expressed retroviral protein is expressed at a level to provide detectable enzymatic activity of the expressed retorviral protein in the eukaryotic cell.
51. The synthetic gene according to claim 50, wherein the expression of the gag or pol gene proteins is independent of retroviral regulatory proteins.
52. The synthetic gene according to claim 50 or 51, wherein the retroviral protein is a lentiviral gag or pol protein.
53. The synthetic gene according to claim 52, wherein the lentiviral protein is an HIV gag or pol protein.
54. The synthetic gene according to any of claims 50 to 53, wherein the detectable activity of the enzymatic function includes at least promotion or stimulation of integration of DNA fragments into the host cell DNA, preferably the chromosome of the host cell.
55. The synthetic gene according to claim 54, wherein the retroviral protein is a protease, a reverse transcriptase, an integrase protein or a polyportein gag-pol precursor thereof.
56. The synthetic gene according to any of the claims 50 to 55, wherein the eukaryotic cell is a mammalian cell.
57. The synthetic gene according to any of the claims 50 to 56, wherein the expression of the protein is at a level at least 200% of that expressed by the wild type gene in the eukaryotic cell.
58. The synthetic gene according to any of the claims 50 to 57 comprising the sequence of Fig. 2A or homologs thereof which have a GC content between 53 and 63%, preferably between 55 and 61%.
59. A eukaryotic expression vector comprising the synthetic gene or region of a gene in accordance with any of the claims 50 to 58.
60. The expression vector according to claim 59, further comprising a constitutive or an inducible or a tissue-specific promoter.
61. The expression vector according to claim 59 or 60, comprising a plasmid, a mammalian or an insect virus.
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP99201306.0 | 1999-04-26 | ||
| EP99201306 | 1999-04-26 | ||
| EP00200171.7 | 2000-01-18 | ||
| EP00200171 | 2000-01-18 | ||
| PCT/EP2000/003765 WO2000065076A2 (en) | 1999-04-26 | 2000-04-26 | Synthetic gene for expressing active retroviral protein in eukaryotes |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CA2369058A1 true CA2369058A1 (en) | 2000-11-02 |
Family
ID=26071747
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002369058A Abandoned CA2369058A1 (en) | 1999-04-26 | 2000-04-26 | Synthetic gene for expressing active retroviral protein in eukaryotes |
Country Status (11)
| Country | Link |
|---|---|
| EP (1) | EP1183381A2 (en) |
| JP (1) | JP2002542791A (en) |
| AU (1) | AU778106B2 (en) |
| BR (1) | BR0010077A (en) |
| CA (1) | CA2369058A1 (en) |
| HU (1) | HUP0201019A2 (en) |
| IL (1) | IL146090A0 (en) |
| NO (1) | NO20015168L (en) |
| PL (1) | PL351988A1 (en) |
| RU (1) | RU2001131728A (en) |
| WO (1) | WO2000065076A2 (en) |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB9803351D0 (en) | 1998-02-17 | 1998-04-15 | Oxford Biomedica Ltd | Anti-viral vectors |
| WO2000039304A2 (en) | 1998-12-31 | 2000-07-06 | Chiron Corporation | Polynucleotides encoding antigenic hiv type c polypeptides, polypeptides and uses thereof |
| EP1980617A1 (en) | 1998-12-31 | 2008-10-15 | Novartis Vaccines and Diagnostics, Inc. | Improved expression of HIV polypeptides and production of virus-like particles |
| US6656706B2 (en) | 1999-12-23 | 2003-12-02 | The United States Of America As Represented By The Department Of Health And Human Services | Molecular clones with mutated HIV gag/pol, SIV gag and SIV env genes |
| ES2329648T3 (en) * | 1999-12-23 | 2009-11-30 | The Government Of The Usa, As Represented By The Secretary, Department Of Health And Human Services | MOLECULAR CLONES WITH MUTED GENES HIV GAG / POL, VIS GAG AND VIS VIS. |
| AUPQ580300A0 (en) * | 2000-02-23 | 2000-03-16 | National Cancer Centre Of Singapore Pte Ltd | Genetic analysis |
| GB0009760D0 (en) * | 2000-04-19 | 2000-06-07 | Oxford Biomedica Ltd | Method |
| AU2002320314A1 (en) | 2001-07-05 | 2003-01-21 | Chiron, Corporation | Polynucleotides encoding antigenic hiv type c polypeptides, polypeptides and uses thereof |
| US20030170614A1 (en) | 2001-08-31 | 2003-09-11 | Megede Jan Zur | Polynucleotides encoding antigenic HIV type B polypeptides, polypeptides and uses thereof |
| US6835568B2 (en) | 2001-10-30 | 2004-12-28 | Virxsys Corporation | Regulated nucleic acid expression system |
| CN1784496B (en) | 2003-05-06 | 2010-08-25 | 独立行政法人产业技术综合研究所 | Multi-gene transcription activity assay system |
| EP1934246B8 (en) | 2005-10-07 | 2012-02-08 | Istituto di Ricerche di Biologia Molecolare P. Angeletti S.R.L. | Matrix metalloproteinase 11 vaccine |
| JP6824594B2 (en) | 2014-09-11 | 2021-02-03 | Jnc株式会社 | How to design synthetic genes |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5468629A (en) * | 1993-04-13 | 1995-11-21 | Calhoun; Cornelia | Method of promoting in vitro homologous recombination transfection in mammalian cells using the RecA protein |
| US5434065A (en) * | 1993-05-06 | 1995-07-18 | President And Fellows Of Harvard College | In vivo selection of microbial virulence genes |
| US5811270A (en) * | 1994-05-20 | 1998-09-22 | Grandgenett; Duane P. | In vitro method for concerted integration of donor DNA molecules using retroviral integrase proteins |
| US6200811B1 (en) * | 1996-04-02 | 2001-03-13 | The Regents Of The University Of California | Cell transformation vector comprising an HIV-2 packaging site nucleic acid and an HIV-1 GAG protein |
| US6114148C1 (en) * | 1996-09-20 | 2012-05-01 | Gen Hospital Corp | High level expression of proteins |
| JP2001512308A (en) * | 1997-02-07 | 2001-08-21 | メルク エンド カンパニー インコーポレーテッド | Synthetic HIV GAG gene |
| EP1980617A1 (en) * | 1998-12-31 | 2008-10-15 | Novartis Vaccines and Diagnostics, Inc. | Improved expression of HIV polypeptides and production of virus-like particles |
| WO2000039304A2 (en) * | 1998-12-31 | 2000-07-06 | Chiron Corporation | Polynucleotides encoding antigenic hiv type c polypeptides, polypeptides and uses thereof |
-
2000
- 2000-04-26 IL IL14609000A patent/IL146090A0/en unknown
- 2000-04-26 HU HU0201019A patent/HUP0201019A2/en unknown
- 2000-04-26 CA CA002369058A patent/CA2369058A1/en not_active Abandoned
- 2000-04-26 RU RU2001131728/15A patent/RU2001131728A/en unknown
- 2000-04-26 WO PCT/EP2000/003765 patent/WO2000065076A2/en not_active Ceased
- 2000-04-26 EP EP00927069A patent/EP1183381A2/en not_active Withdrawn
- 2000-04-26 AU AU45579/00A patent/AU778106B2/en not_active Ceased
- 2000-04-26 BR BR0010077-3A patent/BR0010077A/en not_active Application Discontinuation
- 2000-04-26 PL PL00351988A patent/PL351988A1/en unknown
- 2000-04-26 JP JP2000614410A patent/JP2002542791A/en not_active Withdrawn
-
2001
- 2001-10-23 NO NO20015168A patent/NO20015168L/en not_active Application Discontinuation
Also Published As
| Publication number | Publication date |
|---|---|
| JP2002542791A (en) | 2002-12-17 |
| WO2000065076A3 (en) | 2001-12-13 |
| RU2001131728A (en) | 2004-02-20 |
| EP1183381A2 (en) | 2002-03-06 |
| NO20015168L (en) | 2001-12-14 |
| AU778106B2 (en) | 2004-11-18 |
| PL351988A1 (en) | 2003-07-14 |
| IL146090A0 (en) | 2002-07-25 |
| HUP0201019A2 (en) | 2002-08-28 |
| BR0010077A (en) | 2002-01-15 |
| NO20015168D0 (en) | 2001-10-23 |
| AU4557900A (en) | 2000-11-10 |
| WO2000065076A2 (en) | 2000-11-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US5994136A (en) | Method and means for producing high titer, safe, recombinant lentivirus vectors | |
| CA2328404C (en) | Novel lentiviral packaging cells | |
| Cherepanov et al. | High‐level expression of active HIV‐1 integrase from a synthetic gene in human cells | |
| AU2018336042A1 (en) | Retroviral vectors | |
| AU778106B2 (en) | Synthetic gene for expressing active retroviral protein in eukaryotes | |
| EP1244797B1 (en) | Translentiviral vector system | |
| EP4652288A2 (en) | Viral particle producer cells with landing pad-integrated viral vectors | |
| CA2408786C (en) | Retroviral vectors comprising an enhanced 3' transcription termination structure | |
| AU773015B2 (en) | Lentiviral vectors | |
| WO2001004360A9 (en) | Retroviral recombination assays and uses thereof | |
| US20230151388A1 (en) | Modified vectors for production of retrovirus | |
| WO2024194651A1 (en) | Expression construct | |
| US20120034693A1 (en) | Recombinant vector and use in gene therapy |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FZDE | Discontinued |