[go: up one dir, main page]

CA2369058A1 - Synthetic gene for expressing active retroviral protein in eukaryotes - Google Patents

Synthetic gene for expressing active retroviral protein in eukaryotes Download PDF

Info

Publication number
CA2369058A1
CA2369058A1 CA002369058A CA2369058A CA2369058A1 CA 2369058 A1 CA2369058 A1 CA 2369058A1 CA 002369058 A CA002369058 A CA 002369058A CA 2369058 A CA2369058 A CA 2369058A CA 2369058 A1 CA2369058 A1 CA 2369058A1
Authority
CA
Canada
Prior art keywords
gene
protein
retroviral
integrase
gag
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002369058A
Other languages
French (fr)
Inventor
Zeger Debyser
Erik De Clercq
Peter Cherepanov
Wim Pluymers
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Katholieke Universiteit Leuven
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2369058A1 publication Critical patent/CA2369058A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N7/00Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/005Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2740/00Reverse transcribing RNA viruses
    • C12N2740/00011Details
    • C12N2740/10011Retroviridae
    • C12N2740/16011Human Immunodeficiency Virus, HIV
    • C12N2740/16041Use of virus, viral particle or viral elements as a vector
    • C12N2740/16043Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2740/00Reverse transcribing RNA viruses
    • C12N2740/00011Details
    • C12N2740/10011Retroviridae
    • C12N2740/16011Human Immunodeficiency Virus, HIV
    • C12N2740/16051Methods of production or purification of viral material
    • C12N2740/16052Methods of production or purification of viral material relating to complementing cells and packaging systems for producing virus or viral particles
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2740/00Reverse transcribing RNA viruses
    • C12N2740/00011Details
    • C12N2740/10011Retroviridae
    • C12N2740/16011Human Immunodeficiency Virus, HIV
    • C12N2740/16111Human Immunodeficiency Virus, HIV concerning HIV env
    • C12N2740/16122New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/108Plasmid DNA episomal vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2840/00Vectors comprising a special translation-regulating system
    • C12N2840/20Vectors comprising a special translation-regulating system translation of more than one cistron
    • C12N2840/203Vectors comprising a special translation-regulating system translation of more than one cistron having an IRES

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Virology (AREA)
  • Biophysics (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

The present invention features a synthetic gene or region of a gene which ha s an amended codon usage compared with the wild-type gene and which is for the high level expression of a retroviral protein in eukaryotic cells, the expressed retroviral protein having enzymatic activity in the eukaryotic cel l. In addition, the invention features a synthetic gene or region of a gene encoding a retroviral enzyme or part of a retroviral enzyme normally express ed in a mammalian or other eukaryotic cell wherein at least one non-preferred codon in the wild-type gene encoding the enzyme has been replaced by a preferred codon encoding the same amino acid. The retroviral protein may be a protease, reverse transcriptase, integrase protein or a polyprotein gag-pol precursor thereof. In one embodiment the retroviral protein with enzymatic activity is a lentiviral protein. In other embodiments the enzymatically active protein is a pol enzyme. In more preferred embodiments, the enzymatically active protein is a lentiviral integrase. In an even more preferred embodiment the enzyme is an HIV enzyme. In more preferred embodiments the enzymatically active protein is HIV integrase. The present invention also includes a detection method for intracellular integrase using a promoterless reporter gene.

Description

I
SYNTHETIC GENE FOR EXPRESSING ACTIVE RETROVIRAL PROTEIN IN EUKARYOTES
The present invention relates to the design of a synthetic gene for expressing retroviral proteins in eukaryotic cells especially mammalian cells as well as a synthetic gene, an expression vector containing the gene, eukaryotic cells stably harboring the gene, as well as methods of detection.
TECHNICAL BACKGROUND
Retroviruses are diploid positive strand RNA viruses that replicate through an integrated DNA intermediate. Typically, retroviruses comprise a protein-containing lipid envelope surrounding a protein-encapsulated core carrying the viral genome.
Within the infected cell the retroviral genome is reverse-transcribed into double stranded DNA by a virally encoded reverse transcriptase enzyme that is part of the retroviral particle. The particle also includes other enzymes such as integrase. Integrase is the virus-encoded enzyme that is responsible for inserting the viral DNA
copy into the chromosome of the host cell, a process referred to as retroviral integration. (For a review see Brown (1997), in Retroviruses, Cold Spring Harbor Laboratory Press USA, pp. 161-203). Integration is an essential step in the replication cycle of the human immunodeficiency virus type 1 (HIV-1), the causative agent of AIDS (LaFemina et al.
(1992), J. Virol. 66: 7414-7419). Since no human counterpart is known to exist, integration has attracted a lot of attention as a potential new antiviral target. However, integrase inhibitor development has suffered from the lack of a relevant cellular integration assay; integrase activity is typically evaluated using artificial oligonucleotide-based test tube reactions. There is therefore a need to provide an intracellular integration assay.
Wild-type retroviral genomes contain at least three genes known as the gag, pol and env genes. The gag gene encodes internal core structural proteins, the pol gene encodes for certain enzymes such as protease, reverse transcriptase and integrase, and the env gene encodes the retroviral envelope glycoproteins. Integrases from different retroviruses vary in size from 30 to 46 kDa, are encoded by the 3'-end of the pol gene and are released from a gag pol polyprotein precursor by proteolytic processing. The aminoterminal domain of integrase is characterized by a zinc finger (HHCC), is CONFIRM~I'ION COPT
universally conserved among all retroviruses, and is essential for in vivo integration.
The central domain is the most conserved region with an essential DD35E motif involved in catalysis. This portion can catalyze the disintegration reaction in vitro. The carboxyterminal domain is referred to as DNA binding domain and shows the least sequence conservation. This fragment is required for 3'-end processing and integration.
The active enzyme is thought to exist as a multimer wherein active domains can transcomplement inactive domains.
Transient expression of avian sarcoma-leukosis virus (ASLV) integrase in COS
cells has been obtained previously (Morns-Vasios et al. (1988), J. Virol. 62:
349-353).
1o A mouse cell line stably expressing the integrase of Rous sarcoma virus (RSV) has also been reported (Mumm et al. (1992), Virology 189: 500-510). Expression levels were not specified but appeared rather low. The integrase (IN) of HIV-1 has been expressed in Escherichia coli (E. coli) (Sherman and Fyfe (1990), Proc. Natl. Acad. Sci.
USA 87, 5119-5123), insect cells using baculovirus (Bushman et al. (1991), Science 249: 1555-1558), and Saccharomyces cerevisiae (Caumont et al. ( 1996), Curr. Genet. 29:

510). In yeast integrase expression proved to be toxic in cells defective in DNA repair.
High level expression of HIV-1 integrase in mammalian cells has remained elusive, in large part because expression of HIV-1 gag and pol proteins in general is Rev-dependent (Cullen (1992), Microb. Rev. 56: 375-395). In mammalian cells Rev-dependent expression of HIV-IN or HIV-IN fused to (3-galactosidase or GFP has been reported previously (Faust et al. (1995), Biochem. Mol. Biol. Int. 36: 745-758; Kukolj et al. (1997), J. Virol. 71: 843-847; Pluymers et al., (1999), Virology 258:
327-332).
However, expression levels, even after transient transfection, were always low. In the absence of Rev, multiple inhibitory or instability sequences (INS), also referred to as cis-acting repressor elements (CRS), in the mRNA interfere with protein expression.
Potential mechanisms include: nuclear retention or mRNA instability. It was observed that mRNA containing CRS is trapped in the nuclei and that the inhibition of expression is at least partly due to the poor translocation of mRNA to the cytosol (Mikaelian et al. (1996), J. Mol. Biol. 257: 246-264; Borg et al. (1997), Virology 236:
95-103). Elements of the RNA processing machinery could be involved in nuclear trapping of mRNA that contains CRS. There is also evidence that several regions of the HIV-1 genome that contribute to the instability of the mRNA, have high AU
contents.
They may represent binding sites for cellular factors which contribute to mRNA
instability (Schneider et al. (1997), J. Virol. 71: 4892-4903). According to another hypothesis, mRNA containing inhibitory sequences fails to be translated efficiently without Rev. Whatever the mechanism of the observed inhibition, it is clear that inhibition occurs at the level of the mRNA and is due to some AU-rich regions.
During the HIV replication cycle Rev interaction with the Rev responsive element (RRE) relieves the inhibition in a regulated manner (Schwartz et al. (1992), J.
Virol. 66: 150-159). In this perspective, it is not surprising that by mutating some INS
while preserving the coding function for gag pol transcripts, efficient Rev-independent expression of viral particles has been obtained (Schneider et al. (1997), J.
Virol. 71:
l0 4892-4903). There is evidence that in the case of HIV gp120 mRNA poor translatability due to inefficient codon usage rather than mRNA instability is responsible for low level protein expression (Haas et al., 1996; Schneider et al. ( 1997), J. Virol. 71: 4892-4903).
US 5,811,270 (Grandgenett) describes a test tube method of analysis of concerted integration in which a viral integrase enzyme is first incubated with donor DNA molecules followed by incubation with target DNA molecules. The donor DNA
has at least one unique restriction site for analysis of the concerted integration product.
The described method is said to be useful for studying integrase such as screening of HIV-1 or HIV-2 integrase inhibitors as well as production of transgenic non-human animals and gene transfer. The integrase used is purified from virus particles and the activity is analyzed in the test tube, not intracellularly.
US x,795,737, WO 96/09378, WO 97/11086 and WO 98/12207 all describe methods of producing a synthetic gene encoding a protein normally expressed in a mammalian cell whereby the synthetic gene is reported to overexpress the encoded proteins in mammalian cells. The known synthetic genes are constructed by replacing non-preferred codons or less preferred codons with preferred codons which encode the same amino acid by utilising the redundancy of the genetic code. Examples are given of synthetic env genes which encode envelope glycoproteins but there is no discussion of expressing a protein with enzymatic activity in the host cell. A method of designing a 3o synthetic gene for the overexpression of a protein while maintaining its enzymatic activity is not derivable from the known teaching. There are a significant number of factors which may allow expression of a (retroviral) protein which fails to show intracellular enzymatic activity. The expressed enzyme may be defective for many reasons of which intracellular inhibition of the enzyme and the need for the presence of another viral protein at the same time are but a few. Further, it is not obvious that an enzyme can be overexpressed, for example there may be some limiting factor such as poor solubility or cellular toxicity. On the one hand high level expression of a retroviral enzyme will be required to detect the enzymatic activity, on the other hand levels which are too high may cause protein precipitation or cellular toxicity. For any retroviral enzyme to be active in the cell an optimal intracellular concentration will be required.
In case of failure the suggestion is to replace non-preferred codons to a certain percentage, e.g. 90%, 80%, 70% ..., but there is no precise teaching of how to select 1o which codons are to be replaced. In particular, there is no indication that a specific nucleotide pair frequency is of relevance to high level gene expression. - It is not conclusive that the mechanism of RRE-instability (in env) is the same as, or even related to the mRNA instability problem in gag and pol. In fact there is evidence that the mechanisms are different. Hence, it is not predictable that Rev-independent expression of an env gene may be extrapolated to cure the instability problem of gag and pol genes.
It is an object of the present invention to develop an efficient expression system for an enzymatically active retroviral protein, in particular HIV-1 integrase, in eukaryotic cells, especially mammalian cells.
2o It s a further object to provide a more efficient detection method for retroviral enzyme inhibitors.
It is a further object of the present invention to provide a design method for the construction of a gene encoding a retroviral protein with enzymatic activity.
A further object of the present invention is to provide an expression vector capable of delivering a gene to a target cell, in which cell the enzymatically active protein encoded by the gene is expressed.
SUMMARY OF THE INVENTION
The present invention features a synthetic gene or region of a gene which has an 3o amended codon usage compared with the wild-type gene and which is for the high level expression of a retroviral protein in eukaryotic cells, the expressed retroviral protein having enzymatic activity. in the eukaryotic cell. In addition, the invention features a synthetic gene or region of a gene encoding a retroviral enzyme or part of a retroviral enzyme normally expressed in a mammalian or other eukaryotic cell wherein at least one non-preferred codon in the wild-type gene encoding the enzyme has been replaced by a preferred codon encoding the same amino acid. By "region of a gene with amended codon usage" is meant that it can be sufficient to change codons only in those parts of a gene that normally produce instability sequences (INS) or cis-acting repressor elements (CRS) in the transcribed mRNA of the gene.
By "retroviral protein or enzyme normally expressed in a mammalian or eukaryotic cell" is meant a protein or enzyme which is expressed in a mammalian or eukaryotic cell under disease conditions. These are genes which are encoded by a 1o retrovirus (including a lentivirus) which are expressed in mammalian or eukaryotic cells post-infection.
In preferred embodiments, the synthetic gene is capable of expressing the retroviral enzyme at a level at least 200% of that expressed by the "natural"
(or "native") gene in a mammalian or eukaryotic cell culture system.
The retroviral protein may be a protease, reverse transcriptase, integrase protein or a polyprotein gag-pol precursor thereof. In one embodiment the retroviral protein with enzymatic activity is a lentiviral protein. In other embodiments the enzymatically active protein is a pol enzyme. In more preferred embodiments, the enzymatically active protein is a lentiviral integrase. In an even more preferred embodiment the 2o enzyme is an HIV enzyme. In more preferred embodiments the enzymatically active protein is HIV integrase. The enzymatic activity includes at least an integrase function, namely of promotion or stimulation of the integration of DNA fragments into host cell DNA, preferably the chromosome of the host cell. The integrase hereby is expressed on its own id est as a single component, independent of any retroviral components..
By "retroviral components" is meant the retroviral, specifically the lentiviral, and more specificially the HIV-1 regulatory and accessory proteins like Tat, Rev, Nef, Vpu, Vif, Vpr.
The invention also features a eukaryotic expression vector comprising the synthetic gene or region of a gene. The expression vector preferably includes a 3o constitutive or an inducible or a tissue-specific promoter. Expression from the eukaryotic expression vector can be transient after transfection of the vector in a eukaryotic cell by any of suitable, e.g. established, transfection procedures.
The vector may be any suitable vector such as a plasmid, a mammalian or insect virus.
Expression may also be permanent in a eukaryotic cell line stably harbouring the expression vector.
The expression vector may be comprised in a packaging construct for producing retroviral particles for gene transfer. The retroviral particle may be a lentiviral particle.
Another aspect of the present invention features a eukaryotic cell line that harbours the synthetic gene or region of a gene. The cell line preferably expresses the retroviral enzymatically active protein using a constitutive, inducible or tissue specific promoter. The expressed retroviral protein shows enzymatic activity that can be measured for example by complementation of enzyme-defective viruses or in the case of an integrase by stimulation or the promotion of the insertion of DNA
molecules into to another DNA molecule, preferably the chromosome of the cell.
The present invention also includes a transgenic non-human animal harboring the synthetic gene or region of a gene. The expression of the gene or region of a gene may be induced at any moment using an inducible promoter or, alternatively, in desired tissues using a tissue-specific promoter.
The present invention also features a method for preparing a synthetic gene or region of a gene encoding an enzymatically active retroviral protein or part of such a protein. The method not only identifies and uses preferred codon usage but also, and moreover mainly, seeks to increase mRNA stability during expression. The method includes identifying a small group of genes from the total set of genes of a target 2o eukaryotic cell which encode proteins which are naturally expressed easily and/or in high concentrations in the target cell. The small group may include 10 or less genes, more typically 5 or less genes. From the codon sequences of these identified genes, a prefeiTed codon usage and a preferred nucleotide relationship or nucleotide pair frequency is identified. By preferred codon usage is meant that for a specific amino acid a specific codon is chosen as the preferred codon to encode the amino acid based on the high use of the preferred codon within the select group of genes. By a preferred codon relationship is meant the ratios of the various nucleotides and combinations of nucleotides to each other which commonly appear in genes of the target eukaryotic cell.
One particular nucleotide relationship is the GC content or the GC nucleotide pair 3o frequency. Using the preferred codon usage, non-preferred codons are identified in the natural gene encoding the enzyme and one or more of the non-preferred codons is/are replaced with a preferred codon encoding the same amino acid as the replaced codon.
The replacement is biased to obtain the preferred nucleotide relationship or nucleotide pair frequency, resulting in even better optimized conditions for expression in eukaryotes compared to the use of preferred codon usage only. The replacement may be made based on a random choice between alternative codons encoding the same amino acid at each position using a random number generator and biasing the choice of alternative codons based on the preferred codon usage to obtain the preferred nucleotide relationship or nucleotide pair frequency. In addition, the synthetic gene sequence may be edited by removing potential splice sites and to reduce the number of CpG methylation sites while keeping the overall nucleotide relationship or the nucleotide pair frequency close to the preferred one, e.g. keeping the GC
content and codon usage close to the preferred one. GC content should be kept close to the preferred usage in the target cell, e.g. about 60% in mammalian cells. A
preferred range for the GC content is 53 to 63%, more preferably 55 to 61% for expression of the gene in human cells. To provide efficient initiation of translation the Kozak consensus sequence (ANNATGG) may be added.
It is not necessary to replace all non-preferred codons with preferred codons.
Increased expression may be accomplished even with partial non-preferred codon replacement with preferred codons. Under some circumstances it may be desirable to only partially replace non-preferred codons with preferred codons in order to obtain an intermediate level of expression.
By "synthetic gene" is meant a nucleotide sequence encoding a naturally occurring protein in which a portion of the naturally occurnng codons has been replaced by other codons. For example, a non-preferred codon is replaced with a preferred codon encoding the same amino acid. However, by replacing codons to create a synthetic gene the expression in eukaryotic, e.g. mammalian cells (especially human cells) of a wide variety of genes (of eukaryotic, mammalian, prokaryotic or viral origin) can be increased compared to the expression of the naturally occurring gene.
Thus, the invention includes improving the eukaryotic, especially a mammalian cell expression of a gene from any source by the codon replacement methods described herein.
By "vector" is meant a DNA molecule, derived, e.g., from a plasmid, or 3o mammalian or insect virus, into which fragments of DNA may be inserted or cloned. A
vector will contain one or more unique restriction sites and may be capable of autonomous replication in a defined host or vehicle organism such that the cloned sequence is reproducible. Thus, by "expression vector" is meant any autonomous element capable of directing the synthesis of a protein. Such DNA expression vectors include mammalian plasmids and viruses.
By retroviral "packaging construct" or "packaging vector" is meant a plasmid-based or virus-based vector or construct, configured to encode for the proteins necessary for producing virus particles that are devoid of genomic RNA. In general, this implies providing the gag, pol and env gene products. Lentiviral packaging constructs of interest contain changes to the coding sequences of gag or pol proteins (i.e. synthetic genes) to enhance lentiviral protein expression and to enhance safety. For biosafety reasons, the packaging functions are often divided into two genomes, one which expresses the gag and pol gene and another expressing the env gene product. In packaging constructs in accordance with the present invention, regulators of gene expression such as the Rev gene product would no longer be required. Increased biosafety of these packaging constructs is based on a reduced risk for (homologous) recombination of these synthetic genes with their wild-type counterparts.
The invention also features synthetic portion of a gene which encodes a desired portion of the protein. Such synthetic gene fragments are similar to the synthetic genes of the invention except that they encode only a portion of the protein. The portion of the gene encodes a portion of the enzyme which has some enzymatic activity, e.g. it may have catalytic activity, for example, the synthetic gene may encode a catalytic core of an enzyme, e.g. it may be a part of reverse transcriptase.
The present invention also includes a detection method for intracellular integrase using a promoterless reporter gene. The reporter gene may be luciferase, GFP
or an antibiotic selection marker (e.g. neomycin resistance). The reporter gene construct may be used as the substrate of the retroviral enzyme, e.g.
integrase expressed from the synthetic gene be it in a stable cell line or in a transient mode after transfection of the expression vector, the retroviral enzyme, e.g. integrase being in accordance with the present invention.
The present invention may provide a synthetic gene and a method of designing and constructing the same to obtain efficient expression of a retroviral, in particular lentiviral enzyme such as integrase of the human immunodeficiency virus type 1 (HIV-1), or part of a retroviral enzyme in mammalian cells. The synthetic gene circumvents mRNA instability by increasing the GC content of the wild type integrase gene from 40% to 59%. The synthetic gene, cloned in a eukaryotic expression vector, provides efficient expression of HIV-1 integrase in various mammalian cell lines. The amino terminus of the protein was as predicted by the sequence after removal of the first methionyl residue. Nuclear localization of the recombinant protein was evidenced by fluorescence microscopy. A 293T cell line stably expressing HIV-1 integrase was obtained. The functionality of integrase was proven by trans-complementation experiments. Lentiviral vector particles carrying the inactivating D64V
mutation in the integrase gene, were obtained capable of stably transducing 293T cells when complemented in the producer cell line with integrase expressed from the synthetic gene. When the cell line that stably expresses integrase was infected with the defective to virus particles, complementation of integrase function was observed.
Transfection with a linear promoterless DNA substrate that contains a reporter gene behind an IRES and is flanked by HIV LTR ends, resulted in a reproducibly higher reporter signal in cells that express integrase. Since the increase in reporter gene activity was stable upon passaging of the transfected cells, it can be concluded that the integrase promotes insertion of the linear DNA substrate in the cellular chromosome. The fold increase of reporter signal with integrase expressed from a mutant synthetic gene, containing the D64V mutation, was considerably lower, indicating that the enzymatic activity of the enzyme was required. The established cellular integration system in accordance with the present invention facilitates the study of the interplay between host and viral factors during integration, the development of specific HIV integration inhibitors as well as the design of gene transfer systems.
The present invention, its advantages and embodiments will now be described with reference to the following figures and drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1. Western blot analysis of transient expression of HIV-1 IN in 293T
cells using different expression strategies. 293T cells were transiently transfected with the various expression vectors. At 48 hrs post transfection cell extracts were made using 1 % SDS, 1 mM PMSF. Cell extracts representing 10 p.g of total protein were 3o separated by PAGE and blotted onto PVDF membranes. Detection was performed using polyclonal antibodies against HIV-1 integrase and the ECL+ detection system.
Lane 1 contains 2.5 ng of recombinant and purified His-tagged HIV-1 integrase (HT-IN). The other lanes contain extracts after transfections with equal amounts of the following plasmids: Lane 2, pCEP4; Lane 3, pCEP-IN; Lane 4, pCEP-IN-CTE; Lane 5, pCEP-IN-RRE + pEF-cREV; Lane 6 pCMV-INS.
Figure 2. Sequence and structure of the synthetic gene.
(A) Sequence of the synthetic DNA coding for pNL4-3 HIV-1 integrase. The amino 5 acid sequence is shown in the single letter code. The restriction sites used in construction are boxed. The translation initiation site is underlined.
(B) A schematic representation of the structure of the synthetic gene. The following regions are indicated : the 5'- and 3'- untranslated regions (UTR) derived from (3-globin mRNA, the Met-Gly dipeptide and the integrase open reading frame (ORF).
The 1o three domains of the integrase protein are shown: the Zinc finger motif (HHCC), the catalytic core and the DNA binding domain.
Figure 3. Western blot analysis of the 293T-derived cell line that stably expresses HIV-1 IN from the synthetic gene. 293T cells were transfected with pCMV-INs and a stable cell line was selected with HygromycinB. Cell extracts (10 pg of total protein) were separated by PAGE and blotted onto PVDF membrane.
Detection was performed using polyclonal antibodies against HIV-1 integrase and the ECL+
detection system. Lane 1, 2.5 ng recombinant His-tagged HIV-1 integrase; Lane 2, extract of 293T cells; Lane 3, extract of 293T cells stably expressing IN
(293T-INs).
Figure 4. Detection of integrase activity using a promoterless reporter construct (DIPR) Figs. 4A-C are schematic representations of the method of detection of integrase activity using a promoterless reporter gene.
DESCRIPTION OF THE ILLUSTRATED EMBODIMENTS
The present invention will mainly be described with reference to a synthetic gene for overexpressing HIV integrase in mammalian cells but the invention is not limited thereto but only by the claims.
It has long been known that expression of eukaryotic genes in prokaryotes can be optimised by designing synthetic genes with modified codon usage. Less established, although demonstrated, is the concept of increasing expression of 3o eukaryotic genes in eukaryotic cells by modified codon usage. From bacteria it is known that few general rules apply (Makrides F. (1996), Microb. Rev 60: 512-538).
A retroviral enzyme such as integrase does not normally, during the infectious cycle, work as a soluble protein in the cytoplasm of a host cell.
Integrase is part of a large ill-defined nucleoprotein complex called the preintegration complex of which also reverse transcriptase, nucleocapsid, matrix protein, the viral DNA and other factors are part. It is not obvious that integrase on its own in the cytoplasm of a target cell is enzymatically active, for example, there may be cellular factors which inhibit activity or viral factors which are missing in this environment.
Further, it is not obvious that integrase expressed as such will interact with artificial DNA substrates (see DIPR below). One aspect of the present invention is dissecting the preintegration complex to obtain a simple integrase-linear DNA interaction.
One embodiment of the present invention is a method to detect and utilize the enzymatic activity of a retroviral, in particular a lentiviral enzyme, in particular integrase by itself in a eukaryotic cell.
Initially eukaryotic expression vectors encoding HIV-1 IN and IN-RRE were constructed based on the reasoning that co-expression of Rev in cells transfected with IN-RRE would increase expression levels of IN. However, in human cells transiently transfected with these expression vectors, little or no expression of IN was detected by either immunofluorescence microscopy or western blotting (Fig. 1). An alternative approach consisted of introducing the constitutive transport element (CTE) of simian retrovirus type 1 behind the integrase gene. Again, expression was barely detectable upon prolonged exposure of the blot and amounted to merely 40 ng per 10 x 106 2o transfected cells. The construction of a C-terminal fusion to green fluorescent protein (GFP) (GFP-IN) resulted in a more pronounced expression of wild-type HIV-1 integrase expressed in mammalian cells (Pluymers et al. (1999), Virology 258:

332). Rev co-expression was not required, in accord with Kukolj et al. (1997), J. Virol.
71: 843-847), who expressed integrase as a C-terminal fusion protein with (3-galactosidase in the absence of Rev. The impact of the INS in the IN gene on protein expression levels was illustrated by a 5-fold decrease in expression levels of the GFP-IN construct compared to the parental GFP (Pluymers et al. (1999), Virology 258: 327-332). The present invention is based on a synthetic gene for HIV-1 integrase with an increased intrinsic mRNA stability. The use of such a synthetic gene resulted in high expression levels and concurrent enzymatic integrase activity as could be demonstrated via complementation tests.
In accordance with the present invention, an integrase gene was synthesised with an increased GC content resulting in high level expression of HIV-1 IN in various mammalian cell lines. The enzyme was shown to complement defective integrase carried by HIV-1-derived vector particles and to act in trans on linear DNA
substrates that are flanked by LTR fragments and encode a reporter gene.
DESIGN AND CONSTRUCTION OF THE SYNTHETIC INTEGRASE GENE
Synthetic genes have been constructed in the past to optimize expression of eukaryotic genes in bacteria based on the knowledge that codon usage in prokaryotes is quite different from that in eukaryotes. HIV (lentiviral) genes are not optimal for high level expression in eukaryotic cells. This is related to the mechanism HIV
uses to to circumvent the mRNA instability, namely Rev. During the replication cycle early mRNA transcripts will be spliced which results in expression of regulatory proteins such as Tat and Rev. Only late in the cycle, does Rev accumulation and Rev-RRE
interaction block splicing and suppress AT-rich instability sequences resulting in unspliced transcripts encoding structural and enzymatic proteins. Whereas the synthetic gene in accordance with the present invention clearly augments protein expression in mammalian cells, which is a prerequisite to detect the functionality of the enzyme in the cell, in the context of replicating HIV the presence of a gene with an increased GC
content may well interfere with the mechanism of regulation of gene expression and be detrimental for viral replication.
2o In accordance with an embodiment of the present invention a synthetic viral gene was designed for better optimized and more efficient expression in mammalian cells. The HIV-1 integrase gene has a GC content of 40% whereas highly expressed human genes on average have a GC content of 55-61%. Hence, the GC content is one aspect of the preferred nucleotide relationship or nucleotide pair frequency in accordance with the present invention. By employing the degenerative nature of the genetic code and selecting for the preferred codon usage in the synthetic gene, the GC
code content of a synthetic gene encoding HIV integrase would be increased up to 66%
without altering the amino acid sequence. However, this is not preferred in accordance with the present invention. First of all, in accordance with the present invention, the 3o choice among the alternative codons was biased in favour of preferred triplets (codons) found in a small group of genes of the total human genome which express well/strongly, e.g. human (3-globin, a-, y-actin and EF2 genes (method of determining the preferred codon usage). In addition the bias was such as to approximate the preferred nucleotide relationship or nucleotide pair frequency, i.e. within the range 53 to 63%, more preferably 55-61% for the GC content. In fact a GC content of 59%
rather than 66% was achieved. The other rules for redesigning retroviral genes for eukaryotic expression are: (i) removal of potential splice sites, (ii) reduction of the number of CpG methylation sites, (iii) introduction of S' and 3'-untranslated regions (UTR) of a mammalian mRNA (in our case from human (3-globin), (iv) addition of an extra N-terminal peptide (Met-Gly for the examples given below) for efficient initiation of translation. As a result expression levels from the synthetic gene in various mammalian cell lines were at least 25-fold higher than from the natural integrase gene.
to Efficient expression was also obtained in yeast (Pichia pastoris) (data not shown).
In accordance with one embodiment of the present invention a gene is provided to achieve high level expression of HIV-1 integrase in human cell lines by maintaining the amino acid sequence of IN from the pNL4-3 clone of HIV-1 while adapting the nucleotide codon usage to the codon usage of constitutively and highly expressed human genes ("preferred codon usage"). A first version of an artificial IN
reading frame was based on random choice between alternative codons at each position using a random number generator, biasing in favour of preferred triplets as found in the human (3-globin, a-, y-actin and EF2 genes. Next, the DNA sequence was substantially edited to remove potential splice sites and to reduce the number of CpG methylation sites, but keeping the overall GC content and codon usage close to optimal ("preferred nucleotide relationship" or "nucleotide pair frequency"). The final version of the synthetic gene (Fig. 2 or SEQ ID NO: l ) contains fragments of the 5'- and 3'-untranslated regions from the (3-globin mRNA. This gene encodes for wild type HIV-1 integrase with addition of the N-terminal Met-Gly dipeptide. The extra glycine codon completes the Kozak's consensus sequence (ANNATGG) required for efficient initiation of translation.
In the synthetic gene the overall GC content is 59% compared to 40% in the wild type.
The gene was constructed from six synthetic DNA fragments, each approximately 1 SO
by long, by stepwise cloning. It should be understood that various homologs of the gene shown in Fig. 2 or SEQ ID NO: l are included within the scope of the present invention.
Reapplication of the random number biasing procedure in accordance with the present invention would generate alternative sequences all of them coding for the same protein and all having a similar preferred nucleotide relationship or nucleotide pair frequency.
All such synthetic gene homologs are included within the scope of the present invention.
The synthetic gene includes modification to those described above, the following modifications and improvements of the synthetic gene are included within the scope of the present invention. For example, the leader peptide can be replaced affecting the efficiency of translation and potential myristoylation (e.g. for example, a Met-Ala variant has been constructed). The 5' and 3'-UTRs may be replaced by UTRs from other mammalian mRNAs to optimize the stability of the transcript.
Mutations in the open reading frame are also included within the scope of the present invention whereby the canonical integrase sequences (e.g. HHCC and DD35E) are preferably left to unchanged. A more soluble version can be made by introducing for example the F 185K/ F 185H mutations. Other mutations may induce increased or altered catalytic activity of the enzyme in the eukaryotic cell. For example, the present invention includes a variant synthetic gene with the D64V mutation, known to reduce drastically the enzymatic activity of integrase. Synthetic genes of integrase are included within the scope of the present invention in which the genetic information of domains of other proteins are added. These domains preferably add additional properties to the enzyme such as sequence specificity in DNA binding. Examples of methods of providing specificity to a gene encoding integrase are described in WO 96/37626, US
5,811,270 without describing the specific innovative aspects of the present invention.
The synthetic gene for HIV-1 integrase was designed to circumvent inhibition of gene expression induced by instability sequences (INS) in the wild type integrase gene. This approach can be applied to retroviral integrases in general. In particular the aforementioned design method may be used to redesign any retroviral viral gene encoding a protein with enzymatic activity for efficient expression in eukaryotes. In particular, the design method of synthetic genes in accordance with the present invention will boost eukaryotic expression for retroviral genes encoding a protein with enzymatic activity, especially lentiviral integrases and pol proteins in general. The particular approach of the present invention could also be applied to redesign gag genes, in which mRNA instability due to the presence of INS elements and not poor translatability, like for env, would be the problem. Although the role of Rev in suppressing the effect of 1NS is only well studied in the case of HIV-l, all other lentiviruses are known to encode proteins analogous to Rev. Likewise the human T-lymphotropic and bovine lymphotropic viruses (HTLVs and BLV) encode Rev.
Simple retroviruses such as Mason-Pfizer monkey virus and simian retrovirus-1 (SRV-1) contain a constitutive transport element (CTE) that promotes nuclear export of unspliced mRNA. It has been shown that CTE can functionally substitute for Rev interacting with RRE. In fact, a low level transient expression from a wild type 5 integrase gene with a downstream CTE of SRV-1 has been obtained by us using the methods of the present invention. Since the design of a synthetic gene in accordance with the present invention abolishes any need for co-expression of Rev and presence of RRE or CTE in the construct, this approach can improve expression of retroviral enzymes in general and integrases in particular.
10 In creating mammalian expression vectors, various eukaryotic expression plasmids can be used. Expression can be under control of a constitutive promoter (for example hCMV and RSV) or an inducible promoter. Examples of (commercially available) inducible expression systems are the ecdysone-inducible and the tetracyclin-inducible (Tet-Off and Tet-On) expression systems. Tissue-specific promoters that 15 limit expression in specific tissues may also be envisaged. Examples are the established neuron-specific promoters Thy-1 and enolase. Inducible promoters may limit cellular toxicity, although a cell line that stably expresses integrase was obtained.
In transgenic non-human animals harbouring the synthetic gene, expression may be induced at a desired moment using an inducible promoter or in desired tissues using a tissue-specific 2o promoter.
Transient and stable expression of HIV-1 integrase in 293T and HeLa cells The synthetic gene for integrase (INS) was cloned into the expression vectors pCEP4 and pBK-RSV under control of the human cytomegalovirus (hCMV) and Rous sarcoma virus (RSV) promoters, respectively. Transient and stable expression of IN
was obtained in both 293T and HeLa cell lines, as verified by immunoblotting (Fig. l, 3) and indirect immunofluorescence (data not shown). In transfected 293T cells the expression levels from the hCMV promoter amounted to 10-20 ~g of IN per 10 x cells which is at least 25-fold higher than obtained with expression vectors that contain 3o the unfused wild type HIV-1 integrase gene.
Transfection of 293T cells with the episomal expression vector pCEP-INs followed by selection with hygromycinB, resulted in a stable cell line, referred to as 293T-lNs. Indirect immunofluorescence staining revealed that 80-90% of selected cells produce integrase at detectable levels. The expression level, as estimated by quantitative immunoblotting, was about 0.5 ~g of integrase per 10 x 106 cells.
The reduced cell growth kinetics of 293T-INs (30-50% as compared to the parental cell line) is suggestive of cellular toxicity of integrase in mammalian cells.
In HeLa cells integrase was found exclusively in the nuclei. In 293T cells transient transfections typically gave rise to an irregular, granular cytoplasmatic distribution of IN, probably due to precipitation of the protein. In the 293T cell line selected to stably express IN, nuclear localization of IN was evident. During the metaphase and anaphase steps of mitosis, IN remained stably associated with the chromosomes.
to Solid phase N-terminal sequencing of integrase purified from transiently transfected 293T cells, revealed the following amino terminus: Gly-Phe-Leu-Asp-Gly-Ile-Asp-Lys. This is the sequence predicted by the synthetic gene, the starting methionine being removed post-translationally.
Functionality of INs Complementation of IN-defective vector particles To verify whether the integrase expressed from the synthetic gene in mammalian cells is enzymatically active, the ability of IN to complement integrase-2o defective HIV-derived lentiviral vectors was tested. HIV-1-derived lentiviral vectors have been developed by Naldini et al. and Zufferey et al. (Naldini et al.
(1996), Science 272: 263-267; Zufferey et al. (1997), Nature Biotechnol. 15: 871-875).
Pseudotyped lentiviral vector particles are produced by transfecting 293T cells with a packaging plasmid encoding viral gag and pol proteins, a plasmid encoding the envelope of vesicular stomatitis virus and a plasmid encoding a reporter gene flanked by two long terminal repeats (LTRs). The first generation packaging plasmid pCMVOR8.2, containing all HIV genes except for env and the transfer vector pHR'-CMVLacZ
were used to produce wild type vector (WT vector). Integrase-defective virus particles (D64V vector) were produced using pCMVOR8.2IN(D64V) (Naldini et al. (1996), 3o Science 272: 263-267). The D64V mutation in the integrase gene is known to abolish integrase activity, without affecting any other step of the infection (Leavitt et al.
(1993), J. Biol. Chem. 268: 2113-2119; Leavitt et al. (1996), J. Virol. 70:
721-728).
The transducing titer of the D64V vector in 293T cells was 20-fold lower than the titer of WT vector (Table 1). This is in good agreement with previously reported results (Naldini et al. (1996), Science 272: 263-267). The observed "background"
expression after D64V transduction, is mostly due to transcription from non-integrated circularized viral DNA since (3-galactosidase expression after D64V transduction is reduced drastically upon passaging the cells (Table 1). Nevertheless, in some of the transduction experiments 1 or 2 galactosidase-positive colonies were observed. A residual transducing activity of D64V virus was observed before (Gaur and Leavitt ( 1998), J.
Virol. 72: 4678-4685). It is possible that this integration is independent of the viral integrase.
to Complemented vectors (C IN) were produced after quadruple transient transfection of producer cells, including pCEP-INS, the expression vector containing the synthetic gene. The transducing activity was restored up to 30% with C IN
(Table 1 ). Complementation was due to stable integration, since an equal proportion of galactosidase-positive colonies was counted after multiple passages of the transduced cells. The principle of trans-complementation of IN-defective virus was shown previously, using VPR-IN fusion expression constructs (Fletcher et al. (1997), EMBO
J. 16: 5123-5138). The transducing activity of catalytic domain mutants of IN
was restored up to 20% by transcomplementation with VPR-IN. However, in the absence of VPR, the expression construct for wild type integrase, only achieved 0.04%
2o complementation efficiency (Fletcher et al. (1997), EMBO J. 16: 5123-5138).
The synthetic gene in accordance with the present invention, in the absence of VPR, results in a complementation activity that is 750-fold more pronounced.
Moreover, evidence for trans-complementing activity of integrase expressed from the synthetic gene in target cells was also obtained (Table 1).
Transduction of IN
expressing 293T cells with IN-defective virus particles, resulted in a higher transduction efficiency as compared with the parental 293T cells. After passaging the transduced cells, the difference became even more pronounced. This points to a catalytic interaction of integrase present in the receptor cell with the pre-integration complex of the incoming vector. For the wild type and the complemented vectors increased transduction efficiencies were obtained as well. This may suggest that the amount of active integrase present in the viral particle is dose-limiting or that integrase present in the target cell neutralizes inhibitory host factors.

Detection of integrase activity using a promoterless reporter gene (DIPR).
Integration of HIV in the chromosome does not show strict sequence-specificity, although a weak consensus was found for the integration sites (Carteau et al.(1998), J. Virol. 72, 4005-4014). It is commonly accepted although not formally proven, that retroviral integration is favored in open chromatin near or within active transcription units (Rohdewohld et al. (1987), J. Virol. 61: 336-343; Scherdin et al.
(1990), J. Virol. 64, 907-912; Vijaya et al. (1986), J. Virol. 60: 683-692;
Carteau et al.
(1998), J. Virol. 72, 4005-4014). The design of a promoterless reporter substrate for measuring integrase activity in cell culture, is based on this finding (Figs.
4A - C). In to accordance with an embodiment of the present invention a method is proposed in which read-through transcription of the integrated promoterless reporter gene will occur when inserted within an actively transcribed region of the chromosome. The construct designed is a linear DNA fragment, flanked by the 200 by terminal fragments of the HIV LTRs that provide the integrase recognition sites. The marker gene may encode luciferase, for instance. The presence of an IRES (internal ribosome entry site) in front of the open reading frame of luciferase, directs cap-independent translation of mRNA
transcripts (Fig. 4A).
After transfection with this DIPR substrate (Fig. 4B, C), luciferase activity measured in 293T-INs cells was always 4 to 10 times higher than in the parental 293T
cells (Table 2). In the DIPR assay activity of the D64V mutant integrase was reduced compared to the wild type integrase (data not shown). These results point to an activity of the intracellularly expressed integrase (expressed by the synthetic gene in accordance with the present invention) (Fig. 4C). Sequencing of integrated linear DNA
molecules in 293T cells transiently expressing integrase from the synthetic gene using Alu-PCR, revealed the characteristic removal of the 3' GT dinucleotide in 10%
of integrants. In control cells not expressing integrase none of the DNA
insertions showed this hallmark.
Applications 3o An embodiment of the present invention includes the construction of an efficient eukaryotic expression vector for a retroviral enzyme, e.g. HIV-1 integrase, based on the creation of a synthetic gene. Expression from the eukaryotic expression vector can be transient after transfection of the plasmid in a eukaryotic cell by any of established transfection procedures. Expression may also be permanent in a cell line stably harbouring the expression vector. An important aspect of the present invention and its applications is the functionality of an expressed retroviral enzymatically active protein, as opposed to mere the high level expression of an enzymatically inactive retroviral protein.
Intracellular integrase test for the evaluation of integrase inhibitors An embodiment of the present invention includes assays for evaluating integrase activity in cells transfected with a DNA substrate that is flanked by fragments l0 of HIV LTR, a so-called mini-HIV. In both assays data point to enzymatic activity of IN.
In DIAS (detection of integrase activity through antibiotic selection) test, a resistance gene to a cytotoxic drug is present in the mini-HIV DNA. The presence of IN in the transfected cell augments stable insertion of the resistance gene in the chromosome. Scoring is performed by comparing the residual number of colonies resistant to the cytotoxic agent in comparison with cells transfected with heterologous DNA.
In DIPR (detection of integrase activity using a promoterless reporter gene), a reporter gene (luciferase) without promoter is present downstream of an internal ribosome entry site (IRES) in mini-HIV (Fig. 4A). The presence of IN in the transfected cell (Fig. 4B) augments stable insertion of the reporter construct in the host chromosome in close proximity to a cellular promoter (Fig. 4C). Scoring is performed by measuring enzyme activity expressed from the promoterless marker gene, e.g.
luciferase. The latter assay is highly amenable to evaluation of integrase inhibitors in cell culture in a microtiter plate format, adaptable for high throughput screening.
Potential integrase inhibitors would result in the absence of or a significant decrease in the level of detectable signal from the promoterless marker gene.
Such an assay in accordance with the present invention involves screening test inhibitory compounds from large libraries of synthetic or natural compounds.
Synthetic compound libraries are commercially available from, for example, Maybridge Chemical Co. (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH) and Microsource (New Milford, CT). A rare chemical library is available from Aldrich Chemical Company, Inc. (Milwaukee, WI).

Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts are available from, for example, New Chemical Entities, Pan Laboratories, Bothell, WA or MycoSearch (NC), Chiron, or are readily producible.
Plant extracts may also be obtained form the University of Ghent, Belgium.
5 Additionally, natural and synthetically produced libraries and compounds are readily modified through conventional chemical, physical, and biochemical means.
Tool for non-viral cellular gene delivery.
Cell lines that express integrase from a synthetic gene do have greater 1o propensity to integrate foreign DNA, flanked by LTR fragments. These cell lines are thus more transducible. An embodiment of this invention is the creation of eukaryotic cell lines (or cell culture systems) that are highly transducible (at least 200% compared to the parent cell). Embodiments of the present invention also include applications in transgene technology to increase the efficiency of (non)homologous recombination in 15 ES cells be it by transient expression from a plasmid or after induced expression of the retroviral integrase in ES cells transgenic for the synthetic gene. The synthetic gene in accordance with the present invention may be brought into cells by any transfection agent or method (e.g. electroporation or lipofection) and may result in the stable integration of DNA in the chromosome.
Retroviral (lentiviral) vector packaging construct From the complementation experiment it is clear that integrase expressed from the synthetic gene in the producer cell can complement integrase-defective lentiviral virus particles encoded by a packaging plasmid and thus can substitute for the protein expressed by the packaging construct. It follows that in an expression vector based on one or more synthetic genes) for a lentiviral gag pol gene, the synthetic genes) can substitute for the natural genes) in the packaging constructs resulting in Rev-independent high level protein expression. The present invention includes a packaging construct based on non-lentiviral complex retroviruses in which protein expression is 3o dependent on a Rev homologue such as Rex in the case of HTLV-I. Lentiviral vectors per se, capable of transducing a non-dividing cell, are known in the art (see Naldini et al. (1996), Science 272: 263-267, Zufferey et al. (1997), Nature Biotechnol.
15: 871-875). Generally the vectors are plasmid-based or virus-based, and are configured to carry the essential sequences for incorporating nucleic acid, for selection and for transfer of the nucleic acid in the host cell. Gag, pol and env genes of interest are known in the art. Briefly, a first vector can provide a nucleic acid encoding a viral gag and a viral pol, and a second vector can provide a nucleic acid encoding a viral env gene product to produce a packaging cell. Packaging cells or cell lines supply in trans the proteins necessary for producing infectious virions, themselves being incapable of packaging endogenous viral genomic nucleic acids (Watanabe & Temin (1983), Molec.
Cell Biol. 3(12): 2241-2249; Mann et al. (1983), Cell 33:153-159; Embretson &
Temin (1987), J. Virol. 61(9): 2675-2683). Introducing a vector providing a heterologous to gene, the transfer vector, into such packaging cells yields producer cells which release infectious particles carrying the foreign gene of interest. Methods for transfection or infection are well known by those skilled in the art. After cotransfection of the packaging vectors and the transfer vector to the packaging cell or cell line, the recombinant vector is recovered from the culture media and titered by standard methods used by those of skill in the art.
The foreign or heterologous gene carried by the transfer vector can be any nucleic acid of interest which can be transcribed, but preferably is a nucleic acid encoding for a polypeptide of therapeutic benefit or of interest for gene therapy. The env gene in the (second) packaging vector can be derived from any virus, including 2o retroviruses, and is preferably amphotropic, allowing transduction of cells of human and other species, and is preferably under control of non-endogenous regulatory sequences. Vectors can be made target-specific through linkage of the env protein with an antibody or a ligand for a particular receptor of a particular cell-type (cell-targeting).
Design of the gag-pol synthetic gene is based on a method to circumvent mRNA
instability associated with these wild-type genes. Preferentially, the method used by us to create an expression construct for high level and Rev independent eukaryotic expression of active HIV-1 integrase is employed. Further construction of the vectors of the present invention, whereby natural gag pol genes are replaced respectively by the synthetic genes of the present invention, employ standard ligation and restriction 3o techniques which are well understood in the art (see Maniatis et al, in Molecular Cloning: a Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1982).
Biosafety requires measurements to reduce the risk of generating recombinant replication competent retroviruses (RCR) as much as possible. Dividing the packaging functions into two genomes, one which express the gag and pol gene and another expressing the env gene product help to minimize the likelihood of generating RCR.
That approach minimizes the ability for co-packaging and subsequent transfer of the two-genomes, as well as significantly decreases the frequency of recombination due to the presence of three retroviral genomes in the packaging cell to produce RCR.
To render any possible recombinations non-functional, mutations (Danos & Mulligan (1988), Proc Natl. Acad. Sci 85: 6460-6464) or deletions (Bosselman et al.
(1987), Molec. Cell Biol. 7(5):1797-1806; Markowitz et al. (1988), 62(4):1120-1124) can be configured within the undesired gene products. Deletion of the 3'LTR of both 1o packaging constructs will further reduce the likelihood to form functional recombinants. US5,994,136 describes the production of lentiviral vectors with an even more remote possibility of generating replication competent lentiviruses by functionally deleting the tat gene, which is encoding for a regulating protein that promotes viral expression through a transcriptional mechanism. The likelihood of recombination between the transfer vector, that still contains natural genetic information of the lentivirus like part of the gag gene, and synthetic packaging genes will be considerably reduced, further improving the biosafety of the lentiviral vectors. DNA
sequence mismatching, as induced by the replacement of nucleotides in the third position compared to the natural gene, seems to present a considerable barrier to homologous 2o recombination in a wide variety of species. It is therefore also unlikely that contaminating or endogenous HIV virus particles would exchange the natural integrase gene for the synthetic one through recombination.
Experimental Procedures DNA constructs Construction of integrase expression plasmids The open reading frame of IN from the HIV-1 clone HXB2 was PCR
amplified using Pfu DNA polymerase (Stratagene, Cambridge, UK) with the primers 5'-CCCCCAAGCTTGCCAGCCATGTTTTTAGATGGAATAGATAAGG and 5' CCCGCTCGAGGTTTCCTTGAAATATACATATGGTG and subcloned in pCEP4 (Invitrogen, Leek, The Netherlands), resulting in pCEP-IN. The absence of mutations was verified by DNA sequencing. The RRE sequence of HIV-1, clone HXB2, was PCR

amplified using the primers 5'-TTCCGCTCGAGTAGCACCCACCAAGGCAAAGAG
and 5'-TCGCGGATCCAAGGCACAGCAGTGGTGCAAATG. The PCR fragment was subcloned in the sense orientation downstream of the integrase gene in pCEP-IN to produce pCEP-IN-RRE. The CTE sequence (obtained from plasmid pS 12; Taberno et al., 1996, J. Virol. 70: 5998-6011) was cloned in pCEP4 in the correct orientation, followed by the insertion of the integrase gene upstream of the CTE. This resulted in the plasmid pCEP-IN-CTE. The construction of pGFP-IN is explained in Pluymers et al. (1999), (Virology 258: 327-332). The Rev expression plasmid, pEF321-cREV, was provided by Sandoz Forschungs Institut, Vienna, Austria. PCR amplification and 1o plasmid construction employed standard techniques like standard ligation and restriction techniques and conditions which are well understood in the art (see Maniatis et al, in Molecular Cloning: a Laboratory Manual, Cold Spring Harbor Laboratroy, N.Y., 1982).
Assembly of the synthetic gene The restriction sites NheI, PstI, BamHI, NaeI, NarI (indicated in Fig.2 ) divide the sequence of the synthetic gene into 6 fragments each approximately 150 by long that correspond to the sequences 1-149, 144-306, 301-456, 451-623, 618-776, (Fig. 2). Each of the fragments was constructed separately by annealing and extending 2o two partially complementary oligonucleotides (85-95 nt long, PAGE-purified and 5'-phosphorylated, synthesized by Gibco BRL Life Technologies, Merelbeke, Belgium) using Sequenase (Amersham-Pharmacia, Buckinghamshire, UK) . Each fragment was cloned into the EcoRV site of the vector pBluescript KS(+) (Stratagene, La Jolla, CA).
The sequence errors found in the resulting clones were repaired using either the Stratagene Quick Change procedure with Pyrococcus furiosus (Pfu) polymerise (for a base substitution in the fragment 451-623) or PCR (for deletions in the terminal regions of the fragment 1-149). The full 930 by sequence was built by stepwise assembly of the fragments similar to the method described in W098/12207. Choice of the cloning vector (pBluescript KS or SK) at each step was dictated by toxicity of the IN
coding 3o DNA. Finally, the two halves of the IN gene (1-451 and 452-930) were ligated together and cloned into pBluescript KS(+) resulting in plNs.
Construction of mammalian expression vectors for INs The plasmid pINs was digested by EcoRI and treated with T4 DNA polymerase followed by restriction with XhoI. The 1 kb fragment carrying the INs gene was cloned between the PvuII and XhoI sites of pCEP4 (Invitrogen, Leek, The Netherlands) resulting in pCMV-Ins. pCEP4 is an episomal mammalian expression vector containing the human cytomegalovirus (hCMV) immediate early enhancer/promoter. The Epstein Barr virus replication origin (oriP) and nuclear antigen (encoded by the EBNA-1 gene) permit extrachromosomal replication in human, primate and canine cells. A
hygromycin resistance gene is present, permitting selection of stably transduced clones by hygromycinB (GIBCO BRL). The same 1 kb fragment was also cloned between l0 NheI and XhoI sites of the pBK-RSV expression vector (Stratagene) (the NheI
cohesive end of the vector DNA was filled in using T4 DNA polymerase) resulting in pRSV-INs. In this vector expression of INs gene is driven by the promoter of Rous sarcoma virus (RSV). The presence of the neomycin resistance gene allows selection of stably transduced clones by geneticin (G418) (GIBCO BRL).
Construction of the substrate for the DIPR assay The DNA substrate for the DIPR assay was obtained by linearization of pLTR-IRES-Luc with ScaI. This plasmid was constructed in the following way. First, the 350 by KpnI/EcoRI fragment of pU3U5 (Cherepanov et al.(1999), Nucleic Acids Research 27: 2202-2210) containing the terminal U3 and U5 regions of the HXB2 HIV-1 LTRs was cloned between the Kpnl and EcoRI sites of pUC 19 resulting in pUC-LTR.
Then the ScaI site occurnng in the ampicilline resistance gene of pUC 19 was destroyed by partial digestion of pUC-LTR with ScaI and insertion of a fragment containing the kanamicine resistance gene from the Tn5 transposon yielding pUC-LTR-kan.
Finally, 7.5 kb pLTR-IRES-Luc was obtained by cloning the BamHI/PstI fragment of pBIR
(Martinez-Salas et al. (1993), J. Virol. 67, 3748-3755) carrying the IRES-luciferase gene cassette, made blunt with T4 DNA polymerase (Gibco BRL), into the SmaI
site of pUC-LTR-kan.
3o Cell culture HeLa and 293 cells were obtained from American Type Culture Collection.
HeLa and 293 cells were grown in Dulbecco's modified Eagle's medium (DMEM) (GibcoBRL) supplemented with 10 % FCS, 0.12 % (v/w) sodium bicarbonate (GibcoBRL), 2 mM glutamine (GibcoBRL) and 20 ~.g/ml gentamycin (GibcoBRL) at 37oC in 5 % C02 humidified atmosphere. 293T cells (obtained from Dr. O. Danos, Evry, France) express SV40 large T antigen and were grown in DMEM (GibcoBRL) with glutamax supplemented with 10% fetal calf serum, 45 U/ml penicillin G
(Serva, 5 Heidelberg, Germany) and 45 p.g/ml streptomycin sulphate (Sigma-Aldrich, Bornem, Belgium).
293 and 293T cells were transfected using polyethylenimine (PEI) (Abdallah et al. (1997), Hum. Gene Therapy 7:1947-1954). Polyethylenimine Mw ~ 25.000 was from Sigma-Aldrich (Bornem, Belgium). Cells were grown to 50-70 % confluency in 1o DMEM with glucose, glutamax and 10 % fetal calf serum (FCS) (Gibco BRL).
Medium was replaced by medium containing 1 % FCS 3 hours before transfection.
Mixture of DNA and PEI was added to cells in a minimal volume of medium. Next day the medium was changed to DMEM containing 25 mM HEPES. Transformation efficiency obtained in this way was 50-80 %. HeLa cells were routinely transfected by 15 electroporation. The cells were first trypsinized at 80 % confluency and pelleted by low speed centrifugation. The cells were then resuspended at a density of 2 x 106 cells/ml in growth medium; 0.5 ml of this solution was aliquoted into 4 mm cuvettes (Eurogentec, Seraing, Belgium) and 20 qg DNA was added to the cell suspension. After the electric pulse ( 10 qF, 250 V), cells were allowed to rest for 10 min at room temperature before 20 dilution into growth medium..
To establish stable cell lines expressing the INs gene, cells transfected with pRSV-INs or pCMV-1Ns were cultured in the presence of 500 ~g/ml geneticin (G418) or 200 ug/ml hygromicin B (both from GIBCO BRL), respectively. Expression of IN
was assessed by western blotting and/or indirect immunofluorescence.
Western blotting and immunofluorescence For western blotting and indirect immunofluorescence rabbit polyclonal antibodies directed against recombinant His-tagged HIV-1 IN were produced in house and were purified using a 1 HiTrap rProteinA column (Pharmacia Biotech, Uppsala, 3o Sweden) according to established procedures (Ausubel et al. (1987), Current protocols in molecular biology, John Wiley & Sons, New York). Western blotting was performed using PVDF membranes (Bio-Rad), the ECL+ chemiluminescent detection system (Amersham-Pharmacia) and HRP-conjugated goat anti-rabbit antibodies (Bio-rad).

Dilutions used were 1:30000 for the primary antibody and 1:20000 for the secondary antibody. Detection limit was 0.1-0.5 ng of recombinant integrase. Total protein concentration was determined on cells lysed with 1%SDS/1 mM PMSF (Sigma), using the BCA protein assay (Pierce, Illinois USA). For western blot analysis 10 ~g of total protein was evaluated.
For detection of IN expression in situ by indirect immunofluorescence microscopy, cells were grown on glass slides (HeLa cells) or in permanox chamber slides (GIBCO BRL) (293T cells). After 24-48 hrs, cells were washed with phosphate buffered saline (PBS) supplemented with 1 mM Mg2+ and 0.5 mM Ca2+(PBS+), fixed in 100% methanol and blocked with 10% foetal calf serum (FCS) in PBS+.
Incubations with antibodies were carried out at 37~C in blocking solution. The primary antibody (rabbit anti-IN) was diluted 1:20 to 1:80; the secondary FITC-conjugated swine anti-rabbit antibody from Dako (Glostrup, Denmark) was diluted 1:40. Nuclear staining was performed with 1 pg/ml 4', 6-diamidino-2-phenylindole (DAPI) (Sigma) in methanol.
Fluorescence microscopy was performed with a Leitz microscope (Wetzlar, Germany) using filter blocks I2 (FITC) or A (DAPI).
Detection of integrase activity using a promoterless reporter gene (DIPR) 293T and 293T-INs cells were seeded in six-well plates at a density of 106 cells/well 24 hr before transfection. Five pg of DNA was transfected per well using PEI. 48 hr post-transfection, 5 x 105 cells were lysed to determine the luciferase activity using the Luciferase Assay SystemTM (Promega Benelux, Leiden, The Netherlands) and the LumicountTM (Packard, Meriden, CT). The protein concentration of the lysate was determined using the Bradford method (Bio-Rad protein assay, Bio-Rad, Hercules, CA). The relative luciferase activity was calculated by dividing the luminescence values by the protein concentration.
Lentiviral vectors Lentiviral vector production 3o HIV-1-derived vector particles, pseudotyped with the envelope of vesicular stomatitis virus (VSV), were produced by transfecting 293T cells with a packaging plasmid encoding viral gag and pol proteins (pCMVOR8.2), a plasmid encoding the envelope of vesicular stomatitis virus (pMDG) and a plasmid encoding a reporter gene flanked by two long terminal repeats (LTRs) (pHR'-CMVLacZ). The first generation packaging plasmid, containing all HIV genes except. for env, and the transfer vector were a kind gift from Dr. O. Danos (Genethon, France). For transfection of a 10 cm dish of 293T cells, a 700 p.l mixture of three plasmids was made in 150 mM
NaCI: 20 qg of vector plasmid, 10 ~g of packaging construct and 5 pg of envelop plasmid. To this DNA solution 700 ql of a PEI solution (110 p.l of a 10 mM stock solution in 150 mM NaCI) was added slowly. After 15 min at room temperature, the DNA-PEI
complex was added dropwise to the 293T cells in DMEM medium with 1% FCS. After overnight incubation, medium was replaced with medium containing 10% FCS.
to Supernatants were collected from day two to five post-transfection. The vector particles were sedimented by ultracentrifugation in a swinging-bucket rotor (SW27 Beckman, Palo alto, CA) at 25, 000 rpm for 2 hr at 4°C. Pellets were redissolved in PBS resulting in a 100-fold concentration. Different viral stocks were normalized based on p24 antigen content (HIV-1 p24 Core Profile ELISA, DuPont, Dreieich, Germany) for use in complementation assays.
Complementation experiments Integrase-defective virus particles were produced using pCMVOR8.2IN(D64V), obtained from Dr. D. Trono, (Geneva, Switzerland) as packaging plasmid (Naldini et al. (1996), Science 272: 263-267). Complemented vectors were produced by expressing integrase from pCEP-INs in 293T cells after quadruple transient transfection.
Vector preparations were normalized for p24 antigen count Vector was added to target cells in the presence of 2 ~,g/ml polybrene and left overnight. After removal of vector, cells were incubated for an additional 36 hrs. Cells were washed with PBS, fixed with 0.75%
formaldehyde/0.05% glutaraldehyde in PBS, and stained with freshly prepared X-gal substrate (5 mM potassium ferrocyanide, 5 mM potassium ferncyanide, 2 mM MgClz and 100 pg/ml 5-bromo-4-chloro-3-indolyl-(3-D-galactopyranoside (x-gal) (Biotech Trade & Service Gmbh, St. Leon-Rot, Germany) in PBS) at 37°C
overnight. Each transduction experiment was done in duplicate in a 96-well plate. Transduction 3o efficiency was determined by counting the number of blue cells 48 hrs after infection in one of the wells, whereas the cells in the duplicate well were splitted 1:2.
Half of the sample remained in the well and was stained at confluency (passage 1) whereas the other half was cultured in a 48-well plate. At confluency, these cells were again splitted 1:2. Finally, cells were brought in a 24-well plate and grown to confluency (passage 3, dilution 1:8). After staining, the efficiency of stable transduction was measured by counting blue colonies.
Tables Table 1. Complementation of inte~rase-defective lentiviral vector particles Relative transduction efficiency' Cells Passage WT vector D64V C IN
vector # 0 1.00 0.048 0.303 # 3 1.00 0.007 0.320 293T INs # 0 1.565 0.09 0.510 # 3 1.88 0.045 0.75 lTransduction efficiency is determined by counting galactosidase-positive cells (# 0) or colonies of galactosidase-positive cells (# 3) relative to transduction efficiency obtained by WT vector in 293T cells. Results of transduction by WT vector, D64V IN-defective vector and D64V vectors complemented with IN in the producer cells, are shown.
Cells were infected with normalized amounts of vector. Transduction was done both in cells and in 293T cells that are stably expressing IN. Average numbers for two separate experiments are shown.

Table 2. Detection of inte~rase activity usin , a promoterless reporter gene (DIPR) Luciferase activity (Relative units) Experiment Cell line Blanks LTR-IRES- LTR-IRES-Luc Lucb + pCMV-INs 293T-INs 1 487 119 -293T-INs 1 499 38 990 183 aRelative background luciferase activity in cell lines b 293T and 293T-INs were transfected with equal amounts of linearized pLTR-IRES
Luc. under experimental conditions A. In experiment B total DNA concentration was equalized with parental vector pCEP4.
°293T and 293T-INs were transfected with linearized pLTR-IRES-Luc and 2 ~.g of pCMV-INs.

SEQUENCE LISTING
<110> K .U. Leuven Research & Development Debyser, Zeger De Clercq, Erik Cherepanov, Peter Pluymers, Wim <120> A synthetic gene for expression of a retroviral protein with wild type activity in eukaryotic cells <130> K1291-PCT
<140>
<141>
<150> EP99201306.0 <151> 1999-04-26 <150> EP00200171.7 <151> 2000-O1-18 <160> 2 <170> PatentIn Ver. 2.1 <210> 1 <211> 930 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthetic gene expressing HIV integrase <220>
<221> CDS
<222> (27)..(899) <220>
<221> misc_signal <222> (24)..(30) <223> Kozak sequence <400> 1 atcactagca acctcaaaca gacacc atg gga ttc ctg gac ggc att gac aag 53 Met Gly Phe Leu Asp Gly Ile Asp Lys get cag gag gag cac gag aag tac cac tcg aat tgg cgg gcc atg gcc 101 Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala tcc gac ttc aac ctg cca ccc gtc gtc get aag gag atc gtt get agc 149 Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val Ala Ser tgc gac aag tgc cag ctg aaa ggc gag get atg cac ggg cag gtt gat 197 Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln Val Asp tgc tct ccc ggc atc tgg cag ctc gac tgt act cac ctg gag ggc aag 245 Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu Gly Lys gtc atc ctg gtc gcc gtg cac gtg gcc tct ggt tac atc gag get gag 293 Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu Ala Glu gtc atc cct gca gag act ggc cag gag act gcc tat ttc ctg ctg aaa 341 Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu Leu Lys g0 95 100 105 ctg gcc ggc cgg tgg cct gtg aag aca gtg cac aca gat aac ggc tcc 389 Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn Gly Ser aac ttc acc tcc acc act gtg aag get gcc tgc tgg tgg get ggg atc 437 Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala Gly Ile aag cag gag ttc ggg atc ccc tat aac cca cag tct cag ggc gtg atc 485 Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly Val Ile gaa tcc atg aac aag gag ctg aag aag atc atc ggc cag gtt cgg gac 533 Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val Arg Asp cag gca gag cac ctg aag act gca gtg cag atg gcc gtg ttc atc cac 581 Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe Ile His aac ttc aag cga aag ggc ggc atc ggt ggc tac tca gcc ggc gag cgg 629 Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly Glu Arg atc gtg gac atc atc gcc act gac atc cag acc aaa gag ctg cag aag 677 Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu Gln Lys cag atc acc aag atc cag aac ttc cgt gtg tac tac cgg gac tcc cgg 725 Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg gac cct gtg tgg aag ggc cct gcc aag ctg ctg tgg aag ggc gag ggc 773 Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly Glu Gly gcc gtg gtc att cag gac aac tct gac atc aag gtt gtg ccc agg cgc 821 Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro Arg Arg aag gcc aag att atc cgg gac tac ggc aag cag atg get ggc gac gac 869 Lys A1a Lys Ile Ile Arg Rsp Tyr Gly Lys G1n Met Ala Gly Asp Asp tgt gtg gcc tct cgt caa gat gag gac taa gtccaactac taaactgggg 919 Cys Val Ala Ser Arg Gln Asp Glu Asp gatattatga t 930 <210> 2 <211> 290 <212> PRT
<213> Artificial Sequence <223> Description of Artificial Sequence: synthetic gene expressing HIV integrase <400> 2 Met Gly Phe Leu Asp Gly Ile Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu Ala Glu Val Ile Pro Ala Glu Thr Gly g5 90 95 Gln Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asp Pro Val Trp Lys Gly Pro 225 230. 235 240 Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp

Claims (61)

Claims
1. A detection method for intracellular integrase activity using a promoteriess reporter.
gene.
2. The method according to claim 1, wherein the integrase activity is present in cell culture.
3. The method according to claim 1 or 2, wherein the integrase activity is present after transfection of an integrase gene.
4. The method according to any of claims 1 to 3, where the integrase activity is performed by an integrase protein.
5. The method according to claim 4, wherein the integrase protein is a wild type integrase enzyme.
6. The method according to any previous claim, wherein the integrase protein is mutated.
7. The method according to any of claims 1 to 6, wherein the integrase gene is mutated in order to obtain an optimised codon usage.
8. The method according to any previous claim, wherein the integrase gene is a synthetic gene having a portion of the wildtype codons relaced by other codons.
9. The method according to any previous claim, wherein the integrase protein is retroviral.
10. The method according to any of claims 1 to 9, where the integrase protein is lentiviral.
11. The method according to claim 10, wherein the integrase protein is an HIV
integrase.
12. The method according to any previous claim, wherein the reporter gene is one of a luciferase, GFP and an antibiotic selection marker.
13. The method according to any of the claims 1 to 11, wherein the reporter gene is a cytotoxic drug resistance gene.
14. The method according to any previous claim, wherein a reporter gene construct is generated from the reporter gene and the construct is used as the substrate of an enzymatically active retroviral protein expressed from a synthetic retroviral pol or gag gene, the synthetic gene having modified codon usage compared with a wildtype gene, the synthetic gene being for expression of a retroviral pol or gag gene or a region of a retroviral pol or gag gene in a eukaryotic cell, the expressed retroviral protein being at a level to provide detectable activity of a wild type function of the expressed retroviral protein in the eukaryotic cell.
15. The method according to claim 14, wherein the synthetic gene is for the expression of a lentiviral pol or gag gene or a region of a lentiviral pol or gag gene in a eukaryotic cell, the expressed lentiviral protein being at a level to provide detectable activity of a wild type function of the expressed lentiviral protein in the eukaryotic cell.
16. The method according to claim 14 or 15, wherein the synthetic gene is for the expression of a retroviral or lentiviral gag or pole gene or a region of a retroviral or lentiviral gag or pol gene where the gene or region thereof, after codon optimization for a eukaryotic host in which it is expressed, contains a GC nucleotide pair content between 53 and 63 %, more preferably between 55 and 61 %, and the expressed gene is expressed at a level to provide detectable enzymatic activity of the expressed retroviral or lentiviral protein in the eukaryotic cell.
17. The method according to any of claims 14 to 18, wherein the expression of the gag or pol gene or the region thereof is independent of retroviral regulatory proteins.
18. The method according to any of claims 14 to 17 wherein the retroviral protein is a lentiviral gag or pol protein or a fragment thereof.
19. The method according to claim 18 wherein the retroviral protein is an HIV
gag or pol protein or a fragment thereof.
20. The method according to any of claims 14 to 19, wherein the detectable activity of the enzymatic function includes at least promotion or stimulation of the integration of DNA fragments into the host cell DNA, preferably the chromosome if the host cell.
21.The method according to any of the claims 14 to 20, wherein the eukaryotc cell for expression of genes is a mammalian cell.
22. The method according to any of the claims 14 to 21 wherein the expressed protein has an expression level of at least 200% compared to the expressed wild type gene in a eukaryotic cell.
23. The method according to any of the claims 14 to 21 containing a synthetic gene comprising the sequence of Fig 2A or homologs thereof which have a GC content between 53 and 63% preferably between 55 and 61 percent.
24. The method according to any of claims 1 to 23, wherein a reporter gene construct is generated from the reporter gene and the construct contains an internal IRES.
25. The method according to any previous claim, wherein the reporter gene codes for an enzyme.
26. Use of a method according any of the claims 1 to 25 for screening for integrase inhibitors.
27. An integrase inhibitor obtained by the method of claim 26.
28. Packaging construct for a lentiviral or complex retroviral vector based on a synthetic retroviral pol or gag gene, the synthetic gene having modified codon usage compared with a wildtype gene, the synthetic gene being for expression of a retroviral pol or gag gene or a region of a retroviral pol or gag gene in a eukaryotic cell, the expressed retroviral protein being at a level to provide detectable activity of a wild type function of the expressed retroviral protein in the eukaryotic cell.
29. Packaging construct according to claim 28 wherein the synthetic gene is for the expression of a lentiviral pol or gag gene or a region of a lentiviral pol or gag gene in a eukaryotic cell, the expressed lentiviral protein being at a level to provide detectable activity of a wild type function of the expressed lentiviral protein in the eukaryotic cell.
30. Packaging construct according to claim 28 or 29, for the expression of a retroviral.
gag or pole gene or a region of a retroviral gag or pol gene where the gene or region thereof, after codon optimization for a eukaryotic host in which it is expressed, contains a GC nucleotide pair content between 53 and 63 %, more preferably between 55 and 61 %, and the expressed gene is expressed at a level to provide detectable enzymatic activity of the expressed retroviral protein in the eukaryotic cell.
31. Packaging construct according to any of claims 28 to 30, wherein the expression of the gag or pol gene or the region thereof is independent of retroviral regulatory proteins.
32. Packaging construct according to any of claims 28 to 31, wherein the retroviral protein is a lentiviral gag or pol protein or a fragment thereof.
33. Packaging construct according to claim 32 wherein the retroviral protein is an HIV
gag or pol protein or a fragment thereof.
34. Packaging construct according to any of claims 28 to 33, wherein the detectable activity of the enzymatic function includes at least promotion or stimulation of the integration of DNA fragments into the host cell DNA, preferably the chromosome if the host cell.
35. Packaging construct according to any of the claims 28 to 34, wherein the retroviral protein is a protease, reverse transcriptase, integrase or a polyprotein gag-pol precursor thereof.
36. Packaging construct according to any of the claims 28 to 35, wherein the eukaryotic cell for expression of genes is a mammalian cell.
37. Packaging construct according to any of the claims 28 to 36, wherein the expressed protein has an expression level of at least 200 % compared to the expressed wild type gene in a eukaryotic cell.
38. Packaging construct according to any of the claims 28 to 37 containing a synthetic gene comprising the sequence of Fig 2A or homologs thereof which have a GC
content between 53 and 63 % preferably between 55 and 61 percent.
39. A method of transfecting a eukaryotic cell using the expression vector in accordance with any of claims 59 to 61.
40. A eukaryotic cell line harboring the synthetic gene or region of a gene in accordance with any of the claims 50 to 58.
41.The eukaryotic cell line according to claim 40, wherein the retroviral enzymatically active protein is expressed using a constitutive, inducable or tissue specific promoter.
42. The eukaryotic cell line according to claim 40 or 41, wherein the expression is stable.
43. A transgenic animal harboring the synthetic gene or part of a gene in accordance with any of the claims 50 to 58.
44. The transgenic animal according to claim 43, wherein the expression of the synthetic gene or part of a gene is induced by an inducable promoter or by a tissue-specific promoter.
45. The transgenic animal according to claim 43 or 44, comprising a mammal.
46. A method for preparing a synthetic gene or part of a gene encoding a retroviral protein or part of such a protein which is enzymatically active in a target eukaryotic cell, comprising the steps of:
1) identifying a group of genes from the total set of genes of the target eukaryotic cell which encode proteins which are naturally expressed easily and/or in high concentrations in the target cell;
2) determining the codon sequences of these identified genes and from these sequences a preferred codon usage and a preferred nucleotide pair frequency;
3) using the preferred codon usage, identify the non-preferred codons in the natural gene encoding the enzymatically active protein;
4) replacing one or more of the non-preferred codons with one or more preferred codons encoding the same amino acids as the replaced codons while biasing the replacement to obtain the preferred nucleotide pair frequency, the preferred nucleotide pair frequency being a GC nucleotide pair content of between 53 and 63%, more preferably between 55 and 61%.
47.The method according to claim 46, wherein the replacement step is carried out based on a random choice between alternative codons encoding the same amino acid at each position using a random number generator and biasing the choice of alternative codons based on the preferred codon usage to obtain the preferred nucleotide frequency.
48. A method for gene transfer in a eukaryotic cell expressing the synthetic gene or region of the gene in accordance with any of the claims 50 to 58.
49. A method according to claim 48, wherein the synthetic gene is transiently expressed or is stably integrated in said cell.
50. A synthetic retroviral gag or pol gene or a region of a retroviral gag or pol gene for the expression of a retroviral gag or pol protein in a eukaryotic cell, the retroviral gene having non-preferred codons when referred to the eukaryotic cell, the number of non-preferred codons being such that replacement of all the non-preferred codons by preferred codons for the eukaryotic cell results in a GC nucleotide pair content of 65% or higher, the synthetic gene having a GC nucleotide pair content of between 53 and 63%, more preferably between 55 and 61% and the expressed retroviral protein is expressed at a level to provide detectable enzymatic activity of the expressed retorviral protein in the eukaryotic cell.
51. The synthetic gene according to claim 50, wherein the expression of the gag or pol gene proteins is independent of retroviral regulatory proteins.
52. The synthetic gene according to claim 50 or 51, wherein the retroviral protein is a lentiviral gag or pol protein.
53. The synthetic gene according to claim 52, wherein the lentiviral protein is an HIV gag or pol protein.
54. The synthetic gene according to any of claims 50 to 53, wherein the detectable activity of the enzymatic function includes at least promotion or stimulation of integration of DNA fragments into the host cell DNA, preferably the chromosome of the host cell.
55. The synthetic gene according to claim 54, wherein the retroviral protein is a protease, a reverse transcriptase, an integrase protein or a polyportein gag-pol precursor thereof.
56. The synthetic gene according to any of the claims 50 to 55, wherein the eukaryotic cell is a mammalian cell.
57. The synthetic gene according to any of the claims 50 to 56, wherein the expression of the protein is at a level at least 200% of that expressed by the wild type gene in the eukaryotic cell.
58. The synthetic gene according to any of the claims 50 to 57 comprising the sequence of Fig. 2A or homologs thereof which have a GC content between 53 and 63%, preferably between 55 and 61%.
59. A eukaryotic expression vector comprising the synthetic gene or region of a gene in accordance with any of the claims 50 to 58.
60. The expression vector according to claim 59, further comprising a constitutive or an inducible or a tissue-specific promoter.
61. The expression vector according to claim 59 or 60, comprising a plasmid, a mammalian or an insect virus.
CA002369058A 1999-04-26 2000-04-26 Synthetic gene for expressing active retroviral protein in eukaryotes Abandoned CA2369058A1 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
EP99201306.0 1999-04-26
EP99201306 1999-04-26
EP00200171.7 2000-01-18
EP00200171 2000-01-18
PCT/EP2000/003765 WO2000065076A2 (en) 1999-04-26 2000-04-26 Synthetic gene for expressing active retroviral protein in eukaryotes

Publications (1)

Publication Number Publication Date
CA2369058A1 true CA2369058A1 (en) 2000-11-02

Family

ID=26071747

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002369058A Abandoned CA2369058A1 (en) 1999-04-26 2000-04-26 Synthetic gene for expressing active retroviral protein in eukaryotes

Country Status (11)

Country Link
EP (1) EP1183381A2 (en)
JP (1) JP2002542791A (en)
AU (1) AU778106B2 (en)
BR (1) BR0010077A (en)
CA (1) CA2369058A1 (en)
HU (1) HUP0201019A2 (en)
IL (1) IL146090A0 (en)
NO (1) NO20015168L (en)
PL (1) PL351988A1 (en)
RU (1) RU2001131728A (en)
WO (1) WO2000065076A2 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9803351D0 (en) 1998-02-17 1998-04-15 Oxford Biomedica Ltd Anti-viral vectors
WO2000039304A2 (en) 1998-12-31 2000-07-06 Chiron Corporation Polynucleotides encoding antigenic hiv type c polypeptides, polypeptides and uses thereof
EP1980617A1 (en) 1998-12-31 2008-10-15 Novartis Vaccines and Diagnostics, Inc. Improved expression of HIV polypeptides and production of virus-like particles
US6656706B2 (en) 1999-12-23 2003-12-02 The United States Of America As Represented By The Department Of Health And Human Services Molecular clones with mutated HIV gag/pol, SIV gag and SIV env genes
ES2329648T3 (en) * 1999-12-23 2009-11-30 The Government Of The Usa, As Represented By The Secretary, Department Of Health And Human Services MOLECULAR CLONES WITH MUTED GENES HIV GAG / POL, VIS GAG AND VIS VIS.
AUPQ580300A0 (en) * 2000-02-23 2000-03-16 National Cancer Centre Of Singapore Pte Ltd Genetic analysis
GB0009760D0 (en) * 2000-04-19 2000-06-07 Oxford Biomedica Ltd Method
AU2002320314A1 (en) 2001-07-05 2003-01-21 Chiron, Corporation Polynucleotides encoding antigenic hiv type c polypeptides, polypeptides and uses thereof
US20030170614A1 (en) 2001-08-31 2003-09-11 Megede Jan Zur Polynucleotides encoding antigenic HIV type B polypeptides, polypeptides and uses thereof
US6835568B2 (en) 2001-10-30 2004-12-28 Virxsys Corporation Regulated nucleic acid expression system
CN1784496B (en) 2003-05-06 2010-08-25 独立行政法人产业技术综合研究所 Multi-gene transcription activity assay system
EP1934246B8 (en) 2005-10-07 2012-02-08 Istituto di Ricerche di Biologia Molecolare P. Angeletti S.R.L. Matrix metalloproteinase 11 vaccine
JP6824594B2 (en) 2014-09-11 2021-02-03 Jnc株式会社 How to design synthetic genes

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5468629A (en) * 1993-04-13 1995-11-21 Calhoun; Cornelia Method of promoting in vitro homologous recombination transfection in mammalian cells using the RecA protein
US5434065A (en) * 1993-05-06 1995-07-18 President And Fellows Of Harvard College In vivo selection of microbial virulence genes
US5811270A (en) * 1994-05-20 1998-09-22 Grandgenett; Duane P. In vitro method for concerted integration of donor DNA molecules using retroviral integrase proteins
US6200811B1 (en) * 1996-04-02 2001-03-13 The Regents Of The University Of California Cell transformation vector comprising an HIV-2 packaging site nucleic acid and an HIV-1 GAG protein
US6114148C1 (en) * 1996-09-20 2012-05-01 Gen Hospital Corp High level expression of proteins
JP2001512308A (en) * 1997-02-07 2001-08-21 メルク エンド カンパニー インコーポレーテッド Synthetic HIV GAG gene
EP1980617A1 (en) * 1998-12-31 2008-10-15 Novartis Vaccines and Diagnostics, Inc. Improved expression of HIV polypeptides and production of virus-like particles
WO2000039304A2 (en) * 1998-12-31 2000-07-06 Chiron Corporation Polynucleotides encoding antigenic hiv type c polypeptides, polypeptides and uses thereof

Also Published As

Publication number Publication date
JP2002542791A (en) 2002-12-17
WO2000065076A3 (en) 2001-12-13
RU2001131728A (en) 2004-02-20
EP1183381A2 (en) 2002-03-06
NO20015168L (en) 2001-12-14
AU778106B2 (en) 2004-11-18
PL351988A1 (en) 2003-07-14
IL146090A0 (en) 2002-07-25
HUP0201019A2 (en) 2002-08-28
BR0010077A (en) 2002-01-15
NO20015168D0 (en) 2001-10-23
AU4557900A (en) 2000-11-10
WO2000065076A2 (en) 2000-11-02

Similar Documents

Publication Publication Date Title
US5994136A (en) Method and means for producing high titer, safe, recombinant lentivirus vectors
CA2328404C (en) Novel lentiviral packaging cells
Cherepanov et al. High‐level expression of active HIV‐1 integrase from a synthetic gene in human cells
AU2018336042A1 (en) Retroviral vectors
AU778106B2 (en) Synthetic gene for expressing active retroviral protein in eukaryotes
EP1244797B1 (en) Translentiviral vector system
EP4652288A2 (en) Viral particle producer cells with landing pad-integrated viral vectors
CA2408786C (en) Retroviral vectors comprising an enhanced 3&#39; transcription termination structure
AU773015B2 (en) Lentiviral vectors
WO2001004360A9 (en) Retroviral recombination assays and uses thereof
US20230151388A1 (en) Modified vectors for production of retrovirus
WO2024194651A1 (en) Expression construct
US20120034693A1 (en) Recombinant vector and use in gene therapy

Legal Events

Date Code Title Description
FZDE Discontinued