[go: up one dir, main page]

CN1748034B - Improved expression element - Google Patents

Improved expression element Download PDF

Info

Publication number
CN1748034B
CN1748034B CN200480003321.1A CN200480003321A CN1748034B CN 1748034 B CN1748034 B CN 1748034B CN 200480003321 A CN200480003321 A CN 200480003321A CN 1748034 B CN1748034 B CN 1748034B
Authority
CN
China
Prior art keywords
polynucleotide
gene
carrier
expression
cells
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
CN200480003321.1A
Other languages
Chinese (zh)
Other versions
CN1748034A (en
Inventor
史蒂文·杰兰特·威廉姆斯
罗伯特·拉克兰·克龙比
凯·史蒂文·利平斯基
阿利斯泰尔·辛普森·欧文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
EMD Millipore Corp
Original Assignee
Innovation Public Ltd
Millipore Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0308503A external-priority patent/GB0308503D0/en
Priority claimed from GB0320824A external-priority patent/GB0320824D0/en
Application filed by Innovation Public Ltd, Millipore Corp filed Critical Innovation Public Ltd
Priority claimed from PCT/GB2004/000387 external-priority patent/WO2004067701A2/en
Publication of CN1748034A publication Critical patent/CN1748034A/en
Application granted granted Critical
Publication of CN1748034B publication Critical patent/CN1748034B/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Landscapes

  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

The present invention relates to improved genetic elements that provide high levels of expression of operably linked genes in various tissues. In particular, unmethylated fragments, CpG islands of less than 2kb, are shown to provide enhanced transgene expression and have advantages in vector construction and cloning capacity.

Description

改进的表达元件 Improved expression element

发明领域field of invention

本发明涉及含改进的,较小的遍在染色质开放元件(UCOE)的多核苷酸。当可操作连接到可表达的核酸序列时,所述元件产生高的和可重复的基因表达水平。本发明还涉及含所述多核苷酸序列的载体,含该载体的宿主细胞和所述多核苷酸、载体或宿主细胞在治疗中的应用,或用于在细胞培养中表达蛋白的应用。The present invention relates to polynucleotides comprising improved, smaller ubiquitous chromatin opening elements (UCOEs). When operably linked to an expressible nucleic acid sequence, the elements produce high and reproducible levels of gene expression. The present invention also relates to vectors containing said polynucleotide sequences, host cells containing said vectors and the use of said polynucleotides, vectors or host cells in therapy, or for expressing proteins in cell culture.

发明背景Background of the invention

当前的高等真核生物染色质结构模型假定基因组织在“结构域形式”(Dillon,N.& Grosveld,F.染色质结构域作为潜在的真核的基因功能单位,Curr.Opin.Genet.Dev.4,260-264(1994);Higgs,D.R.LCRs开放染色质结构域?Cell 95,299-302(1998))。染色质结构域想象是以浓缩、“闭合的”的,转录沉默状态,或去浓缩的、“开放的”和转录活性的构型存在。特征为DNasel敏感性增加,DNA超甲基化和组蛋白超乙酰化的开放染色质结构的建立,被认为是起始基因表达必须的。Current models of chromatin structure in higher eukaryotes assume gene organization in a "domain format" (Dillon, N. & Grosveld, F. Chromatin Domains as Potential Eukaryotic Gene Functional Units, Curr. Opin. Genet. Dev .4, 260-264 (1994); Higgs, D.R. LCRs open chromatin domains? Cell 95, 299-302 (1998)). Chromatin domains are imagined to exist in a condensed, 'closed', transcriptionally silent state, or a decondensed, 'open', and transcriptionally active configuration. The establishment of an open chromatin structure, characterized by increased DNasel sensitivity, DNA hypermethylation and histone hyperacetylation, is thought to be necessary for initiating gene expression.

染色质区域开放和关闭的性质反映在随机整合到宿主细胞基因组中转基因的行为中。当整合在小鼠基因组不同的位置时,相同的构造产生不同的组织特异性和发育阶段特异性表达模式(Palmiter,R.D.&Brinster,R.L.Ann.Ref.Genet.20,465-499(1986);Allen,N.D.et al.Nature 333,852-855(1988);Bonnerot,C.,Grimber,G.,Briand,P.&Nicolas,J.F.Proc.Natl.Acad.Sci.USA 87:6331-6335(1990))。The nature of the opening and closing of chromatin regions is reflected in the behavior of transgenes randomly integrated into the host cell genome. The same construct produces different tissue-specific and developmental stage-specific expression patterns when integrated at different locations in the mouse genome (Palmiter, R.D. & Brinster, R.L. Ann. Ref. Genet. 20, 465-499 (1986); Allen , N.D.et al.Nature 333,852-855(1988); Bonnerot, C., Grimber, G., Briand, P. & Nicolas, J.F.Proc.Natl.Acad.Sci.USA 87:6331-6335(1990 )).

在给定的转基因的小鼠组织内还经常地观察到花斑表达模式,通称是位置效应花斑(PEV)(Kioussis,D.&Festenstein,R.Curr.Opin.Genet.Dev.7,614-619(1997))。当外源的基因被整合到体外培养的哺乳动物细胞染色体中时,许多整合事件引起转基因迅速沉默,其余的引起表达水平的较大改变(Pikaart,M.J.,Recillas-Targa,F.& Felsenfield,G.Genes Dev.12,2852-2862(1998);Fussenegger M.,Bailey,J.E.,Hauser,H.&Mueller,P.P Trends Biotech.17,35-42(1999))。这些位置效应使转基因表达效率降低,显示其基础研究和生物技术的应用潜力。A variegated expression pattern is also frequently observed in a given transgenic mouse tissue, commonly referred to as position-effect variegated (PEV) (Kioussis, D. & Festenstein, R.Curr.Opin.Genet.Dev.7, 614- 619 (1997)). When exogenous genes are integrated into chromosomes of mammalian cells cultured in vitro, many integration events cause rapid silencing of transgenes, while others cause large changes in expression levels (Pikaart, M.J., Recillas-Targa, F. & Felsenfield, G . Genes Dev. 12, 2852-2862 (1998); Fussenegger M., Bailey, J. E., Hauser, H. & Mueller, P.P Trends Biotech. 17, 35-42 (1999)). These position effects reduce the efficiency of transgene expression, showing its application potential in basic research and biotechnology.

基因组织的染色质结构域模型表明能建立和维持能转录活性的开放染色质结构的遗传控制元件应该与基因组的活性区域有关。Chromatin domain models of gene organization suggest that genetic control elements that establish and maintain an open chromatin structure capable of transcriptional activity should be associated with active regions of the genome.

基因座控制区域(LCRs)是一类具有远程染色质改造能力的转录调控元件。在转基因小鼠LCRs通过它们在顺式连接的基因上特别是单拷贝转基因上,整合位点不依赖的,转基因拷贝数依赖的、生理水平的表达的能力而以功能方式定义的,Fraser,P.& Grosveld,F.Curr.Opin.Cell Biol.10,361-365(1998);Li,Q.,Harju,S.& amp;Peterson,K.R.Trends Genet.15:403-408(1999)。重要的是,这种表达是组织特异的。LCRs能够阻碍异染色质的散布,防止PEV(Kioussis,D.&Festenstein,R.Curr.Opin.Genet.Dev.7,614-619(1997)),并由一系列位于它们调节的基因的5′或3’的DNase I超敏(HS)位点组成(Li,Q.,Harju,S.&Peterson,K.R.Trends Genet.15:403-408(1999))。Locus control regions (LCRs) are a class of transcriptional regulatory elements capable of long-range chromatin remodeling. LCRs in transgenic mice are functionally defined by their ability to express expression at physiological levels, integration site-independent, transgene copy number-dependent, at cis-linked genes, especially single-copy transgenes, Fraser, P . & Grosveld, F. Curr. Opin. Cell Biol. 10, 361-365 (1998); Li, Q., Harju, S. & Peterson, K. R. Trends Genet. 15: 403-408 (1999). Importantly, this expression is tissue-specific. LCRs can hinder the dissemination of heterochromatin, prevent PEV (Kioussis, D. & Festenstein, R. Curr. Opin. Genet. Dev. 7, 614-619 (1997)), and by a series of genes located 5' to the genes they regulate or 3' DNase I hypersensitive (HS) site composition (Li, Q., Harju, S. & Peterson, K.R. Trends Genet. 15:403-408 (1999)).

LCRs表现为由两个单独的,尽管不一定独立的组分组成。首先,建立“开放染色质结构域”,其次,显著的翻译活化能力产生转基因拷贝数依赖性的表达(Fraser,P.&Grosveld,F.Curr.Opin.Cell Biol.10,361-365(1998)。LCRs发挥其功能的分子机制仍然是争论的问题(Higgs,D.R.Cell 95,299-302(1998);Bμlger,M.& Groudine,M.GenesDev.13,2465-2477(1999);Grosveld F.Curr.Opin.Genet Dev.9152-157(1999);Bender,M.A.,Bμlger,M.,Close,J.&Groudine,M..Mol.Cell 5,387-393(2000)。LCRs appear to consist of two separate, though not necessarily independent, components. First, the establishment of an "open chromatin domain", and second, a pronounced translational activation capacity that results in transgene copy number-dependent expression (Fraser, P. & Grosveld, F. Curr. Opin. Cell Biol. 10, 361-365 (1998) The molecular mechanism by which LCRs exert their functions is still a matter of debate (Higgs, D.R. Cell 95, 299-302 (1998); Bulger, M. & Groudine, M. Genes Dev. 13, 2465-2477 (1999); Grosveld F. Curr. Opin. Genet Dev. 9152-157 (1999); Bender, M.A., Bulger, M., Close, J. & Groudine, M.. Mol. Cell 5, 387-393 (2000).

生产高水平治疗蛋白产品的哺乳动物细胞系的增殖是主要的发展中行业。染色质位置效应使之成为困难的、既费时又费钱的过程。最通常使用的生产这种哺乳动物“细胞工厂”的方法依赖于通过抗药性基因的组合诱导的基因扩增(例如,DHFR、谷氨酰胺合成酶(Kaufman RJ.酶学方法185,537-566(1990)),并维持严格的选择压力。通过利用含来自高表达基因结构域的LCRs的载体,使用来自合适组织的细胞,大大地简化所述方法,产生大比例的显示高水平表达的克隆细胞系(Needham M,Gooding C,Hudson K,Antoniou M,Grosfeld F和Hollis M.Nucleic Acids Res 20,997-1003(1992);Needham M,Egerton M,MillestA,Evans S,Popplewell M,Cerillo G,McPheat J,Monk A,Jack A,Johnstone D和Hollis M.Protein Expr Purif6,124-131(1995)。Proliferation of mammalian cell lines producing high levels of therapeutic protein products is a major developing industry. Chromatin position effects make this a difficult, time-consuming and costly process. The most commonly used method for producing such mammalian "cell factories" relies on gene amplification induced by combinations of drug resistance genes (e.g., DHFR, glutamine synthetase (Kaufman RJ. Methods in Enzymology 185, 537-566 (1990)), and maintain stringent selection pressure. By utilizing vectors containing LCRs from highly expressed gene domains, using cells from suitable tissues, the method is greatly simplified, producing a large proportion of clones showing high levels of expression Cell lines (Needham M, Gooding C, Hudson K, Antoniou M, Grosfeld F and Hollis M. Nucleic Acids Res 20, 997-1003 (1992); Needham M, Egerton M, Millest A, Evans S, Popplewell M, Cerillo G, McPheat J, Monk A, Jack A, Johnstone D, and Hollis M. Protein Expr Purif 6, 124-131 (1995).

然而,LCRs的组织特异性,尽管在一些环境下有用,仍然是很多应用的控制因素,例如需要进行表达的组织中已知没有LCR,或要求在许多或全部组织中表达。However, the tissue specificity of LCRs, while useful in some circumstances, remains a controlling factor for many applications where expression is desired in tissues where no LCRs are known, or where expression is required in many or all tissues.

此处引入作为参考的我们的在审专利申请PCT/GB99/02357(WO00/05393),US 09/358082,GB 0022995.5和US 60/252,048,描述了在它们的天然染色体环境中负责建立开放染色质结构域的元件,所述开放染色质结构域跨越仅由遍在表达的管家基因组成的位点。这些元件不是来自LCR,并包括延伸的无甲基化CpG岛。我们使用术语遍在的染色质开放元件(UCOE)以描述这种元件。Our pending patent applications PCT/GB99/02357 (WO00/05393), US 09/358082, GB 0022995.5 and US 60/252,048, incorporated herein by reference, describe the processes responsible for the establishment of open chromatin in their natural chromosomal environment An element of a domain that spans a locus consisting only of ubiquitously expressed housekeeping genes. These elements are not from the LCR and include extended unmethylated CpG islands. We use the term ubiquitous chromatin opening element (UCOE) to describe this element.

在哺乳动物DNA中,二核苷酸CpG通过将胞嘧啶甲基化为5-甲基胞嘧啶的DNA甲基转移酶而进行鉴定。然而,5-甲基胞嘧啶是不稳定的,并被转化为胸腺嘧啶。结果,CpG二核苷酸的产生频率比随机出现的机率少得多。然而基因组DNA的一些区域具有接近于期待频率的CpG,并且这些序列被称为“CpG岛”。此处使用的“CpG岛”被定义为至少200bp的DNA序列,其具有至少50%的GC含量和观察的/期望的CpG含量比例至少为0.6(即CpG二核苷酸含量为随机期望值的至少60%)(Gardiner-Green M和Frommer M.J Mol Biol 196,261-282(1987);Rice P,Longden I和Bleasby A Trends Genet 16,276-277(2000)。In mammalian DNA, the dinucleotide CpG is identified by a DNA methyltransferase that methylates cytosine to 5-methylcytosine. However, 5-methylcytosine is unstable and is converted to thymine. As a result, CpG dinucleotides are generated much less frequently than by chance. However, some regions of genomic DNA have CpGs at a frequency close to that expected, and these sequences are called "CpG islands". As used herein, a "CpG island" is defined as a DNA sequence of at least 200 bp with a GC content of at least 50% and an observed/expected CpG content ratio of at least 0.6 (i.e., a CpG dinucleotide content of at least 60%) (Gardiner-Green M and Frommer M.J Mol Biol 196, 261-282 (1987); Rice P, Longden I and Bleasby A Trends Genet 16, 276-277 (2000).

无甲基化CpG岛是现有技术公知的(Bird等(1985)Cell 40:91-99,Tazi和Bird(1990)Cell 60:909-920),并且当显著比例的胞嘧啶残基没有甲基化时,可以定义为CpG岛,并且其通常延伸超过两个紧密间距的(0.1-3kb)背驰转录的基因的5′末端。DNA的这些区域被报道在所有组织中在整个发育阶段保持超甲基化(Wise和Pravtcheva(1999)Genomics 60:258-271)。它们经常与遍在表达基因的5’末端有关,而且估计40%的基因显示组织限制性表达谱(Antequera,F.&Bird,A.Proc.Natl.Acad.Sci.USA 90,1195-11999(1993);Cross,S.H.&Bird,A.P.Curr.Opin,Genet.Dev.5,309-314(1995),并已知是活性染色质的定位区域(Tazi,J.&Bird,A.Cell,60,909-920(1990)。Unmethylated CpG islands are well known in the art (Bird et al. (1985) Cell 40:91-99, Tazi and Bird (1990) Cell 60:909-920), and when a significant proportion of cytosine residues are unmethylated When ylated, they can be defined as CpG islands, and they usually extend beyond the 5' ends of two closely spaced (0.1-3 kb) divergently transcribed genes. These regions of DNA have been reported to remain hypermethylated throughout developmental stages in all tissues (Wise and Pravtcheva (1999) Genomics 60:258-271). They are often associated with the 5' ends of ubiquitously expressed genes, and an estimated 40% of genes show tissue-restricted expression profiles (Antequera, F. & Bird, A.Proc.Natl.Acad.Sci.USA 90, 1195-11999 (1993 ); Cross, S.H.&Bird, A.P.Curr.Opin, Genet.Dev.5, 309-314 (1995), and known to be the localization region of active chromatin (Tazi, J.&Bird, A.Cell, 60, 909- 920 (1990).

‘延伸′的无甲基化CpG岛是跨越包含超过一个转录起始位点的区域和/或跨越超过300bp并优选超过500bp的无甲基化CpG岛。延伸的无甲基化CpG岛的边界通过在该区域使用PCR,协同使用具有在识别序列消化(切割)DNA的能力对出现的任何CpG残基的甲基化状态敏感的限制性内切酶而进行功能限定。一个这样的酶是Hpa ll,其识别并消化通常在CpG岛内存在的CCGG位点,但只有在中央的CG残基没有甲基化的情况下进行。因此,用Hpall消化的DNA以及包含Hpall位点的区域进行的PCR,如果DNA是未甲基化的则由于Hpall消化不产生扩增产物。PCR将只在DNA是甲基化的情况下产生扩增产物。因此,在Hpall不消化DNA的无甲基化区域外可以观察到PCR扩增产物,从而定义了″延伸的无甲基化CpG岛″的边界。An 'extended' unmethylated CpG island is an unmethylated CpG island that spans a region comprising more than one transcription start site and/or spans more than 300 bp and preferably more than 500 bp. The boundaries of extended unmethylated CpG islands are determined by using PCR in this region in conjunction with the use of a restriction enzyme that has the ability to digest (cut) DNA at the recognition sequence and is sensitive to the methylation status of any CpG residues present. Limit the function. One such enzyme is Hpall, which recognizes and digests CCGG sites normally found within CpG islands, but only if the central CG residue is not methylated. Therefore, PCR performed with Hpall-digested DNA and the region containing the Hpall site, if the DNA is unmethylated, does not yield amplification products due to Hpall digestion. PCR will only produce amplification products if the DNA is methylated. Thus, PCR amplification products can be observed outside the unmethylated regions of Hpall-undigested DNA, thereby defining the boundaries of "extended unmethylated CpG islands".

我们已经证实(WO 00/05393)横跨包含来自人TATA结合蛋白(TBP)/蛋白体组分-B1(PSMBI)和核不均-核糖核蛋白A2/B1(hnRNPA2)/异染色质蛋白1Hsy(HP1Hsγ)基因座的异源二联背驰转录启动子的无甲基化CpG岛的区域,产生可重复的、生理水平的基因表达,而且能够阻止通常发生在中央异染色质内转基因整合作用的花斑表达模式和沉默。We have demonstrated (WO 00/05393) that spanning DNA from human TATA-binding protein (TBP)/protein body component-B1 (PSMBI) and nuclear heterogeneous-ribonucleoprotein A2/B1 (hnRNPA2)/heterochromatin protein 1Hsy (HP1 Hsγ ) locus without methylated CpG island regions of the heteroduplex diverging transcriptional promoter, resulting in reproducible, physiological gene expression and the ability to prevent transgene integration that normally occurs within central heterochromatin Role of variegated expression patterns and silencing.

此处使用的术语″可重复的表达″指本发明的多核苷酸不管其染色质环境直接在基本上地相同的表达水平表达可表达的基因,优选不管本发明的多核苷酸的细胞类型或组织类型。本领域技术人员将识别获得基本上地相同水平的可操作连接的可表达基因的表达,而不管权利要求的多核苷酸染色质环境,和优选不管细胞类型,假定所述细胞能够主动表达基因。The term "reproducible expression" as used herein means that a polynucleotide of the invention expresses an expressible gene directly at substantially the same expression level regardless of its chromatin environment, preferably regardless of the cell type or organization type. Those skilled in the art will recognize that substantially the same level of expression of an operably linked expressible gene is obtained regardless of the claimed polynucleotide chromatin environment, and preferably regardless of the cell type, assuming that the cell is capable of actively expressing the gene.

我们已经显示(WO 00/05393)与主动转录的启动子有关的无甲基化的CpG岛具有改造染色质的能力,并从而认为是建立和维持管家基因座的开放结构域的根本的决定因素。We have shown (WO 00/05393) that unmethylated CpG islands associated with actively transcribed promoters have the ability to remodel chromatin and are thus considered to be fundamental determinants for the establishment and maintenance of open domains of housekeeping loci .

UCOEs通过改善转基因表达的水平和稳定性使多产基因递送事件的比例增加。这具有重要的研究和生物技术应用,包括制备转基因动物和在培养的细胞中生产重组蛋白产物。我们已经显示(WO 00/05393)UCOEs对CMV-EGFP报道基因构建体和分泌的有药用价值的促红细胞生成素蛋白表达的有利效果。UCOEs的性质还表明在基因治疗中的实用性,其有效性由于低频率的多产基因递送和表达水平和持续时间不充分而经常受到限制(Verma,I.M.& Somia,N.Nature389:239-242(1997)。UCOEs increase the proportion of prolific gene delivery events by improving the level and stability of transgene expression. This has important research and biotechnology applications, including making transgenic animals and producing recombinant protein products in cultured cells. We have shown (WO 00/05393) the beneficial effect of UCOEs on the expression of a CMV-EGFP reporter gene construct and secreted pharmaceutically valuable erythropoietin protein. The nature of UCOEs also suggests utility in gene therapy, whose effectiveness is often limited by low frequency of prolific gene delivery and insufficient expression levels and duration (Verma, I.M. & Somia, N. Nature 389:239-242 (1997).

我们上述的申请PCT/GB99/02357(WO 00/05393)公开了大约4.0kb的功能性UCOE片段,尤其是图21的4102到8286的核苷酸定义的‘5.5RNP′片段(公开于p11,第6和7行)。同一申请公开了‘1.5kbRNP’片段(图22和29,出处描述于p51,第1到5行)。然而,该片段实际上是上述‘5.5RNP’片段的2165bp的BamH1-Tth111I片段,由该申请的4102到6267位核苷酸组成。Our above-mentioned application PCT/GB99/02357 (WO 00/05393) discloses a functional UCOE fragment of about 4.0 kb, especially the '5.5RNP' fragment defined by nucleotides 4102 to 8286 of Figure 21 (disclosed in p11, lines 6 and 7). The same application discloses a '1.5kbRNP' fragment (Figures 22 and 29, source described in p51, lines 1 to 5). However, this fragment is actually a 2165bp BamH1-Tth111I fragment of the above '5.5RNP' fragment, consisting of nucleotides 4102 to 6267 of the application.

在另外的申请(WO 02/24930)中,我们公开了人工构建的包括天然存在的CpG岛片段的UCOEs。公开的片段比当前的申请中要求的片段大,并且在当时认为不可能单独使用小的片段,而不仅仅是作为合成或‘混合的’UCOE构建物的组分。In a further application (WO 02/24930) we disclosed artificially constructed UCOEs comprising fragments of naturally occurring CpG islands. The disclosed fragments were larger than those claimed in the current application, and at the time it was not considered possible to use the small fragments alone, not just as components of synthetic or 'mixed' UCOE constructs.

即使存在这些显著的暗示和范围较宽的应用,还是有进一步优化转基因表达水平的需要。有需要进一步的优选转基因表达的水平,特别在体内基因治疗领域和体外生产重组蛋白领域。Even with these striking implications and broad range of applications, there is still a need to further optimize transgene expression levels. There is a need to further optimize the level of transgene expression, especially in the field of gene therapy in vivo and in the field of recombinant protein production in vitro.

一个特别的需要是降低用于增强基因表达的元件的大小。这样一来,可以生产更小的载体,或能稳定地容纳和表达较大的插入物的载体。A particular need is to reduce the size of elements used to enhance gene expression. In this way, smaller vectors can be produced, or vectors that can stably accommodate and express larger inserts.

发明简述Brief description of the invention

本发明的一个目的是提供小的染色质开放元件以增强可操作连接的转基因的水平和可重复性。It is an object of the present invention to provide small chromatin opening elements to enhance the level and reproducibility of operably linked transgenes.

根据本发明,提供了包括小的功能UCOEs片段的多核苷酸。这种多核苷酸包括不超过大约2kb无甲基化CpG岛,或大于这种岛不超过大约2kb的片段。According to the present invention, polynucleotides comprising small fragments of functional UCOEs are provided. Such polynucleotides include no more than about 2 kb unmethylated CpG islands, or fragments no more than about 2 kb larger than such islands.

尽管包括延长的无甲基化CpG岛的大多核苷酸是现有技术已知的(参见申请人自己以前的申请WO 00/05393),以前没有确定是否有可能使用明显小的片段并仍然维持由大UCOEs/无甲基化CpG岛获得的表达水平和一致性的增强。Although large nucleotides comprising extended unmethylated CpG islands are known in the prior art (see applicant's own previous application WO 00/05393), it was not previously determined whether it was possible to use significantly smaller fragments and still maintain Enhancement of expression levels and consistency obtained from large UCOEs/unmethylated CpG islands.

此处使用的术语″可操作连接″指本发明的多核苷酸元件之间可操作性的关系。″可操作连接″是本领域技术人员公知的术语,描述了顺式作用DNA序列之间的功能关系。准确的结构关系可以是或不是有关联的,并且不同的元件类型不同。至于启动子,其暗示位于与其驱动的开放阅读框5′基本上相邻的(通常在小于100bp之内)的位置。就延伸的无甲基化CpG岛来说,看来对染色质结构的局部作用负责提高基因表达的水平和一致性。举例来说,包括延伸的无甲基化CpG岛的元件被置于与控制可表达基因转录的启动子的紧邻5′端。然而,“可操作连接”包含其被置于其他地方的可能性,只要可以显示明确的功能效果。The term "operably linked" as used herein refers to an operable relationship between polynucleotide elements of the invention. "Operably linked" is a term well known to those skilled in the art and describes a functional relationship between DNA sequences acting in cis. The exact structural relationships may or may not be associative, and differ for different element types. With respect to a promoter, it is implied to be located substantially adjacent (usually within less than 100 bp) to the 5' open reading frame it drives. In the case of extended unmethylated CpG islands, local effects on chromatin structure appear to be responsible for increasing the level and consistency of gene expression. For example, an element comprising an extended unmethylated CpG island is placed immediately 5' to a promoter that controls transcription of an expressible gene. However, "operably linked" embraces the possibility of it being placed elsewhere as long as a definite functional effect can be demonstrated.

本发明提供了分离的多核苷酸,其包括The invention provides isolated polynucleotides comprising

a)延伸的无甲基化CpG岛;a) Extended unmethylated CpG islands;

b)可表达的开放阅读框,可操作连接到所述延伸的无甲基化CpG岛;b) an expressible open reading frame operably linked to said extended unmethylated CpG island;

c)启动子,可操作连接到所述开放阅读框,其中所述的启动子不天然地连接到所述的CpG岛;c) a promoter, operably linked to said open reading frame, wherein said promoter is not naturally linked to said CpG island;

其特征在于所述的CpG岛大小不超过2kb并且其中在两个组织类型中获得所述开放阅读框的可重复表达。It is characterized in that the size of said CpG island does not exceed 2 kb and wherein reproducible expression of said open reading frame is obtained in two tissue types.

或者,多核苷酸包括人hnRNP A2基因不超过2kb的片段,优选不超过1.6kb,包括1546bp的Esp31限制性片断,更优选图2的序列,977到2522的核苷酸(SEQ ID NO:1),或其功能类似物。优选地,所述的片段定向为正向。Alternatively, the polynucleotide includes a fragment of no more than 2kb of the human hnRNP A2 gene, preferably no more than 1.6kb, including a 1546bp Esp31 restriction fragment, more preferably the sequence of Figure 2, 977 to 2522 nucleotides (SEQ ID NO: 1 ), or its functional analogues. Preferably, said fragments are oriented forward.

‘功能同系物’指能够在严格条件下杂交到所述公开的序列的多核苷酸序列,并且其具有类似的实现可操作连接的可表达的开放阅读框在两个或更多组织中可重复表达的性质。严格的杂交/洗涤条件是现有技术已知的。例如,在0.1x SSC,0.1%SDS在60℃洗涤后稳定的核酸杂交物。如果核酸序列已知,计算最佳杂交条件是本领域公知的。例如,杂交条件可以根据待杂交的核酸GC含量来决定。参见Sambrook等(1989),分子克隆;实验室方法。通用的计算在指定相似性的核酸分子之间获得杂交需要的严格条件的公式是:'Functional homologue' refers to a polynucleotide sequence capable of hybridizing to the disclosed sequence under stringent conditions and having a similar expressible open reading frame that is operably linked and repeatable in two or more tissues nature of expression. Stringent hybridization/wash conditions are known in the art. For example, stable nucleic acid hybrids after washing in 0.1x SSC, 0.1% SDS at 60°C. If the nucleic acid sequence is known, calculation of optimal hybridization conditions is well known in the art. For example, hybridization conditions can be determined according to the GC content of the nucleic acid to be hybridized. See Sambrook et al. (1989), Molecular Cloning; Laboratory Methods. A general formula for calculating the stringency conditions required to achieve hybridization between nucleic acid molecules of a given similarity is:

Tm=81.5℃+16.6Log[Na+]+0.41[%G+C]-0.63(%甲酰胺)T m =81.5°C+16.6Log[Na+]+0.41[%G+C]-0.63(%formamide)

优选本发明的多核苷酸促进可操作连接的基因非组织特异性地可重复表达。Preferably, the polynucleotides of the invention promote non-tissue-specific, reproducible expression of operably linked genes.

在另外的优选方案中,多核苷酸包括人hnRNP A2基因不超过大约1kb的片段,优选包括987bp的BspE1-Esp31限制性片断,更优选包括图2的序列,1536到2522的核苷酸(SEQ ID NO:2)。In another preferred scheme, the polynucleotide includes a fragment of no more than about 1 kb of the human hnRNP A2 gene, preferably includes a 987bp BspE1-Esp31 restriction fragment, more preferably includes the sequence of Figure 2, nucleotides from 1536 to 2522 (SEQ ID NO: 2).

优选地,本发明的多核苷酸包括一个或多个天然存在的启动子。Preferably, the polynucleotides of the invention include one or more naturally occurring promoters.

优选地,本发明的多核苷酸包括二联或双向的背驰转录的启动子。在其它实施方案中,本发明的多核苷酸包括β-肌动蛋白CpG岛/启动子区域的片段,优选来自于人。更优选本发明的多核苷酸包括100bp到2kb范围内跨越人β-肌动蛋白CpG岛/启动子区域的DNA片段。Preferably, the polynucleotides of the present invention include a dual or bidirectional promoter of divergent transcription. In other embodiments, the polynucleotides of the invention comprise fragments of the β-actin CpG island/promoter region, preferably of human origin. More preferably, the polynucleotide of the present invention comprises a DNA fragment spanning the CpG island/promoter region of human β-actin in the range of 100 bp to 2 kb.

在另外任选的实施方案中,本发明的多核苷酸包括PDCD2CpG岛/启动子区域的片段,优选来自于人。更优选地,本发明的多核苷酸包括100bp到2kb范围内跨越人PDCD2CpG岛/启动子区域的DNA片段。In further optional embodiments, the polynucleotides of the invention comprise fragments of the PDCD2 CpG island/promoter region, preferably of human origin. More preferably, the polynucleotide of the present invention comprises a DNA fragment spanning the human PDCD2 CpG island/promoter region in the range of 100 bp to 2 kb.

优选地,本发明的多核苷酸包括100bp到1.9kb范围内跨越人β-肌动蛋白CpG岛/启动子区域的DNA片段和100bp到2kb范围内跨越人PDCD2CpG岛/启动子区域的DNA片段。优选地,所述的片段与它们的背驰方向的启动子直接相邻。Preferably, the polynucleotide of the present invention includes a DNA fragment spanning the CpG island/promoter region of human β-actin in the range of 100bp to 1.9kb and a DNA fragment spanning the CpG island/promoter region of human PDCD2 in the range of 100bp to 2kb. Preferably, said fragments are directly adjacent to their dc-oriented promoters.

另一个方面,本发明提供提供了包括上述公开的本发明的多核苷酸的载体。优选地,所述的载体是经调整用于真核基因表达的表达载体。In another aspect, the present invention provides a vector comprising the polynucleotide of the present invention disclosed above. Preferably, the vector is an expression vector adapted for eukaryotic gene expression.

一般地所述的调整包括,例如但不限于,供给介导细胞/组织特异性表达的转录控制序列(启动子序列)。Typically said adjustments include, for example but not limited to, provision of transcriptional control sequences (promoter sequences) that mediate cell/tissue specific expression.

启动子和增强子是本领域公知的术语,包括下列提供举例而不是作为限制的特征。启动子是5′,顺式作用的调控序列,直接连接到转录起始。启动子元件包括所谓的TATA盒和用于选择转录起始位置的RNA聚合酶起始选择(RIS)序列。这些序列还结合用于RNA聚合酶促进转录起始选择的多肽。Promoter and enhancer are art-recognized terms that include the following features, which are provided by way of example and not by way of limitation. A promoter is a 5', cis-acting regulatory sequence linked directly to the initiation of transcription. The promoter element includes the so-called TATA box and an RNA polymerase initiation selection (RIS) sequence for selecting where transcription starts. These sequences also bind polypeptides for RNA polymerase to facilitate transcription initiation selection.

优选地,启动子选自CMV、EF-1α、劳氏肉瘤病毒(RSV)LTR、或HIV2LTR或从其衍生的组合序列。更优选地,启动子是CMV立即/早期启动子。最优选其是小鼠CMV立即/早期启动子。Preferably, the promoter is selected from CMV, EF-la, Rous Sarcoma Virus (RSV) LTR, or HIV2 LTR or a combination sequence derived therefrom. More preferably, the promoter is the CMV immediate/early promoter. Most preferably it is the mouse CMV immediate/early promoter.

在最优选载体的实施方案中,本发明的CpG岛位于与控制可表达的开放阅读框的表达的操作启动子相邻及5′端。In the most preferred vector embodiments, the CpG islands of the invention are located adjacent to and 5' to an operable promoter controlling expression of an expressible open reading frame.

增强子元件是顺式作用核酸序列,经常被发现在基因转录起始位点5′(增强子还可以被发现在基因序列的3′乃至位于内含子序列和因此是位置独立的)。增强子的作用是提高连接到增强子的基因的转录比率。增强子活性对应于反式作用的转录因子(多肽),其已经显示特异性结合到增强子元件上。转录因子的结合/活性对应于许多环境的因素,其包括例如但不限于,中间代谢产物(例如葡萄糖)、环境效应器(例如加热)。(参见真核转录因子,David S Latchman,Academic Press Ltd,SanDiego)。Enhancer elements are cis-acting nucleic acid sequences that are often found 5' to the transcription initiation site of a gene (enhancers can also be found 3' to a gene sequence or even in intronic sequences and are thus position independent). The role of an enhancer is to increase the rate of transcription of the gene linked to the enhancer. Enhancer activity corresponds to trans-acting transcription factors (polypeptides) that have been shown to bind specifically to enhancer elements. The binding/activity of transcription factors corresponds to a number of environmental factors including, for example, but not limited to, intermediate metabolites (eg, glucose), environmental effectors (eg, heat). (See Eukaryotic Transcription Factors, David S Latchman, Academic Press Ltd, SanDiego).

调整还包括供给可选标记物和自主复制序列,两者都促进所述的载体在真核细胞或者原核宿主中的维持。自主维持的载体称为游离型载体。游离型载体由于它们自我复制并且不需要整合而能持续所以是合乎需要的。这类游离型载体描述于W098/07876。Modifications also include the provision of selectable markers and autonomously replicating sequences, both of which facilitate maintenance of the vector in eukaryotic or prokaryotic hosts. Self-sustaining vectors are called episomal vectors. Episomal vectors are desirable because they are self-replicating and persist without the need for integration. Such episomal vectors are described in WO98/07876.

促进编码基因的载体表达的调整包括供给转录终止/多腺苷酸序列。这还包括供给内部核糖体进入位点(IRES),其作用于使在双顺式或多顺式表达盒中排列的载体编码的基因表达最大化。Modifications of the vector to facilitate expression of the encoding gene include the provision of transcription termination/polyadenylation sequences. This also includes the provision of internal ribosome entry sites (IRES), which act to maximize expression of vector-encoded genes arranged in bi-cis or multi-cis expression cassettes.

这些调整是现有技术已知的。通常有大量关于表达载体构建和重组DNA方法的发表的文献。请参见,Sambrook等(1989)分子克隆:实验指南,Cold Spring Harbour,NY和其中的参考文献;Marston,F(1987)DNA克隆技术:实用方法Vol III IRL Press,Oxford UK;DNA Cloning:FM Ausubel等,分子生物学最新手册,John Wiley & Sons,Inc,(1994)。These adjustments are known in the art. There is generally a large body of published literature on expression vector construction and recombinant DNA methods. See, Sambrook et al (1989) Molecular Cloning: A Laboratory Guide, Cold Spring Harbor, NY and references therein; Marston, F (1987) DNA Cloning: A Practical Approach Vol III IRL Press, Oxford UK; DNA Cloning: FM Ausubel et al., Current Handbook of Molecular Biology, John Wiley & Sons, Inc, (1994).

所述载体可以是游离型载体或整合型载体。优选地,该载体是质粒。或者,载体可以是病毒,例如腺病毒,腺病毒相关病毒,疱疹病毒,痘苗病毒,慢病毒或其他的逆转录病毒。The vector may be an episomal vector or an integrative vector. Preferably, the vector is a plasmid. Alternatively, the vector may be a virus, such as an adenovirus, adeno-associated virus, herpes virus, vaccinia virus, lentivirus or other retrovirus.

优选所述载体包括可操作连接的基因,其是治疗性核酸。这种治疗性核酸可以通过替换或补充导致疾病的缺陷基因的功能而起作用,所述疾病例如囊性纤维化、地中海贫血、镰刀贫血症、Fanconi氏贫血症、血友病、严重联合免疫缺陷(SCID)、苯丙酮酸尿症(PKU)、α-1抗胰蛋白酶缺乏、Duchenne肌肉萎缩、鸟氨酸转氨甲酰酶不足或成骨不全。或者,它可以编码选择性地在靶细胞例如恶性癌细胞中表达细胞毒性试剂或前体药物转化酶,以杀死所述细胞。这种应用,和许多其它应用,是本领域技术人员公知的,并且所述本领域技术人员清楚本发明增强治疗性核酸表达的关联性。Preferably the vector comprises an operably linked gene, which is a therapeutic nucleic acid. Such therapeutic nucleic acids may work by replacing or supplementing the function of defective genes that cause diseases such as cystic fibrosis, thalassemia, sickle cell anemia, Fanconi's anemia, hemophilia, severe combined immunodeficiency (SCID), phenylketonuria (PKU), alpha-1 antitrypsin deficiency, Duchenne muscle atrophy, ornithine transcarbamylase deficiency, or osteogenesis imperfecta. Alternatively, it may encode the selective expression of a cytotoxic agent or prodrug converting enzyme in target cells, such as malignant cancer cells, to kill said cells. This use, and many others, are well known to those skilled in the art, and the relevance of the invention to enhance expression of therapeutic nucleic acids is clear to said skilled artisan.

最优选所述载体包括SEQ ID Nos 1或2,CMV启动子,多克隆位点,多腺苷酸序列和编码适合的控制元件控制的选择性标记物的基因中的任意一种。Most preferably, said vector comprises any one of SEQ ID Nos 1 or 2, CMV promoter, multiple cloning site, polyadenylation sequence and a selectable marker encoding a suitable control element.

还提供转染有载体的宿主细胞。优选是真核细胞,更优选哺乳动物细胞,最优选人或啮齿动物细胞。Host cells transfected with the vector are also provided. Preferably they are eukaryotic cells, more preferably mammalian cells, most preferably human or rodent cells.

或者,可能是植物细胞。Or, maybe plant cells.

本发明的另一个方面是任何分离的多核苷酸、无甲基化延伸的CpG岛(特别地SEQ ID Nos 1或2公开的)、上述治疗公开的载体或宿主细胞在治疗尤其是在基因治疗中的应用。A further aspect of the invention is any isolated polynucleotide, CpG island without methylation extension (in particular disclosed in SEQ ID Nos 1 or 2), vector or host cell disclosed in the above-mentioned treatment in particular in gene therapy in the application.

本发明还提供了多核苷酸、载体或宿主细胞在制备用于基因治疗的药物或组合物中的应用。The present invention also provides the application of polynucleotide, vector or host cell in the preparation of medicine or composition for gene therapy.

本发明还提供了治疗方法,包括对需要这种治疗的患者施用有效剂量的所述本发明的多核苷酸、载体或宿主细胞。优选患者患可以通过基因治疗的疾病。The present invention also provides methods of treatment comprising administering an effective dose of said polynucleotides, vectors or host cells of the present invention to a patient in need of such treatment. Preferably the patient suffers from a disease amenable to gene therapy.

本发明还提供了药物组合物,其包括多核苷酸和/或载体和/或宿主细胞,任选地与药用载体或稀释剂混合用于治疗疾病或提供具有有益的蛋白或功能的特定组织的细胞。The present invention also provides pharmaceutical compositions, which include polynucleotides and/or vectors and/or host cells, optionally mixed with pharmaceutically acceptable carriers or diluents for treating diseases or providing specific tissues with beneficial proteins or functions Cell.

本发明的多核苷酸、载体或宿主细胞或药物组合物可以以如下途径给药,包括全身肌内、静脉内、气雾剂、口的(固体或液体)、局部、眼部、直肠、腹膜内和/或鞘内和局部的直接注射。The polynucleotides, vectors or host cells or pharmaceutical compositions of the invention may be administered by routes including systemic intramuscular, intravenous, aerosol, oral (solid or liquid), topical, ocular, rectal, peritoneal Intra and/or intrathecal and local direct injection.

准确的剂量方式将,当然,需要由具体患者的具体临床医生决定,而这,反过来根据所需的基因表达的蛋白和作为治疗靶的组织类型的确切性质而控制。The exact dosage regimen will, of course, need to be determined by the individual clinician for the individual patient, which, in turn, will be governed by the desired gene expressed protein and the exact nature of the tissue type being targeted for treatment.

剂量还取决于疾病指症和给药途径。剂量的数目取决于疾病,和来自临床试验的功效数据。Dosage also depends on disease indications and route of administration. The number of doses depends on the disease, and efficacy data from clinical trials.

根据本发明递送用于有效的基因治疗的多核苷酸或载体DNA的量优选在50ng-1000μg载体DNA/kg体重的范围之内;更优选在大约1-100μg载体DNA/kg的范围内。The amount of polynucleotide or vector DNA delivered according to the present invention for effective gene therapy is preferably in the range of 50 ng-1000 μg vector DNA/kg body weight; more preferably in the range of about 1-100 μg vector DNA/kg.

尽管根据本发明优选施用多核苷酸、载体或宿主细胞到哺乳动物用于体内细胞吸收,可以利用离体方法,借此将细胞从动物中取出,用多核苷酸或载体转导,然后重植入到所述动物。肝脏,例如可以采用离体方法通过从动物取出肝细胞,体外转导肝细胞和重植入转导的肝细胞到动物中(如Chowdhury等,Science 254:1802-1805,1991描述用于兔子,或Wilson,Hμm.Gene Ther.3:179-222,1992描述用于人)进行评价。这种方法还可以有效在循环系统或淋巴系统中递送不同的细胞群体,例如红血球、T细胞、B细胞和造血干细胞。Although administration of polynucleotides, vectors or host cells to mammals for in vivo cellular uptake is preferred according to the invention, ex vivo methods can be utilized whereby cells are removed from the animal, transduced with the polynucleotide or vector, and then replanted into the animal. The liver, for example, can be used ex vivo by removing the hepatocytes from the animal, transducing the hepatocytes in vitro and reimplanting the transduced hepatocytes into the animal (as described in Chowdhury et al., Science 254:1802-1805, 1991 for rabbits, or Wilson, Hμm. Gene Ther. 3: 179-222, 1992 described for humans) for evaluation. This approach can also efficiently deliver different cell populations in the circulatory or lymphatic system, such as red blood cells, T cells, B cells, and hematopoietic stem cells.

在另外的方面,本发明提供了本发明的多核苷酸载体或宿主细胞在细胞培养系统中的应用以获得所需的基因产物。优选地,可表达的核酸编码用于在体外细胞培养系统中表达的重组蛋白。In a further aspect, the present invention provides the use of the polynucleotide vector or host cell of the present invention in a cell culture system to obtain a desired gene product. Preferably, the expressible nucleic acid encodes a recombinant protein for expression in an in vitro cell culture system.

适合的细胞培养系统是现有技术已知的,并在本领域技术人员知道的文献中有完整的描述。本发明提供了生产本发明的多肽的方法,包括:Suitable cell culture systems are known in the art and are fully described in the literature known to those skilled in the art. The invention provides a method for producing a polypeptide of the invention, comprising:

i)提供转化/转染有本发明的多核苷酸的细胞;i) providing a cell transformed/transfected with a polynucleotide of the present invention;

ii)在有助于生产所述多肽的情况下培养所述细胞;以及ii) culturing said cell under conditions conducive to production of said polypeptide; and

iii)从所述的细胞或其生长环境纯化所述多肽。iii) purifying said polypeptide from said cell or its growth environment.

或者,可表达的基因编码非-多肽产物,例如RNA。这种RNA可以是能够在翻译后水平抑制特定基因表达的反义RNA,或可以是酶促(核糖酶)或其他的功能,例如核蛋白体RNA的功能。Alternatively, the expressible gene encodes a non-polypeptide product, such as RNA. Such RNA may be antisense RNA capable of inhibiting expression of a specific gene at the post-translational level, or may be enzymatic (ribozyme) or otherwise functional, such as that of ribosomal RNA.

还提供了根据SEQ ID NOs:1或2之一的延伸的无甲基化CpG岛多核苷酸用于提高内源基因表达的应用,包括将所述多核苷酸插入到细胞的基因组中与内源基因可操作相连的位置从而提高基因的表达水平。Also provided according to SEQ ID NOs: 1 or 2 extended non-methylated CpG island polynucleotides for improving the application of endogenous gene expression, comprising the polynucleotide inserted into the genome of the cell and endogenous The position of the source gene can be operably linked to increase the expression level of the gene.

在本发明的另一个实施方案中,提供了包含任何本发明的多核苷酸或载体或任何本发明的延伸的无甲基化CpG岛的非人转基因的动物,其中所述的CpG岛已经被人工地引入。制备转基因小鼠(Gordon等,Proc.Natl.Acad.Sci.USA 77:7380(1980);Harbers等,Nature293:540(1981);Wagner等.,Proc.Natl.Acad.Sci.USA 78:5016(1981);和Wagner等,Proc.Natl.Acad.Sci.USA 78:6376(1981),绵羊,猪,鸡(参见Hammer等.,Nature 315:680(1985))等的方法是现有技术已知的并且构思用于本发明。In another embodiment of the invention there is provided a non-human transgenic animal comprising any polynucleotide or vector of the invention or any extended unmethylated CpG island of the invention, wherein said CpG island has been introduced manually. Production of transgenic mice (Gordon et al., Proc.Natl.Acad.Sci.USA 77:7380 (1980); Harbers et al., Nature 293:540 (1981); Wagner et al., Proc.Natl.Acad.Sci.USA 78:5016 (1981); and Wagner et al., Proc.Natl.Acad.Sci.USA 78:6376 (1981), sheep, pig, chicken (seeing Hammer et al., Nature 315:680 (1985)) etc. are prior art known and contemplated for use in the present invention.

这种包含本发明的多核苷酸转基因动物还可以被用来长期生产所需的蛋白。Such transgenic animals containing the polynucleotide of the present invention can also be used for long-term production of desired proteins.

本发明还提供了使用本发明的多核苷酸、载体或宿主细胞确定基因治疗的功效的哺乳动物模型。哺乳动物模型包括其细胞含有本发明的载体的转基因动物。这种动物可供在进行人体临床试验前进行测试。The invention also provides mammalian models for determining the efficacy of gene therapy using the polynucleotides, vectors or host cells of the invention. Mammalian models include transgenic animals whose cells contain a vector of the invention. The animals are available for testing before human clinical trials.

本发明还提供了本发明的多核苷酸和延伸的无甲基化CpG岛在生产转基因植物中的应用。The present invention also provides the use of polynucleotides and extended unmethylated CpG islands of the present invention in the production of transgenic plants.

产量增加、或对疾病、害虫、干旱或盐的抗性提高的转基因植物的生产是本领域技术人员已知的。本发明还提供了包括含有本发明的多核苷酸的细胞的转基因植物。一些或所有包括本发明的多核苷酸细胞可以来自于植物。The production of transgenic plants with increased yield, or increased resistance to disease, pests, drought or salt is known to those skilled in the art. The invention also provides a transgenic plant comprising a cell comprising a polynucleotide of the invention. Some or all of the cells comprising a polynucleotide of the invention may be derived from plants.

本发明还涉及本发明的多核苷酸、载体或延伸的无甲基化CpG岛在功能基因组学中的应用。功能基因组学主要地涉及特异性地在特定的细胞类型或疾病状态中表达的基因的鉴定,并且目前提供了具有潜在的药物发现或基因治疗目的价值的数以千计的新基因序列。使用这种信息开发新疗法的主要问题在于如何确定这些基因的功能。本发明的多肽可被用于多种功能基因组应用中以确定基因序列的功能。本发明的功能基因组应用包括,但不限制于:The present invention also relates to the use of polynucleotides, vectors or extended unmethylated CpG islands of the present invention in functional genomics. Functional genomics is primarily concerned with the identification of genes specifically expressed in specific cell types or disease states, and currently provides thousands of new gene sequences that are of potential value for drug discovery or gene therapy purposes. A major problem in using this information to develop new treatments is how to determine the function of these genes. The polypeptides of the invention can be used in a variety of functional genomics applications to determine the function of a gene sequence. Functional genomics applications of the present invention include, but are not limited to:

(1)使用本发明的多核苷酸获得基因序列的反义序列或核酶knockdown文库的持续表达,从而确定基因失活对细胞表型的效果。(1) Using the polynucleotide of the present invention to obtain the continuous expression of the antisense sequence of the gene sequence or ribozyme knockdown library, so as to determine the effect of gene inactivation on the cell phenotype.

(2)使用本发明的多核苷酸制备基因序列的表达文库,使递送到细胞内产生所述基因序列的可靠的、可重复的、持续表达。得到的表达所述基因序列的细胞可被用于多种功能鉴定和药物发现的方法。例如,制备基因产物的中和抗体;快速纯化基因本身的蛋白产物用于结构、功能或药物筛选的研究;或用于基于细胞的药物筛选。(2) Using the polynucleotide of the present invention to prepare an expression library of gene sequences, so that the gene sequences can be delivered into cells to produce reliable, reproducible and sustained expression of the gene sequences. The resulting cells expressing the gene sequence can be used in various functional identification and drug discovery methods. For example, to prepare neutralizing antibodies to gene products; to rapidly purify the protein product of the gene itself for structural, functional or drug screening studies; or for cell-based drug screening.

(3)使用本发明的多核苷酸用于小鼠胚胎干细胞(ES)和转基因小鼠的方法中。最强大的功能基因组方法之一包括向小鼠ES细胞基因内随机插入构建体,所述构建体只在插入到表达的基因内以后才允许药物选择,并且其可以容易地克隆出来用于测序(G.Hicks等,Nature Genetics,16,338-334)。具有基因敲除突变和新序列的转基因小鼠能因此容易地用于研究其功能。目前该技术对小鼠ES细胞中良好表达的10%小鼠基因是有用的。掺入本发明的多核苷酸到整合的构建体中将使这些方法扩展到鉴定小鼠表达的全部基因。(3) In a method of using the polynucleotide of the present invention for mouse embryonic stem (ES) cells and transgenic mice. One of the most powerful functional genomic approaches involves the random insertion into the genes of mouse ES cells of constructs that allow drug selection only after insertion into the expressed gene and which can be easily cloned for sequencing (G. Hicks et al., Nature Genetics, 16, 338-334). Transgenic mice with knockout mutations and new sequences can thus be readily used to study their function. Currently the technique is useful for the 10% of mouse genes that are well expressed in mouse ES cells. Incorporation of polynucleotides of the invention into integrated constructs will extend these methods to identify all genes expressed in mice.

发明详述Detailed description of the invention

本发明目前只参考下面的附图举例进行描述,其中;The present invention is only described by way of example with reference to the following drawings, wherein;

图1显示表明BamH1、HindIII、Esp31、Tth1111和BspEI限制性位点定义不同的4、2、1.5和1kb CpG岛片段的HP-1/hnRNPA2基因座的图谱。Figure 1 shows a map of the HP-1/hnRNPA2 locus showing 4, 2, 1.5 and 1 kb CpG island fragments that define differently for BamH1, HindIII, Esp31, Tth1111 and BspEI restriction sites.

图2显示表明BamH1、Esp31、BspE1和Tth111I限制性位点的HP-1/hnRNPA2基因座的核苷酸序列。Figure 2 shows the nucleotide sequence of the HP-1/hnRNPA2 locus indicating BamH1, Esp31, BspE1 and Tth111I restriction sites.

图3显示可操作连接到4kb和1.5kb(在两个方向)CpG岛片段的EGFP报告基因的表达。FACScan数据表示为经过71天的荧光中值。Figure 3 shows the expression of an EGFP reporter gene operably linked to 4 kb and 1.5 kb (in both orientations) CpG island fragments. FACScan data are expressed as median fluorescence over 71 days.

图4显示可操作连接到4kb和1.5kb(在两个方向)CpG岛片段的EGFP报告基因的表达。FACScan数据表示为经过71天的阳性细胞%。Figure 4 shows the expression of an EGFP reporter gene operably linked to 4 kb and 1.5 kb (in both orientations) CpG island fragments. FACScan data are expressed as % positive cells over 71 days.

图5显示可操作连接到4kb、1.5kb和1kb CpG岛片段的EGFP报告基因的表达。FACScan数据表示为经过68天的荧光中值。Figure 5 shows the expression of an EGFP reporter gene operably linked to 4 kb, 1.5 kb and 1 kb CpG island fragments. FACScan data are expressed as median fluorescence over 68 days.

图6显示可操作连接到4kb、1.5kb和1kb CpG岛片段的EGFP报告基因的表达。FACScan数据表示为经过68天的阳性细胞%。Figure 6 shows the expression of an EGFP reporter gene operably linked to 4 kb, 1.5 kb and 1 kb CpG island fragments. FACScan data are expressed as % positive cells over 68 days.

图7显示两个腺病毒构建体Ad.CMV-Luc-SV40p(A)和Ad.1.5kb(F)UCOE-CMV-Luc-SV40p(A)的结构,其基于腺病毒血清型5。荧光素酶表达盒被以从左到右的方向插入E1区中。E1和E3区被从病毒上删除。由于E1的删除,病毒是复制缺陷性的。CMV(巨细胞病毒)是人CMV增强子/启动子。SV40p(A)是来自pGL3basic(Promega)的SV40病毒晚期多腺苷酸信号。Figure 7 shows the structure of two adenovirus constructs, Ad.CMV-Luc-SV40p (A) and Ad.1.5kb(F)UCOE-CMV-Luc-SV40p (A), which are based on adenovirus serotype 5. The luciferase expression cassette was inserted in the El region in a left-to-right orientation. The E1 and E3 regions are deleted from the virus. Due to the deletion of El, the virus is replication deficient. CMV (cytomegalovirus) is the human CMV enhancer/promoter. SV40p (A) is the late polyadenylation signal of SV40 virus from pGL3basic (Promega).

图8显示了1.5kb UCOE当通过腺病毒载体递送后在HeLa细胞中提高基因表达。HeLa细胞以MOI50在200μl感染介质中用Ad.CMV-Luc-SV40p(A)和Ad.1.5kb(F)UCOE-CMV-Luc-SV40p(A)病毒感染2-3小时(正常HeLa介质只包含1%FCS)。在孵育后添加2ml完全培养基,并将细胞接种在6孔板中。在感染2天后分析荧光素酶活性。显示三个独立实验的结果。实验进行三次重复。显示为平均值+/-标准差。Figure 8 shows that 1.5kb UCOE increases gene expression in HeLa cells when delivered by an adenoviral vector. HeLa cells were infected with Ad.CMV-Luc-SV40p (A) and Ad.1.5kb (F) UCOE-CMV-Luc-SV40p (A) virus in 200μl infection medium at MOI50 for 2-3 hours (normal HeLa medium only contained 1% FCS). 2 ml of complete medium was added after incubation, and cells were seeded in 6-well plates. Luciferase activity was analyzed 2 days after infection. Results of three independent experiments are shown. Experiments were performed in triplicate. Displayed as mean +/- standard deviation.

图9显示了1.5kb UCOE效果与病毒的制备无关。HeLa细胞以MOI50在200μl的感染介质(正常Hela介质只包含1%FCS)中用来自两个独立的病毒制剂的病毒Ad.CMV-Luc-SV40p(A)和Ad.1.5kb(F)UCOE-CMV-Luc-SV40p(A)分别感染2-3小时。在孵育之后添加2ml完全培养基,然后将细胞接种在6孔板中。在感染2天后分析荧光素酶活性。显示一个典型的实验的结果。实验进行三次重复。Figure 9 shows that the 1.5kb UCOE effect is independent of virus preparation. HeLa cells were treated with viruses Ad.CMV-Luc-SV40p (A) and Ad.1.5kb (F) UCOE- CMV-Luc-SV40p (A) were infected for 2-3 hours respectively. After incubation 2ml of complete medium was added and cells were seeded in 6-well plates. Luciferase activity was analyzed 2 days after infection. Shows the results of a typical experiment. Experiments were performed in triplicate.

图10显示1.5kb UCOE提高逆转录病毒转导的CHO-K1集合体中转基因表达水平和稳定性。(A)FACS分选的CHO-K1集合体(最初FACS分选的36,000细胞)在逆转录病毒转导CMV和1.5UCOE-CMV构建体后的平均GFP值与时间的关系。第一天指测量的第一天(FACS分选后8天)。(B)FACS分选的CHO-K1集合体各测量时间点的柱状图。Figure 10 shows that 1.5kb UCOE increases transgene expression levels and stability in retrovirally transduced CHO-K1 aggregates. (A) Mean GFP values versus time for FACS-sorted CHO-K1 pools (36,000 cells initially FACS-sorted) following retroviral transduction of CMV and 1.5 UCOE-CMV constructs. Day 1 refers to the first day of measurement (8 days after FACS sorting). (B) Histogram of FACS-sorted CHO-K1 aggregates at each measurement time point.

图11显示在低MOI感染之后转导的CHO-K1(集合体I)的GFP表达。(A)GFP阳性(M1)CHO-K1群在逆转录病毒转导后各时间点的平均GFP值。CMV群从3.19%GFP阳性细胞开始而1.5UCOE-CMV群从4.99%GFP阳性细胞开始。(B)细胞群在各时间点的柱状图。Figure 11 shows GFP expression of transduced CHO-K1 (pool I) after low MOI infection. (A) Mean GFP values of the GFP-positive (M1) CHO-K1 population at various time points after retroviral transduction. The CMV population started with 3.19% GFP positive cells and the 1.5UCOE-CMV population started with 4.99% GFP positive cells. (B) Histogram of cell populations at various time points.

图12显示在中等MOI感染后转导的CHO-K11(集合体II)的GFP表达。(A)GFP阳性(M1)CHO-K1群在逆转录病毒转导后各时间点的平均GFP值。CMV群从25.2%GFP阳性细胞开始而1.5UCOE-CMV群从14.35%GFP阳性细胞开始。(B)细胞群在各时间点的柱状图。Figure 12 shows GFP expression of transduced CHO-K11 (pool II) after medium MOI infection. (A) Mean GFP values of the GFP-positive (M1) CHO-K1 population at various time points after retroviral transduction. The CMV population started with 25.2% GFP positive cells and the 1.5UCOE-CMV population started with 14.35% GFP positive cells. (B) Histogram of cell populations at various time points.

图13显示1.5kb UCOE提高逆转录病毒转导的HeLa集合体中转基因表达水平和稳定性。(A)FACS分选的HeLa集合体(最初FACS分选的10,000细胞)在逆转录病毒转导CMV和1.5UCOE-CMV构建体后的平均GFP值与时间的关系。第一天指测量的第一天(FACS分选后5天)。(B)FACS分选的HeLa集合体在各测量时间点的柱状图。Figure 13 shows that 1.5 kb UCOE increases transgene expression levels and stability in retrovirally transduced HeLa aggregates. (A) Mean GFP values versus time for FACS-sorted HeLa pools (10,000 cells initially FACS-sorted) after retroviral transduction of CMV and 1.5 UCOE-CMV constructs. Day 1 refers to the first day of measurement (5 days after FACS sorting). (B) Histogram of FACS-sorted HeLa aggregates at each measurement time point.

图14显示1.5kbUCOE提高在逆转录病毒转导的HeLa细胞克隆中和之间GFP表达的一致性。FACS分选的HeLa克隆被扩大为6孔大小并分析GFP表达。(A-C)所有CMV克隆的柱状图。(D-E)所有1.5UCOE-CMV克隆的柱状图。Figure 14 shows that 1.5 kb UCOE increases the consistency of GFP expression in and among retrovirally transduced HeLa cell clones. FACS sorted HeLa clones were expanded to a 6-well size and analyzed for GFP expression. (A-C) Histograms of all CMV clones. (D-E) Histograms of all 1.5 UCOE-CMV clones.

图15显示1.5kb UCOE降低HeLa细胞克隆中GFP表达的变异系数(CV)。显示图5的全部的逆转录病毒转导的和FACS分选的HeLa克隆的平均值的平均CV值(+/-SD)。1.5UCOE-CMV克隆的CV和它的标准差(SD)与CMV克隆相比显著地降低。Figure 15 shows that 1.5 kb UCOE reduces the coefficient of variation (CV) of GFP expression in HeLa cell clones. The mean CV values (+/- SD) of the mean of all retrovirally transduced and FACS sorted HeLa clones of Figure 5 are shown. The CV and its standard deviation (SD) of the 1.5 UCOE-CMV clone were significantly lower compared to the CMV clone.

实施例Example

实施例1A 1.5kb HP-1/hnRNP A2UCOE增强表达Example 1A 1.5kb HP-1/hnRNP A2UCOE enhanced expression

材料和方法Materials and methods

载体的构建Vector construction

如我们的在先申请WO 00/05393中所描述,载体CET20通过将人HP-1/hnRNP A2基因座的8.3kb的HindIII片段(其包含HP1/RNP启动子和延伸的CpG岛)克隆到到pBluescript中(Stratagene)而产生。As described in our prior application WO 00/05393, vector CET20 was obtained by cloning the 8.3 kb HindIII fragment of the human HP-1/hnRNP A2 locus (which contains the HP1/RNP promoter and extended CpG island) into generated in pBluescript (Stratagene).

插入的4186bp(称为4kb)片段然后通过BamH1和HindIII消化除去。这些片段使用T4DNA聚合酶补平末端,并连接到已经用Asel消化的pEGFPN-1中(Clontech)并再次使用T4DNA聚合酶补平末端。然后分离具有两个方向的片段的克隆。The inserted 4186bp (referred to as 4kb) fragment was then removed by BamHI and HindIII digestion. These fragments were blunt-ended using T4 DNA polymerase and ligated into pEGFPN-1 (Clontech) which had been digested with Asel and blunt-ended again using T4 DNA polymerase. Clones with fragments in both orientations were then isolated.

1546bp的Esp31(BsmBI的同裂酶)片段(称为1.5kb片段)被再次通过Esp31(BsmBI)消化从CET20中分离,随后补平末端,然后将这些连接到如上所述的pEGFPN-1的Asel位点,鉴定两个方向片段的克隆(“正”向与RNPA2启动子具有与转录EGFP转基因的相邻下游操作CMV启动子相同的5′到3′方向。)A 1546bp Esp31 (isoschizase of BsmBI) fragment (referred to as a 1.5kb fragment) was again isolated from CET20 by Esp31(BsmBI) digestion followed by blunt-end filling, and these were then ligated into Asel of pEGFPN-1 as described above locus, clones that identify fragments in both orientations ("forward" orientation with the RNPA2 promoter having the same 5' to 3' orientation as the adjacent downstream operating CMV promoter that transcribes the EGFP transgene.)

转染transfection

根据标准方法转染CHO-K1细胞被在G418中选择,如此处引入参考的未决申请的描述。Transfection of CHO-K1 cells according to standard methods was selected in G418, as described in the co-pending application incorporated herein by reference.

GFP表达分析GFP expression analysis

转染细胞培养在600μg/ml G418选择培养基中。细胞用标准磷酸盐缓冲盐水洗涤并根据标准方法用胰蛋白酶/EDTA与基质剥离。添加过量的营养混合物F12(HAM)培养基(Gibco),并将细胞转入5ml圆底聚苯乙烯试管用于Becton Dickinson FACScan分析。检测GFP荧光并与亲本细胞群的自身荧光比较。细胞群体中的表达既可以表示为相对于对照(根据标准方法)在图中线性标度的任意单位组的荧光中值,也可以表示为鉴定为阳性表达子的细胞的%。Transfected cells were cultured in 600 μg/ml G418 selection medium. Cells were washed with standard phosphate buffered saline and stripped from the matrix with trypsin/EDTA according to standard protocols. Excess nutrient mixture F12 (HAM) medium (Gibco) was added and cells were transferred to 5 ml round bottom polystyrene tubes for Becton Dickinson FACScan analysis. GFP fluorescence was detected and compared to the autofluorescence of the parental cell population. Expression in cell populations can be expressed either as median fluorescence of an arbitrary group of units scaled linearly in the graph relative to controls (according to standard methods), or as % of cells identified as positive expressors.

结果result

如图3所示,从荧光中值角度,与对照相比当1.5kb片段正向插入时表达显著增强(大约10倍,持续至少70天),4kb片段时观察到增强大约60%。在图5显示的实验中,由正向1.5kb片段的表达可与用4kb片段获得的表达相当。然而,相反方向表现为从荧光中值角度较少有效,如图3所示。As shown in Figure 3, from the perspective of fluorescence median, the expression was significantly enhanced (about 10 times, lasting at least 70 days) when the 1.5 kb fragment was inserted in the forward direction compared with the control, and an increase of about 60% was observed for the 4 kb fragment. In the experiments shown in Figure 5, the expression from the forward 1.5 kb fragment was comparable to that obtained with the 4 kb fragment. However, the opposite direction appears to be less effective from the perspective of the median fluorescence, as shown in Figure 3.

图4显示两个方向从阳性细胞%角度是相当的。然而,这些数据显示片段也许具有一定程度的方向性而且,对大多数情况而言,正向是优选的。Figure 4 shows that both directions are comparable in terms of % positive cells. However, these data show that fragments may have some degree of directionality and that, for most cases, forward orientation is preferred.

实施例2A 1kb HP-1/hnRNP A2UCOE增强表达Embodiment 2A 1kb HP-1/hnRNP A2UCOE enhanced expression

材料和方法Materials and methods

载体的构建Vector construction

含1kb UCOE的载体通过用PciI和BspEI消化正向具有1.5kbEsp31片段的pEGFPN-1载体除去5′500bp,之以补平和重连接。这产生只在一个方向具有987bp BspE1-Esp31片段的载体。The vector containing 1kb UCOE was digested with PciI and BspEI to remove the 5'500bp of the pEGFPN-1 vector with the 1.5kbEsp31 fragment in the forward direction, which was filled in and reconnected. This produced a vector with the 987bp BspE1-Esp31 fragment in only one orientation.

结果result

所述正向1kb(987bp)片段表现为可与正向1.5kb片段的荧光中值(图5)和阳性细胞%(图6)相当The forward 1 kb (987 bp) fragment showed comparable fluorescence median (Figure 5) and positive cell % (Figure 6) to the forward 1.5 kb fragment

实施例3A 1.5kb HP-1/hnRNP A2UCOE增强腺病毒编码的构建体中的Example 3A 1.5kb HP-1/hnRNP A2UCOE enhances adenovirus-encoded constructs 表达Express

材料和方法Materials and methods

细胞培养cell culture

HeLa获自ATCC(Manassas,弗吉尼亚)。PER.C6获自Crucell,(Leiden,荷兰)。全部购买的细胞系如厂商推荐的方法培养。911细胞由L.S.Young教授(Cancer Research UK Institute for Cancer Studies,伯明翰大学,伯明翰,英国)惠赠,并在含抗生素的DMEM/10%FCS中培养。HeLa was obtained from ATCC (Manassas, Virginia). PER.C6 was obtained from Crucell, (Leiden, The Netherlands). All purchased cell lines were cultured as recommended by the manufacturer. 911 cells were kindly provided by Prof. L.S. Young (Cancer Research UK Institute for Cancer Studies, University of Birmingham, Birmingham, UK) and cultured in DMEM/10% FCS containing antibiotics.

质粒构建Plasmid construction

PGL3basic获自Promega(Madison,WI,美国)并在多克隆位点下游包含荧光素酶-SV40p(A)盒。人CMV增强子/启动子(0.9kb)被克隆到Smal消化的pGL3basic中制备pGL3/CMV-Luc-SV40p(A)。为制备pGL3/1.5kb(F)UCOE-CMV-SV40p(A),将1.5kb UCOE Esp3I片段(参见图1和2)用T4DNA聚合酶进行平末端化(NEB,Beverly,MA,USA)然后连接到NheI消化的和T4平末端化的pGL3basic/CMV-Luc中。PGL3basic was obtained from Promega (Madison, WI, USA) and contained the luciferase-SV40p (A) cassette downstream of the multiple cloning site. The human CMV enhancer/promoter (0.9 kb) was cloned into Smal digested pGL3basic to make pGL3/CMV-Luc-SV40p (A). To make pGL3/1.5kb(F)UCOE-CMV-SV40p(A), the 1.5kb UCOE Esp3I fragment (see Figures 1 and 2) was blunt-ended with T4 DNA polymerase (NEB, Beverly, MA, USA) and ligated into Nhel digested and T4 blunt-ended pGL3basic/CMV-Luc.

病毒载体构建Viral vector construction

为构建pPS1128/CMV-Luc-SV40p(A)和pPS1128/1.5kb(F)UCOE-CMV-Luc-SV40p(A),表达盒通过PvuI/NheI/BamHI的消化从pGL3/CMV-Luc-SV40p(A)中切除并通过KpnI/BamHI的消化从pGL3/1.5kb(F)UCOE-CMV-Luc-SV40p(A)中切除,两个表达盒然后被平末端化和克隆到SpeI消化的和平末端化的pPS1128中(Djeha等,Cancer Gene Therapy 2000:721-731)。质粒通过限制性酶消化进行分析。To construct pPS1128/CMV-Luc-SV40p(A) and pPS1128/1.5kb(F)UCOE-CMV-Luc-SV40p(A), the expression cassette was digested from pGL3/CMV-Luc-SV40p( Excised in A) and digested by KpnI/BamHI from pGL3/1.5kb(F)UCOE-CMV-Luc-SV40p (A), both expression cassettes were then blunt-ended and cloned into SpeI-digested blunt-ended in pPS1128 (Djeha et al., Cancer Gene Therapy 2000:721-731). Plasmids were analyzed by restriction enzyme digestion.

病毒Ad.CMV-Luc-SV40p(A)和Ad.1.5kb(F)UCOE-CMV-Luc-SV40p(A)通过在PER.C6中分别使用pPS1128/CMV-Luc-SV40p(A)和pPS1128/1.5kb(F)UCOE-CMV-Luc-SV40p(A)和重叠腺病毒骨架载体pPS1160同源重组而构建(Djeha等,癌症基因治疗2000:721-731),放大和CsCI纯化和滴度测定如别处的描述(Lipinski等,Gene Therapy,8,2001:274-281)。Viruses Ad.CMV-Luc-SV40p (A) and Ad.1.5kb (F) UCOE-CMV-Luc-SV40p (A) were obtained by using pPS1128/CMV-Luc-SV40p (A) and pPS1128/ 1.5kb (F) UCOE-CMV-Luc-SV40p (A) and overlapping adenovirus backbone vector pPS1160 were constructed by homologous recombination (Djeha et al., Cancer Gene Therapy 2000: 721-731), amplification and CsCI purification and titer determination as Described elsewhere (Lipinski et al., Gene Therapy, 8, 2001:274-281).

病毒感染和荧光素酶活性分析Virus infection and luciferase activity assay

HeLa细胞(每6孔5.0×104)用含有相应的病毒的200μl感染介质悬液(含抗生素的EMEM/1%FCS)感染2-3小时。然后添加1ml完全培养基,细胞接种至6孔板中。通过在200μl裂解缓冲液(10mM磷酸钠pH 7.8,8mM MgCl2、1mM EDTA、1%Triton-X-100、15%甘油)中裂解细胞制备全细胞抽提物并通过离心(1分钟,13,000×g,RT)得到澄清的裂解液。在线性范围使用光度计(Lumat LB 9501,Berthold,Wildbad,德国)在指明的时间点分析各份上清液的荧光素酶活性。HeLa cells (5.0×10 4 per 6 wells) were infected with 200 μl infection medium suspension (EMEM/1% FCS with antibiotics) containing the corresponding virus for 2-3 hours. Then 1 ml of complete medium was added, and the cells were seeded into 6-well plates. Whole cell extracts were prepared by lysing cells in 200 μl lysis buffer (10 mM sodium phosphate pH 7.8, 8 mM MgCl 2 , 1 mM EDTA, 1% Triton-X-100, 15% glycerol) and centrifuged (1 min, 13,000× g, RT) to obtain a clear lysate. The luciferase activity of the individual supernatants was analyzed at the indicated time points using a luminometer (Lumat LB 9501, Berthold, Wildbad, Germany) in the linear range.

结果result

为分析UCOE是否可以在腺病毒载体中提高基因表达,将1.5kbUCOE以正向克隆到驱动荧光素酶报告基因的人CMV增强子/启动子上游(RNP启动子具有与CMV启动子相同的方向)。图7示意地显示两个比较它们的荧光素酶活性的病毒的结构。HeLa细胞以MOI 50用相应的病毒感染,并分析感染2天后的荧光素酶活性。图8清楚显示1.5kbUCOE片段可以极大增加HeLa细胞中报告基因的表达水平。为排除与病毒制剂相关的特异性作用,我们制备两个病毒的新制剂并重复该实验。图9显示了UCOE效果不取决于病毒制剂并且利用新的独立制剂完全可重复。To analyze whether UCOE can enhance gene expression in adenoviral vectors, 1.5kb UCOE was cloned in forward orientation upstream of the human CMV enhancer/promoter driving the luciferase reporter gene (RNP promoter has the same orientation as the CMV promoter) . Figure 7 schematically shows the structures of two viruses comparing their luciferase activities. HeLa cells were infected with the corresponding viruses at MOI 50, and the luciferase activity was analyzed 2 days after infection. Figure 8 clearly shows that the 1.5 kb UCOE fragment can greatly increase the expression level of the reporter gene in HeLa cells. To rule out specific effects related to virus preparations, we prepared new preparations of two viruses and repeated the experiment. Figure 9 shows that the UCOE effect is independent of the virus formulation and is fully reproducible with the new independent formulation.

实施例4 A1.5kb HP-1/hnRNP A2UCOE增强逆转录病毒编码的构建体Embodiment 4 A1.5kb HP-1/hnRNP A2UCOE enhances the construct encoded by retrovirus 中的表达expression in

材料和方法Materials and methods

细胞培养cell culture

HeLa,CHO-K1和293获自ATCC(Manassas,弗吉尼亚)。HeLa和293两个都培养在添加1%NEAA和含抗生素的10%FCS的DMEM中。CHO-K1在含10%FCS和抗生素的营养的混合F12(HAM)营养培养基中培养。HeLa, CHO-K1 and 293 were obtained from ATCC (Manassas, Virginia). Both HeLa and 293 were cultured in DMEM supplemented with 1% NEAA and 10% FCS with antibiotics. CHO-K1 was cultured in nutrient mixed F12 (HAM) nutrient medium containing 10% FCS and antibiotics.

质粒构建Plasmid construction

质粒pVPack-GP和pVPack-VSV-G获自Stratagene(LaJolla,CA,美国)并分别地表达逆转录病毒gag-pol和被膜VSV-G蛋白。逆转录病毒载体pQCXIX获自BD Bioscience,它包含用于转基因表达的人CMV启动子(Palo Alto,CA,美国)。质粒phrGFP-1获自Stratagene并被用作修饰的GFP cDNA的来源。为克隆pQCXIX-CMV-hrGFP,将hrGFP cDNA从phr-GFP-1通过BamHI/EcoRV消化而切除并克隆到BamHI/EcoRV消化的pQCXIX中。为产生pQCXIX-1.5UCOE-CMV-hrGFP,将1.5kbUCOE片段(Esp3I/BsmBI片段)作为平末端化片段克隆到XbaI消化的并平末端化的pQCXIX-CMV-hrGFP中。Plasmids pVPack-GP and pVPack-VSV-G were obtained from Stratagene (LaJolla, CA, USA) and express retroviral gag-pol and envelope VSV-G proteins, respectively. The retroviral vector pQCXIX was obtained from BD Bioscience, which contains the human CMV promoter for transgene expression (Palo Alto, CA, USA). Plasmid phrGFP-1 was obtained from Stratagene and used as the source of modified GFP cDNA. To clone pQCXIX-CMV-hrGFP, the hrGFP cDNA was excised from phr-GFP-1 by BamHI/EcoRV digestion and cloned into BamHI/EcoRV digested pQCXIX. To generate pQCXIX-1.5UCOE-CMV-hrGFP, the 1.5 kb UCOE fragment (Esp3I/BsmBI fragment) was cloned as a blunt-ended fragment into XbaI digested and blunt-ended pQCXIX-CMV-hrGFP.

双嗜性VSV-G-被膜逆转录病毒粒子的制备Preparation of amphotropic VSV-G-enveloped retroviral particles

所有的病毒用能产生高效和宽宿主范围转导的双嗜性VSV-G(水泡性口膜炎病毒糖蛋白)制成假病毒。使用的逆转录病毒载体产生自我钝化的病毒,这是由于在反转录后拷贝到5′LTR的3′LTR U3区删除因此抑制从5′LTR的任何进一步转录。All viruses were pseudovirused with the amphotropic VSV-G (vesicular stomatitis virus glycoprotein), which produces efficient and broad host range transduction. The retroviral vector used produces a self-inactivating virus due to deletion of the U3 region of the 3'LTR copied to the 5'LTR after reverse transcription thus inhibiting any further transcription from the 5'LTR.

4.5×106293细胞(第二天细胞应该80-90%铺满)被接种在瞬时转染前天用I型胶原酶包被的8.4厘米直径(55.6cm2)平皿中。次日114μl Lipofectamine 2000(Invitrogen,Groenningen,荷兰)与286μl OptiMEM(Invitrogen)混合并在室温(RT)下保温5分钟。平行地,将1.5μg的各质粒pVPack-GP、pVPack-VSV-G和逆转录病毒载体与OptiMEM混合以达到终体积400μl。DNA混合物添加到Lipofectamine 2000溶液中并在RT保温20-30分钟。在温育期间293细胞用PBS洗涤一次并向细胞添加6ml的OptiMEM/10%FCS。最后DNA/Lipofectamine混合物被滴加到细胞,细胞在5%CO2培养箱中37℃培养5小时。然后用8ml新鲜的293培养基替代培养基,细胞被继续培养24-36小时。最后,从细胞收获含病毒粒子的上清液,在1000RPM离心5分钟以沉淀细胞和细胞碎片,上清液使用Millex-HV PVDF低蛋白结合的0.45μm滤器过滤除菌(Millipore,Molsheim,法国)。上清液被分成小份,液氮迅速冷冻并储存在-80℃。4.5 x 10 6 293 cells (cells should be 80-90% confluent the next day) were seeded in 8.4 cm diameter (55.6 cm 2 ) dishes coated with type I collagenase the day before transient transfection. The next day 114 μl Lipofectamine 2000 (Invitrogen, Groenningen, The Netherlands) was mixed with 286 μl OptiMEM (Invitrogen) and incubated at room temperature (RT) for 5 minutes. In parallel, 1.5 μg of each of the plasmids pVPack-GP, pVPack-VSV-G and retroviral vectors were mixed with OptiMEM to achieve a final volume of 400 μl. The DNA mixture was added to the Lipofectamine 2000 solution and incubated at RT for 20-30 minutes. During the incubation period 293 cells were washed once with PBS and 6 ml of OptiMEM/10% FCS was added to the cells. Finally, the DNA/Lipofectamine mixture was added dropwise to the cells, and the cells were incubated at 37°C for 5 hours in a 5% CO 2 incubator. The medium was then replaced with 8 ml of fresh 293 medium, and the cells were further cultured for 24-36 hours. Finally, the virion-containing supernatant was harvested from the cells, centrifuged at 1000 RPM for 5 minutes to pellet cells and cell debris, and the supernatant was filter-sterilized using a Millex-HV PVDF low protein binding 0.45 μm filter (Millipore, Molsheim, France) . The supernatant was aliquoted, snap frozen in liquid nitrogen and stored at -80°C.

逆转录病毒转导靶细胞系Retroviral transduction of target cell lines

靶细胞(HeLa,CHO-K1)在感染前天被以1×105细胞/孔接种到6孔板中。为病毒转导,在存在8.0μg/ml的1,5-二甲基1,5-二氮十一亚甲基聚甲溴化物(Sigma,St.Louis,密苏里州,美国)下添加不同量的含病毒的上清液。细胞与病毒培养24小时,然后用新鲜的培养基替换培养基。基因表达可以在转导后两天观察到。为防止每细胞的多拷贝效应,所有分析中用产生远少于100%转导效率(通常给与1-20%GFP阳性细胞)的病毒量感染靶细胞。在这转导效率范围内,被用来感染的上清液的体积与阳性细胞的数目线性相关而平均表达水平显示低得多的增加。Target cells (HeLa, CHO-K1) were seeded into 6-well plates at 1×10 5 cells/well the day before infection. For viral transduction, various amounts of 1,5-dimethyl-1,5-diazaundecylidene polymethyl bromide (Sigma, St. Louis, MO, USA) were added in the presence of 8.0 μg/ml virus-containing supernatant. Cells were incubated with virus for 24 hours, and then the medium was replaced with fresh medium. Gene expression can be observed two days after transduction. To prevent multiple copy effects per cell, target cells were infected in all assays with an amount of virus that yielded much less than 100% transduction efficiency (typically given 1-20% GFP positive cells). Within this range of transduction efficiencies, the volume of supernatant used for infection correlated linearly with the number of positive cells while mean expression levels showed a much lower increase.

FACS分析FACS analysis

为进行FACS分析,靶细胞用PBS洗涤,进行胰蛋白酶作用并再悬浮在完全培养基中,用Becton Dickinson FACScan(BD,Franklin Lakes,NJ,美国)使用用于各分析的恒定的仪器设定值分析hrGFP(绿色荧光蛋白)表达。用Apple Mcintosh的CeIIQuest软件分析数据。For FACS analysis, target cells were washed with PBS, trypsinized and resuspended in complete medium with a Becton Dickinson FACScan (BD, Franklin Lakes, NJ, USA) using constant instrument settings for each analysis Analysis of hrGFP (green fluorescent protein) expression. Data were analyzed with CeIIQuest software for Apple Mcintosh.

FACS分选FACS sorting

为分选表达GFP的HeLa和CHO-K1细胞,细胞被制备用于FACS分析然后用BD Bioscience FACS分选仪分选(由The Institute for CancerStudies惠赠,伯明翰大学,英国)。分选的单个的克隆和集合体随后被扩增,随后经历一段时间表达GFP。To sort GFP-expressing HeLa and CHO-K1 cells, cells were prepared for FACS analysis and then sorted with a BD Bioscience FACS sorter (gift from The Institute for Cancer Studies, University of Birmingham, UK). Sorted individual clones and pools are then expanded and then express GFP over a period of time.

结果result

在第一个研究中,CHO-K1细胞用含逆转录病毒CMV-hrGFP(CMV)和1.5UCOE CMV-hrGFP(1.5UCOE-CMV)细胞的上清液转导。CHO-K1细胞如材料和方法的描述进行转导,感染三天后GFP阳性细胞被FACS分选为群集合体(各细胞构建体36,000细胞)然后扩增。经一段时间进行GFP表达和柱状图作图并分别地图解在图10A和B中。在分析的第17天,1.5UCOE-CMV集合体的平均值比CMV集合体的平均值高2.6倍。另外,如图10B的柱状图所示,1.5UCOE-CMV集合体比CMV集合体产生更密集的GFP表达峰。GFP阳性细胞的平均CV值(变异系数;这是GFP表达均一性的标记)在Marker 1中对CMV集合体为146.0而对1.5UCOE-CMV集合体为83.0。In the first study, CHO-K1 cells were transduced with supernatants containing retrovirus CMV-hrGFP (CMV) and 1.5UCOE CMV-hrGFP (1.5UCOE-CMV) cells. CHO-K1 cells were transduced as described in Materials and Methods, and three days after infection GFP-positive cells were FACS sorted into population pools (36,000 cells per cell construct) and expanded. GFP expression and histogram plots were performed over time and are illustrated in Figures 10A and B, respectively. On day 17 of the analysis, the mean of 1.5 UCOE-CMV aggregates was 2.6-fold higher than the mean of CMV aggregates. In addition, as shown in the histogram of Figure 10B, the 1.5 UCOE-CMV aggregate produced a denser GFP expression peak than the CMV aggregate. The average CV value (coefficient of variation; this is a marker for the uniformity of GFP expression) of GFP-positive cells was 146.0 for the CMV pool in Marker 1 and 83.0 for the 1.5 UCOE-CMV pool.

与此研究相比,转导CHO-K1,经一段时间研究细胞群但不进行FACS分选。选择产生低百分比阳性细胞的MOI以排除每细胞的多拷贝效应。如图11A所示(有大约5%GFP阳性细胞的集合体I),1.5UCOE-CMV群在最后测量时间点的阳性细胞的均值比CMV群的均值高2.8倍。对于集合体II(大约15-25%GFP阳性细胞;图12A),1.5UCOE-CMV群在最迟测量时间点的阳性细胞的平均值比CMV群的均值高2.3倍。In contrast to this study, where CHO-K1 was transduced, the cell population was studied over time without FACS sorting. An MOI yielding a low percentage of positive cells was chosen to rule out multi-copy effects per cell. As shown in FIG. 11A (pool I with approximately 5% GFP positive cells), the mean value of positive cells in the 1.5 UCOE-CMV population at the last measurement time point was 2.8 times higher than the mean value of the CMV population. For pool II (approximately 15-25% GFP positive cells; FIG. 12A ), the mean value of positive cells for the 1.5 UCOE-CMV population at the latest measurement time point was 2.3-fold higher than the mean value of the CMV population.

与FACS分选的群相比,1.5UCOE-CMV群与CMV群相比在所有分析时间点产生明显更密集的表达峰(图11B和图12B)。Compared to the FACS sorted population, the 1.5UCOE-CMV population produced significantly denser expression peaks compared to the CMV population at all time points analyzed (Figure 1 IB and Figure 12B).

至于CHO-K1,稳定地转导有逆转录病毒构建体的10,000个HeLa细胞的集合体进行FACS分选。在图13A,平均GFP值被显示为包括FACS分选当天(第五天)的平均值。在最后一次测量中,1.5UCOE-CMV集合体的平均值比CMV集合体的平均值高1.6倍。如图13B的柱状图所示,许多CMV细胞经一段时间后失去其高GFP表达水平;相反大部分1.5UCOE-CMV细胞保持其高GFP表达。As for CHO-K1, pools of 10,000 HeLa cells stably transduced with the retroviral construct were subjected to FACS sorting. In Figure 13A, mean GFP values are shown as mean including the day of FACS sorting (day 5). At the last measurement, the average value of 1.5 UCOE-CMV aggregates was 1.6 times higher than the average value of CMV aggregates. As shown in the histogram of Figure 13B, many CMV cells lost their high GFP expression levels over time; in contrast, most of the 1.5UCOE-CMV cells maintained their high GFP expression.

HeLa单细胞克隆同样进行FACS-挑选并扩增到6孔板中。CMV克隆趋向于产生比1.5UCOE-CMV克隆稍高的表达峰,然而,无论在克隆内部还是在克隆之间,如全部分析的克隆的柱状图(图14A-E)所示,由于存在UCOE,表达水平的一致性被显著地增加。图15显示GFP表达差异的平均系数(CV)和其标准差异(SD)。CMV克隆的CV+/-SD值为135.0+/-165.3,并且1.5UCOE-CMV克隆为49.4+/-10.9。HeLa single cell clones were also FACS-picked and expanded into 6-well plates. CMV clones tended to produce slightly higher expression peaks than 1.5 UCOE-CMV clones, however, both within and between clones, as shown in the histograms of all clones analyzed (Figure 14A-E), due to the presence of UCOE, Consistency of expression levels was significantly increased. Figure 15 shows the mean coefficient (CV) of GFP expression difference and its standard difference (SD). The CV+/-SD value of the CMV clone was 135.0+/-165.3 and that of the 1.5UCOE-CMV clone was 49.4+/-10.9.

序列表sequence listing

<110>ML实验室公开有限公司(ML Laboratories PLC)<110>ML Laboratories PLC

<120>改进的表达元件(Improved expression ements)<120> Improved expression elements

<130>P101405<130>P101405

<160>2<160>2

<170>PatentIn version 3.1<170>PatentIn version 3.1

<210>1<210>1

<211>1546<211>1546

<212>DNA<212>DNA

<213>Homo sapiens<213>Homo sapiens

<400>1<400>1

 the

ggccctccgc gcctacagct caagccacat ccgaaggggg agggagccgg gagctgcgcg     60ggccctccgc gcctacagct caagccacat ccgaaggggg agggagccgg gagctgcgcg 60

cggggccgcc ggggggaggg gtggcaccgc ccacgccggg cggccacgaa gggcggggca    120cggggccgcc ggggggaggg gtggcaccgc ccacgccggg cggccacgaa gggcggggca 120

gcgggcgcgc gcgcggcggg gggaggggcc ggcgccgcgc ccgctgggaa ttggggccct    180gcgggcgcgc gcgcggcggg gggaggggcc ggcgccgcgc ccgctgggaa ttggggccct 180

agggggaggg cggaggcgcc gacgaccgcg gcacttaccg ttcgcggcgt ggcgcccggt    240aggggggaggg cggaggcgcc gacgaccgcg gcacttaccg ttcgcggcgt ggcgcccggt 240

ggtccccaag gggagggaag ggggaggcgg ggcgaggaca gtgaccggag tctcctcagc    300ggtccccaag gggagggaag ggggaggcgg ggcgaggaca gtgaccggag tctcctcagc 300

ggtggctttt ctgcttggca gcctcagcgg ctggcgccaa aaccggactc cgcccacttc    360ggtggctttt ctgcttggca gcctcagcgg ctggcgccaa aaccggactc cgcccacttc 360

ctcgcccgcc ggtgcgaggg tgtggaatcc tccagacgct gggggagggg gagttgggag    420ctcgcccgcc ggtgcgaggg tgtggaatcc tccagacgct gggggagggg gagttggggag 420

cttaaaaact agtacccctt tgggaccact ttcagcagcg aactctcctg tacaccaggg    480cttaaaaact agtacccctt tgggaccact ttcagcagcg aactctcctg tacaccaggg 480

gtcagttcca cagacgcggg ccaggggtgg gtcattgcgg cgtgaacaat aatttgacta    540gtcagttcca cagacgcggg ccagggggtgg gtcattgcgg cgtgaacaat aatttgacta 540

gaagttgatt cgggtgtttc cggaaggggc cgagtcaatc cgccgagttg gggcacggaa    600gaagttgatt cgggtgtttc cggaaggggc cgagtcaatc cgccgagttg gggcacggaa 600

aacaaaaagg gaaggctact aagatttttc tggcgggggt tatcattggc gtaactgcag    660aacaaaaagg gaaggctact aagatttttc tggcgggggt tatcattggc gtaactgcag 660

ggaccacctc ccgggttgag ggggctggat ctccaggctg cggattaagc ccctcccgtc    720ggaccacctc ccgggttgag ggggctggat ctccaggctg cggattaagc ccctcccgtc 720

ggcgttaatt tcaaactgcg cgacgtttct cacctgcctt cgccaaggca ggggccggga    780ggcgttaatt tcaaactgcg cgacgtttct cacctgcctt cgccaaggca ggggccggga 780

ccctattcca agaggtagta actagcagga ctctagcctt ccgcaattca ttgagcgcat    840ccctattcca agaggtagta actagcagga ctctagcctt ccgcaattca ttgagcgcat 840

ttacggaagt aacgtcgggt actgtctctg gccgcaaggg tgggaggagt acgcatttgg    900ttacggaagt aacgtcgggt actgtctctg gccgcaaggg tgggaggagt acgcatttgg 900

cgtaaggtgg ggcgtagagc cttcccgcca ttggcggcgg atagggcgtt tacgcgacgg    960cgtaaggtgg ggcgtagagc cttcccgcca ttggcggcgg atagggcgtt tacgcgacgg 960

cctgacgtag cggaagacgc gttagtgggg gggaaggttc tagaaaagcg gcggcagcgg   1020cctgacgtag cggaagacgc gttagtgggg gggaaggttc tagaaaagcg gcggcagcgg 1020

ctctagcggc agtagcagca gcgccgggtc ccgtgcggag gtgctcctcg cagagttgtt   1080ctctagcggc agtagcagca gcgccgggtc ccgtgcggag gtgctcctcg cagagttgtt 1080

tctcgagcag cggcagttct cactacagcg ccaggacgag tccggttcgt gttcgtccgc   1140tctcgagcag cggcagttct cactacagcg ccaggacgag tccggttcgt gttcgtccgc 1140

ggagatctct ctcatctcgc tcggctgcgg gaaatcgggc tgaagcgact gagtccgcga   1200ggagatctct ctcatctcgc tcggctgcgg gaaatcgggc tgaagcgact gagtccgcga 1200

tggaggtaac gggtttgaaa tcaatgagtt attgaaaagg gcatggcgag gccgttggcg   1260tggaggtaac gggtttgaaa tcaatgagtt attgaaaagg gcatggcgag gccgttggcg 1260

cctcagtgga agtcggccag ccgcctccgt gggagagagg caggaaatcg gaccaattca   1320cctcagtgga agtcggccag ccgcctccgt gggagagagg caggaaatcg gaccaattca 1320

gtagcagtgg ggcttaaggt ttatgaacgg ggtcttgagc ggaggcctga gcgtacaaac   1380gtagcagtgg ggcttaaggt ttatgaacgg ggtcttgagc ggaggcctga gcgtacaaac 1380

agcttcccca ccctcagcct cccggcgcca tttcccttca ctgggggtgg gggatgggga   1440agcttcccca ccctcagcct cccggcgcca tttcccttca ctgggggtgg gggatgggga 1440

gctttcacat ggcggacgct gccccgctgg ggtgaaagtg gggcgcggag gcgggaattc   1500gctttcacat ggcggacgct gccccgctgg ggtgaaagtg gggcgcggag gcgggaattc 1500

ttattccctt tctaaagcac gctgcttcgg gggccacggc gtctcc                  1546ttatccctt tctaaagcac gctgcttcgg gggccacggc gtctcc 1546

 the

<210>2<210>2

<211>987<211>987

<212>DNA<212>DNA

<213>Homo sapiens<213>Homo sapiens

<400>2<400>2

 the

ccggaagggg ccgagtcaat ccgccgagtt ggggcacgga aaacaaaaag ggaaggctac     60ccggaagggg ccgagtcaat ccgccgagtt ggggcacgga aaacaaaaag ggaaggctac 60

taagattttt ctggcggggg ttatcattgg cgtaactgca gggaccacct cccgggttga    120taagattttt ctggcggggg ttatcattgg cgtaactgca gggaccacct cccgggttga 120

gggggctgga tctccaggct gcggattaag cccctcccgt cggcgttaat ttcaaactgc    180gggggctgga tctccaggct gcggattaag cccctcccgt cggcgttaat ttcaaactgc 180

gcgacgtttc tcacctgcct tcgccaaggc aggggccggg accctattcc aagaggtagt    240gcgacgtttc tcacctgcct tcgccaaggc aggggccggg accctattcc aagaggtagt 240

aactagcagg actctagcct tccgcaattc attgagcgca tttacggaag taacgtcggg    300aactagcagg actctagcct tccgcaattc attgagcgca tttacggaag taacgtcggg 300

tactgtctct ggccgcaagg gtgggaggag tacgcatttg gcgtaaggtg gggcgtagag    360tactgtctct ggccgcaagg gtgggaggag tacgcatttg gcgtaaggtg gggcgtagag 360

ccttcccgcc attggcggcg gatagggcgt ttacgcgacg gcctgacgta gcggaagacg    420ccttcccgcc attggcggcg gatagggcgt ttacgcgacg gcctgacgta gcggaagacg 420

cgttagtggg ggggaaggtt ctagaaaagc ggcggcagcg gctctagcgg cagtagcagc    480cgttagtggg ggggaaggtt ctagaaaagc ggcggcagcg gctctagcgg cagtagcagc 480

agcgccgggt cccgtgcgga ggtgctcctc gcagagttgt ttctcgagca gcggcagttc    540agcgccgggt cccgtgcgga ggtgctcctc gcagagttgt ttctcgagca gcggcagttc 540

tcactacagc gccaggacga gtccggttcg tgttcgtccg cggagatctc tctcatctcg    600tcactacagc gccaggacga gtccggttcg tgttcgtccg cggagatctc tctcatctcg 600

ctcggctgcg ggaaatcggg ctgaagcgac tgagtccgcg atggaggtaa cgggtttgaa    660ctcggctgcg ggaaatcggg ctgaagcgac tgagtccgcg atggaggtaa cgggtttgaa 660

atcaatgagt tattgaaaag ggcatggcga ggccgttggc gcctcagtgg aagtcggcca    720atcaatgagt tattgaaaag ggcatggcga ggccgttggc gcctcagtgg aagtcggcca 720

gccgcctccg tgggagagag gcaggaaatc ggaccaattc agtagcagtg gggcttaagg    780gccgcctccg tgggagagag gcaggaaatc ggaccaattc agtagcagtg gggcttaagg 780

tttatgaacg gggtcttgag cggaggcctg agcgtacaaa cagcttcccc accctcagcc    840tttatgaacg gggtcttgag cggaggcctg agcgtacaaa cagcttcccc accctcagcc 840

tcccggcgcc atttcccttc actgggggtg ggggatgggg agctttcaca tggcggacgc    900tcccggcgcc atttcccttc actgggggtg ggggatgggg agctttcaca tggcggacgc 900

tgccccgctg gggtgaaagt ggggcgcgga ggcgggaatt cttattccct ttctaaagca    960tgccccgctg gggtgaaagt ggggcgcgga ggcgggaatt cttattccct ttctaaagca 960

cgctgcttcg ggggccacgg cgtctcc                                        987cgctgcttcg ggggccacgg cgtctcc 987

Claims (20)

1. isolating polynucleotide comprise
A) the no methylated CpG island of Yan Shening;
B) effable open reading frame can be operationally connected to the no methylated CpG island of described extension;
C) promotor can be operationally connected to described open reading frame, and wherein said promotor is not to be connected to described CpG island natively;
It is characterized in that described CpG island comprises the fragment of the people HP-1/hnRNPA2 locus that is no more than 1.6kb, and the repeating to be expressed in the two or more types of organizations and obtain of wherein said open reading frame.
2. the polynucleotide of claim 1 comprise the Esp31 restriction fragment.
3. the polynucleotide of claim 2 comprise the sequence of Nucleotide 977 to 2522 among the Fig. 2 shown in the SEQ ID NO:1.
4. each polynucleotide of claim 1 to 3 comprise the fragment of the HP-1/hnRNPA2 locus that is no more than 1kb.
5. the polynucleotide of claim 4 comprise the BspE1-Esp31 restriction fragment.
6. the polynucleotide of claim 5 comprise the sequence of Nucleotide 1536 to 2522 among the Fig. 2 shown in the SEQ ID NO:2.
7. carrier contains right and requires 1 to 6 each polynucleotide.
8. the carrier of claim 7, carrier wherein is an episomal vector.
9. the carrier of claim 7, carrier wherein is an integrating vector.
10. each carrier of claim 7-9, carrier wherein is a plasmid.
11. each carrier of claim 7 to 10, wherein the gene that can be operatively connected is a therapeutic nucleic acids.
12. each carrier of claim 7 to 11 comprises following any: SEQ IDNOs 1 or 2, CMV promotor, multiple clone site, polyadenylic acid sequence and the gene of coding selected marker thing under suitable controlling elements.
13. host cell comprises each polynucleotide of claim 1 to 6, or each carrier of claim 7 to 12.
14. each each carrier or the application that is used for gene therapy medicine in preparation of the host cell of claim 13 of polynucleotide, claim 7 to 12 of claim 1 to 6.
15. pharmaceutical composition, with the claim 1 to 6 of pharmaceutical excipient combination each polynucleotide, claim 7 to 12 each carrier or the host cell of claim 13.
16. claim 1 to 6 each polynucleotide, claim 7 to 12 each carrier or the host cell of claim 13 in cell culture system for obtaining the application of required gene product.
17.SEQ the application of the polynucleotide of ID NOs 1 or 2 in improving endogenous gene expression is included in the position that can be operatively connected endogenous gene polynucleotide is inserted in the genome of cell, thereby improves the expression of gene level.
18. claim 1 to 6 each polynucleotide, claim 7 to 12 each carrier or the host cell of claim 13 in the expression that obtains the inverted defined gene sequence with the application in the gene order of inactivation correspondence.
19. each each the application of polynucleotide in the preparation expression library of carrier of polynucleotide, claim 7 to 12 of claim 1 to 6.
20.SEQ the application of the polynucleotide of ID NOs 1 or 2 in the method for identifying the expressible gene in the non-human animal, comprise that the construct that will contain polynucleotide is inserted in non-human animal's the embryonic stem cell, wherein said construct can allow to carry out medicament selection after inserting the gene of expressing.
CN200480003321.1A 2003-02-01 2004-02-02 Improved expression element Expired - Lifetime CN1748034B (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
GB0302330.6 2003-02-01
GB0302330A GB0302330D0 (en) 2003-02-01 2003-02-01 Improved expression elements
GB0308503A GB0308503D0 (en) 2003-04-12 2003-04-12 Improved expression elements
GB0308503.2 2003-04-12
GB0320824.6 2003-09-05
GB0320824A GB0320824D0 (en) 2003-09-05 2003-09-05 Improved expression elements
PCT/GB2004/000387 WO2004067701A2 (en) 2003-02-01 2004-02-02 Improved genetic elements providing high levels of expression

Publications (2)

Publication Number Publication Date
CN1748034A CN1748034A (en) 2006-03-15
CN1748034B true CN1748034B (en) 2011-03-02

Family

ID=9952257

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200480003321.1A Expired - Lifetime CN1748034B (en) 2003-02-01 2004-02-02 Improved expression element

Country Status (2)

Country Link
CN (1) CN1748034B (en)
GB (1) GB0302330D0 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100368549C (en) * 2006-03-29 2008-02-13 北京未名凯拓作物设计中心有限公司 Bacterial plasmid, its derivative plasmid and application
US11504435B2 (en) * 2016-06-13 2022-11-22 The University Of North Carolina At Chapel Hill Optimized CLN1 genes and expression cassettes and their use
CN112575031B (en) * 2019-09-29 2023-04-07 新乡医学院 Ubiquitous chromatin open expression element, recombinant expression vector, expression system, preparation method and application thereof

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KOZU T 等.STRUCTURE AND EXPRESSION OF THE GENE(HNRPA2B1) ENCODING THE HUMAN HNRNP PROTEINA2/B1.GENOMICS25 2.1995,25(2),365-371.
KOZU T等.STRUCTURE AND EXPRESSION OF THE GENE(HNRPA2B1) ENCODING THE HUMAN HNRNP PROTEINA2/B1.GENOMICS25 2.1995,25(2),365-371. *

Also Published As

Publication number Publication date
GB0302330D0 (en) 2003-03-05
CN1748034A (en) 2006-03-15

Similar Documents

Publication Publication Date Title
CN1461343B (en) polynucleotide
AU2001290067A1 (en) Artificial ubiquitous chromatin opening elements (UCOE)
JP2011193881A (en) Improved gene expression
AU2002242872A1 (en) Improved gene expression
CN1748034B (en) Improved expression element
US7812148B2 (en) Vectors comprising CpG islands without position effect varigation and having increased expression
KR20100102235A (en) How to reduce directional bias in expression
JP4709743B2 (en) Improved expression elements
JP4642865B2 (en) Vector containing the new control element
KR20040016848A (en) Improved Gene Expression

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: SERUM INVESTMENT CO.,LTD.

Free format text: FORMER OWNER: INNOVATA PLC

Effective date: 20060630

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20060630

Address after: Delaware

Applicant after: Serum investment Co.

Address before: British Nottingham

Applicant before: Innovation Public Ltd.

ASS Succession or assignment of patent right

Owner name: SERUM ROYAL CO.,LTD.

Free format text: FORMER OWNER: SERUM INVESTMENT CO.,LTD.

Effective date: 20070720

Owner name: SERUM FINANCE CO.,LTD.

Free format text: FORMER OWNER: SERUM ROYAL CO.,LTD.

Effective date: 20070720

Owner name: BLOOD SERUM CO.,LTD.

Free format text: FORMER OWNER: SERUM FINANCE CO.,LTD.

Effective date: 20070720

Owner name: MIANYANG FUTE RELAY CO., LTD.

Free format text: FORMER OWNER: BLOOD SERUM CO.,LTD.

Effective date: 20070720

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20070720

Address after: American Georgia

Applicant after: Serum company

Address before: Bahamian archipelago bus Island

Applicant before: Serum finance Co.

Effective date of registration: 20070720

Address after: Bahamian archipelago bus Island

Applicant after: Serum finance Co.

Address before: Bahamian archipelago bus Island

Applicant before: Serum Company RoyaL

Effective date of registration: 20070720

Address after: Bahamian archipelago bus Island

Applicant after: Serum Company RoyaL

Address before: Delaware

Applicant before: Serum investment Co.

Effective date of registration: 20070720

Address after: Massachusetts, USA

Applicant after: Millipore Corp.

Address before: American Georgia

Applicant before: Serum company

C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee

Owner name: EMD MILLIPORE CORPORATION

Free format text: FORMER NAME: MILLIPORE CORP.

CP01 Change in the name or title of a patent holder

Address after: Massachusetts, America

Patentee after: EMD MILLIPORE Corp.

Address before: Massachusetts, America

Patentee before: Millipore Corp.

CX01 Expiry of patent term
CX01 Expiry of patent term

Granted publication date: 20110302