[go: up one dir, main page]

WO2005005661A2 - Genes regulated in ovarian cancer as prognostic and therapeutic targets - Google Patents

Genes regulated in ovarian cancer as prognostic and therapeutic targets Download PDF

Info

Publication number
WO2005005661A2
WO2005005661A2 PCT/EP2004/007167 EP2004007167W WO2005005661A2 WO 2005005661 A2 WO2005005661 A2 WO 2005005661A2 EP 2004007167 W EP2004007167 W EP 2004007167W WO 2005005661 A2 WO2005005661 A2 WO 2005005661A2
Authority
WO
WIPO (PCT)
Prior art keywords
genes
expression
ovarian cancer
patient
see
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/EP2004/007167
Other languages
French (fr)
Other versions
WO2005005661A3 (en
Inventor
Christian Nicolas Lavedan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Novartis Pharma GmbH Austria
Novartis AG
Original Assignee
Novartis Pharma GmbH Austria
Novartis AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Novartis Pharma GmbH Austria, Novartis AG filed Critical Novartis Pharma GmbH Austria
Priority to EP04740533A priority Critical patent/EP1644522A2/en
Priority to MXPA05014220A priority patent/MXPA05014220A/en
Priority to BRPI0412110-4A priority patent/BRPI0412110A/en
Priority to CA002531091A priority patent/CA2531091A1/en
Priority to US10/562,406 priority patent/US20060281091A1/en
Priority to AU2004256182A priority patent/AU2004256182A1/en
Priority to JP2006518084A priority patent/JP2007526749A/en
Publication of WO2005005661A2 publication Critical patent/WO2005005661A2/en
Publication of WO2005005661A3 publication Critical patent/WO2005005661A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/136Screening for pharmacological compounds
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Definitions

  • the present invention belongs to the fields of medicine and relates to the use of genomic analysis to evaluate and treat ovarian cancer.
  • this invention relates to the measurement of patterns of gene expression to determine the presence of ovarian cancer in a patients tissues.
  • Ovarian cancer is one of the most common types of cancer that affects women in the United States, with a lifetime risk of approximately 1/70. See Whittemore, Gynecol. Oncol., Vol. 55, No. 3, Part 2, pp. S15-S19 (1994). It is a rapidly fatal disease usually detected late, with still no good method of prevention. The greatest risk factor for ovarian cancer is a family history of the disease, suggesting the strong influence of genetics. See Schildkraut and Thompson, Am. J. Epidemiol., Vol. 128, No. 3, pp. 456-466 (1988). Other factors such as demographic, lifestyle and reproductive factors have also been shown to contribute to the risk of ovarian cancer.
  • genes apparently expressed at high levels, or with the biggest change in expression, may not always be the most relevant; it is conceivable that a small disruption of the very tight regulation of genes may have dramatic consequences, even when the level of expression is low.
  • this invention provides a method to determine if a patient is afflicted with ovarian cancer comprising: a) obtaining a sample from the said patient; b) determining the levels of gene expression of two or more of the genes listed in Table 9 in the sample from the patient; c) comparing the levels of gene expression of the two or more genes determined in (b) to the levels of the same genes listed in Table 1 ; d) determining the degree of similarity (DOS) between the levels of gene expression of the two or more genes determined in (c); and e) determining from the DOS between the level of gene expression of the two or more genes the probability that the sample shows evidence of the presence of ovarian cancer in the patient.
  • this invention provides a method wherein the levels of gene expression are determined for a subset of the genes listed in Table 9 comprising genes
  • the invention employs a sample comprising cells obtained from the patient. These may be cells removed from a solid tumor in the said patient or, in a preferred embodiment, the sample comprises blood cells and serum drawn from the said patient. In a most preferred embodiment, the sample comprises a body fluid drawn from the patient.
  • this invention employs a method of determining the level of gene expression comprising measuring the levels of protein expression product in the sample from the patient. This may be done in a variety of ways including, but not limited to, detecting the presence and level of the protein expression products using a reagent which specifically binds with the proteins, wherein the reagent may be selected from the group consisting of an antibody, an antibody derivative and an antibody fragment.
  • this invention provides a method wherein the levels of expression in the sample are assessed by measuring the levels in the sample of the transcribed polynucleotides of the two or more gene in Table 9.
  • These transcribed polynucleotide may be mRNA or complementary DNA (cDNA).
  • this method would further include the step of amplifying the transcribed polynucleotide.
  • this invention includes a method of treating a subject afflicted with ovarian cancer, the method comprising providing to cells of the subject an antisense oligonuceotide complimentary to one or more of the genes whose expression is up-regulated in ovarian cancer as shown in Table 8.
  • this invention provides a method of inhibiting ovarian cancer in a subject at risk for developing ovarian cancer, the method comprising inhibiting expression of one or more of the genes shown in Table 8 to be up-regulated in ovarian cancer.
  • kits for use in determining treatment strategy for a patient with suspected ovarian cancer comprising: a) a number (for example, two or more) of antibodies able to recognize and bind to the polypeptide expression product of the two or more of the genes in Table 9; b) a container suitable for containing the said antibodies and a sample of body fluid from the said individual wherein the antibody can contact the polypeptide expressed by the two or more genes shown in Table 9 if they are present; c) means to detect the combination of the said antibodies with the polypeptides expressed by the two or more genes shown in Table 9; and d) instructions for use and interpretation of the kit results.
  • this invention provides a kit for use in determining the presence or absence of ovarian cancer in a patient comprising: a) a number (for example, two or more) of polynucleotides able to recognize and bind to the mRNA expression product of the two or more genes shown in Table 9; b) a container suitable for containing the said polynucleotides and a sample of body fluid from the said individual wherein the said polynucleotide can contact the mRNA, if it is present; c) means to detect the levels of combination of the said polynucleotide with the mRNA from the two or more genes shown in Table 9; and d) instructions for use and interpretation of the kit results.
  • a number for example, two or more
  • FIG. 1(A) Re-Classification of Samples Using Increasing Number of Probe Sets.
  • FIG. 1 (B). Plot of Errors as a Function of Number of Probe Sets for Determination of
  • FIG. 2 Determination of a Threshold CC Value for Classification of Ovarian Status.
  • FIG. 3 Correlation of Test and Validation Biopsy Profiles with Mean Normal Profile for Different Size Probe Sets.
  • N or T represent Normal or Tumor status, respectively,
  • r is the PCC value of the probe set profile of the corresponding biopsy sample with the mean
  • FIG. 4 Correlation of Biopsy Profiles with Mean of AH Normal Profiles for Different Size Probe Sets. N or T represent Normal or Tumor status, respectively, "r" is the PCC value of the probe set profile of the corresponding biopsy sample with the mean profile of all Normal samples. Samples are ordered from highest CC to lowest.
  • the present invention provides methods to determine whether or not a sample from a patient including, but not limited to, biopsy tissue or blood, serum or some other body fluid from a patient, contains evidence of the presence of ovarian cancer in the patient.
  • This invention is based, in part, on the discovery of approximately 900 genes which are differentially expressed in tissue from ovarian cancer as compared to normal tissue.
  • This methods of this invention comprise measuring the activities of the approximately 900 or fewer genes that are shown to be differently-expressed in ovarian cancer as compared to normal tissue.
  • This DOS could be determined by any procedure that produces a result whose value is a known function of the DOS between the two groups of numbers, i.e., the measured gene expression values of the two or more genes in tissue from an individual whose ovarian cancer status is unknown and to be determined and the measured gene expression values for the same two or more genes from individuals whose tissue is known to contain ovarian cancer and from individuals whose tissue is known not to contain ovarian cancer.
  • DOS shall mean the extent to which the pattern of gene expression values are alike or numerically similar, as measured by a comparison of the values of gene expression determined by direct or indirect methods.
  • the DOS would be determined by a mathematical calculation resulting in a correlation coefficient (CC).
  • CC correlation coefficient
  • PCC Pearson Correlation Coefficient
  • the value of the DOS (PCC), so calculated, can then be directly related to the probability that the tissue sample is from a patient who does or does not have ovarian cancer. That is to say, the higher the patients' DOS (CC or PCC) as compared to the gene expression values from a patient who does not have ovarian cancer or the higher the DOS (CC or PCC) as compared to the gene expression values from a patient who does have ovarian cancer then the greater the probability that the patient does not or does have ovarian cancer, respectively.
  • the value of the DOS can be used to determine probabilities for the presence of ovarian cancer.
  • PCC DOS
  • this would work as shown in FIG. 3, using the 28 predictor probe set (as described below) if the gene expression profile correlates with the mean normal (No Ovarian Cancer) profile with a CC ⁇ 0.920 the tissue sample is 63 times more likely to contain ovarian cancer then if the CC >0.920 [odds ratio (OR) 63 with 95% confidence interval (Cl): 3.3-1194.7].
  • the PCC can be set to produce optional sensitivity. That is, to make the smallest possible number of false negatives (Ovarian Cancer misclassified as No Ovarian Cancer).
  • the threshold is determined by setting the PCC to >0.955.
  • the threshold is determined by setting the PCC to >0.955.
  • 100% of patients with a CC of >0.955 as compared to the No Ovarian Cancer group did not have ovarian cancer and 100% of the patients whose CC were ⁇ 0.870, as compared to the No Ovarian Cancer group, did have ovarian cancer.
  • one of skill in the art can choose a PCC that will either maximize sensitivity or maximize specificity or produce any desired ratio of false positives or false negatives.
  • One of skill in the art can easily adjust their choice of PCC to the clinical situation to provide maximum benefit and safety to the patient.
  • Another aspect of the of the invention are methods to treat ovarian cancer. These methods consist of various efforts to suppress the excess gene expression of the genes that have been found to be up-regulated in ovarian cancer. These genes are shown in Table 8. Methods to decrease the excess expression of these gene would include, but not be limited to, use of antisense DNA, siRNA and methods to complex and deactivate the protein expression products of these over-expressed genes. Methods of Measurement
  • the gene expression of a selected group of the 900 genes is determined by measuring mRNA levels from tissue samples as described below.
  • the gene expression can be measured more indirectly by measuring polypeptide gene expression products in tissues including, but not limited to, tumor and blood tissue.
  • gene expression is measured by identifying the presence or amount of one or more proteins encoded by one of the genes listed in Table 9.
  • the present invention also provides systems for detecting two or more markers of interest, e.g., two or more markers from Table 2. For example, where it is determined that a finite set of particular markers provides relevant information, a detection system is provided that detects the finite set of markers. For example, as opposed to detecting all genes expressed in a tissue with a generic microarray, a defined microarray or other detection technology is employed to detect the plurality, e.g., 28, 42, etc., of markers that define a biological condition, e.g., the presence or absence of ovarian cancer, etc.
  • the present invention is not limited by the method in which gene expression biomarkers are detected or measured.
  • mRNA, cDNA or protein is detected in tissue samples, e.g., biopsy samples.
  • mRNA, cDNA or protein is detected in bodily fluids, e.g., serum, plasma, urine or saliva.
  • a preferred embodiment of the invention provides that the method of the invention is performed ex vivo.
  • the present invention further provides kits for the detection of these relevant gene expression biomarkers.
  • protein or the polypeptide expression product is detected. Protein expression may be detected by any suitable method. In some embodiments, proteins are detected by binding of an antibody specific for the protein. For example, in some embodiments, antibody binding is detected using a suitable technique including, but not limited to, radioimmunoassay, enzyme-linked immunosorbant assay (ELISA), "sandwich” immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, immunodiffusion assays, in situ immunoassays, e.g., using colloidal gold, enzyme or radioisotope labels, e.g., Western blots, precipitation reactions, agglutination assays, e.g., gel agglutination assays, hemagglutination assays, etc., complement fixation assays, immunofluorescence assays, protein A assays, immunoelectrophoresis assays and proteomic assays, such as the use of
  • antibody binding is detected by detecting a label on the primary antibody.
  • the primary antibody is detected by detecting binding of a secondary antibody or reagent to the primary antibody.
  • the secondary antibody is labeled.
  • an automated detection assay is utilized. Methods for the automation of immunoassays include, but are not limited to, those described in U.S. Patent Nos. 5,885,530; 4,981,785; 6,159,750; and 5,358,691, each of which is herein incorporated by reference.
  • the analysis and presentation of results is also automated.
  • software that generates a diagnosis and/or prognosis based on the presence or absence of a series of proteins corresponding to markers is utilized.
  • the immunoassay described in U.S. Patent Nos. 5,599,677 and 5,672,480, each of which is herein incorporated by reference, is utilized.
  • proteins are detected by immunohistochemistry.
  • markers are detected at the level of cDNA or RNA.
  • gene expression biomarkers shall mean any biologic marker which can indicate the rate or degree of gene expression of a specific gene including, but not limited to, mRNA, cDNA or the polypeptide expression product of the specific gene.
  • gene expression biomarkers are detected using a PCR-based assay.
  • reverse-transcriptase PCR RT-PCR
  • RNA is enzymatically converted to cDNA using a reverse-transcriptase enzyme.
  • the cDNA is then used as a template for a PCR reaction.
  • PCR products can be detected by any suitable method including, but not limited to, gel electrophoresis and staining with a DNA-specific stain or hybridization to a labeled probe.
  • the quantitative RT-PCR with standardized mixtures of competitive templates method described in U.S. Patent Nos. 5,639,606; 5,643, 765; and 5,876,978, each of which is herein incorporated by reference, is utilized.
  • gene expression biomarkers are detected using a hybridization assay.
  • a hybridization assay the presence or absence of a marker is determined based on the ability of the nucleic acid from the sample to hybridize to a complementary nucleic acid molecule, e.g., an oligonucleotide probe.
  • a complementary nucleic acid molecule e.g., an oligonucleotide probe.
  • a variety of hybridization assays are available.
  • hybridization of a probe to the sequence of interest is detected directly by visualizing a bound probe, e.g., a Northern or Southern assay.
  • a Northern or Southern assay See, e.g., Ausabel et al., eds., Current Protocols in Molecular Biology, John Wiley & Sons, NY (1991).
  • DNA Southern
  • RNA Northern
  • the DNA or RNA is then cleaved with a series of restriction enzymes that cleave infrequently in the genome and not near any of the markers being assayed.
  • the DNA or RNA is then separated, e.g., on an agarose gel, and transferred to a membrane.
  • a labeled probe or probes e.g., by incorporating a radionucleotide, is allowed to contact the membrane under low-, medium- or high-stringency conditions. Unbound probe is removed and the presence of binding is detected by visualizing the labeled probe.
  • the DNA chip assay is a GeneChip (Affymetrix, Santa Clara, CA). See, e.g., U.S. Patent Nos. 6,045,996; 5,925,525; and 5,858,659, each of which is herein incorporated by reference.
  • the GeneChip technology uses miniaturized, high-density arrays of oligonucleotide probes affixed to a "chip". Probe arrays are manufactured by Affymetrix's light-directed chemical synthesis process, which combines solid-phase chemical synthesis with photolithographic fabrication techniques employed in the semiconductor industry.
  • the process constructs high-density arrays of oligonucleotides, with each probe in a predefined position in the array.
  • Multiple probe arrays are synthesized simultaneously on a large glass wafer. The wafers are then diced, and individual probe arrays are packaged in injection-molded plastic cartridges, which protect them from the environment and serve as chambers for hybridization.
  • the nucleic acid to be analyzed is isolated, amplified by PCR and labeled with a fluorescent reporter group.
  • the labeled DNA is then incubated with the array using a fluidics station.
  • the array is then inserted into the scanner, where patterns of hybridization are detected.
  • the hybridization data are collected as light emitted from the fluorescent reporter groups already incorporated into the target, which is bound to the probe array. Probes that perfectly match the target generally produce stronger signals than those that have mismatches. Since the sequence and position of each probe on the array are known, by complementary, the identity of the target nucleic acid applied to the probe array can be determined.
  • a DNA microchip containing electronically captured probes (Nanogen, San Diego, CA) is utilized. See, e.g., U.S. Patent Nos. 6,017,696; 6,068,818; and 6,051,380, each of which are herein incorporated by reference.
  • Nanogen's technology enables the active movement and concentration of charged molecules to and from designated test sites on its semiconductor microchip.
  • DNA capture probes unique to a given gene expression biomarkers are electronically placed at, or "addressed" to, specific sites on the microchip. Since nucleic acid molecules have a strong negative charge, they can be electronically moved to an area of positive charge.
  • an array technology based upon the segregation of fluids on a flat surface (chip) by differences in surface tension (ProtoGene, Palo Alto, CA) is utilized. See, e.g., U.S. Patent Nos. 6,001,311; 5,985,551; and 5,474,796, each of which is herein incorporated by reference.
  • Protogene's technology is based on the fact that fluids can be segregated on a flat surface by differences in surface tension that have been imparted by chemical coatings. Once so segregated, oligonucleotide probes are synthesized directly on the chip by ink-jet printing of reagents.
  • a "bead array” is used for the detection of gene expression biomarkers (lllumina, San Diego, CA). See, e.g., PCT Publications WO 99/67641 and WO 00/39587, each of which is herein incorporated by reference, lllumina uses a BEAD ARRAY technology that combines fiber optic bundles and beads that self-assemble into an array. Each fiber optic bundle contains thousands to millions of individual fibers depending on the diameter of the bundle. The beads are coated with an oligonucleotide specific for the detection of a given marker. Batches of beads are combined to form a pool specific to the array. To perform an assay, the BEAD ARRAY is contacted with a prepared sample. Hybridization is detected using any suitable method.
  • hybridization is detected by enzymatic cleavage of specific structures, e.g., INVADERTM assay, Third Wave Technologies. See, e.g., U.S. Patent Nos. 5,846,717, 6,090, 543; 6,001,567; 5,985,557; and 5,994,069, each of which is herein incorporated by reference.
  • hybridization of a bound probe is detected using a TaqMan assay (PE Biosystems, Foster City, CA). See, e.g., U.S. Patent Nos. 5,962,233 and 5,538,848, each of which is herein incorporated by reference. The assay is performed during a PCR reaction.
  • the TaqMan assay exploits the 5'-3' exonuclease activity of DNA polymerases, such as AMPLITAQ DNA polymerase.
  • a probe specific for a given marker, is included in the PCR reaction.
  • the probe consists of an oligonucleotide with a 5'-reporter dye, e.g., a fluorescent dye and a 3'-quencher dye.
  • a 5'-reporter dye e.g., a fluorescent dye and a 3'-quencher dye.
  • the 5'-3' nucleolytic activity of the AMPLITAQ polymerase cleaves the probe between the reporter and the quencher dye.
  • the separation of the reporter dye from the quencher dye results in an increase of fluorescence.
  • the signal accumulates with each cycle of PCR and can be monitored with a fluorimeter.
  • Additional detection assays that are produced and utilized using the systems and methods of the present invention include, but are not limited to, enzyme mismatch cleavage methods, e.g., Variagenics (see U.S. Patent Nos. 6,110,684; 5,958,692; and 5,851,770, herein incorporated by reference in their entireties); branched hybridization methods, e.g., Chiron (see U.S. Patent Nos. 5,849,481; 5,710,264; 5,124,246; and 5,624,802, herein incorporated by reference in their entireties); rolling circle replication (see, e.g., U.S. Patent Nos.
  • mass spectroscopy is used to detect gene expression biomarkers.
  • a MASSARRAYTM system (Sequenom, San Diego, CA) is used to detect gene expression biomarkers. See, e.g., U.S. Patent Nos. 6,043,031; 5,777,324; and 5,605,798, each of which is herein incorporated by reference.
  • kits for the identification, characterization and quantitation of gene expression biomarkers contain antibodies specific for gene expression biomarkers, in addition to detection reagents and buffers.
  • the kits contain reagents specific for the detection of nucleic acid, e.g., oligonucleotide probes or primers.
  • the kits contain all of the components necessary to perform a detection assay, including all controls, directions for performing assays and any necessary software for analysis and presentation of results.
  • the kits contain instructions including a statement of intended use as required by the Environmental Protection Agency or U.S. Food and Drug Administration (FDA) for the labeling of in vitro diagnostic assays and/or of pharmaceutical or food products.
  • FDA Environmental Protection Agency
  • the present invention is a method of screening a test compound for the ability to inhibit, retard, reverse or mimic the gene expression changes characteristic of ovarian cancer.
  • a test mammal known to have ovarian cancer with a test compound and then analyze a representative tissue of the mammal for the level of expression of the genes or sequences which change in expression in response to ovarian cancer.
  • the tissue is biopsy material from the tumor or, in a preferred embodiment, an easily obtainable tissue, such as blood or serum.
  • Methods of increasing and decreasing expression would be known to one of skill in the art. Examples for supplementation of expression would include supplying the organism with additional copies of the gene. A preferred example for decreasing expression would include RNA antisense technologies or pharmaceutical intervention.
  • the genes disclosed in Table 2 would be appropriate drug development targets. One would use the information presented in the present application for drug development by using currently existing, or by developing, pharmaceutical compounds that either mimic or inhibit the activity of the genes listed in Table 2, or the proteins encoded by these genes. Therefore, the gene expression biomarkers or genes disclosed herein represent targets for pharmaceutical development and gene therapy or RNA antisense therapy with the goal of suppressing the changes characteristic of ovarian cancer at the molecular level. These gene expression alterations may also play a role in understanding the various mechanisms that underlie ovarian cancer. Additionally, these genes represent biomarkers of ovarian cancer that can be used for diagnostic purposes.
  • the present invention is not limited by the form of the expression profile.
  • the expression profile is maintained in computer software.
  • the expression profile is written material.
  • the present invention is not limited by the number of markers provided or displayed in an expression profile.
  • the expression profile may comprise two or more markers found in Table 2, indicating a biological status of a sample.
  • the present invention further provides databases comprising expression information, e.g., expression profiles comprising one or more markers from Table 2 from one or more samples.
  • the databases find use in data analysis including, but not limited to, comparison of markers to one or more public or private information databases, e.g., OMIM, GenBank, BLAST, Molecular Modeling Databases, Medline, genome databases, etc.
  • an automated process is carried out to automatically associate information obtained from data obtained using the methods of the present invention to information in one or more of public or private databases. Associations find use, e.g., in making expression correlations to phenotypes, e.g., disease states.
  • EXAMPLE 1 Preferred Methods To identify genes involved in the development and progression of ovarian tumors, we compared the gene expression profiles of a series of Normal and Tumor ovarian biopsies. Gene expression data for more than 12,000 genes were generated from each sample. Of the 900 probe sets that we observed to be most differentially-expressed between the Normal and cancerous ovarian biopsies, 98% were down-regulated in the Tumor biopsies. Using 8 Normal and 10 Tumor samples, we identified a minimum number of probe sets (28) that could be used to classify biopsies as Normal or Tumor. This finding was validated on a second set of biopsies (4 Normal and 14 Tumor) previously profiled by another laboratory. A mean Normal ovarian profile was established that could be used as a reference to compare other ovarian biopsies. The identification of the most differentially-expressed genes between
  • Normal and Tumor ovarian biopsies may provide new insight into the molecular mechanisms of ovarian tumor development and progression.
  • Some of the genes identified in this study are known to be involved in ovarian cancer, but a large proportion represents novel candidates for drug targets and molecular biomarkers to diagnose or monitor disease and treatment.
  • Flash-frozen ovarian biopsies were obtained from Asterand (Detroit, Ml), and consisted of 10 Tumor samples and 10 adjacent Normal tissues. Total RNA was also purchased for 4 additional samples from Ambion (Austin, TX) and Stratagene (La Jolla, CA).
  • tumors analyzed were malignant surface epithelial serous tumors, e.g., papillary cystcarcinoma, papillary cystadenocarcinoma or papillary cystcarcinoma; others included a mucinous cyst carcinoma, an endometrioid adenocarcinoma and a mature teratoma.
  • Mote Paired samples (Normal and Tumor adjacent tissue) obtained from the same patient are boxed together. Stages of ovarian cancers are indicated using the FIGO staging system.
  • Genome U95Av2 http://www.affvmetrix.com/products/arravs/specific/hau95.affx).
  • a "false Neg” is defined as a Tumor incorrectly classified as a Normal ovary tissue
  • a "false Pos” is defined as a Normal tissue incorrectly classified as a Tumor.
  • the OR indicates that an ovarian biopsy of the test set is nearly 63 times more likely to be from an ovarian tumor if its expression profile of the 28 predictor probe sets correlates with the mean Normal profile with a CC ⁇ 0.920 (see Table 4). In our test set, 100% of profiles with a CC >0.955 correspond to Normal biopsies and 100% of profiles with a CC ⁇ 0.870 correspond to Tumor biopsies (see Fig. 3). Validation of the mean Normal profile
  • Fig. 3 summarizes the classification of all ovarian biopsies based on the correlation of 28 probe sets. Remarkably, the profiles of the Normal and Tumor samples of the validation set were clearly separated from each other (see Fig. 3). As in the test set, 100% of profiles with a CC ⁇ 0.870 correspond to Tumor biopsies.
  • Probe sets were ranked from highest absolute PCC to lowest, first using the 18 samples from the test set, and then with all 36 samples from both the test set and the validation set. From the 900 probe sets selected, 694 and 473 had an absolute CC higher than 0.5 with the 18 and 36 samples, respectively; 412 probe sets had a coefficient higher than 0.5 in both cases. Interestingly, from the 28 probe sets originally selected for the biopsy classification, 19 ranked in the top 100; the other 9 probe sets had correlation values ranging from 0.359-0.703.
  • Claudin 4 a component of tight junctions, has been shown to be up-regulated in ovarian tumors together with another member of this family of transmembrane receptors, Claudin 3.
  • Claudin 3 Another member of this family of transmembrane receptors, Claudin 3.
  • Costa and colleagues have reported that levels of topoisomerase II alpha correlate with poor prognosis of ovarian surface epithelial neoplasms.
  • Kallikrein 8 has been detected by immunohistochemistry in carcinoma but not Normal ovarian tissue and was suggested as a prognostic marker of ovarian cancer. See Underwood et al., Cancer Res., Vol. 59, No. 17, pp.
  • TGF ⁇ R3 transforming growth factor beta receptor III
  • PDGFRL platelet-derived growth factor receptor-like gene
  • ST13 tumorigenicity gene
  • RECK reversion-inducing- cysteine-rich protein with kazal motif
  • PEG3 paternally expressed 3
  • biopsies and, in particular, Tumor biopsies have a substantial level of heterogeneity: tumor type, grade, percentage of tumor cells, presence of connective and fat tissues, etc.
  • We studied different types of ovarian tumors of various grades see Table 1) to search for genes involved in common pathways of tumor development and progression, rather than genes involved more specifically in certain types of tumors as previously reported. See Ono et al. (2000), supra; and Welsh et al. (2001), supra.
  • RNA abundances and activities are currently fall within three classes: ribozymes, antisense species and RNA aptamers. See Good et al., Gene Ther., Vol. 4, No. 1 , pp. 45-54 (1997). Controllable application or exposure of a cell to these entities permits controllable perturbation of RNA abundances. Ribozymes
  • Ribozymes are RNAs which are capable of catalyzing RNA cleavage reactions. See Cech, Science, Vol. 236, pp. 1532-1539 (1987); PCT International Publication WO 90/11364 (1990); Sarver et al., Science, Vol. 247, pp. 1222-1225 (1990). "Hairpin” and "hammerhead” RNA ribozymes can be designed to specifically cleave a particular target mRNA. Rules have been established for the design of short RNA molecules with ribozyme activity, which are capable of cleaving other RNA molecules in a highly sequence specific way and can be targeted to virtually all kinds of RNA. See Haseloff et al., Nature, Vol.
  • Ribozyme methods involve exposing a cell to, inducing expression in a cell, etc. of such small RNA ribozyme molecules. See Grassi and Marini, Annals of Med., Vol. 28, No. 6, pp.499-510 (1996); and Gibson, Cancer Meta. Rev., Vol. 15, pp. 287-299 (1996).
  • Ribozymes can be routinely expressed in vivo in sufficient number to be catalytically effective in cleaving mRNA, and thereby modifying mRNA abundances in a cell. See Cotton et al., EMBO J., Vol. 8, pp. 3861-3866 (1989).
  • a ribozyme coding DNA sequence designed according to the previous rules and synthesized, e.g., by standard phosphoramidite chemistry, can be ligated into a restriction enzyme site in the anticodon stem and loop of a gene encoding a tRNA, which can then be transformed into and expressed in a cell of interest by methods routine in the art.
  • an inducible promoter e.g., a glucocorticoid or a tetracycline esponse element
  • an inducible promoter e.g., a glucocorticoid or a tetracycline esponse element
  • a highly and constituently active promoter can be used.
  • tDNA genes i.e., genes encoding tRNAs, are useful in this application because of their small size, high rate of transcription and ubiquitous expression in different kinds of tissues.
  • ribozymes can be routinely designed to cleave virtually any mRNA sequence, and a cell can be routinely transformed with DNA coding for such ribozyme sequences such that a controllable and catalytically effective amount of the ribozyme is expressed. Accordingly, the abundance of virtually any RNA species in a cell can be modified or perturbed.
  • Antisense molecules can be routinely designed to cleave virtually any mRNA sequence, and a cell can be routinely transformed with DNA coding for such ribozyme sequences such that a controllable and catalytically effective amount of the ribozyme is expressed. Accordingly, the abundance of virtually any RNA species in a cell can be modified or perturbed.
  • activity of a target RNA (preferably mRNA) species can be controllably inhibited by the controllable application of antisense nucleic acids.
  • Application at high levels results in a saturating inhibition.
  • An "antisense" nucleic acid as used herein refers to a nucleic acid capable of hybridizing to a sequence-specific, e.g., non-poly A, portion of the target RNA, e.g., its translation initiation region, by virtue of some sequence complementary to a coding and/or non-coding region.
  • the antisense nucleic acids of the invention can be oligonucleotides that are double- stranded or single-stranded, RNA or DNA or a modification or derivative thereof, which can be directly administered in a controllable manner to a cell or which can be produced intracellularly by transcription of exogenous, introduced sequences in controllable quantities sufficient to perturb translation of the target RNA.
  • antisense nucleic acids are of at least six nucleotides and are preferably oligonucleotides, ranging from 6 oligonucleotides to about 200 oligonucleotides.
  • the oligonucleotide is at least 10 nucleotides, at least 15 nucleotides, at least 100 nucleotides or at least 200 nucleotides.
  • the oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or modified versions thereof, single-stranded or double- stranded.
  • the oligonucleotide can be modified at the base moiety, sugar moiety or phosphate backbone.
  • the oligonucleotide may include other appending groups, such as peptides, or agents facilitating transport across the cell membrane [see, e.g., Letsinger et al., Proc. Natl. Acad. Sci. USA, Vol. 86, pp. 6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. Sci. USA, Vol. 84, pp. 648-652 (1987); and PCT Publication No. WO 88/09810 (1988)], hybridization-triggered cleavage agents [see, e.g., Krol et al., BioTechniques, Vol. 6, pp. 958- 976 (1988)] or intercalating agents [see, e.g., Zon, Pharm. Res., Vol. 5, No. 9, pp. 539-549 (1988)].
  • an antisense oligonucleotide is provided, preferably as single-stranded DNA.
  • the oligonucleotide may be modified at any position on its structure with constituents generally known in the art.
  • the antisense oligonucleotides may comprise at least one modified base moiety which is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, ⁇ -D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, ⁇ -D-
  • the oligonucleotide comprises at least one modified sugar moiety selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, xylulose and hexose.
  • the oligonucleotide comprises at least one modified phosphate backbone selected from the group consisting of a phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester and a formacetal or analog thereof.
  • the oligonucleotide is a 2-a-anomeric oligonucleotide.
  • oligonucleotide forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual B-units, the strands run parallel to each other. See Gautier et al., Nucl. Acids Res., Vol. 15, pp. 6625-6641 (1987).
  • the oligonucleotide may be conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc.
  • the antisense nucleic acids of the invention comprise a sequence complementary to at least a portion of a target RNA species.
  • absolute complementary although preferred, is not required.
  • the ability to hybridize will depend on both the degree of complementary and the length of the antisense nucleic acid.
  • Oligonucleotides of the invention may be synthesized by standard methods known in the art, e.g., by use of an automated DNA synthesizer, such as are commercially-available from Biosearch, Applied Biosystems, etc.
  • phosphorothioate oligonucleotides may be synthesized by the method of Stein et al., Nucl. Acids Res., Vol. 16, p. 3209 (1988), methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer supports, etc. See Sarin et al., Proc. Natl. Acad. Sci. USA, Vol. 85, pp. 7448-7451 (1988).
  • the oligonucleotide is a 2'-0-methylribonucleotide [see Inoue et al., Nucl. Acids Res., Vol. 15, pp. 6131-6148 (1987)] or a chimeric RNA-DNA analog [see Inoue et al., FEBS Lett., Vol. 215, pp. 327-330 (1987)].
  • the synthesized antisense oligonucleotides can then be administered to a cell in a controlled or saturating manner.
  • the antisense oligonucleotides can be placed in the growth environment of the cell at controlled levels where they may be taken up by the cell.
  • the uptake of the antisense oligonucleotides can be assisted by use of methods well- known in the art.
  • the antisense nucleic acids of the invention are controllably expressed intracellularly by transcription from an exogenous sequence. If the expression is controlled to be at a high level, a saturating perturbation or modification results.
  • a vector can be introduced in vivo such that it is taken up by a cell, within which cell the vector or a portion thereof is transcribed, producing an antisense nucleic acid (RNA) of the invention.
  • RNA antisense nucleic acid
  • Such a vector would contain a sequence encoding the antisense nucleic acid.
  • Such a vector can remain episomal or become chromosomally integrated, as long as it can be transcribed to produce the desired antisense RNA.
  • Vectors can be constructed by recombinant DNA technology methods standard in the art.
  • Vectors can be plasmid, viral or others known in the art, used for replication and expression in mammalian cells.
  • Expression of the sequences encoding the antisense RNAs can be by any promoter known in the art to act in a cell of interest.
  • promoters can be inducible or constitutive.
  • promoters are controllable or inducible by the administration of an exogenous moiety in order to achieve controlled expression of the antisense oligonucleotide.
  • controllable promoters include the Tet promoter.
  • promoters for mammalian cells include, but are not limited to, the SV40 early promoter region [see Bernoist and Chambon, Nature, Vol. 290, pp. 304-310 (1981)], the promoter contained in the 3' long terminal repeat of Rous sarcoma virus [see Yamamoto et al., Ceil, Vol. 22, pp. 787-797 (1980)], the herpes thymidine kinase promoter [see Wagner et al., Proc. Natl. Acad. Sci. USA, Vol. 78, pp. 1441-1445 (1981)], the regulatory sequences of the metallothionein gene, etc. [see Brinster et al., Nature, Vol. 296, pp. 39-42 (1982)].
  • antisense nucleic acids can be routinely designed to target virtually any mRNA sequence, and a cell can be routinely transformed with or exposed to nucleic acids coding for such antisense sequences such that an effective and controllable or saturating amount of the antisense nucleic acid is expressed. Accordingly the translation of virtually any RNA species in a cell can be modified or perturbed.
  • RNA Aptamers
  • RNA aptamers can be introduced into or expressed in a cell.
  • RNA aptamers are specific RNA ligands for proteins, such as for Tat and Rev RNA [see Good et al. (1997), supra] that can specifically inhibit their translation.
  • Methods of modifying protein abundances include, inter alia, those altering protein degradation rates and those using antibodies, which bind to proteins affecting abundances of activities of native target protein species. Increasing (or decreasing) the degradation rates of a protein species decreases (or increases) the abundance of that species. Methods for increasing the degradation rate of a target protein in response to elevated temperature and/or exposure to a particular drug, which are known in the art, can be employed in this invention.
  • one such method employs a heat-inducible or drug-inducible /V-terminal degron, which is an ⁇ /-terminal protein fragment that exposes a degradation signal promoting rapid protein degradation at a higher temperature, e.g., 37°C, and which is hidden to prevent rapid degradation at a lower temperature, e.g., 23°C.
  • a heat-inducible or drug-inducible /V-terminal degron which is an ⁇ /-terminal protein fragment that exposes a degradation signal promoting rapid protein degradation at a higher temperature, e.g., 37°C, and which is hidden to prevent rapid degradation at a lower temperature, e.g., 23°C.
  • a degron is Arg-DHFR ts , a variant of murine dihydrofolate reductase in which the /V-terminal Val is replaced by Arg and the Pro at position 66 is replaced with Leu.
  • a gene for a target protein, P is replaced by standard gene targeting methods known in the art [see Lodish et al., Molecular Biology of the Cell, W.H. Freeman and Co., NY, especially Chapter 8 (1995)] with a gene coding for the fusion protein Ub-Arg-DHFR te -P ("Ub” stands for ubiquitin).
  • Ub stands for ubiquitin
  • the /V-terminal ubiquitin is rapidly cleaved after translation exposing the /V-terminal degron. At lower temperatures, lysines internal to Arg-DHFR ts are not exposed, ubiquitination of the fusion protein does not occur, degradation is slow and active target protein levels are high.
  • Target protein activities can also be decreased by (neutralizing) antibodies.
  • protein abundances/activities can be modified or perturbed in a controlled or saturating manner.
  • antibodies to suitable epitopes on protein surfaces may decrease the abundance, and thereby indirectly decrease the activity, of the wild-type active form of a target protein by aggregating active forms into complexes with less or minimal activity as compared to the wild-type unaggregated wild-type form.
  • antibodies may directly decrease protein activity by, e.g., interacting directly with active sites or by blocking access of substrates to active sites.
  • (activating) antibodies may also interact with proteins and their active sites to increase resulting activity.
  • antibodies of the various types to be described
  • the effects of the antibodies can be assayed and suitable antibodies selected that raise or lower the target protein species concentration and/or activity.
  • Such assays involve introducing antibodies into a cell (see below) and assaying the concentration of the wild-type amount or activities of the target protein by standard means, such as immunoassays, known in the art.
  • the net activity of the wild-type form can be assayed by assay means appropriate to the known activity of the target protein.
  • Antibodies can be introduced into cells in numerous fashions, including, e.g., microinjection of antibodies into a cell [see Morgan et al., Immunol.
  • recombinant antibodies can be engineering and ectopically expressed in a wide variety of non-lymphoid cell types to bind to target proteins, as well as to block target protein activities. See Biocca et al., Trends Cell Biol., Vol. 5, pp. 248-252 (1995). Expression of the antibody is preferably under control of a controllable promoter, such as the Tet promoter, or a constitutively active promoter (for production of saturating perturbations).
  • a controllable promoter such as the Tet promoter, or a constitutively active promoter (for production of saturating perturbations).
  • a first step is the selection of a particular monoclonal antibody with appropriate specificity to the target protein (see below). Then sequences encoding the variable regions of the selected antibody can be cloned into various engineered antibody formats, including, e.g., whole antibody, Fab fragments, Fv fragments, single-chain Fv (ScFv) fragments (V H and V L regions united by a peptide linker), diabodies (two associated ScFv fragments with different specificities) and so forth. See Hayden et al., Curr. Opin. Immunol., Vol. 9, pp. 210-212 (1997).
  • Intracellularly-expressed antibodies of the various formats can be targeted into cellular compartments, e.g., the cytoplasm, the nucleus, the mitochondria, etc., by expressing them as fusions with the various known intracellular leader sequences. See Bradbury et al., Antibody Engineerinq, Borrebaeck, Editor, Vol. 2, pp. 295-361, IRL Press (1995).
  • the ScFv format appears to be particularly suitable for cytoplasmic targeting.
  • Antibody types include, but are not limited to, polyclonal, monoclonal, chimeric, single-chain, Fab fragments and an Fab expression library.
  • Various procedures known in the art may be used for the production of polyclonal antibodies to a target protein.
  • various host animals can be immunized by injection with the target protein, such host animals include, but are not limited to, rabbits, mice, rats, etc.
  • adjuvants can be used to increase the immunological response, depending on the host species and include, but are not limited to, Freund's (complete and incomplete); mineral gels, such as aluminum hydroxide; surface active substances, such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol; and potentially useful human adjuvants, such as Bacillus Calmette-Guerin (BCG) and corynebacterium parvum.
  • BCG Bacillus Calmette-Guerin
  • any technique that provides for the production of antibody molecules by continuous cell lines in culture may be used. Such techniques include, but are not restricted to, the hybridoma technique originally developed by Kohler and Milstein, Nature, Vol.
  • monoclonal antibodies can be produced in germ-free animals utilizing recent technology. See PCT/US90/02545.
  • human antibodies may be used and can be obtained by using human hybridomas [see Cote et al., Proc. Natl.
  • monoclonal antibodies can be alternatively selected from large antibody libraries using the techniques of phage display. See Marks et al., J. Biol. Chem., Vol. 267, pp. 16007-16010 (1992). Using this technique, libraries of up to 10 12 different antibodies have been expressed on the surface of fd filamentous phage, creating a "single pot" in vitro immune system of antibodies available for the selection of monoclonal antibodies. See Griffiths et al., EMBO J., Vol. 13, pp. 3245-3260 (1994).
  • Selection of antibodies from such libraries can be done by techniques known in the art, including contacting the phage to immobilized target protein, selecting and cloning phage bound to the target and subcloning the sequences encoding the antibody variable regions into an appropriate vector expressing a desired antibody format.
  • Antibody fragments that contain the idiotypes of the target protein can be generated by techniques known in the art.
  • such fragments include, but are not limited to, the F(ab') 2 fragment which can be produced by pepsin digestion of the antibody molecule; the Fab' fragments that can be generated by reducing the disulfide bridges of the F(ab') 2 fragment, the Fab fragments that can be generated by treating the antibody molecule with papain and a reducing agent and Fv fragments.
  • screening for the desired antibody can be accomplished by techniques known in the art, e.g., ELISA.
  • ELISA ELISA-Linked Immunosorbent Assays
  • Methods of directly modifying protein activities include, inter alia, dominant negative mutations, specific drugs or chemical moieties and also the use of antibodies, as previously discussed.
  • Dominant negative mutations are mutations to endogenous genes or mutant exogenous genes that when expressed in a cell disrupt the activity of a targeted protein species. Depending on the structure and activity of the targeted protein, general rules exist that guide the selection of an appropriate strategy for constructing dominant negative mutations that disrupt activity of that target. See Hershkowitz, Nature, Vol. 329, pp. 219-222 (1987). In the case of active monomeric forms, over expression of an inactive form can cause competition for natural substrates or ligands sufficient to significantly reduce net activity of the target protein.
  • over expression can be achieved by, e.g., associating a promoter, preferably a controllable or inducible promoter, or also a constitutively expressed promoter, of increased activity with the mutant gene.
  • a promoter preferably a controllable or inducible promoter, or also a constitutively expressed promoter
  • changes to active site residues can be made so that a virtually irreversible association occurs with the target ligand.
  • Such can be achieved with certain tyrosine kinases by careful replacement of active site serine residues. See Perlmutter et al., Curr. Opin. Immunol., Vol. 8, pp. 285-290 (1996).
  • several strategies can guide selection of a dominant negative mutant.
  • Multimeric activity can be decreased in a controlled or saturating manner by expression of genes coding exogenous protein fragments that bind to multimeric association domains and prevent multimer formation.
  • controllable or saturating over-expression of an inactive protein unit of a particular type can tie up wild-type active units in inactive multimers, and thereby decrease multimeric activity.
  • the DNA binding domain can be deleted from the DNA binding unit, or the activation domain deleted from the activation unit.
  • the DNA binding domain unit can be expressed without the domain causing association with the activation unit.
  • DNA binding sites are tied up without any possible activation of expression.
  • a particular type of unit normally undergoes a conformational change during activity
  • expression of a rigid unit can inactivate resultant complexes.
  • proteins involved in cellular mechanisms such as cellular motility, the mitotic process, cellular architecture and so forth, are typically composed of associations of many subunits of a few types. These structures are often highly sensitive to disruption by inclusion of a few monomeric units with structural defects. Such mutant monomers disrupt the relevant protein activities and can be expressed in a cell in a controlled or saturating manner.
  • mutant target proteins that are sensitive to temperature (or other exogenous factors) can be found by mutagenesis and screening procedures that are well-known in the art.
  • activities of certain target proteins can be modified or perturbed in a controlled or a saturating manner by exposure to exogenous drugs or ligands. Since the methods of this invention are often applied to testing or confirming the usefulness of various drugs to treat cancer, drug exposure is an important method of modifying/perturbing cellular constituents, both mRNAs and expressed proteins.
  • input cellular constituents are perturbed either by drug exposure or genetic manipulation, such as gene deletion or knockout; and system responses are measured by gene expression technologies, such as hybridization to gene transcript arrays (described in the following).
  • a drug that interacts with only one target protein in the cell and alters the activity of only that one target protein, either increasing or decreasing the activity.
  • Graded exposure of a cell to varying amounts of that drug thereby causes graded perturbations of network models having that target protein as an input. Saturating exposure causes saturating modification/perturbation.
  • Cyclosporin A is a very specific regulator of the calcineurin protein, acting via a complex with cyclophilin.
  • a titration series of Cyclosporin A therefore can be used to generate any desired amount of inhibition of the calcineurin protein.
  • saturating exposure to Cyclosporin A will maximally inhibit the calcineurin protein.
  • the experimental methods of this invention depend on measurements of cellular constituents.
  • the cellular constituents measured can be from any aspect of the biological state of a cell. They can be from the transcriptional state, in which RNA abundances are measured, the translation state, in which protein abundances are measured, the activity state, in which protein activities are measured.
  • the cellular characteristics can also be from mixed aspects, e.g., in which the activities of one or more proteins are measured along with the RNA abundances (gene expressions) of other cellular constituents.
  • This section describes exemplary methods for measuring the cellular constituents in drug or pathway responses. This invention is adaptable to other methods of such measurement.
  • the transcriptional state of the other cellular constituents are measured.
  • the transcriptional state can be measured by techniques of hybridization to arrays of nucleic acid or nucleic acid mimic probes, described in the next subsection, or by other gene expression technologies, described in the subsequent subsection.
  • the result is data including values representing mRNA abundance and/or ratios, which usually reflect DNA expression ratios (in the absence of differences in RNA degradation rates).
  • aspects of the biological state other than the transcriptional state such as the translational state, the activity state or mixed aspects can be measured.
  • measurements of the cellular constituents should be made in a manner that is relatively independent of when the measurement are made.
  • measurement of the transcriptional state is made by hybridization to transcript arrays, which are described in this subsection. Certain other methods of transcriptional state measurement are described later in this subsection.
  • transcript arrays also called herein “microarrays”.
  • Transcript arrays can be employed for analyzing the transcriptional state in a cell, and especially for measuring the transcriptional states of cancer cells.
  • transcript arrays are produced by hybridizing detectably-labeled polynucleotides representing the mRNA transcripts present in a cell, e.g., fluorescently- labeled cDNA synthesized from total cell mRNA, to a microarray.
  • a microarray is a surface with an ordered array of binding, e.g., hybridization, sites for products of many of the genes in the genome of a cell or organism, preferably most or almost all of the genes.
  • Microarrays can be made in a number of ways, of which several are described below. However produced, microarrays share certain characteristics. The arrays are reproducible, allowing multiple copies of a given array to be produced and easily compared with each other.
  • the microarrays are small, usually smaller than 5 cm 2 and they are made from materials that are stable under binding, e.g. nucleic acid hybridization, conditions.
  • a given binding site or unique set of binding sites in the microarray will specifically bind the product of a single gene in the cell.
  • site physical binding site
  • positionally-addressable arrays containing affixed nucleic acids of known sequence at each location are used.
  • cDNA complementary to the total cellular mRNA is hybridized to a microarray
  • the site on the array corresponding to a gene i.e., capable of specifically binding the product of the gene, that is not transcribed in the cell will have little or no signal, e.g., fluorescent signal, and a gene for which the encoded mRNA is prevalent will have a relatively strong signal.
  • Microarrays are known in the art and consist of a surface to which probes that correspond in sequence to gene products, e.g., cDNAs, mRNAs, cRNAs, polypeptides and fragments thereof, can be specifically hybridized or bound at a known position.
  • the microarray is an array, i.e., a matrix, in which each position represents a discrete binding site for a product encoded by a gene, e.g., a protein or RNA, and in which binding sites are present for products of most or almost all of the genes in the organism's genome.
  • the "binding site”, hereinafter “site”, is a nucleic acid or nucleic acid analogue to which a particular cognate cDNA can specifically hybridize.
  • the nucleic acid or analogue of the binding site can be, e.g., a synthetic oligomer, a full-length cDNA, a less than full-length cDNA or a gene fragment.
  • the microarray contains binding sites for products of all or almost all genes in the target organism's genome, such comprehensiveness is not necessarily required.
  • the microarray will have binding sites corresponding to at least about 50% of the genes in the genome, often at least about 75%, more often at least about 85%, even more often more than about 90%, and most often at least about 99%.
  • the microarray has binding sites for genes relevant to testing and confirming a biological network model of interest.
  • a "gene” is identified as an open reading frame (ORF) of preferably at least 50, 75 or 99 amino acids from which a mRNA is transcribed in the organism, e.g., if a single cell, or in some cell in a multicellular organism.
  • ORF open reading frame
  • the number of genes in a genome can be estimated from the number of mRNAs expressed by the organism, or by extrapolation from a well-characterized portion of the genome.
  • the number of ORFs can be determined and mRNA coding regions identified by analysis of the DNA sequence.
  • Saccharomyces cerevisiae genome has been completely sequenced and is reported to have approximately 6,275 ORFs longer than 99 amino acids. Analysis of these ORFs indicates that there are 5,885 ORFs that are likely to specify protein products. See Goffeau et al., Science, Vol. 274, pp. 546-567 (1996), which is incorporated by reference in its entirety for all purposes.
  • the human genome is estimated to contain approximately 10 5 genes.
  • the "binding site" to which a particular cognate cDNA specifically hybridizes is usually a nucleic acid or nucleic acid analogue attached at that binding site.
  • the binding sites of the microarray are DNA polynucleotides corresponding to at least a portion of each gene in an organism's genome. These DNAs can be obtained by, e.g., PCR amplification of gene segments from genomic DNA, cDNA, e.g., by RT-PCR, or cloned sequences.
  • PCR primers are chosen, based on the known sequence of the genes or cDNA, that result in amplification of unique fragments, i.e., fragments that do not share more than 10 bases of contiguous identical sequence with any other fragment on the microarray.
  • Computer programs are useful in the design of primers with the required specificity and optimal amplification properties. See, e.g., Oligo pi version 5.0, National Biosciences.
  • Oligo pi version 5.0 National Biosciences.
  • it will sometimes be desirable to amplify segments near the 3' end of the gene so that when oligo-dT primed cDNA probes are hybridized to the microarray, less-than-full length probes will bind efficiently.
  • each gene fragment on the microarray will be between about 50 bp and about 2000 bp, more typically between about 100 bp and about 1000 bp, and usually between about 300 bp and about 800 bp in length.
  • PCR methods are well-known and are described, e.g., in Innis et al., eds., PCR Protocols: A Guide to Methods and Applications, Academic Press Inc., San Diego, CA (1990), which is incorporated by reference in its entirety for all purposes. It will be apparent that computer-controlled robotic systems are useful for isolating and amplifying nucleic acids.
  • nucleic acid for the microarray is by synthesis of synthetic polynucleotides or oligonucleotides, e.g., using ⁇ /-phosphonate or phosphoramidite chemistries. See Froehler et al., Nucleic Acid Res., Vol. 14, pp. 5399-5407 (1986); and McBride et al., Tetrahedron Lett., Vol. 24, pp. 245-248 (1983). Synthetic sequences are between about 15 bases and about 500 bases in length, more typically between about 20 bases and about 50 bases.
  • synthetic nucleic acids include non-natural bases, e.g., inosine.
  • nucleic acid analogues may be used as binding sites for hybridization.
  • An example of a suitable nucleic acid analogue is peptide nucleic acid. See, e.g., Egholm et al., Nature, Vol. 365, pp. 566-568 (1993); and also U.S. Patent No. 5,539,083.
  • the binding (hybridization) sites are made from plasmid or phage clones of genes, cDNAs, e.g., expressed sequence tags, or inserts therefrom. See Nguyen et al., Genomics, Vol. 29, pp. 207-209 (1995).
  • the polynucleotide of the binding sites is RNA. Attaching Nucleic Acids to the Solid Surface
  • the nucleic acid or analogue are attached to a solid support, which may be made from glass, plastic, e.g., polypropylene and nylon, polyacrylamide, nitrocellulose or other materials.
  • a preferred method for attaching the nucleic acids to a surface is by printing on glass plates, as is described generally by Schena et al., Science, Vol. 270, pp. 467-470 (1995). This method is especially useful for preparing microarrays of cDNA. See, also, DeRisi et al., Nat. Genet, Vol. 14, pp. 457-460 (1996); Shalon et al., Genome Res., Vol. 6, pp.
  • a second preferred method for making microarrays is by making high-density oligonucleotide arrays. Techniques are known for producing arrays containing thousands of oligonucleotides complementary to defined sequences, at defined locations on a surface using photolithographic techniques for synthesis in situ [see Fodor et al., Science, Vol. 251 , pp. 767-773 (1991); Pease et al., Proc. Natl. Acad. Sci.
  • oligonucleotides e.g., 20 mers, of known sequence are synthesized directly on a surface such as a derivatized glass slide. Usually, the array produced is redundant, with several oligonucleotide molecules per RNA. Oligonucleotide probes can be chosen to detect alternatively spliced mRNAs.
  • microarrays may also be used. See Maskos and Southern, Nucleic Acids Res., Vol. 20, pp. 1679-1684 (1992).
  • any type of array e.g., dot blots on a nylon hybridization membrane [see Sambrook et al., Molecular Cloning-A Laboratory Manual, 2 nd Edition, Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989), which is incorporated in its entirety for all purposes], could be used, although, as will be recognized by those of skill in the art, very small arrays will be preferred because hybridization volumes will be smaller.
  • RNA is extracted from cells of the various types of interest in this invention using guanidinium thiocyanate lysis followed by CsCI centrifugation. See Chirgwin et al., Biochemistry, Vol. 18, pp. 5294-5299 (1979).
  • Poly(A) + RNA is selected by selection with oligo-dT cellulose. See Sambrook et al. (1989), supra.
  • Cells of interest include wild-type cells, drug-exposed wild-type cells, cells with modified/perturbed cellular constituent(s), and drug-exposed cells with modified/perturbed cellular constituent(s).
  • Labeled cDNA is prepared from mRNA by oligo dT-primed or random-primed reverse transcription, both of which are well-known in the art. See, e.g., Klug and Berger, Methods Enzymol., Vol. 52, pp. 316-325 (1987). Reverse transcription may be carried out in the presence of a dNTP conjugated to a detectable label, most preferably a fluorescently- labeled dNTP. Alternatively, isolated mRNA can be converted to labeled antisense RNA synthesized by in vitro transcription of double-stranded cDNA in the presence of labeled dNTPs. See Lockhart et al.
  • the cDNA or RNA probe can be synthesized in the absence of detectable label and may be labeled subsequently, e.g., by incorporating biotinylated dNTPs or rNTP, or some similar means, e.g., photo-cross-linking a psoralen derivative of biotin to RNAs, followed by addition of labeled streptavidin, e.g., phycoerythrin- conjugated streptavidin or the equivalent.
  • biotinylated dNTPs or rNTP or some similar means, e.g., photo-cross-linking a psoralen derivative of biotin to RNAs, followed by addition of labeled streptavidin, e.g., phycoerythrin- conjugated streptavidin or the equivalent.
  • fluorophores include fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and others. See, e.g., Kricka, Nonisotopic DNA Probe Techniques, Academic Press, San Diego, CA (1992). It will be appreciated that pairs of fluorophores are chosen that have distinct emission spectra so that they can be easily distinguished.
  • a label other than a fluorescent label is used.
  • a radioactive label or a pair of radioactive labels with distinct emission spectra, can be used. See Zhao et al., Gene, Vol. 156, p. 207 (1995); and Pietu et al., Genome Res., Vol. 6, p. 492 (1996).
  • use of radioisotopes is a less-preferred embodiment.
  • labeled cDNA is synthesized by incubating a mixture containing 0.5 mM dGTP, dATP and dCTP plus 0.1 mM dTTP plus fluorescent deoxyribonucleotides, e.g., 0.1 mM Rhodamine 110 UTP (Perken Elmer Cetus) or 0.1 mM Cy3 dUTP (Amersham), with reverse transcriptase, e.g., SuperScript.TM. II, LTI Inc., at 42°C for 60 minutes.
  • nucleic acid hybridization and wash conditions are chosen so that the probe "specifically binds" or “specifically hybridizes” to a specific array site, i.e., the probe hybridizes, duplexes or binds to a sequence array site with a complementary nucleic acid sequence but does not hybridize to a site with a non-complementary nucleic acid sequence.
  • one polynucleotide sequence is considered complementary to another when, if the shorter of the polynucleotides is ⁇ 25 bases, there are no mismatches using standard base-pairing rules or, if the shorter of the polynucleotides is longer than 25 bases, there is no more than a 5% mismatch.
  • the polynucleotides are perfectly complementary (no mismatches). It can easily be demonstrated that specific hybridization conditions result in specific hybridization by carrying out a hybridization assay including negative controls. See, e.g., Shalon et al. (1996), supra; and Chee et al., supra.
  • Optimal hybridization conditions will depend on the length, e.g., oligomer vs. polynucleotide >200 bases; and type, e.g., RNA, DNA and PNA, of labeled probe and immobilized polynucleotide or oligonucleotide.
  • General parameters for specific, i.e., stringent, hybridization conditions for nucleic acids are described in Sambrook et al. (1996), supra; and Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing and Wiley-lnterscience, NY (1987), which is incorporated in its entirety for all purposes.
  • the fluorescence emissions at each site of a transcript array can be, preferably, detected by scanning confocal laser microscopy.
  • a separate scan, using the appropriate excitation line, is carried out for each of the two fluorophores used.
  • a laser can be used that allows simultaneous specimen illumination at wavelengths specific to the two fluorophores and emissions from the two fluorophores can be analyzed simultaneously. See Shalon el al. (1996), supra, which is incorporated by reference in its entirety for all purposes.
  • the arrays are scanned with a laser fluorescent scanner with a computer-controlled X-Y stage and a microscope objective.
  • Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser and the emitted light is split by wavelength and detected with two photomultiplier tubes.
  • Fluorescence laser scanning devices are described in Schena et al. (1996), supra and in other references cited herein.
  • the fiber-optic bundle described by Ferguson et al., Nat. Biotechnol., Vol. 14, pp. 1681-1684 (1996) may be used to monitor mRNA abundance levels at a large number of sites simultaneously.
  • Signals are recorded and, in a preferred embodiment, analyzed by computer, e.g., using a 12-bit analog to digital board.
  • the scanned image is de-speckled using a graphics program, e.g., Hijaak Graphics Suite, and then analyzed using an image gridding program that creates a spreadsheet of the average hybridization at each wavelength at each site. If necessary, an experimentally determined correction for "cross talk" (or overlap) between the channels for the two fluorophores may be made.
  • a ratio of the emission of the two fluorophores is preferably calculated. The ratio is independent of the absolute expression level of the cognate gene, but is useful for genes whose expression is significantly modulated by drug administration, gene deletion or any other tested event.
  • the transcriptional state of a cell may be measured by other gene expression technologies known in the art.
  • Several such technologies produce pools of restriction fragments of limited complexity for electrophoretic analysis, such as methods combining double restriction enzyme digestion with phasing primers [see, e.g., EP 0534858 A1 (1992), Zabeau et al.], or methods selecting restriction fragments with sites closest to a defined mRNA end [see, e.g., Prashar et al., Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 659-663 (1996)].
  • cDNA pools such as by sequencing sufficient bases, e.g., 20-50 bases, in each of multiple cDNAs to identify each cDNA, or by sequencing short tags, e.g., 9-10 bases, which are generated at known positions relative to a defined mRNA end pathway pattern. See, e.g., Velculescu, Science, Vol. 270, pp.484-487 (1995). Measurement of Other Aspects
  • aspects of the biological state other than the transcriptional state such as the translational state, the activity state or mixed aspects can be measured in order to obtain drug and pathway responses. Details of these embodiments are described in this section. Translational State Measurements
  • Measurement of the translational state may be performed according to several methods.
  • whole genome monitoring of protein i.e., the "proteome” [see Goffeau et al. (1996), supra]
  • antibodies are present for a substantial fraction of the encoded proteins, or at least for those proteins relevant to testing or confirming a biological network model of interest.
  • Methods for making monoclonal antibodies are well-known. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor, NY (1988), which is incorporated in its entirety for all purposes.
  • monoclonal antibodies are raised against synthetic peptide fragments designed based on genomic sequence of the cell. With such an antibody array, proteins from the cell are contacted to the array and their binding is assayed with assays known in the art.
  • proteins can be separated by two-dimensional gel electrophoresis systems.
  • Two-dimensional gel electrophoresis is well-known in the art and typically involves iso-electric focusing along a first dimension followed by SDS-PAGE electrophoresis along a second dimension. See, e.g., Hames et al., Gel Electrophoresis of Proteins: A Practical Approach, IRL Press, NY (1990); Shevchenko et al., Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 1440-1445 (1996); Sagliocco et al., Yeast, Vol. 12, pp. 1519-1533 (1996); Lander, Science, Vol. 274, pp.
  • the resulting electropherograms can be analyzed by numerous techniques, including mass spectrometric techniques, western blotting and immunoblot analysis using polyclonal and monoclonal antibodies, and internal and /V-terminal micro-sequencing. Using these techniques, it is possible to identify a substantial fraction of all the proteins produced under given physiological conditions, including in cells, e.g., in yeast; exposed to a drug or in cells modified by, e.g., deletion or over-expression of a specific gene.
  • response data may be formed of mixed aspects of the biological state of a cell.
  • Response data can be constructed from, e.g., changes in certain mRNA abundances, changes in certain protein abundances, and changes in certain protein activities.
  • the computation steps of the previous methods are implemented on a computer system or on one or more networked computer systems in order to provide a powerful and convenient facility for forming and testing models of biological systems.
  • the computer system may be a single hardware platform comprising internal components and being linked to external components.
  • the internal components of this computer system include processor element interconnected with a main memory.
  • computer system can be an Intel Pentium based processor of 200 Mhz or greater clock rate and with 32 MB or more of main memory.
  • the external components include mass data storage.
  • This mass storage can be one or more hard disks, which are typically packaged together with the processor and memory. Typically, such hard disks provide for at least 1 GB of storage.
  • Other external components include user interface device, which can be a monitor and keyboards, together with pointing device, which can be a "mouse" or other graphic input devices.
  • the computer system is also linked to other local computer systems, remote computer systems, or wide area communication networks, such as the Internet. This network link allows the computer system to share data and processing tasks with other computer systems.
  • Loaded into memory during operation of this system are several software components, which are both standard in the art and special to the instant invention. These software components collectively cause the computer system to function according to the methods of this invention.
  • the software component represents the operating system, which is responsible for managing the computer system and its network interconnections.
  • This operating system can be, e.g., of the Microsoft Windows family, such as Windows 95, Windows 98 or Windows NT; or a Unix operating system, such as Sun Solaris.
  • Software include common languages and functions conveniently present on this system to assist programs implementing the methods specific to this invention. Languages that can be used to program the analytic methods of this invention include C, C++ or, less preferably, JAVA.
  • the methods of this invention are programmed in mathematical software packages which allow symbolic entry of equations and high-level specification of processing, including algorithms to be used, thereby freeing a user of the need to procedurally program individual equations or algorithms.
  • Such packages include, e.g., Matlab from Mathworks (Natick, Mass.), Mathematica from Wolfram Research (Champaign, IL) and MathCAD from Mathsoft (Cambridge, MA).
  • the analytic software component actually comprises separate software components which interact with each other.
  • Analytic software represents a database containing all data necessary for the operation of the system.
  • Analytic software includes a data reduction and computation component comprising one or more programs which execute the analytic methods of the invention.
  • Analytic software also includes a user interface which provides a user of the computer system with control and input of test network models, and, optionally, experimental data.
  • the user interface may comprise a drag-and-drop interface for specifying hypotheses to the system.
  • the user interface may also comprise means for loading experimental data from the mass storage component, e.g., the hard drive; from removable media, e.g., floppy disks or CD-ROM; or from a different computer system communicating with the instant system over a network, e.g., a local area network, or a wide area communication network, such as the Internet.
  • a network e.g., a local area network, or a wide area communication network, such as the Internet.
  • Absolute CC values are shown for expression levels analyzed in the 18 test samples only (R1) or in all 36 samples (R2).
  • Probe Set Probe set
  • Probe set Probe set Name R1 R2 name R1 R2 name R1 R2 37628 at 0.865 0.808 37529 at 0.669 0.494 201 s at 0.577 0.571 41859 at 0.865 0.877 32175 at 0.669 0.486 774_g_at 0.576 0.465 38120 at 0.852 0.768 35753 at 0.667 0.535 40998 at 0.576 0.539 32664 at 0.848 0.749 38875 r at 0.667 0.495 41772 at 0.573 0.485 35717 at 0.847 0.783 32779 s at 0.665 0.599 40522 at 0.572 0.544 34257 at 0.846 0.786 41385 at 0.665 0.594 41768 at 0.571 0.513 38220 at 0.844 0.820 1319 at 0.664 0.580 37828 at 0.569 0.677 40423 at

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Pathology (AREA)
  • Immunology (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Veterinary Medicine (AREA)
  • Animal Behavior & Ethology (AREA)
  • Oncology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Public Health (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Hospice & Palliative Care (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)

Abstract

This invention relates to the use of genomic analysis to detect the presence of ovarian cancer in a patient from a sample of tissue or blood and to kits for carrying out this determination. In addition this invention relates to methods to treat a patient with ovarian cancer.

Description

GENES REGULATED IN OVARIAN CANCER AS PROGNOSTIC AND THERAPEUTIC TARGETS
TECHNICAL FIELD
[01] The present invention belongs to the fields of medicine and relates to the use of genomic analysis to evaluate and treat ovarian cancer. In particular, this invention relates to the measurement of patterns of gene expression to determine the presence of ovarian cancer in a patients tissues.
BACKGROUND ART
[02] Ovarian cancer is one of the most common types of cancer that affects women in the United States, with a lifetime risk of approximately 1/70. See Whittemore, Gynecol. Oncol., Vol. 55, No. 3, Part 2, pp. S15-S19 (1994). It is a rapidly fatal disease usually detected late, with still no good method of prevention. The greatest risk factor for ovarian cancer is a family history of the disease, suggesting the strong influence of genetics. See Schildkraut and Thompson, Am. J. Epidemiol., Vol. 128, No. 3, pp. 456-466 (1988). Other factors such as demographic, lifestyle and reproductive factors have also been shown to contribute to the risk of ovarian cancer.
[03] Several microarray expression analyses of ovarian biopsies and cell lines have been conducted to identify genes specifically over-expressed in ovarian cancers. See Schummer et al., Gene, Vol. 238, No. 2, pp. 375-385 (1999). Other studies have tried to correlate gene expression levels with specific tumor types. See Bayani et al., Cancer Res., Vol. 62, No. 12, pp. 3466-3476 (2002); Welsh et al., Proc. Natl. Acad. Sci. USA, Vol. 98, No. 3, pp. 1176-1181 (2001); and Ono et al., Cancer Res., Vol. 60, No. 18, pp. 5007-5011 (2000).
[04] These kinds of studies, aimed at increasing our understanding of the molecular mechanism of tumor development and in some cases at better classifying tumors, has provided a list of genes with few overlaps between analyses. Some technical differences may explain in part the apparent lack of consistency or low reproducibility between studies: quality of samples, amplification of the messenger ribonucleic acid (mRNA) and different microarray platforms. However, it is likely that the heterogeneity of the tumors is a key factor that contributes to the differences observed between studies, in particular, those where few tumors are analyzed. Furthermore, comparison of gene expression levels on microarray experiments have historically been done using ratios of signal intensity (fold change), with limited use of statistical methods and a lack of validation with additional samples.
[05] However, genes apparently expressed at high levels, or with the biggest change in expression, may not always be the most relevant; it is conceivable that a small disruption of the very tight regulation of genes may have dramatic consequences, even when the level of expression is low.
[06] Thus there is a need for the identification of genes whose expression rates are consistently and reliably altered in ovarian cancer. Such a list could provide new insight into ovarian tumor development and progression, and suggest potential new drug targets, and biomarkers for diagnosis, monitoring and treatment of the disease.
DISCLOSURE OF INVENTION
[07] In the present invention, the application of a combination of statistical tests and the recently described leave-one-out method [see van't Veer et al., Nature, Vol. 415, No. 6871, pp. 530-536 (2002)], allows the analyze of expression profiles of tumors and normal ovarian tissues and for these patterns to also be determined in gene expression products in various body fluids including, but not limited to, blood and serum. See van't Veer et al., supra. [08] The study of two independent sets of samples, a test set and a validation set, confirms the involvement of several known genes with ovarian tumor development, but also identify novel genes. These findings provide new insight into ovarian tumor development and progression, and suggest potential new drug targets, and biomarkers for diagnosis and monitoring of the disease.
[09] In one embodiment, this invention provides a method to determine if a patient is afflicted with ovarian cancer comprising: a) obtaining a sample from the said patient; b) determining the levels of gene expression of two or more of the genes listed in Table 9 in the sample from the patient; c) comparing the levels of gene expression of the two or more genes determined in (b) to the levels of the same genes listed in Table 1 ; d) determining the degree of similarity (DOS) between the levels of gene expression of the two or more genes determined in (c); and e) determining from the DOS between the level of gene expression of the two or more genes the probability that the sample shows evidence of the presence of ovarian cancer in the patient. [10] In a preferred embodiment, this invention provides a method wherein the levels of gene expression are determined for a subset of the genes listed in Table 9 comprising genes
Nos. 1-28 in Table 9.
[11] In another embodiment, the invention employs a sample comprising cells obtained from the patient. These may be cells removed from a solid tumor in the said patient or, in a preferred embodiment, the sample comprises blood cells and serum drawn from the said patient. In a most preferred embodiment, the sample comprises a body fluid drawn from the patient.
[12] In a preferred embodiment, this invention employs a method of determining the level of gene expression comprising measuring the levels of protein expression product in the sample from the patient. This may be done in a variety of ways including, but not limited to, detecting the presence and level of the protein expression products using a reagent which specifically binds with the proteins, wherein the reagent may be selected from the group consisting of an antibody, an antibody derivative and an antibody fragment.
[13] In another embodiment, this invention provides a method wherein the levels of expression in the sample are assessed by measuring the levels in the sample of the transcribed polynucleotides of the two or more gene in Table 9. These transcribed polynucleotide may be mRNA or complementary DNA (cDNA).
[14] In a preferred embodiment, this method would further include the step of amplifying the transcribed polynucleotide.
[15] In another embodiment, this invention includes a method of treating a subject afflicted with ovarian cancer, the method comprising providing to cells of the subject an antisense oligonuceotide complimentary to one or more of the genes whose expression is up-regulated in ovarian cancer as shown in Table 8.
[16] In addition, this invention provides a method of inhibiting ovarian cancer in a subject at risk for developing ovarian cancer, the method comprising inhibiting expression of one or more of the genes shown in Table 8 to be up-regulated in ovarian cancer. [17] This invention also provides kits for use in determining treatment strategy for a patient with suspected ovarian cancer comprising: a) a number (for example, two or more) of antibodies able to recognize and bind to the polypeptide expression product of the two or more of the genes in Table 9; b) a container suitable for containing the said antibodies and a sample of body fluid from the said individual wherein the antibody can contact the polypeptide expressed by the two or more genes shown in Table 9 if they are present; c) means to detect the combination of the said antibodies with the polypeptides expressed by the two or more genes shown in Table 9; and d) instructions for use and interpretation of the kit results.
[18] In another embodiment, this invention provides a kit for use in determining the presence or absence of ovarian cancer in a patient comprising: a) a number (for example, two or more) of polynucleotides able to recognize and bind to the mRNA expression product of the two or more genes shown in Table 9; b) a container suitable for containing the said polynucleotides and a sample of body fluid from the said individual wherein the said polynucleotide can contact the mRNA, if it is present; c) means to detect the levels of combination of the said polynucleotide with the mRNA from the two or more genes shown in Table 9; and d) instructions for use and interpretation of the kit results.
BRIEF DESCRIPTION OF DRAWINGS
[19] FIG. 1(A). Re-Classification of Samples Using Increasing Number of Probe Sets.
[20] FIG. 1 (B). Plot of Errors as a Function of Number of Probe Sets for Determination of
Optimum Number of Classification Genes. Calculated values by increasing number of individual probe sets from the top 6 to the top 55 (A) or to all 900 (B). Arrow indicates the minimum number of probe sets (N=28) that minimizes misclassification.
[21] FIG. 2. Determination of a Threshold CC Value for Classification of Ovarian Status.
[22] FIG. 3. Correlation of Test and Validation Biopsy Profiles with Mean Normal Profile for Different Size Probe Sets. N or T represent Normal or Tumor status, respectively, "r" is the PCC value of the probe set profile of the corresponding biopsy sample with the mean
Normal profile (Group 1). Samples are ordered from highest CC to lowest. [23] FIG. 4. Correlation of Biopsy Profiles with Mean of AH Normal Profiles for Different Size Probe Sets. N or T represent Normal or Tumor status, respectively, "r" is the PCC value of the probe set profile of the corresponding biopsy sample with the mean profile of all Normal samples. Samples are ordered from highest CC to lowest.
MODES FOR CARRYING OUT THE INVENTION
[24] The present invention provides methods to determine whether or not a sample from a patient including, but not limited to, biopsy tissue or blood, serum or some other body fluid from a patient, contains evidence of the presence of ovarian cancer in the patient.
[25] This invention is based, in part, on the discovery of approximately 900 genes which are differentially expressed in tissue from ovarian cancer as compared to normal tissue. This methods of this invention comprise measuring the activities of the approximately 900 or fewer genes that are shown to be differently-expressed in ovarian cancer as compared to normal tissue.
[26] In a preferred embodiment, only a small fraction of the 900 genes would be measured. These measurements, could, in various embodiments, be in the tissue itself from biopsies, etc., or in preferred embodiments could be performed as more indirect measurement of gene expression including, but not limited to, cRNA or polypeptide expression products in various tissues including blood or other body fluids.
[27] The measurements, direct or indirect, of the rates of expression of two or more of these 900 genes from an individual whose tissues status was unknown could then be compared to the expression values for the same two or more genes measured in ovarian cancer tissue or normal tissue.
[28] The "degree of similarity" (DOS) of the unknown two or more gene expression values to the cancer tissue versus normal tissue would then be determined.
[29] This DOS could be determined by any procedure that produces a result whose value is a known function of the DOS between the two groups of numbers, i.e., the measured gene expression values of the two or more genes in tissue from an individual whose ovarian cancer status is unknown and to be determined and the measured gene expression values for the same two or more genes from individuals whose tissue is known to contain ovarian cancer and from individuals whose tissue is known not to contain ovarian cancer. [30] As used herein the term "DOS" shall mean the extent to which the pattern of gene expression values are alike or numerically similar, as measured by a comparison of the values of gene expression determined by direct or indirect methods. [31] In a preferred embodiment, the DOS would be determined by a mathematical calculation resulting in a correlation coefficient (CC). In a particularly preferred embodiment, the Pearson Correlation Coefficient (PCC) would be determined but any other mathematical procedure that produces a result whose value is a known function of the DOS between the two groups of numbers could be used.
[32] The value of the DOS (PCC), so calculated, can then be directly related to the probability that the tissue sample is from a patient who does or does not have ovarian cancer. That is to say, the higher the patients' DOS (CC or PCC) as compared to the gene expression values from a patient who does not have ovarian cancer or the higher the DOS (CC or PCC) as compared to the gene expression values from a patient who does have ovarian cancer then the greater the probability that the patient does not or does have ovarian cancer, respectively.
[33] Thus, in a given case, the value of the DOS can be used to determine probabilities for the presence of ovarian cancer. Those of skill in the art will understand that the clinical circumstance for each patient will dictate the value of the DOS (PCC) to be used as a cutoff or to help make clinical decisions with regard to a specific patient. For example, in one embodiment, it is desirable to determine with optimal accuracy the number of a group of patients who have ovarian cancer. This means to minimize both false positives (No Ovarian Cancer misclassified as Ovarian Cancer) and at the same time to minimize false negatives (Ovarian Cancer misclassified as No Ovarian Cancer ).
[34] In one preferred embodiment of the present invention, this would work as shown in FIG. 3, using the 28 predictor probe set (as described below) if the gene expression profile correlates with the mean normal (No Ovarian Cancer) profile with a CC ≤0.920 the tissue sample is 63 times more likely to contain ovarian cancer then if the CC >0.920 [odds ratio (OR) = 63 with 95% confidence interval (Cl): 3.3-1194.7].
[35] To use this threshold in one embodiment of this invention, a patient whose gene expression profile when compared with the mean No Ovarian Cancer expression profile achieves a PCC of >0.920 would be classified in the No Ovarian Cancer group and would be presumed not to have ovarian cancer, while a patient whose expression profile was had a PCC of ≤0.920 would be classified in the Ovarian Cancer group and would be assumed to have ovarian cancer with a high probability. [36] In a further preferred embodiment, the PCC can be set to produce optional sensitivity. That is, to make the smallest possible number of false negatives (Ovarian Cancer misclassified as No Ovarian Cancer). Such an optimal sensitivity setting would be indicated in situations where the occurrence of ovarian cancer must be ruled out with the greatest certainty obtainable. In this embodiment, the threshold is determined by setting the PCC to >0.955. In this case, in the example given, using the 28 predictor probes shown in Table 9 (probe sets 1-28 shown in Table 9), 100% of patients with a CC of >0.955 as compared to the No Ovarian Cancer group did not have ovarian cancer and 100% of the patients whose CC were <0.870, as compared to the No Ovarian Cancer group, did have ovarian cancer. [37] As is shown in the example, one of skill in the art can choose a PCC that will either maximize sensitivity or maximize specificity or produce any desired ratio of false positives or false negatives. One of skill in the art can easily adjust their choice of PCC to the clinical situation to provide maximum benefit and safety to the patient.
[38] Another aspect of the of the invention are methods to treat ovarian cancer. These methods consist of various efforts to suppress the excess gene expression of the genes that have been found to be up-regulated in ovarian cancer. These genes are shown in Table 8. Methods to decrease the excess expression of these gene would include, but not be limited to, use of antisense DNA, siRNA and methods to complex and deactivate the protein expression products of these over-expressed genes. Methods of Measurement
[39] In some embodiments of this invention, the gene expression of a selected group of the 900 genes is determined by measuring mRNA levels from tissue samples as described below.
[40] In some embodiments, the gene expression can be measured more indirectly by measuring polypeptide gene expression products in tissues including, but not limited to, tumor and blood tissue.
[41] In some embodiments, gene expression is measured by identifying the presence or amount of one or more proteins encoded by one of the genes listed in Table 9. [42] The present invention also provides systems for detecting two or more markers of interest, e.g., two or more markers from Table 2. For example, where it is determined that a finite set of particular markers provides relevant information, a detection system is provided that detects the finite set of markers. For example, as opposed to detecting all genes expressed in a tissue with a generic microarray, a defined microarray or other detection technology is employed to detect the plurality, e.g., 28, 42, etc., of markers that define a biological condition, e.g., the presence or absence of ovarian cancer, etc. [43] The present invention is not limited by the method in which gene expression biomarkers are detected or measured. In some embodiments, mRNA, cDNA or protein is detected in tissue samples, e.g., biopsy samples. In other embodiments, mRNA, cDNA or protein is detected in bodily fluids, e.g., serum, plasma, urine or saliva. A preferred embodiment of the invention provides that the method of the invention is performed ex vivo. The present invention further provides kits for the detection of these relevant gene expression biomarkers.
[44] In some preferred embodiments, protein or the polypeptide expression product is detected. Protein expression may be detected by any suitable method. In some embodiments, proteins are detected by binding of an antibody specific for the protein. For example, in some embodiments, antibody binding is detected using a suitable technique including, but not limited to, radioimmunoassay, enzyme-linked immunosorbant assay (ELISA), "sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, immunodiffusion assays, in situ immunoassays, e.g., using colloidal gold, enzyme or radioisotope labels, e.g., Western blots, precipitation reactions, agglutination assays, e.g., gel agglutination assays, hemagglutination assays, etc., complement fixation assays, immunofluorescence assays, protein A assays, immunoelectrophoresis assays and proteomic assays, such as the use of gel electrophoresis coupled to mass spectroscopy to identify multiple proteins in a sample.
[45] In one embodiment, antibody binding is detected by detecting a label on the primary antibody. In another embodiment, the primary antibody is detected by detecting binding of a secondary antibody or reagent to the primary antibody. In a further embodiment, the secondary antibody is labeled. Many methods are known in the art for detecting binding in an immunoassay and are within the scope of the present invention. [46] In some embodiments, an automated detection assay is utilized. Methods for the automation of immunoassays include, but are not limited to, those described in U.S. Patent Nos. 5,885,530; 4,981,785; 6,159,750; and 5,358,691, each of which is herein incorporated by reference. In some embodiments, the analysis and presentation of results is also automated. For example, in some embodiments, software that generates a diagnosis and/or prognosis based on the presence or absence of a series of proteins corresponding to markers is utilized. [47] In other embodiments, the immunoassay described in U.S. Patent Nos. 5,599,677 and 5,672,480, each of which is herein incorporated by reference, is utilized. In other embodiments, proteins are detected by immunohistochemistry. In still other embodiments, markers are detected at the level of cDNA or RNA.
[48] As used herein, the term "gene expression biomarkers" shall mean any biologic marker which can indicate the rate or degree of gene expression of a specific gene including, but not limited to, mRNA, cDNA or the polypeptide expression product of the specific gene. [49] In some embodiments of the present invention, gene expression biomarkers are detected using a PCR-based assay. In yet other embodiments, reverse-transcriptase PCR (RT-PCR) is used to detect the expression of RNA. In RT-PCR, RNA is enzymatically converted to cDNA using a reverse-transcriptase enzyme. The cDNA is then used as a template for a PCR reaction. PCR products can be detected by any suitable method including, but not limited to, gel electrophoresis and staining with a DNA-specific stain or hybridization to a labeled probe.
[50] In some embodiments, the quantitative RT-PCR with standardized mixtures of competitive templates method described in U.S. Patent Nos. 5,639,606; 5,643, 765; and 5,876,978, each of which is herein incorporated by reference, is utilized. [51] In preferred embodiments of the present invention, gene expression biomarkers are detected using a hybridization assay. In a hybridization assay, the presence or absence of a marker is determined based on the ability of the nucleic acid from the sample to hybridize to a complementary nucleic acid molecule, e.g., an oligonucleotide probe. A variety of hybridization assays are available.
[52] In some embodiments, hybridization of a probe to the sequence of interest is detected directly by visualizing a bound probe, e.g., a Northern or Southern assay. See, e.g., Ausabel et al., eds., Current Protocols in Molecular Biology, John Wiley & Sons, NY (1991). In these assays, DNA (Southern) or RNA (Northern) is isolated. The DNA or RNA is then cleaved with a series of restriction enzymes that cleave infrequently in the genome and not near any of the markers being assayed. The DNA or RNA is then separated, e.g., on an agarose gel, and transferred to a membrane. A labeled probe or probes, e.g., by incorporating a radionucleotide, is allowed to contact the membrane under low-, medium- or high-stringency conditions. Unbound probe is removed and the presence of binding is detected by visualizing the labeled probe.
[53] In some embodiments, the DNA chip assay is a GeneChip (Affymetrix, Santa Clara, CA). See, e.g., U.S. Patent Nos. 6,045,996; 5,925,525; and 5,858,659, each of which is herein incorporated by reference. The GeneChip technology uses miniaturized, high-density arrays of oligonucleotide probes affixed to a "chip". Probe arrays are manufactured by Affymetrix's light-directed chemical synthesis process, which combines solid-phase chemical synthesis with photolithographic fabrication techniques employed in the semiconductor industry. Using a series of photolithographic masks to define chip exposure sites, followed by specific chemical synthesis steps, the process constructs high-density arrays of oligonucleotides, with each probe in a predefined position in the array. Multiple probe arrays are synthesized simultaneously on a large glass wafer. The wafers are then diced, and individual probe arrays are packaged in injection-molded plastic cartridges, which protect them from the environment and serve as chambers for hybridization. [54] The nucleic acid to be analyzed is isolated, amplified by PCR and labeled with a fluorescent reporter group. The labeled DNA is then incubated with the array using a fluidics station. The array is then inserted into the scanner, where patterns of hybridization are detected. The hybridization data are collected as light emitted from the fluorescent reporter groups already incorporated into the target, which is bound to the probe array. Probes that perfectly match the target generally produce stronger signals than those that have mismatches. Since the sequence and position of each probe on the array are known, by complementary, the identity of the target nucleic acid applied to the probe array can be determined.
[55] In other embodiments, a DNA microchip containing electronically captured probes (Nanogen, San Diego, CA) is utilized. See, e.g., U.S. Patent Nos. 6,017,696; 6,068,818; and 6,051,380, each of which are herein incorporated by reference. Through the use of microelectronics, Nanogen's technology enables the active movement and concentration of charged molecules to and from designated test sites on its semiconductor microchip. DNA capture probes unique to a given gene expression biomarkers are electronically placed at, or "addressed" to, specific sites on the microchip. Since nucleic acid molecules have a strong negative charge, they can be electronically moved to an area of positive charge. [56] In still further embodiments, an array technology based upon the segregation of fluids on a flat surface (chip) by differences in surface tension (ProtoGene, Palo Alto, CA) is utilized. See, e.g., U.S. Patent Nos. 6,001,311; 5,985,551; and 5,474,796, each of which is herein incorporated by reference. Protogene's technology is based on the fact that fluids can be segregated on a flat surface by differences in surface tension that have been imparted by chemical coatings. Once so segregated, oligonucleotide probes are synthesized directly on the chip by ink-jet printing of reagents. [57] In yet other embodiments, a "bead array" is used for the detection of gene expression biomarkers (lllumina, San Diego, CA). See, e.g., PCT Publications WO 99/67641 and WO 00/39587, each of which is herein incorporated by reference, lllumina uses a BEAD ARRAY technology that combines fiber optic bundles and beads that self-assemble into an array. Each fiber optic bundle contains thousands to millions of individual fibers depending on the diameter of the bundle. The beads are coated with an oligonucleotide specific for the detection of a given marker. Batches of beads are combined to form a pool specific to the array. To perform an assay, the BEAD ARRAY is contacted with a prepared sample. Hybridization is detected using any suitable method.
[58] In some preferred embodiments of the present invention, hybridization is detected by enzymatic cleavage of specific structures, e.g., INVADER™ assay, Third Wave Technologies. See, e.g., U.S. Patent Nos. 5,846,717, 6,090, 543; 6,001,567; 5,985,557; and 5,994,069, each of which is herein incorporated by reference. In some embodiments, hybridization of a bound probe is detected using a TaqMan assay (PE Biosystems, Foster City, CA). See, e.g., U.S. Patent Nos. 5,962,233 and 5,538,848, each of which is herein incorporated by reference. The assay is performed during a PCR reaction. The TaqMan assay exploits the 5'-3' exonuclease activity of DNA polymerases, such as AMPLITAQ DNA polymerase. A probe, specific for a given marker, is included in the PCR reaction. The probe consists of an oligonucleotide with a 5'-reporter dye, e.g., a fluorescent dye and a 3'-quencher dye. During PCR, if the probe is bound to its target, the 5'-3' nucleolytic activity of the AMPLITAQ polymerase cleaves the probe between the reporter and the quencher dye. The separation of the reporter dye from the quencher dye results in an increase of fluorescence. The signal accumulates with each cycle of PCR and can be monitored with a fluorimeter.
[59] Additional detection assays that are produced and utilized using the systems and methods of the present invention include, but are not limited to, enzyme mismatch cleavage methods, e.g., Variagenics (see U.S. Patent Nos. 6,110,684; 5,958,692; and 5,851,770, herein incorporated by reference in their entireties); branched hybridization methods, e.g., Chiron (see U.S. Patent Nos. 5,849,481; 5,710,264; 5,124,246; and 5,624,802, herein incorporated by reference in their entireties); rolling circle replication (see, e.g., U.S. Patent Nos. 6,210,884 and 6,183,960, herein incorporated by reference in their entireties); NASBA (see, e.g., U.S. Patent No. 5,409,818, herein incorporated by reference in its entirety); molecular beacon technology (see, e.g., U.S. Patent No. 6,150,097, herein incorporated by reference in its entirety); E-sensor technology (see Motorola, U.S. Patent Nos. 6,248,229; 6,221,583; 6,013,170; and 6,063,573, herein incorporated by reference in their entireties); cycling probe technology (see, e.g., U.S. Patent Nos. 5,403,711; 5,011,769; and 5,660,988, herein incorporated by reference in their entireties); ligase chain reaction [see Barnay, Proc. Natl. Acad. Sci. USA, Vol. 88, pp. 189-93 (1991)]; and sandwich hybridization methods (see, e.g., U.S. Patent No. 5,288,609, herein incorporated by reference in its entirety). [60] In some embodiments, mass spectroscopy is used to detect gene expression biomarkers. For example, in some embodiments, a MASSARRAY™ system (Sequenom, San Diego, CA) is used to detect gene expression biomarkers. See, e.g., U.S. Patent Nos. 6,043,031; 5,777,324; and 5,605,798, each of which is herein incorporated by reference.
[61] In some embodiments, the present invention provides kits for the identification, characterization and quantitation of gene expression biomarkers. In some embodiments, the kits contain antibodies specific for gene expression biomarkers, in addition to detection reagents and buffers. In other embodiments, the kits contain reagents specific for the detection of nucleic acid, e.g., oligonucleotide probes or primers. In preferred embodiments, the kits contain all of the components necessary to perform a detection assay, including all controls, directions for performing assays and any necessary software for analysis and presentation of results. In some embodiments, the kits contain instructions including a statement of intended use as required by the Environmental Protection Agency or U.S. Food and Drug Administration (FDA) for the labeling of in vitro diagnostic assays and/or of pharmaceutical or food products.
[62] Comparison of the organism's gene expression pattern, with the result expressed in Table 9, would indicate whether the organism has a gene expression profile which may indicate that the organism does or does not contain ovarian cancer. [63] In another embodiment, the present invention is a method of screening a test compound for the ability to inhibit, retard, reverse or mimic the gene expression changes characteristic of ovarian cancer. In a typical example of this embodiment, one would first treat a test mammal known to have ovarian cancer with a test compound and then analyze a representative tissue of the mammal for the level of expression of the genes or sequences which change in expression in response to ovarian cancer. Preferably, the tissue is biopsy material from the tumor or, in a preferred embodiment, an easily obtainable tissue, such as blood or serum.
[64] One then compares the analysis of the tissue with a control mammal known to have ovarian cancer but not given the test compound and thereby identifies test compounds that are capable of modifying the expression of the gene expression biomarkers sequences in the mammalian samples such that the expression is altered toward the No Ovarian Cancer pattern.
[65] In another embodiment of the present invention, one would use the sequences of the genes disclosed in Table 2 for a therapy for mimicking the No Ovarian Cancer state. In general, one would try to amplify gene expression for the genes identified herein as under- expressed in ovarian cancer and decrease the expression of genes identified herein as over- expressed in ovarian cancer. For example, one might try to decrease the expression of genes or sequences identified in Table 2 as increased or increase the expression of genes found to be decreased in ovarian cancer.
[66] Methods of increasing and decreasing expression would be known to one of skill in the art. Examples for supplementation of expression would include supplying the organism with additional copies of the gene. A preferred example for decreasing expression would include RNA antisense technologies or pharmaceutical intervention. The genes disclosed in Table 2 would be appropriate drug development targets. One would use the information presented in the present application for drug development by using currently existing, or by developing, pharmaceutical compounds that either mimic or inhibit the activity of the genes listed in Table 2, or the proteins encoded by these genes. Therefore, the gene expression biomarkers or genes disclosed herein represent targets for pharmaceutical development and gene therapy or RNA antisense therapy with the goal of suppressing the changes characteristic of ovarian cancer at the molecular level. These gene expression alterations may also play a role in understanding the various mechanisms that underlie ovarian cancer. Additionally, these genes represent biomarkers of ovarian cancer that can be used for diagnostic purposes.
[67] The present invention is not limited by the form of the expression profile. In some embodiments, the expression profile is maintained in computer software. In some embodiments, the expression profile is written material. The present invention is not limited by the number of markers provided or displayed in an expression profile. For example, the expression profile may comprise two or more markers found in Table 2, indicating a biological status of a sample.
[68] The present invention further provides databases comprising expression information, e.g., expression profiles comprising one or more markers from Table 2 from one or more samples. In some embodiments, the databases find use in data analysis including, but not limited to, comparison of markers to one or more public or private information databases, e.g., OMIM, GenBank, BLAST, Molecular Modeling Databases, Medline, genome databases, etc. In some such embodiments, an automated process is carried out to automatically associate information obtained from data obtained using the methods of the present invention to information in one or more of public or private databases. Associations find use, e.g., in making expression correlations to phenotypes, e.g., disease states. [69] We also understand the present invention to be extended to mammalian homologues of the mouse genes listed in Table 9. One of skill in the art could easily investigate homologues in other mammalian species by identifying particular genes with sufficiently high homology to the genes listed in Table 9. By "high homology" we mean that the homology is at least 50% overall (within the entire gene or protein) either at the nucleotide or amino acid level. List of Abbreviations
A Absent NCBI National Center for Biotechnology
AvgDiff Average Difference (overall intensity Information of probe set on Affymetrix array) Neg Negative
CHTN Cooperative Human Tissue Network nM Nanometer
Cl Confidence Interval OMIM Online Mendelian Inheritance in Man
FIGO Federation of Gynecology and OR Odds Ratio Obstetrics ORF Open Reading Frame
GAPDH Glyceraldehyde 3-phosphate P Present dehydrogenase PG Pharmacogenetics
GNF Genomics Institute of the Novartis Pos Positive Research Foundation QC Quality Control mg Milligram RNA Ribonucleic Acid
EXAMPLE 1 Preferred Methods [70] To identify genes involved in the development and progression of ovarian tumors, we compared the gene expression profiles of a series of Normal and Tumor ovarian biopsies. Gene expression data for more than 12,000 genes were generated from each sample. Of the 900 probe sets that we observed to be most differentially-expressed between the Normal and cancerous ovarian biopsies, 98% were down-regulated in the Tumor biopsies. Using 8 Normal and 10 Tumor samples, we identified a minimum number of probe sets (28) that could be used to classify biopsies as Normal or Tumor. This finding was validated on a second set of biopsies (4 Normal and 14 Tumor) previously profiled by another laboratory. A mean Normal ovarian profile was established that could be used as a reference to compare other ovarian biopsies. The identification of the most differentially-expressed genes between
Normal and Tumor ovarian biopsies may provide new insight into the molecular mechanisms of ovarian tumor development and progression. Some of the genes identified in this study are known to be involved in ovarian cancer, but a large proportion represents novel candidates for drug targets and molecular biomarkers to diagnose or monitor disease and treatment.
Materials and Methods
Samples
[71] Flash-frozen ovarian biopsies were obtained from Asterand (Detroit, Ml), and consisted of 10 Tumor samples and 10 adjacent Normal tissues. Total RNA was also purchased for 4 additional samples from Ambion (Austin, TX) and Stratagene (La Jolla, CA).
Gene expression profiles from samples used in the validation step had been previously generated at GNF and reported. See Welsh et al. (2001), supra.
[72] Most of the tumors analyzed were malignant surface epithelial serous tumors, e.g., papillary cystcarcinoma, papillary cystadenocarcinoma or papillary cystcarcinoma; others included a mucinous cyst carcinoma, an endometrioid adenocarcinoma and a mature teratoma.
[73] A summary of sample information is shown in Table 1 below.
Figure imgf000016_0001
Figure imgf000017_0001
Mote: Paired samples (Normal and Tumor adjacent tissue) obtained from the same patient are boxed together. Stages of ovarian cancers are indicated using the FIGO staging system.
RNA Expression Profiling
[74] Total RNA was extracted from each biopsy and processed as previously described. RNA extraction techniques are well-known to those of skill in the art. All samples profiled were processed using the Affymetrix GENECHIP™ system as recommended by Affymetrix (GeneChip Expression Analysis Technical Manual, rev. 1 , July 2001 ). Concentration and total amount of RNA and cRNA were estimated by measuring the samples at 260 nM and
280 nM wavelengths using a Beckman-Coulter DU 650 spectrophotometer after a 1:50 dilution of the samples (see Table 2). The type of array used for this study was the Human
Genome U95Av2 (http://www.affvmetrix.com/products/arravs/specific/hau95.affx).
Analytical Strategy
[75] Analysis of the expression profiles was performed in several steps described below.
Selection of microarray data of highest quality
[76] We used for our analysis only microarrays for which the scaling factor was lower than 6, and where more than 30% of the probe sets were called "Present" by the Affymetrix
MAS 4.0 algorithm.
Selection of a subset of probe sets
[77] Expression data were directly imported into the GENE SPRING® program (Silicon
Genetics, Redwood City, CA) from the database. Genes expressed in only a few samples were eliminated; out of the 12,627 probe sets on the microarray, only those with an AvgDiff of at least 100 in 10% of the samples or more were used for further analysis. A clustering experiment was performed to visualize the different gene expression profiles of Normal and
Tumor biopsies.
[78] Further filtering was accomplished by eliminating probe sets of low quality or very low intensity signals in both groups of samples (Group 1 : Normal biopsies; Group 2: Tumor biopsies). Probe sets not called "Present" (P) in at least 75% of the samples in one of the two groups were not used for further analysis. In addition, AvgDiff values lower than 20 were all converted to a value of 20.
Focus on the most differentially-expressed genes
[79] Selection of genes differentially-expressed between the two groups of samples was done in 2 steps: 1. The AvgDiff of each probe set was compared between the 2 groups of samples by a non-parametric one-way ANOVA test, using SAS 8.2. 2. The AvgDiff of each probe sets with p<0.05 was then correlated with the group of samples (Normal or Tumor). Probe sets were ranked from highest absolute PCC to lowest (calculated in Microsoft Excel). Re-classification of samples
[80] We used the "leave-one-out" analytical strategy previously described to determine the optimal number of probe sets that distinguished an ovarian tumor from a normal ovarian tissue. See van't Veer et al. (2002), supra.
[81] For every sample left-out, we determined the average AvgDiff of each probe set in each group of samples (Groups 1 and 2). PCCs between the expression profile of the left- out sample and the average profile of each group were calculated for each probe set. The effectiveness of each probe set in distinguishing a tumor from a normal ovarian tissue was evaluated by re-classifying each sample as Normal or Tumor based on the higher of the two CCs.
[82] We determined the number of misclassified samples when using increasingly larger sets of genes (starting with 5). As used herein, a "false Neg" is defined as a Tumor incorrectly classified as a Normal ovary tissue, and inversely, a "false Pos" is defined as a Normal tissue incorrectly classified as a Tumor.
[83] The probe sets that most-effectively distinguished tumor from normal ovarian tissue were then tested in their ability to classify gene expression profiles of a different set of ovarian tissues (Normal and Tumor) generated at GNF (see Table 1). Statistical determination of OR
[84] Using the desired threshold correlation value, a 2 x 2 table was constructed indicating the number of biopsies correctly and incorrectly identified as Normal or Tumor. ORs, along with 95% Cls, were calculated using SAS version 8.2. Statistical significance was determined using a Fisher's exact test with p-value cut-off of 0.05. Genes
[85] The link between a probe set name and a GenBank Accession Number was provided by Affymetrix, together with a short gene description. We complemented and updated this description by a search of the NCBI databases, mainly LocusLink (http://www.ncbi.nlm.nih.gov/LocusLink/index.html). OMIM (http://www.ncbi.nlm.nih.gov/Omim/searchomim.html) and PubMed (http://www.ncbi.nlm.nih.gov/entrez/guery.fcgi).
Results
RNA Expression Profiling
[86] Eighteen out of the 20 biopsies yielded more than the 5 mg of purified total RNA necessary to process the samples further (see Table 2). Sample p6175, from which less than 1 mg of purified total RNA was obtained, was the smallest sample (38 mg). The quality of the RNA was assessed by electrophoresis on a 1 % agarose gel. The absence of both 28S and 18S ribosomal RNA bands was observed for samples p6169 and p6180, indicating some RNA degradation. For microarray hybridization, a maximum of 15 mg of cRNA was used when available, but no less than 12 mg. Enough cRNA was available for 21 samples (see Table 2). Quality Assessment
[87] The data from the 21 arrays hybridized in PG (this study) and from the 18 hybridized previously at GNF were checked for quality (see Table 3). See Welsh et al. (2001 ), supra. All but 3 (p6169, p6180 and p6185) passed our criteria of a scaling factor lower than 6, with more than 30 % of probe sets called "P" (see Table 3). The 36 remaining expression profiles were separated into 2 sets: a test set of 18 profiles generated in PG consisting of data from 8 Normal and 10 Tumor biopsies, and a validation set of 18 profiles previously generated at GNF from 4 Normal and 14 Tumor biopsies. See Welsh et al. (2001), supra.
Analysis
Clustering Analysis
[88] Expression data of the 18 samples of the test set were imported into the GENESPRING® software. Out of the 12,627 probe sets on the Affymetrix U95A microarray, 2, 174 had an AvgDiff of at least 100 in 2 or more of these 18 samples and were used for clustering analysis. The resulting clustering tree of samples and probe sets is shown in Fig. 1. Interestingly, the dendogram of experiments contains two main branches corresponding to the two groups of samples, Normal (top) and Tumor (bottom) biopsies. The vast majority (>90%) of genes examined have an overall higher expression in the Normal ovarian tissues. The dendogram of probe sets shows only a small cluster of genes with higher expression in the Tumor tissues (left part). Selection of the most differentially-expressed genes
[89] Out of the 2,174 probe sets, 217 were excluded from further analysis because they provided large number of "A" or "marginal" calls (>75% in both groups). [90] Data for the remaining 1 ,957 probe sets were exported into SAS version 8.2 for non- parametric one-way ANOVA testing between the Normal and the Tumor groups. A total of 900 probe sets had AvgDiff values significantly different between the two groups (p<0.05). These genes are listed in Table 9. [91] The AvgDiff of these 900 probe sets was then correlated with the two groups of samples (Group 1 : Normal; Group 2: Tumor). The absolute PCC (R)-values ranged from 0.042-0.877, with 694 probe sets (77%) with a R-value higher than 0.5. The AvgDiff data of the 900 probe sets ranked from highest absolute PCC to lowest are available in Appendix 1. Leave-one-out method and re-classification of samples
[92] The "leave-one-out" analytical strategy previously described was applied to the 18 ovarian samples for the expression of the 900 selected probe sets. See van't Veer et al. (2002), supra.
[93] The number of misclassified samples when using the first 5 probe sets was 6 (2 false Pos and 4 false Neg). Increasingly large sets of genes were used. The number of misclassifications varied between 2 and 7, with the minimum achieved when using the first 28 probe sets (Fig. 1). These first 28 probe sets displayed only one false Pos and one false Neg. Interestingly, perfect classification of Normal biopsies (0 false Pos) was achieved with the first 32 probe sets (which also detected 2 false Neg), while perfect classification of Tumor biopsies (0 false Neg) was never seen. Optimal classification set and correlation threshold values
[94] We determined the mean Normal (No Tumor) biopsy profile for the classification probe sets, to be used as a reference for analysis of biopsies of unknown or questionable status; we expected that tumor heterogeneity may not allow the determination of a reference Tumor profile. We examined the classification value of the first 28 probe sets, by comparing their expression for each of the 18 samples to the mean Normal profile calculated using all 8 Normal biopsy profiles. Samples were then ranked by correlation values from highest to lowest and error rates were determined as a function of where the threshold correlation was drawn. The results are displayed in Fig. 2. The minimum number of incorrectly assigned samples was 2 [1 false Pos (p6177) and 1 false Neg (p6168)]. The corresponding CC value was between 0.920 and 0.921. The OR and Fisher's exact test were performed based on the number of samples correctly and incorrectly predicted to be Normal or Tumor. The difference between the observed and expected biopsy status was significant: OR = 63; 95% Cl: 3.3-1194.7, p=0.0029. The OR indicates that an ovarian biopsy of the test set is nearly 63 times more likely to be from an ovarian tumor if its expression profile of the 28 predictor probe sets correlates with the mean Normal profile with a CC ≤0.920 (see Table 4). In our test set, 100% of profiles with a CC >0.955 correspond to Normal biopsies and 100% of profiles with a CC <0.870 correspond to Tumor biopsies (see Fig. 3). Validation of the mean Normal profile
[95] The 28 probe sets selected by the leave-one-out method allowed us to distinguish Normal from Tumor ovarian biopsies in our series of 18 ovarian samples. We then tested if independent ovarian biopsies could be correctly classified by comparing their expression profile to the same mean Normal profile of the 28 classification probe sets. [96] Fig. 3 summarizes the classification of all ovarian biopsies based on the correlation of 28 probe sets. Remarkably, the profiles of the Normal and Tumor samples of the validation set were clearly separated from each other (see Fig. 3). As in the test set, 100% of profiles with a CC <0.870 correspond to Tumor biopsies. Interestingly, Normal profiles had lower CCs than in the test set, and the threshold correlation value that best separate Normal and Tumor biopsies in the validation set lies between 0.762 and 0.876. [97] We performed a non-parametric t-test between the average Normal profile of the test set and the average Normal profile of the validation set. Similarly, we compared the average Tumor profiles of both sets. Since no statistical difference was observed (p=0.373 and p=0.110, respectively), we combined both sets to increase the classification value of the 28 probe sets. We compared the expression for all the samples to the mean Normal profile calculated using all 12 Normal biopsy profiles (8 from the test set and 4 from the validation test). Results confirm that correlation values provide highly-significant separation of the Normal biopsy from the Tumor biopsy profiles (see Fig. 4 and Table 5). As seen previously, the profile of Tumor sample (p6168) had a high correlation with the average Normal profile. Correlation between individual gene expression and biopsy status [98] The selection of probe sets for the classification of ovarian biopsies was originally done based on the profile of the 18 test samples. The good separation of all 36 Normal and Tumor samples (see Fig.4) with the same probe sets suggested that the genes selected by our method are differentially-expressed in many other ovarian tumors. However, because of tumor heterogeneity, the difference in individual gene expression is likely to vary with the samples analyzed. We evaluated to what extent the 900 probe sets differentially-expressed in the 18 test samples, were also differentially-expressed when all 36 biopsies were analyzed.
[99] Probe sets were ranked from highest absolute PCC to lowest, first using the 18 samples from the test set, and then with all 36 samples from both the test set and the validation set. From the 900 probe sets selected, 694 and 473 had an absolute CC higher than 0.5 with the 18 and 36 samples, respectively; 412 probe sets had a coefficient higher than 0.5 in both cases. Interestingly, from the 28 probe sets originally selected for the biopsy classification, 19 ranked in the top 100; the other 9 probe sets had correlation values ranging from 0.359-0.703.
Genes differentially-expressed between Normal and Tumor ovarian biopsies Genes up-regulated in ovarian tumors
[100] Among the genes differentially expressed between Normal and Tumor ovarian biopsies, we detected a few genes already known to be up-regulated in ovarian tumors, such as the genes coding for Claudin 4, topoisomerase II alpha, Kallikrein 8, osteopontin, as well as potential new markers of ovarian cancers (see Table 6).
[101] Claudin 4, a component of tight junctions, has been shown to be up-regulated in ovarian tumors together with another member of this family of transmembrane receptors, Claudin 3. See Hough et al., Cancer Res., Vol. 60, No. 22, pp. 6281-6287 (2000). Costa and colleagues have reported that levels of topoisomerase II alpha correlate with poor prognosis of ovarian surface epithelial neoplasms. Kallikrein 8 has been detected by immunohistochemistry in carcinoma but not Normal ovarian tissue and was suggested as a prognostic marker of ovarian cancer. See Underwood et al., Cancer Res., Vol. 59, No. 17, pp. 4435-4439 (1999); and Magklara et al., din. Cancer Res., Vol. 7, No. 4, pp. 806-811 (2001 ). Osteopontin has also been previously proposed as a diagnostic biomarker for ovarian cancer. See Kim et al., JAMA, Vol. 287, No. 13, pp. 1671-1679 (2002). Another gene, C20ORF1, has been shown to be expressed in lung carcinoma cell lines but not in normal lung tissues. See Manda et al., Genomics, Vol. 61, No. 1, pp. 5-14 (1999). Other genes that may have be over-expressed in only some of the biopsies due to the tumor type, the disease stage or other tumor specificity, were not detected by our analytical method. Genes down-regulated in ovarian tumors
[102] We further examined a large number of genes down-regulated in the ovarian tumor biopsies profiled. For analysis purpose, we classified the 28 probe sets and the top 100 down-regulated genes in 8 categories based on the known or suspected function of their product (see Tables 7 and 8). Interestingly, the function of nearly 30% of these 100 genes is still unknown. Most of the other genes play a role in, or are already suspected to be involved in transcription regulation (16 genes), in cell cycle regulation, growth differentiation, cell death or tumor suppression (12 genes) and signal transduction (6 genes). This list includes several potential tumor suppressors: the gene coding for the transforming growth factor beta receptor III (TGFβR3), a platelet-derived growth factor receptor-like gene (PDGFRL), the suppression of tumorigenicity (ST13) gene, a gene coding for a reversion-inducing- cysteine-rich protein with kazal motif (RECK) and the paternally expressed 3 (PEG3) gene. [103] This observation suggests that the genes with still unidentified function are likely to be involved in cell cycle regulation, growth differentiation, signal transduction or transcription regulation. Some of them may act as tumor suppressors. Down-regulation in ovarian tumor or cell lines had been reported for just one of these genes, IGFBP5 which in our study was detected with 2 separate probe sets (see Table 8). See Welsh et al. (2001 ), supra. [104] Only 6 genes coding for proteins of the extracellular matrix were noticed including laminin alpha 2 (LAMα2). Yang and colleagues have reported that transient loss of LAMoώ in the basement membrane of the pre-malignant epithelium and subsequent inactivation of Dab2 are common early event associated with tumorigenicity of the ovarian surface epithelium. See Yang et al., Cancer, Vol. 94, No. 9, pp. 2380-2392 (2002). Interestingly, down-regulation of Dab2 (probe set 479_at) was also observed in our study with a CC value of 0.49.
[105] Taken together, these results indicated that most of the genes with a statistically significant decreased expression in the ovarian biopsies, are indeed involved in the development or progression of the tumors rather than detected because of a change in cell population or tissue organization, e.g., loss of connective tissue and fat cells.
Discussion
[106] The filtering and analytical methods that we used here, provided a list of genes differentially expressed between Normal and Tumor ovarian samples. We showed that a small subset (28-42 probe sets) is sufficient to accurately classify ovarian biopsies as Normal or Tumor based on their expression profiles. Validation of this expression signature was done on different biopsies profiled in an independent laboratory, and confirms that the difference in expression observed between the Normal or Tumor samples reflects a biological process rather than of a laboratory or analytical error. [107] Several factors not examined here that may affect the detection of differentially expressed genes include the number of samples in the test set, and the heterogeneity of the samples studied. Indeed, it is expected that biopsies and, in particular, Tumor biopsies, have a substantial level of heterogeneity: tumor type, grade, percentage of tumor cells, presence of connective and fat tissues, etc. We studied different types of ovarian tumors of various grades (see Table 1) to search for genes involved in common pathways of tumor development and progression, rather than genes involved more specifically in certain types of tumors as previously reported. See Ono et al. (2000), supra; and Welsh et al. (2001), supra.
[108] Our clustering analysis of the biopsy expression profiles, revealed that the vast majority of genes that differentiate Normal and Tumor samples were down-regulated in the tumors. Indeed, when the 900 most differentially-expressed probes were ranked based on the CC between their expression in all 36 biopsies and the Normal and Tumor status, the top 220 probes (R from 0.865-0.644) were down-regulated in the tumors. We examined more closely the function of the top 100 genes (R from 0.865-0.72), and the top 10 genes over- expressed in the tumors (R from 0.643-0.443). Among the most differentially-expressed genes, we detected several genes already known to be up-regulated in ovarian tumors, as well as potential new markers of ovarian cancers. However, most of the genes were down- regulated, most likely because we studied various types of late stage tumors of different origins, different grades and different tumor cell content. The involvement of many of these genes in transcription regulation, in cell cycle regulation, growth differentiation, signal transduction, cell death or tumor suppression underscores the need to further evaluate their role in ovarian cancer. The list of other genes of still unknown function points to novel potential players in tumor development and progression. Methods of Modifying RNA Abundances or Activities
[109] Methods of modifying RNA abundances and activities currently fall within three classes: ribozymes, antisense species and RNA aptamers. See Good et al., Gene Ther., Vol. 4, No. 1 , pp. 45-54 (1997). Controllable application or exposure of a cell to these entities permits controllable perturbation of RNA abundances. Ribozymes
[110] Ribozymes are RNAs which are capable of catalyzing RNA cleavage reactions. See Cech, Science, Vol. 236, pp. 1532-1539 (1987); PCT International Publication WO 90/11364 (1990); Sarver et al., Science, Vol. 247, pp. 1222-1225 (1990). "Hairpin" and "hammerhead" RNA ribozymes can be designed to specifically cleave a particular target mRNA. Rules have been established for the design of short RNA molecules with ribozyme activity, which are capable of cleaving other RNA molecules in a highly sequence specific way and can be targeted to virtually all kinds of RNA. See Haseloff et al., Nature, Vol. 334, pp. 585-591 (1988); Koizumi et al., FEBS Lett., Vol. 228, pp. 228-230 (1988); and Koizumi et al., FEBS Lett., Vol. 239, pp. 285-288 (1988). Ribozyme methods involve exposing a cell to, inducing expression in a cell, etc. of such small RNA ribozyme molecules. See Grassi and Marini, Annals of Med., Vol. 28, No. 6, pp.499-510 (1996); and Gibson, Cancer Meta. Rev., Vol. 15, pp. 287-299 (1996).
[111] Ribozymes can be routinely expressed in vivo in sufficient number to be catalytically effective in cleaving mRNA, and thereby modifying mRNA abundances in a cell. See Cotton et al., EMBO J., Vol. 8, pp. 3861-3866 (1989). In particular, a ribozyme coding DNA sequence, designed according to the previous rules and synthesized, e.g., by standard phosphoramidite chemistry, can be ligated into a restriction enzyme site in the anticodon stem and loop of a gene encoding a tRNA, which can then be transformed into and expressed in a cell of interest by methods routine in the art. Preferably, an inducible promoter, e.g., a glucocorticoid or a tetracycline esponse element, is also introduced into this construct so that ribozyme expression can be selectively controlled. For saturating use, a highly and constituently active promoter can be used. tDNA genes, i.e., genes encoding tRNAs, are useful in this application because of their small size, high rate of transcription and ubiquitous expression in different kinds of tissues. Therefore, ribozymes can be routinely designed to cleave virtually any mRNA sequence, and a cell can be routinely transformed with DNA coding for such ribozyme sequences such that a controllable and catalytically effective amount of the ribozyme is expressed. Accordingly, the abundance of virtually any RNA species in a cell can be modified or perturbed. Antisense molecules
[112] In another embodiment, activity of a target RNA (preferably mRNA) species, specifically its rate of translation, can be controllably inhibited by the controllable application of antisense nucleic acids. Application at high levels results in a saturating inhibition. An "antisense" nucleic acid as used herein refers to a nucleic acid capable of hybridizing to a sequence-specific, e.g., non-poly A, portion of the target RNA, e.g., its translation initiation region, by virtue of some sequence complementary to a coding and/or non-coding region. The antisense nucleic acids of the invention can be oligonucleotides that are double- stranded or single-stranded, RNA or DNA or a modification or derivative thereof, which can be directly administered in a controllable manner to a cell or which can be produced intracellularly by transcription of exogenous, introduced sequences in controllable quantities sufficient to perturb translation of the target RNA.
[113] Preferably, antisense nucleic acids are of at least six nucleotides and are preferably oligonucleotides, ranging from 6 oligonucleotides to about 200 oligonucleotides. In specific aspects, the oligonucleotide is at least 10 nucleotides, at least 15 nucleotides, at least 100 nucleotides or at least 200 nucleotides. The oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or modified versions thereof, single-stranded or double- stranded. The oligonucleotide can be modified at the base moiety, sugar moiety or phosphate backbone. The oligonucleotide may include other appending groups, such as peptides, or agents facilitating transport across the cell membrane [see, e.g., Letsinger et al., Proc. Natl. Acad. Sci. USA, Vol. 86, pp. 6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. Sci. USA, Vol. 84, pp. 648-652 (1987); and PCT Publication No. WO 88/09810 (1988)], hybridization-triggered cleavage agents [see, e.g., Krol et al., BioTechniques, Vol. 6, pp. 958- 976 (1988)] or intercalating agents [see, e.g., Zon, Pharm. Res., Vol. 5, No. 9, pp. 539-549 (1988)].
[114] In a preferred aspect of the invention, an antisense oligonucleotide is provided, preferably as single-stranded DNA. The oligonucleotide may be modified at any position on its structure with constituents generally known in the art.
[115] The antisense oligonucleotides may comprise at least one modified base moiety which is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, β-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, β-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- isopentenyladenine, uraciI-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-/V- 2-carboxypropyl) uracil, (acp3)w and 2,6-diaminopurine.
[116] In another embodiment, the oligonucleotide comprises at least one modified sugar moiety selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, xylulose and hexose.
[117] In yet another embodiment, the oligonucleotide comprises at least one modified phosphate backbone selected from the group consisting of a phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester and a formacetal or analog thereof. [118] In yet another embodiment, the oligonucleotide is a 2-a-anomeric oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual B-units, the strands run parallel to each other. See Gautier et al., Nucl. Acids Res., Vol. 15, pp. 6625-6641 (1987). [119] The oligonucleotide may be conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc.
[120] The antisense nucleic acids of the invention comprise a sequence complementary to at least a portion of a target RNA species. However, absolute complementary, although preferred, is not required. A sequence "complementary to at least a portion of an RNA", as referred to herein, means a sequence having sufficient complementary to be able to hybridize with the RNA, forming a stable duplex; in the case of double-stranded antisense nucleic acids, a single strand of the duplex DNA may thus be tested or triplex formation may be assayed. The ability to hybridize will depend on both the degree of complementary and the length of the antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base mismatches with a target RNA it may contain and still form a stable duplex (or triplex, as the case may be). One skilled in the art can ascertain a tolerable degree of mismatch by use of standard procedures to determine the melting point of the hybridized complex. The amount of antisense nucleic acid that will be effective in the inhibiting translation of the target RNA can be determined by standard assay techniques. [121] Oligonucleotides of the invention may be synthesized by standard methods known in the art, e.g., by use of an automated DNA synthesizer, such as are commercially-available from Biosearch, Applied Biosystems, etc. As examples, phosphorothioate oligonucleotides may be synthesized by the method of Stein et al., Nucl. Acids Res., Vol. 16, p. 3209 (1988), methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer supports, etc. See Sarin et al., Proc. Natl. Acad. Sci. USA, Vol. 85, pp. 7448-7451 (1988). In another embodiment, the oligonucleotide is a 2'-0-methylribonucleotide [see Inoue et al., Nucl. Acids Res., Vol. 15, pp. 6131-6148 (1987)] or a chimeric RNA-DNA analog [see Inoue et al., FEBS Lett., Vol. 215, pp. 327-330 (1987)].
[122] The synthesized antisense oligonucleotides can then be administered to a cell in a controlled or saturating manner. For example, the antisense oligonucleotides can be placed in the growth environment of the cell at controlled levels where they may be taken up by the cell. The uptake of the antisense oligonucleotides can be assisted by use of methods well- known in the art. Antisense Molecules Expressed Intracellularlv
[123] In an alternative embodiment, the antisense nucleic acids of the invention are controllably expressed intracellularly by transcription from an exogenous sequence. If the expression is controlled to be at a high level, a saturating perturbation or modification results. For example, a vector can be introduced in vivo such that it is taken up by a cell, within which cell the vector or a portion thereof is transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a vector would contain a sequence encoding the antisense nucleic acid. Such a vector can remain episomal or become chromosomally integrated, as long as it can be transcribed to produce the desired antisense RNA. Such vectors can be constructed by recombinant DNA technology methods standard in the art. Vectors can be plasmid, viral or others known in the art, used for replication and expression in mammalian cells. Expression of the sequences encoding the antisense RNAs can be by any promoter known in the art to act in a cell of interest. Such promoters can be inducible or constitutive. Most preferably, promoters are controllable or inducible by the administration of an exogenous moiety in order to achieve controlled expression of the antisense oligonucleotide. Such controllable promoters include the Tet promoter. Other usable promoters for mammalian cells include, but are not limited to, the SV40 early promoter region [see Bernoist and Chambon, Nature, Vol. 290, pp. 304-310 (1981)], the promoter contained in the 3' long terminal repeat of Rous sarcoma virus [see Yamamoto et al., Ceil, Vol. 22, pp. 787-797 (1980)], the herpes thymidine kinase promoter [see Wagner et al., Proc. Natl. Acad. Sci. USA, Vol. 78, pp. 1441-1445 (1981)], the regulatory sequences of the metallothionein gene, etc. [see Brinster et al., Nature, Vol. 296, pp. 39-42 (1982)].
[124] Therefore, antisense nucleic acids can be routinely designed to target virtually any mRNA sequence, and a cell can be routinely transformed with or exposed to nucleic acids coding for such antisense sequences such that an effective and controllable or saturating amount of the antisense nucleic acid is expressed. Accordingly the translation of virtually any RNA species in a cell can be modified or perturbed. RNA Aptamers
[125] Finally, in a further embodiment, RNA aptamers can be introduced into or expressed in a cell. RNA aptamers are specific RNA ligands for proteins, such as for Tat and Rev RNA [see Good et al. (1997), supra] that can specifically inhibit their translation. Methods of Modifying Protein Abundances
[126] Methods of modifying protein abundances include, inter alia, those altering protein degradation rates and those using antibodies, which bind to proteins affecting abundances of activities of native target protein species. Increasing (or decreasing) the degradation rates of a protein species decreases (or increases) the abundance of that species. Methods for increasing the degradation rate of a target protein in response to elevated temperature and/or exposure to a particular drug, which are known in the art, can be employed in this invention. For example, one such method employs a heat-inducible or drug-inducible /V-terminal degron, which is an Λ/-terminal protein fragment that exposes a degradation signal promoting rapid protein degradation at a higher temperature, e.g., 37°C, and which is hidden to prevent rapid degradation at a lower temperature, e.g., 23°C. See Dohmen et al., Science, Vol. 263, pp. 1273-1276 (1994). Such an exemplary degron is Arg-DHFRts, a variant of murine dihydrofolate reductase in which the /V-terminal Val is replaced by Arg and the Pro at position 66 is replaced with Leu. According to this method, e.g., a gene for a target protein, P, is replaced by standard gene targeting methods known in the art [see Lodish et al., Molecular Biology of the Cell, W.H. Freeman and Co., NY, especially Chapter 8 (1995)] with a gene coding for the fusion protein Ub-Arg-DHFRte-P ("Ub" stands for ubiquitin). The /V-terminal ubiquitin is rapidly cleaved after translation exposing the /V-terminal degron. At lower temperatures, lysines internal to Arg-DHFRts are not exposed, ubiquitination of the fusion protein does not occur, degradation is slow and active target protein levels are high. At higher temperatures (in the absence of methotrexate), lysines internal to Arg-DHFRts are exposed, ubiquitination of the fusion protein occurs, degradation is rapid and active target protein levels are low. This technique also permits controllable modification of degradation rates since heat activation of degradation is controllably blocked by exposure methotrexate. This method is adaptable to other /V-terminal degrons which are responsive to other inducing factors, such as drugs and temperature changes. Modifying Protein Activity With Antibodies
[127] Target protein activities can also be decreased by (neutralizing) antibodies. By providing for controlled or saturating exposure to such antibodies, protein abundances/activities can be modified or perturbed in a controlled or saturating manner. For example, antibodies to suitable epitopes on protein surfaces may decrease the abundance, and thereby indirectly decrease the activity, of the wild-type active form of a target protein by aggregating active forms into complexes with less or minimal activity as compared to the wild-type unaggregated wild-type form. Alternately, antibodies may directly decrease protein activity by, e.g., interacting directly with active sites or by blocking access of substrates to active sites. Conversely, in certain cases, (activating) antibodies may also interact with proteins and their active sites to increase resulting activity. In either case, antibodies (of the various types to be described) can be raised against specific protein species (by the methods to be described) and their effects screened. The effects of the antibodies can be assayed and suitable antibodies selected that raise or lower the target protein species concentration and/or activity. Such assays involve introducing antibodies into a cell (see below) and assaying the concentration of the wild-type amount or activities of the target protein by standard means, such as immunoassays, known in the art. The net activity of the wild-type form can be assayed by assay means appropriate to the known activity of the target protein. [128] Antibodies can be introduced into cells in numerous fashions, including, e.g., microinjection of antibodies into a cell [see Morgan et al., Immunol. Today, Vol. 9, pp. 84-86 (1988)] or transforming hybridoma mRNA encoding a desired antibody into a cell [see Burke et al., Cell, Vol. 36, pp. 847-858 (1984)]. In a further technique, recombinant antibodies can be engineering and ectopically expressed in a wide variety of non-lymphoid cell types to bind to target proteins, as well as to block target protein activities. See Biocca et al., Trends Cell Biol., Vol. 5, pp. 248-252 (1995). Expression of the antibody is preferably under control of a controllable promoter, such as the Tet promoter, or a constitutively active promoter (for production of saturating perturbations). A first step is the selection of a particular monoclonal antibody with appropriate specificity to the target protein (see below). Then sequences encoding the variable regions of the selected antibody can be cloned into various engineered antibody formats, including, e.g., whole antibody, Fab fragments, Fv fragments, single-chain Fv (ScFv) fragments (VH and VL regions united by a peptide linker), diabodies (two associated ScFv fragments with different specificities) and so forth. See Hayden et al., Curr. Opin. Immunol., Vol. 9, pp. 210-212 (1997). Intracellularly-expressed antibodies of the various formats can be targeted into cellular compartments, e.g., the cytoplasm, the nucleus, the mitochondria, etc., by expressing them as fusions with the various known intracellular leader sequences. See Bradbury et al., Antibody Engineerinq, Borrebaeck, Editor, Vol. 2, pp. 295-361, IRL Press (1995). In particular, the ScFv format appears to be particularly suitable for cytoplasmic targeting.
[129] Antibody types include, but are not limited to, polyclonal, monoclonal, chimeric, single-chain, Fab fragments and an Fab expression library. Various procedures known in the art may be used for the production of polyclonal antibodies to a target protein. For production of the antibody, various host animals can be immunized by injection with the target protein, such host animals include, but are not limited to, rabbits, mice, rats, etc. Various adjuvants can be used to increase the immunological response, depending on the host species and include, but are not limited to, Freund's (complete and incomplete); mineral gels, such as aluminum hydroxide; surface active substances, such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol; and potentially useful human adjuvants, such as Bacillus Calmette-Guerin (BCG) and corynebacterium parvum. [130] For preparation of monoclonal antibodies directed towards a target protein, any technique that provides for the production of antibody molecules by continuous cell lines in culture may be used. Such techniques include, but are not restricted to, the hybridoma technique originally developed by Kohler and Milstein, Nature, Vol. 256, pp. 495-497 (1975), the trioma technique, the human B-cell hybridoma technique [see Kozbor et al., Immunol. Today, Vol. 4, p. 72 (1983)] and the EBV hybridoma technique to produce human monoclonal antibodies [see Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)]. In an additional embodiment of the invention, monoclonal antibodies can be produced in germ-free animals utilizing recent technology. See PCT/US90/02545. According to the invention, human antibodies may be used and can be obtained by using human hybridomas [see Cote et al., Proc. Natl. Acad. Sci. USA, Vol. 80, pp. 2026-2030 (1983)], or by transforming human B cells with EBV virus in vitro [see Cole et al. (1985), supra]. In fact, according to the invention, techniques developed for the production of "chimeric antibodies" [see Morrison et al., Proc. Natl. Acad. Sci. USA, Vol. 81, pp. 6851-6855 (1984); Neuberger et al., Nature, Vol. 312, pp. 604-608 (1984); Takeda et al., Nature, Vol. 314, pp. 452-454 (1985)] by splicing the genes from a mouse antibody molecule specific for the target protein together with genes from a human antibody molecule of appropriate biological activity can be used; such antibodies are within the scope of this invention.
[131] Additionally, where monoclonal antibodies are advantageous, they can be alternatively selected from large antibody libraries using the techniques of phage display. See Marks et al., J. Biol. Chem., Vol. 267, pp. 16007-16010 (1992). Using this technique, libraries of up to 1012 different antibodies have been expressed on the surface of fd filamentous phage, creating a "single pot" in vitro immune system of antibodies available for the selection of monoclonal antibodies. See Griffiths et al., EMBO J., Vol. 13, pp. 3245-3260 (1994). Selection of antibodies from such libraries can be done by techniques known in the art, including contacting the phage to immobilized target protein, selecting and cloning phage bound to the target and subcloning the sequences encoding the antibody variable regions into an appropriate vector expressing a desired antibody format.
[132] According to the invention, techniques described for the production of single-chain antibodies (see U.S. Patent No. 4,946,778) can be adapted to produce single-chain antibodies specific to the target protein. An additional embodiment of the invention utilizes the techniques described for the construction of Fab expression libraries [see Huse et al., Science, Vol. 246, pp. 1275-1281 (1989)] to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity for the target protein.
[133] Antibody fragments that contain the idiotypes of the target protein can be generated by techniques known in the art. For example, such fragments include, but are not limited to, the F(ab')2 fragment which can be produced by pepsin digestion of the antibody molecule; the Fab' fragments that can be generated by reducing the disulfide bridges of the F(ab')2 fragment, the Fab fragments that can be generated by treating the antibody molecule with papain and a reducing agent and Fv fragments.
[134] In the production of antibodies, screening for the desired antibody can be accomplished by techniques known in the art, e.g., ELISA. To select antibodies specific to a target protein, one may assay generated hybridomas or a phage display antibody library for an antibody that binds to the target protein. Methods of Modifying Protein Activities
[135] Methods of directly modifying protein activities include, inter alia, dominant negative mutations, specific drugs or chemical moieties and also the use of antibodies, as previously discussed.
[136] Dominant negative mutations are mutations to endogenous genes or mutant exogenous genes that when expressed in a cell disrupt the activity of a targeted protein species. Depending on the structure and activity of the targeted protein, general rules exist that guide the selection of an appropriate strategy for constructing dominant negative mutations that disrupt activity of that target. See Hershkowitz, Nature, Vol. 329, pp. 219-222 (1987). In the case of active monomeric forms, over expression of an inactive form can cause competition for natural substrates or ligands sufficient to significantly reduce net activity of the target protein. Such over expression can be achieved by, e.g., associating a promoter, preferably a controllable or inducible promoter, or also a constitutively expressed promoter, of increased activity with the mutant gene. Alternatively, changes to active site residues can be made so that a virtually irreversible association occurs with the target ligand. Such can be achieved with certain tyrosine kinases by careful replacement of active site serine residues. See Perlmutter et al., Curr. Opin. Immunol., Vol. 8, pp. 285-290 (1996). [137] In the case of active multimeric forms, several strategies can guide selection of a dominant negative mutant. Multimeric activity can be decreased in a controlled or saturating manner by expression of genes coding exogenous protein fragments that bind to multimeric association domains and prevent multimer formation. Alternatively, controllable or saturating over-expression of an inactive protein unit of a particular type can tie up wild-type active units in inactive multimers, and thereby decrease multimeric activity. See Nocka et al., EMBO J., Vol. 9, pp. 1805-1813 (1990). For example, in the case of dimeric DNA binding proteins, the DNA binding domain can be deleted from the DNA binding unit, or the activation domain deleted from the activation unit. Also, in this case, the DNA binding domain unit can be expressed without the domain causing association with the activation unit. Thereby, DNA binding sites are tied up without any possible activation of expression. In the case where a particular type of unit normally undergoes a conformational change during activity, expression of a rigid unit can inactivate resultant complexes. For a further example, proteins involved in cellular mechanisms, such as cellular motility, the mitotic process, cellular architecture and so forth, are typically composed of associations of many subunits of a few types. These structures are often highly sensitive to disruption by inclusion of a few monomeric units with structural defects. Such mutant monomers disrupt the relevant protein activities and can be expressed in a cell in a controlled or saturating manner. [138] In addition to dominant negative mutations, mutant target proteins that are sensitive to temperature (or other exogenous factors) can be found by mutagenesis and screening procedures that are well-known in the art.
[139] Also, one of skill in the art will appreciate that expression of antibodies binding and inhibiting a target protein can be employed as another dominant negative strategy. Modifying Proteins With Small Molecule Drugs
[140] Finally, activities of certain target proteins can be modified or perturbed in a controlled or a saturating manner by exposure to exogenous drugs or ligands. Since the methods of this invention are often applied to testing or confirming the usefulness of various drugs to treat cancer, drug exposure is an important method of modifying/perturbing cellular constituents, both mRNAs and expressed proteins. In a preferred embodiment, input cellular constituents are perturbed either by drug exposure or genetic manipulation, such as gene deletion or knockout; and system responses are measured by gene expression technologies, such as hybridization to gene transcript arrays (described in the following). [141] In a preferable case, a drug is known that interacts with only one target protein in the cell and alters the activity of only that one target protein, either increasing or decreasing the activity. Graded exposure of a cell to varying amounts of that drug thereby causes graded perturbations of network models having that target protein as an input. Saturating exposure causes saturating modification/perturbation. For example, Cyclosporin A is a very specific regulator of the calcineurin protein, acting via a complex with cyclophilin. A titration series of Cyclosporin A therefore can be used to generate any desired amount of inhibition of the calcineurin protein. Alternately, saturating exposure to Cyclosporin A will maximally inhibit the calcineurin protein.
Measurement Methods
[142] The experimental methods of this invention depend on measurements of cellular constituents. The cellular constituents measured can be from any aspect of the biological state of a cell. They can be from the transcriptional state, in which RNA abundances are measured, the translation state, in which protein abundances are measured, the activity state, in which protein activities are measured. The cellular characteristics can also be from mixed aspects, e.g., in which the activities of one or more proteins are measured along with the RNA abundances (gene expressions) of other cellular constituents. This section describes exemplary methods for measuring the cellular constituents in drug or pathway responses. This invention is adaptable to other methods of such measurement.
[143] Preferably, in this invention the transcriptional state of the other cellular constituents are measured. The transcriptional state can be measured by techniques of hybridization to arrays of nucleic acid or nucleic acid mimic probes, described in the next subsection, or by other gene expression technologies, described in the subsequent subsection. However measured, the result is data including values representing mRNA abundance and/or ratios, which usually reflect DNA expression ratios (in the absence of differences in RNA degradation rates).
[144] In various alternative embodiments of the present invention, aspects of the biological state other than the transcriptional state, such as the translational state, the activity state or mixed aspects can be measured.
[145] In all embodiments, measurements of the cellular constituents should be made in a manner that is relatively independent of when the measurement are made.
Transcriptional State Measurement
[146] Preferably, measurement of the transcriptional state is made by hybridization to transcript arrays, which are described in this subsection. Certain other methods of transcriptional state measurement are described later in this subsection.
Transcript Arrays Generally
[147] In a preferred embodiment the present invention makes use of "transcript arrays", also called herein "microarrays". Transcript arrays can be employed for analyzing the transcriptional state in a cell, and especially for measuring the transcriptional states of cancer cells.
[148] In one embodiment, transcript arrays are produced by hybridizing detectably-labeled polynucleotides representing the mRNA transcripts present in a cell, e.g., fluorescently- labeled cDNA synthesized from total cell mRNA, to a microarray. A microarray is a surface with an ordered array of binding, e.g., hybridization, sites for products of many of the genes in the genome of a cell or organism, preferably most or almost all of the genes. Microarrays can be made in a number of ways, of which several are described below. However produced, microarrays share certain characteristics. The arrays are reproducible, allowing multiple copies of a given array to be produced and easily compared with each other. Preferably the microarrays are small, usually smaller than 5 cm2 and they are made from materials that are stable under binding, e.g. nucleic acid hybridization, conditions. A given binding site or unique set of binding sites in the microarray will specifically bind the product of a single gene in the cell. Although there may be more than one physical binding site (hereinafter "site") per specific mRNA, for the sake of clarity the discussion below will assume that there is a single site. In a specific embodiment, positionally-addressable arrays containing affixed nucleic acids of known sequence at each location are used. [149] It will be appreciated that when cDNA complementary to the RNA of a cell is made and hybridized to a microarray under suitable hybridization conditions, the level of hybridization to the site in the array corresponding to any particular gene will reflect the prevalence in the cell of mRNA transcribed from that gene. For example, when detectably labeled, e.g., with a fluorophore, cDNA complementary to the total cellular mRNA is hybridized to a microarray, the site on the array corresponding to a gene, i.e., capable of specifically binding the product of the gene, that is not transcribed in the cell will have little or no signal, e.g., fluorescent signal, and a gene for which the encoded mRNA is prevalent will have a relatively strong signal. Preparation of Microarrays
[150] Microarrays are known in the art and consist of a surface to which probes that correspond in sequence to gene products, e.g., cDNAs, mRNAs, cRNAs, polypeptides and fragments thereof, can be specifically hybridized or bound at a known position. In one embodiment, the microarray is an array, i.e., a matrix, in which each position represents a discrete binding site for a product encoded by a gene, e.g., a protein or RNA, and in which binding sites are present for products of most or almost all of the genes in the organism's genome. In a preferred embodiment, the "binding site", hereinafter "site", is a nucleic acid or nucleic acid analogue to which a particular cognate cDNA can specifically hybridize. The nucleic acid or analogue of the binding site can be, e.g., a synthetic oligomer, a full-length cDNA, a less than full-length cDNA or a gene fragment.
[151] Although in a preferred embodiment the microarray contains binding sites for products of all or almost all genes in the target organism's genome, such comprehensiveness is not necessarily required. Usually the microarray will have binding sites corresponding to at least about 50% of the genes in the genome, often at least about 75%, more often at least about 85%, even more often more than about 90%, and most often at least about 99%. Preferably, the microarray has binding sites for genes relevant to testing and confirming a biological network model of interest. A "gene" is identified as an open reading frame (ORF) of preferably at least 50, 75 or 99 amino acids from which a mRNA is transcribed in the organism, e.g., if a single cell, or in some cell in a multicellular organism. The number of genes in a genome can be estimated from the number of mRNAs expressed by the organism, or by extrapolation from a well-characterized portion of the genome. When the genome of the organism of interest has been sequenced, the number of ORFs can be determined and mRNA coding regions identified by analysis of the DNA sequence. For example, the Saccharomyces cerevisiae genome has been completely sequenced and is reported to have approximately 6,275 ORFs longer than 99 amino acids. Analysis of these ORFs indicates that there are 5,885 ORFs that are likely to specify protein products. See Goffeau et al., Science, Vol. 274, pp. 546-567 (1996), which is incorporated by reference in its entirety for all purposes. In contrast, the human genome is estimated to contain approximately 105 genes. Preparing Nucleic Acids for Microarrays
[152] As noted above, the "binding site" to which a particular cognate cDNA specifically hybridizes is usually a nucleic acid or nucleic acid analogue attached at that binding site. In one embodiment, the binding sites of the microarray are DNA polynucleotides corresponding to at least a portion of each gene in an organism's genome. These DNAs can be obtained by, e.g., PCR amplification of gene segments from genomic DNA, cDNA, e.g., by RT-PCR, or cloned sequences. PCR primers are chosen, based on the known sequence of the genes or cDNA, that result in amplification of unique fragments, i.e., fragments that do not share more than 10 bases of contiguous identical sequence with any other fragment on the microarray. Computer programs are useful in the design of primers with the required specificity and optimal amplification properties. See, e.g., Oligo pi version 5.0, National Biosciences. In the case of binding sites corresponding to very long genes, it will sometimes be desirable to amplify segments near the 3' end of the gene so that when oligo-dT primed cDNA probes are hybridized to the microarray, less-than-full length probes will bind efficiently. Typically each gene fragment on the microarray will be between about 50 bp and about 2000 bp, more typically between about 100 bp and about 1000 bp, and usually between about 300 bp and about 800 bp in length. PCR methods are well-known and are described, e.g., in Innis et al., eds., PCR Protocols: A Guide to Methods and Applications, Academic Press Inc., San Diego, CA (1990), which is incorporated by reference in its entirety for all purposes. It will be apparent that computer-controlled robotic systems are useful for isolating and amplifying nucleic acids.
[153] An alternative means for generating the nucleic acid for the microarray is by synthesis of synthetic polynucleotides or oligonucleotides, e.g., using Λ/-phosphonate or phosphoramidite chemistries. See Froehler et al., Nucleic Acid Res., Vol. 14, pp. 5399-5407 (1986); and McBride et al., Tetrahedron Lett., Vol. 24, pp. 245-248 (1983). Synthetic sequences are between about 15 bases and about 500 bases in length, more typically between about 20 bases and about 50 bases. In some embodiments, synthetic nucleic acids include non-natural bases, e.g., inosine. As noted above, nucleic acid analogues may be used as binding sites for hybridization. An example of a suitable nucleic acid analogue is peptide nucleic acid. See, e.g., Egholm et al., Nature, Vol. 365, pp. 566-568 (1993); and also U.S. Patent No. 5,539,083.
[154] In an alternative embodiment, the binding (hybridization) sites are made from plasmid or phage clones of genes, cDNAs, e.g., expressed sequence tags, or inserts therefrom. See Nguyen et al., Genomics, Vol. 29, pp. 207-209 (1995). In yet another embodiment, the polynucleotide of the binding sites is RNA. Attaching Nucleic Acids to the Solid Surface
[155] The nucleic acid or analogue are attached to a solid support, which may be made from glass, plastic, e.g., polypropylene and nylon, polyacrylamide, nitrocellulose or other materials. A preferred method for attaching the nucleic acids to a surface is by printing on glass plates, as is described generally by Schena et al., Science, Vol. 270, pp. 467-470 (1995). This method is especially useful for preparing microarrays of cDNA. See, also, DeRisi et al., Nat. Genet, Vol. 14, pp. 457-460 (1996); Shalon et al., Genome Res., Vol. 6, pp. 639-645 (1996); and Schena et al., Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 10539-11286 (1995). Each of the aforementioned articles is incorporated by reference in its entirety for all purposes. [156] A second preferred method for making microarrays is by making high-density oligonucleotide arrays. Techniques are known for producing arrays containing thousands of oligonucleotides complementary to defined sequences, at defined locations on a surface using photolithographic techniques for synthesis in situ [see Fodor et al., Science, Vol. 251 , pp. 767-773 (1991); Pease et al., Proc. Natl. Acad. Sci. USA, Vol. 91, No. 11, pp. 5022-5026 (1994); Lockhart et al., Nat. Biotechnol., Vol. 14, p. 1675 (1996); and U.S. Patent Nos. 5,578,832; 5,556,752; and 5,510,270, each of which is incorporated by reference in its entirety for all purposes] or other methods for rapid synthesis and deposition of defined oligonucleotides [see Blanchard et al., Biosens. Bioelectron., Vol. 11, pp. 687-690 (1996)]. When these methods are used, oligonucleotides, e.g., 20 mers, of known sequence are synthesized directly on a surface such as a derivatized glass slide. Usually, the array produced is redundant, with several oligonucleotide molecules per RNA. Oligonucleotide probes can be chosen to detect alternatively spliced mRNAs.
[157] Other methods for making microarrays, e.g., by masking, may also be used. See Maskos and Southern, Nucleic Acids Res., Vol. 20, pp. 1679-1684 (1992). In principal, any type of array, e.g., dot blots on a nylon hybridization membrane [see Sambrook et al., Molecular Cloning-A Laboratory Manual, 2nd Edition, Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989), which is incorporated in its entirety for all purposes], could be used, although, as will be recognized by those of skill in the art, very small arrays will be preferred because hybridization volumes will be smaller. Generating Labeled Probes
[158] Methods for preparing total and poly(A)+ RNA are well-known and are described generally in Sambrook et al. (1989), supra. In one embodiment, RNA is extracted from cells of the various types of interest in this invention using guanidinium thiocyanate lysis followed by CsCI centrifugation. See Chirgwin et al., Biochemistry, Vol. 18, pp. 5294-5299 (1979). Poly(A)+ RNA is selected by selection with oligo-dT cellulose. See Sambrook et al. (1989), supra. Cells of interest include wild-type cells, drug-exposed wild-type cells, cells with modified/perturbed cellular constituent(s), and drug-exposed cells with modified/perturbed cellular constituent(s).
[159] Labeled cDNA is prepared from mRNA by oligo dT-primed or random-primed reverse transcription, both of which are well-known in the art. See, e.g., Klug and Berger, Methods Enzymol., Vol. 52, pp. 316-325 (1987). Reverse transcription may be carried out in the presence of a dNTP conjugated to a detectable label, most preferably a fluorescently- labeled dNTP. Alternatively, isolated mRNA can be converted to labeled antisense RNA synthesized by in vitro transcription of double-stranded cDNA in the presence of labeled dNTPs. See Lockhart et al. (1996), supra, which is incorporated by reference in its entirety for all purposes. In alternative embodiments, the cDNA or RNA probe can be synthesized in the absence of detectable label and may be labeled subsequently, e.g., by incorporating biotinylated dNTPs or rNTP, or some similar means, e.g., photo-cross-linking a psoralen derivative of biotin to RNAs, followed by addition of labeled streptavidin, e.g., phycoerythrin- conjugated streptavidin or the equivalent.
[160] When fluorescently-labeled probes are used, many suitable fluorophores are known, including fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and others. See, e.g., Kricka, Nonisotopic DNA Probe Techniques, Academic Press, San Diego, CA (1992). It will be appreciated that pairs of fluorophores are chosen that have distinct emission spectra so that they can be easily distinguished.
[161] In another embodiment, a label other than a fluorescent label is used. For example, a radioactive label, or a pair of radioactive labels with distinct emission spectra, can be used. See Zhao et al., Gene, Vol. 156, p. 207 (1995); and Pietu et al., Genome Res., Vol. 6, p. 492 (1996). However, because of scattering of radioactive particles, and the consequent requirement for widely-spaced binding sites, use of radioisotopes is a less-preferred embodiment.
[162] In one embodiment, labeled cDNA is synthesized by incubating a mixture containing 0.5 mM dGTP, dATP and dCTP plus 0.1 mM dTTP plus fluorescent deoxyribonucleotides, e.g., 0.1 mM Rhodamine 110 UTP (Perken Elmer Cetus) or 0.1 mM Cy3 dUTP (Amersham), with reverse transcriptase, e.g., SuperScript.TM. II, LTI Inc., at 42°C for 60 minutes. Hybridization to Microarrays
[163] Nucleic acid hybridization and wash conditions are chosen so that the probe "specifically binds" or "specifically hybridizes" to a specific array site, i.e., the probe hybridizes, duplexes or binds to a sequence array site with a complementary nucleic acid sequence but does not hybridize to a site with a non-complementary nucleic acid sequence. As used herein, one polynucleotide sequence is considered complementary to another when, if the shorter of the polynucleotides is ≤25 bases, there are no mismatches using standard base-pairing rules or, if the shorter of the polynucleotides is longer than 25 bases, there is no more than a 5% mismatch. Preferably, the polynucleotides are perfectly complementary (no mismatches). It can easily be demonstrated that specific hybridization conditions result in specific hybridization by carrying out a hybridization assay including negative controls. See, e.g., Shalon et al. (1996), supra; and Chee et al., supra.
[164] Optimal hybridization conditions will depend on the length, e.g., oligomer vs. polynucleotide >200 bases; and type, e.g., RNA, DNA and PNA, of labeled probe and immobilized polynucleotide or oligonucleotide. General parameters for specific, i.e., stringent, hybridization conditions for nucleic acids are described in Sambrook et al. (1996), supra; and Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing and Wiley-lnterscience, NY (1987), which is incorporated in its entirety for all purposes. When the cDNA microarrays of Schena et al. are used, typical hybridization conditions are hybridization in 5 x SSC plus 0.2% SDS at 65°C for 4 hours followed by washes at 25°C in low-stringency wash buffer (1 x SSC plus 0.2% SDS) followed by 10 minutes at 25°C in high- stringency wash buffer (0.1 x SSC plus 0.2% SDS). See Shena et al., Proc. Natl. Acad. Sci. USA, Vol. 93, p. 10614 (1996). Useful hybridization conditions are also provided. See, e.g., Tijessen, Hybridization With Nucleic Acid Probes, Elsevier Science Publishers B.V. (1993); and Kricka (1992), supra. Signal Detection and Data Analysis
[165] When fluorescently-labeled probes are used, the fluorescence emissions at each site of a transcript array can be, preferably, detected by scanning confocal laser microscopy. In one embodiment, a separate scan, using the appropriate excitation line, is carried out for each of the two fluorophores used. Alternatively, a laser can be used that allows simultaneous specimen illumination at wavelengths specific to the two fluorophores and emissions from the two fluorophores can be analyzed simultaneously. See Shalon el al. (1996), supra, which is incorporated by reference in its entirety for all purposes. In a preferred embodiment, the arrays are scanned with a laser fluorescent scanner with a computer-controlled X-Y stage and a microscope objective. Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser and the emitted light is split by wavelength and detected with two photomultiplier tubes. Fluorescence laser scanning devices are described in Schena et al. (1996), supra and in other references cited herein. Alternatively, the fiber-optic bundle described by Ferguson et al., Nat. Biotechnol., Vol. 14, pp. 1681-1684 (1996), may be used to monitor mRNA abundance levels at a large number of sites simultaneously.
[166] Signals are recorded and, in a preferred embodiment, analyzed by computer, e.g., using a 12-bit analog to digital board. In one embodiment the scanned image is de-speckled using a graphics program, e.g., Hijaak Graphics Suite, and then analyzed using an image gridding program that creates a spreadsheet of the average hybridization at each wavelength at each site. If necessary, an experimentally determined correction for "cross talk" (or overlap) between the channels for the two fluorophores may be made. For any particular hybridization site on the transcript array, a ratio of the emission of the two fluorophores is preferably calculated. The ratio is independent of the absolute expression level of the cognate gene, but is useful for genes whose expression is significantly modulated by drug administration, gene deletion or any other tested event.
[167] Preferably, in addition to identifying a perturbation as positive or negative, it is advantageous to determine the magnitude of the perturbation. This can be carried out by methods that will be readily apparent to those of skill in the art. Other Methods of Transcriptional State Measurement
[168] The transcriptional state of a cell may be measured by other gene expression technologies known in the art. Several such technologies produce pools of restriction fragments of limited complexity for electrophoretic analysis, such as methods combining double restriction enzyme digestion with phasing primers [see, e.g., EP 0534858 A1 (1992), Zabeau et al.], or methods selecting restriction fragments with sites closest to a defined mRNA end [see, e.g., Prashar et al., Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 659-663 (1996)]. Other methods statistically sample cDNA pools, such as by sequencing sufficient bases, e.g., 20-50 bases, in each of multiple cDNAs to identify each cDNA, or by sequencing short tags, e.g., 9-10 bases, which are generated at known positions relative to a defined mRNA end pathway pattern. See, e.g., Velculescu, Science, Vol. 270, pp.484-487 (1995). Measurement of Other Aspects
[169] In various embodiments of the present invention, aspects of the biological state other than the transcriptional state, such as the translational state, the activity state or mixed aspects can be measured in order to obtain drug and pathway responses. Details of these embodiments are described in this section. Translational State Measurements
[170] Measurement of the translational state may be performed according to several methods. For example, whole genome monitoring of protein, i.e., the "proteome" [see Goffeau et al. (1996), supra], can be carried out by constructing a microarray in which binding sites comprise immobilized, preferably monoclonal, antibodies specific to a plurality of protein species encoded by the cell genome. Preferably, antibodies are present for a substantial fraction of the encoded proteins, or at least for those proteins relevant to testing or confirming a biological network model of interest. Methods for making monoclonal antibodies are well-known. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor, NY (1988), which is incorporated in its entirety for all purposes. In a preferred embodiment, monoclonal antibodies are raised against synthetic peptide fragments designed based on genomic sequence of the cell. With such an antibody array, proteins from the cell are contacted to the array and their binding is assayed with assays known in the art.
[171] Alternatively, proteins can be separated by two-dimensional gel electrophoresis systems. Two-dimensional gel electrophoresis is well-known in the art and typically involves iso-electric focusing along a first dimension followed by SDS-PAGE electrophoresis along a second dimension. See, e.g., Hames et al., Gel Electrophoresis of Proteins: A Practical Approach, IRL Press, NY (1990); Shevchenko et al., Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 1440-1445 (1996); Sagliocco et al., Yeast, Vol. 12, pp. 1519-1533 (1996); Lander, Science, Vol. 274, pp. 536-539 (1996). The resulting electropherograms can be analyzed by numerous techniques, including mass spectrometric techniques, western blotting and immunoblot analysis using polyclonal and monoclonal antibodies, and internal and /V-terminal micro-sequencing. Using these techniques, it is possible to identify a substantial fraction of all the proteins produced under given physiological conditions, including in cells, e.g., in yeast; exposed to a drug or in cells modified by, e.g., deletion or over-expression of a specific gene.
Embodiments Based on Other Aspects of the Biological State
[172] Although monitoring cellular constituents other than mRNA abundances currently presents certain technical difficulties not encountered in monitoring mRNAs, it will be apparent to those of skill in the art that the use of methods of this invention that the activities of proteins relevant to the characterization of cell function can be measured, embodiments of this invention can be based on such measurements. Activity measurements can be performed by any functional, biochemical or physical means appropriate to the particular activity being characterized. Where the activity involves a chemical transformation, the cellular protein can be contacted with the natural substrates and the rate of transformation measured. Where the activity involves association in multimeric units, e.g., association of an activated DNA-binding complex with DNA, the amount of associated protein or secondary consequences of the association, such as amounts of mRNA transcribed, can be measured. Also, where only a functional activity is known, e.g., as in cell cycle control, performance of the function can be observed. However known and measured, the changes in protein activities form the response data analyzed by the foregoing methods of this invention. [173] In alternative and non-limiting embodiments, response data may be formed of mixed aspects of the biological state of a cell. Response data can be constructed from, e.g., changes in certain mRNA abundances, changes in certain protein abundances, and changes in certain protein activities. Computer Implementations
[174] In a preferred embodiment, the computation steps of the previous methods are implemented on a computer system or on one or more networked computer systems in order to provide a powerful and convenient facility for forming and testing models of biological systems. The computer system may be a single hardware platform comprising internal components and being linked to external components. The internal components of this computer system include processor element interconnected with a main memory. For example computer system can be an Intel Pentium based processor of 200 Mhz or greater clock rate and with 32 MB or more of main memory.
[175] The external components include mass data storage. This mass storage can be one or more hard disks, which are typically packaged together with the processor and memory. Typically, such hard disks provide for at least 1 GB of storage. Other external components include user interface device, which can be a monitor and keyboards, together with pointing device, which can be a "mouse" or other graphic input devices. Typically, the computer system is also linked to other local computer systems, remote computer systems, or wide area communication networks, such as the Internet. This network link allows the computer system to share data and processing tasks with other computer systems. [176] Loaded into memory during operation of this system are several software components, which are both standard in the art and special to the instant invention. These software components collectively cause the computer system to function according to the methods of this invention. These software components are typically stored on mass storage. Alternatively, the software components may be stored on removable media such as floppy disks or CD-ROM (not illustrated). The software component represents the operating system, which is responsible for managing the computer system and its network interconnections. This operating system can be, e.g., of the Microsoft Windows family, such as Windows 95, Windows 98 or Windows NT; or a Unix operating system, such as Sun Solaris. Software include common languages and functions conveniently present on this system to assist programs implementing the methods specific to this invention. Languages that can be used to program the analytic methods of this invention include C, C++ or, less preferably, JAVA. Most preferably, the methods of this invention are programmed in mathematical software packages which allow symbolic entry of equations and high-level specification of processing, including algorithms to be used, thereby freeing a user of the need to procedurally program individual equations or algorithms. Such packages include, e.g., Matlab from Mathworks (Natick, Mass.), Mathematica from Wolfram Research (Champaign, IL) and MathCAD from Mathsoft (Cambridge, MA). [177] In preferred embodiments, the analytic software component actually comprises separate software components which interact with each other. Analytic software represents a database containing all data necessary for the operation of the system. Such data will generally include, but is not necessarily limited to, results of prior experiments, genome data, experimental procedures and cost and other information which will be apparent to those skilled in the art. Analytic software includes a data reduction and computation component comprising one or more programs which execute the analytic methods of the invention. [178] Analytic software also includes a user interface which provides a user of the computer system with control and input of test network models, and, optionally, experimental data. The user interface may comprise a drag-and-drop interface for specifying hypotheses to the system. The user interface may also comprise means for loading experimental data from the mass storage component, e.g., the hard drive; from removable media, e.g., floppy disks or CD-ROM; or from a different computer system communicating with the instant system over a network, e.g., a local area network, or a wide area communication network, such as the Internet.
[179] Alternative systems and methods for implementing the analytic methods of this invention will be apparent to one of skill in the art and are intended to be comprehended within the accompanying claims. In particular, the accompanying claims are intended to include the alternative program structures for implementing the methods of this invention that will be readily apparent to one of skill in the art.
Figure imgf000046_0001
Figure imgf000047_0001
Figure imgf000048_0001
Table 4. Comparison of Predicted Vs. Observed Status of 18 Ovarian Test Samples With a 28 Probe Sets Expression Profile
Figure imgf000048_0002
Note: The number of observations is shown for each group of samples, with the value expected under random association in parentheses.
"r" = the PCC value of the 28 probe set profile of a biopsy sample with the mean Normal profile.
OR = 63 (95% Cl: 3.3-1194.7), p=0.0029
Table 5. Comparison of Predicted Vs. Observed Status of 36 Ovarian Samples With 28, 32 or 42 Probe Sets Expression Profile
Figure imgf000048_0003
Table 6. List of Genes Up-Regulated in Ovarian Tumors
Absolute CC values are shown for expression levels analyzed in the 18 test samples only (R1) or in all 36 samples (R2).
Figure imgf000049_0001
Table 7. Functional Categories of the Most Differentially-Expressed Genes in Ovarian Cancer
Figure imgf000049_0002
*For 5 genes (GPRK5, IGFBP5, IRS1, ITPR1 and RBPMS) similar results were obtained with 2 different probe sets. Table 8 List of Genes Down-Regulated in Ovarian Tumors Functional Category
Figure imgf000050_0001
Figure imgf000051_0001
Figure imgf000052_0001
Figure imgf000053_0001
Figure imgf000054_0001
Note: Gene symbols in bold indicated genes detected with 2 separate probe sets. Absolute CC values are shown for expression levels analyzed in all 36 samples (R1) and in the 18 test samples only (R2). In each functional category, probe sets are listed by descending R1 values. indicates genes from the 28 classification set not ranked within the 100 highest R1 values.
Figure imgf000055_0001
Figure imgf000056_0001
Figure imgf000057_0001
Figure imgf000058_0001
Figure imgf000059_0001
Figure imgf000060_0001
Figure imgf000061_0001
Figure imgf000062_0001
Figure imgf000063_0001
Figure imgf000064_0001
Figure imgf000065_0001
Figure imgf000066_0001
Figure imgf000067_0001
Figure imgf000068_0001
Figure imgf000069_0001
Figure imgf000070_0001
Figure imgf000071_0001
Figure imgf000072_0001
Figure imgf000073_0001
Figure imgf000074_0001
Figure imgf000075_0001
Figure imgf000076_0001
Table 10. Ranking of the Top 100 Probe Sets Based on PCC Values Probe Set Probe set Probe set Name R1 R2 name R1 R2 name R1 R2 37628 at 0.865 0.808 37529 at 0.669 0.494 201 s at 0.577 0.571 41859 at 0.865 0.877 32175 at 0.669 0.486 774_g_at 0.576 0.465 38120 at 0.852 0.768 35753 at 0.667 0.535 40998 at 0.576 0.539 32664 at 0.848 0.749 38875 r at 0.667 0.495 41772 at 0.573 0.485 35717 at 0.847 0.783 32779 s at 0.665 0.599 40522 at 0.572 0.544 34257 at 0.846 0.786 41385 at 0.665 0.594 41768 at 0.571 0.513 38220 at 0.844 0.820 1319 at 0.664 0.580 37828 at 0.569 0.677 40423 at 0.844 0.785 32593 at 0.664 0.692 280_g_at 0.569 0.377 38650 at 0.840 0.748 38101 at 0.663 0.615 33431 at 0.568 0.536 38439 at 0.829 0.736 39864 at 0.663 0.698 1278 at 0.566 0.448 39673 i at 0.827 0.698 39037 at 0.663 0.726 35736 at 0.566 0.508 38047 at 0.822 0.756 32057 at 0.663 0.667 37985 at 0.565 0.608 1396 at 0.821 0.756 38518 at 0.663 0.517 38326 at 0.565 0.417 37015 at 0.820 0.751 39556 at 0.662 0.725 37197 s at 0.562 0.647 40775 at 0.818 0.666 40841 at 0.661 0.618 41529_g_at 0.560 0.620 32145 at 0.817 0.786 39117 at 0.660 0.557 39838 at 0.560 0.582 35742 at 0.815 0.708 36695 at 0.659 0.602 2092 s at 0.560 0.679 1290_g_at 0.814 0.776 39681 at 0.659 0.690 31672_g_at 0.559 0.438 39674 r at 0.811 0.699 1850 at 0.659 0.529 37701 at 0.559 0.391 36917 at 0.810 0.752 38013 at 0.657 0.672 40260_g_at 0.558 0.516 1897 at 0.809 0.770 1377 al 0.656 0.605 33249 at KJ OOO 0.618 755 at 0.809 0.822 41739 s at 0.655 0.552 33198 at 0.557 0.705 38176 al 0.805 0.729 36095 at 0.654 0.513 1708 at 0.557 0.475 36073 at 0.804 0.774 35169 at 0.653 0.481 38116 at 0.556 0.595 32764 at 0.803 0.691 36533 at 0.653 0.517 40832 s at 0.555 0.505 39750 at 0.796 0.718 40698 at 0.65 0.563 37958 at 0.555 0.617 35645 at 0.795 0.714 33936 at 0.651 0.702 1787 at 0.555 0.374 38717 at 0.793 0.789 1507 s at 0.648 0.596 333 s at 0.554 0.562 37394 at 0.79 0.656 38211 at 0.64S 0.681 36690 at 0.554 0.654 36160 s at 0.788 0.846 38351 at 0.648 0.527 40839 at 0.553 0.611 36867 at 0.785 0.739 32239 at 0.647 0.548 32569 at 0.553 0.534 39852 at 0.783 0.738 32582 at 0.646 0.578 33302 at 0.553 0.619 37643 at 0.779 0.716 31880 at 0.646 0.489 1058 at 0.552 0.668 40767 at 0.779 0.640 38342 at 0.646 0.632 38990 at 0.552 0.526 41449 at 0.778 0.762 33240 at 0.645 0.511 506 s at 0.551 0.492 40488 at 0.778 0.698 33136 at 0.645 0.628 871 s at 0.550 0.630 Probe Set Probe set Probe set Name R1 R2 name R1 R2 name R1 R2 40063 at 0.777 0.729 39438 at 0.645 0.690 33113 at 0.549 0.554 41685 at 0.777 0.871 38035 at 0.644 0.528 507 s at 0.548 0.523
34163_g_at 0.776 0.709 33278 at 0.644 0.560 40876 at 0.548 0.754 40570 at 0.775 0.668 33140 at 0.644 0.607 35754 at 0.548 0.632 37446 at 0.771 0.613 40145 at 0.643 0.636 34287 at 0.546 0.553 38669 at 0.770 0.712 34215 at 0.642 0.611 39775 at 0.545 0.436 36627 at 0.770 0.675 538 at 0.642 0.622 41174 at 0.545 0.497 1640 at 0.769 0.734 35846 at 0.641 0.655 34417 at 0.545 0.525
35681 r at 0.769 0.644 39545 at 0.641 0.546 34259 at 0.544 0.541 36894 at 0.768 0.634 32521 at 0.641 0.563 38724 at 0.544 0.512 41137 at 0.768 0.631 34320 at 0.640 0.557 32769 at 0.543 0.426 39397 at 0.768 0.700 39939 at 0.640 0.593 33916 at 0.541 0.501 41273 at 0.766 0.758 40419 at 0.639 0.599 38470 i at 0.541 0.565 38122 at 0.761 0.707 36791_g_at 0.635 0.556 38438 at 0.540 0.612 40861 at 0.760 0.689 40674 s at 0.638 0.697 31573 at 0.540 0.488 35164 at 0.758 0.621 32143 at 0.638 0.532 31536 at %j* Ov-sy 0.479 39400 at 0.758 0.634 41770 at 0.636 0.539 39376 at 0.539 0.715 872 i at 0.756 0.770 1909 at 0.636 0.736 34859 at 0.538 0.669 41738 at 0.755 0.707 38581 at 0.636 0.573 39733 at 0.538 0.431 38113 at 0.754 0.681 4.227 at 0.333 0.466 1211 s at 0.538 0.614 40202 at 0.754 0.606 39790 at 0.635 0.579 38059_g_at 0.537 0.524 34760 at 0.749 j. ot?y 36091 at 0.635 0.652 31510 s at 0.537 0.668 32526 al 0.749 0.631 41191 at 0.633 0.477 1596 g_at 0.537 0.393 40994 at 0.749 0.627 33817 al 0.631 0.606 38711 at 0.537 0.578 37908 at 0.748 0.557 36790 al 0.631 0.552 32676 at 0.536 0.642 38119 at 0.747 0.669 34162 at 0.631 0.566 38768 at 0.536 0.671 33690 at 0.747 0.565 39691 at 0.630 0.683 34819 at 0.535 0.614 41478 at 0.745 0.653 38049_g_at 0.630 0.473 36629 at 0.535 0.550 38634 at 0.745 0.677 38754 at 0.628 0.560 36578 at 0.535 0.575 32109 at 0.744 0.670 31605 at 0.628 0.539 35335 at 0.534 0.616 41049 at 0.744 0.694 37230 at 0.628 0.590 39687 at 0.534 0.525 32778 at 0.743 0.689 41300 s at 0.628 0.519 37532 at 0.534 0.710 37599 at 0.743 0.652 31897 at 0.627 0.493 41634 at 0.534 0.629 32076 at 0.741 0.713 34789 at 0.627 0.606 34821 at 0.534 0.638 35234 at 0.741 0.634 40508 at 0.626 0.610 38627 at 0.533 0.512 40853 at 0.740 0.647 33303 at 0.626 0.540 34445 at 0.533 0.526 1731 at 0.739 0.619 35221 at 0.626 0.520 32755 at 0.533 0.481 39714 at 0.738 0.743 37638 at 0.626 0.607 1578_g_at 0.532 0.661
41505 r at 0.738 0.638 41744 at 0.625 0.723 34349 at 0.530 0.603 1761 at 0.736 0.789 41405 at 0.625 0.592 33426 at 0.529 0.484 Probe Set Probe set Probe set Name R1 R2 name R1 R2 name R1 R2 36577 at 0.736 0.549 41594 at 0.625 0.644 39441 at 0.529 0.633 32805 at 0.736 0.586 35782 at 0.625 0.638 39066 at 0.528 0.290 1577 at 0.736 0.767 37221 at 0.625 0.501 38318 at 0.527 0.747 38643 at 0.735 0.714 39147_g_at 0.624 0.592 40438 at 0.527 0.577 40496 at 0.734 0.617 38727 at 0.624 0.597 1090 f at 0.525 0.677 1135 at 0.732 0.652 39109 at 0.623 0.564 40077 at 0.524 0.650 41138 at 0.731 0.565 40399 r at 0.623 0.630 38079 at 0.523 0.490 38968 at 0.729 0.683 39436 at 0.623 0.596 35811 at 0.522 0.566 1327 s at 0.729 0.626 37710 at 0.622 0.600 1530_g_at 0.522 0.560 34772 at 0.729 0.748 36727 at 0.627 0.558 38690 at 0.520 0.591 32535 at 0.729 0.582 33911 at 0.627 0.633 41147 at 0.520 0.699 37743 at 0.729 0.617 36634 at 0.627 0.412 39693 at 0.577 0.544 34355 at 0.728 0.662 36118 at 0.620 0.542 1147 at 0.517 0.637 32259 at 0.728 0.611 39019 at 0.620 0.486 34808 at 0.577 0.614 32251 at 0.728 0.615 33244 at 0.620 0.572 39055 at 0.577 0.610 41000 at 0.727 0.677 39557 at 0.679 0.549 39986 at 0.516 0.591 40786 at 0.725 0.705 40962 s at 0.679 0.521 33380 at 0.575 0.501 39701 at 0.724 0.611 29 at 0.678 0.656 34793 s at 0.572 0.387 34740 at 0.723 0.520 31852 at 0.677 0.687 37842 at 0.512 0.336 35354 at 0.723 0.667 818 s at 0.677 0.554 39685 at 0.5 7 0.538
34169 s at 0.722 0.661 37407 s at 0.677 0.506 38279 at 0.57 0.587 36873 at 0.722 0.633 36829 at 0.677 0.518 41420 at 0.570 0.593 3243 s at 0.72f 0.702 35173 al 0.677 0.774 547 s at 0.506 0.370 34877 at 0.720 0.671 39846 at 0.677 0.564 1252 at 0.506 0.548 36119 at 0.720 0.622 0.676 0.497 32597 at 0.506 0.648 38364 at 0.7- 9 (J- ot?y 40607 at 0.676 0.708 32847 at 0.505 0.363 39025 at 0.778 0.746 32087 at 0.676 0.672 1678_g_at 0.504 0.638 32747 at 0.717 0.706 36488 at 0.675 0.636 35999 r at 0.504 0.531 36975 at 0.777 0.696 487_g_at 0.675 0.512 36964 at 0.504 0.542 33443 at 0.716 0.635 40617 at 0.675 0.645 753 at 0.504 0.412 32542 at 0.716 0.662 853 at 0.674 0.674 32153 s at 0.504 0.575
32765 f at 0.776 0.528 1101 at 0.672 0.531 32668 at 0.503 0.809 41013 at 0.775 0.578 35359 at 0.672 0.653 38394 at 0.500 0.790
37707 i at 0.775 0.528 32851 at 0.672 0.614 37623 at 0.500 0.384 35785 at 0.775 0.691 41195 at 0.670 0.502 41759 at 0.500 0.403 35783 at 0.773 0.753 40825 at 0.608 0.556 37027 at 0.500 0.576 36515 at 0.772 0.583 33235 at 0.607 0.587 33756 at 0.500 0.390 924 s at 0.777 0.640 40155 at 0.606 0.547 1120 at 0.498 0.500 33857 at 0.770 0.630 37617 at 0.606 0.636 40971 at 0.497 0.581 35704 at 0.709 0.656 40213 at 0.606 0.615 1307 at 0.497 0.582 Probe Set Probe set Probe set Name R1 R2 name R1 R2 name R1 R2
41747 s at 0.709 0.587 39260 at 0.606 0.462 35955 at 0.496 0.640 35316 at 0.709 0.743 39294 at 0.606 0.595 38459_g_at 0.494 0.429
38508 s at 0.708 0.542 2062 at 0.605 0.528 38466 at 0.493 0.344 35644 at 0.707 0.535 31886 at 0.605 0.467 32708_g_at 0.493 0.627 35366 at 0.707 0.560 38695 at 0.605 0.775 36521 at 0.493 0.233 37005 at 0.706 0.541 767 at 0.604 0.478 37315 f at 0.492 0.602 40961 at 0.705 0.745 33878 at 0.604 0.657 33440 at 0.492 0.273 36948 at 0.703 0.766 32119 at 0.603 0.575 35843 at 0.491 0.567 39743 at 0.702 0.775 38228_g_at 0.602 0.323 34235 at 0.491 0.554 39369 at 0.702 0.635 39979 at 0.602 0.588 479 at 0.490 0.370
36596 r at 0.707 0.575 35740 at 0.607 0.397 40140 at 0.490 0.666 39038 at 0.700 0.576 32768 at 0.607 0.648 41379 at 0.490 0.524 851 s at 0.699 0.725 34637 f at 0.600 0.678 39623 at 0.489 0.457 41655 at 0.699 0.664 35794 at 0.600 0.465 41277 at 0.489 0.632 35246 at 0.697 0.652 36792 at 0.599 0.527 36458 at 0.488 0.372 38317 at 0.696 0.593 39366 at 0.599 0.529 35166 al 0.488 0.544 37294 at 0.696 0.649 40770 f at 0.599 0.655 654 at 0.488 0.643
35168 f at 0.696 0.540 38673 s at 0.598 0.497 38826 at 0.488 0.505
36159 s at 0.696 0.589 1675 at 0.598 0.535 33915 at 0.487 0.599
35752 s at 0.695 0.642 33399 at 0.598 0.544 38626 at 0.487 0.509 35325 at 0.694 0.577 32700 at 0.598 0.592 38074 at 0.486 0.634 718 at 0.693 0.540 32337 at 0.597 (-/ dO^? 35730 at 0.486 0.515
37708 r at 0.692 0.545 1872 at 0.597 0.538 31932 f at 0.485 0.586 34363 at 0.697 0.665 1836 at 0.597 0.543 34803 at 0.485 0.633
2086 s at 0.697 0.637 176 at 0.596 0.654 32066_g_at 0.484 0.407 37406 at 0.690 0.665 37598 at j. oyo 0.557 37199 at 0.484 0.488 41796 at 0.689 0.779 41490 at 0.593 0.525 33123 at 0.483 0.605 35331 at 0.689 0.565 37266 at 0.593 0.548 36120 at 0.483 0.437 1736 at 0.688 0.688 39054 at 0.592 0.482 35294 at 0.483 0.476 32107 at 0.688 0.598 31936 s at 0.597 0.565 39856 at 0.482 0.620 34853 at 0.688 0.538 1629 s at 0.597 0.692 1323 at 0.482 0.604 36396 at 0.688 0.785 40461 at 0.588 0.427 33830 at 0.481 0.679 32780 at 0.687 0.594 39031 at 0.588 0.475 38743 f at 0.481 0.524 32254 at 0.687 0.464 1529 at 0.588 0.586 41656 at 0.480 0.661 37248 at 0.687 0.536 39555 at 0.588 0.517 38106 at 0.479 0.497 33800 at 0.687 0.579 1005 at 0.588 0.329 39420 at 0.479 0.452 38837 at 0.686 0.584 38745 at 0.587 0.702 39110 at 0.478 0.566 39360 at 0.685 0.640 35784 at 0.587 0.622 31463 s at 0.478 0.500 41638 at 0.683 0.633 706 at 0.587 0.473 41686 s at 0.476 0.545 34198 at 0.683 0.638 41620 at 0.587 0.678 41289 at 0.476 0.430
Figure imgf000081_0002
Figure imgf000081_0001
Figure imgf000082_0001
Figure imgf000083_0001
Figure imgf000084_0001
Figure imgf000085_0001
Note: Absolute CC values are shown for expression levels analyzed in all 36 samples (R1 ) and in the 18 test samples only (R2).
CCs ≥0.5 are italicized and underlined. References Cited
[180] All references cited herein are incorporated herein by reference in their entirety and for all purposes to the same extent as if each individual publication or patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. The discussion of references herein is intended merely to summarize the assertions made by their authors and no admission is made that any reference constitutes prior art. Applicants reserve the right to challenge the accuracy and pertinence of the cited references.
[181] In addition, all GenBank accession numbers, Unigene Cluster numbers and protein accession numbers cited herein are incorporated herein by reference in their entirety and for all purposes to the same extent as if each such number was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. [182] The present invention is not to be limited in terms of the particular embodiments described in this application, which are intended as single illustrations of individual aspects of the invention. Many modifications and variations of this invention can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. Functionally equivalent methods and apparatus within the scope of the invention, in addition to those enumerated herein, will be apparent to those skilled in the art from the foregoing description and accompanying drawings. Such modifications and variations are intended to fall within the scope of the appended claims. The present invention is to be limited only by the terms of the appended claims, along with the full scope of equivalents to which such claims are entitled.

Claims

CLAIMSWhat is claimed:
1. A method to determine if a patient is afflicted with ovarian cancer comprising: a) obtaining a sample from the said patient; b) determining the levels of gene expression of two or more of the genes listed in Table 9 in the sample from the patient; c) comparing the levels of gene expression of the two or more genes determined in (b) to the levels of the same genes listed in Table 1 ; d) determining the degree of similarity (DOS) between the levels of gene expression of the two or more genes determined in (c); and e) determining from the DOS between the level of gene expression of the two or more genes the probability that the sample shows evidence of the presence of ovarian cancer in the patient.
2. The method of Claim 1 , wherein the levels of gene expression are determined for a subset of the genes listed in table comprising genes Nos. 1-28 in Table 9.
3. The method of Claim 1 or 2, wherein the sample comprises cells obtained from the patient.
4. The method of any one of Claims 1 to 3, wherein the sample comprises cells removed from a solid tumor in the said patient.
5. The method of any one of Claims 1 to 4, wherein the sample comprises blood cells and serum drawn from the said patient.
6. The method of any one of Claims 1 to 5, wherein the sample comprises a body fluid drawn from the patient.
7. The method of any one of Claims 1 to 6, wherein the method of determining the level of gene expression comprises measuring the levels of protein expression product in the sample from the patient.
8. The method of Claim 7, wherein the presence and level of the protein expression products are detected using a reagent which specifically binds with the proteins.
9. The method of Claim 7 or 8, wherein the reagent is selected from the group consisting of an antibody, an antibody derivative and an antibody fragment.
10. The method of any one of Claims 1 to 6, wherein the levels of expression in the sample are assessed by measuring the levels in the sample of the transcribed polynucleotides of the two or more gene in Table 9.
11. The method of Claim 10, wherein the transcribed polynucleotide is an mRNA.
12. The method of Claim 10 or 11, wherein the transcribed polynucleotide is a cDNA.
13. The method of any one of Claims 10 to 12, wherein the step of detecting further comprises amplifying the transcribed polynucleotide.
14. The method of any one of Claims 1 to 13, wherein the method is performed ex vivo.
15. A method of treating a subject afflicted with ovarian cancer, the method comprising providing to cells of the subject an antisense oligonuceotide complimentary to one or more of the genes whose expression is up-regulated in ovarian cancer as shown in Table 6.
16. A method of inhibiting ovarian cancer in a subject at risk for developing ovarian cancer, the method comprising inhibiting expression of one or more of the genes shown in Table 6 to be up-regulated in ovarian cancer.
17. A kit for use in determining treatment strategy for a patient with suspected ovarian cancer comprising: a) two or more antibodies able to recognize and bind to the polypeptide expression product of the two or more of the genes in Table 9; b) a container suitable for containing the said antibodies and a sample of body fluid from the said individual wherein the antibody can contact the polypeptide expressed by the two or more genes shown in Table 9 if they are present; c) means to detect the combination of the said antibodies with the polypeptides expressed by the two or more genes shown in Table 9; and d) instructions for use and interpretation of the kit results.
18. A kit for use in determining the presence or absence of ovarian cancer in a patient comprising: a) two or more polypeptides able to recognize and bind to the mRNA expression product of the two or more genes shown in Table 9; b) a container suitable for containing the said polynucleotides and a sample of body fluid from the said individual wherein the said polynucleotide can contact the mRNA, if it is present; c) means to detect the levels of combination of the said polynucleotide with the mRNA from the two or more genes shown in Table 9; and d) instructions for use and interpretation of the kit results.
PCT/EP2004/007167 2003-07-02 2004-07-01 Genes regulated in ovarian cancer as prognostic and therapeutic targets Ceased WO2005005661A2 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
EP04740533A EP1644522A2 (en) 2003-07-02 2004-07-01 Genes regulated in ovarian cancer as prognostic and therapeutic targets
MXPA05014220A MXPA05014220A (en) 2003-07-02 2004-07-01 Genes regulated in ovarian cancer as prognostic and therapeutic targets.
BRPI0412110-4A BRPI0412110A (en) 2003-07-02 2004-07-01 genes regulated in ovarian cancer as prognostic and therapeutic targets
CA002531091A CA2531091A1 (en) 2003-07-02 2004-07-01 Genes regulated in ovarian cancer as prognostic and therapeutic targets
US10/562,406 US20060281091A1 (en) 2003-07-07 2004-07-01 Genes regulated in ovarian cancer a s prognostic and therapeutic targets
AU2004256182A AU2004256182A1 (en) 2003-07-02 2004-07-01 Genes regulated in ovarian cancer as prognostic and therapeutic targets
JP2006518084A JP2007526749A (en) 2003-07-02 2004-07-01 Genes regulated in ovarian cancer as a prognostic and therapeutic target

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US48446503P 2003-07-02 2003-07-02
US60/484,465 2003-07-02

Publications (2)

Publication Number Publication Date
WO2005005661A2 true WO2005005661A2 (en) 2005-01-20
WO2005005661A3 WO2005005661A3 (en) 2005-07-14

Family

ID=34062046

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2004/007167 Ceased WO2005005661A2 (en) 2003-07-02 2004-07-01 Genes regulated in ovarian cancer as prognostic and therapeutic targets

Country Status (8)

Country Link
EP (1) EP1644522A2 (en)
JP (1) JP2007526749A (en)
CN (1) CN1845999A (en)
AU (1) AU2004256182A1 (en)
BR (1) BRPI0412110A (en)
CA (1) CA2531091A1 (en)
MX (1) MXPA05014220A (en)
WO (1) WO2005005661A2 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006110587A3 (en) * 2005-04-07 2007-03-08 Novartis Vaccines & Diagnostic Ddr2 in cancer diagnosis, detection and treatment
EP1806413A1 (en) * 2006-01-06 2007-07-11 Oligene GmbH An in-vitro-method and means for determination of different tumor types and predicting success of surgical procedures in ovarian cancer
WO2010023458A1 (en) * 2008-09-01 2010-03-04 Immunovia Ab Method of diagnosing or prognosing epithelial ovarian cancer
EP2092081A4 (en) * 2006-11-13 2010-06-09 Helsingin Yliopiston Rahastot METHODS AND USES INVOLVING GENETIC ANOMALIES ON CHROMOSOME 12
CN103920150A (en) * 2014-04-17 2014-07-16 北京大学第一医院 Application of SATB1 to treatment of cutaneous T cell lymphomata

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA3256492A1 (en) 2010-01-11 2025-11-29 Genomic Health Inc Method to use gene expression to determine likelihood of clinical outcome of renal cancer
AU2011295013B2 (en) * 2010-08-27 2017-03-30 Universitaetsklinikum Muenster Means and methods for the detection of a predisposition of a female subject to recurrent pregnancy loss (RPL), preeclampsia (PE) and/or fetal growth restriction (FGR)
ES2829415T3 (en) 2013-05-30 2021-05-31 Genomic Health Inc Gene expression profiling algorithm to calculate a recurrence score for a kidney cancer patient
JP6436477B2 (en) * 2014-05-13 2018-12-12 国立大学法人大阪大学 Pharmaceutical composition for cancer treatment
JP6168625B2 (en) * 2016-01-12 2017-07-26 国立研究開発法人産業技術総合研究所 Epithelial ovarian cancer differentiation marker
CN106841622B (en) * 2017-04-07 2019-01-18 南通大学附属医院 Application of the SCML2 in diagnosis gastro-entero-pancreatic tumor
CN108085392A (en) * 2018-01-09 2018-05-29 山大生殖研发中心有限公司 Biomarkers for epithelial ovarian cancer and uses thereof
CN110241198A (en) * 2019-05-30 2019-09-17 成都吉诺迈尔生物科技有限公司 A kind of genome recombination fingerprint and its identification method characterizing hHRD HR defective
CN112229999B (en) * 2020-09-01 2022-04-22 浙江省肿瘤医院 Prognostic diagnosis marker Claudin21 for ovarian cancer and application thereof
CN113684277B (en) * 2021-09-06 2022-05-17 南方医科大学南方医院 Method for predicting ovarian cancer homologous recombination defect based on biomarker of genome copy number variation and application
CN118393036B (en) * 2024-04-28 2024-11-29 中国医学科学院北京协和医院 Protein markers for risk stratification of ovarian yolk sac tumor and their application

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001018542A2 (en) * 1999-09-03 2001-03-15 Millennium Pharmaceuticals, Inc. Identification, assessment, prevention, and therapy of ovarian cancer
AU4592601A (en) * 2000-03-21 2001-10-03 Millennium Predictive Medicine Novel genes, compositions, kits, and method for identification, assessment, prevention, and therapy of ovarian cancer
US20030003479A1 (en) * 2001-04-19 2003-01-02 Millennium Pharmaceutical, Inc. Compositions, kits, and methods for identification, assessment, prevention, and therapy of ovarian cancer
WO2003021229A2 (en) * 2001-09-05 2003-03-13 The Brigham And Women's Hospital, Inc. Diagnostic and prognostic tests

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006110587A3 (en) * 2005-04-07 2007-03-08 Novartis Vaccines & Diagnostic Ddr2 in cancer diagnosis, detection and treatment
JP2008535855A (en) * 2005-04-07 2008-09-04 ノバルティス ヴァクシンズ アンド ダイアグノスティクス インコーポレイテッド DDR2 in cancer diagnosis, detection and treatment
JP4942219B2 (en) * 2005-04-07 2012-05-30 ノバルティス ヴァクシンズ アンド ダイアグノスティクス インコーポレイテッド DDR2 in cancer diagnosis, detection and treatment
EP1806413A1 (en) * 2006-01-06 2007-07-11 Oligene GmbH An in-vitro-method and means for determination of different tumor types and predicting success of surgical procedures in ovarian cancer
EP2092081A4 (en) * 2006-11-13 2010-06-09 Helsingin Yliopiston Rahastot METHODS AND USES INVOLVING GENETIC ANOMALIES ON CHROMOSOME 12
WO2010023458A1 (en) * 2008-09-01 2010-03-04 Immunovia Ab Method of diagnosing or prognosing epithelial ovarian cancer
CN103920150A (en) * 2014-04-17 2014-07-16 北京大学第一医院 Application of SATB1 to treatment of cutaneous T cell lymphomata

Also Published As

Publication number Publication date
JP2007526749A (en) 2007-09-20
EP1644522A2 (en) 2006-04-12
CA2531091A1 (en) 2005-01-20
CN1845999A (en) 2006-10-11
BRPI0412110A (en) 2006-11-21
MXPA05014220A (en) 2006-03-09
WO2005005661A3 (en) 2005-07-14
AU2004256182A1 (en) 2005-01-20

Similar Documents

Publication Publication Date Title
JP4619350B2 (en) Diagnosis and prognosis of breast cancer patients
US8019552B2 (en) Classification of breast cancer patients using a combination of clinical criteria and informative genesets
US9909185B2 (en) Diagnosis and prognosis of breast cancer patients
US20030064385A1 (en) Genes expressed in breast cancer as prognostic and therapeutic targets
WO2005005661A2 (en) Genes regulated in ovarian cancer as prognostic and therapeutic targets
EP2195451A1 (en) Expression profiles of biomarker genes in notch mediated cancers
JP2007506442A (en) Gene expression markers for response to EGFR inhibitors
AU2004211955A1 (en) Gene expression markers for response to EGFR inhibitor drugs
JP2008524986A (en) Genetic changes useful for predicting malignant tumor response to taxane-based drug therapy
KR20080063343A (en) Gene Expression Profiling for Identification of Prognostic Subclasses in Nasopharyngeal Carcinoma
AU2004283225A1 (en) Methods and kits for investigating cancer
JP2015061528A (en) Prognosis prediction of melanoma cancer
EP2307570B1 (en) Molecular signature of liver tumor grade and use to evaluate prognosis and therapeutic regimen
JP2010166915A (en) Method for predicting edema as side effect of drug treatment
WO2003087404A1 (en) Methods to predict patient responsiveness to tyrosine kinase inhibitors
US20070160528A1 (en) Method to evaluate the likelehood of development of bone metastasis based on the determination of calcium binding proteins
US20060281091A1 (en) Genes regulated in ovarian cancer a s prognostic and therapeutic targets
US20070141618A1 (en) Genes expressed in breast cancer as prognostic and therapeutic targets

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200480025033.6

Country of ref document: CN

AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2004256182

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2004740533

Country of ref document: EP

Ref document number: PA/a/2005/014220

Country of ref document: MX

ENP Entry into the national phase

Ref document number: 2531091

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 12/CHENP/2006

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2006518084

Country of ref document: JP

ENP Entry into the national phase

Ref document number: 2004256182

Country of ref document: AU

Date of ref document: 20040701

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 2004256182

Country of ref document: AU

WWP Wipo information: published in national office

Ref document number: 2004740533

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2006281091

Country of ref document: US

Ref document number: 10562406

Country of ref document: US

ENP Entry into the national phase

Ref document number: PI0412110

Country of ref document: BR

WWP Wipo information: published in national office

Ref document number: 10562406

Country of ref document: US

WWW Wipo information: withdrawn in national office

Ref document number: 2004740533

Country of ref document: EP