WO2003100030A2 - Kidney toxicity predictive genes - Google Patents
Kidney toxicity predictive genes Download PDFInfo
- Publication number
- WO2003100030A2 WO2003100030A2 PCT/US2003/006196 US0306196W WO03100030A2 WO 2003100030 A2 WO2003100030 A2 WO 2003100030A2 US 0306196 W US0306196 W US 0306196W WO 03100030 A2 WO03100030 A2 WO 03100030A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- genes
- gene sequences
- members
- combo
- predictive
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Definitions
- This application contains a gene sequence listing and four tables submitted on a compact disc whose file name is 'Tables for Burning", created on February 27, 2003, containing 5 files and is herein incorporated by reference in its entirety.
- the five files are (a) a gene sequencing Table 32 (403 KB), in Microsoft® Word®, (b) Table 38 (785 KB) in Microsoft Excel®, (c) Table 39 (957KB) in Excel, (d) Table 40 (992 KB) in Excel, and (e) Table 45 (57KB) in Excel.
- This invention is the field of toxicology. More specifically, it relates to kidney toxicity predictive genes and the methods of using such genes to predict kidney toxicity.
- Molecular biology and genomics technologies have potential to create dramatic advances and improvements for the science of toxicology as for other biological sciences. See, for example, MacGregor, et al. Fund. Appl. Tox. 26:156- 173, 1995; Rodi et al., Tox. Pathology 27:107-110, 1999; Cunningham et al., Ann. N.Y. Acad. Sci. 919: 52-67, 2000; Pritchard et al., Proc. Natl. Acad. Sci. USA 98:13266-13271 , 2001 ; and Fielden and Zacharewski, Tox.
- the invention provides kidney toxicity predictive genes and predictive models which are useful to predict toxic responses to one or more agents.
- the invention provides methods of predicting kidney toxicity in an individual exposed to an agent which include the steps of: (a) obtaining a biological sample from an individual treated with the agent or treating a biological sample obtained from an individual with the agent or treating in vitro cultured cells or explants with the agent; (b) obtaining a gene expression profile from the biological sample or in vitro cultured cells or explants; and (c) using the gene expression profile from the biological sample or cells treated with the agent as a test set and a database of gene expression profiles and toxicity classifications as a training set and using kidney toxicity predictive genes and a Predictive Model to determine whether the agent will induce kidney toxicity in the individual or would be predicted to produce kidney toxicity following in vivo exposure.
- the predictive model utilizes expression profiles from sets of kidney toxicity predictive gene(s) selected from Combination 6, infra, wherein the set is one or more kidney toxicity predictive gene(s). In other embodiments, the predictive model utilizes expression profiles from sets of one or more kidney toxicity predictive gene(s) selected from Combination 5, 4, 3, 2, or 1 , wherein the set is one or more kidney toxicity predictive gene(s).
- the invention provides methods for determining the presence or absence of a no-observable effect level (NOEL) of an agent by the steps of: (a) obtaining biological samples from individuals treated with the agent at different dose levels or treating a biological sample obtained from an individual with different dose levels of the agent or treating in vitro cultured cells or explants with different dose levels of the agent; (b) obtaining gene expression profiles of the samples; and (d) using the gene expression profile from the biological samples as a test set and a database of gene expression profiles and toxicity classifications as a training set and using kidney toxicity predictive genes and a Predictive Model to determine or predict whether and at which dose levels the agent will induce kidney toxicity.
- NOEL no-observable effect level
- the predictive model utilizes expression profiles from sets of kidney toxicity predictive gene(s) selected from Combination 6, infra, wherein the set is one or more kidney toxicity predictive gene(s). In other embodiments, the predictive model utilizes expression profiles from sets of one or more kidney toxicity predictive gene(s) selected from Combination 5, 4, 3, 2, or 1 , wherein the set is one or more kidney toxicity predictive gene(s).
- the predictive genes and models may be used with an in vitro system to identify in vitro systems that can be used to accurately predict in vivo toxicity and to use the identified in vitro systems to accurately predict in vivo toxicity.
- the invention provides methods of identifying a kidney toxicity predictive gene in an individual including the steps of: (a) providing a set of candidate toxicity predictive genes; (b) evaluating said genes for their predictive performance with at least one training and test set of data in a predictive model to identify genes which are predictive of kidney toxicity; and (c) testing the performance of predictive genes for their ability to predict kidney toxicity for different training and test sets of data, for prediction of accurate compared to random classification and prediction of test data external to the data used to derive the predictive genes, in one embodiment, the candidate toxicity predictive genes are rat toxicity genes.
- the invention provides methods for determining the presence or absence of a no-observable effect level (NOEL) of an agent by the steps of: (a) obtaining biological samples from individuals treated with the agent at different dose levels or treating a biological sample obtained from an individual with different dose levels of the agent or treating in vitro cultured cells or explants with different dose levels of the agent; (b) obtaining gene expression profiles of the samples; and (d) using the gene expression profile from the biological samples as a test set and a database of gene expression profiles and toxicity classifications as a training set and using kidney toxicity predictive genes and a Predictive Model to determine or predict whether and at which dose levels the agent will induce kidney toxicity.
- NOEL no-observable effect level
- the predictive model utilizes expression profiles from sets of kidney toxicity predictive gene(s) selected from Combination 6, infra, wherein the set is one or more kidney toxicity predictive gene(s). In other embodiments, the predictive model utilizes expression profiles from sets of one or more kidney toxicity predictive gene(s) selected from Combination 5, 4, 3, 2, or 1 , wherein the set is one or more kidney toxicity predictive gene(s).
- the invention provides a computer program product which includes a set of kidney toxicity predictive genes derived from mining a database having a plurality of gene expression profiles indicative of toxicity.
- the set of kidney toxicity predictive genes includes at least one toxicity predictive gene from combination 6, 5, 4, 3, 2, or 1 list.
- the invention provides a library of information about kidney toxicity predictive genes produced by the methods disclosed herein.
- the invention provides an integrated system for predicting kidney toxicity comprising: an array reader modified to read gene expression profiles from biological samples exposed to a test agent, operably linked to a computer comprising a database file having a plurality of kidney toxicity predictive genes.
- FIG. 1 is a flow diagram illustrating the identification of kidney toxicity predictive genes.
- the pathway is given for discovery of kidney toxicity predictive genes using the database of expression array data (Rat CT array) and toxicity data for kidney samples from rats treated with various compounds (see Table 1).
- Gene with expressions correlating with pathology were determined using a variety of correlation statistics (see for example Tables 2 and 3).
- Predictive model used was the GeneSpring Predict Parameter Value model that employs a K-nearest neighbor model.
- Figure 2 is a graph which shows the percent of overall correct calls as a function of the number of predictivity genes using histopathology correlating genes (Pearson measure) as the input gene list with Training and Test Set A. The percent of overall correct calls is presented as a function of the number of kidney toxicity predictivity. genes.
- the input genes list consisted of 66 genes that showed a statistically significant correlation with the histopathology scores using Pearson's correlation measure (r-value >0.4). Training and Test Set A was used with other model values of 10 nearest neighbors and a p-value ratio cutoff of 0.5. An optimum gene number of 49 was observed (lowest number of genes giving the highest percent overall calls) for this case.
- FIG. 3 is a flow diagram illustrating how kidney toxicity predictive genes are evaluated for performance. Performance of predictive model is evaluated using 6 sets of training and test data (Rat CT expression array data). The training and test sets have accurate classification assignments (histopathology "yes” or “no” for each sample) or random classifications assignments ("yes” and "no” randomly assigned to samples). The K-nearest neighbor model is used with input being lists of predictive genes, as indicated, and the training and test set data. Four different measures of prediction are considered as indicated.
- Figure 4 is a graph that shows the cumulative predictive performance of
- Combo 6 genes The mean, minimum and maximum percent accuracy for 6 training and test sets are presented for Combo 6 genes that were used cumulatively in the order given in Table 14.
- Figure 5 is a graph that shows the cumulative predictive performance of
- Combo 5 genes The mean, minimum and maximum percent accuracy for 6 training and test sets are presented for Combo 5 genes that were used cumulatively in the order given in Table 14.
- Figure 6 is a graph that shows the cumulative predictive performance of
- Combo 4 genes The mean, minimum and maximum percent accuracy for 6 training and test sets are presented for Combo 4 genes that were used cumulatively in the order given in Table 14.
- Figure 7 shows the k-means and tree cluster analysis of Combo 6 genes.
- Figure 8 shows the Wards cluster analysis of Combo 6 gene set.
- Figure 9 shows a scanned autoradiogram of a Western blot of serum samples from 8 animals probed with antibodies to clusterin and insulin-like growth factor binding protein 1. Sample information is indicated in the figure. The figure also presents transcriptional differential expression levels of the insulin-like growth factor binding protein 1 gene observed in kidney samples from these animals.
- Table 1 lists the compounds, dose levels, kidney pathology and abbreviations in the database.
- Table 2 lists genes whose expression at 24h directly correlates with kidney tubular necrosis at 72h, ranked by Pearson correlation coefficient.
- Table 3 lists genes whose expression at 24h inversely correlates with kidney tubular necrosis at 72h, ranked by Spearman correlation coefficient.
- Table 4 lists the distribution of compounds in individual training and test sets for 24 hour kidney data.
- Table 5 lists the predictive genes for 24 hour expression data.
- Table 7 lists the randomly selected gene subsets from 24 h combo 6 gene set (28 genes).
- Table 8 lists the randomly selected gene subsets from 24 h combo 5 gene set (25 genes).
- Table 9 lists the randomly selected gene subsets from 24 h combo 4 gene set (23 genes).
- Table 10 lists the randomly selected gene subsets from array genes excluding combo all set.
- Table 11 lists the kidney toxicity individual sample prediction values for 24 hour data predictive genes (combined list and subsets).
- Table 12 lists the kidney toxicity compound-dose prediction values for 24 hour data predictive genes (combined list and subsets).
- Table 13 lists the kidney toxicity compound prediction values for 24 hour data predictive genes (combined list and subsets).
- Table 14 lists the order of genes used for cumulative analysis of predictive performance of predictive combo gene sets.
- Table 15 lists the individual gene predictions for combo 6.
- Table 16 lists the individual gene predictions for combo 5.
- Table 17 lists kidney toxicity individual sample prediction values for 24 hour data with random gene subsets.
- Table 18 lists the comparison of predictivity for true kidney toxicity classification and random classification using combo gene sets and random subsets and 24 hour data.
- Table 19 lists the distribution of compounds in individual training and test sets for 6 hour kidney data.
- Table 20 lists the genes whose expression at 6 hours directly correlates with kidney tubular necrosis at 72 hours, ranked by Pearson correlation coefficient.
- Table 21 lists the genes whose expression at 6 hours inversely correlates with kidney tubular necrosis at 72 hours, ranked by Spearman correlation coefficient.
- Table 22 lists the genes whose expression at 6 hours is predictive of kidney toxicity at 72 hours.
- Table 23 lists the kidney toxicity compound-dose prediction values for 6 hour data predictive genes (combined list and subsets).
- Table 24 lists the distribution of compounds in individual training and test sets for the 72 hour kidney data.
- Table 25 lists the genes whose expression at 72 hours directly correlates with kidney tubular necrosis at 72 hours, ranked by Pearson correlation coefficient.
- Table 26 lists the genes whose expression at 72 hours inversely correlates with kidney tubular necrosis at 72 hours, ranked by Spearman correlation coefficient.
- Table 27 lists the genes whose expression at 72 hours is predictive of kidney toxicity at 72 hours.
- Table 28 lists the kidney toxicity compound-dose prediction values for 72 hour data predictive genes (combined list and subsets).
- Table 29 lists the predictive performance of various models.
- Table 30 lists the logistic discrimination coefficients.
- Table 31 lists the prediction of kidney toxicity for samples external to database.
- Table 32 lists the genes predictive for kidney tubular necrosis, sequences, and accession numbers.
- Table 33 lists the kidney predictive genes (376 genes) organized by time point and combo category.
- Table 34 lists the RCT genes (ESTs) predictive for kidney tubular necrosis: best homology matches.
- Table 35 lists the genes that are predictive at all three time points.
- Table 36 lists the genes that are the most predictive across the time points.
- Table 37 lists the kidney toxicity predictive genes whose protein products are known to be secreted. The genes are from the table listing all the kidney predictive genes at the three time points 6, 24 and 72 hours. The protein products are easier to access since they are secreted into body fluids and are thus more amenable to be quantified. Therefore these proteins could be monitored in body fluids of subjects such as humans and toxicity predictions could be made.
- Table 38 lists the expression data for the 6 hour timepoint.
- Table 39 lists the expression data for the 24 hour timepoint.
- Table 40 lists the expression data for the 72 hour timepoint.
- Table 41 lists the predictive performance of predictive genes organized by occurrence on training/test set lists (combo number) and time point.
- Table 42 lists the summary output of the predictive computer software product.
- Table 43 lists the detailed output of the predictive computer software product.
- Table 44 lists protein marker candidate identification information that includes the gene name, % correct calls, average fold induction for negative histopathology samples, and average fold induction for positive histopathology samples. [69] Table 45 lists input data used for the predictive computer program product.
- This invention relates to methods of predicting whether an agent or other stimulus is capable of inducing kidney toxicity in a recipient organism using predictive molecular toxicology analysis.
- the invention provides methods of predicting kidney toxicity that comprise analyzing gene and/or protein expression across a number of kidney toxicity biomarkers disclosed herein for patterns of expression that correlate with and are predictive of kidney tubule necrosis in the recipient organism. This endpoint is significant because mortality in patients is high for acute renal failure and tubular necrosis is associated with many causes such as ischemia, endotoxemia or exposure to nephrotoxins (Ueda et al., Am. J. Med. 108: 403-415, 2000).
- the invention is based, in part, upon the discovery that modulated transcriptional regulation of relatively small sets of certain genes in response to a test agent can accurately predict the occurrence of kidney toxicity observed at later time points.
- kidney toxicity biomarkers which are useful in the practice of the kidney toxicity prediction methods of the invention.
- applicants have identified 376 kidney toxicity biomarkers that demonstrate utility in predicting kidney toxicity outcomes. These biomarkers have been thoroughly characterized for their predictive performance, individually as well as in various combinations or subsets thereof.
- various optimized subsets of the kidney toxicity biomarkers of the invention are disclosed, which sets have also been thoroughly characterized for predictive performance using the methods of the invention.
- subsets of kidney toxicity genes provided herein are several which demonstrate prediction accuracies in the vicinity of 95%.
- the methods of the invention are capable of distinguishing between agent dose levels which induce toxicity (typically higher doses) and those doses that are non-toxic. This latter feature is an essential component of meaningful toxicological evaluation.
- Toxic or "toxicity” refers to the result of an agent causing adverse effects, usually by a xenobiotic agent administered at a sufficiently high dose level to cause the adverse effects.
- kidney toxicity biomarker and “kidney toxicity predictive gene” are used interchangeably and refer to a gene whose expression, measured at the RNA or protein level can predict the likelihood of a kidney toxicity response with accuracy significantly better than would occur by chance.
- the kidney toxicity response is tubular necrosis.
- the kidney toxicity response can be other toxicity manifestations that elicit similar detectable gene expression changes. These could include other forms of tubular injury, glomerular toxicity and papillary injury.
- a "toxicological response” refers to a cellular, tissue, organ or system level response to exposure to an agent. At the molecular level, this can include, but is not limited to, the differential expression of genes encompassing both the up- and down- regulation of expression of such genes at the RNA and/or protein level; the up- or down-regulation of expression of genes which encode proteins associated with response to and mitigation of damage, the repair or regulation of cell damage; or changes in gene expression due to changes in populations of cells in the tissue or organ affected in response to toxic damage.
- An "agent” or “compound” is any element to which an individual can be exposed and can include, without limitation, drugs, pharmaceutical compounds, household chemicals, industrial chemicals, environmental chemicals, other chemicals, and physical elements such as electromagnetic radiation.
- biological sample refers to substances obtained from an individual.
- the samples may comprise cells, tissue, parts of tissues, organs, parts of organs, or fluids (e.g., blood, urine or serum).
- Biological samples include, but are not limited to, those of eukaryotic, mammalian or human origin.
- sample is defined for the purposes of prediction as a biological sample and the gene expression data for that sample. Each sample comes from an individual animal. A toxicity classification may also be associated with the sample.
- Gene expression refers to the relative levels of expression and/or pattern of expression of a gene. In some embodiments, the expression refers to a toxicity gene or toxic response gene. In other embodiments, the expression is of a toxicity predictive gene.
- Gene expression profile refers to the relative levels of expression of multiple different genes measured for the same sample. Gene expression profiles may be measured in a sample, such as samples comprising a variety of cell types, different tissues, different organs, or fluids (e.g., blood, urine, spinal fluid, sweat, saliva or serum) by various methods including but not limited to microarray technologies and quantitative and semi-quantitative RT-PCR (e.g., TaqmanTM) techniques, as well as techniques for measuring expression of proteins.
- a sample such as samples comprising a variety of cell types, different tissues, different organs, or fluids (e.g., blood, urine, spinal fluid, sweat, saliva or serum) by various methods including but not limited to microarray technologies and quantitative and semi-quantitative RT-PCR (e.g., TaqmanTM) techniques, as well as techniques for measuring expression of proteins.
- RT-PCR e.g., TaqmanTM
- “Individual” refers to a vertebrate, including, but not limited to, a human, non- human primate, mouse, hamster, guinea pig, rabbit, cattle sheep, pig, chicken, and dog.
- the terms “hybridize”, “hybridizing”, “hybridizes” and the like, used in the context of polynucleotides are meant to refer to conventional hybridization conditions, such as hybridization in 50% formamide/6X SSC/0.1% SDS/100 ⁇ g/ml ssDNA, in which temperatures for hybridization are above 37 degrees Celsius and temperatures for washing in 0.1 X SSC/0.1% SDS are above 55 degrees Celsius, and preferably to stringent hybridization conditions.
- Nucleic acids will hybridize will depend upon factors such as their degree of complementarity as well as the stringency of the hybridization reaction conditions. Stringent conditions can be used to identify nucleic acid duplexes with a high degree of complementarity. Means for adjusting the stringency of a hybridization reaction are well-known to those of skill in the art. See, for example, Sambrook, et al., "Molecular Cloning: A Laboratory Manual,” Second Edition, Cold Spring Harbor Laboratory Press, 1989; Ausubel, et al., “Current Protocols In Molecular Biology,” John Wiley & Sons, 1996 and periodic updates; and Hames et al., "Nucleic Acid Hybridization: A Practical Approach,” IRL Press, Ltd., 1985.
- conditions that increase stringency include higher temperature, lower ionic strength and presence or absence of solvents; lower stringency is favored by lower temperature, higher ionic strength, and lower or higher concentrations of solvents.
- identity is used to express the percentage of amino acid residues at the same relative position which are the same.
- homology is used to express the percentage of amino acid residues at the same relative positions which are either identical or are similar, using the conserved amino acid criteria of BLAST analysis, as is generally understood in the art. Further details regarding amino acid substitutions, which are considered conservative under such criteria, are provided.
- A. Generation of Toxicology Gene Expression Biomarkers The kidney toxicity biomarkers described herein were initially identified utilizing a database generated from large numbers of in vivo experiments, wherein the differential expression of approximately 700 rat genes, measured at various time points, in response to multiple toxic compounds inducing various specific toxic responses, as visualized through microscopic histopathological analysis, was quantified, as described in pending United States Patent Application filed January 29, 2002 (serial number not yet assigned). This quantitative gene expression data, as well as corresponding histopathological information, was then subjected to an analytical approach specifically designed to identify genes which not only correlated with the observed histopathology, but also demonstrated an ability to be used in a model capable of accurately predicting the occurrence of the toxic response associated with the observed histopathology. A complete description of this identification process is presented in the Examples. A flow diagram illustrating how the kidney toxicity biomarkers of the invention were identified is presented in Figure 1.
- kidney toxicity gene expression databases may be generated using techniques well known in the art, and used to identify additional kidney toxicity biomarkers, which may also be employed in the practice of the kidney toxicity prediction methods of the invention.
- Such databases may be generated with test compounds capable of inducing various pathologies indicative of a toxic response in the kidney and/or other organs or systems, over different time periods and under different administration and/or dosing conditions, including without limitation kidney tubule necrosis, glomerular necrosis, glomerular sclerosis and papillary injury.
- An example of compounds, dose levels, kidney toxicity classifications and histopathology scores used in the Examples which follow is provided in Table 1.
- Such databases may be generated using organisms other than the rat, including without limitation, animals of canine, murine, or non-human primate species. In addition, such databases may incorporate data derived from human clinical trials and post-approval human clinical experiences.
- Various methods for detecting and quantitating the expression of genes and/or proteins in response to toxic stimuli may be employed in the generation of such databases, as are generally known in the art. For example, microarrays comprising multiple cDNAs or oligonucleotide probes capable of hybridizing to corresponding transcripts of genes of interest may be used to generate gene expression profiles. Additionally, a number of other methods for detecting and quantitating the expression of gene transcripts are known in the art and may be employed, including without limitation, RT-PCR techniques such as TaqMan®, RNAse protection, branched chain, etc.
- Databases comprising quantitative gene expression information preferably include qualitative and quantitative and/or semi-quantitative information respecting the observed toxicological responses and other conventional toxicology endpoints, such as for example, body and organ weights, serum chemistry and histopathology observations, histopathology scores and/or similar parameters.
- the database preferably includes histopathology scores for each animal which has been exposed to one or more agent(s). These scores can be assigned based on actual histopathology observations for the tissue and animal or on the basis of effects observed for other animals treated with the same agent and dose level.
- the scores are numerical scores that reflect the occurrence and severity of histopathological changes. These scores can be adjusted to have similar range to gene expression changes. For example, a score of 1 could be assigned to samples with no changes and scores of 28 assigned to increasingly severe changes. Because the scores are numerical, they are suitable for use with a variety of statistical correlation and similarity measures.
- Example 1 An example of a histopathology scoring system is provided in Example 1.
- histopathology scores may be utilized to identify genes which correlate with the observed toxicological response, using any number of statistical correlation and similarity analysis techniques, including without limitation those techniques described or employed in Example 1 (e.g., Pearson, Spearman, change, smooth, distance etc.). Such correlating genes may be used as predictive gene candidates. Examples of genes whose expression at 24 hours after treatment correlates with histopathology observed at 72h are detailed in Tables 3 and 4. In one embodiment, the correlating gene lists as well as the entire array gene list are used as input gene lists in the GeneSpringTM Predictive Model (otherwise known hereafter as "Predictive Model").
- (C) Class Prediction and Classification Statistical analysis of the database of gene expression profiles can be effected by utilizing commercially available software programs.
- GeneSpringTM Very 4.1 , Silicon Genetics, Redwood City, CA
- Other software programs which can be used for statistical analysis include, without limitation, SAS software packages (SAS Institute Inc., Cary, NC) and S-PLUS® software (Insightful Corporation, Seattle, WA)
- class predictions can be made from the genes in the database, as detailed in Example 1 , using one or more training and test sets.
- six training sets and six test sets are obtained, as shown in Example 1 (Table 4).
- Kidney toxicological classifications are entered for the samples in each training and test set.
- Toxicological classifications can be defined by various pathologies.
- the toxicity is defined as kidney tubular necrosis observed 72 hours after treatment with an agent. However, toxicity can manifest in other nephropathologies such as glomerular necrosis or papillary injury.
- predicted classifications of the test set samples are obtained by using k-nearest neighbor (or knri) voting procedure.
- the class of each of the knn is determined and the test sample is assigned to the class with the largest representation after adjusting for the proportion of classifications in the training set. In one embodiment, adjustments are made to account for different proportions of classes in the training set.
- Toxicity can also be observed at various time points after exposure to an agent and is not limited to only 72 hour after treatment.
- a skilled toxicologist can determine the optimal time after exposure to an agent to observe pathology by either what has been disclosed in the art or a stepwise experimentation with time increments, for example 2, 4, 6, 12, 18, 24, 36, 48 hours post-exposure or even longer time increments, for example, days, weeks, or months after exposure to the agent.
- Figure 1 describes the overall process used to identify kidney toxicity predictive genes. In one embodiment, this process was run independently for each time point.
- the number of genes that are to be used in the Predictive Model can be varied, for example 50, 40, 30, 20, 10, 5, 2, or 1 gene(s) can be used. In a preferred embodiment, at least 50 genes are used.
- optimum gene lists for all input gene lists are combined for each training and test set and then these combined lists for all six training and test sets are merged to create an aggregate list of predictive genes.
- the aggregate list can then be subdivided to smaller lists of genes based on the number of times that the genes occurred on the predictive gene lists for each individual training or test set. These are designated herein as Combo 6, 5, 4, 3, 2, or 1 lists.
- the genes that were predictive in all 6 training and test sets are designated as Combo 6 and the genes that were predictive in 5 of 6 training and test sets are designated as Combo 5 and so forth.
- Table 32 presents gene names, accession numbers and sequence information for the kidney toxicity predictive genes found by analysis of the database in the manner described above.
- Table 33 lists the kidney toxicity predictive genes organized by time point and Combo Class.
- Table 34 lists homologous genes for the RCT sequences that were identified by BLAST search using the GenBank NR database as the target database. [102] The predictive genes can also be categorized by their occurrence as predictive at different time points.
- Table 35 lists 53 genes that are on the combined predictive lists of all three time points tested. This list is derived from the list of all the predictive genes measured at 6, 24 and 72 hours that predicted kidney tubular necrosis at 72 hours. Genes that are predictive at multiple time points can be further grouped by their Combo ranking.
- Table 36 lists 23 genes that are the most predictive across the three time points tested.
- This list is a subset of the list of 53 genes that are predictive across all three time points 6, 24 and 72 hours.
- the criteria for inclusion in this table were that the gene be a member of the highest combinations, viz., combinations 6, 5 or 4 in at least 2 out of three time points.
- the gene expression data of the genes in Table 36 could be expected to be very highly predictive of kidney tubular necrosis. Further, since the predictive strength of these genes is very high across the 3 time points tested, it could be expected that gene expression data derived from these genes even at time points not tested such as any time points falling between 6 and 72 hours or any other time point would be very highly predictive of tubular necrosis.
- These specific genes could be useful in cases where the dose route or pharmacokinetic properties of a compound may alter the kinetics of predictive gene expression changes.
- Example 1 list or subsets thereof was used as input into the Predictive Model.
- Example 2 describes the evaluation of the predictive performance of the kidney toxicity predictive genes.
- Predictive performance may also be assessed using data from different time points after exposure to the agent.
- 24 hour expression data is used.
- 6 hour expression data is used, as described in Examples 3 and 4.
- 72 hour expression data is used, as described in Example 5 and 6.
- Table 41 predictive capability for 24 hour expression data has a high accuracy rate (i.e., 90% accuracy) when the entire predictive gene list is used.
- Predictive performance may also be assessed using subsets of genes from the different Combo lists. As indicated in Examples 2, 4 and 6 randomly selected subsets of the Combo gene lists had very good predictive performance (accuracy better than 80% and approaching 90%) and even individual genes had mean predictive accuracies that were significant (for example, greater than 80%). Cumulative performance of subsets of 24 h data is presented in Figures 4-6. In one embodiment, using 3 genes from Combo list 6 yields about 90% accuracy. However, using different Combo lists may require more genes to reach the same accuracy level, e.g., 8 genes from Combo 5 list, 13 genes from Combo 4 list.
- kidney toxicity predictive genes The kidney toxicity predictive genes disclosed herein and kidney toxicity predictive genes identified by using methods disclosed herein are useful for predicting kidney toxicity in response to exposure to one or more agents.
- the use of larger numbers of predictive genes provides for redundancy and consequent greater accuracy and precision. Applications using larger numbers of predictive genes might be tests of candidates at later stages of commercial development. An example would be later stages of preclinical development of a therapeutic candidate where in vivo samples can be obtained and more comprehensive methods such as microarray measurement of gene expression are appropriate.
- the larger gene sets can also include different subsets of genes which may offer more insight into potential mechanisms of toxicity and the ability to have refined predictions of long term toxic consequences such as chronic, irreversible toxicity or carcinogenicity.
- kidney toxicity predictive genes may also be suitable for prediction of toxicity in other organs or may be preferable for predicting toxicity for wider ranges of timepoints or treatment routes or regimens. As an example of the latter, some of the predictive genes are observed at three different timepoints after treatment. These genes may be useful for prediction in cases where the samples come from treatment protocols that have different measurement timepoints or routes of administration than those employed for the database or where the toxicokinetics for a particular agent are known or suspected to be different from those in the database.
- the agent is an agent for which no expression profile has been assessed or stored in the database or library.
- An animal e.g., rat
- the gene expression profile(s) is the test set for the Predictive Model.
- the training set which is used in the Predictive Model in this case can be the entire database of sample array data because the test set data is not present in the database.
- the prediction can be made with accuracy without requiring the use of histopathology scores for the test set as part of the input into the Predictive Model.
- the agent is an agent present in the database but is used at a different dose level or with a different treatment protocol than used in the database.
- the training set which is used in the Predictive Model in this case can be the entire database of sample array data because the test set data is not present in the database.
- the prediction can be made with accuracy without requiring the use of histopathology scores for the test set as part of the input into the Predictive Model.
- the exposure time of the agent is not 6, 24, or 72 hours or repeat dosing protocols are used.
- the skilled artisan can use the toxicity predictive genes from surrounding time points to extrapolate the predicted toxicity without undue experimentation. For example, if the individual has been exposed to the agent for 12 hours, then predictive genes from 6 and 24 hours timepoints are used as guidelines for extrapolating possible predicted toxicity.
- the kidney predictive genes and predictive model can be used to determine the presence or absence of a no-observable toxicity effect level (NOEL).
- NOEL no-observable toxicity effect level
- An agent can be used at different treatment levels and expression profiles obtained for each treatment level.
- the predictive genes and predictive model can be used to determine which dose levels elicit a response that is predicted to be toxic and which dose levels are not toxic.
- the use of expression data, predictive genes and predictive models applies a number of quantitative endpoints and criteria instead of subjective endpoints and criteria. This permits more rigorous and precisely defined determination of no effect levels.
- kidney toxicity predictive genes can be used to detect toxic effects that may be manifested as long lasting or chronic consequences such as irreversible toxicity or carcinogenesis.
- the predictive genes and model can be applied to databases where classifications of training and test set samples are made with respect to actual or putative endpoints such as irreversible toxicity or carcinogenicity.
- the predictive genes can be used in a variety of alternative models to predict kidney toxicity. Some of these models do not require the direct use of data in a database but use functions or coefficients derived from the database.
- the predictive genes and models may be used to evaluate in vitro systems for their ability to reflect in vivo toxic events and to use such in vitro systems for predicting in vivo toxicity. Expression profiles for predictive genes can be created from candidate in vitro assays using treatments with agents of known in vivo toxicity and for which in vivo data on gene expression are available. The expression data and predictive models of this invention can be used to determine whether the in vitro assay system has predictive gene expression responses that accurately reflect the in vivo situation. Large sets of predictive genes as described in this invention can be tested in such models for their suitability and performance with the candidate in vitro systems. This is a superior and novel tool for evaluating and optimizing in vitro systems for their ability to reflect and accurately predict in vitro responses.
- measurement of the expression levels of the proteins coded for by the predictive genes can be used in conjunction with predictive models to predict kidney toxicity.
- kidney toxicity predictive genes are various genes known to encode cell surface, secreted and/or shed proteins. This enables the development of methods for predicting toxicity using protein biomarkers.
- Example 11 presents a process by which candidate protein biomarker genes may be selected from biomarker genes identified from transcription expression. For example, as disclosed in Table 37, there are 23 genes in the master predictive set which are known to encode secreted proteins. As disclosed in Table 43, predictive protein marker candidates may also be selected by categorizing a number of other parameters related to the predictive performance and potential use as protein markers.
- Example 11 the utility of this concept has been demonstrated by testing for serum protein levels of one of the identified biomarkers, insulin-like growth factor binding protein 1.
- the serum protein levels of this biomarker parallel the kidney transcription levels and distinguish kidney toxic from non-toxic treatments.
- kidney toxicity predictive assays which detect the expression of one or more of said predictive proteins may be developed. Such assays may have several advantages, such as:
- the identified predictive genes can be considered as potential therapeutic targets when the genes are involved in toxic damage or repair responses whose expression or functional modification may attenuate, ameliorate or eliminate disease conditions or adverse symptoms of disease conditions.
- the predictive genes can be organized into clusters of genes that exhibit similar patterns of expression by a variety of statistical procedures commonly used to identify such coordinately expression patterns.
- Common functional properties of these clustered genes can be used to provide insight into the functional relationship of the response of these genes to toxic effects.
- Common genetic properties of these genes e.g., common regulatory sequences
- the presence of common known or novel signal transduction systems that regulate expression of the genes can also lead to insight as to the functional properties of the genes.
- the presence of common known or novel regulatory sequences in the identified predictive genes can also be used to identify toxicity predictive genes that are not present in the current Rat CT array. This can be accomplished by someone skilled in the art who can analyze sequence databases for common regulatory sequences.
- the kidney toxicity predictive genes can be used to predict toxicity responses in other species, for example, human, non-human primate, mouse, hamster, guinea pig, rabbit, cattle, sheep, pig, chicken, and dog. Some members of the kidney toxicity predictive genes may also be more suitable for prediction of toxicity in species other than the species used to derive the database (rat in the case of the examples provided).
- One method for identification of such genes is that would be available to someone skilled in the art would be to examine DNA sequence databases to determine whether orthologous sequences to the predictive genes exist in the target species and how close the orthologous sequences are to the predictive gene sequences.
- One of skill in the art can examine the orthologous sequences for similarity in amino acid coding regions and motifs as well as for similarities in regulatory regions and motifs of the gene.
- kidney toxicity predictive genes or gene sequences are used for screening other potential toxicity predictive genes or gene sequences in other species or even within the same species using methods known in the art. See, for example, Sambrook supra. Gene sequences which hybridize under stringent conditions to the kidney toxicity predictive gene sequences disclosed herein are selected as potential toxicity predictive genes. Gene sequences which hybridize to the kidney toxicity predictivity gene of this invention can show homology to the kidney toxicity predictivity genes, preferably at least about 50%, 60%, 70%, 80%, or 90% identical to the kidney toxicity predictivity genes disclosed herein. It is understood that conservative substitutions of amino acids are possible for gene sequences which have some percentage homology with the kidney toxicity predictive gene sequences of this invention.
- a conservative substitution in a protein is a substitution of one amino acid with an amino acid with similar size and charge.
- Groups of amino acids known normally to be equivalent are: (a) Ala, Ser, Thr, Pro, and Gly; (b) Asn, Asp, Glu, and Gin; (c) His, Arg, and Lys; (d) Met, Glu, lie, and Val; and (e) Phe, Tyr, and Trp.
- the toxicity predictive genes can be used as guides to predicting toxicity for agents that have been administered via different routes (, intravenous, oral, dermal, inhalation, I, etc.) from the routes that were used to generate the database or to identify the toxicity predictive genes.
- the invention is not intended to be limiting to agents that have been administered at different dosages than the agents that were used to generate the database or to identify the toxicity predictive genes.
- mice Charles River, Raleigh, NC were divided into treated rats that receive a specific concentration of the compound (see Table 1 ) and the control rats that only received the vehicle in which the compound is mixed (e.g., saline).
- a specific concentration of the compound see Table 1
- the control rats that only received the vehicle in which the compound is mixed (e.g., saline).
- kidney tissue was weighed out and placed in a sterile container. To preserve integrity of the RNA, all tissues were kept on dry ice when other samples were being weighed. A RLT (Qiagen®) buffer buffer was added to the sample to aid in the homogenization process.
- the tissue was homogenized using commercially available homogenizer ( IKA Ultra Turrax T25 homogenizer) with the 7 mm microfine sawtooth shaft and generator (195 mm long with a processing range of 0.25 ml to 20 ml, item # 372718). After homogenization, samples were stored on ice until all samples were homogenized. The homogenized tissue sample was spun to remove nuclei thus reducing DNA contamination.
- Rat 700 CT chip Gene expression data was generated from a microarray chip that has a set of toxicologically relevant rat genes which are used to predict toxicological responses.
- the rat 700 CT gene array is disclosed in U.S. applications 60/264,933; 60/308,161 ; and pending application filed on January 29, 2002 that claims priority to 60/264,933 and 60/308,161 [Attorney docket 40074-2000600].
- Microarray RT reaction Fluorescence-labeled first strand cDNA probe was made from the total RNA or mRNA isolated from kidneys of control and treated rats. This probe was hybridized to microarray slides spotted with DNA specific for toxicologically relevant genes. The materials needed are: total or messenger RNA, primer, Superscript II buffer, dithiothreitol (DTT), nucleotide mix, Cy3 or Cy5 dye, Superscript II (RT), ammonium acetate, 70% EtOH, PCR machine, and ice.
- each sample that would contain 20 yg of total RNA (or 2 ⁇ g of mRNA) was calculated.
- the amount of DEPC water needed to bring the total volume of each RNA sample to 14 ⁇ l was also calculated. If RNA was too dilute, the samples were concentrated to a volume of less than 14 ⁇ in a speedvac without heat. The speedvac must be capable of generating a vacuum of 0 Milli-Torr so that samples can freeze dry under these conditions. Sufficient volume of DEPC water was added to bring the total volume of each RNA sample to 14 ⁇ l.
- Each PCR tube was labeled with the name of the sample or control reaction. The appropriate volume of DEPC water and 8 ⁇ of anchored oligo dT mix (stored at -20°C) was added to each tube.
- PCR tube The samples were mixed by pipeting. The tubes were kept on ice until all samples are ready for the next step. It is preferable for the tubes to kept on ice until the next step is ready to proceed. The samples were incubated in a PCR machine for 10 minutes at 70°C followed by 4°C incubation period until the sample tubes were ready to be retrieved. The sample tubes were left at 4°C for at least 2 minutes.
- Cy dyes are light sensitive, so any solutions or samples containing Cy- dyes should be kept out of light as much as possible (e.g., cover with foil) after this point in the process. Sufficient amounts of Cy3 and Cy5 reverse transcription mix were prepared for one to two more reactions than would actually be run by scaling up the following:
- the completed RT reaction contained impurities that must be removed. These impurities included excess primers, nucleotides, and dyes.
- the primary method of removing the impurities was by following the instructions in the QIAquick PCR purification kit (Qiagen cat#120016).
- the completed RT reactions were cleaned of impurities by ethanol precipitation and resin bead binding.
- the samples from DNA engine were transferred to Eppendorf tubes containing 600 ⁇ l of ethanol precipitation mixture and placed in -80°C freezer for at least 20-30 minutes. These samples were centrifuged for 15 minutes at 20800 x g (14000 rpm in Eppendorf model 5417C) and carefully the supernatant was decanted. A visible pellet was seen (pink/red for Cy3, blue for Cy5). Ice cold 70% EtOH (about 1 ml per tube) was used to wash the tubes and the tubes were subsequently inverted to clean tube and pellet.
- the tubes were centrifuged for 10 minutes at 20800 x g (14000 rpm in Eppendorf model 5417C), then the supernatant was carefully decanted. The tubes were air dried for about 5 to 10 minutes, protected from light. When the pellets were dried, they were resuspended in 80 ul nanopure water. The cDNA/mRNA hybrid was denatured by heating for 5 minutes at 95°C in a heat block and flash spun. Then the lid of a "Millipore MAHV N45" 96 well plate was labeled with the appropriate sample numbers. A blue gasket and waste plate (v-bottom 96 well) was attached.
- the filter plate was placed on a clean collection plate (v-bottom 96 well) and 80 ⁇ l of Nanopure water, pH 8.0-8.5 was added. The pH was adjusted with NaOH. The filter plate was secured to the collection plate and after 5 minutes was centrifuged for 7 minutes at 2500 rpm.
- Probes were added to the appropriate wells (80 ⁇ l cDNA samples) containing the Binding Resin.
- the reaction is mixed by pipeting up and down -10 times. It is preferable to use regular, unfiltered pipette tips for this step.
- the plates were centrifuged at 2500 rpm for 5 minutes (Beckman GS-6 or equivalent) and then the filtrate was decanted. About 200 ⁇ l of 80% isopropanol was added, the plates were spun for 5 minutes at 2500 rpm, and the filtrate was discarded. Then the 80% isopropanol wash and spin step was repeated.
- the filter plate was placed on a clean collection plate (v-bottom 96 well) and 80 ⁇ l of Nanopure water, pH 8.0-8.5 was added.
- the pH was adjusted with NaOH.
- the filter plate was secured to the collection plate with tape to ensure that the plate did not slide during the final spin.
- the plate sat for 5 minutes and was centrifuged for 7 minutes at 2500 rpm. Replicates of samples should be pooled.
- (G) Dry-down Process Concentration of the cDNA probes is preferable so that they can be resuspended in hybridization buffer at the appropriate volume.
- the volume of the control cDNA (Cy-5) was measured and divided by the number of samples to determine the appropriate amount to add to each test cDNA (Cy-3).
- Eppendorf tubes were labeled for each test sample and the appropriate amount of control cDNA was allocated into each tube.
- the test samples (Cy-3) were added to the appropriate tubes. These tubes were placed in a speed-vac to dry down, with foil covering any windows on the speed vac. At this point, heat (45°C) may be used to expedite the drying process. Samples may be saved in dried form at -20°C for up to 14 days.
- Hybridization Buffer for 100 ⁇ l:
- the hybridization buffer was made up as:
- Hybridization Buffer for 101 ⁇ l:
- the slides were then moved to 2X SSC, 0.1% SDS and soaked for 5 minutes.
- the slides were transferred into 0.1X SSC and 0.1% SDS for 5 minutes.
- the slides are transferred to 0.1 X SSC for 5 minutes.
- the slides, still in the slide carrier were transferred into nanopure water (18 megaohms) for 1 second.
- the stainless steel slide carriers were placed on micro-carrier plates and spun in a centrifuge (Beckman GS-6 or equivalent) for 5 minutes at 1000 rpm.
- GenePix files as above. Initially, set A training set compounds (see Table 4) data from one microarray was used per animal. Next, set A test set compounds (see Table 4) replicate arrays for each animal were combined into one GenePix file. Specific data loaded into GeneSpringTM software included gene name, GenBank ID control channel mean fluorescence and signal channel mean fluorescence. Expression ratio data (ratio of signal to control fluorescence) were normalized using the 50 th percentile of the distribution of all genes and control channel. Ratio data were excluded from analysis if the control channel value was ⁇ 0. For analysis of correlations and predictive values gene expression ratios were transformed as the log of the ratio.
- Histopathology scores for each animal were entered with gene expression data by using the GeneSpringTM 'Drawn Gene' function.
- the first step is variable selection of genes to be used for prediction. This entails taking a single gene and a single class (e.g., kidney toxicity) and creating a contingency table.
- columns 1 through N of the table each represent one possible cutoff point based on the gene expression level (ratio of signal/control) for that class.
- the number of possible cutoffs is less than or equal to the total number of samples for the class (e.g., A). It is possibly less than the total number, since there may be ties in gene expression level.
- N, M, and X may or may not be distinct.
- n-class problem is illustrated, where and /entries are the class counts at that gene expression cutoff level, for that specific gene and class, either above (“a") or below (“b") the cutoff.
- Classl is the set of all samples (above or below) the cutoff for Classl
- !Class1 are all those not in Classl (above or below) the cutoff, and similarly for the other classes.
- the class totals in the training set are the total class marginals used to compute Fisher's exact test.
- the genes per class are rank ordered by the most discriminating (highest) score.
- the predictivity list is composed of the most discriminating genes per class. Namely, genes are combined that best discriminate class 1 with those that best discriminate class 2 and so on. The genes are selected in rotation of the highest score per class. Duplicate genes are ignored in the rotation and not added to the list, the gene with the next highest score is taken.
- each sample is a vector of 50 normalized expression ratios. Since the selection of genes is done in rotation, the list contains 25 genes for one class, and 25 for the other class.
- the matrix below illustrates the basic features of this gene selection process.
- the test set is classified based on the / -nearest neighbor (knn) voting procedure. Using just those genes in the gene list, for each sample in the test set of samples, the k nearest neighbors in the training set are found with the Euclidean distance. The class in which each of the k nearest neighbors is determined, and the test set sample is assigned to the class with the largest representation in the k nearest neighbors after adjusting for the proportion of classes in the training set.
- knn / -nearest neighbor
- the decision threshold is a mechanism to help clearly define the class into which the sample will fall, and can be set to reject classification if the voting is very close or tied. (Thus, k can be even for two-class problems without worrying about the tie problem.) A p-value is calculated for the proportion of neighbors in each class against the proportions found in the training set, again using Fisher's exact test, but now a one-sided test.
- a p-value ratio is set as a way of setting the level of confidence in individual sample predictions based on the ratio of p-values for the best class (lowest p-value) versus the second best class (second lowest p-value). For example, if the P-value is set at 0.5 and the ratio of p-values for a particular sample is 0.6, then the predictive model will not make a call for that sample.
- Training and Test Data Sets Data were each separated into 6 training and test sets. The first training and test set was created by allocating one set of data as a training set (Set A training set) and another set of data as a test set (Set A test set). Other training and test sets were created by randomly distributing the compounds into the sets. This was accomplished by assigning random numbers to lists of compounds that are negative and positive for histopathology, sorting by random number, and then dividing the sorted lists into a specific number of training and test sets. The training and test set assignments are presented in Table 4.
- Kidney Toxicology Classification Kidney toxicity classifications were entered for training and test set as a parameter column. Toxicity, as defined by observation of kidney tubular necrosis in the kidney at 72 hours after treatment, was entered as a "yes” or "no" for each animal in a compound-dose group. Additionally, a parameter column for random histopathology classification was designated. This was done by randomly assigning the same number of "yes” and "no" calls to the individual animals.
- Kidney toxicology classifications used are described in Table 1. In this analysis randomized classifications (same number of "yes” and “no" classifications distributed randomly among the samples) were used.
- False positive rate is the proportion of negative cases that are incorrectly classified as positive is calculated as: b/a+b.
- False negative rate is the proportion of positive cases that are incorrectly classified as negative is calculated as: c/c+d.
- One noteworthy feature of the predictive ability is the ability to distinguish between effects of a compound at different dose levels.
- Combo 6 and Combo 5 (The top combo subsets with the highest levels, 92.1% and 89.6%, respectively, of predictive accuracy on an individual sample basis) for 24 hour kidney data.
- Example 1 Materials and Methods: Compounds and treatments list used to construct the kidney database are given in Example 1. This table also provides the evaluation of the kidney toxicity observed as kidney tubular necrosis in samples collected 72 hours after treatment. The database is described in detail in Example 1. This Example analyzes expression data from samples collected 6 hours after treatment. Array data, normalization and transformation procedures used were as described in Example 1. Procedures and methods for obtaining gene lists correlating with histopathology scores were as described in Example 1 with scores as in Example 1. The Predict Parameter Values tool in GeneSpringTM software used for kidney toxicity class prediction is described in detail in Material and Methods of Example 1. [202] (B) Training and Test Data Sets:Data were each separated into 6 training and test sets.
- the first training and test set was created by allocating one set of data as a training set (Set A training set) and another set of data as a test set (Set A test set). Other training and test sets were created by randomly distributing the compounds into the sets. This was accomplished by assigning random numbers to lists of compounds that are negative and positive for histopathology, sorting by random number, and then dividing the sorted lists into a specific number of training and test sets.
- the training and test set assignments are presented in Table 19.
- Kidney toxicity classifications were entered for training and test set as a parameter column. Toxicity, as defined by observation of kidney tubular necrosis in the kidney at 72 hours after treatment, was entered as a "yes” or "no" for each animal in a compound-dose group. Additionally, a parameter column for random histopathology classification was designated. This was done by randomly assigning "yes” and "no” calls to the individual animals. The total number of "yes” and “no” calls was maintained the same as in the correct classification, so that the proportion of "yes” and no calls was the same in all the training and test sets.
- Input genes for the Predict Parameter Value feature included all 700 genes in the GenePix file (the rat CT Array) as well as smaller lists of genes whose expressions correlated with histopathology by the correlation measures described previously.
- the number of genes used to predict are varied with standard numbers of 50, 40, 30, 20, 10, 5, 2 and 1 genes used.
- the specified number of predictive genes was varied to obtain an optimum number of predictive genes.
- Hour Expression Data (A) Materials and Methods: The database used was as described in Example 1. Array data, normalization procedures and transformations used in these analyses are as described in Example 1. Table 38 presents 6 hour gene expression data for the predictive genes. These data can be used with a k- means nearest neighbor prediction model (as available in GeneSpring or other statistical software packages) to make predictions as described in this example. The Predict Parameter Values tool in GeneSpringTM software was used for kidney toxicity class prediction. A description of this tool and the statistical procedures used is provided in Example 1.
- Kidney Toxicology Classification Kidney toxicology classifications used are described in Example 1. In this analysis randomized classifications (same number of "yes” and “no" classifications distributed randomly among the samples) were used.
- (E) Prediction Measures Measures of prediction used for these analyses are generally accepted prediction measures for information about actual and predicted classifications done by a classification system (Venables and Ripley, ibid and Kubat and Matwin, ibid). Results from predictions of a two class case can be described as a two-class matrix as described above.
- Example 1 Materials and Methods: Compounds and treatments list used to construct the kidney database are given in Example 1. This table also provides the evaluation of the kidney toxicity observed as kidney tubular necrosis in samples collected 72 hours after treatment. The Database is described in detail in Example 1. This Example analyzes expression data from samples collected 6 hours after treatment. Array data, normalization and transformation procedures used were as described in Example 1. Procedures and methods for obtaining gene lists correlating with histopathology scores were as described in Example 1 with scores as in Example 1. The Predict Parameter Values tool in GeneSpringTM software used for kidney toxicity class prediction is described in detail in Material and Methods of Example 1.
- the first training and test set was created by allocating one set of data as a training set (Set A training set) and another set of data as a test set (Set A test set).
- Other training and test sets were created by randomly distributing the compounds into the sets. This was accomplished by assigning random numbers to lists of compounds that are negative and positive for histopathology, sorting by random number, and then dividing the sorted lists into a specific number of training and test sets.
- Kidney Toxicology Classification Kidney toxicity classifications were entered for training and test set as a parameter column. Toxicity, as defined by observation of kidney tubular necrosis in the kidney at 72 hours after treatment, was entered as a "yes” or "no" for each animal in a compound-dose group. Additionally, a parameter column for random histopathology classification was designated. This was done by randomly assigning "yes” and "no" calls to the individual animals. The total number of "yes” and “no” calls was maintained the same as in the correct classification, so that the proportion of "yes” and no calls was the same in all the training and test sets.
- Non-calls are cases where no prediction was made because the P-value ratio exceeded the specified P-value ratio cutoff Calculations were made for overall percent correct calls (number of correct classifications/number or samples), percent correct calls of called samples (number of correct classifications/number of samples with calls) and percent of called samples (samples with calls/number of samples).
- the correlating gene lists as well as the entire array gene list were provided as input lists to the GeneSpring Predict Parameter value tool (described in Materials and Methods) that employs a K-means nearest neighbor (knn) predictive model. These lists as well as the entire array gene list were used for each of the six training and test sets defined in Materials and Methods o generate predictions of histopathology classifications of the test sets.
- Input genes for the Predict Parameter Value feature included all 700 genes in the GenePix file (the Rat CT Array) as well as smaller lists of genes whose expressions correlated with histopathology by the correlation measures described previously.
- the number of genes used to predict are varied with standard numbers of 50, 40, 30, 20, 10, 5, 2 and 1 genes used.
- the specified number of predictive genes was varied to obtain an optimum number of predictive genes.
- each gene on this aggregate list has predictive value for at least one of the training and test sets because it was observed to contribute to an optimum predictivity for a specific training/test set.
- the aggregate list was subdivided into smaller lists of genes based on the number of times a gene was predictive for an individual training or test set. For example, if 6 training and test sets were used, genes that were predictive in all 6 training and test sets were designated as Combo (combination) 6. Genes that were predictive in only 5 of 6 training and test sets were designated as Combo 5, etc.
- Hour Expression Data (A) Materials and Methods: The Database used was as described in Example 1. Array data, normalization procedures and transformations used in these analyses are as described in Example 1. Table 40 presents 72 hour gene expression data for the predictive genes. These data can be used with a k- means nearest neighbor prediction model (as available in GeneSpring or other statistical software packages) to make predictions as described in this example. The Predict Parameter Values tool in GeneSpringTM software was used for kidney toxicity class prediction. A description of this tool and the statistical procedures used is provided in Example 1. The training and test data sets used are those described in Example 1.
- Kidney Toxicology Classification Kidney toxicology classifications used are described in Example 1. In this analysis randomized classifications (same number of "yes” and “no" classifications distributed randomly among the samples) were used.
- (D) Prediction Measures Measures of prediction used for these analyses are generally accepted prediction measures for information about actual and predicted classifications done by a classification system (Venables and Ripley, ibid and Kubat and Matwin, ibid). Results from predictions of a two class case can be described above.
- the database used for evaluation of these models was the 24 hour expression data for kidney samples described above. Expression data was for the Combo 6 set of predictive genes as described herein. Due to heteroscedasticity (i.e., the variance increases proportionately more than the mean increases) of the gene expression ratio data, a log transformation of the data is often considered. In general untransformed data was used but for some models log transformed data was used for comparison. Six training and testing sets were used that are the same as described in Example 1.
- a discrimination function is used to classify a training set. This function is cross-validated with a testing set, often repeatedly to quantify the mean and variation of the classification error. There are numerous common discrimination functions, and a comparative study of the performance of these functions is useful in determining the best classifier. Additional measures are then used to compare the performance of the classifiers. Since the classes are of significantly uneven sizes, use a geometric mean measure (GMM) was used to compare models, namely, the square root of the product of the true positives and the true negatives.
- GMM geometric mean measure
- (C) Classifier Models A variety of common classification techniques were evaluated. As an extension of the k-means nearest neighbor (knn) model a simple hybrid classifier was designed and tested, using the knn results, to transform the knn model into a database independent model. This model is termed a centroid model. The centroid model uses the correctly identified test data results from knn and locates a centroid of the subset of k samples that are of the same class for each correctly identified test sample. The centroid is assigned the correct class, and with new test data, a sample is assigned the class of its nearest centroid.
- knn k-means nearest neighbor
- Trees were pruned via ten-fold internal cross-validation, (i.e., using subsets of the training set) for each training set, and then the tree was used to predict the testing set.
- a GMM was thus calculated for each testing set.
- Trees perform the gene selection via pruning, and anywhere from one to five genes were selected for each tree.
- the centroid model is five-fold cross-validated using random subsets of the testing set. The mean of the GMM of each of the validation runs is used as the performance measure.
- the top five discriminating genes are used in the centroid models.
- the logistic discrimination uses a stepwise backwards selection process to determine the gene set during the training phase. Three to six genes are typically selected via this process. A single performance is then obtained using the corresponding testing set.
- a neural network is trained on each training set and then validated on the corresponding testing set. All 28 genes in the data set are used with the neural network model.
- Table 30 presents logistic discrimination coefficients derived from this analysis. These coefficients may be used in a logistic discriminant model to obtain predictions of kidney toxicity when expression ⁇ values for the indicated genes are determined using appropriate samples and an appropriate microarray expression detection system such as the Rat CT array used to develop the Database.
- the classification model for all of the data using a classification tree in S-Plus software provided the following rule for predicting toxicity: if Gadd45 ⁇ 1.474 AND Tissue inhibitor of metalloproteinases 1 ⁇ 1.786, then "No” (not toxic), otherwise "Yes” Toxic.
- (A)(1 ) Animal Treatment and Tissue Harvest Male Sprague-Dawley rats in groups of 3 were treated by intraperitoneal injection with test compounds (cephalosporidine, 1500 mg/kg and cisplatin, 20 mg/kg) or only with the vehicle in which the compound was mixed. At specified timepoints (6h and 24h) the rats were euthanized and tissues collected. Kidney tissues were immediately placed into liquid nitrogen and frozen within 3 minutes of the death of the animal to ensure that mRNA did not degrade. The tissues were sent blinded to be evaluated. The organs/tissues are then packaged into well-labeled plastic freezer quality bags and stored at -80 degrees until needed for isolation of the mRNA from a portion of the organ/tissue sample.
- (C) RESULTS Table 31 presents predictions for samples that were external to the database used to derive the predictive genes.
- the samples were kidney samples from replicate animals treated with cephaloridine and cisplatin.
- One of these compounds (cisplatin) is also represented in the database (at a different dose level) and the other compound, cephaloridine, is not in the database. Histopathology conducted on the kidney samples verified that these treatments induced kidney tubular necrosis.
- FIG. 7 presents combined results of K-means and gene-tree hierarchical clustering analysis.
- Combo 6 28 genes was clustered using K-means (number of cluster 10, maximum iteration 100, similarity measure Pearson) and Gene tree (separation ratio 0.5, minimum distance 0.001 , similarity measure Pearson).
- the k-means clusters are colored according to the corresponding set 1 to set 10.
- the gene names on the display from top to bottom correspond to left to right cluster bars.
- a computer program product produces a prediction of the occurrence of a kidney toxicity using input gene expression data from test samples.
- the model and data for the computer program have been primarily validated using Phase-1 Rat CT arrays and Phase-
- Rat CT expression data in the Phase-1 TOXBank database as described in previous examples may also be used in the computer program product.
- expression platforms such as TaqMan using Syber Green technology
- Those skilled in the art are capable of developing and validating scaling factors to adjust for differences in differential gene expression sensitivity and responsiveness among different platforms used in the computer program product.
- the computer program product uses the Predictive Model as described in the previous examples.
- the computer program product contains an encrypted training data set that includes differential gene expression values and an endpoint classification for each sample in the training set.
- the computer program product samples are from the same timepoint (e.g., gene expression measured at 24 hours after dosing) and the classification is binary for the specific endpoint (e.g., kidney tubular necrosis or no kidney tubular necrosis).
- the computer program product also contains encrypted lists of the Combo sets of predictive genes (also called Predictagen sets).
- Inputs to the Predictive Model of the computer program product are the c value for number of nearest neighbors and the type of distance measure to be used in the model.
- Data inputs for the Predictive Model include the Combo list(s) of predictive genes and training set as encrypted "plug-in" files and specification of a test data file(s) that has expression data.
- the initial prediction is made after calculating the probability that the tabulated votes are different from the proportion of votes in the training set for each classification.
- a statistical test (hypergeometric mean distribution) is run for each classification and p-values are calculated.
- the classification prediction would be that class that has the highest p-value.
- a classification cutoff procedure is used that uses the p-value ratio (1 - po/pi where po is the p-value for the not predicted class and pi is the p-value for the predicted class). If the p-value ratio does not exceed a specified cutoff value (input to the computer program product by the user) then a prediction is not made.
- the Prediction Machine can be used with multiple Predictagen sets with the classifications, p-values and p-value ratios calculated as above. In this case an overall prediction is made by combining the predictions of the individual Predictagen sets. Each Predictagen set is weighted by a performance number. The overall certainty for this combined prediction is calculated by a paired value Mest using the p-value ratio and (1 -p-value ratio) for each Predictagen set as a pair of values. The certitude is 1-p where p is the value for the paired value Mest.
- Encrypted training data is included as a plug-in module for the software.
- User input includes specification of encrypted Predictagen gene lists and samples for prediction (files with gene expression data). Additional specifications are distance measure to be used in the knn model (currently Euclidean), number of neighbors and a certitude cutoff (p-value ratio cutoff).
- the 'Load Predictagens' button is clicked on to load the desired predictagen(s).
- the 24 hour kidney Predictagen is loaded.
- a predictagen in the Predictagen sets list box is highlighted and the 'Make Predictor' button is clicked on (in this example, 24 hour kidney). If necessary, the predictor is highlighted and the 'Configure' button is clicked on to set parameter values.
- the 'Load Samples' button is clicked on. Sample data is loaded as text files in the format shown in Table 44. Samples from the Samples list box using the left mouse button are then selected, and the CTRL key is simultaneously selected to make multiple selections.
- 3 kidney samples from rats treated with 25 mg/kg paraquat and 3 kidney samples from rats treated with 80 mg/kg phenobarbital are selected.
- the samples were treated and processed for gene expression analysis as described in the previous examples.
- the 'Add to predictor' button is then clicked on, and the 'Predict' button is then clicked on to generate the program's output.
- the 'Summary', 'Detail', or 'Full' radio buttons are selected to control the amount of information displayed about the prediction.
- the 'Tabular Report' checkbox is checked to put the output in a format that can be loaded into Excel as tab-delimited text.
- the 'Save', 'Copy', 'Print', and 'Clear' buttons are selected to save the output, copy the output to the clipboard, print the output, or clear the output window prior to another prediction.
- the summary view displays sample information, the call (kidney tubular necrosis or negative), and the overall certitude.
- the detail view presents the individual calls and 1 -p-value ratio for each Predictagen, in addition to summary view information.
- the full view presents, for each sample and Predictagen gene list, the specific nearest neighbors and their classification (votes) along with the hypergeometric mean p values for each classification. At the end of this information detail view information is presented.
- Table 43 displays the test set of gene expression data used to generate predictions. The table shows the correct classification of kidney samples that have histopathology (kidney tubular necrosis) or no histopathology.
- Table 42 displays the summary output of the computer program after loading. Two out of three of the paraquat samples (sample #s 16477 and 16479) were correctly predicted for rat kidney tubular necrosis (with certitudes of 0.472 and 0.796). Three out of three of the phenobarbital samples were correctly predicted as negative for kidney tubular necrosis.
- Table 43 displays the detailed output of the computer program, which shows the individual performances of the 24 hour kidney Combo sets and the overall certitude score.
- Protein marker candidates can be selected from biomarker genes using a number of parameters. Table 44 presents biomarker genes sorted in order of their mean individual gene predictive performance (percent correct calls) for all genes exhibiting ⁇ 60% percent correct calls. Each gene was then evaluated for evidence whether it codes for a protein. This is clearly a key criterion for a protein marker. The next parameters evaluated were the relative transcriptional response in toxic versus non- toxic samples. If protein levels are proportional to RNA levels then these columns indicate the relative potential magnitude of the protein marker in toxic and non-toxic samples. The better marker candidates should be those genes exhibiting the larger differences in RNA expression. A number of additional criteria can be considered included protein MW, occurrence of the protein in tissues other than the target tissue and availability of antibodies which will recognize the protein.
- One important criterion may also be whether the protein is secreted.
- the last column in Table 44 indicates that 3 of the proteins are known to be secreted.
- Table 37 lists proteins known to be secreted derived from the total list of predictive genes. The property of secretion may be useful in identification of proteins which could be biomarkers in serum or possibly other matrices such as urine or saliva.
- Protein markers can be rapidly evaluated by testing for levels of the identified marker candidates using any of a number of analytical techniques for measuring specific protein levels such as Western blots or ELISA assays.
- Samples for analysis may be selected from a tissue bank such as that described in Example 1. Selection for analysis would include samples from toxic treatments and samples from non-toxic treatments.
- Quantitative protein marker data can be analyzed using the same approaches described in Example 2 for evaluation and validation of predictive performance of the protein markers.
- Combination category is the number of training/test set gene list occurrences Table 6 Randomly Selected Gene Subsets from 24 H Combo All (216 Genes)*
- Genes were randomly selected from the entire array list of genes excluding the Combo All 216 predictive genes by assigning a random number to each gene, sorting by the random number and selecting the appropriate number of sorted genes.
- Prediction measures are given as means and range of values (in parentheses) for six training/test sets using 24 hour array data and gene lists. Unit of prediction was the animal and the predictive classification was for kidney tubular necrosis observed at 72 hours after treatment.
- False positive rate Proportion of negative cases that are incorrectly classified as positive
- Geometric mean Performance measure that takes into account proportion of positive and negative cases
- Prediction measures are given as means and range of values (in parentheses) for six training/test sets using 24 hour array data and gene lists. Unit of prediction was compound-dose level and the predictive classification was for kidney tubular necrosis observed at 72 hours after treatment. Prediction for compound-dose was based on a majority of individual animal calls. In cases where there were an equal number of opposing calls or no calls a no-call was assigned to the compound-dose level.
- Prediction measures are given as means and range of values (in parentheses) for six training/test sets using 24 hour array data and gene lists. Unit of prediction was the compound and the predictive classification was for kidney tubular necrosis observed at 72 hours after treatment. Compounds were considered toxic if any compound-dose level for that compound was predicted as toxic.
- Beta-actin sequence 2
- Pancreatic secretory trypsin inhibitor type II PSTI-II
- Preproalbumin sequence 2 (alternate clone 1)
- Prediction measures are given as means and range of values (in parentheses) for six training/test sets using 24 hour array data and random subsets of genes. Unit of prediction was the animal and the predictive classification was for kidney tubular necrosis observed at 72 hours after treatment.
- Accuracy proportion of the total number of predictions that are correct. Non-calls are counted as incorrect predictions. Accuracy was calculated for correct classifications of kidney toxicity assigned to the samples and for randomized classifications in the same proportions as the correct classifications. Values presented are the mean accuracy values for 6 training/test sets with minimum and maximum accuracy values.
- Combination category is the number of training/test set gene list occurrences.
- Combination category is the number of training/test set gene list occurrences.
- Prediction measures are given as means and range of values (in parentheses) for six training/test sets using 72 hour array data and gene lists. Unit of prediction was the animal and the predictive classification was for kidney tubular necrosis observed at 72 hours after treatment.
- a Combo entry number indicates that the gene was on the predictive list for that time point and the number of occurrences of that gene on optimal combined training/test set lists. "Not Found” indicates that the gene was not on the optimal combined list for that time point.
- PC3 NGF-inducible anti-proliferative putative secreted protein
- Pancreatic secretory trypsin inhibitor type II PSTI-II
- Alpha prothymosin 1 104 118 106 -108 1
- Beta-actin sequence 2 -117 -115 -102 102 105 108
- Bile salt export pump (sister of p-gtycoprotein) 115 12 115 108 127 105
- Carbonic anhydrase III sequence 2 -114 -107 -121 123 105 123
- NCLK Cdc2 related protein kinase
- CNBP Cellular nucleic acid binding protein
- Complement factor I CFI
- Co ⁇ trapsin-iike protease inhibitor (CP ⁇ -21) -105 103 101 -102 -102 -106
- Disulfide isomerase related protein ERp72 117 102 111 11 101 112
- Enoyl CoA hydratase (mitochondnal) 122 117 117 127 114 118
- Epithelial sodium channel alpha subunit (alpha-ENaC) 1 1 -104 -104 101 109
- Fetuin-like protein (IRL685) 104 101 104 -127 -104 -117
- hypoxia-inducible factor 1 alpha 116 106 112 111 108 116
- Insulin-like growth factor I exon 6 -106 108 ⁇ 115 -104 -127 •111
- Interferon inducible protein 10 104 -107 -109 122 108 -108
- Interferon related developmental regulator IFRD1 (PC4) 133 109 117 -109 103 •107
- Peroxisome proliferator activated receptor alpha 114 114 108 109 125 128
- Peroxisome proliferator activated receptor gamma 1 103 102 111 -105 -11
- Phase-1 RCT 110 1 -117 -128 -106 106 113
- Phase-1 RCT-141 138 134 121 -102 103 104
- Phase-1 RCT-H9 177 131 115 -109 -109 -107 D hase-1 RCT 15 106 102 103 111 106 Pnase-1 RCT-150 -114 -104 ' -117 105 125 107- Phas ⁇ -1 RCT-151 -12 -11 ⁇ 121 107 102 108 Phase-1 RCT-152 -104 -111 12 -104 -106 -104 Phase-1 RCT 153 -144 129 131 -106 -102 -109 Phase-1 RCT-154 -107 -109 -102 103 -109 108 Phase-1 RCT-155 -11 -104 •115 -104 -103 -107 Phase-1 RCT-156 104 -103 •1 -107 -125 -108 Phase-1 RCT-158 196 135 256 -104 165 106 Phase-1 RCT-160 103 -111 -106 1 104 106 Phase-1 RCT-161 -13 -139 -129 -
- Phase- 1 RCT-282 " -115 -118 -117 -107 -104 -104 Phase-! RCT-283 -104 -103 -108 -103 -107 -103 Phase-1 RCT 284 103 -101 11 -111 105 -103 Phase-1 RCT-285 -103 -I 03 -107 -101 105 108 Phase-1 RCT-286 104 -105 -102 105 -1 ⁇ 101 Phase-1 RCT 287 -101 -101 -102 -106 103 -108 Phase-1 RCT-288 101 104 -108 103 102 •101 Phase-1 RCT-289 -103 101 -1 -101 105 114 Phase-1 RCT-29 -108 -116 -I 18 -1 -104 -103 Phase-1 RCT-290 124 115 103 106 119 105 Phase-1 RCT-291 -102 -106 -1 108 123 108 Phase-1 RCT-292 •11 -108 -106 104
- D hase-1 RCT-74 102 -111 -114 -102 -101 -106
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Pathology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Abstract
Description
Claims
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP03741753A EP1506396A2 (en) | 2002-02-27 | 2003-02-27 | Kidney toxicity predictive genes |
| AU2003273154A AU2003273154A1 (en) | 2002-02-27 | 2003-02-27 | Kidney toxicity predictive genes |
| CA002477688A CA2477688A1 (en) | 2002-02-27 | 2003-02-27 | Kidney toxicity predictive genes |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US36112802P | 2002-02-27 | 2002-02-27 | |
| US60/361,128 | 2002-02-27 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| WO2003100030A2 true WO2003100030A2 (en) | 2003-12-04 |
| WO2003100030A3 WO2003100030A3 (en) | 2004-12-23 |
| WO2003100030B1 WO2003100030B1 (en) | 2005-05-06 |
Family
ID=29584275
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2003/006196 Ceased WO2003100030A2 (en) | 2002-02-27 | 2003-02-27 | Kidney toxicity predictive genes |
Country Status (4)
| Country | Link |
|---|---|
| EP (1) | EP1506396A2 (en) |
| AU (1) | AU2003273154A1 (en) |
| CA (1) | CA2477688A1 (en) |
| WO (1) | WO2003100030A2 (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7415358B2 (en) | 2001-05-22 | 2008-08-19 | Ocimum Biosolutions, Inc. | Molecular toxicology modeling |
| US7469185B2 (en) | 2002-02-04 | 2008-12-23 | Ocimum Biosolutions, Inc. | Primary rat hepatocyte toxicity modeling |
| US8603752B2 (en) | 2005-01-27 | 2013-12-10 | Institute For Systems Biology | Methods for identifying and monitoring drug side effects |
| WO2022064506A1 (en) * | 2020-09-24 | 2022-03-31 | Quris Technologies Ltd | Ai-chip-on-chip, clinical prediction engine |
-
2003
- 2003-02-27 EP EP03741753A patent/EP1506396A2/en not_active Withdrawn
- 2003-02-27 CA CA002477688A patent/CA2477688A1/en not_active Abandoned
- 2003-02-27 AU AU2003273154A patent/AU2003273154A1/en not_active Abandoned
- 2003-02-27 WO PCT/US2003/006196 patent/WO2003100030A2/en not_active Ceased
Non-Patent Citations (5)
| Title |
|---|
| DEBOUCK ET AL: 'DNA microarrays in drug discovery and development' NATURE GENETICS SUPPLEMENT vol. 21, January 1999, pages 48 - 50, XP002928673 * |
| DOOLEY ET AL: 'A method to improve selection of molecular targets by circumventing the ADME pharmacokinetic system utilizing PharmArray DNA microarrays' BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS vol. 303, 11 April 2003, pages 828 - 841, XP002982396 * |
| GERHOLD ET AL: 'Better therapeutics through microarrays' NATURE GENETICS SUPPLEMENT vol. 32, December 2002, pages 547 - 552, XP002982395 * |
| HUANG ET AL: 'Assessment of Cisplatin-induced nephrotoxicity by microarray technology' TOXICOLOGICAL SCIENCES vol. 63, 2001, pages 196 - 207, XP001096461 * |
| KRAMER ET AL: 'Overview of the application of transcription profiling using selected nephrotoxicants for toxicology assessment' ENVIRONMENTAL HEALTH PERSPECTIVES vol. 112, no. 4, March 2004, pages 460 - 464, XP002982394 * |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7415358B2 (en) | 2001-05-22 | 2008-08-19 | Ocimum Biosolutions, Inc. | Molecular toxicology modeling |
| US7426441B2 (en) | 2001-05-22 | 2008-09-16 | Ocimum Biosolutions, Inc. | Methods for determining renal toxins |
| US7469185B2 (en) | 2002-02-04 | 2008-12-23 | Ocimum Biosolutions, Inc. | Primary rat hepatocyte toxicity modeling |
| US8603752B2 (en) | 2005-01-27 | 2013-12-10 | Institute For Systems Biology | Methods for identifying and monitoring drug side effects |
| US9103834B2 (en) | 2005-01-27 | 2015-08-11 | Institute For Systems Biology | Methods for identifying and monitoring drug side effects |
| WO2022064506A1 (en) * | 2020-09-24 | 2022-03-31 | Quris Technologies Ltd | Ai-chip-on-chip, clinical prediction engine |
Also Published As
| Publication number | Publication date |
|---|---|
| AU2003273154A1 (en) | 2003-12-12 |
| AU2003273154A8 (en) | 2003-12-12 |
| WO2003100030A3 (en) | 2004-12-23 |
| EP1506396A2 (en) | 2005-02-16 |
| WO2003100030B1 (en) | 2005-05-06 |
| CA2477688A1 (en) | 2003-12-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2897828C (en) | Methods for identifying, diagnosing, and predicting survival of lymphomas | |
| US8014957B2 (en) | Genes associated with progression and response in chronic myeloid leukemia and uses thereof | |
| US20060063156A1 (en) | Outcome prediction and risk classification in childhood leukemia | |
| US7908090B2 (en) | Signatures for human aging | |
| US20070092888A1 (en) | Diagnostic markers of hypertension and methods of use thereof | |
| US11591655B2 (en) | Diagnostic transcriptomic biomarkers in inflammatory cardiomyopathies | |
| US20090258002A1 (en) | Biomarkers for Tissue Status | |
| AU2007244868A1 (en) | Methods and compositions for detecting autoimmune disorders | |
| JP2015154774A (en) | Methods and compositions for assessing graft survival in solid organ transplant recipients | |
| AU2006236588A1 (en) | Diagnosis of sepsis | |
| WO2003083140A2 (en) | Classification and prognosis prediction of acute lymphoblasstic leukemia by gene expression profiling | |
| KR20110057188A (en) | Biomarker Profile Measurement System and Method | |
| US20060204968A1 (en) | Tools for diagnostics, molecular definition and therapy development for chronic inflammatory joint diseases | |
| CA2674211A1 (en) | Methods and kits for diagnosis and/or prognosis of the tolerant state in liver transplantation | |
| AU2003233743A1 (en) | Identification of specific marker genes and use thereof | |
| WO2006135904A2 (en) | Method for producing improved results for applications which directly or indirectly utilize gene expression assay results | |
| CN101180407A (en) | Leukemia disease genes and uses thereof | |
| US20040067507A1 (en) | Liver inflammation predictive genes | |
| US20080096770A1 (en) | Evaluation of the Toxicity of Pharmaceutical Agents | |
| US20110130303A1 (en) | In vitro diagnosis/prognosis method and kit for assessment of tolerance in liver transplantation | |
| US20040076974A1 (en) | Liver necrosis predictive genes | |
| WO2003100030A2 (en) | Kidney toxicity predictive genes | |
| WO2005074540A2 (en) | Novel predictors of transplant rejection determined by peripheral blood gene-expression profiling | |
| EP1368499A2 (en) | Rat toxicologically relevant genes and uses thereof | |
| US7060444B1 (en) | Zone 3 necrosis associated markers and method of use thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2477688 Country of ref document: CA |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2003741753 Country of ref document: EP |
|
| WWP | Wipo information: published in national office |
Ref document number: 2003741753 Country of ref document: EP |
|
| B | Later publication of amended claims |
Effective date: 20041217 |
|
| NENP | Non-entry into the national phase in: |
Ref country code: JP |
|
| WWW | Wipo information: withdrawn in national office |
Country of ref document: JP |
|
| WWW | Wipo information: withdrawn in national office |
Ref document number: 2003741753 Country of ref document: EP |