[go: up one dir, main page]

EP1604319A1 - Test d'elaboration informatique de proteines pour identifier des sites de liaison - Google Patents

Test d'elaboration informatique de proteines pour identifier des sites de liaison

Info

Publication number
EP1604319A1
EP1604319A1 EP03719372A EP03719372A EP1604319A1 EP 1604319 A1 EP1604319 A1 EP 1604319A1 EP 03719372 A EP03719372 A EP 03719372A EP 03719372 A EP03719372 A EP 03719372A EP 1604319 A1 EP1604319 A1 EP 1604319A1
Authority
EP
European Patent Office
Prior art keywords
macromolecule
binding
organic fragment
instances
identifying
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP03719372A
Other languages
German (de)
English (en)
Other versions
EP1604319A4 (fr
Inventor
Frank Guarnieri
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sarnoff Corp
Original Assignee
Sarnoff Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sarnoff Corp filed Critical Sarnoff Corp
Priority to EP07075671A priority Critical patent/EP1912145A3/fr
Priority claimed from PCT/US2003/007366 external-priority patent/WO2004081841A1/fr
Publication of EP1604319A1 publication Critical patent/EP1604319A1/fr
Publication of EP1604319A4 publication Critical patent/EP1604319A4/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/30Drug targeting using structural data; Docking or binding prediction

Definitions

  • the present invention relates to methods of identifying binding sites on proteins, methods for identifying classes of compounds suitable for binding a protein, and methods of conducting experiments to identify compounds that interact with a protein to affect a biological process.
  • Determinations of protein structures have to date been conducted by isolating crystals of the protein of interest, and analyzing structure by X-ray crystallography.
  • the protein has been co-crystallized with a heavy metal component, or subjected to multiple co-crystallizations, with the heavy metal providing a reference for solving the crystallographic data.
  • the present inventor has found a method of identifying, from a three- dimensional structural solution of a macromolecule, the binding sites for molecules.
  • the structural solution used as the basis for the method can be derived from crystallography, spectroscopic analyses such as NMR, computational derivations, or any other method of determining the structure of a macromolecule.
  • the method does not require or typically use information on the function of the macromolecule, as the method avoids subjective biases and instead depends purely on physical parameters. Further, the method can be refined further to narrow the possible choices of binding sites and identify the functionalities, i.e., organic fragments or "ORFs," that effectively interact with the binding site(s).
  • the data obtained for ORFs further identifies the orientations of the functionalities useful in a candidate binding agent, thereby providing a tool for searching chemical databases to identify candidate binding agents.
  • the methods described herein identify more than one potential binding site, the data generated through these methods can be used to energetically rank the binding sites, and thereby quantitatively determine which site has the potential to more strongly bind molecules.
  • the computational method described here generates maps of binding site preferences that are nearly identical with maps produced by compiling data generated by traditional methods, but with one important difference — the experimentally produced data took many years to produce while the data produced as described herein can be produced in no more than a few weeks.
  • the invention provides an important development in unbiased simulation methods for predicting the character of agents that bind to biological macromolecules to affect the function of the macromolecules.
  • a method of identifying binding sites on a macromolecule comprising: (a) for at least one organic fragment (ORF), conducting, at separate values of parameter B, two or more simulated annealing of chemical potential calculations using the ORF as the inserted solvent; and (b) comparing converged solutions from step (a) to identify first locations at which the relevant ORF is strongly bound, thereby identifying candidate sites for binding ligand molecules.
  • the method further comprises: (c) identifying clusters of sites that strongly bind an ORF.
  • the method further comprises:
  • step (d) conducting steps (a) and (b) for each of two or more ORFs and identifying clusters where two or more distinct ORFs bind.
  • a cluster that binds three or more distinct ORFs is identified.
  • the method can identify further functionalities that contribute to the binding of bioactive agents by reducing the binding stringency in the vicinity of a cluster to further identify elements that would contribute to the binding of a bioactive agent.
  • the method further comprises:
  • step (e) conducting, at separate values a measure of chemical potential, two or more simulated annealing of chemical potential calculations using water as the inserted solvent; (f) comparing converged solutions from step (c) to identify locations at which water is strongly bound, thereby identifying locations on the protein which are not candidate sites for binding ligand molecules; and (g) identifying first locations that are not water locations.
  • the simulated annealing of chemical potential calculations comprise multiple steps of sampling, and wherein in a number of steps of the sampling the ORFs position is changed by a small amount and the resulting new position is accepted or rejected based on the change in energy as a result of the change attempted.
  • Also provided is a method of conducting combinatorial chemistry to identify compounds that interact with a macromolecule comprising: (a) identifying classes of reactants that are modeled by the functionalities of the ORFs found in a cluster pursuant to the binding site identifying method of macromolecule; (b) designing a combinatorial synthetic protocol that calls for two or more synthetic procedures that react reagents of at least two of the classes identified in step (a); and (c) conducting the combinatorial synthetic protocol to create candidate binding molecules.
  • a method of conducting a bioactive agent discovery process comprising: (a) from a group of established combinatorial synthetic protocols or collections of chemical compounds or pools of chemical compounds, identifying those members of the group that provide a high density of compounds that meet for a macromolecule selection criteria identified from the binding site identifying method; and (b) conducting binding or functional assays to identify compounds obtained from the identified collections or protocols which bind or affect the function of the macromolecule.
  • Figure 1A illustrates a solved crystal structure
  • Figure IB displays the structure with a grid imposed.
  • Figures 2A-2D display the method of the invention applied to the crystallographic solution of elastase; the method can be exemplified using methanol as the ORF.
  • Figures 3A and 3B show the combined results for several ORFs bound to elastase after simulations at relatively low B values, with the results in Figure 3B filtered to identify clusters of these bound ORFs.
  • Figure 3C shows the two clusters of Figure 3B which remain after excluding strong water binding sites, and
  • Figure 3D shows the one cluster that remains after extending the analysis to another ORF;
  • Figure 3E shows the analysis extended to still a further ORF.
  • the panels of Figure 3F compare the simulation results to a co- crystallography result.
  • Illustrated in Figure 4A are the amide binding sites extracted from the data of six co-crystallization experiments with elastase and known ligands; and illustrated in Figure 4B is a cluster of the highest affinity amide binding sites determined by the simulation method of the invention.
  • Illustrated in Figure 4C are the amide ORFs of Figure 4B plus amides which are in the vicinity of the cluster but which appear in the simulation at second highest affinity binding values.
  • Figures 6A and 6B show the surfaces of elastase involved in binding ligands as indicated by the crystallographic data, Figure 6A, and as indicated by the solutions obtained using the method described herein, Figure 6B.
  • Figure 7 shows a schematic illustration of the type of titrations for water binding to a macromolecule that can be used to help identify a level of relatively strong water binding.
  • Bioactive agent refers to a substance such as a chemical that can act on a cell, virus, organ or organism, including but not limited to drugs (i.e., pharmaceuticals) to create a change in the functioning of the cell, virus, organ or organism.
  • drugs i.e., pharmaceuticals
  • the method of identifying bioactive agents of the invention is applied to organic molecules having molecular weight of about 600 or less or to polymeric species such as peptides, proteins, nucleic acids, proteoglycans and the like.
  • a bioactive agent can be a medicament, i.e. a substance used in therapy of an animal, preferably a human.
  • Cluster of free grid points refers to free grid points that are within a “cluster” in that, relative to a given ORF, there is a sufficient number of nearby or adjacent free grid points to allow a reasonable probability that the ORF could be inserted at the cluster.
  • the cluster of free grid points for H 2 O must be defined to identify all volumes at the surface or interior of a macromolecule that could accommodate H 2 O — though the selection criteria should err to identifying some volumes that do not accommodate H 2 O, as needed to assure that all appropriate volumes are sampled in the simulation process.
  • a cluster of free grid points is defined differently depending on the size of the ORFs (e.g., compare H O and benzene) and the spacing of the grid.
  • a “cluster of ORF binding sites” typically refers to a pattern of closely located or superimposed sites that bind ORFs with sufficient affinity to merit further consideration.
  • Collection of chemical compounds refers to any collection of compounds collected or organized with the intention that they can be examined to identify bioactive agents (e.g., having a biological activity measured directly or through a surrogate for biological activity such as binding to a macromolecule or interfering with a function of a macromolecule).
  • the collection can be prepared from a collection of simpler molecules (which can be bound to a support) by a chemical scheme designed to generate a diversity of chemicals. Collections of this latter type are often referred to as "combinatorial libraries.”
  • Free grid points refers to grid points (which are discussed below) which are, for a given accepted definition of atomic radius, “free” in that they do not fall within the atomic radii of the mapped atoms of the relevant macromolecule.
  • “Macromolecule” refers to a molecule or collection of molecules which has a time-averaged tertiary structure. Thus, while the term typically refers to proteins, ribonucleic acids, structures formed of both nucleic acid and protein, carbohydrates, structures formed of two or more of the aforementioned, and the like, it can also refer to structures formed with other molecules including lipids. Macromolecules are used in the method described herein with reference to maps of their tertiary structure.
  • maps are typically generated by X-ray diffraction studies, which have generated maps for thousands of macromolecules.
  • maps can be produced by other methods such as computational methods or computational methods supplemented by other data such as NMR data. While computational methods have been difficult to apply, recent studies appear to have achieved some successes.
  • Organic fragments or “ORFs” are molecules or molecular fragments that can be used to model one or more modes of interaction with a macromolecule, such as the interactions of carbonyls, hydroxyls, amides, hydrocarbons, and the like.
  • Water locations are locations at which water is strongly bound, meaning, in one embodiment, for example locations where the simulation indicates water remains bound when the simulation is run at values of B that are equal to or less than the B value for the transition point indicating those water molecules that are strongly influenced by the macromolecule. Illustrated in Figure 7 is a conceptualization of the titration of simulated bound water molecules with decreasing values of B, a parameter described further below.
  • a transition point indicates water molecules that are strongly influenced by the macromolecule.
  • a B value less than or equal to that at the transition point can be designated as defining water binding of sufficient strength to render competitive binding by another molecule unlikely, as illustrated by point SB in the illustration.
  • this point SB is selected so that about 100 to about 50 water molecules remain bound for a 50 kd protein.
  • the simulation process of the present invention works by artificially inserting a given ORF at an unbiased sampling of all the sites on or within a macromolecule structure where such ORF can, as a practical matter, reside. These sites can be termed the "sampling sites.”
  • a schedule of simulations for each of a number of ORFs are run, with each simulation run at a separate value of a parameter B, which is related to the excess chemical potential.
  • the schedule provides for simulations conducted at each of a number of B values, typically ranging from 10 to about -15.
  • the simulation assesses at each step of the simulation whether the insertion of the ORF at a given site shall be accepted or rejected, with the assessment based on a grand canonical ensemble probability density function.
  • the algorithm models the insertion of the ORF at the site.
  • a forced bias canonical probability density function is used to translate and rotate the ORF in small steps (e.g., ⁇ 0.2A, ⁇ 30°) to identify an energy minimized insertion given the simulation parameters in place at the time of the simulation step.
  • the probability of the insertion is then determined from the grand canonical ensemble probability density function, and the ORF can be represented as resident at the site by a random number generating protocol weighed to the probability value.
  • an occupation probability of the ORF residing at that sample site at the given value of B can be assessed.
  • the occupancy as an overall result of the method can then be determined based on this probability, for example with a random number protocol making the representation based on its probability.
  • the degree to which the ORF is translated or rotated can also be represented based on the probability of such translations and rotations.
  • FIG. 1 illustrates a solved crystal structure (Figure 1A) on which a grid is imposed ( Figure IB).
  • the grid can have about 1 A to about 1A spacing, with the grid intersection points defining the candidates for sampling sites.
  • the spacing of the grid is preferably selected to be less than the smallest cross-section of the ORF.
  • the spacing is typically selected to be small enough in relation to the size of the ORF so that the probability that free volumes that could define free grid point clusters have sufficient free grid points to allow useful sampling as described below. Such relatively small spacing minimizes the chance that the selection of how to orient the grid will bias the algorithm against identifying certain ORF binding preferences.
  • the sampling sites are selected from sites that are unoccupied by the macromolecule (Figure IB). A final elimination of "grid bias" is achieved by varying the test insertion points away from strict initial insertion at grid points, as described below.
  • the sampling sites are limited to those sites having enough adjacent volume free of the macromolecule to allow the ORF to be inserted.
  • the grid points can be selected for those free grid points that are within a cluster of free grid points, such as, for example, a cluster of 3, 4, 5, 6, 7, 8 or more free grid points, depending on the size of the ORF and the spacings of the grid.
  • the ORF is not necessarily initially inserted exactly at the grid points, but instead at a random sampling of insertion points within a short distance of the grid points, such as points within a sphere shape centered at the grid point and having a diameter of about some percentage, such as 10%, of the grid spacing, or within a box shape centered at the grid point having width, length and height of about such a percentage of the grid distance.
  • this "wobble" in the initial insertion point helps eliminate grid bias where the placement of the grid happens to reduce the chance that a given open volume will be efficiently sampled.
  • Figure 2C shows the final solution using a lower intermediate value, e.g., B - 0 or -2 or -4.
  • the next step of the process is to conduct simulations with additional ORFs and identify clusters of relatively high affinity ORF binding sites.
  • simulations can be conducted to determine binding for ORFs for ammonia, methanol, ketone and amide.
  • Combined results at relatively low B values are illustrated in Figure 3 A.
  • Clusters of ORF binding sites are identified in Figure 3B.
  • the method of the present invention seeks to identify clusters of ORF binding sites, where the clusters can be made up solely of one type of ORF. Preferably, however, the cluster will include binding sites for 2, 3, 4, 5, 6, 7 or more distinct ORFs. Examples of useful ORFs include:
  • the ORFs selected are representative of chemical features that have proven useful in the design of pharmaceuticals or other bioactive chemicals.
  • an important part of the process is to run the simulations with several ORFs, identifying clusters of sites that bind multiple ORFs with relatively high affinity. These clusters are strong candidate sites for ligand binding sites. Moreover, the relative positioning of the ORFs is instructive of the features of good binding agents. For example, at the binding site identified on elastase by the methods described below, a cluster having two benzene rings with an amide interposed between them models some of the strongest elastase inhibitors derived from an extensive research program, which inhibitors have a sulfonamide in place of the carbon- based amide of the simulation. See, Tables XXIII and XXV of Edwards et al, Med. Res. Rev. .74:127-194 (1994).
  • clusters of ORF binding sites alone will identify, or substantially narrow the range of choices for, the sites at which ligands interact with a given protein.
  • the sites that bind water strongly are identified, and the clusters that intersect with strong water binding sites are discounted.
  • the candidate ligand binding sites of Figure 3B are narrowed by excluding water binding sites, as illustrated in Figure 3C. If the analysis is extended to five ORFs as illustrated in Figure 3D, a single candidate site remains.
  • Figure 3E shows a slightly different perspective of the same site illustrated in Figure 3D, with the analysis extended to six ORFs.
  • Figure 3F shows how well the candidate site (left panel) matches up with the structure of a co-crystal containing the ligand trifluoroacetyl-lysyl-prolyl-j!?- isopropylanilide.
  • an optional step in the process is to narrow the choices for ligand binding sites by excluding ORF clusters that intersect with relatively strong water binding sites.
  • clusters of ORFs are typically identified at relatively low B values, thereby helping to identify prospective binding sites for ligands.
  • further information about prospective binding sites can be gleaned by looking, in the vicinity of a prospective binding site, at more weakly binding ORFs. This information value flows from the prospect of more weakly binding ORFs modeling a ligand interaction which, while weak in isolation, models a real contribution to ligand binding affinity of a bioactive agent as a whole.
  • Illustrated in Figure 4A are the amide binding sites extracted from the data of six co-crystallization experiments with elastase and known ligands.
  • Illustrated in Figure 4B is a cluster of the highest affinity amide binding sites determined by simulation.
  • Illustrated in Figure 4C are the amide ORFs of Figure 4B plus amides which are in the vicinity of the cluster but which appear in the simulation at the second highest affinity values. As illustrated, this last step of expanding the results by looking at neighboring lower affinity ORF binding sites helps to better model the results seen in co- crystallography.
  • the cluster results identify the site at which the majority of amide, binding sites are seen in crystallography, but the expansion extends the results to another cleft in elastase where amides have been experimentally located. Additionally, the expansion identifies part of another cleft at which ligand interactions are seen (as will be illustrated in ' other Figures).
  • the features of ligand binding sites indicated by other modes of analysis are expanded upon by looking to less stringent simulation results in the vicinity of ORF clusters.
  • the above illustration focused on a cluster of one type of ORF, but is applicable with clusters of many types of ORFs, where the expansions can be limited to one type of ORF or multiple types of ORFs.
  • inhibitors are trifluoroacetyl-1-lysyl-l-prolyl- p-isopropylanilide (crystal solution: Mattos et al, as submitted April 30, 1994), trifluoroacetyl-1-lysyl-l-leucyl-p-isopropylanilide (crystal solution: Mattos et al, as submitted June 22, 1994), trifluoroacetyl-1-phenylalanyl- p-isopropylanilide (crystal solution: Mattos et al, as submitted April 30, 1994), trifluoroacetyl-1-phenylalanyl-l-alanyl-p-trifluoromethylanilide (crystal solution: Mattos et al, as submitted February 14, 1995), trifluoroacetyl-1- valyl-1-alanyl-p-trifluoromethylanilide (crystal solution: Mattos et al., as submitted February 14,
  • Figures 6A and 6B show the regions of elastase involved in binding ligands as indicated by the crystallographic data, Figure 6A, and as indicated by the solutions obtained from the computational method described herein, Figure 6B.
  • the simulations of the invention utilize a Monte Carlo algorithm.
  • the form of Monte Carlo simulation useful in the present invention is described in Frenkel and Smit, Understanding Molecular Simulation: From Algorithms to Applications, Academic Press, New York (1996).
  • the simulation method can comprise: Locate a numeric representation of the macromolecule in a periodic cell.
  • the distinguishing feature of simulations in the grand-canonical ensemble is the change in the number of molecules (ORFs) in the system during the simulation.
  • ORFs number of molecules
  • the sampling is not restricted to the configuration space of a given dimension but it has to be extended to a set of configuration spaces.
  • Applicant has found, unexpectedly, that the complexity of allowing for these changing numbers of molecules and the resulting changing mass nonetheless makes the simulation computationally extremely more efficient.
  • the change in the number of molecules corresponds to the fact that the grand-canonical partition function ⁇ is the linear combination of the corresponding canonical partition functions of a different number, N, of molecules, Q:
  • T is the absolute temperature
  • is the chemical potential
  • k is the
  • This method has been found useful in simulating atomic fluids at moderate densities but runs into difficulties when room-temperature liquids are simulated.
  • the difficulty stems from the fact that most insertion attempts will be at positions where there already is a molecule (e.g., from the solved protein structure) resulting in a large ⁇ E, and the resulting probable rejection of the attempt.
  • PTM V V represents the volume of the regions of the system that contain cavities of suitable size.
  • the efficiency of the cavity- biased method follows from the fact that the algorithm searching for cavities also yields P ⁇ av without extra steps.
  • Calculations on a variety of fluids (water, benzene), which define ORFs, have confirmed that the cavity biased method significantly increases the efficiency of insertion attempts and allows modeling of densities that proved to be impractical without this improvement.
  • This text illustrates how the method of simulated annealing of chemical potential allows bulk waters to be distinguished from bound waters, and how differentially bound waters may be distinguished from each other based on their relative chemical potentials. This is illustrated by showing that it takes more free energy to desolvate the minor groove than the major groove of a charged DNA dodecamer.
  • Grand canonical ensemble simulations are generally performed by placing a molecule in a periodic simulation cell, setting a parameter B, which is representative of free energy, in such a way as to achieve an experimentally determined density, sampling potential hydration positions around the molecule by inserting and deleting water molecules from the simulation cell using a technique such as cavity-bias (Mezei, Mol Phys. 61:565-582 (1994); Resat and Mezei, J Am. Chem. Soc. 116:7451-7452 (1994)), and accepting or rejecting the attempt based on a Metropolis Monte Carlo (Metropolis et al, J. Chem. Phys.
  • the simulation is started with a large initial _ ⁇ -value so that a higher percentage of water insertion attempts are accepted. This causes the simulation cell to be flooded with water molecules.
  • a simulated annealing of chemical potential on a d(CGCGAATTCGCG) 2 was performed, starting with 5 - 1.0 down to -26 in 37 increments performing 2,000,000 cavity-biased grand canonical ensemble Monte Carlo steps at each 5-value.
  • the final configuration of the simulation with B — -6 has 1120 bound water molecules.
  • the final configuration of the simulation with B -8, has bound 533 water molecules.
  • the final configuration of the simulation with B — ll has 215 bound water molecules.
  • the most salient feature of this progression is the differential hydration of the major and minor groove of the DNA.
  • the first hydration shell (defined by the position of the first minimum of the radial distribution -function) of the major and minor groove has a comparable density (0.012 and 0.013, respectively), while the second hydration shell of the minor groove has twice the density of the major groove.
  • B — -8 the hydration difference becomes quite pronounced with the minor groove first and second shell hydration density being 2.5 fold and 5 fold higher than the major groove, respectively.
  • B -11 the major and minor groove hydration density again becomes equal because at this value of the excess chemical potential both grooves are essentially stripped bare.
  • Illustrating the differential hydration propensities of the major and minor grooves of DNA is computationally undemanding (3 days of CPU time to run one annealing schedule and 3 days of CPU time to run one proximity analysis (Calculations of volume elements can be CPU intensive. The effects of volume element calculations on proximity analysis are described in Mezei and Beveridge in Methods in Enzymology, Packer, ed., Academic Press, New York (1986), pp. 21-47) on an SGI Power Challenge) using simulated annealing of chemical potential because only a coarse "cooling" schedule of the chemical potential is required. Since the chemical potential is a free energy, a very fine cooling schedule may be used to estimate quantitatively the hydration free energy difference of two different functional groups or even two different atoms of the DNA. Two atoms that desolvate at the same
  • Antagonists and Agonists Assays and Molecules
  • Candidate bioactive agents identified by the methods of the invention can be tested to assess their binding to the macromolecule in question. Where the macromolecules are responsible for many biological functions, including disease states, it is therefore desirable to devise screening methods to identify compounds which stimulate or which inhibit the function of the macromolecule. Accordingly, in a further aspect, the present invention provides for a method of screening compounds to identify those which stimulate or which inhibit the function of such a macromolecule.
  • agonists or antagonists can be employed for therapeutic and prophylactic purposes for diseases.
  • Compounds can be identified from a variety of sources, for example, cells, cell-free preparations, chemical libraries, and natural product mixtures.
  • the screening methods can simply measure the binding of a candidate compound to the macromolecule, or to cells or membranes bearing the macromolecule.
  • the macromolecule can be a variant of the macromolecule used in the simulation method, such as a fragment retaining the binding site identified in the simulation or a fusion protein used to make recombinant synthetic methods more practical.
  • the screening method can involve competition with a labeled competitor. Further, these screening methods can test whether the candidate compound results in a signal generated by activation or inhibition of the macromolecule, using detection systems appropriate to the cells comprising the macromolecule. Inhibitors of activation are generally assayed in the presence of a known agonist and the effect on activation by the agonist by the presence of the candidate compound is observed. Further, the screening methods can simply comprise the steps of mixing a candidate compound with a solution containing a macromolecule, measuring macromolecule activity in the mixture, and comparing the activity of the mixture to a standard.
  • the invention also provides a method of screening compounds to identify those which enhance (agonist) or block (antagonist) the action of macromolecules, including association of the macromolecule with itself or another macromolecule.
  • the method of screening can involve high- throughput techniques.
  • a synthetic reaction mix for agonists or antagonists, a synthetic reaction mix, a cellular compartment, such as a membrane, cell envelope or cell wall, or a preparation of any thereof, comprising macromolecule and a labeled substrate or ligand of such polypeptide is incubated in the absence or the presence of a candidate molecule that can be an agonist or antagonist.
  • the ability of the candidate molecule to agonize or antagonize the macromolecule is reflected in decreased binding of the labeled ligand or decreased production of product from a substrate.
  • Molecules that bind gratuitously, i.e., without inducing the effects of macromolecule are most likely to be good antagonists.
  • Molecules that bind well and, as the case can be, increase for example the rate of product production from substrate, increase signal transduction, or increase chemical channel activity are agonists.
  • Detection of the rate or level of, as the case can be, production of product from substrate, signal transduction, or chemical chamiel activity can be enhanced by using a reporter system.
  • Reporter systems that can be useful in this regard include but are not limited to colorimetric, labeled substrate converted into product, a reporter gene that is responsive to changes in macromolecule activity, and binding assays known in the art.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Physics & Mathematics (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Biophysics (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Medicinal Chemistry (AREA)
  • Peptides Or Proteins (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

L'invention concerne un procédé informatique permettant d'analyser une macromolécule à la recherche de sites de liaison potentiels, qui consiste à placer une instance d'une représentation informatique d'un fragment organique à une pluralité de sites de liaison potentiels de la macromolécule ; à choisir une valeur de B, B étant = ν'/kT + ln<N> ν représentant le potentiel chimique excédentaire, k la constante de Boltzmann, T la température absolue et <N> le nombre moyen de molécules du fragment organique ; à replacer les instances du fragment organique jusqu'à l'obtention d'un état d'énergie minimisé ; à évaluer, pour chaque instance du fragment organique replacé, si ledit fragment se lie à la macromolécule au niveau du site de liaison potentiel associé à la valeur choisie de B ; à supprimer les instances du fragment organique qui ne se lient pas au niveau du site de liaison potentiel associé à la valeur choisie de B ; à répéter ces étapes avec une valeur de B moindre ; et à sortir une liste des instances non supprimées du fragment organique ; pour autant que le fragment organique ne soit pas de l'eau.
EP03719372A 2003-03-11 2003-03-11 Test d'elaboration informatique de proteines pour identifier des sites de liaison Withdrawn EP1604319A4 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP07075671A EP1912145A3 (fr) 2003-03-11 2003-03-11 Sondage informatique d'une protéine afin d'identifier les sites de liaison

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2003/007366 WO2004081841A1 (fr) 1998-09-23 2003-03-11 Test d'elaboration informatique de proteines pour identifier des sites de liaison

Related Child Applications (1)

Application Number Title Priority Date Filing Date
EP07075671A Division EP1912145A3 (fr) 2003-03-11 2003-03-11 Sondage informatique d'une protéine afin d'identifier les sites de liaison

Publications (2)

Publication Number Publication Date
EP1604319A1 true EP1604319A1 (fr) 2005-12-14
EP1604319A4 EP1604319A4 (fr) 2006-08-02

Family

ID=34392751

Family Applications (1)

Application Number Title Priority Date Filing Date
EP03719372A Withdrawn EP1604319A4 (fr) 2003-03-11 2003-03-11 Test d'elaboration informatique de proteines pour identifier des sites de liaison

Country Status (4)

Country Link
EP (1) EP1604319A4 (fr)
JP (1) JP2006519976A (fr)
AU (1) AU2003223242A1 (fr)
CA (1) CA2517233A1 (fr)

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6735530B1 (en) * 1998-09-23 2004-05-11 Sarnoff Corporation Computational protein probing to identify binding sites

Also Published As

Publication number Publication date
CA2517233A1 (fr) 2004-09-23
EP1604319A4 (fr) 2006-08-02
JP2006519976A (ja) 2006-08-31
AU2003223242A1 (en) 2004-09-30

Similar Documents

Publication Publication Date Title
US6735530B1 (en) Computational protein probing to identify binding sites
Kihara et al. The PDB is a covering set of small protein structures
Zhang Protein structure prediction: when is it useful?
Rapp et al. Prediction of loop geometries using a generalized born model of solvation effects
Pettit et al. Hotpatch: a statistical a pproach to finding biologically relevant features on protein surfaces
Dickson et al. Kinetics of ligand binding through advanced computational approaches: a review
US20040083060A1 (en) Method of operating a computer system to perform a discrete substructural analysis
Melquiond et al. Next challenges in protein–protein docking: from proteome to interactome and beyond
US9218460B2 (en) Defining and mining a joint pharmacophoric space through geometric features
Liu et al. A comparison of topologically associating domain callers based on Hi-C data
Ezzell Beyond the human genome
JP2003524831A (ja) 組み合わせ空間を探索するためのシステムおよび方法
WO2002065119A9 (fr) Procede de prediction d&#39;interactions moleculaires dans des reseaux
Lauria et al. Drugs polypharmacology by in silico methods: new opportunities in drug discovery
Lobanov et al. Stochastic similarity selections from large combinatorial libraries
US20040267456A1 (en) Method and computer program product for drug discovery using weighted grand canonical metropolis Monte Carlo sampling
WO2005114458A1 (fr) Sondage informatique de proteines pour identifier des sites de liaison
EP1604319A1 (fr) Test d&#39;elaboration informatique de proteines pour identifier des sites de liaison
EP1912145A2 (fr) Sondage informatique d&#39;une protéine afin d&#39;identifier les sites de liaison
Marti-Renom et al. Structure comparison and alignment
Mann et al. Classifying proteinlike sequences in arbitrary lattice protein models using LatPack
Raman et al. Prediction report
Lepp et al. Finding key members in compound libraries by analyzing networks of molecules assembled by structural similarity
US20040267509A1 (en) Method and computer program product for drug discovery using weighted Grand Canonical Metropolis Monte Carlo sampling
Kuziemko et al. Using structure to explore the sequence alignment space of remote homologs

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20051006

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20060630

17Q First examination report despatched

Effective date: 20080915

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20090127