WO2001049721A2 - Bacterial genes and proteins that are essential for cell viability and their uses - Google Patents
Bacterial genes and proteins that are essential for cell viability and their uses Download PDFInfo
- Publication number
- WO2001049721A2 WO2001049721A2 PCT/US2000/035604 US0035604W WO0149721A2 WO 2001049721 A2 WO2001049721 A2 WO 2001049721A2 US 0035604 W US0035604 W US 0035604W WO 0149721 A2 WO0149721 A2 WO 0149721A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- polypeptide
- nucleic acid
- ceg
- ligand
- acid molecule
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- SZAYJIYHBBUQLV-YLGXPZSGSA-N CC[n]1ncc2c1ncc1c2c(NCC/C=C\C(\O)=C(/C)\Cl)nnc1NCCCN1CCN(C)CC1 Chemical compound CC[n]1ncc2c1ncc1c2c(NCC/C=C\C(\O)=C(/C)\Cl)nnc1NCCCN1CCN(C)CC1 SZAYJIYHBBUQLV-YLGXPZSGSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D231/00—Heterocyclic compounds containing 1,2-diazole or hydrogenated 1,2-diazole rings
- C07D231/54—Heterocyclic compounds containing 1,2-diazole or hydrogenated 1,2-diazole rings condensed with carbocyclic rings or ring systems
- C07D231/56—Benzopyrazoles; Hydrogenated benzopyrazoles
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07C—ACYCLIC OR CARBOCYCLIC COMPOUNDS
- C07C279/00—Derivatives of guanidine, i.e. compounds containing the group, the singly-bound nitrogen atoms not being part of nitro or nitroso groups
- C07C279/20—Derivatives of guanidine, i.e. compounds containing the group, the singly-bound nitrogen atoms not being part of nitro or nitroso groups containing any of the groups, X being a hetero atom, Y being any atom, e.g. acylguanidines
- C07C279/24—Y being a hetero atom
- C07C279/26—X and Y being nitrogen atoms, i.e. biguanides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07C—ACYCLIC OR CARBOCYCLIC COMPOUNDS
- C07C311/00—Amides of sulfonic acids, i.e. compounds having singly-bound oxygen atoms of sulfo groups replaced by nitrogen atoms, not being part of nitro or nitroso groups
- C07C311/15—Sulfonamides having sulfur atoms of sulfonamide groups bound to carbon atoms of six-membered aromatic rings
- C07C311/16—Sulfonamides having sulfur atoms of sulfonamide groups bound to carbon atoms of six-membered aromatic rings having the nitrogen atom of at least one of the sulfonamide groups bound to hydrogen atoms or to an acyclic carbon atom
- C07C311/19—Sulfonamides having sulfur atoms of sulfonamide groups bound to carbon atoms of six-membered aromatic rings having the nitrogen atom of at least one of the sulfonamide groups bound to hydrogen atoms or to an acyclic carbon atom to an acyclic carbon atom of a hydrocarbon radical substituted by carboxyl groups
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D209/00—Heterocyclic compounds containing five-membered rings, condensed with other rings, with one nitrogen atom as the only ring hetero atom
- C07D209/02—Heterocyclic compounds containing five-membered rings, condensed with other rings, with one nitrogen atom as the only ring hetero atom condensed with one carbocyclic ring
- C07D209/04—Indoles; Hydrogenated indoles
- C07D209/08—Indoles; Hydrogenated indoles with only hydrogen atoms or radicals containing only hydrogen and carbon atoms, directly attached to carbon atoms of the hetero ring
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D211/00—Heterocyclic compounds containing hydrogenated pyridine rings, not condensed with other rings
- C07D211/04—Heterocyclic compounds containing hydrogenated pyridine rings, not condensed with other rings with only hydrogen or carbon atoms directly attached to the ring nitrogen atom
- C07D211/06—Heterocyclic compounds containing hydrogenated pyridine rings, not condensed with other rings with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having no double bonds between ring members or between ring members and non-ring members
- C07D211/36—Heterocyclic compounds containing hydrogenated pyridine rings, not condensed with other rings with only hydrogen or carbon atoms directly attached to the ring nitrogen atom having no double bonds between ring members or between ring members and non-ring members with hetero atoms or with carbon atoms having three bonds to hetero atoms with at the most one bond to halogen, e.g. ester or nitrile radicals, directly attached to ring carbon atoms
- C07D211/56—Nitrogen atoms
- C07D211/58—Nitrogen atoms attached in position 4
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D239/00—Heterocyclic compounds containing 1,3-diazine or hydrogenated 1,3-diazine rings
- C07D239/02—Heterocyclic compounds containing 1,3-diazine or hydrogenated 1,3-diazine rings not condensed with other rings
- C07D239/24—Heterocyclic compounds containing 1,3-diazine or hydrogenated 1,3-diazine rings not condensed with other rings having three or more double bonds between ring members or between ring members and non-ring members
- C07D239/28—Heterocyclic compounds containing 1,3-diazine or hydrogenated 1,3-diazine rings not condensed with other rings having three or more double bonds between ring members or between ring members and non-ring members with hetero atoms or with carbon atoms having three bonds to hetero atoms with at the most one bond to halogen, directly attached to ring carbon atoms
- C07D239/46—Two or more oxygen, sulphur or nitrogen atoms
- C07D239/48—Two nitrogen atoms
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07D—HETEROCYCLIC COMPOUNDS
- C07D471/00—Heterocyclic compounds containing nitrogen atoms as the only ring hetero atoms in the condensed system, at least one ring being a six-membered ring with one nitrogen atom, not provided for by groups C07D451/00 - C07D463/00
- C07D471/12—Heterocyclic compounds containing nitrogen atoms as the only ring hetero atoms in the condensed system, at least one ring being a six-membered ring with one nitrogen atom, not provided for by groups C07D451/00 - C07D463/00 in which the condensed system contains three hetero rings
- C07D471/14—Ortho-condensed systems
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/315—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci
- C07K14/3156—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci from Streptococcus pneumoniae (Pneumococcus)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/02—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving viable microorganisms
- C12Q1/18—Testing for antimicrobial activity of a material
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
Definitions
- the present invention relates generally to nucleotide sequences, and polypeptides encoded by the sequences, that are essential for bacterial viability, and to methods of using the nucleotide and polypeptide sequences.
- Bacterial genera such as Streptococcus, Staphylococcus, Pseudomonas, Yersinia, Salmonella, and Enterobacter, are the cause of numerous afflictions in humans and animals. Bacterial infection can lead to serious health conditions, including pneumonia, osteomyelitis, meningitis, sinusitis, otitis, cystitis, and even food poisoning. Typically, these infections can be treated with standard antimicrobial agents such as antibiotics. However, the emergence of pathogenic bacterial strains that are resistant to antibiotics has risen alarmingly in the past two decades. This situation has created an urgent need for the development of new antimicrobial agents.
- One strategy for developing new antimicrobial agents is to identify bacterial gene sequences that encode gene products that are essential for bacterial cell viability and develop and/or identify agents which inhibit the function of the gene product.
- DNA sequencing technology has advanced from sequencing one gene at a time to sequencing entire genomes, the sum of all genes in an organism. With the recent arrival of bacterial genomic information, it is now possible to compare multiple bacterial genomes in an attempt to identify genes that encode conserved gene products. In this manner, one skilled in the art may identify a set of conserved bacterial genes, including a subset of genes that are essential for bacterial cell viability. The essential gene is then used as a starting point to develop therapeutic agents that inhibit or inactivate the product of the essential gene.
- genes that encode gene products that are essential for cell viability such as cell replication, growth, and survival.
- These genes and their encoded gene products can be used as a starting point towards identifying agents that inhibit functions essential for cell viability, thereby causing bacterial cell stasis or death (e.g., antibacterial agents).
- the present invention provides experimental identification of novel, conserved essential genes (ceg) from bacteria and their encoded protein products.
- ceg genes are considered essential to cell viability because disruption of an endogenous ceg gene results in lethality of a bacterial cell (e.g., as determined by failure to recover viable chloramphenicol-resistant colonies, as described herein).
- the gene products encoded by these genes are potentially valuable targets for chemotherapeutic intervention of bacterial infections .
- ceg nucleotide sequences of the invention were obtained by large-scale computational comparisons of multiple genome sequences to identify conserved protein coding regions, followed by gene disruption to identify cegs.
- the conservation of protein sequences in many cases is believed to reflect the higher level conservation of common biochemical pathways essential for bacterial function and viability.
- CEG CEG and CEG stand for Conserved Essential Gene.
- ceg refers herein to ceg nucleotide sequences.
- CEG refers herein to CEG polypeptide sequences.
- Embodiments of the ceg nucleotide sequences and the CEG polypeptide sequences are designated CFEs which stands for CEG For Expression.
- the CFEs are polypeptides resulting from expression of the ceg nucleotide sequence.
- the _present invention provides isolated nucleotide sequences of conserved essential genes from bacteria, designated ceg.
- the invention also provides recombinant nucleic acid molecules including the ceg sequences of the invention, and methods of uses thereof.
- nucleic acid molecules having ceg sequences are described in SEQ ID NOS.: 1-113.
- the invention further provides isolated polypeptides and recombinant polypeptides having the CEG sequences of the invention, and methods of uses thereof.
- polypeptides having CEG sequences are described in SEQ ID NOS. : 114- 226.
- the ceg sequences of the present invention are DNA or RNA. Further, the invention includes nucleic acid molecules that are identical or nearly identical (e.g., similar) with the ceg sequences of the invention. The invention additionally provides polynucleotide sequences that hybridize under stringent conditions to the ceg sequences of the invention. A forther embodiment provides polynucleotide sequences which are complementary to the ceg sequences of the invention. Yet another embodiment provides ceg nucleic acid molecules that are labeled with a detectable marker. Another embodiment provides recombinant nucleic acid molecules, such as a vector or a fusion molecule, including the ceg sequences of the invention.
- the present invention provides various ceg sequences, fragments thereof having essential gene activity, and related molecules such as antisense molecules, oligonucleotides, peptide nucleic acids (PNA), fragments, and portions thereof.
- antisense molecules oligonucleotides
- PNA peptide nucleic acids
- the present invention relates to the inclusion of the polynucleotides encoding CEG gene products, such as CEG polypeptides, in an expression vector which can be used to transform host cells or organisms.
- CEG gene products such as CEG polypeptides
- Such transgenic hosts are useful for the production of CEG gene products for the development of antibacterial agents such as antibiotics.
- the invention further provides substantially purified CEG gene products, and uses thereof.
- the invention also relates to pharmaceutical compositions comprising antisense molecules capable of disrupting expression of ceg sequences, agonists, antagonists or inhibitors of CEG gene products, and antibodies reactive against the CEG polypeptides. These compositions are useful for preventing the growth or survival of bacteria, for example, in the treatment of conditions associated with bacterial infections.
- Figure 1 A schematic representation of the gene disruption assay, as described in Example 3, infra.
- A) A recombinant vector undergoing homologous recombination with the host genome.
- B) The result of homologous recombination.
- Figure 2 A schematic representation of the polarity test for operons, as described in Examples 2 and 3, infra.
- FIG. 3 Purification of 2CFE 75, as described in Example 6, infra.
- Figure 4 Fractionation profile of 2CFE 3 eluted from a hydroxyapatite column, as described in Example 7, infra.
- Figure 5 The biosynthesis pathway of Coenzyme A which starts with phosphorylation of pantothenate.
- Figure 6 Circular dichroism spectra of 2CFE 101 and 103, as described in Example 10, infra.
- Figure 7 Circular dichroism spectra of aggregate and monomer pools of 2CFE 101 and 103, as described in Example 10, infra.
- Figure 8 Absorbance spectra of pantothenate-dependent production of ADP, as described in Example 10, infra. -
- Figure 9 The results of size exclusion chromatography and gel electrophoresis showing the oligomeric forms of 2CFE 21 and 39, as described in Example 11, infra. Lanes 1-6 contain 2CFE 21, lane 7 is a molecular weight marker, lanes 8-10 contain 2CFE 39.
- Figure 10 Gel electrophoresis of a helicase reaction using 2CFE 21 and 39 and radiolabeled synthetic HoUiday Junction template, as described in Example 11, infra.
- Lane 1 contains the synthetic HoUiday Junction template
- lane 2 contains the synthetic duplex
- lane 3 contains a single-stranded template
- lane 4 contains the helicase reaction using 2CFE 39
- lane 5 contains the helicase reaction using 2CFE 21
- lanes 6-8 contain the helicase reaction using 2CFE 39 and 21 at varying concentrations (e.g., 1, 2, and 3 ⁇ M each)
- lane 9 contains the helicase reaction using 2 ⁇ M each 2CFE 39 and 21 in the presence of ethidium bromide.
- Figure 11 A graph depicting the results of the helicase reaction which were monitored by measuring the unquenching of the HoUiday Junction templates with time, as described in Example 1.1, infra.
- Figure 12 Capillary electrophoresis results of 2CFE 8 with and without ssDNA, as described in Example 12, infra.
- Figure 13 Gel mobility shift assay of 2CFE 8, and 2CFE 8 in the presence of a single- stranded 32-mer, as described in Example 12, infra.
- Figure 14 The N-acetyl glucosamine pathway putatiyely mediated by 2CFE 3 and 2CFE 86, as described in Example 13, infra.
- Figure 15 Capillary electrophoresis results of 2CFE 3 with and without putative substrates, as described in Example 13, infra..
- Figure 16 Capillary electrophoresis results of FITC-derivitized 2CFE 3 polypeptide with and without D-glucosamine-6-phosphate (substrate) to produce the product D-glucosamine- 1 -phosphate, using laser-induced fluorescence, as described in Example 13, infra. Electropherogram of D-glucosamine-6-phosphate (putative substrate), 2CFE 3 reacted with D-glucosamine-6-phosphate, and the product glucosamine- 1-phhosphate.
- Figure 17 Gel electrophoresis of 2CFE 86 eluted from an Ni-NTA column, as described in Example 13, infra.
- Figure 18 HPLC analysis of a coupled reaction including 2CFE 3, 2CFE 86, and D- glucosamine-6-phosphate to produce the product, UDP-N-acetylglucosamine-1 -phosphate (UDPAG), as described in Example 13, infra.
- 2CFE 3 2CFE 86
- D- glucosamine-6-phosphate D- glucosamine-6-phosphate
- Figure 19 A fatty acid biosynthesis pathway.
- Figure 20 Size exclusion chromatography to determine the molecular weight and oligomeric form of 2CFE 34, as described in Example 14, infra.. Selected eluted samples were sized by gel electrophoresis.
- Figure 21 Gel electrophoresis of 2CFE 41 eluted from a Ni-NTA column, as described in Example 15, infra.
- Figure 22 Capillary electrophoresis results of 2CFE 40, 41, and 46, as described in Example 15, infra.
- Figure 23 Depicts a schematic diagram of a ligand which binds 2CFE 34.
- the ligand is 2- phenyl-N-(3 corboxyl-4hydroxyphenyl) azabicyclo [4.3.0] riona-2, 8-diene.
- Figure 24 Depicts a schematic diagram of a ligand which binds 2CFE 43.
- the ligand is N- (3, 5-dinitrobenzyl)-7-trifluoromethyl benza diaza furanolactone.
- Figure 25 Depicts a schematic diagram of a ligand which binds 2CFE 43.
- the ligand is 2- amino (N-para-methylphenyl sulfonamide)-3-phenylpropianic acid.
- Figure 26 A nucleic acid sequence of 2CFE1 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 27 A nucleic acid sequence of 2CFE2 deposited with the American Type Culture
- Figure 28 A nucleic acid sequence of 2CFE3 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 29 A nucleic acid sequence of 2CFE4 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 30 A nucleic acid sequence of 2CFE5 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 31 A nucleic acid sequence of 2CFE6 deposited with the American Type Culture Collection as ATCC designation ' on December 20, 2000.
- Figure 32 A nucleic acid sequence of 2CFE7 deposited with the American Type Culture
- Figure 33 A nucleic acid sequence of 2CFE8 deposited with the American Type Culture Collection as ATCC designation . on December 20, 2000.
- Figure 34 A nucleic acid sequence of 2CFE9 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 35 A nucleic acid sequence of 2CFE10 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 36 A nucleic acid sequence of 2CFE11 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 37 A nucleic acid sequence of 2CFE12 deposited with the American Type Culture
- Figure 38 A nucleic acid sequence of 2CFE13 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 39 A nucleic acid sequence of 2CFE14 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 40 A nucleic acid sequence of 2CFE15 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 41 A nucleic acid sequence of 2CFE16 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 42 A nucleic acid sequence of 2CFE17 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 43 A nucleic acid sequence of 2CFE19 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 44 A nucleic acid sequence of 2CFE21 deposited with the American Type Culture
- Figure 45 A nucleic acid sequence of 2CFE24 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 46 A nucleic acid sequence of 2CFE25 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 47 A nucleic acid sequence of 2CFE26 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 48 A nucleic acid sequence of 2CFE27 deposited with the American Type Culture Collection as ATCC designation ' on December 20, 2000.
- Figure 49 A nucleic acid sequence of 2CFE28 deposited with the American Type Culture
- Figure 50 A nucleic acid sequence of 2CFE29 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 51 A nucleic acid sequence of 2CFE30 deposited with the American Type Culture Collection as ATCC designation * on December 20, 2000.
- Figure 52 A nucleic acid sequence of 2CFE31 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 53 A nucleic acid sequence of 2CFE32 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 54 A nucleic acid sequence of 2CFE33 deposited with the American Type Culture
- Figure 55 A nucleic acid seq ⁇ ence of 2CFE34 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 56 A nucleic acid sequence of 2CFE35 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 57 A nucleic acid sequence of 2CFE36 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 58 A nucleic acid sequence of 2CFE37 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 59 A nucleic acid sequence of 2CFE38 deposited with the American Type Culture
- Figure 60 A nucleic acid sequence of 2CFE39 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 61 A nucleic acid sequence of 2CFE40 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 62 A nucleic acid sequence of 2CFE41 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 63 A nucleic acid sequence of 2CFE42 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 64 A nucleic acid sequence of 2CFE43 deposited with the American Type Culture
- Figure 65 A nucleic acid sequence of 2CFE44 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 66 A nucleic acid sequence of 2CFE45 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 67 A nucleic acid sequence of 2CFE46 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 68 A nucleic acid sequence of 2CFE47 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 69 A nucleic acid sequence of 2CFE48 deposited with the American Type Culture
- Figure 71 A nucleic acid sequence of 2CFE50 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 72 A nucleic acid sequence of 2CFE51 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 73 A nucleic acid sequence of 2CFE52 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 74 A nucleic acid sequence of 2CFE53 deposited with the American Type Culture
- Figure 75 A nucleic acid sequence of 2CFE54 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 76 A nucleic acid sequence of 2CFE55 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 77 A nucleic acid sequence of 2CFE56 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 78 A nucleic acid sequence of 2CFE57 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 79 A nucleic acid sequence of 2CFE58 deposited with the American Type Culture
- Figure 80 A nucleic acid sequence of 2CFE59 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 81 A nucleic acid sequence of 2CFE60 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 82 A nucleic acid sequence of 2CFE61 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 83 A nucleic acid sequence of 2CFE62 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 84 A nucleic acid sequence of 2CFE64 deposited with the American Type Culture
- Figure 85 A nucleic acid sequence of 2CFE65 deposited with the American Type Culture Collection as ATCC designation - on December 20,, 2000.
- Figure 86 A nucleic acid sequence of 2CFE66 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 87 A nucleic acid sequence of 2CFE67 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 88 A nucleic acid sequence of 2CFE68 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 89 A nucleic acid sequence of 2CFE69 deposited with the American Type Culture
- Figure 90 A nucleic acid sequence of 2CFE70 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 91 A nucleic acid sequence of 2CFE71 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 92 A nucleic acid sequence of 2CFE72 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 93 A nucleic acid sequence of 2CFE75 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 94 A nucleic acid sequence of 2CFE76 deposited with the American Type Culture
- Figure 95 A nucleic acid sequence of 2CFE78 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 96 A nucleic acid sequence of 2CFE79 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 97 A nucleic acid sequence of 2CFE80 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 98 A nucleic acid sequence of 2CFE81 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 99 A nucleic acid sequence of 2CFE82 deposited with the American Type Culture
- Figure 101 A nucleic acid sequence of 2CFE84 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 102 A nucleic acid sequence of 2CFE85 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 103 A nucleic acid sequence of 2CFE86 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 104 A nucleic acid sequence of 2CFE87 deposited with the American Type Culture
- Figure 105 A nucleic acid sequence of 2CFE88 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 106 A nucleic acid sequence of 2CFE89 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 107 A nucleic acid sequence of 2CFE90 deposited with He American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 108 A nucleic acid sequence of 2CFE91 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 109 A nucleic acid sequence of 2CFE92 deposited with the American Type Culture
- Figure 110 A nucleic acid sequence of 2CFE94 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 111 A nucleic acid sequence of 2CFE95 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 112 A nucleic acid sequence of 2CFE96 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 113 A nucleic acid sequence of 2CFE97 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 114 A nucleic acid sequence of 2CFE99 deposited with the American Type Culture
- Figure 115 A nucleic acid sequence of 2CFE101 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 116 A nucleic acid sequence of 2CFE102 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 117 A nucleic acid sequence of 2CFE103 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 118 A nucleic acid sequence of 2CFE104 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 119 A nucleic acid sequence of 2CFE105 deposited with the American Type
- Figure 120 A nucleic acid sequence of 2CFE106 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 121 A nucleic acid sequence of 2CFE107 deposited -with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 122 A nucleic acid sequence of 2CFE108 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 123 A nucleic acid sequence of 2CFE109 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 124 A nucleic acid sequence of 2CFE111 deposited with the American Type
- Figure 125 A nucleic acid sequence of 2CFE112 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 126 A nucleic acid sequence of 2CFE113 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 127 A nucleic acid sequence of 2CFE114 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 128 A nucleic acid sequence of 2CFE115 deposited with the American Type Culture Collection as ATCC designation on December 20, 2000.
- Figure 129 A nucleic acid sequence of 2CFE116 deposited with the American Type
- Figure 131 Schematic structures of alkyloids which are ligands, for example, of 2CFE42.
- a ceg nucleic acid molecule is said to be "isolated” when the nucleic acid molecule is substantially separated from contaminant nucleic acid molecules that encode polypeptides other than CEGs.
- isolated nucleic acid molecule refers to any RNA or DNA sequence obtained from a natural source, or constructed by recombinant methods, or synthesized. A skilled artisan can readily employ nucleic acid isolation procedures to obtain an isolated nucleic acid molecule having ceg sequences.
- ceg includes all isolated forms of ceg nucleotide and CEG amino acid sequences disclosed herein.
- the ceg sequences encode gene products that have essential biological functions in bacterial cells, such as, for example, nucleotide biosynthesis, amino acid biosynthesis, DNA replication, RNA transcription, protein translation, DNA recombination, DNA repair, biosynthesis of cofactors (e.g., Coenzyme A), biosynthesis of prosthetic groups, cellular processes (e.g., chaperones, cell division, and polypeptide secretion), energy metabolism (e.g., pentose phosphate pathway, glycolysis, gluconeogenesis), fatty acid biosynthesis, cell wall biosynthesis, and/or biosynthesis of purines, pyrimidines, nucleosides, and nucleotides.
- ceg nucleotide sequences are required for viability of bacterial cells.
- ceg also includes variants having nucleotide sequence similarity to the disclosed ceg sequences, including sequences isolated from various bacterial genera and species, allelic variants, mutant variants, and ceg variants that encode conservative and non-conservative amino acid substitutions.
- the present invention also provides for all ceg sequences generated by recombinant DNA technology, including complementary sequences, ceg sequences that hybridize to the sequences of the invention at high stringency hybridization conditions, fusion genes comprising a ceg sequence, and codon usage variants.
- essential genes refers to a nucleotide sequence that encodes a gene product having a function which is required for cell viability.
- essential protein refers to a polypeptide that is encoded by an essential gene and has a function that is required for cell viability. Accordingly, a mutation that disrupts the function of the essential gene or essential proteins results in a loss of viability of cells harboring the mutation.
- Non-essential genes or “non-essential proteins” refer to genomic information or the protein(s) or RNAs encoded therefrom which, when disrupted by a mutation, do not result in a loss of viability of cells harboring said mutation under defined laboratory conditions.
- nucleotide sequence is said to be “identical” to another reference sequence when both nucleotide sequences are exactly alike.
- nucleotide sequence is said to be "similar" to another reference sequence when a comparison of the two sequences shows that they have a low level of sequence differences.
- two sequences are considered to be similar to each other when the percentage of nucleotides that are shared between the two sequences is between about 70 % to 99.99% over the entire length of the two sequences.
- amino acid sequence is said to be "similar" to another reference sequence when a comparison of the two sequences shows that they have a low level of sequence differences.
- two sequences are considered to be similar to each other when the percentage of amino acids that are shared between the two sequences may be between about 30% to 100% identity over the entire length of the two sequences.
- an "allele” or “allelic sequence” is an alternative form of the naturally- occurring ceg sequence.
- a eles result from a mutation, that changes the nucleotide sequence, and generally produce altered mRNAs or polypeptides whose structure or function may or may not be altered.
- substantially purified as used herein means a specific isolated nucleic acid or protein, or fragment thereof, in which substantially all contaminants (i.e. substances that differ from said specific molecule) have been separated from said nucleic acid or protein.
- an "endogenous" sequence as used herein means a nucleic acid sequence that is naturally-occurring and resides within the host genome.
- an "exogenous" sequence as used herein means an isolated nucleic acid sequence that is introduced into the host cell, using any one of a variety of introduction methods, such as transfection, electroporation, cationic lipid or salt treatment methods.
- knockout mutant or “knockout mutation” as used herein refers to an in vitro engineered disruption of a region of endogenous chromosomal DNA (e.g., disruption of the genome), typically within a protein coding region.
- a knockout mutation can be generated by inserting an exogenous DNA sequence into the homologous endogenous sequence.
- a knockout mutation occurring in a protein coding region is expected to disrupt normal expression of the protein coding region. This usually leads to loss of the function provided by the protein.
- the present invention provides isolated and recombinant ceg nucleic acid molecules and fragments thereof, and related molecules, such as sequences complementary to ceg sequences or a portion thereof, and those that hybridize to the nucleic acid molecules of the invention.
- ceg polynucleotide sequences are preferably in isolated form, including DNA, RNA, DNA/RNA hybrids, and related molecules, and fragments thereof. Specifically contemplated are genomic DNA, ribozymes, and antisense molecules, as well as nucleic acid molecules based on an alternative backbone or including alternative bases, whether derived from natural sources or synthesized.
- Embodiments of particular ceg polynucleotide and amino acid sequences include, but are not limited to, the sequences described in Tables I and II (e.g., SEQ ID NOS: 1-113, 114-226 and SEQ ID NOS: 227-339, 340-452, respectively).
- the ceg polynucleotide and amino acid sequences were designated cfe which stands for CEG For Expression.
- the present invention also provides nucleic acid molecules having a nucleotide sequence substantially identical or similar to the ceg sequences (SEQ ID NOS: 1-113, 227-331) disclosed herein.
- the present invention provides nucleotide sequences which are similar to SEQ ID NOS-.1-113 and/or SEQ ID NOS:227-331.
- the present invention provides nucleotide sequences which vary from SEQ ID NOS: 1-113 or 227-331 by a range of about 1% to about 70%.
- the present invention encompasses variations in polynucleotide sequences resulting from mutations and/or from transfer of genetic material from one cell to another (e.g., horizontal gene transfer or horizontal gene exchange).
- the present invention also provides for variants of the polynucleotide ceg sequences disclosed herein, including variants isolated from naturaUy-occurring sources, those generated by recombinant DNA technology or other in vitro synthesis methodologies (e.g., PCR).
- the variant polynucleotide sequences of the invention encode polypeptides that exhibit the biological activity of naturally-occurring CEG polypeptides, such as activity required for bacterial cell viability.
- a variant of ceg polynucleotide sequences may encode a polypeptide that differs by one or more amino acid substitutions.
- the variant may have conservative changes, wherein a substituted amino acid has similar structural or chemical properties, eg, replacement of leucine with isoleucine.
- a polynucleotide sequence can encode conservative amino acid substitutions without altering either the conformation or the function of the polypeptide. Such changes include substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; glutamine (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice versa. Other substitutions can also be considered conservative, depending on the environment of the particular amino acid and its role in the three-dimensional structure of the protein.
- glycine (G) and alanine (A) can frequently be interchangeable, as can alanine (A) and valine (V).
- Methionine (M) which is relatively hydrophobic, can frequently be interchanged with leucine and isoleucine, and sometimes with valine.
- Lysine (K) and arginine (R) are frequently interchangeable in locations in which the significant feature of the amino acid residue is its charge and the differing pK's of these two amino acid residues are not significant. Still other changes can be considered "conservative" in particular environments.
- a variant may also have nonconservative changes, eg, replacement of a glycine with a tryptophan.
- Other variations may also include amino acid deletions or insertions, or both.
- Guidance in determining which and how many amino acid residues may be substituted, inserted or deleted without abolishing biological or immunological activity may be found using computer programs well known in the art, for example, DNASTAR software.
- ceg sequence variant includes naturally-occurring allelic variants of ceg which share significant similarity (e.g., between about 30- 99%) to the disclosed CEG polypeptide sequence. Allelic variants of the ceg sequences can encode conservative or non-conservative amino acid substitutions of the CEG polypeptide sequence herein described.
- allelic variants of ceg are mutant alleles of ceg polynucleotide sequences that encode a polypeptide having one or more changes in the polypeptide sequence, such as amino acid substitutions, deletions, insertions, frame shifts, or truncations.
- the mutant alleles of ceg may or may not encode a CEG polypeptide having the same biological functions as wild-type CEG proteins.
- Variations in the bacterial genomic sequences can also arise from transfer of genetic material to another bacterial cell. The transfer of gene sequences can occur intraspecies or interspecies. Gene transfer can occur between bacterial cells which are members of the same or different populations.
- a population includes, but is not limited to, a serotype isolate, a clinical isolate, a naturally-occurring isolate, a strain, and a species.
- the transfer of genetic material can occur between cells within a population; for example transfer between serotype A to serotype A, or between S. pneumoniae and S. pneumoniae.
- the transfer of genetic material can occur between cells of different populations; for example, between serotype A to serotype B or S. pneumoniae and S. mutans.
- Gene transfer can give rise to mutant or polymorphic variant genes sequences. In rare cases, gene transfer introduces new gene sequences that confer a new phenotype, such as antibiotic resistance.
- the transfer of genetic material includes transfer of large regions of genomic sequences which include partial gene sequences, whole single gene sequences, or multiple gene sequences. This mode of transfer can give rise to replacement of native whole gene sequences or introduction of new sequences in the recipient cell. This mode of transfer gives rise to mosaic gene sequences in the recipient cell.
- genomic sequences resulting from gene transfer can be examined using molecular techniques, including: multilocus enzyme electrophoresis (Selander. R. K., et al., 1986 Appl. Environ. Microbiol 51:837-884); and restriction endonuclease cleavage electrophoretic profiling (Coffey, T. J., et al, 1991 Mol. Microbio. 5:2255-2260); pulse- field gel electrophoresis fingerprinting (Bygraves, J. A. and Maiden, M. C. J. 1992 J. Gen. Microbiol 138:523-531); and ribotyping (Stull, T. L., et al., 1988 J. Infect. Dis. 157:280-286).
- the degree of variation can vary greatly, and ranges from little or no variation as exemplified by gene sequences of E. coli (Caugant, d. A., et al, 1981 Genetics 98:467-490; Whittam, T. S., et al., 1983 Mol Biol Evol 1 :67-83; Souza, N., et al, 1992 Proc. Natl. Acad. Sci. USA 89:8389-8393) and Salmonella (Selander, R. K., et al, 1990 Infect. Immun. 58:2262-2275; Selander, R.K. and Smith, ⁇ . H.T990 Rev. Med. Microbiol. 1 :219-228; Smith, J.
- Gene transfer can be examined between various isolates of a particular microbial species which are antibiotic-sensitive or antibiotic-resistent (Coffey, T. J., et al., 1991 Molec. Microbiol. 5:2255-2260).
- Molecular biology techniques can be utilized to study the degree of transfer between populations, such as, for example, the degree of gene transfer between serotypes, isolates, strains, " or species .
- the degree of transfer can be examined by comparing, for example, the penicillin binding proteins and numerous different loci which encode metabolic enzymes or capsular biosynthesis enzymes.
- intra-species, inter-serotype, gene transfer is possible (Coffey, T. J., et al., 1991 supra). Additionally, intraspecies gene transfer in S. pneumoniae (Coffey, T. J., et al., 1998 Mol. Microbiol. 27:73-83), Vibrio cholerae (Bik, E. M., et al, 1995 EMBO J. 14:209-216), and Haemophilus influenzae (KroU, J. S. and Moxon, E. R. 1990 J. Bacteriol. 112: 1374-1379) are possible.
- Variant gene sequences arising from gene transfer can be continually generated in transformable bacteria (e.g., transformation competent), such as S. pneumoniae.
- transformable bacteria e.g., transformation competent
- transformation competent e.g., transformation competent
- the worldwide spread of varying degrees of antibiotic resistance has. been documented and reviewed (Dowson, C. G., et al., 1994 Trends Microbiol. 2:361-366; Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 517- 534; and reviewed in Maiden, M. C. J. 1998 Clinic. Infect. Dis. 27 (Supplement 1) S12- S20).
- variant gene sequence arising from gene transfer can be tracked using a marker gene such as the gene which encodes the penicillin binding protein (Barcus, V. A., et al, 1995 FEMS Microbiol. Lett. 126:299-303).
- gene sequences encoding the penicillin binding proteins in susceptible and resistant strains differ by about 14% to 23% (Hakenbeck, R. 1995 Biochem. Pharmacol. 50:1121- 1127; Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 517-534; Spratt, B.
- the ceg nucleotide sequences can be isolated from various species of Streptococcus including Streptococcus pneumoniae. Additionally, the ceg sequences can be isolated from other Steptococcal species, including S. mutans, S. pyogenes, and S. thermophila, The ceg polynucleotide sequences can also be isolated from strains of other bacterial genera including, but not limited to, Streptococcus, Escherichia, Bacillus, Pseudomonas, Yersinia, Salmonella, and Haemophilus.
- the present invention additionally provides isolated codon-usage variants that differ from the disclosed ceg nucleotide sequences, yet do not alter the predicted CEG polypeptide sequence or function.
- the codon-usage variants may be generated by recombinant DNA technology. Codons may be selected to optimize the level of production of the ceg transcript or CEG polypeptide in a particular prokaryotic or eukaryotic expression host, in accordance with the frequency of codon utilized by the host cell.
- Alternative reasons for altering the nucleotide sequence encoding a CEG polypeptide include the production of RNA transcripts having more desirable properties, such as an extended half-life or increased stability.
- a multitude of variant ceg nucleotide sequences that encode the respective CEG polypeptide may be isolated, as a result of the degeneracy of the genetic code. Accordingly, the present invention contemplates selecting every possible triplet codon to generate every possible combination of nucleotide sequences that encode the disclosed CEG polypeptides.
- This particular embodiment provides isolated nucleotide sequences that vary from the sequences as described in SEQ ID NOs.: 1-113 or 227-331, such that each variant nucleotide sequence encodes a polypeptide having sequence identity with the amino acid sequences, as described in SEQ ID NOs. :114-226 or 332- 436, respectively.
- the present invention includes polynucleotide sequences that are complementary to the sequences disclosed herein.
- complementary refers to the capacity of purine and/or pyrimidine nucleotides to associate through hydrogen bonding to form double stranded nucleic acid molecules.
- the following base pairs are related by complementarity: guanine and cytosine; adenine and thymine; and adenine and uracil. Complementary applies to all base pairs comprising at least two single-stranded nucleic acid molecules.
- nucleic acid molecules that will hybridize to ceg sequences under hybridization conditions. It is readily apparent to one skilled in the art that the stringency of the hybridization condition selected will depend upon the characteristics of the nucleic acid molecule to be hybridized, such as, the length, the degree of complementarity (e.g., exact or non-exact complementarity), the percent A/T content, and the objective of the hybridization experiment.
- the stringency of the hybridization condition selected will depend upon the characteristics of the nucleic acid molecule to be hybridized, such as, the length, the degree of complementarity (e.g., exact or non-exact complementarity), the percent A/T content, and the objective of the hybridization experiment.
- the hybridization procedure may by performed in low stringency hybridization conditions.
- Low stringency hybridization conditions will permit hybridization between two nucleic acid molecules that differ from exact complementarity by about 25% to 70%.
- Hybridization under standard high stringency conditions will occur between two complementary nucleic acid molecules (e.g., 100% exact complementarity) or two complementary nucleic acid molecules that differ from exact complementarity by about 1% to about 70%.
- high stringency hybridization conditions that disfavor non-homologous base pairing are well known in the art.
- high stringency hybridization conditions includes but is not limited to, hybridizing at 50 °C to 65 °C in 5X SSPE, and washing at 50 °C to 65 °C in 0.5X SSPE.
- low stringency conditions includes but is not limited to, hybridizing at 35 °C to 37 °C in 5X SSPE and 40% to 45% formamide and washing at 42 °C in 1-2X SSPE.
- the invention further provides nucleic acid molecules having fragments of the ceg sequences, such as a portion of the ceg sequence (e.g., SEQ ID NOS: 1-113, 227-331) disclosed herein.
- the size of the fragment will be determined by its intended use. For example, the length of the fragment to be used as a nucleic acid probe or PCR primer is chosen to obtain a relatively small number of false positives during probing or priming.
- a fragment of the ceg sequence may be used to construct a recombinant fusion gene having a ceg sequence fused to a non-ceg sequence.
- the nucleic acid molecules, fragments thereof, and probes and primers of the present invention are useful for a variety of molecular biology techniques including, for example, hybridization screens of libraries, or detection and quantification of mRNA transcripts as a means for analysis of gene transcription and/or expression.
- the probes and primers are DNA.
- a probe or primer length of at least 15 base pairs is suggested by theoretical and practical considerations (Wallace, B. and Miyada, G. 1987 "Oligonucleotide Probes for the Screening of Recombinant DNA Libraries" in: Methods in Enzymology, 152:432-442, Academic Press).
- Other lengths of fragments, probes, or primers are possible and routine to determine.
- probes and primers of this invention can be prepared by methods well known to those skilled in the art (Sambrook, et al. supra). In a preferred embodiment the probes and primers are synthesized by chemical synthesis methods (ed: Gait, M. J. 1984 Oligonucleotide Synthesis, IRL Press, Oxford, England).
- nucleic acid primers that are complementary to ceg sequences, which allow the specific amplification of nucleic acid molecules of the invention or of any specific parts thereof.
- nucleic acid probes that are complementary for selectively or specifically hybridizing to the ceg sequences or to any part thereof.
- the nucleic acid molecules of the invention include peptide nucleic acids (PNAs), or derivative molecules such as phosphorothioate, phosphotriester, phosphoramidate, and methylphosphonate, that specifically bind to single-stranded DNA or RNA in a base pair- dependent manner (Zarnecnik, P. C, et al, 1978 Proc. Natl. Acad. Sci. 75:280284; Goodchild, P. C, et al., 1986 Proc. Natl. Acad. Sci. 83:4143-4146).
- PNAs peptide nucleic acids
- PNA molecules comprise a nucleic acid oligomer to which an amino acid residue, such as lysine, and an amino group have been added. These small molecules, also designated anti-gene agents, stop transcript elongation by binding to their complementary (template) strand of nucleic acid (Nielsen, P. E., et al, 1993 Anticancer Drug Des 8:53-63).
- anti-gene agents stop transcript elongation by binding to their complementary (template) strand of nucleic acid.
- the present invention provides RNA molecules that encode the predicted ceg gene products.
- the RNA molecules of the invention may be isolated foll-length or partial mRNA molecules or RNA oligomers that encode CEG gene products.
- the RNA molecules of the invention include the nucleotide sequences encoding all or portions of CEGs.
- RNA molecules of the invention also include antisense RNA molecules, peptide nucleic acids (PNAs), or non-nucleic acid molecules such as phosphorothioate derivatives, that specifically bind to the sense strand of DNA or RNA in a base pair- dependent manner.
- PNAs peptide nucleic acids
- non-nucleic acid molecules such as phosphorothioate derivatives
- the nucleic acid molecules having ceg sequences can be labeled with a detectable marker.
- a detectable marker include, but are not limited to, a radioisotope, a ' fluorescent compound, a bioluminescent compound, a chemiluminescent compound, a metal chelator or an enzyme. Technologies for generating labeled DNA and RNA probes are well known in the art (See e.g. Sambrook et al., supra).
- recombinant nucleic acid molecules such as recombinant DNA molecules (rDNAs) that comprise ceg sequences or fragments thereof.
- a recombinant DNA molecule is a DNA molecule that has been subjected to molecular manipulation in vitro. Methods for generating rDNA molecules are well known in the art, for example, see Sambrook et al., Molecular Cloning (1989), supra. a) Vectors
- the nucleic acid molecules of the invention may be recombinant molecules each comprising the sequence, or portions thereof, of a ceg sequence linked to a nox-ceg sequence.
- the ceg sequence may be fused operatively to a vector to generate a recombinant molecule.
- vector includes, but is not limited to, plasmids, cosmids, and phagemids.
- a preferred vector includes an autonomously replicating vector comprising a replicon that directs the replication of the rDNA within the appropriate host cell.
- the preferred vectors can also include an expression control element, such as a promoter sequence, which enables transcription of the inserted ceg sequences and can be used for regulating the expression (e.g., transcription and/or translation) of an operably linked ceg sequence in an appropriate host cell such as Escherichia coli.
- expression control elements are known in the art and include, but are not limited to, inducible promoters, constitutive promoters, secretion signals, enhancers, transcription terminators, and other transcriptional regulatory elements.
- Other expression control elements that are involved in translation are known in the art, and include the Shine- Dalgarno sequence, and initiation and termination codons.
- the preferred vector also includes at least one selectable marker gene that encodes a gene product that confers drug resistance such as resistance to ampicillin or tetracyline.
- the vector also comprises multiple endonuclease restriction sites that enable convenient insertion of .exogenous DNA sequences.
- the preferred vectors for generating ceg transcripts and/or the encoded CEG polypeptides are expression vectors which are compatible with prokaryotic host cells.
- Prokaryotic cell expression vectors are well known in the art and are available from several commercial sources.
- a pET vectors e.g., pET-21, Novagen Corp.
- pET-21 e.g., Novagen Corp.
- pET-21 e.g., Novagen Corp.
- the present invention provides recombinant vectors that may be used to integrate exogenously provided sequences into the genome of a host cell.
- the recombinant integration vectors of the present invention include a gene that encodes a selectable marker and ceg sequences, or fragments thereof.
- the integration vectors are used to integrate the ceg sequence into a target gene sequence that resides within the bacterial host genome (e.g., endogenous sequence), thereby disrupting the function of the target gene sequence within the bacterial cells.
- These integration vectors may be used in a gene disruption assay to screen candidate ceg nucleotide sequences, in order to identify the candidate sequences that encode a gene product that is required for bacterial cell viability.
- these recombinant integration vectors include candidate ceg sequences that will be screened to determine if the candidate ceg sequences encode a gene product that is required for cell viability.
- the candidate ceg sequence that is included as part of the recombinant integration vector is the "exogenous” ceg sequence that is employed as the "disrupting" sequence in a gene disruption assay.
- the ceg sequence that resides within the host genome is the "endogenous" or "target” ceg sequence.
- the integration event rarely occurs, for example, by non-homologous recombination in which a recombinant vector, that includes the exogenous ceg sequence, inserts the exogenous ceg sequence into a random location within the host genome.
- the integration event inserts the exogenous ceg sequence into a specific target site within the host genome.
- the targeted integration event can involve homologous recombination in which the integration vector, that includes the exogenous ceg sequence, inserts the exogenous ceg sequence into its homologous target ceg sequence that resides within the host's genome (e.g., the endogenous ceg sequence) ( Figure 1).
- exogenous ceg sequence can be used as a disrupting sequence whereby the homologous recombination event integrates the exogenous ceg sequence into the endogenous target ceg sequence resulting in disruption of the function of the endogenous ceg sequence.
- disrupting the function of the endogenous ceg sequence may result in the loss of bacterial cell viability.
- a recombinant vector that can be used as an integration vector in S. pneumoniae is the pEVP-3 vector (Jean-Pierre Claverys, et al. 1995 Gene 164: 123-128).
- the pENP-3 vector integrates an exogenous sequence by homologous recombination involving a Campbell-type event (S. Adhya and A. Campbell 1970 J. Mol. Biol 50:481- 490).
- the pEVP-3 vector includes a replicon that functions only in gram-negative bacteria, such as E. coli. Therefore, the p ⁇ NP-3 vector cannot replicate in S. pneumoniae.
- This vector also contains multiple cloning sites, and confers resistance to chloramphenicol in both a gram-negative and gram-positive bacteria, such as S. pneumoniae.
- a fusion ceg gene is another example of a recombinant molecule of the invention.
- a fusion gene includes a ceg sequence operatively fused (e.g., linked) to a non-ceg sequence such as, for example, a tag sequence to facilitate isolation and/or purification of the expressed CEG gene product (KroU, D.J., et al, 1993 DNA Cell Biol 12:441-53).
- a recombinant fusion molecule has a ceg sequence of the invention fused to a ceg sequence isolated from a different microbial source.
- the disclosed ceg sequences isolated from S. pneumoniae can be fused to a ceg sequence isolated from a different bacterial species.
- the invention additionally provides CEG proteins and peptide fragments thereof that are isolated or substantially purified.
- Embodiments of particular CEG amino acid sequences are disclosed in Tables I and II (SEQ ID NOS: 114-226 and SEQ ID NOS:332-436, respectively).
- the present invention also includes polypeptides having sequence variations from the predicted CEG polypeptide sequences disclosed herein, including mutant variants, conservative substitution variants, and similar CEG polypeptides from other prokaryotic organisms.
- CEG proteins CEG polypeptides
- CEG polypeptides or “proteins of the invention”.
- CEG protein refers to a polypeptide having amino acid sequence identity or similarity to any one of the predicted amino acid sequences, as provided in SEQ ID NO.: 114-226 or 332-436.
- the variant CEG polypeptides can be allelic forms of CEG, such as mutant forms of CEG polypeptides.
- the present invention also provides conservative substitution-mutants of the CEG proteins that maintain functional activity of wild-type CEG (e.g., the CEG polypeptide is required for bacterial cell viability).
- the CEG protein may be isolated from any source whether natural, synthetic, semi- synthetic, or recombinant.
- "natural” refers to a polypeptide which is found in nature.
- the CEG proteins may be isolated from a prokaryotic organism, such as a bacterial strain including, but not limited to, Streptococcus, Escherichia, Bacillus, Pseudomonas, Yersinia, Salmonella, and Streptomyces.
- the CEG proteins of the invention, and fragments thereof, can also be generated by recombinant methods or chemical synthesis methods.
- the CEG polypeptides of the invention are essential for the viability of a bacterial cell. Further, the CEG polypeptides can exhibit at least any one of the following functions: a pantothenate kinase, a HoUiday Junction branch migration protein, a single stranded DNA binding protein, a phosphoglucosamine mutase, an acetyltransferase, an uridylyltrarisferase, a malonyl CoenzymeA:ACP transcylase, a 3-oxoacyl-ACP synthase II, a 3-oxoacyl-ACP reductase, a phosphomethylpyrimidine (HMP-P) kinase, a GTP binding protein, a ATP binding protein, or a 4-aminoimidazole carboxylase.
- HMP-P phosphomethylpyrimidine
- Putative functions can include, but are not limited to, sugar transferase, techoic acid biosynthesis, ribosome recycling factor, response regulator, nicotinate phosphoribosyltransferase, nitropropane dioxygenase, (3R)-hydroxymyristol acyl carrier protein dehydrase, sugar dehydrogenase, murein biosynthesis, cobalimin biosynthesis, ABC transporter, tRNA modification enzyme, arylsulfatase, 16S processing enzyme, tRNA methyl transferase, elongation factor P, signal recognition particle, protein export, undecaprenol kinase, SRP docking domain, diacyl glycerol kinase, dihydopicilinate reductase, HU-DNA binding protein, thiamine biosynthase, GreA transcription elongation factor, dTDP-L-rhamnose synthase, ATP-binding motif, ribose-5
- phosphopanthetheine adenylyltransferase oligopeptide transport permease subunit, translocation protein, perM permease, DNA pol III gamma and tau subunits, DNA pol III delta subunit, signal peptidase I, acetyl-coA carboxylase biotin carboxyl carrier protein, protein chain release factor- 1, replicative DNA helicase, topoisomerase, pentapeptide-transferase, elongation factor G, spore coat polysaccharide biosynthesis protein C, protein release factor B, DNA polymerase III alpha subunit, phosphoprotein phosphatase, chaparonin, UDP-N-acetylmuramoylalanyl-D-glutamate-2, 6-diaminopimelate ligase, techuronic acid biosynthesis, UDP-glucose lipid carrier transferase, transcription termination factor, chromosome segregation factor, amino acid biosynthesis, HMG-CoA reduc
- the invention provides compounds that modulate (e.g., activate or inhibit) the function of a CEG polypeptide.
- Such compounds can provide lead-compounds for developing drugs for diagnosing and/or treating conditions associated with bacterial infections.
- the modulator is a compound that may alter the function of the CEG polypeptide, such as activating or inhibiting the function of a CEG polypeptide.
- the compound can act as agonist, antagonist, partial agonist, partial antagonist, cytotoxic agents, inhibitors of cell proliferation, and cell proliferation-promoting agents.
- the activity of the compound may be known, unknown or partially known.
- Suitable ligands include, but are not limited to, diazalactones, N-protected amino acid, azabicyclodiene, and alkaloids.
- N-protected amino acid is:
- alkaloids examples include:
- Recombinant methods are preferred if a high yield is desired.
- Recombinant . methods involve expressing the cloned gene in a suitable host cell.
- a host cell is introduced with an expression vector having the CEG sequence, then the host cell is cultured under conditions that permit in vivo production of the CEG protein.
- the recombinant vector can integrate the CEG sequence into the host genome.
- the CEG sequence can be maintained extra-chromosomally, as part of an autonomously replicating vector.
- the invention further provides a host-vector system comprising the vector, plasmid, phagemid, or cosmid comprising a ceg nucleotide sequence, or a fragment thereof, introduced into a suitable host cell.
- the host-vector system can be used to produce the CEG polypeptides encoded by the ceg nucleotide sequences.
- the host cell can be prokaryotic or eukaryotic. Examples of suitable prokaryotic host cells include bacteria strains from genera such as Escherichia, Bacillus, Pseudomonas, Streptococcus, and Streptomyces. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, or an animal cell, such as a mammalian cell.
- a preferred embodiment provides a host- vector system comprising the pET21 vector having a ceg sequence introduced into an E. coli ⁇ DE3 lysogen which is useful, for example for the production of the CEG protein, herein designated CFE polypeptides and CFE proteins.
- rDNA molecules of the present invention into an appropriate cell host is accomplished by well known methods that typically depend on the type of vector used and host system employed. For example, transformation of prokaryotic host cells by electroporation and salt treatment methods are typically employed, see for example, Cohen et al., 1972 Proc Acad Sci USA 69:2110; Maniatis, T., et al., 1989 Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY.
- Transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic lipid or salt treatment methods are typically employed, see, for example, Graham et al, 1973 Virol 52:456; Wigler et al., 1979 Proc Natl Acad Sci USA 76:1373-76.
- Successfully transformed cells i.e., cells that contain a rDNA molecule of the present invention
- cells resulting from the introduction of a rDNA of the present invention can be selected and cloned to produce single colonies.
- Cells from those colonies can be harvested, lysed and their DNA content examined for the presence of the rDNA using a method such as that described by Southern, JMol Biol (1975) 98:503, or Berent et al., Biotech (1985) 3:208, or the proteins produced from the cell assayed via a biochemical assay or immunological method.
- Procaryotes are generally used as host cells for cloning and producing the products of exogenous DNA sequences.
- Escherichia coli K12 BL21 ⁇ DE3 (Novagen) is particularly useful for expression of foreign proteins.
- Other strains of E. coli, and bacilli such as Bacillus subtilis, Enterobacteriaceae such as Salmonella typhimurium or Serratia marcescans, various Pseudomonas, Streptococcus, and Streptomyces species may also be employed as host cells in cloning and expressing the recombinant proteins of this invention.
- the production of recombinant CEG proteins may involve using a host/vector system, or other methods may be used.
- the host/vector system may employ the following steps.
- a nucleic acid molecule is obtained that encodes a CEG protein or a fragment thereof, such as any one of the polynucleotides disclosed in SEQ ID NOs.: 1-113 or 227-331.
- the CEG- encoding nucleic acid molecule is preferably inserted into an expression vector in operable linkage with suitable expression control sequences, to generate an expression vector including the CEG-encoding sequence.
- the expression vector is introduced into a suitable host, by standard transformation methods, and the resulting transformed host is cultured under conditions that allow the production of the CEG protein. For example, if expression of the CEG gene is under the control of an inducible promoter, then suitable growth conditions would include the appropriate inducer.
- the CEG protein e.g., designated a
- CFE polypeptide or protein so produced, is isolated from the growth medium or directly from the cells; recovery and purification of the protein may not be necessary in some instances where some impurities may be tolerated.
- a skilled artisan can readily adapt an appropriate host/expression system known in the art for use with CEG-encoding sequences to produce a CEG protein (Cohen, et al. , supra; Maniatis et al., supra).
- a preferred host is E. coli strain BL21( ⁇ DE3) transfected or transformed with a vector comprising a nucleic acid of the present invention.
- the invention also provides a host cell capable of expressing the ceg sequences described herein.
- the preferred host cell is any strain of E. coli that can accommodate high level expression of an exogenously introduced gene.
- the proteins of the present invention can also be made by chemical synthesis. The principles of solid phase chemical synthesis of polypeptides are well known in the art and may be found in general texts relating to this area (Dugas, H. and Penney, C. 1981 Bioorganic Chemistry, pp 54-92, Springer- Verlag, New York).
- CEG polypeptides may be synthesized by solid-phase methodology utilizing an Applied Biosystems 430 A peptide synthesizer (Applied Biosystems, Foster City, Calif.) and synthesis cycles supplied by Applied Biosystems.
- Protected amino acids, such as t-butoxycarbonyl- protected amino acids, and other reagents are commercially available from many chemical supply houses.
- the polypeptides of the invention exhibit properties of a CEG protein, such as, for example, the ability to elicit the generation of antibodies that specifically bind an epitope associated with CEG polypeptides. Accordingly, the CEG polypeptide, or any oligopeptide thereof, is capable of inducing a specific immune response in appropriate animals or cells and binding with specific antibodies.
- the invention further provides antibodies (e.g., polyclonal, monoclonal, chimeric, humanized, and human antibodies) that bind a CEG polypeptide.
- the most preferred antibodies will selectively bind a CEG polypeptide and will not bind (or will bind weakly) a non-CEG polypeptide.
- Antibodies that are particularly contemplated include monoclonal and polyclonal antibodies, as well as fragments thereof (e.g., recombinant proteins) which include the antigen binding domain and/or one or more complement determining regions of these antibodies. These antibodies can be from any source, for example, rabbit, sheep, rat, dog, cat, pig, horse, mouse, and human.
- the invention encompasses antibody fragments that specifically recognize a CEG polypeptide.
- an antibody fragment is defined as at least a portion of the variable region of the immunoglobulin molecule that binds to its target, i.e., the antigen binding region. Some of the constant region of the immunoglobulin may be included.
- the regions or epitopes of a CEG polypeptide to which an antibody is directed may vary with the intended application. For example, antibodies intended for use in an immunoassay for the detection of membrane- bound CEG proteins on viable bacterial cells should be directed to an accessible epitope on membrane-bound CEG proteins. Antibodies that recognize other epitopes may be useful for the identification of CEG protein within damaged or dying cells, for the detection of secreted CEG protein or fragments thereof.
- antibodies may be prepared by immunizing a suitable mammalian host using a CEG protein, peptide, or fragment, in isolated or immunoconjugated form (Harlow, 1989 Antibodies, Cold Spring Harbor Press, NY).
- CEG protein peptide, or fragment
- fusion proteins comprising CEG polypeptides may also be used, such as a CEG protein/GST-fosion protein.
- Cells expressing or overexpressing a CEG polypeptide may also be used for immunizations.
- any cell engineered to express CEG protein may be used. This strategy may result in the production of monoclonal antibodies with enhanced capacities for recognizing endogenous CEG protein.
- the present invention contemplates chimeric antibodies that comprise a human and non- human immunoglobin portion.
- the antigen combining region (variable region) of a chimeric antibody can be derived from a prokaryotic source (e.g., bacteria) and the constant region of the chimeric antibody which confers biological effector function to the immunoglobulin can be derived from a eukaryotic source (e.g., human).
- the chimeric antibody should have the antigen binding specificity of the prokaryotic antibody molecule and the effector function conferred by the eukaryotic antibody molecule.
- the procedure used to produce chimeric antibodies can involve the following steps: a) Identifying and cloning the correct immunoglobin gene segment encoding the antigen binding portion of the antibody molecule.
- This gene segment is known as the VDJ, variable, diversity and joining regions for heavy chains or VJ, variable, joining regions for light chains or simply as the V or variable region.
- This gene regions may be in either the cDNA or genomic form; b) Cloning the gene segments encoding the constant region or desired part thereof; c) Ligating the variable region with the constant region so that the complete chimeric antibody is encoded in a form that can be transcribed and translated; d) Ligating this construct into a vector containing a selectable marker and gene control regions such as promoters, enhancers and poly(A) addition signals; e) Amplifying this construct in bacteria; f) Introducing this DNA into eukaryotic cells (transfection) most often mammalian lymphocytes; g) Selecting for cells expressing the selectable marker; h) Screening for cells expressing the desired chimeric antibody; and k) Testing the antibody for appropriate binding specificity and effector functions.
- a selectable marker and gene control regions such as promoters, enhancers and poly(A) addition signals
- Chimeric antibodies of several distinct antigen binding specificities have been produced by protocols well known in the art, including anti-TNP antibodies (Boulianne et al, 1984 Nature 312:643); and anti-tumor antigen antibodies (Sahagan et al, 1986 J, Immunol. 137:1066).
- anti-TNP antibodies Booulianne et al, 1984 Nature 312:643
- anti-tumor antigen antibodies Sahagan et al, 1986 J, Immunol. 137:1066
- effector functions have been achieved by linking new sequences to those encoding the antigen binding region. Examples of these include enzymes (Neuberger et al., 1984 Nature 312:604); immunoglobulin constant regions from another species and constant regions of another immunoglobulin chain (Sharon et al., 1984 Nature 309:364; Tan et al., 1985 J. Immunol. 135:3565-3567).
- the predicted amino acid sequence of a CEG protein may be used to select specific regions of the CEG protein for generating antibodies.
- hydrophobicity and hydrophilicity analyses of a CEG polypeptide may be used to identify hydrophobic and hydrophilic regions in the CEG protein.
- Regions of the CEG protein that show immunogenic structure, as well as other regions and domains, can readily be identified using various other methods known in the art, such as Chou-Fasman, Garnier-Robson , Kyte- Doolittle, Eisenberg, Karplus-Schult or Jameson- Wolf analysis. Fragments that include the immunogenic regions are particularly suited for generating specific classes of antibodies.
- Methods for preparing a protein for use as an immunogen and for preparing immunogenic conjugates of a protein with a carrier such as BSA, KLH, or other carrier proteins are well known in the art. In some circumstances, direct conjugation using, for example, carbodiimide reagents may be used; in other instances linking reagents such as those supplied by Pierce Chemical Co., Rockford, EL, may be effective.
- Administration of a CEG immunogen is conducted generally by injection over a suitable time period and with use of a suitable adjuvant, as is generally understood in the art. During the immunization schedule, titers of antibodies can be taken to determine adequacy of polyclonal antibody formation.
- .Immortalized cell lines which secrete a desired monoclonal antibody may be prepared using the standard method of Kohler and Milstein (Nature 256: 495-497) or other techniques as described in Monoclonal Antibodies; A Manual of Techniques, CRC press, Inc., Boca Raton, Fla. (1987) ed. Zola.
- the immortalized cell lines secreting the desired antibodies are screened by immunoassay in which the antigen is the CEG polypeptide having binding activity, or a fragment thereof.
- the cells can be cultured either in vitro or by production in ascites fluid.
- the desired monoclonal antibodies are then recovered from the culture supernatant or from the ascites supernatant.
- Fragments of the monoclonal antibodies of the invention or the polyclonal antisera e.g., Fab, F(ab') 2 , Fv fragments, fusion proteins
- the immunologically significant portion i.e., a portion that recognizes and binds a CEG protein
- Humanized antibodies directed against a CEG polypeptide are also useful. The advantage of using humanized antibodies is that they are less immunogenic in humans.
- a humanized antibody is an immunoglobulin molecule which is capable of binding to a CEG polypeptide and which comprises a FR region having substantially the amino acid sequence of a human immunoglobulin and a CDR having substantially the amino acid sequence of non-human immunoglobulin or a sequence engineered to bind a CEG protein.
- immunologically reactive fragments such as the Fab, Fab', or F(ab') 2 fragments is often preferable, especially in a therapeutic context, as these fragments are generally less immunogenic than the whole immunoglobulin.
- bi-specific antibodies specific for two or more epitopes may be generated using methods generally known in the art.
- antibody effector functions may be modified so as to enhance the therapeutic effect of the antibodies of the invention.
- cysteine residues may be engineered into the Fc region, permitting the formation of interchain disulfide bonds and the generation of homodimers which may have enhanced capacities for internalization, ADCC and/or complement-mediated cell killing (Caron et al., 1992 J. Exp. Med.
- Homodimeric antibodies may also be generated by cross-linking techniques known in the art (Wolff et al., Cancer Res. 53: 2560- 2565).
- the invention also provides pharmaceutical compositions having the monoclonal antibodies or anti-idiotypic monoclonal antibodies of the invention.
- the antibodies or fragments may also be produced, using current technology, by recombinant means. Regions that bind specifically to the desired regions of the CEG protein can also be produced in the context of chimeric or CDR grafted antibodies of multiple species origin.
- the invention includes an antibody, e.g., a monoclonal antibody which competitively inhibits the immunospecific binding of any of the monoclonal antibodies of the invention to a CEG protein.
- methods for producing fully human monoclonal antibodies include phage display and transgenic methods, are known and may be used for the generation of human monoclonal antibodies (reviewed in: Naughan et al., 1998 Nature Biotechnology 16: 535- 539).
- folly human monoclonal antibodies may be generated using cloning technologies employing large human Ig gene combinatorial libraries (i.e., phage display) (Griffiths and Hoogenboom, "Building an in vitro immune system: human antibodies from phage display libraries"-, in: Protein Engineering of Antibody Molecules for Prophylactic and Therapeutic Applications in Man, Clark, M. (Ed.), Nottingham Academic, pp 45-64 (1993); Burton and Barbas, "Human Antibodies from combinatorial libraries” Id., pp 65- 82).
- large human Ig gene combinatorial libraries i.e., phage display
- Fully human monoclonal antibodies may also be produced using transgenic mice engineered to contain human immunoglobulin gene loci as described in PCT Patent Application WO98/24893, Jakobovits et al., published December 3, 1997 (see also, Jakobovits, 1998 Exp. Opin. Invest. Drugs 7: 607-614). This method avoids the in vitro manipulation required with phage display technology and efficiently produces high affinity, authentic human antibodies.
- the antibody or fragment thereof of the invention may be labeled with a detectable marker or conjugated to a second molecule, such as a therapeutic agent (e.g., a cytotoxic agent) thereby resulting in an immunoconjugate.
- a therapeutic agent e.g., a cytotoxic agent
- the therapeutic agent includes, but is not limited to, an anti-tumor drug, a toxin, a radioactive agent, a cytokine, a second antibody or an enzyme.
- the invention provides an embodiment wherein the antibody of the invention is linked to an enzyme that converts a prodrug into a cytotoxic drug.
- cytotoxic agents include, but are not limited to ricin, ricin A-chain, doxorubicin, daunorubicin, taxol, ethiduim bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, arbrin A chain, modeccin A chain, alpha-sarcin, gelonin, mitogellin, retstrictocin, phenomycin, enomycin, curicin, crotin, calicheamicin, sapaonaria officinalis inhibitor, and glucocorticoid and other chemotherapeutic agents, as well as radioisotopes such as 212 Bi, 131 1, 131 In, 90 Y, and 186 Re.
- Suitable detectable markers for diagnostic used include, but are not limited to, a radioisotope, a fluorescent compound, a bioluminescent compound, chemiluminescent compound, a metal chelator or an enzyme.
- Antibodies may also be conjugated to an anti- cancer pro-drug activating enzyme capable of converting the pro-drug to its active form. See, for example, U.S. Patent Nos. 4,952,394 and 5,716,990.
- a recombinant protein of the invention comprising the antigen-binding region of any of the monoclonal antibodies of the invention can be made.
- the antigen-binding region of the recombinant protein is joined to at least a functionally active portion of a second protein having therapeutic activity.
- the second protein can include, but is not limited to, an enzyme, lymphokine, oncostatin or toxin. Suitable toxins include those described above.
- the invention includes pharmaceutical compositions for use in the treatment of microbial infections comprising a pharmaceutically effective amount of an anti-CEG antibody or a CEG polypeptide.
- the pharmaceutical compositions may comprise a CEG antibody, either unmodified, conjugated to a therapeutic agent (e.g., drug, toxin, enzyme or second antibody) or in a recombinant form (e.-g., chimeric or bispecific).
- a therapeutic agent e.g., drug, toxin, enzyme or second antibody
- a recombinant form e.-g., chimeric or bispecific
- the compositions may additionally include other antibodies or conjugates (e.g., an antibody cocktail).
- the pharmaceutical compositions also preferably include suitable carriers and adjuvants which include any material which when combined with the molecule of the invention (e.g., an anti-CEG antibody or a CEG protein) retains the molecule's activity and is non- reactive with the subject's immune systems.
- suitable carriers and adjuvants include, but are not limited to, human serum albumin, ion exchangers, alumina, lecithin, buffer substances such as phosphates, glycine, sorbic acid, potassium sorbate, and salts or electrolytes such as protamine sulfate.
- compositions comprising such carriers are formulated by well known conventional methods. Such compositions may also be formulated within various lipid compositions, such as, for example, liposomes as well as in various polymeric compositions, such as polymer microspheres.
- compositions of the invention can be administered using conventional modes of administration including, but not limited to, intravenous, intraperitoneal, oral, intralymphatic or administration directly into the tumor. Intravenous administration is preferred.
- compositions of the invention may be in a variety of dosage forms which include, but are not limited to, liquid solutions or suspensions, tablets, pills, powders, suppositories, polymeric microcapsules or microvesicles, liposomes, and injectable or infusible solutions.
- dosage forms include, but are not limited to, liquid solutions or suspensions, tablets, pills, powders, suppositories, polymeric microcapsules or microvesicles, liposomes, and injectable or infusible solutions.
- the preferred form depends upon the mode of administration and the therapeutic application.
- the CEG polypeptides and proteins of this invention are found in common pathogenic bacterial species such as Streptococcus pneumoniae. This organism causes upper respiratory tract infections.
- the peptides and proteins of this invention can be used as immunogens in subunit vaccines for vaccination against a pathogenic bacteria such as Streptococcus pneumoniae.
- the ceg sequences of the invention can be used as DNA vaccines (U.S. Patent No. 5,736,524 and U.S. Patent No. 5,989,553).
- polypeptides and proteins of this invention can be formulated , as univalent and multivalent vaccines.
- the protein can be mixed, conjugated or fused with other antigens, including B or T cell epitopes of other antigens.
- a haptenic peptide of the proteins of the invention when used, (i.e., a peptide which reacts with cognate antibodies, but cannot itself elicit an immune response), it can be conjugated to an immunogenic carrier molecule. Conjugation to an immunogenic carrier can render the oligopeptide immunogenic.
- carrier molecules are tetanus toxin or toxoid, diphtheria toxin or toxoid and any mutant forms of these proteins such as CRM.sub.197. Others include exotoxin A of Pseudomonas, the heat labile toxin of E. coli and rotaviral particles (including rotavirus and VP6 particles).
- a fragment or epitope of the carrier protein or other immunogenic protein can be used. For example, the happen can be coupled to a T cell epitope of a bacterial toxin.
- the immunogen is adjusted to an appropriate concentration and formulated with any suitable vaccine adjuvant.
- suitable adjuvants include, but are not limited to: surface active substances, e.g., hexadecylamine, octadecylamine, octadecyl amino acid esters, lysolecithin, dimethyl- dioctadecylammonium bromide), methoxyhexadecylgylcerol, and pluronic polyols; polyamines, e.g., pyran, dextransulfate, poly.
- IC carbopol
- peptides e.g., muramyl dipeptide, dimethylglycine, tuftsin
- oil emulsions e.g., mineral gels, e.g., aluminum hydroxide, aluminum phosphate, etc. and immune stimulating complexes.
- the immunogen may also be incorporated into liposomes, or conjugated to polysaccharides and/or other polymers.
- the vaccines can be administered to a human or animal in a variety of ways. These include intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, oral and intranasal routes of administration. Further, the vaccines can be live or inactivated vaccines.
- compositions of this invention The most effective mode of administration and dosage regimen for the compositions of this invention depends upon the severity and course of the disease, the patient's health and response to treatment and the judgment of the treating physician. Accordingly, the dosages of the compositions should be titrated to the individual patient.
- nucleic acid molecules of the invention and their encoded proteins may be employed as molecular weight markers.
- molecular weight of each of the nucleic acid molecules having ceg sequences and their predicted polypeptides can be determined and can be used to compare against other gene sequences and proteins whose molecular weights are unknown.
- the nucleic acid molecules of the invention may be employed in diagnostic embodiments.
- the presence of nucleotide sequences which are identical or similar to the ceg sequences of the invention may be detected within a biological sample.
- the biological sample may include blood, serum or a swab from nose, ear or throat, may be determined by means of a nucleic acid detection assay.
- Nucleic acid probes or primers having sequences complementary to ceg sequences may be used in a hybridization assay to detect the presence of the sequences which are identical or similar to the ceg sequences of the invention in the biological samples.
- nucleic acids molecules obtained from a suitable biological sample are hybridized with labeled probes or primers.
- the resulting hybridized molecules are detected and resolved by methods well known in the art , such as Northern or Southern blotting, micro-array technology, or amplifying with PCR technology.
- Other hybridization techniques and systems are known that can be used in connection with the detection aspects of the invention, including diagnostic assays such as those described in Falkow et al., U.S. Pat. No. 4,358,535.
- nucleic acid molecules are obtained from a suitable biological source and contacted with two primers corresponding to the ceg sequences disclosed herein, under conditions which allow for hybridization and polymerization to occur.
- a pair of probes, one corresponding to the 5' flanking region and the other corresponding to the 3' flanking region, would be sufficient to detect the nucleic acid molecules of the invention in a biological sample and may be used to indicate the amount of bacteria present.
- detecting nucleic acid molecules include, for example, in situ hybridization techniques, where a ceg probe is used to detect homologous sequences within one or more cells, such as cells within a clinical sample or even cells grown in tissue culture.
- the cells are prepared for hybridization by fixation, e.g. chemical fixation, and placed in conditions that allow for the hybridization of a detectable probe with nucleic acids located within the fixed cell.
- fixation e.g. chemical fixation
- the amount of ceg sequences present in a biological sample can be quantified and compared to the levels in a normal or "healthy" sample. For example, ceg sequences present in either increased or decreased levels, compared to the levels found in the control sample may indicate the presence of bacteria. This information is useful for ' diagnosis of a bacterial infection that requires treatment with an antibacterial agent.
- the amount of CEG polypeptides present in a biological sample may be determined by means of an immunoassay.
- labeled antibodies reactive against CEG polypeptides may be used in an immuno-reactive assay to detect the presence of CEG polypeptides in the biological samples.
- ceg nucleotide sequences of the invention can be used to identify nucleotide sequences which are identical or similar to the ceg sequences that are required for bacterial cell viability.
- the ceg sequences can be used in a bacterial gene disruption assay to screen candidate nucleotide sequences to identify sequences required for bacterial cell viability.
- the disruption assay can involve: introducing into a host cell a recombinant vector that is capable of integration into the host genome, where the recombinant vector, includes a candidate sequence that putatively encodes a cell-viability gene product (e.g., the exogenous ceg sequence); the vector integrates the candidate sequence into a target sequence within the host's genome (e.g., the endogenous ceg sequence); and the host cell, so introduced, is screened for viability.
- the recombinant vector preferably includes a selectable marker so that the introduced host cell can be screened for viability in the presence of a selectable agent.
- Figure 1 shows a schematic representation of a gene disruption assay, within a bacterial host cell.
- the recombinant vector, pENP3 includes the CAT gene (e.g., the selectable marker chloramphenicol acetyl transferase) and an internal region of the ceg disrupting sequence; the internal region excludes the 5' and 3' ends of the ceg sequence.
- the "X" in Figure 1 indicates the recombinant pEVP3 vector undergoing homologous recombination with the target sequence (e.g., within the host genome).
- Figure IB the resolved pEVP3 vector that is integrated into the host genome, is shown.
- the native promoter of the target gene a 5' partial copy of the target gene
- the body of the integrated pEVP3 vector including the disrupting gene and CAT and, a 3' partial copy of the target gene.
- integration of the pEVP3 vector via homologous recombination results in two partial gene duplications flanking the integrated vector.
- the target gene is not essential for survival, it is possible to recover chloramphenicol-resistant colonies of S. pneumoniae. Failure to recover chloramphenicol resistant colonies, in the presence of the proper controls as described below, indicates that the target gene may be essential for cell viability.
- the gene disruption assay for screening candidate ceg sequences can involve the following steps.
- the recombinant pEVP-3 vector encoding CAT resistance and having a fragment of a candidate ceg sequence can be introduced into transformation-competent S. pneumoniae cells by methods that are well-known in the art (Lee, M.S., et al., 1998 Appl Environ. Microbiol. 64:4796-4802).
- the preferred size of the ceg fragment can be between about 200 to about 500 bp in length. It is advantageous that the candidate ceg sequence does not include the 5' and 3' ends that encode the ⁇ - and C-terminal ends of the CEG polypeptide.
- the transformation-competent cells can be obtained by performing the transformation step in the presence of a heptadecapeptide that induces competence for transformation of S. pneumoniae (Havarstein, L. S., et al., 1995 Proc. Natl Acad. Sci. 92:11140-11144), such as the CSP-1 -peptide.
- the CSP-1 can be naturally-derived or synthetic.
- the transformation step can.be optimized by performing the transformation when the cells have reached a density which is optimal for transformation (e.g., 3 X 10 7 cells per ml.) (Havarstein, L. S. et al. supra).
- the recombinant vector can be introduced into the competent pneumococci and may undergo homologous recombination, whereby the candidate ceg fragment recombines with the corresponding endogenous ceg sequence, resulting in targeted integration of the vector into the pneumococcal genome and disruption of the endogenous ceg.
- the transformed cells can be plated on or cultured in chloramphenicol-containing growth medium.
- the cells can be cultured under standard conditions, such as 37° C in 5% CO 2 for approximately 40 to 48 hours, for the purpose of selecting cells that carry the integrated vector.
- control samples can be run in parallel with the gene disruption assay, in order to determine whether the gene disruption procedure is working properly.
- the control samples can be used to calibrate the gene disruption experiment so that disruption of a known non-essential bacterial gene results in an approximate number of colonies per plate.
- the disruption of a known essential gene can be calibrated to yield only zero or one colony per plate. The appearance of one colony is due to the rare illegitimate recombination into a non-homologous sequence.
- a known non-essential gene such as the lytA gene (Tomasz, A., et al., 1988 J Bacteriol.
- ftsZ gene (Lutkenhaus, J. F., et al., 1980 J. Bacteriol. 143:1281-1288), a known essential gene, can be used to yield zero or, rarely, one colony per plate.
- specific parameters that are involved in any given gene disruption assay can be adjusted to calibrate the desired number of plated cells in the control samples. Experimental parameters that can be adjusted include, but are not limited to, the E.
- the transformed cells carrying the recombinant integration vector that disrupts expression of an endogenous essential gene can be identified, based on a selectable phenotype such as non- viability. For example, the cells that carry a disrupted non-essential gene will be viable and, due to the integration of pEVP3, will grow on chloramphenicol-containing medium.
- cells that carry a disrupted essential gene will not grow (e.g., non-viable) on the chloramphenicol-containing medium.
- the transformed cells that do not grow under these selective conditions carry an endogenous gene sequence that is essential for cell viability which has been disrupted by an exogenous candidate fragment, thereby identifying a ceg sequence. Steps one through three may be repeated in order to confirm that the ceg sequences, so identified, are essential for cell viability.
- the lytA transformation control can be used to confirm that the transformation system is functioning properly. For example, a phenotypic test for autolysin activity (lytA gene product) can be performed to determine that the exogenous lytA fragment is correctly integrated into the lytA site within the host genome. This typically involves flooding the culture plates containing transformants carrying the integrated lytA control vector with a solution of detergent, such as 0.1% deoxycholate, which triggers cell lysis in lytA -intact cells (e.g., the cells that have not undergone homologous recombination). After about 5- 10 minutes the colonies with intact lytA will appear ghost-like due to cell lysis, and the colonies with a disrupted lytA gene will appear intact.
- a solution of detergent such as 0.1% deoxycholate
- the ceg sequences that are confirmed to be essential for cell viability can be examined further by performing a polarity analysis to determine if the corresponding endogenous ceg sequence is organized in an operon.
- Polarity is an effect unique to prokaryotes and is the result of the operon organization of bacterial genomes.
- Many bacterial genes are arranged in operons in which multiple genes are under the control of a single regulatory sequence (e.g., a promoter) and are transcribed into a single mRNA transcript. With respect to the orientation of multiple genes within an operon, the genes that are proximal to the regulatory sequence are said to be "upstream" genes and the genes that are distal are said to be "downstream” genes.
- many operons contain genes encoding different proteins that catalyze discrete steps of a common biochemical pathway. Thus, any of the proteins that catalyze the steps of the pathway may be essential for cell viability.
- operons in a bacterial host genome may influence the interpretations of the gene disruption results. For example, disruption of an upstream gene may be erroneously interpreted as affecting the expression of the disrupted gene but may, in fact, have expression affects on the intact downstream genes. Therefore, it is advantageous to perform a polarity analysis to determine if a ceg sequence is part of an operon.
- a polarity analysis can involve performing an in vivo gene disruption procedure using, as the disrupting sequence, a ceg sequence that includes the entire ceg coding sequence region but lacking expression regulatory sequences. This differs from the gene disruption assay, which involves the central region of the ceg sequence.
- the polarity analysis involves gene duplication via homologous recombination.
- the pEVP-3 vector having the entire coding region of a ceg sequence can be used for the polarity analysis ( Figure 2 A).
- the polarity analysis will yield different results depending on the organization of the endogenous target sequence within the host genome. *
- Figure 2 shows a schematic representation of the polarity test for operons, within a bacterial host cell.
- the recombinant vector, pEVP3 includes the
- FIG. 2 indicates the recombinant pEVP3 vector undergoing homologous recombination with the target sequence.
- FIGs 2 B and C Two of the possible results of homologous recombination are shown in Figures 2 B and C.
- Figure 2 B case 1, if the endogenous target sequence is not organized in an operon, the integration event may yield: a functional target sequence (e.g., it is capable of expression); a duplicate non-functional target sequence that lacks a promoter; and a functional downstream gene (e.g., Gene B) that is controlled by its own promoter.
- the cells carrying this type of integrated target sequence can be recovered as viable cells that grow in the presence of chloramphenicol; this condition is termed "polarity negative".
- case 2 if the target sequence is organized in an operon, then the integration event may yield an integration site that is similar to that described for case 1 , including: a functional target sequence; and a duplicate non-functional target sequence which is not functional. However, this integration event may also yield a non-fonctional downstream gene (e.g., Gene B) because expression of this downstream gene is controlled by a promoter located upstream of the insertion site. The cells that carry this type of integrated target sequence will be non- viable; this condition is termed "polarity positive". Thus, the polarity analysis provides a method to determine whether integration of a recombinant vector into a target ceg sequence effects expression of downstream genes.
- a non-fonctional downstream gene e.g., Gene B
- ceg sequences disclosed herein encode gene products that are essential for viability in S. pneumoniae. Furthermore, many of these ceg sequences have been analyzed for the polarity effect and the results are presented in
- ceg sequences Another subset of ceg sequences is classified as polarity positive (+), since the homologous recombination event did affect the expression of downstream genes.
- the ceg sequences that have not yet been classified as polarity positive or negative are indicated in Table I as a blank.
- the genes downstream of the disrupted endogenous ceg sequences may or may not also be essential.
- the present invention provides screening methods for identifying agents that interact and/or bind to the CEG proteins of the invention, such as a ligand.
- An agent can be, for example, a natural product, a derived or synthetic chemical molecule, a polypeptide, a nucleic acid molecule, or a metal.
- the agents that interact with CEG proteins may cause bacterial cell death by disrupting the functions of CEG proteins, including, but not limited to, nucleotide biosynthesis, DNA replication, RNA transcription, protein translation, and/or cell wall biosynthesis.
- the present invention provides screening methods for identifying agents having antibacterial activity, such as agents that cause bacterial cell death by interacting with the CEG proteins. These antibacterial agents are useful for treating diseases and afflictions associated with bacterial infections.
- Various methods can be used to discover agents having antibacterial activity, as determined by the ability of the binding agent to bind to a CEG protein and disrupt the function of the CEG protein. These screening methods include whole cell in vivo assays as well as in vitro assays with cellular components.
- An in vivo screening method for identifying ligands that bind CEG polypeptides can be performed in a whole cell assay.
- a typical method may be the use of whole bacterial cells to assess the antibacterial properties based on cell growth or viability.
- These methods can include methods for measuring cell growth and/or viability, for example, by optical density or zones of growth (Koch, A. L. et al., 1970 Anal Biochem. 38:252-259; Biemer, J. J. et al., 1973 Ann. Clin. Lab. Sci. 2:135-140; Manual of Clinical Microbiology, 7 th edition, Murray, P. R. (ed), ASM Press), by growth inhibition in an agar assay (Murray, P.
- reporter genes include, but are not limited to, beta- galactosidase, alkaline phosphatase, luciferase, and green fluorescent protein.
- one embodiment provides a reporter system that monitors inhibition of DNA synthesis by fusing a reporter such as beta-galactosidase (lacZ) to genes known to be upregulated by the cessation of DNA synthesis as a result of the binding of ligands to the DNA synthetic apparatus.
- a reporter such as beta-galactosidase (lacZ)
- lacZ beta-galactosidase
- yeast two-hybrid system may be adapted to screen for ligands that bind CEG polypeptides.
- the yeast two-hybrid system is performed in a yeast host cell carrying a reporter gene, and is based on the modular nature of the GAL transcription factor which has a DNA binding domain and a transcriptional activation domain.
- the yeast two-hybrid system relies on the physical interaction between a recombinant polypeptide that comprises the GAL DNA binding domain and another recombinant polypeptide that comprises the GAL transcriptional activation domain.
- the physical interaction between the two recombinant polypeptides reconstitutes the transcriptional activity of the transcription factor, thereby causing expression of the reporter gene.
- Either of the recombinant polypeptides used in the two-hybrid system can be generated to include a CEG polypeptide sequence to screen for binding partners of CEG.
- the in vitro screening method comprises: a) generating the CEG protein of the invention, or membranes enriched in the CEG protein; b) exposing the CEG protein or membranes to a candidate agent; and c) detecting the interaction of the CEG protein with the agent by any suitable means.
- the screening methods may be adapted to automated high-throughput procedures, such as PANDEX.RTM Baxter-Dade Diagnostics, allowing for efficient high-volume screening of candidate agents.
- an alternative method for screening potential ligands involves an in vitro binding procedure.
- the CEG proteins of the invention can be produced using recombinant DNA technology and host-vector systems as described herein.
- a candidate agent is introduced into a reaction vessel containing the CEG protein, o fragment thereof; the candidate agents may be detectable by methods such as, but not limited to, radioisotope or chemical labeling.
- Binding of the CEG protein by a candidate agent can be determined by any suitable means, including, for example, quantifying bound label versus unbound label using any suitable method. Binding of a candidate agent may also be detected by methods similar to an alternative physical method disclosed in U.S. Patent No. 5,585,277.
- binding of a candidate agent to a protein is assessed by monitoring the ratio of folded protein to unfolded protein, for example by monitoring sensitivity of the protein to a protease, or amenability to binding of the protein by a specific antibody against the folded state of the protein, or binding to chaperone protein, or by binding to any suitable surface.
- the invention provides methods of identifying compounds that modulate (e.g., activate or inhibit) the function of a CEG polypeptide.
- any compound can be used in the assays of the invention.
- the preferred compounds are those that are soluble in aqueous or organic solutions. It will be appreciated by those of skill in the art that there are many commercial suppliers of chemical compounds that can be used in the methods of the invention, including Sigma Chemical Co. (St. Louis, Mo.), Aldrich Chemical Co. (St. Louis, Mo.), Sigma-Aldrich (St. Louis, Mo.), Fluka Chemika-Biochemica Analytika (Buchs, Switzerland), and the like.
- the present invention provides methods for detecting compounds which are identified as modulators of CEG function.
- the methods of the invention can be performed using isolated CEG polypeptides, or use whole cells expressing the CEG polypeptide.
- the steps, of the method using isolated CEG polypeptides include: contacting the isolated CEG polypeptide with a candidate compound; and determining whether the function of the CEG polypeptide is altered.
- the steps of the method using whole cells include: contacting the whole cells with a candidate compound; and determining whether the cell dies, indicating the compound inhibited the function of a CEG polypeptide.
- the preferred methods of the invention provide high-throughput screening assays for identifying compounds which modulate the function of a CEG polypeptide. The high throughput methods permit screening of large libraries of compounds.
- the high throughput methods can use automated assay steps.
- the assays can be performed in parallel on a solid support, as microtiter formats on microtiter plates in robotic assays are well known.
- a preferred embodiment of the methods includes adapting the methods to use microtiter plates or pico- nano- or micro-liter arrays. In high throughput assays it is desirable to run positive controls to ensure that the components of the assays are working properly.
- the high throughput screening methods of the invention include providing a combinatorial library containing a large number of compounds (candidate modulator compounds) (Borman, S, C. & E. News, 1999, 70(10), 33-48).
- Such combinatorial chemical libraries can be screened in one or more assays to identify library members (particular chemical species or subclasses) that exhibit the ability to modulate the function of the CEG polypeptide (Borman, S., supra; Dagani, R. C. & E. News, 1999, 70(10), 51-60).
- the compounds, so identified can serve as lead-compounds or can themselves be used as potential or actual therapeutics.
- a combinatorial chemical library is a collection of diverse chemical compounds generated by using either chemical synthesis or biological synthesis, to combine a number of chemical building blocks, such as reagents.
- a linear combinatorial chemical library such as a polypeptide library, is formed by combining a set of chemical building blocks (amino acids) in every possible way for a given compound length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical compounds can be synthesized through such combinatorial mixing of chemical building blocks.
- combinatorial chemical libraries include, but are not limited to, peptide libraries
- chemistries for generating chemical diversity libraries can also be used. Such chemistries include, but are not limited to, peptoids (PCT Publication No. WO 91/19735); encoded peptides (PCT Publication WO 93/20242); random bio-oligomers (PCT Publication No. WO 92/00091); benzodiazepines (U.S. Pat. No.
- each well of a microtiter plate can be used to run a separate assay against a selected potential modulator, or if concentration or incubation time effects are to be observed, every 5-10 wells can test a single modulator.
- a single standard microtiter plate can assay about 100 (96) modulators. If 1536 well plates are used, then a single plate can easily assay from about 100 to about 1500 different compounds. It is possible to assay many different plates per day; assay screens for up to about 6,000-20,000, and even up to about 100,000- 1,000,000 different candidate modulator compounds are possible using the methods of the invention.
- the following provides a general description of how a list of candidate ceg sequences was generated.
- the list was generated by selecting candidate ceg gene sequences from a Concordance web engine using the method described in: Bruccoleri, R.E., Dougherty, T.J., Davison, D.B. (1998) "Concordance analysis of microbial genomes” in: Nucleic Acids Res 26:4482-4486.
- the entire genomic sequence data of various bacteria was acquired from several public and proprietary sequence database sources, including GTC (Genome Therapeutics Corporation), and TIGR (The Institute for Genomic Research). Predicted ORFs from the genomic data were identified, translated, and stored. , The desirable ORFs were at least 90 amino acid residues in length. Concordance analysis was performed among bacteria and various parameters were used to filter out genes with high similarity to eukaryotes.
- the entire genomic sequence of various Eubacteria was acquired from several public and private sources.
- Public data was obtained from GenBank (http ://ncbi.nlm.nih. gov), The Institute for Genomic Research (TIGR), the Yeast Proteome Database, from Proteome, Inc. of Beverly, MA, and the Sanger Center of the Medical Research Council of the United Kingdom (http://www.sanger.ac.uk).
- the non-microbial sequence data used as a basis for comparison and data subtraction was obtained from a proprietary database, including the LifeSeq Database from Incyte Pharmaceuticals, Palo Alto, CA.
- Incyte nucleotide sequences were translated into protein sequences in all six possible reading frames.
- GTC supplied predicted protein sequences with their data.
- CRITICA the program for eubacterial nucleotide sequences.
- the output was processed and stored in a PostGres 95 database fhttp://www.postgresql.org).
- Graphical user interfaces, using web browser technology, were constructed to query the database.
- a Concordance Analysis was performed on the data.
- the question used to generate the dataset was show all Streptococcus pneumoniae open reading frames with a similarity greater than or equal to 30% overall protein sequence identity to both selected gram- positive and/or gram-negative bacteria in the database.
- the data was further required not to match yeast or human sequences at greater than 30% overall protein sequence similarity.
- the resulting dataset included a list of more than 400 conserved amino acid sequences having known or unknown function.
- the amino acid sequences having unknown functions formed the basis of a list designated conserveed Unknown Reading Frames, or CURFs which is a subset of the total list of CEGs (e.g., CURFs includes known and unknown).
- the resulting list of conserved genes (e.g., more than 400 sequences) was used as a basis for selecting and screening bacterial gene sequences that are essential for cell viability.
- the Concordance system was designed to permit high-throughput identification of conserved gene sequences in the database. (Bruccoleri, R, Dougherty, T, and Davison, D. 1998 "Concordance analysis of microbial genomes” Nucleic Acids Res. 26:4482-4486.)
- the resulting list of conserved genes was used as a basis for selecting and screening bacterial gene sequences that are essential for cell viability. This Concordance system was designed to permit high throughput use of the conserved gene sequences contained on the list.
- a set of Knockout PCR primers were generated, based on the list of conserved genes, for the purpose of use in the gene disruption procedure described below.
- the PCR primers were designed to amplify a central 300-500 bp region of the ceg (to prevent generation of a functional copy of the ceg gene following integration), ordered electronically, the primers were placed in a 96-well format, and used in the gene disruption procedure as described below.
- the following provides a description of the procedure to generate recombinant vectors of pEVP-3 having inserts of candidate ceg nucleotide sequences.
- the Knockout primers generated by the method described in Example 1 above were used to generate DNA fragments comprising candidate ceg sequences.
- 96-well plate format were set up (36 ⁇ l H 2 O , 5 ⁇ l 10* VentTM buffer, 1 ⁇ l gene specific, knockout forward primer (0.5 ⁇ g/ ⁇ l), 1 ⁇ l gene specific knockout reverse primer (0.5 ⁇ g/ ⁇ l), 0.5 ⁇ l VentTM DNA polymerase (2000 U/ml New England Biolabs, Beverly, MA), 1.5 ⁇ l each dNTPs (lOmM; 6.0 ⁇ l total), 0.5 ⁇ l S. pneumoniae chromosomal DNA (0.5 ⁇ g/ ⁇ l), 50 ⁇ l total volume/reaction).
- the nucleotide sequences of the forward and reverse knockout primer pairs were generated from the nucleotide sequence information obtained from the Genomic Therapeutics Corporation database for Streptococcus pneumoniae.
- the primer pairs were each used in a PCR reaction to generate a unique internal (e.g., central region) fragment of the candidate gene targeted for knockout.
- the PCR program was set in the PCR machine (Initial 95 °C - 5 minutes: 30 Cycles of: 95 °C - 1 minute, 58 °C - 1 minute, 72 °C - 30 seconds; Final, 72 °C - 10 minutes, 4 °C - hold indefinitely). 5 ⁇ l of each reaction was run on an 0.8% agarose gel after purifying fragment over PCR purification kit (Qiagen) to visualize the fragments then ligation reactions were performed.
- Ligation Reactions proceeded (set up in 96-well plate format (10.0 ⁇ l genomic PCR fragment (generated from step 2 above), 1.0 ⁇ l pEPV-3 Smal-cut vector (1: 10 dilution of vector DNA at 50-100 ng/ ⁇ l), 1.5 ⁇ l 10* ligation buffer (New England BiolabsTM), 1.0 ⁇ l T4 DNA Ligase (New England BiolabsTM 400,000 U/ml), 1.5 ⁇ l ddH 2 O, 15.0 ⁇ l total reaction volume).
- the nucleotide sequences of the forward and reverse primer pairs used for the polarity test were generated in a similar manner, from the nucleotide sequence information obtained from the Genomic Therapeutics Corporation database for Streptococcus pneumoniae.
- the primer pairs were each used in a PCR reaction to generate a unique fragment of the candidate gene targeted for the polarity test.
- the fragment generated for the polarity test included the entire ceg coding sequence region but lacking the expression regulatory sequences.
- the colony PCR involved the following. 96-well plate format was set up (36.5 ⁇ l H 2 O, 0.5 ⁇ l pEPV3 forward primer (0.25 ⁇ g/ ⁇ l), 0.5 ⁇ l pEPV3 reverse primer (0.25 ⁇ g/ ⁇ l), 1.5 ⁇ l each (6.0 ⁇ l total) dNTPs (10 mM), 0.5 ⁇ l VentTM DNA polymerase, 5 ⁇ l 10* VentTM buffer, 1 ⁇ l of a 1:50 cell dilution, 50 ⁇ l total volume).
- pEPV3 forward primer 5' CATCAAGCTTATCGATACCGTCG 3' (SEQ ID NO:437)
- p EP V3 reverse primer 5 ' CACAGTAGTTCACCACCTTTTCCC 3 ' (SEQ ID NO :438)
- Colonies of E. coli LE392 were picked onto a master plate of LB + 13 ⁇ g/ml chloramphenicol (incubate throughout the day at 37° C) and then into 50 ⁇ l H 0 which has been placed into a 96-well plate. 1 ⁇ l of this dilution was used in above PCR reaction (if the 96-well dilution plate is kept you will not need to prepare a master plate). Cultures for minipreps of plasmid candidates may be prepared directly from the cell dilutions.
- the PCR program was run (95 °C - 5 minutes, 30 Cycles of: 95 °C - 1 minute, 58 °C - 1 minute, 72 °C - 30 seconds, 72 °C - 10 minutes, 4 °C - hold).
- a 10 ⁇ l/ reaction was run on a 1.0 % TBE gel.
- a gel designed for 96 well plates and a multichannel pipettor were used to ease loading of the sample rows.
- the gel was run and stained with ethidium bromide.
- the positive clones were identified with appropriate molecular size insert(s), amplified by the flanking pEVP-3 primers.
- the constructs that carried an insert were identified.
- the constructs having an insert were inoculated into a 5 ml LB/Cm culture, and incubated over night at 37 °C with aeration.
- Miniprep plasmid DNA was prepared by a standard procedure.
- the miniprep DNA was digested with appropriate restriction enzymes to confirm the presence of the insert (enzymes flank Smal site in pEVP-3) (10 ⁇ l miniprep DNA, 2 ⁇ l 10 buffer, 1 ⁇ l Xbal, 1 ⁇ l Xhol, 6 ⁇ l ddH20, ' 20 ⁇ l total volume for digest).
- the digest reactions were electrophoresed on an agarose gel and the gel was stained with ethidium bromide. The positive clones were used for the S. pneumoniae KNOCKOUTs procedure.
- the confirmatory PCR reactions, using knock out-specific primers involved 35.5 ⁇ l H 2 O, 5 ⁇ l 10 x VentTM buffer, 1 ⁇ l knockout forward primer (0.5 ⁇ g/ ⁇ l), 1 ⁇ l knockout reverse primer (0.5 ⁇ g/ ⁇ l), 0.5 ⁇ l VentTM (6.0 ⁇ l total) DNA Polymerase (2000 U/ml), 1.5 ⁇ l each dNTPs (lOmM, 6.0 ⁇ l total), 1.0 ⁇ l miniprep DNA from test clone, 50 ⁇ l total reaction volume.
- the PCR program was as follows: 95 °C for 5 minutes, 30 Cycles of: 95 °C for 1 minute, 60 °C for .1 minute, 72 °C for 30 seconds, 72 °C for 10 minutes, hold at 4 °C.
- the presence of the correct-sized insert was confirmed by agarose gel electrophoresis and ethidium bromide staining.
- the confirmed clones were used for the S. pneumoniae gene KNOCKOUT procedure.
- Glycerol stocks were made of all positive E. coli LE392 constructs and frozen at - 80 degrees C.
- the following provides a description of the high throughput gene disruption procedure used in S. pneunomiae strain (e.g., gene knockout procedure).
- the candidate ceg fragments that were generated by the method described in Example 2 were used in the gene disruption procedure in order to identify ceg nucleotide sequences that are required for cell viability.
- the following provides a description of the autolysin procedure used to determine that the non-essential control samples of S pneumoniae contain a disrupted lytA gene.
- the culture plates containing transformants carrying the lytA control vector were flooded with 0.1% deoxycholate in H 2 O. The plates were observed after 5-10 minutes. Plates with "ghosts” indicated intact lytA gene, or plates without “ghosts” indicated a disrupted lytA gene.
- the "ghost” phenomenon is due to detergent triggered autolysis of the cells, causing a gradual fading of the colonies.
- CEG proteins e.g., designated CFE proteins
- custom primers were used to insert N- and C- termini into vectors such that the 5' end (N-terminus of the CEG) is positioned properly for expression behind the T7 promoter and optimally placed with regard to the pET ribosome binding site.
- the pET vectors contain an Ndel site which allows positioning of ATG start site in the vector.
- blunt ligation of the ceg PCR fragment into the vector is accomplished via Klenow fill-in of the Ndel site.
- primers were also designed such that the ceg 3' (C-terminus of the expressed protein) will contain an in-frame extension of 6X-histidine residues, encoded in the vector sequence of pET-21.
- cegs were PCR amplified via custom designed primers as described above. Both ceg PCR and vector DNA were digested with appropriate restriction enzymes. The foil-length ceg were ligated into the pET expression vector. The ligation mixture was transformed into competant E. coli BL21 ⁇ DE3 cells and selected for transformants on LB agar with 50 ⁇ g/ml ampicillin. Positive insert bearing clones were screened via minipreps of the plasmids and size analysis on 0.8% agarose gels, with detection by ethidium bromide staining, as above.
- the protein is overproduced and purified, via the following method.
- a large scale (500- 1000ml) culture of E. coli is grown to early logarithmic phase in broth (e.g., LB broth) and protein expression induced for 2 hours with IPTG (isopropyl-D-thiogalactoside).
- the cells are harvested by centrifugation (8000 X G; 15 minutes) and the cell pellets resuspended in 20 ml. of buffer.
- the cells are lysed by sonication, and the supernatant fluid centrifuged at low speed (5000 X G, 15 min.) to remove unbroken cells.
- the supernatant fluid, containing the over-expressed protein is subjected to Ni- NTA affinity column chromatography (Quiagen, Inc., Chatsworth, CA).
- Ni- NTA affinity column chromatography Quiagen, Inc., Chatsworth, CA.
- the 6X-histidine residues linked at the C-terminal end of the CEG proteins permit rapid protein purification via selective binding to a Ni-NTA resin column.
- the protein-bound Ni-NTA resin was to remove contaminants, and the bound proteins subsequently eluted with imidazole and recovered. It is possible to upscale this procedure to larger volumes for higher yields of proteins.
- the following provides a description of the methods used to purify all 2CEG polypeptides (e.g., 2CFE polypeptides #19-117; SEQ ID NOS:349-436) having a histidine tag at their C-terminal ends.
- the 2CEG polypeptides having the his-tags were produced by the methods described in Example 5, supra. As an example, results of purification of 2CFE 75 polypeptide are presented. Production Of The CFE Polypeptides
- the BL21 ⁇ DE3 cells harboring recombinant pET-21 vectors carrying a 2CFE nucleotide sequence were cultured in LB broth containing ampicillin. When the A 6 oo reached approximately 0.6, protein production was induced by adding 1.0 mM of IPTG, the cells were cultured for an additional 2 hours. The cell pellet was collected by centrifugation, and the collected cell pellet was sonicated in Solution A (50 mM NaPO 4 ; 300 mM NaCl, pH 8.0). The sonicated cells were centrifuged at 10,000 RPM to remove the debris.
- Solution A 50 mM NaPO 4 ; 300 mM NaCl, pH 8.0
- the supernatant was diluted with Solution A, loaded onto a Ni-NTA column (Quiagen) equilibrated with Solution A; the column bed size was 2.5 x 25 cm, and the flow rate was approximately 3.0 ml/minute.
- the 2CFE protein was eluted using a linear gradient of imidazole, using 0-250 mM in 450 ml, flow rate approximately 3.0 ml/minute.
- the eluted samples were collected as 22 ml fractions per tube and the eluted samples were monitored using spectrophotometry. The amount of protein in the eluted fractions was estimated using the Bradford method (Bradford, M. M., 1976 Anal. Biochem.
- the 2CFE 75 polypeptide a precipitate formed and was redissolved upon increasing the sample volume and removing the imidazole by repeated concentration in 50 mM Tris,
- Varying amounts of the 2CFE 75 polypeptide were diluted in either 20 mM Tris, 20 mM KCl, pH 7.5 or 20 mM Tris, 20 mM MgCl 2 , pH 7.5 at
- CEG polypeptides that lack a histidine tag (e.g., 2CFE polypeptides #1-17; SEQ ID NOS:332-348).
- a histidine tag e.g., 2CFE polypeptides #1-17; SEQ ID NOS:332-348.
- the 2CFE 3 polypeptide was produced using the large scale IPTG-induced method described in Example 5, supra.
- the 2CFE 3 (SEQ ID NO:334) polypeptide lacks a C- terminal histidine tag.
- the 2CFE 3 polypeptide was purified using a 2-column procedure.
- the 2CFE 3 polypeptide preparation was eluted from a 26/10 Q Sepharose column (Pharmacia) using a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the gradient size was 1 liter.
- the following provides a description of the size exclusion chromatography methods used to estimate the molecular weight and determine whether the CEG polypeptides oligomerize.
- the CFE polypeptide may olimerize to form monomers, dimers, tetramers, hexameric rings, or other oligomeric forms.
- Size exclusion chromatography was performed on all isolated 2CFE polypeptides #s 1- 117 (e.g., SEQ ID NOS:332-436). This method was performed using various types of columns, depending on the particular 2CFE polypepeptide tested.
- the Biosil SEC- 125 HPLC Gel Filtration column (BioRad Laboratories, Inc) was used, for example, to characterize CFE 8.
- the mobile phase was 0.2 M KH 2 PO 4 , 0.9% NaCl pH 6.8.
- the Phenomenex 600 x 7.5 mm Biosep SECS 3000 column was used, for example to characterize 2CFE 21 and 39.
- the mobile phase for size exclusion was 50 mM Na 2 HPO , pH 7.0 and 150 mM NaCl run at 1 ml/minute in a Gilson HPLC system, with protein detection at 280 nm.
- The* putative function of the CFE polypeptides were determined using computer-aided bioinformatic approaches, including distant homologies, motif searching, or predictions based on statistical rules.
- the distant homology approach involved pairwise or multiple sequence alignments, employing tools such as FASTA, and Psi-BLAST.
- the motif searching approach involved using sophisticated hidden Markov models.
- the approach based upon predictions of statistical rules involved prediction of transmembrane regions, coiled-coil, and other structural motifs.
- amino acid sequences of the CFEs were analyzed by performing protein threading analyses using the ProCeryon fold recognition program (Sippl, et al, 1992 Proteins 13:258-271; Sippl, J. 1993 J. Comp. Aided Mol. Design 7:473-501; www.proceryon.com) and Geneformatics.
- a Protein Threading (e.g., fold recognition) method was used to predict similarities in the folded protein structure of CFE polypeptides in the absence of a high level of sequence similarity with proteins in the databases (review by Teichmann, et al., 1999 Current Opinion in Structural Biology 9:390-399).
- the Protein Threading method predicts the compatibility of a query sequence (e.g., CFE polypeptide sequences) with each of the folds in a library of known protein structures.
- the library of known protein structures as developed, maintained, and updated throughout the search process.
- the fold assignments for each query were used to generate pairwise sequence alignments.
- the pairwise sequence alignments were used to generate protein models of the query polypeptide (e.g., CFE polypeptides).
- the pairwise sequence alignments were also used to compare the position of critical residues of the structural template with the query polypeptide.
- the list of critical residues was generated by using multiple sequence alignments derived from a structural classification of proteins to generate a conservation profile which provided sequence- specific positions conserved across a homologous family of protein folds. Comparative modeling was used to search the model of the query polypeptide for the critical residues and determine whether the structural and functional motifs are conserved in the query protein. Conservation of structural and functional motifs permitted assignment of putative structure and function to a query polypeptide sequence.
- the Protein Threading method was used to search for putative folded structure and function for all CFE polypeptides (SEQ ID NOS: 114-226).
- the CFE polypeptides having significant sequence identity (e.g., more than 30%) to known proteins were assigned putative functions with a high level of confidence.
- CFE 101 polypeptide mediates the conversion of pantothenate to 4' phosphophantothenate, and is predicted to be a pantothenate kinase.
- CFE 101 may be a pantothenate kinase, which mediates the conversion of pantothenate to 4' phosphophantothenate ( Figure 5).
- Circular dichroism and circular dichroism melt methods were used to determine the folded structure of the expressed and isolated 2CFE polypeptides. For example, this method was used to characterize the folded structure of isolated 2CFE 101 (SEQ ID NO.421).
- the starting concentration of the 2CFE 101 polypeptide was such that OD 05 was approximately 1.5, and the OD 280 was approximately 0.05 (e.g., 0.05 to 0.1 mg/ml).
- the starting concentration of 2CFE 101 was approximately 344 ⁇ M in 50% glycerol, 50 mM Tris, 100 mM NaCl, 5 mM MgCl 2 , 0.5. mM EDTA, at pH 7.5.
- the polypeptide was diluted to a final concentration of 7 ⁇ M, as determined by absorbance at A 280 , in 20 mM Na-phosphate, 100 mM KCl, at pH 7.0.
- the circular dichroism analysis was performed using quartz cuvettes, the instrumentation was from JASCO (Model J-720), the readings were performed at 25 degrees C ( Figure 6 A). The band width was 1 nm, the sensitivity was 20 mdeg, the response was 0.25 seconds, the scan speed was 50 nm minute, and the step was 0.5.
- the circular dichroism thermal melt analysis was performed at a range of between 0 and 100 degrees C ( Figure 6 B). Additionally, the circular dichroism was performed comparing monomer and aggregate pools of 2CFE 101. Size Exclusion Analyses
- biochemical assays of the 2CFE 101 polypeptide was based on the PK/LDH coupled enzyme assays described by Vallari, D. S., et al. (1987 J. Biol. Chem. 262:2468-2471) and Song, W. -J., et al., (1994 J. Biol Chem. 269:27051-27058).
- the assay was performed as follows. The reaction included: 885 ⁇ l of 0.1 M Tris-HCl (pH 7.6), 25 ⁇ l NADH (14.1 mM), 20 ⁇ l ATP (10.7 mM), 50 ⁇ l phospho-enol- pyruvate (56 mM), 5 ⁇ l LDH/PK (lactose dehydrogenase/PK; Sigma, catalog # P-0294,
- The. monomer form has a specific activity of approximately 1.7 ⁇ M min "1 mg "1 .
- the oligomeric form has a specific activity of 0.26 ⁇ M min "1 mg “1 .
- the 2CFE 101 polypeptide can be tested in an assay that monitors the conversion of pantothenate to 4'-phosphopantothenate.
- the same reaction described above can be used, except C-labeled pantothenate is used.
- the . reaction can be monitored by measuring the amount of 14 C-labeled 4'-phosphopantothanate produced.
- the following provides a description of the methods used to characterize purified, CFE 39 and CFE 21 polypeptides, carrying a C-terminal histidine 6-tag.
- the methods include helicase reactions, in which synthetic HoUiday Junction templates are resolved into duplex structures.
- helicase reaction was monitored using radiolabeled templates.
- the helicase assay was adapted for use in a high throughput assay employing fluorescence labeled templates.
- the HoUiday Junction analysis was performed using radiolabeled, synthetic, asymmetrical, HoUiday Junction templates, as described in Hiom, K. and S. C. West 1995 Cell 80:787-793.
- the HoUiday Junction templates were produced by annealing together four separate, single-stranded, oligonucleotide strands to form four-stranded structures (e.g., the HoUiday Junction template).
- the HoUiday Junction templates were reacted with the 2CFE 39 and 2CFE 21 polypeptides, in a helicase reaction, to test their ability to generate two duplex structures.
- asymmetrical HoUiday Junction templates were produced by annealing the following oligonucleotide sequences:
- Oligonucleotide strand 1 Oligonucleotide strand 1 :
- Oligonucleotide strand 2 Oligonucleotide strand 2:
- Oligonucleotide strand 3 5 '-AACGTCATAGATGAACGGACAGATCATGGTGCTTTTAAAGTCTAGAGAC TATCGAGCATTAGTACCAGTATCGAATCCGTCTTGTCAA-3' (SEQ ID NO:440) Oligonucleotide strand 4:
- Oligonucleotide strand 3 was labeled at the 5' end using approximately 300 ng of oligonucleotide strand 3, 1 ⁇ l lOx Phosphate Buffer, 5 ⁇ l 32 P ATP, 1 ⁇ l T4 polynuclotide kinase (Gibco-BRL)), in a 10 ⁇ l volume, and the reaction was performed at 37 degrees C for 30 minutes. The reaction was loaded onto a G50 column to remove the unincorporated radiolabel. The final concentration of the radiolabeled oligonucleotide strand 3 was approximately 15 ng per ⁇ l.
- oligonucleotide strands were annealed (e.g., hybridized).
- the annealing reaction included: 5 ⁇ l Annealing Buffer (200 mM Tris-Cl pH 8.0, 100 mM MgCl 2 , 1 M NaCl, 10 mM DTT); 450 ng of radiolabeled oligonucleotide strand 3; and 1000 ng each of oligonucleotide strands 1, 2, and 4; in 50 ⁇ l total reaction volume.
- the control annealing reaction included: 5 ⁇ l Annealing Buffer, 60 ng radiolabeled oligonucleotide strand 3; 1000 ng oligonucleotide strand 4; in 50 ⁇ l total reaction volume. Annealing was performed at 95 degrees C for 5 minutes, 65 degrees C for 30 minutes, 42 degrees C for 30 minutes, and room temperature (e.g., between about 23 to 27 degrees C) for 30 minutes to generate the synthetic HoUiday Junction templates.
- the synthetic HoUiday Junction templates were gel or column- purified to remove the duplex and non-annealed products.
- oligonucleotide strands 3 and 4 were annealed to form duplex structures.
- the synthetic HoUiday Junction templates and duplex structures were stored at -20 degrees C.
- the helicase reaction was performed to determine whether 2CFE 39 and 2CFE 21 resolved the synthetic HoUiday Junction templates into duplex structures.
- the helicase reaction was performed as follows. A 50 ⁇ l total reaction volume included: 25 ⁇ l of 2x Reaction Buffer (50 mM Tris-Cl pH8.0, 30 mM MgCl 2 , 2 mM ATP); 1 ⁇ l synthetic HoUiday Junction template (36 ng); 2 ⁇ l 2CFE 39 (1 ⁇ M); and 2 ⁇ l 2CFE 21 (1 ⁇ M). The reaction was incubated at 37 degrees for 30 minutes.
- the reaction was stopped by adding 5 ⁇ l Stop Buffer (100 mM Tris-Cl pH 7.5, 5 mg/ml Proteinase-K, 5% SDS). The stopped reaction was returned to 37 degrees C for 5 minutes. The helicase reaction was loaded onto and run on a non-denaturing, 12% PAGE, Tris-glycine gel.
- Stop Buffer 100 mM Tris-Cl pH 7.5, 5 mg/ml Proteinase-K, 5% SDS.
- E. coli RuvA binds to HoUiday Junction templates (Parsons, C. A., et al., 1992 Proc. Natl, Acad. Sci. USA 89:5452-5456).
- the ability of 5. pneumoniae CFE 39 to bind to a HoUiday Junction template can be tested by employing the helicase assay described herein. The results of the helicase assay can be monitored by performing a gel shift assay and/or capillary electrophoresis. The presence of a HoUiday Junction template bound to 2CFE 39, which migrates more slowly than the HoUiday Junction template alone, would indicate that S. pneumoniae 2CFE 39 binds to HoUiday Junction templates.
- the helicase reaction described herein was performed using HoUiday Junction templates having one oligonucleotide strand labeled with a fluorescent agent and another strand labeled with a quenching agent.
- the 5' fluorescent end and the 3' quenching end of the strands that make up the HoUiday Junction templates are in proximity to each other, resulting in a non-fluorescent template.
- the fluorescent and quench ends are not in proximity to each other, resulting in fluorescence.
- the HoUiday Junction templates used to perform this experiment comprised the following: the 5' end of oligonucleotide strand 1 was labeled with a fluorescein (e.g., the fluorescent agent), and the 3 ' end of oligonucleotide strand 4 was labeled with DABC YL (e.g., the quenching agent).
- a fluorescein e.g., the fluorescent agent
- DABC YL e.g., the quenching agent
- the oligonucleotide strand 1 labeled with fluorescein and the oligonucleotide strand 4 labeled with DABCYL were custom synthesized (Gibco-BRL Life Technologies, Inc.).
- the fluorescein and DABCYL labled oligonucleotides were annealed in a reaction, as described above, to generate synthetic HoUiday Junction templates.
- the helicase reaction was performed as described above.
- the results of the helicase reaction were monitored by measuring the unquenching of the HoUiday Junction templates with time ( Figure 11).
- the helicase assay using HoUiday Junction templates labeled with fluorescent-quenching agents can be adapted for use in high throughput analyses to test 2CFE 39, 2CFE 21, and other polypeptides for their ability to resolve the templates into duplex structures.
- the following provides a description of the methods used to characterize purified, CFE 8 polypeptide, which lacks a histidine tag.
- the CFE 8 is a putative DNA single-stranded binding protein.
- CFE 8 polypeptide SEQ ID NO: 121
- SSB single stand binding protein homologue
- the 2CFE 8 polypeptide (SEQ ID NO:339) was characterized by size exclusion chromatography, using the Biosil SEC- 125 HPLC Gel Filtration column as described in Example 8 supra. The chromatogram showed one peak corresponding to a molecular weight of approximately 89 kDa. Based on the nucleotide sequence, the predicted molecular weight of 2CFE 8 is 17,351 Da. In non-denaturing conditions, 2CFE 8 forms a multimer.
- the 2CFE 8 polypeptide was reacted with a single-stranded oligonucleotide A. Briefly, the binding reaction included: 50 ⁇ M of 2CFE 8 polypeptide, 50 ⁇ M oligo strand A, 20 mM Tris/20 mM KCl pH 7.5. The binding reaction was performed at 37 degrees C, for 2 hours.
- Oligonucleotide strand A 5'-TTAGGGCCCGGGCTATCTTACAATCTCGTT-3' (SEQ ID NO:442)
- Separation was performed using an uncoated capillary tube (360 ⁇ m o.d., 50 ⁇ m i.d., with a 50 cm effective separation length; Watrex International, Inc., Pittsford, NY) and 50 mM borate pH 9.3 as the mobile phase, at 25 kVolts, 20 minutes separation time.
- SPA Scintillation proximity, assay
- the binding reaction of the 2CFE 8 polypeptide and the oligonucleotide strand A can be monitored using SPA beads and a scintillation counter.
- the beads can be coated with avidin, the 2CFE 8 polypeptide can be tagged with biotin, and the oligonucleotide strand A can be radiolabeled.
- the 2CFE 3 polypeptide catalyzes the conversion of D-glucosamine-6-phosphate to D- glucosamine- 1 -phosphate, indicating that 2CFE 3 mediates amino-sugar biosynthesis through the N-acetyl glucosamine pathway ( Figure 14).
- the 2CFE 86 polypeptide catalyzes the conversion of D-glucosamine-1 -phosphate to N- acetylglucosamine-1 -phosphate, and the conversion of N-acetylglucosamine-1 -phosphate to UDP-N-acetylglucosamine-1 -phosphate, which indicates that 2CFE 86 also mediates amino-sugar biosynthesis through the N-acetyl glucosamine pathway ( Figure 14).
- the 2CFE 3 polypeptide was produced using the large scale IPTG-induced method described in Example 5, supra.
- the 2CFE 3 polypeptide lacks a C-terminal histidine tag.
- the 2CFE 3 polypeptide was purified using a 2-column procedure.
- the 2CFE 3 polypeptide preparation was eluted from a 26/10 Q Sepharose column (Pharmacia) using a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the gradient size was 1 liter.
- Affinity capillary electrophoresis methods were used to determine whether the 2CFE 3 polypeptide binds to various glucose derivatives. Binding was performed under equilibrium conditions, in which the sugars were dissolved in the running buffer and reacts with 2CFE 3 during separation in the column.
- the affinity capillary electrophoresis method used to analyze 2CFE 3 follows the methods described in "Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. J. Landers.
- 2CFE 3 polypeptide was reacted with increasing amounts of various glucose derivatives (e.g., substrate) at 25, 30 and 37 degrees C.
- the glucose derivatives included UDP-glucose, glucose- 1 -phosphate, glucose-6-phosphate, glucosamine- 1 -phosphate, and glucosamine-6-phosphate.
- the reaction included: 2CFE 3 polypeptide (2.0 mg/ml), separation buffer (25 mM Tris; 192 mM Glycine, pH 8.0; BupH Tris-Glycine Buffer Packs, Pierce). Separation was performed at 25 kVolts, separation time was 15 or 20 minutes.
- CFE 3 Capillary Electrophoresis and Laser-Induced Fluorescence
- capillary electrophoresis was performed with laser-induced fluorescence in order to separate and detect interaction between the substrate (e.g., D-glucosamine-6-phosphate) and the product (e.g., D-glucosamine- 1- phosphate) in a one dose, one time-point procedure.
- substrate e.g., D-glucosamine-6-phosphate
- product e.g., D-glucosamine- 1- phosphate
- the 2CFE 3 polypeptide was derivitized by reacting 10 mM FITC (fluorescein isothiocyanate dissolved in methanol; Calbiochem, San Diego, CA) with D-glucosamine- 6-phosphate, at ambient temperature, in the dark, overnight.
- the FITC-derivatized 2CFE 3 polypeptide 2.0 mg/ml was reacted with the substrate (D-glucosamine-6-phosphate and D-glucosamine- 1 -phosphate) for one hour.
- Separation was performed using an uncoated capillary (360 ⁇ m o.d., 50 ⁇ m i.d., with a 50 cm effective separation length) and 50 mM borate (pH 9.3) as the mobile phase.
- the argon-ion laser had an excitation wavelength of 488 nm and an emission filter of 520 nm (Beckman, Fullerton, CA).
- the results shown in Figure 16 indicate that 2CFE 3 binds and catalyzes the conversion of D-glucosamine-6-phosphate to D-glucosamine- 1- phosphate.
- the CFE 86 polypeptide (SEQ ID NO: 195) is an acetyltransferase, such as GlmU which is a bifunctional enzyme in E. coli. It has been previously shown that, in E coli, GlmU is a bifunctional protein having both the acetyltransferase and uridylyltransferase active sites (Mengin-Lecreulx, D. and J. van Heijennort 1994 J. Bacteriol. 176:5788-5795; Gehring, Al., et al, 1996 Biochemistry 35:579-585).
- the bifunctional enzyme catalyzes the conversion of D-glucosamine- 1 -phosphate to N-acetylglucosamine-1 -phosphate (acetyltransferase), and catalyzes the conversion of N-acetylglucosamine-1-phosphate to UDP-N-acetylglucosmine-1 -phosphate (uridylyltransferase).
- acetyltransferase The Km of the acetyltransferase and uridylyltransferase reactions has been previously calculated (Mengin-Lecreulx, D. and J. van Heijennort 1994 supra ). Additionally, the crystal structure of GlmU from E. coli is known (Brown, K., et al., 1999 EMBO J. 18:4096- 4107).
- the 2CFE 86 polypeptide (SEQ ID NO:409) has a C-terminal histidine tag.
- the 2CFE 86 polypeptide was produced using the large scale IPTG-induced method described in Example 5, supra.
- the 2CFE 86 polypeptide was purified using the Ni-NTA affinity column method described in Example 6, supra.
- the eluted 2CFE 86 polypeptide was dialyzed against 50 mM Tris-Cl, 100 mM NaCl, 25% glycerol, pH 8.0. Samples of the purified 2CFE 86 polypeptide were electrophoresed on a polyacrylamide gel ( Figure 17). Coupling CFE 3 and CFE 86 to Produce UDPAG
- a biochemical assay was performed, to determine whether 2CFE 3 and 2CFE 86 convert D-glucosamine-6-phosphate to UDP-N-acetylglucosamine-1 -phosphate (e.g., UDPAG).
- the 2CFE 3 and 2CFE 86 polypeptides were used in a coupled reaction based on the assays described in Jolly, L. P., et al, 1999 Eur. J. Biochem. 262:202-210.
- a time-dependent and dose-dependent assay were performed. Briefly, the assay was performed in 96-well plates, each well including 100 ⁇ l volume. The assay included: 1 mM D-glucosamine-6-phosphate (Sigma); 0.7 mM D-glucosamine- 1,6-diphosphate (Sigma); 1.2 mM acetyl-Coenzyme A (Sigma); and 5 mM uridine-5' -phosphate (Sigma); 3 mM MgCl (Sigma); 50 mM Tris-Cl, pH 8.0 (Life Technologies). The reaction was started by adding 1 ⁇ g of 2CFE 3; and 10 ⁇ g of 2CFE 86. The reaction was performed at room temperature. The reaction was stopped at 0, 15, 30, and 65 minutes, by filtering out the 2CFE polypeptides.
- 1 mM D-glucosamine-6-phosphate Sigma
- 0.7 mM D-glucosamine- 1,6-diphosphate Sigma
- the results of the assay was monitored by HPLC (high pressure liquid chromatography) using an Optisil lO ⁇ SAX column (250 x 4.6 mm), measuring at 262 nm, the mobile phase was 150 mM KH PO 4 (pH 3.5), and 1.5 ml/minute flow rate.
- HPLC high pressure liquid chromatography
- Optisil lO ⁇ SAX column 250 x 4.6 mm
- the mobile phase was 150 mM KH PO 4 (pH 3.5)
- 1.5 ml/minute flow rate The results shown in Figure 18 show the time-dependent assay and indicate that HPLC detected the presence of UDPAG.
- the 2CFE 86 polypeptide was tested in a uridylyltransferase reaction, in which N-acetyl- D-glucosamine-1-phosphate and UTP produce UDP-N-acetylglucosamine.
- the uridylyltransferase reaction was monitored using a malachite green/inorganic pyrophosphatase assay (e.g., malachite green-IPPAse assay) and/or monitored using HPLC.
- the malachite green-IPPAse assay was used to measure orthophosphate production from digestion of the pyrophosphate liberated in the uridylyltransferase reaction.
- the malachite green reagent was prepared as follows.
- 0.045 % solution of malachite green (Sigma; M9636) was prepared in water.
- a 4.2 % solution of ammonium molybdate (Mallinckrodt) was prepared in 4N HCl.
- the malachite green and ammonium molybdate were mixed in a 3 : 1 ratio, and stirred for about 20 minutes. The mixture was filtered, and stored at 4 degrees C.
- the inorganic pyrophosphatase (Sigma; 1-2267) was diluted to 0.1 U/ ⁇ l in 50 mM Tris/3mM MgCl 2 ph 8.0, and stored at 4 degrees C.
- the uridylyltransferase reaction was performed in 96-well plates, The coupled reaction described herein was performed, in the presence of 2CFE 3 alone or 2CFE 3 and 2CFE 86, and included the addition of 0.5 U/well of the diluted inorganic pyrophosphate. The reaction was mixed for 5 minutes at room temperature. The reaction was stopped by the addition of 240 ⁇ l/well of the malachite green reagent and 30 ⁇ l/well of 34% sodium citrate, and the reaction was mixed. The results of the uridylyltransferase reaction was monitored by spectrophotometry at 660 nm.
- the results of separate uridylyltransferase reactions were monitored by HPLC, using a Phenosphere-NEXT C18 column (250 x 4.6 mm).
- the mobile phases included A and B as follows: A) methanol/10 mM potassium phosphate pH 6.5 (0:100); and B) methanol/10 mM potassium phosphate pH 6.5 (40:60).
- the mobile phases were run under the following conditions: 100% mobile phase A for 5 minutes, to 100% mobile phase B in 3 minutes; and hold 100% mobile phase B for 9 minutes.
- the retention time for the UDPAG product is approximately 5.75 to 6.0 minutes.
- CFE 34 SEQ ID NO: 143
- CFE 35 SEQ ID NO: 144
- 90 SEQ ID NO: 199
- CFE 34 is a malonyl CoA:ACP transcylase, which catalyzes the reaction in which malonyl CoA and acyl carrier protein (ACP) are converted to malonyl- ACP and CoA.
- ACP acyl carrier protein
- the CFE 34 polypeptide may be a homologue of E. coli FabD.
- CFE 90 is a 3-oxoacyl-ACP synthase II (beta ketoacyl- ACP synthase II) which catalyzes the reaction in which malonyl-ACP is converted to beta aceto acetyl-ACP.
- the CFE 90 polypeptide may be a homologue of E. coli FabF.
- CFE 35 is a 3-oxoacyl-ACP reductase (beta aceto acetyl ACP reductase) which catalyzes the reaction in which beta-keto ' -acetyl-ACP is converted to beta-hydroxy-acetyl-ACP.
- the CFE 35 polypeptide may be a homologue of E. coli FabG.
- the estimated molecular' weights of 2CFE 34 (SEQ ID NO:361), 2CFE 35 (SEQ ID NO-.362), and 2CFE 90 (SEQ ID NO:413) were determined using the Biosil SEC-125 HPLC Gel Filtration column as described in Example 8, supra.
- the function of 2CFE 34 was determined by performing various biochemical reactions. To determine whether 2CFE 34 catalyzes the convertion of malonyl-CoA to malonyl and CoA, the following reaction was performed.
- the biochemical reaction was performed in the presence of acyl carrier protein.
- the reaction included the following: 10 ⁇ M 14 C labeled malonyl-CoA, 20 ⁇ M ACP, 30 ⁇ M 2CFE 34 (e.g., FabD) in 20 mM Tris-Cl, pH 8.0 and 5 mM DTT in 300 ⁇ l volume.
- the reaction was performed at room temperature (e.g., approximately 24 degrees C) for 30 minutes.
- the reaction was terminated with the addition of 45 ⁇ l of 0.5% TFA.
- the labeled reaction was injected onto a MonoQ 5/5 column on a Gilson HPLC. Detection was performed by monitoring the radioactivity of the continuous flow-through of the HPLC effluent.
- Buffer A included 20 mM Tris-Cl, pH 8.3.
- Buffer B was the same as Buffer A and included 1 M NaCl. The program was held at 90% A, 10% B for 10 minutes followed by a linear ramp to a final mix of 50% of each Buffer A and B over 10 minutes.
- the substrate e.g., 14 C malonyl-CoA
- the product e.g., 14 C malonyl-ACP
- CFE 34 catalyzes the conversion of malonyl-CoA and acyl carrier protein (ACP) to malonyl-ACP and CoA.
- CFE 40 polypeptide SEQ ID NO: 149
- HMP-P phosphomethylpyrimidine
- CFE 41 polypeptide (SEQ ID NO: 150) has a GTP-binding motif and may be a protease.
- Example 5 The large-scale method described in Example 5 supra (e.g., IPTG-induced protein production) was used to prepare a sample of 2CFE 41 polypeptide (SEQ ID NO:368).
- the sample was affinity purified using the Ni-NTA method described in Example 6, supra.
- the eluted fractions were loaded onto and run on a 12% SDS-PAGE gel (Novex) ( Figure 21).
- Circular dichroism and circular dichroism thermal melt methods were performed using JASCO instrumentation.
- the concentration of the isolated 2CFE 40 (SEQ ID NO:367) was approximately 21 ⁇ M, in a 0.1 cm pathlength cell at 210 nm.
- the circular dichroism spectrum suggests that this preparation of 2CFE 40 had mixed alpha and beta secondary structure.
- the circular dichroism thermal melt spectrum suggests that 2CFE 40 has a T m of approximately 67 degrees C.
- the 2CFE 40 polypeptide precipitates at approximately the T m .
- the concentration of the isolated 2CFE 41 (SEQ ID NO:368) was approximately 70 ⁇ M, in a 0.02 cm pathlength cell.
- the circular dichroism spectrum suggests that this preparation of 2CFE 41 had mixed alpha and beta secondary structure, with a greater percentage of alpha structures.
- the circular dichroism thermal melt spectrum suggests that 2CFE 41 has a T m of approximately 38 degrees C.
- the concentration of the isolated 2CFE 46 was approximately 23 ⁇ M, in a 0.1 cm pathlength cell at 280 nm.
- the circular dichroism spectrum suggests that this preparation of 2CFE 46 had mixed alpha and beta secondary structure.
- the circular dichroism thermal melt spectrum suggests that 2CFE 46 is highly stable at elevated temperatures. At 90 degrees C, the 2CFE 46 polypeptide exhibited only a 27% loss in signal and the polypeptide remained soluble.
- CEG polypeptides e.g., CFE polypeptides
- Computer-aided compilation of bacterial metabolic pathways may be analyzed using
- the function of a CFE polypeptide may be identified by identifying a ligand or substrate which binds with the CFE polypeptide.
- the ligand or substrate may be identified using fractionation and affinity capillary electrophoresis methods. The following method is based upon the assumption that the bacterial cell lysate includes the ligand or substrate.
- a bacterial host cells carrying an endogenous (e.g. native) CFE gene or carrying a recombinant vector which includes a CFE gene may be cultured so that the CFE polypeptide is produced by the cell.
- the cells may be ruptured in order to obtain the cell lysate.
- the cell lysate may be fractionated using HPLC technology.
- the HPLC fractions may be reacted with a CFE polypeptide in a binding reaction, and the binding reaction may be analyzed by affinity capillary electrophoresis methods.
- the ligand or substrate which reacts with the CFE polypeptide may be identified using mass spectrophotometry methods (in "Mass Spectrometry" 1990 eds. McCloskey, J.
- NMR nuclear magnetic resonance
- High resolution NMR spectroscopy was applied to 15 N-labled, 13 C/ 15 N-labeled, 2 H/ 13 C/ 15 N-labeled, and type-specifically isotopically labeled CFE polypeptide samples in the solution state for the following purposes: to assess various aspects of the structural state, e.g., foldedness, structural integrity; to refine a previously determined experimental structure of a close sequence homologue; to refine a homology-modeled structure; to assess the potential for a CFE polypeptide to bind small molecules; and to identify small- molecule pharmacophoric fragments that bind specifically to the CFE polypeptide ('.'Nuclear Magnetic Resonance" 1994 eds. James, T. L. in Methods in Enzymology volume 239).
- the NMR analysis includes screening both a compound deck of approximately 4,500 commercially available, structurally and chemically diverse compounds (the small- molecule pharmacophore deck) and a compound deck of proprietary, known, antimicrobial compounds (anti-microbial deck) against the CFE polypeptides (i.e., target polypeptides) to determine, either based upon perturbations to the chemical shifts of the amide proton and/or nitrogen resonances, as measured from a two-dimensional proton- nitrogen heteronuclear single-quantum correlation spectrum (2D screening method), or based upon increases in the linewidth of the compound's proton resonance(s), as measured by a one-dimensional T lp spin-lock difference spectrum (ID screening method), both whether a compound binds to a CFE polypeptide and, in the case of the 2D screening method, where the compound binds on the CFE polypeptide.
- 2D screening method two-dimensional proton- nitrogen heteronuclear single-quantum correlation spectrum
- BL21-DE3 E. coli bacteria are transformed with the CFE expression vectors. Expression takes place between 20°C and 37°C in minimal media containing [ 15 N] -ammonium sulfate as the sole nitrogen source and either glucose, [ 2 H] 13 -glucose, or [ 13 C] 6 -glucose as the sole carbon source.
- Glucose is used for preparing uniformly 15 N-labeled and 2 H/ 15 N- labeled CFE polypeptides.
- [ 2 H] 13 -glucose is used for preparing type-specifically 'H/ ⁇ C- labeled, uniformly 15 N-labeled CFE polypeptides.
- [ 13 C] 6 -glucose is used for preparing 13 C/ 15 N-labeled CFE polypeptides.
- the minimal media is prepared in 100% H 2 O for expressing both uniformly 15 N-labeled and uniformly 13 C/ 15 N-labeled CFE polypeptides; the minimal media is prepared in 95% D 2 O (deuterium oxide) and 5% H 2 O for expressing both type-specifically ⁇ C-labeled, uniformly 15 N-labeled and just uniformly 2 H/ 15 N- labeled CFE polypeptides.
- Compounds in the anti-microbial deck are pre-dissolved to a target concentration of 16 mM in deuterated DMSO (dimethylsulfoxide) with each deck well containing only one compound.
- Compounds in the small-molecule, pharmacophore deck are pre-dissolved in deuterated dmso to a target concentration of 50 mM in groups of 8, i.e., each deck well contains 8 unique compounds with each compound at a target concentration of 50 mM.
- each compound screening well contains solution from only one deck well. 166.5 ⁇ l of buffer is added to each compound screening well. 170 ⁇ l of a CFE polypeptide solution, initially at a concentration ranging from 200-300 ⁇ M, is added to each compound screening well; the contents of that well are then thoroughly mixed.
- the control screening well contains only 3.5 ⁇ l of deuterated dmso. The screening plate is then centrifoged in a bucket rotor for 15 minutes at 3,500 rpm to insure that all particulate matter is at the bottom of the well.
- the 2D screening method requires a single control screening well in which the compound solution consists only of deuterated DMSO.
- the ID screening method requires a control screening well for each compound screening well.
- the control screening well is prepared identically to the compound screening well except that the 170 ⁇ l of a CFE polypeptide solution is replaced by 170 ⁇ l of buffer.
- the screening plate is covered with aluminum foil and placed onto a rack of a Gilson liquid handler.
- the Gilson liquid handler under computer control by the NMR host/data- acquisition software, is responsible for removing each sample from the screening plate, injecting the sample into a high-resolution, 1H/ 15 N double-resonance NMR flow-probe, removing the sample from the flow-probe, and dispensing it back into the screening plate well from which the sample was originally removed.
- NMR data are collected on the sample while the sample resides in the NMR flow-probe. The type of NMR data collected depends upon whether the 2D or ID screening method is being used.
- NMR nuclear magnetic resonance
- the protein secondary structure was delineated as either helical, turn or extended (e.g., ⁇ -sheet) by measuring ⁇ ( ⁇ c ⁇ - ⁇ cp), ⁇ C, and ⁇ ⁇ where ⁇ refers to the chemical-shift value and ⁇ refers to the difference between chemical-shift values measured in this protein and those measured for the same residue type in a random-coil (unstructured), tetrameric peptide.
- This secondary-structure profile was generated in approximately 2-3 weeks per protein.
- the secondary-structure profile was used to confirm the functional identity of a protein. It was also used to refine the list of possible functional identities of folds, predicted by various computational techniques including fold recognition which is associated with a protein or polypeptide. NMR was used to generate folds of proteins or polypeptides for which both no structure was known of a sequence homologue and no structural homologue was discernible in the PDB by fold recognition techniques.
- the CFE 88 polypeptide was characterized by NMR analysis to establish its secondary structure.
- the NMR data was used to filter the computer-aided threading analysis.
- the NMR-determined secondary structure for CFE 88 suggested that CFE 88 is structurally similar to 4-aminoimidazole carboxylase.
- CFE polypeptides were analyzed by NMR methods.
- a computer-aided threading analysis revealed that the N-terminal domain of the protein EGA, which both binds and hydrolyzes GTP, was both structurally similar and sufficiently similar in sequence to CFE 52 to suggest that CFE 52 had a similar function.
- CFEs 2, 42, 43, 68 and 88 polypeptides were tested for their ability to bind potential inhibitor molecules by screening both the anti-microbial deck and the small-molecule, pharmacophore deck.
- CFE 34 was tested for its ability to bind potential inhibitor molecules by screening the anti-microbial deck. Characterizing Small-Molecule Binding
- NMR-based screening was used to measure binding against both the small-molecule, pharmacophore deck and the anti-microbial deck. Binding data from these screens allowed assessment of the propensity of a protein to bind small molecules. The binding data was also used to identify sites on the protein which are capable of binding small molecules. The binding data was also used to identify common pharmacophores among the compounds which bind.
- Reverse screening refers to a process whereby known anti-microbial compounds, the microbial target of which is unknown, are screened by a general method, e.g., binding as assessed by NMR, to find a physical interaction with polypeptide targets previously determined to be essential to the bacteria (i.e., the CFEs).
- the reverse screening method was used to determine which CFE polypeptides bind to which compounds in the anti- microbial deck.
- the reverse screening method included the following.
- the compounds in a proprietary compound deck were screened for Minimal Inhibitory Concentration (e.g., MIC).
- the compounds exhibiting antimicrobial activity were designated active compounds.
- the CFE polypeptides were screened to determine which polypeptide bind to which active compounds.
- the CFE polypeptides which bound to the active compound(s) were confirmed, where possible, i.e., in cases where an in-vitro assay was possible to construct, as being inhibited in their function as a polypeptide by the active compound(s) by examination of the inhibition profile of the compound(s) against the CFE polypeptides.
- the effect of the compound on the microorganism harboring the CFE polypeptide was monitored (e.g., whole cell assays).
- the structure of the active compound was used as a basis to generate chemically-related compounds by iterative synthesis.
- the chemically-related compounds were tested in a screening assay for binding with CFE polypeptides.
- the active compounds and the chemically-related compounds of interest were the compounds which exhibited an increase in binding affinity for a CFE polypeptide and/or exhibited drug-like properties.
- the results of the reverse screening are as follows. 127 compounds from the proprietary compound deck exhibited anti-microbial activity. 94 of these active compounds were selected based upon both lack of cytotoxicity and lack of excessive hydrophobicity. These 94 compounds were soluble to 16 mM in deuterated DMSO; these compounds were also deemed to be sufficiently soluble in aqueous buffer for both the 2D and ID NMR screening methods.
- This subset of 94 compounds was used in an NMR-based screen to determine which compound binds to which CFE polypeptide.
- the CFE 42 polypeptide bound two different compounds with Kd's in the range of 0.2 to 1 mM; the CFE 43 polypeptide bound one compound with Kd ⁇ 30-50 ⁇ M; the CFE 34 polypeptide bound 13 compounds, one of which inhibited the polypeptide function with IC 50 ⁇ 10 ⁇ M.
- the enzyme assay used to confirm the NMR results which suggested CFE 34 interaction with the compounds included the following: 10 ⁇ M 14 C-labeled malonyl CoA; 20 ⁇ M
- the reaction was performed at room temperature, the reaction was stopped with the addition of TFA. Two hundred ⁇ l of the reaction was injected onto a Mono Q 5/5 column.
- the chromatography conditions included: A) 20 mM Tris-Cl, pH 8.3; B) 20 mM Tris-Cl, pH 8.3, 1 M NaCl. Hold 10% B for 5 minutes, linear gradient from 10% B to 50%B in 10 minutes, back to 10% B in 1 minute, hold for 14 minutes to re-equilibrate.
- reaction substrate 14 C- malonyl CoA
- reaction product 14 C-malonyl ACP
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Gastroenterology & Hepatology (AREA)
- Analytical Chemistry (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Toxicology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Pulmonology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
Claims
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU43006/01A AU4300601A (en) | 1999-12-30 | 2000-12-29 | Novel bacterial genes and proteins that are essential for cell viability and their uses |
| EP00992297A EP1261630A2 (en) | 1999-12-30 | 2000-12-29 | Bacterial genes and proteins that are essential for cell viability and their uses |
| CA002396040A CA2396040A1 (en) | 1999-12-30 | 2000-12-29 | Novel bacterial genes and proteins that are essential for cell viability and their uses |
| IL14947200A IL149472A0 (en) | 1999-12-30 | 2000-12-29 | Nucleotide sequences and polypeptides encoded by the sequences that are essential for bacterial viability and methods for detecting and utilizing the same |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17408999P | 1999-12-30 | 1999-12-30 | |
| US60/174,089 | 1999-12-30 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2001049721A2 true WO2001049721A2 (en) | 2001-07-12 |
| WO2001049721A3 WO2001049721A3 (en) | 2002-09-12 |
Family
ID=22634783
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2000/035604 Ceased WO2001049721A2 (en) | 1999-12-30 | 2000-12-29 | Bacterial genes and proteins that are essential for cell viability and their uses |
Country Status (5)
| Country | Link |
|---|---|
| EP (1) | EP1261630A2 (en) |
| AU (1) | AU4300601A (en) |
| CA (1) | CA2396040A1 (en) |
| IL (1) | IL149472A0 (en) |
| WO (1) | WO2001049721A2 (en) |
Cited By (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2001079476A1 (en) * | 2000-04-18 | 2001-10-25 | Institut National De La Recherche Agronomique (Inra) | Lactic acid bacteria overproducing exopolysaccharides |
| WO2002016601A3 (en) * | 2000-08-24 | 2003-01-23 | Omnigene Bioproducts Inc | Microorganisms and assays for the identification of antibiotics |
| US6720139B1 (en) | 1999-01-27 | 2004-04-13 | Elitra Pharmaceuticals, Inc. | Genes identified as required for proliferation in Escherichia coli |
| US6821746B2 (en) | 2000-10-06 | 2004-11-23 | Affinium Pharmaceuticals, Inc. | Methods of screening for FabK antagonists and agonists |
| US6951729B1 (en) | 1999-10-27 | 2005-10-04 | Affinium Pharmaceuticals, Inc. | High throughput screening method for biological agents affecting fatty acid biosynthesis |
| US7033795B2 (en) | 2000-10-06 | 2006-04-25 | Affinium Pharmaceuticals, Inc. | FabK variant |
| US7048926B2 (en) | 2000-10-06 | 2006-05-23 | Affinium Pharmaceuticals, Inc. | Methods of agonizing and antagonizing FabK |
| US7056697B2 (en) | 2000-10-06 | 2006-06-06 | Affinium Pharmaceuticals, Inc. | FabK variant |
| WO2009023343A3 (en) * | 2007-05-22 | 2009-12-23 | Wisconsin Alumni Research Foundation | Anti-bacterial drug targeting of genome maintenance interfaces |
| US7790412B2 (en) | 1997-07-02 | 2010-09-07 | sanofi pasteur limited/sanofi pasteur limitée | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| EP2270175A1 (en) * | 2001-03-27 | 2011-01-05 | Novartis Vaccines and Diagnostics S.r.l. | Streptococcus pneumoniae proteins and nucleic acids |
| EP2311991A1 (en) * | 2003-04-15 | 2011-04-20 | Intercell AG | S. pneumoniae antigens |
| US8022180B2 (en) * | 2004-07-13 | 2011-09-20 | Affiris Forschungs-Und Entwicklungs Gmbh | Method for preventing and treating Alzheimer's disease |
| US8349336B2 (en) | 2003-03-04 | 2013-01-08 | Intercell Ag | Streptococcus pyogenes antigens |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE69739981D1 (en) * | 1996-10-31 | 2010-10-14 | Human Genome Sciences Inc | Streptococcus pneumoniae antigens and vaccines |
| AU753971B2 (en) * | 1997-12-31 | 2002-10-31 | Millennium Pharmaceuticals, Inc. | Essential bacterial genes and their use |
| US6268177B1 (en) * | 1998-09-22 | 2001-07-31 | Smithkline Beecham Corporation | Isolated nucleic acid encoding nucleotide pyrophosphorylase |
-
2000
- 2000-12-29 CA CA002396040A patent/CA2396040A1/en not_active Abandoned
- 2000-12-29 AU AU43006/01A patent/AU4300601A/en not_active Abandoned
- 2000-12-29 IL IL14947200A patent/IL149472A0/en unknown
- 2000-12-29 WO PCT/US2000/035604 patent/WO2001049721A2/en not_active Ceased
- 2000-12-29 EP EP00992297A patent/EP1261630A2/en not_active Withdrawn
Cited By (31)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7867501B2 (en) | 1997-07-02 | 2011-01-11 | Sanofi Pasteur Limited/Sanofi Pasteur Limitee | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US7834166B2 (en) | 1997-07-02 | 2010-11-16 | Sanofi Pasteur Limited | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US7875437B2 (en) | 1997-07-02 | 2011-01-25 | Sanofi Pasteur Limited/Sanofi Pasteur Limitee | Nucleic acid and amino acid sequences relating to streptococcus pneumoniae for diagnostics and therapeutics |
| US8293884B2 (en) | 1997-07-02 | 2012-10-23 | sanofi pasteur limited/sanofi pasteur limitée | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US7875439B2 (en) | 1997-07-02 | 2011-01-25 | Sanofi Pasteur Limited | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US8003775B2 (en) | 1997-07-02 | 2011-08-23 | Sanofi Pasteur Limited | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US8609107B2 (en) | 1997-07-02 | 2013-12-17 | Sanofi Pasteur Limited | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US8288519B2 (en) | 1997-07-02 | 2012-10-16 | Sanofi Pasteur Limited | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US7790412B2 (en) | 1997-07-02 | 2010-09-07 | sanofi pasteur limited/sanofi pasteur limitée | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US8298788B2 (en) | 1997-07-02 | 2012-10-30 | Sanofi Pasteur Limited | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US7875438B2 (en) | 1997-07-02 | 2011-01-25 | Sanofi Pasteur Limited/Sanofi Pasteur Limitee | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US8293249B2 (en) | 1997-07-02 | 2012-10-23 | Sanofi Pasteur Limited/Sanofi Pasteur Limitee | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
| US6720139B1 (en) | 1999-01-27 | 2004-04-13 | Elitra Pharmaceuticals, Inc. | Genes identified as required for proliferation in Escherichia coli |
| US6951729B1 (en) | 1999-10-27 | 2005-10-04 | Affinium Pharmaceuticals, Inc. | High throughput screening method for biological agents affecting fatty acid biosynthesis |
| US7241610B2 (en) | 2000-04-18 | 2007-07-10 | Institut National De La Recherche Agronomique (Inra) | Lactic acid bacteria overproducing exopolysaccharides |
| WO2001079476A1 (en) * | 2000-04-18 | 2001-10-25 | Institut National De La Recherche Agronomique (Inra) | Lactic acid bacteria overproducing exopolysaccharides |
| US7858335B2 (en) | 2000-08-24 | 2010-12-28 | Omnigene Bioproducts, Inc. | Microorganisms and assays for the identification of antibiotics |
| WO2002016601A3 (en) * | 2000-08-24 | 2003-01-23 | Omnigene Bioproducts Inc | Microorganisms and assays for the identification of antibiotics |
| US6830898B2 (en) | 2000-08-24 | 2004-12-14 | Omnigene Bioproducts, Inc. | Microorganisms and assays for the identification of antibiotics |
| US7033795B2 (en) | 2000-10-06 | 2006-04-25 | Affinium Pharmaceuticals, Inc. | FabK variant |
| US7048926B2 (en) | 2000-10-06 | 2006-05-23 | Affinium Pharmaceuticals, Inc. | Methods of agonizing and antagonizing FabK |
| US7056697B2 (en) | 2000-10-06 | 2006-06-06 | Affinium Pharmaceuticals, Inc. | FabK variant |
| US6821746B2 (en) | 2000-10-06 | 2004-11-23 | Affinium Pharmaceuticals, Inc. | Methods of screening for FabK antagonists and agonists |
| EP2270175A1 (en) * | 2001-03-27 | 2011-01-05 | Novartis Vaccines and Diagnostics S.r.l. | Streptococcus pneumoniae proteins and nucleic acids |
| US8349336B2 (en) | 2003-03-04 | 2013-01-08 | Intercell Ag | Streptococcus pyogenes antigens |
| AU2004230244B2 (en) * | 2003-04-15 | 2011-09-22 | Intercell Ag | S. pneumoniae antigens |
| EP2311991A1 (en) * | 2003-04-15 | 2011-04-20 | Intercell AG | S. pneumoniae antigens |
| US8372411B2 (en) | 2003-04-15 | 2013-02-12 | Intercell Ag | S. pneumoniae antigens |
| US8022180B2 (en) * | 2004-07-13 | 2011-09-20 | Affiris Forschungs-Und Entwicklungs Gmbh | Method for preventing and treating Alzheimer's disease |
| US8415393B2 (en) | 2007-05-22 | 2013-04-09 | Wisconsin Alumni Research Foundation | Anti-bacterial drug targeting of genome maintenance interfaces |
| WO2009023343A3 (en) * | 2007-05-22 | 2009-12-23 | Wisconsin Alumni Research Foundation | Anti-bacterial drug targeting of genome maintenance interfaces |
Also Published As
| Publication number | Publication date |
|---|---|
| EP1261630A2 (en) | 2002-12-04 |
| CA2396040A1 (en) | 2001-07-12 |
| AU4300601A (en) | 2001-07-16 |
| WO2001049721A3 (en) | 2002-09-12 |
| IL149472A0 (en) | 2002-11-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Dussurget et al. | Molecular determinants of Listeria monocytogenes virulence | |
| WO2001049721A2 (en) | Bacterial genes and proteins that are essential for cell viability and their uses | |
| JP4852211B2 (en) | Identification of essential genes in prokaryotes | |
| Boibessot et al. | The rational design, synthesis, and antimicrobial properties of thiophene derivatives that inhibit bacterial histidine kinases | |
| Lock et al. | Cell-division inhibitors: new insights for future antibiotics | |
| AU2001270381B2 (en) | Streptococcus antigens | |
| Bliss et al. | Coating the surface: a model for expression of capsular polysialic acid in Escherichia coli K1 | |
| US8241642B2 (en) | Streptococcus pneumoniae open reading frames encoding polypeptide antigens and uses thereof | |
| US20050196758A1 (en) | Novel enoyl reductases and methods of use thereof | |
| JP2008069162A (en) | Dna methyl transferase inhibitor | |
| Ju et al. | Discovery of novel peptidomimetic boronate ClpP inhibitors with noncanonical enzyme mechanism as potent virulence blockers in vitro and in vivo | |
| EA018621B1 (en) | PIPERAZINE DERIVATIVES USED AS Ca2.2 CALCIUM CHANNEL MODULATORS | |
| Swier et al. | Insight into the complete substrate-binding pocket of ThiT by chemical and genetic mutations | |
| TW316903B (en) | ||
| US7855228B2 (en) | Antibiotics targeting MreB | |
| WO1999045136A1 (en) | Methods for assaying type iii secretion inhibitors | |
| JP2002524068A (en) | Novel BAG proteins and nucleic acid molecules encoding them | |
| WO2003104391A2 (en) | Antibacterial targets in alloiococcus otitidis | |
| WO2001085773A2 (en) | Luxo-sigma54 interactions and methods of use | |
| US20030003444A1 (en) | Compositions and methods involving an essential Staphylococcus aureus gene and its encoded protein STAAU_R9 | |
| Hubbard et al. | Pathogenicity and histidine kinases: approaches toward the development of a new generation of antibiotics | |
| CA2365929A1 (en) | Novel method for identifying antibacterial compounds | |
| Jones | Structure, Function, and Inhibition of Peptidoglycan O-Acetyltransferase A from Staphylococcus aureus | |
| Neißner | Structural and dynamical analysis of protein-ligand interactions by NMR-spectroscopy and X-ray crystallography | |
| Do Gunther | Discovery of an amidase-activator complex that controls Staphylococcus aureus cell growth and division |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 149472 Country of ref document: IL |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 43006/01 Country of ref document: AU |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2000992297 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref country code: JP Ref document number: 2001 550261 Kind code of ref document: A Format of ref document f/p: F |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2396040 Country of ref document: CA |
|
| AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| WWP | Wipo information: published in national office |
Ref document number: 2000992297 Country of ref document: EP |
|
| WWW | Wipo information: withdrawn in national office |
Ref document number: 2000992297 Country of ref document: EP |