AU2003299859A1 - Stress-related polypeptides and uses therefor - Google Patents
Stress-related polypeptides and uses therefor Download PDFInfo
- Publication number
- AU2003299859A1 AU2003299859A1 AU2003299859A AU2003299859A AU2003299859A1 AU 2003299859 A1 AU2003299859 A1 AU 2003299859A1 AU 2003299859 A AU2003299859 A AU 2003299859A AU 2003299859 A AU2003299859 A AU 2003299859A AU 2003299859 A1 AU2003299859 A1 AU 2003299859A1
- Authority
- AU
- Australia
- Prior art keywords
- nucleic acid
- seq
- acid molecule
- acid sequence
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Description
WO 2004/061080 PCT/US2003/041098 1 Description STRESS-RELATED POLYPEPTIDES AND USES THEREFOR Cross Reference To Related Applications This application is based on and claims priority to United States 5 Provisional Application Serial Number 60/463,564, filed December 26, 2002, which is herein incorporated by reference in its entirety. Technical Field The presently disclosed subject matter relates, in general, to transgenic plants. More particularly, the presently disclosed subject matter 10 relates to stress-related polypeptides, nucleic acid molecues encoding the polypeptides, and uses thereof. Table of Abbreviations ABA - abscisic acid AOS - active oxygen species 15 FPD - Functional Protein Domain HR - hypersensitive response HSPs - high scoring sequence pairs LR - local resistance PP2A - type 2A serine/threonine protein 20 phosphatase SA - salicylic acid SAR - systemic acquired resistance Amino Acid Abbreviations and Correspondinq mRNA Codons Amino Acid 3-Letter 1-Letter mRNA Codons Alanine Ala A GCA GCC GCG GCU Arginine Arg R AGA AGG CGA CGC CGG CGU Asparagine Asn N AAC AAU Aspartic Acid Asp D GAC GAU Cysteine Cys C UGC UGU Glutamic Acid Glu E GAA GAG Glutamine Gln Q CAA CAG WO 2004/061080 PCT/US2003/041098 2 Glycine Gly G GGA GGC GGG GGU Histidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Leucine Leu L UUA UUG CUA CUC CUG CUU Lysine Lys K AAA AAG Methionine Met M AUG Proline Pro P CCA CCC CCG CCU Phenylalanine Phe F UUC UUU Serine Ser S ACG AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Tryptophan Trp W UGG Tyrosine Tyr Y UAC UAU Valine Val V GUA GUC GUG GUU Background Art As some of the major human staples, monocot plants such as rice, corn, and wheat have been a target of genetic engineering for resistance to diseases, pests, and environmental stresses of various kinds. Knowledge of 5 plant-pathogen interactions and the complex networks of proteins that act in concert to respond to environmental stresses has important applications in agriculture, providing new approaches to disease control. Modulation of interactions between proteins that participate in stress responses can be exploited for the development of genetically engineered plants that are 10 resistant to pathogens. The production of pest-resistant crops provides an alternative to environmentally damaging pesticides for improvement of agricultural yield. For example, detailed knowledge of signaling pathways regulating innate immunity can help develop strategies for durable crop protection. 15 Resistance to disease occurs on several levels that include local and nonspecific systemic responses. The hypersensitive response (HR) in plants is a mechanism of local resistance to pathogenic microbes characterized by a rapid and localized tissue collapse and cell death at the WO 2004/061080 PCT/US2003/041098 3 infection site, resulting in immobilization of the intruding pathogen. This process is triggered by pathogen elicitors and orchestrated by an oxidative burst, which occurs rapidly after the attack (Lamb & Dixon, 1997). The accumulation of active oxygen species (AOS) is a central theme during plant 5 responses to both biotic and abiotic stresses. AOS are generated at the onset of the HR and might be instrumental in killing host tissue during the initial stages of infection. AOS also act as signaling molecules that induce expression of PR genes and production of other signaling molecules which participate in the signal cascade that leads to PR gene induction. The 10 triggering of defense genes can extend to the uninfected tissues and the whole plant, leading to local resistance (LR) and systemic acquired resistance (SAR; reviewed in Martinez et al., 2000). As a result of SAR, other portions of the plant are provided with long-lasting protection against the same and unrelated pathogens. 15 Hydrogen peroxide from the oxidative burst plays an important role in the localized HR not only by driving the cross-linking of cell wall structural proteins, but also by triggering cell death in challenged cells and as a diffusible signal for the induction in adjacent cells of genes encoding cellular protectants such as glutathione S-transferase and glutathione peroxidase 20 (Levine et al., 1994) and for the production of salicylic acid (SA). SA is thought to act as a signaling molecule in LR and SAR through generation of SA radicals, a likely by-product of the interaction of SA with catalases and peroxidases, as reported by Martinez et al., 2000. These authors showed that recognition of a bacterial pathogen by cotton triggers the oxidative burst 25 that precedes the production of SA in cells undergoing the HR, and that hydrogen peroxide is required for local and systemic accumulation of SA, thus acting as the initiating signal for LR and SAR. The involvement of catalase in SA-mediated induction of SAR in plants was previously demonstrated by Chen et al., 1993 who showed that binding of catalase to 30 SA results in inhibition of catalase activity, and that consequent WO 2004/061080 PCT/US2003/041098 4 accumulation of hydrogen peroxide induces expression of defense-related genes associated with SAR. The cell wall can also play a role in defense against bacterial and fungal pathogens by receiving information from the surface of the pathogen 5 from molecules called elicitors, and by transmitting this information to the plasma membrane of plant cells, resulting in gene-activated processes that lead to resistance. One type of biochemical reaction induced by elicitors and associated with the hypersensitive response is the synthesis and accumulation of phytoalexins, antimicrobial compounds produced in the 10 plant after fungal or bacterial infection (reviewed in Hammerschmidt, 1999). Other responses can involve the expression of proteases that activate other signalling molecules, and enzymes that allow the plant to respond with morphological changes to physical insult produced by pathogen attack. Stress responses do not occur in isolation from other cellular 15 processes, but can be intimately linked to other aspects of plant growth and development, such as control of the cell cycle and senescence. Some proteins are known to act both in general pathways of cellular growth and development as well as in response to particular stresses. For example, type 2A serine/threonine protein "phosphatases (PP2A) are important 20 regulators of signal transduction, which they affect by dephosphorylation of other proteins (Janssens & Goris, 2001). There are multiple PP2A isoforms in plants and other organisms, and they appear to be differentially expressed in various tissues and at different stages of development (Arino et al., 1993). Harris et al. cites a number of reports describing the association of PP2A 25 subunits with a variety of cellular proteins in addition to regulatory subunits, suggesting that PP2As function as regulators of various signaling pathways associated with protein synthesis, cell cycle and apoptosis (Harris et al., 1999). PP2A enzymes have been implicated as mediators of a number of plant growth and developmental processes. 30 In addition, PP2A enzymes play a role in pathogen invasion. In animals, a variety of viral proteins target specific PP2A enzymes to WO 2004/061080 PCT/US2003/041098 5 deregulate chosen cellular pathways in the host and promote viral progeny (Sontag, 2001; Garcia et a!., 2000). PP2A enzymes interact with many cellular and viral proteins, and these protein-protein interactions are critical to modulation of PP2A signaling (Sontag, supra). The proteins interacting 5 with PP2A (e.g., PP2A) can, for example, target PP2A to different subcellular compartments, or affect PP2A enzyme activity. To modulate plant responses to biotic and abiotic stresses, there is a need for a more comprehensive udnerstanding of signaling pathways and networks of protein-protein interactions. Further, additional factors involved 10 in these networks must be identified to facilitate the engineering of plants more tolerant to biotic and abiotic stresses. Summary This Summary lists several embodiments of the presently disclosed 15 subject matter, and in many cases lists variations and permutations of these embodiments. This Summary is merely exemplary of the numerous and varied embodiments. Mention of one or more representative features of a given embodiment is likewise exemplary. Such an embodiment can typically exist with or without the feature(s) mentioned; likewise, those features can 20 be applied to other embodiments of the presently disclosed subject matter, whether listed in this Summary or not. To avoid excessive repetition, this Summary does not list or suggest all possible combinations of such features. The presently disclosed subject matter provides proteins and nucleic acid molecules encoding such proteins that are involved in the control and 25 regulation of plant maturation and development, including proliferation, senescence, disease-resistance, stress response including stress resistance, and differentiation. The presently disclosed subject matter provides compositions comprising at least one of the proteins described herein, as well as methods for using the proteins disclosed herein to affect 30 plant maturation, development, and responses to stress.
WO 2004/061080 PCT/US2003/041098 6 The presently disclosed subject matter provides an isolated nucleic acid molecule encoding a stress-related polypeptide, wherein the polypeptide binds in a yeast two hybrid assay to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 5 (SEQ ID NO: 128), 0s006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGT1 (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170). In one embodiment, the isolated nucleic acid molecule is derived from rice (Oryza sativa). In another 10 embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence selected from the group consisting of odd numbered SEQ ID NOs: 1-111. The presently disclosed subject matter also provides a description of interactions between stress-related proteins and polypeptides encoded by 15 the isolated nucleic acid molecules disclosed herein. In one embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 1-15 and the protein comprises an amino acid sequence of SEQ ID NO: 114. In another embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of one of SEQ ID 20 NOs: 7 and 17 and the protein comprises an amino acid sequence of SEQ ID NO: 128. In another embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 21-25 and the protein comprises an amino acid sequence of SEQ ID NO: 20. In another embodiment, the isolated nucleic acid molecule comprises a 25 nucleic acid sequence of SEQ ID NO: 27 and the protein comprises an amino acid sequence of SEQ ID NO: 134. In another embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of SEQ ID NO: 29 and the protein comprises an amino acid sequence of SEQ ID NO: 138. In another embodiment, the isolated nucleic acid molecule comprises a 30 nucleic acid sequence of one of odd numbered SEQ ID NOs: 31-43 and the protein comprises an amino acid sequence of SEQ ID NO: 144. In another WO 2004/061080 PCT/US2003/041098 7 embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 45-67 and the protein comprises an amino acid sequence of SEQ ID NO: 146. In another embodiment, the isolated nucleic acid molecule comprises a nucleic acid 5 sequence of SEQ ID NO: 69 and the protein comprises an amino acid sequence of SEQ ID NO: 36. In another embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 71-77 and the protein comprises an amino acid sequence of SEQ ID NO: 152. In another embodiment, the isolated nucleic acid molecule 10 comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 79-95 and the protein comprises an amino acid sequence of SEQ ID NO: 156. In another embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 97-105 and the protein comprises an amino acid sequence of SEQ ID NO: 164. In still 15 another embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 97 and 107-111 and the protein comprises an amino acid sequence of SEQ ID NO: 170. The presently disclosed subject matter also provides an isolated nucleic acid molecule encoding a stress-related polypeptide, wherein the 20 nucleic acid molecule is selected from the group consisting of: (a) a nucleic acid molecule encoding a polypeptide comprising an amino acid sequence of one of even numbered SEQ ID NOs: 2-112; (b) a nucleic acid molecule comprising a nucleic acid sequence of 25 one of odd numbered SEQ ID NOs: 1-111; (c) a nucleic acid molecule that has a nucleic acid sequence at least 90% identical to the nucleic acid sequence of the nucleic acid molecule of (a) or (b); (d) a nucleic acid molecule that hybridizes to (a) or (b) under 30 conditions of hybridization selected from the group consisting of: WO 2004/061080 PCT/US2003/041098 8 (i) 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM ethylenediamine tetraacetic acid (EDTA) at 500C with a final wash in 2X standard saline citrate (SSC), 0.1% SDS at 500C; 5 (ii) 7% SDS, 0.5 M NaPO4, 1 mM EDTA at 500C with a final wash in 1X SSC, 0.1% SDS at 500C; (iii) 7% SDS, 0.5 M NaPO4, 1 mM EDTA at 50"C with a final wash in 0.5X SSC, 0.1% SDS at 50'C; (iv) 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM 10 EDTA at 500C with a final wash in 0.1X SSC, 0.1% SDS at 500C; and (v) 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 500C with a final wash in 0.1X SSC, 0.1% SDS at 650C; 15 (e) a nucleic acid molecule comprising a nucleic acid sequence fully complementary to (a); and (f) a nucleic acid molecule comprising a nucleic acid sequence that is the full reverse complement of (a). The presently disclosed subject matter also provides an isolated 20 stress-related polypeptide encoded by the disclosed isolated nucleic acid molecules, or a functional fragment, domain, or feature thereof. The presently disclosed subject matter also provides a method for producing a polypeptide disclosed herein, the method comprising the steps of: (a) growing cells comprising an expression cassette under suitable 25 growth conditions, the expression cassette comprising a nucleic acid molecule as disclosed herein; and (b) isolating the polypeptide from the cells. The presently disclosed subject matter also provides a transgenic plant cell comprising an isolated nucleic acid molecule disclosed herein. In 30 one embodiment, the plant is selected from the group consisting of corn (Zea mays), Brassica sp., alfalfa (Medicago sativa), rice (Oryza sativa ssp.), rye WO 2004/061080 PCT/US2003/041098 9 (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), 5 soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanut (Arachis hypogaea), cotton, sweet potato (Ipomoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado 10 (Persea ultilane), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, duckweed (Lemna), barley, a vegetable, an ornamental, and a 15 conifer. In another embodiment, the plant is rice (Oryza sativa ssp.). In one embodiment, the duckweed is selected from the group consisting of genus Lemna, genus Spirodela, genus Woffia, and genus Wofiella. In one embodiment, the vegetable is selected from the group consisting of tomatoes, lettuce, guar, locust bean, fenugreek, soybean, garden beans, 20 cowpea, mungbean, lima bean, fava bean, lentils, chickpea, green bean, lima bean, pea, and members of the genus Cucumis. In one embodiment, the ornamental is selected from the group consisting of impatiens, Begonia, Pelargonium, Viola, Cyclamen, Verbena, Vinca, Tagetes, Primula, Saint Paulia, Agertum, Amaranthus, Antihirrhinum, Aquilegia, Cineraria, Clover, 25 Cosmo, Cowpea, Dahlia, Datura, Delphinium, Gerbera, Gladiolus, Gloxinia, Hippeastrum, Mesembryanthemum, Salpiglossos, and Zinnia, azalea, hydrangea, hibiscus, rose, tulip, daffodil, petunia, carnation, poinsettia, and chrysanthemum. In one embodiment, the conifer is selected from the group consisting of loblolly pine, slash pine, ponderosa pine, lodgepole pine, 30 Monterey pine, Douglas-fir, Western hemlock, Sitka spruce, redwood, silver fir, balsam fir, Western red cedar, and Alaska yellow-cedar.
WO 2004/061080 PCT/US2003/041098 10 In another embodiment, the transgenic plant is a plant selected from the group consisting of Acacia, aneth, artichoke, arugula, blackberry, canola, cilantro, clementines, escarole, eucalyptus, fennel, grapefruit, honey dew, jicama, kiwifruit, lemon, lime, mushroom, nut, okra, orange, parsley, 5 persimmon, plantain, pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, pepper, cauliflower, Brassica, broccoli, cabbage, ultilan sprouts, onion, 10 carrot, leek, beet, broad bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, ultilane, and zucchini. The presently disclosed subject matter also provides an isolated stress-related polypeptide, wherein the polypeptide binds in a yeast two hybrid assay to a fragment of a protein selected from the group consisting of 15 OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGT1 (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170). In one embodiment, the isolated stress-related polypeptide is selected 20 from the group consisting of (a) a polypeptide comprising an amino acid sequence of even numbered SEQ ID NOs: 2-112; and (b) a polypeptide comprising an amino acid sequence at least 80% similar to the polypeptide of (a) using the GCG Wisconsin Package SEQWEB@ application of GAP with the default GAP analysis parameters. In another embodiment, the 25 polypeptide comprises an amino acid sequence of one of even numbered SEQ ID NOs: 2-112. The presently disclosed subject matter also provides an expression cassette comprising a nucleic acid molecule encoding a stress-related polypeptide disclosed herein. In one embodiment, the nucleic acid molecule 30 encoding a stress-related polypeptide comprises a nucleic acid sequence selected from odd numbered SEQ ID NOs: 1-111, In one embodiment, the WO 2004/061080 PCT/US2003/041098 11 expression cassette further comprises a regulatory element operatively linked to the nucleic acid molecule. In one embodiment, the regulatory element comprises a promoter. In one embodiment, the promoter is a plant promoter. In another embodiment, the promoter is a constitutive promoter. 5 In another embodiment, the promoter is a tissue-specific or a cell type specific promoter. In one embodiment, the tissue-specific or cell type specific promoter directs expression of the expression cassette in a location selected from the group consisting of epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, flower, seed, and combinations 10 thereof. The presently disclosed subject matter also provides a transgenic plant cell comprising a disclosed expression cassette. In one embodiment, the expression cassette comprises an isolated nucleic acid molecule comprising a nucleic acid sequence of one of odd numbered SEQ ID NOs: 15 1-111. The presently disclosed subject matter also provides transgenic plants comprising a disclosed expression cassette, as well as transgenic seeds and progeny of the trangenic plants disclosed herein. The presently disclosed subject matter also provides a method for 20 modulating stress response of a plant cell comprising introducing into the plant cell an expression cassette comprising an isolated nucleic acid molecule encoding a stress-related polypeptide, wherein the polypeptide binds in a yeast two hybrid assay to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 25 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGTI (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170). In one embodiment of the disclosed method, the expression of the polypeptide in the cell results in an 30 enhancement of a rate or extent of proliferation of the cell. In another WO 2004/061080 PCT/US2003/041098 12 embodiment, the expression of the polypeptide in the cell results in a decrease in a rate or extent of proliferation of the cell. In another embodiment of the instant method, the isolated nucleic acid molecule comprises a nucleic acid sequence selected from one of odd 5 numbered SEQ ID NOs: 1-173. In another embodiment, the isolated nucleic acid molecule comprises a nucleic acid sequence selected from one of odd numbered SEQ ID NOs: 1-111. Accordingly, it is an object of the presently disclosed subject matter to provide methods and compositions that can be used to enhance 10 agriculturally important plants. This object is achieved in whole or in part by the presently disclosed subject matter. An object of the presently disclosed subject matter having been stated above, other objects and advantages will become apparent to those of ordinary skill in the art after a study of the following description of the 15 presently claimed subject matter and non-limiting Examples. Brief Description of the Drawings Figure 1 is a schematic representation. of the interactions between various, non-limiting, stress-related proteins of the invention. Arrows 20 indicate interaction direction between DNA binding domain fused proteins (thick lined boxes or ovals) and activation domain fused proteins. Dotted boxes indicate previously published interactions. Ovals rather than boxes indicate that a protein fused to the DNA binding domain did not interact with other proteins. Circular arrows depict self-interactions. Dotted lines indicate 25 amino acid similarity between proteins. The proteins listed in the Figure can be classified as follows: biotic stress (20251); abiotic stress (12464, 19902, 22844, 22874, 23059, and 23426); and chloroplast (19842, 22832, 22840, 22844, 22858, 22874, 23059, 23061, 23426, and 30846). Figure 2 is a schematic representation of the interactions between 30 various, non-limiting, stress-related proteins of the invention. Arrows indicate interaction direction between DNA binding domain fused proteins WO 2004/061080 PCT/US2003/041098 13 (thick lined boxes or ovals) and activation domain fused proteins. Dotted boxes indicate previously published interactions. Ovals rather than boxes indicate that a protein fused to the DNA binding domain did not interact with other proteins. Circular arrows depict self-interactions. Dotted lines indicate 5 amino acid similarity between proteins. The proteins listed in the Figure can be classified as follows: development (glutamyl amino peptidase); biotic stress (19651, 20899, and 22823); abiotic stress (20775, 29077, 29098, 29086, and 29113). Figure 3 is a schematic representation of the interactions between 10 various, non-limiting, stress-related proteins of the invention. Arrows indicate interaction direction between DNA binding domain fused proteins (thick lined boxes or ovals) and activation domain fused proteins. Dotted boxes indicate previously published interactions. Ovals rather than boxes indicate that a protein fused to the DNA binding domain did not interact with 15 other proteins. Circular arrows depict self-interactions. Dotted lines indicate amino acid similarity between proteins. The proteins listed in the Figure can be classified as follows: biotic stress (ORF020300-2233.2, 23268, 011994 D16, and OsPP2-A) and abiotic stress (23225, OsCAA90866, and 3209 OS208938). 20 Brief Description of the Sequence Listing SEQ ID NOs: 1-174 present nucleic acid and amino acid sequences of the rice (Oryza sativa) polypeptides employed in the two hybrid assays disclosed hereinbelow. For these SEQ ID NOs., the odd numbered 25 sequences are nucleic acid sequences, and the even numbered sequences are the deduced amino acid sequences of the nucleic acid sequence of the immediately preceding SEQ ID NO:. For example, SEQ ID NO: 2 is the deduced amino acid sequence of the nucleic acid sequence presented in SEQ ID NO: 1, SEQ ID NO: 4 is the deduced amino acid sequence of the 30 nucleic acid sequence presented in SEQ ID NO: 3, SEQ ID NO: 6 is the deduced amino acid sequence of the nucleic acid sequence presented in WO 2004/061080 PCT/US2003/041098 14 SEQ ID NO: 5, etc. Further description of the SEQ ID NOs. is presented in the following Table: SEQ ID PN Description NOs. Number 1,2 22858 Novel Protein 22858, Fragment, similar to Arabidopsis GTP Cyclohydrolase II (BAB09512.1; e=0) 3, 4 22874 Novel Protein 22874, Fragment, similar to Arabidopsis Putative Phosphatidylinositol-4 phosphate 5-kinase (NP_187603.1; 4e-" 8 ) 5, 6 22866 Novel Protein PN22866, Fragment, Similar to A. Thaliana Vacuolar ATP Synthase Subunit C (V-ATPase C subunit; Vacuolar proton pump C subunit; Q9SDS7; e- 152 ) 7,8 23022 Novel Protein PN23022, Fragment, similar to H. Vulgare Plasma Membrane H*-ATPase (CAC50884; e=0.0) 9, 10 23061 Hypothetical Protein OsContig3864, Similar to H. vulgare Photosystem I Reaction Center Subunit II, Chloroplast Precursor (P36213; 6eB1 7 ) 11, 12 29982 Novel Protein PN29982 13, 14 30846 Novel Protein PN30846 15, 16 30974 Novel Protein PN30974 17, 18 23053 Novel Protein 23053, Fragment, Similar to Arabidopsis Putative Na+-Dependent Inorganic Phosphate Cotransporter (NP1 81341.1; e 105 ) 19, 20 20462 Hypothetical Protein 006819-2510, Similar to Senescence-Related Protein 5 from Hemerocallis Hybrid Cultivar (AAC34855.1; e- 97
)
WO 2004/061080 PCT/US2003/041098 15 21, 22 23226 Novel Protein PN23226, Callose synthase 23, 24 23485 Novel Protein PN23485, Similar to Hordeum vulgare Coproporphyrinogen Ill Oxidase, chloroplast precursor (Q42840; e-169) 25, 26 29037 Novel Protein PN29037 27, 28 29950 Novel Protein PN29950 29, 30 20551 Hypothetical Protein 003118-3674 Similar to Lycopersicon esculentum Calmodulin 31, 32 24060 L-aspartase-like protein-like 33, 34 23914 RNA binding domain protein 35, 36 23221 Proline rich protein 37, 38 24061 Auxin induced protein-like 39, 40 23949 HSP70-like 41, 42 28982 Archain delta COP-like 43, 44 29042 Fibrillin-like 45, 46 29984 Novel Protein PN29950 47, 48 30844 Novel protein PN30844 49, 50 30868 NAD(P) binding domain protein 51, 52 24292 Gamma adaptin-like 53, 54 29983 Novel protein PN29983 55, 56 30845 Pectinesterase-like 57, 58 31085 Receptor-like protein kinase-like 59, 60 20674 Pyruvate orthophosphate dikinase-like 61,62 30870 lsp-4 like 63, 64 29997 Xanthine dehydrogenase-like 65, 66 30843 Ubiquitin specific protease-like 67, 68 30857 Novel protein PN30857 69, 70 20115 Ring zinc finger protein WO 2004/061080 PCT/US2003/041098 16 71, 72 22823 Novel Protein PN22823, Similar to ABC Transporter Proteins (T02187, AB043999.1, NP_171753; e=O) 73, 74 22154 Novel Protein PN22154, Similar to A. thaliana Glutamyl Aminopeptidase (AL035525; e=0) 75, 76 29041 Novel Protein PN29041, Fragment, Similar to A. thaliana Putative ATPase (AAG52137; e 17 ) 77, 78 22020 Novel Protein PN22020, Fragment, Similar to A. thaliana Putative Protein (NP_197783; 3e 34 ) 79, 80 22825 Novel Protein PN22825, Fragment 81, 82 29076 Novel Protein PN29076, Fragment 83,84 29077 Novel Protein PN29077, Fragment, Similar to A. thaliana DNA-Damage Inducible Protein DD11-Like (BAB02792; 5e-94) 85, 86 29084 Novel Protein PN29084, Fragment, Similar to Soybean (Glycine max) Calcium-Dependent Protein Kinase (A43713, 2e- 79 ) 87,88 29115 Novel Protein PN29115, Fragment, Similar to A. thaliana 6,7-Dimethyl-8-Ribityllumazine Synthase Precursor (AAK93590, 6e- 37 ) 89,90 29116 Novel Protein PN29116, Fragment 91,92 29117 Novel Protein PN29117 93,94 29118 Novel Protein PN29118, Fragment 95,96 29119 Novel Protein PN29119, Fragment 97, 98 21639 Hypothetical Protein ORF020300-2233.2, Putative PP2A Regulatory Subunit, Similar to OsCAA90866 (AAD39930; 5 e 2 ; CAA90866; 5e- 53
)
WO 2004/061080 PCT/US2003/041098 17 99, 100 23268 Novel Protein 23268, Similar to Phosphoribosylanthranilate Transferase, Chloroplast Precursor, Fragment (AAB02913.1; 5e-95) 101,102 26645 Novel Protein PN26645, Putative Protein Disulfide Isomerase-Related Protein Precursor (BAB09470.1; e-) 103,104 24162 Novel Protein PN24162, Porin-like, Voltage Dependent Anion Channel Protein (NP_201551; 3e- 8 6 ) 105, 106 20618 Hypothetical Protein 011994-D16, Similar to Z. mays DnaJ protein (TO1 643; e=0) 107, 108 23045 Novel Protein PN23045 109, 110 23225 Novel Protein PN23225, Similar to Tritticum aestivum Initiation Factor (iso)4f p82 Subunit (AAA74724; e=0) 111,112 29883 Novel Protein PN29883, Fragment 113, 114 12464 0. sativa 14-3-3 Protein Homolog GF14-c (U65957) 115,116 22844 0. sativa 3-Phosphoshikimate 1 carboxyvinyltransferase (a.k.a. EPSP Synthase ; AB052962; BAB61062.1) 117, 118 22832 0. sativa Fructose-Bisphosphate Aldolase, Chloroplast Precursor (Q40677) 119, 120 23426 0. sativa Chloroplast Ribulose Bisphosphate Carboxylase, Large Chain (D00207; P12089) 121,122 19842 0. sativa Ribulose Bisphosphate Carboxylase/Oxygenase Activase, Large Isoform Al (AB034698, BAA97583) WO 2004/061080 PCT/US2003/041098 18 123, 124 23059 OsContig4331, 0. sativa Putative 33kDa Oxygen-Evolving Protein of Photosystem II (BAB64069) 125, 126 22840 0. sativa Photosystem 1110 kDa Polypeptide (U86018; T04177) 127, 128 20251 0. sativa Defender Against Apoptotic Death 1 (D89727; BAA24104) 129, 130 19902 Beta-Expansin EXPB2 (U95968; AAB61710) 131,132 24059 0. sativa Histone Deacetylase HD1 (AF332875; AAKO1712.1) 133, 134 20544 0. sativa Calreticulin Precursor (AB021259; BAA88900) 135, 136 22883 Oryza sativa Low Temperature-Induced Protein 5 (AB011368; BAA24979.1) 137, 138 23878 Oryza sativa Putative Myosin (AC090120; AAL31066.1) 139, 140 20554 0. sativa DEHYDRIN RAB 16B (P22911) 141, 142 19701 Soluble Starch Synthase (AF165890; AAD49850) 143, 144 20285 OsSGTI (gil6581058) 145, 146 20696 Elicitor responsive protein (gil11358958) 147, 148 24063 RAS GTPase (gil730510) 149, 150 20621 Shaggy kinase (gil13677093) 151, 152 19651 0. sativa Chitinase, Class Ill (AF296279; AAG02504) 153,154 20899 0. sativa Catalase A Isozyme (D29966; BAA06232) 155,156 19707 0. sativa Cellulose Synthase Catalytic Subunit, RSWI-Like (AF030052; AAC39333) WO 2004/061080 PCT/US2003/041098 19 157, 158 29086 0. sativa salT Gene Product (AF001 395; AAB53810.1) 159, 160 29098 0. sativa Aquaporin (AF062393) 161,162 29113 0. sativa DNAJ Homologue (BAB70509.1) 163,164 20254 0. sativa Serine/Threonine Protein Phosphatase PP2A-2, Catalytic Subunit (AF134552, AAD22116) 165, 166 23266 0. sativa Putative Proline-Rich Protein AAK63900 (AC084884) 167, 168 24775 0. sativa Glutelin CAA33838 (X15833) 169, 170 20311 0. sativa Chilling-Inducible Protein CAA90866 (Z54153, CAA90866) 171,172 20215 0. sativa Putative 14-3-3 Protein (AAK38492) 173,174 23186 0. sativa Putative Pyrrolidone Carboxyl Peptidase (AAG46136) Detailed Description The presently disclosed subject matter will be now be described more fully hereinafter with reference to the accompanying Examples, in which 5 representative embodiments of the presently disclosed subject matter are shown. The presently disclosed subject matter can, however, be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of 10 the presently disclosed subject matter to those skilled in the art. All of the patents (including published patent applications) and publications (including GENBANK@ sequence references), which are cited herein, are hereby incorporated by reference in their entireties to the same extent as if each were specifically stated to be incorporated by reference. 15 Any inconsistency between these patents and publications and the present disclosure shall be resolved in favor of the present disclosure.
WO 2004/061080 PCT/US2003/041098 20 1. General Considerations A goal of functional genomics is to identify genes controlling expression of organismal phenotypes, and functional genomics employs a variety of methodologies including, but not limited to, bioinformatics, gene 5 expression studies, gene and gene product interactions, genetics, biochemistry, and molecular genetics. For example, bioinformatics can assign function to a given gene by identifying genes in heterologous organisms with a high degree of similarity (homology) at the amino acid or nucleotide level. Studies of the expression of a gene at the mRNA or 10 polypeptide levels can assign function by linking expression of the gene to an environmental response, a developmental process, or a genetic (mutational) or molecular genetic (gene overexpression or underexpression) perturbation. Expression of a gene at the mRNA level can be ascertained either alone (for example, by Northern analysis) or in concert with other 15 genes (for example, by microarray analysis), whereas expression of a gene at the polypeptide level can be ascertained either alone (for example, by native or denatured polypeptide gel or immunoblot analysis) or in concert with other genes (for example, by proteomic analysis). Knowledge of polypeptide/polypeptide and polypeptide/DNA interactions can assign 20 function by identifying polypeptides and nucleic acid sequences acting together in the same biological process. Genetics can assign function to a gene by demonstrating that DNA lesions (mutations) in the gene have a quantifiable effect on the organism, including, but not limited to, its development; hormone biosynthesis and response; growth and growth habit 25 (plant architecture); mRNA expression profiles; polypeptide expression profiles; ability to resist diseases; tolerance of abiotic stresses (for example, drought conditions); ability to acquire nutrients; photosynthetic efficiency; altered primary and secondary metabolism; and the composition of various plant organs. Biochemistry can assign function by demonstrating that the 30 polypeptide(s) encoded by the gene, typically when expressed in a heterologous organism, possesses a certain enzymatic activity, either alone WO 2004/061080 PCT/US2003/041098 21 or in combination with other polypeptides. Molecular genetics can assign function by overexpressing or underexpressing the gene in the native plant or in heterologous organisms, and observing quantifiable effects as disclosed in functional assignment by genetics above. In functional 5 genomics, any or all of these approaches are utilized, often in concert, to assign functions to genes across any of a number of organismal phenotypes. It is recognized by those skilled in the art that these different methodologies can each provide data as evidence for the function of a particular gene, and that such evidence is stronger with increasing amounts 10 of data used for functional assignment: in one embodiment from a single methodology, in another embodiment from two methodologies, and in still another embodiment from more than two methodologies. In addition, those skilled in the art are aware that different methodologies can differ in the strength of the evidence provided for the assignment of gene function. 15 Typically, but not always, a datum of biochemical, genetic, or molecular genetic evidence is considered stronger than a datum of bioinformatic or gene expression evidence. Finally, those skilled in the art recognize that, for different genes, a single datum from a single methodology can differ in terms of the strength of the evidence provided by each distinct datum for the 20 assignment of the function of these different genes. The objective of crop trait functional genomics is to identify crop trait genes of interest, for example, genes capable of conferring useful agronomic traits in crop plants. Such agronomic traits include, but are not limited to, enhanced yield, whether in quantity or quality; enhanced nutrient acquisition 25 and metabolic efficiency; enhanced or altered nutrient composition of plant tissues used for food, feed, fiber, or processing; enhanced utility for agricultural or industrial processing; enhanced resistance to plant diseases; enhanced tolerance of adverse environmental conditions (abiotic stresses) including, but not limited to, drought, excessive cold, excessive heat, or 30 excessive soil salinity or extreme acidity or alkalinity; and alterations in plant architecture or development, including changes in developmental timing.
WO 2004/061080 PCT/US2003/041098 22 The deployment of such identified trait genes by either transgenic or non transgenic means can materially improve crop plants for the benefit of agriculture. Cereals are the most important crop plants on the planet in terms of 5 both human and animal consumption. Genomic synteny (conservation of gene order within large chromosomal segments) is observed in rice, maize, wheat, barley, rye, oats, and other agriculturally important monocots, which facilitates the mapping and isolation of orthologous genes from diverse cereal species based on the sequence of a single cereal gene. Rice has the 10 smallest (about 420 Mb) genome among the cereal grains, and has recently been a major focus of public and private genomic and EST sequencing efforts. See Goff et al., 2002. II. Definitions Unless otherwise defined, all technical and scientific terms used 15 herein have the same meaning as commonly understood by one of ordinary skill in the art to which the presently disclosed subject matter pertains. For clarity of the present specification, certain definitions are presented hereinbelow. Following long-standing patent law convention, the terms "a" and "an" 20 mean "one or more" when used in this application, including in the claims. As used herein, the term "about", when referring to a value or to an amount of mass, weight, time, volume, concentration or percentage is meant to encompass variations of ±20% or ±10%, in another example ±5%, in another example ±1%, and in still another example ±0.1% from the specified 25 amount, as such variations are appropriate to practice the presently disclosed subject matter. Unless otherwise indicated, all numbers expressing quantities of ingredients, reaction conditions, and so forth used in the specification and claims are to be understood as being modified in all instances by the term "about". Accordingly, unless indicated to the contrary, 30 the numerical parameters set forth in this specification and attached claims WO 2004/061080 PCT/US2003/041098 23 are approximations that can vary depending upon the desired properties sought to be obtained by the presently disclosed subject matter. As used herein, the terms "amino acid" and "amino acid residue" are used interchangeably and refer to any of the twenty naturally occurring 5 amino acids, as well as analogs, derivatives, and congeners thereof; amino acid analogs having variant side chains; and all stereoisomers of any of any of the foregoing. Thus, the term "amino acid" is intended to embrace all molecules, whether natural or synthetic, which include both an amino functionality and an acid functionality and capable of being included in a 10 polymer of naturally occurring amino acids. An amino acid is formed upon chemical digestion (hydrolysis) of a polypeptide at its peptide linkages. The amino acid residues described herein are in one embodiment in the "L" isomeric form. However, residues in the "D" isomeric form can be substituted for any L-amino acid residue, as 15 long as the desired functional property is retained by the polypeptide. NH 2 refers to the free amino group present at the amino terminus of a polypeptide. COOH refers to the free carboxy group present at the carboxy terminus of a polypeptide. In keeping with standard polypeptide nomenclature abbreviations for amino acid residues are shown in tabular 20 form presented hereinabove. It is noted that all amino acid residue sequences represented herein by formulae have a left-to-right orientation in the conventional direction of amino terminus to carboxy terminus. In addition, the phrases "amino acid" and "amino acid residue" are broadly defined to include modified and 25 unusual amino acids. Furthermore, it is noted that a dash at the beginning or end of an amino acid residue sequence indicates a peptide bond to a further sequence of one or more amino acid residues or a covalent bond to an amino-terminal group such as NH 2 or acetyl or to a carboxy-terminal group such as COOH. 30 As used herein, the terms "associated with" and "operatively linked" refer to two nucleic acid sequences that are related physically or functionally.
WO 2004/061080 PCT/US2003/041098 24 For example, a promoter or regulatory DNA sequence is said to be "associated with" a DNA sequence that encodes an RNA or a polypeptide if the two sequences are operatively linked, or situated such that the regulator DNA sequence will affect the expression level of the coding or structural 5 DNA sequence. As used herein, the term "chimera" refers to a polypeptide that comprises domains or other features that are derived from different polypeptides or are in a position relative to each other that is not naturally occurring. 10 As used herein, the term "chimeric construct" refers to a recombinant nucleic acid molecule in which a promoter or regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid sequence that codes for an mRNA or which is expressed as a polypeptide, such that the regulatory nucleic acid sequence is able to regulate 15 transcription or expression of the associated nucleic acid sequence. The regulatory nucleic acid sequence of the chimeric construct is not normally operatively linked to the associated nucleic acid sequence as found in nature. As used herein, the term "co-factor" refers to a natural reactant, such 20 as an organic molecule or a metal ion, required in an enzyme-catalyzed reaction. A co-factor can be, for example, NAD(P), riboflavin (including FAD and FMN), folate, molybdopterin, thiamin, biotin, lipoic acid, pantothenic acid and coenzyme A, S-adenosylmethionine, pyridoxal phosphate, ubiquinone, and menaquinone. In one embodiment, a co-factor can be regenerated and 25 reused. As used herein, the terms "coding sequence" and "open reading frame" (ORF) are used interchangeably and refer to a nucleic acid sequence that is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, sense RNA, or antisense RNA. In one embodiment, the RNA is then translated in 30 vivo or in vitro to produce a polypeptide.
WO 2004/061080 PCT/US2003/041098 25 As used herein, the term "complementary" refers to two nucleotide sequences that comprise antiparallel nucleotide sequences capable of pairing with one another upon formation of hydrogen bonds between the complementary base residues in the antiparallel nucleotide sequences. As 5 is known in the art, the nucleic acid sequences of two complementary strands are the reverse complement of each other when each is viewed in the 5' to 3' direction. As is also known in the art, two sequences that hybridize to each other under a given set of conditions do not necessarily have to be 100% 10 fully complementary. As used herein, the terms "fully complementary" and "100% complementary" refer to sequences for which the complementary regions are 100% in Watson-Crick base-pairing, i.e., that no mismatches occur within the complementary regions. However, as is often the case with recombinant molecules (for example, cDNAs) that are cloned into cloning 15 vectors, certain of these molecules can have non-complementary overhangs on either the 5' or 3' ends that result from the cloning event. In such a situation, it is understood that the region of 100% or full complementarity excludes any sequences that are added to the recombinant molecule (typically at the ends) solely as a result of, or to facilitate, the cloning event. 20 Such sequences are, for example, polylinker sequences, linkers with restriction enzyme recognition sites, etc. As used herein, the terms "domain" and "feature", when used in reference to a polypeptide or amino acid sequence, refers to a subsequence of an amino acid sequence that has a particular biological function. Domains 25 and features that have a particular biological function include, but are not limited to, ligand binding, nucleic acid binding, catalytic activity, substrate binding, and polypeptide-polypeptide interacting domains. Similarly, when used herein in reference to a nucleic acid sequence, a "domain", or "feature" is that subsequence of the nucleic acid sequence that encodes a domain or 30 feature of a polypeptide.
WO 2004/061080 PCT/US2003/041098 26 As used herein, the term "enzyme activity" refers to the ability of an enzyme to catalyze the conversion of a substrate into a product. A substrate for the enzyme can comprise the natural substrate of the enzyme but also can comprise analogues of the natural substrate, which can also be 5 converted by the enzyme into a product or into an analogue of a product. The activity of the enzyme is measured for example by determining the amount of product in the reaction after a certain period of time, or by determining the amount of substrate remaining in the reaction mixture after a certain period of time. The activity of the enzyme can also be measured by 10 determining the amount of an unused co-factor of the reaction remaining in the reaction mixture after a certain period of time or by determining the amount of used co-factor in the reaction mixture after a certain period of time. The activity of the enzyme can also be measured by determining the amount of a donor of free energy or energy-rich molecule (e.g., ATP, 15 phosphoenolpyruvate, acetyl phosphate, or phosphocreatine) remaining in the reaction mixture after a certain period of time or by determining the amount of a used donor of free energy or energy-rich molecule (e.g., ADP, pyruvate, acetate, or creatine) in the reaction mixture after a certain period of time. 20 As used herein, the term "expression cassette" refers to a nucleic acid molecule capable of directing expression of a particular nucleotide sequence in an appropriate host cell, comprising a promoter operatively linked to the nucleotide sequence of interest which is operatively linked to termination signals. It also typically comprises sequences required for proper translation 25 of the nucleotide sequence. The coding region usually encodes a polypeptide of interest but can also encode a functional RNA of interest, for example antisense RNA or a non-translated RNA, in the sense or antisense direction. The expression cassette comprising the nucleotide sequence of interest can be chimeric, meaning that at least one of its components is 30 heterologous with respect to at least one of its other components. The expression cassette can also be one that is naturally occurring but has been WO 2004/061080 PCT/US2003/041098 27 obtained in a recombinant form useful for heterologous expression. Typically, however, the expression cassette is heterologous with respect to the host; i.e., the particular DNA sequence of the expression cassette does not occur naturally in the host cell and was introduced into the host cell or an 5 ancestor of the host cell by a transformation event. The expression of the nucleotide sequence in the expression cassette can be under the control of a constitutive promoter or of an inducible promoter that initiates transcription only when the host cell is exposed to some particular external stimulus. In the case of a multicellular organism such as a plant, the promoter can also 10 be specific to a particular tissue, organ, or stage of development. As used herein, the term "fragment" refers to a sequence that comprises a subset of another sequence. When used in the context of a nucleic acid or amino acid sequence, the terms "fragment" and "subsequence" are used interchangeably. A fragment of a nucleic acid 15 sequence can be any number of nucleotides that is less than that found in another nucleic acid sequence, and thus includes, but is not limited to, the sequences of an exon or intron, a promoter, an enhancer, an origin of replication, a 5' or 3' untranslated region, a coding region, and a polypeptide binding domain. It is understood that a fragment or subsequence can also 20 comprise less than the entirety of a nucleic acid sequence, for example, a portion of an exon or intron, promoter, enhancer, etc. Similarly, a fragment or subsequence of an amino acid sequence can be any number of residues that is less than that found in a naturally occurring polypeptide, and thus includes, but is not limited to, domains, features, repeats, etc. Also similarly, 25 it is understood that a fragment or subsequence of an amino acid sequence need not comprise the entirety of the amino acid sequence of the domain, feature, repeat, etc. A fragment can also be a "functional fragment", in which the fragment retains a specific biological function of the nucleic acid sequence or amino acid sequence of interest. For example, a functional 30 fragment of a transcription factor can include, but is not limited to, a DNA binding domain, a transactivating domain, or both. Similarly, a functional WO 2004/061080 PCT/US2003/041098 28 fragment of a receptor tyrosine kinase includes, but is not limited to a ligand binding domain, a kinase domain, an ATP binding domain, and combinations thereof. As used herein, the term "gene" refers to a nucleic acid that encodes 5 an RNA, for example, nucleic acid sequences including, but not limited to, structural genes encoding a polypeptide. The target gene can be a gene derived from a cell, an endogenous gene, a transgene, or exogenous genes such as genes of a pathogen, for example a virus, which is present in the cell after infection thereof. The cell containing the target gene can be 10 derived from or contained in any organism, for example a plant, animal, protozoan, virus, bacterium, or fungus. The term "gene" also refers broadly to any segment of DNA associated with a biological function. As such, the term "gene" encompasses sequences including but not limited to a coding sequence, a promoter region, a transcriptional regulatory sequence, a non 15 expressed DNA segment that is a specific recognition sequence for regulatory proteins, a non-expressed DNA segment that contributes to gene expression, a DNA segment designed to have desired parameters, or combinations thereof. A gene can be obtained by a variety of methods, including cloning from a biological sample, synthesis based on known or 20 predicted sequence information, and recombinant derivation from one or more existing sequences. As is understood in the art, a gene comprises a coding strand and a non-coding strand. As used herein, the terms "coding strand" and "sense strand" are used interchangeably, and refer to a nucleic acid sequence that 25 has the same sequence of nucleotides as an mRNA from which the gene product is translated. As is also understood in the art, when the coding strand and/or sense strand is used to refer to a DNA molecule, the coding/sense strand includes thymidine residues instead of the uridine residues found in the corresponding mRNA. Additionally, when used to refer 30 to a DNA molecule, the coding/sense strand can also include additional elements not found in the mRNA including, but not limited to promoters, WO 2004/061080 PCT/US2003/041098 29 enhancers, and introns. Similarly, the terms "template strand" and "antisense strand" are used interchangeably and refer to a nucleic acid sequence that is complementary to the coding/sense strand. As used herein, the terms "complementarity" and "complementary" 5 refer to a nucleic acid that can form one or more hydrogen bonds with another nucleic acid sequence by either traditional Watson-Crick or other non-traditional types of interactions. In reference to the nucleic molecules of the presently disclosed subject matter, the binding free energy for a nucleic acid molecule with its complementary sequence is sufficient to allow the 10 relevant function of the nucleic acid to proceed, in one embodiment, RNAi activity. For example, the degree of complementarity between the sense and antisense strands of the siRNA construct can be the same or different from the degree of complementarity between the antisense strand of the siRNA and the target nucleic acid sequence. Complementarity to the target 15 sequence of less than 100% in the antisense strand of the siRNA duplex, including point mutations, is not well tolerated when these changes are located between the 3'-end and the middle of the antisense siRNA, whereas mutations near the 5'-end of the antisense siRNA strand can exhibit a small degree of RNAi activity (Elbashir et al., 2001c). Determination of binding 20 free energies for nucleic acid molecules is well known in the art. See e.g., Freier et al., 1986; Turner et al., 1987. As used herein, the phrase "percent complementarity" refers to the percentage of contiguous residues in a nucleic acid molecule that can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid 25 sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). The terms "100% complementary", "fully complementary", and "perfectly complementary" indicate that all of the contiguous residues of a nucleic acid sequence can hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. 30 The term "gene expression" generally refers to the cellular processes by which a biologically active polypeptide is produced from a DNA sequence WO 2004/061080 PCT/US2003/041098 30 and exhibits a biological activity in a cell. As such, gene expression involves the processes of transcription and translation, but also involves post transcriptional and post-translational processes that can influence a biological activity of a gene or gene product. These processes include, but 5 are not limited to RNA syntheses, processing, and transport, as well as polypeptide synthesis, transport, and post-translational modification of polypeptides. Additionally, processes that affect protein-protein interactions within the cell can also affect gene expression as defined herein. The terms "heterologous", "recombinant", and "exogenous", when 10 used herein to refer to a nucleic acid sequence (e.g., a DNA sequence) or a gene, refer to a sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has been modified through, for 15 example, the use of DNA shuffling or other recombinant techniques (for example, cloning the gene into' a vector). The terms also include non naturally occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position or form within the host cell in 20 which the element is not ordinarily found. Similarly, when used in the context of a polypeptide or amino acid sequence, an exogenous polypeptide or amino acid sequence is a polypeptide or amino acid sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, exogenous DNA 25 segments can be expressed to yield exogenous polypeptides. A "homologous" nucleic acid (or amino acid) sequence is a nucleic acid (or amino acid) sequence naturally associated with a host cell into which it is introduced. As used herein, the terms "host cells" and "recombinant host cells" 30 are used interchangeably and refer cells (for example, plant cells) into which the compositions of the presently disclosed subject matter (for example, an WO 2004/061080 PCT/US2003/041098 31 expression vector) can be introduced. Furthermore, the terms refer not only to the particular plant cell into which an expression construct is initially introduced, but also to the progeny or potential progeny of such a cell. Because certain modifications can occur in succeeding generations due to 5 either mutation or environmental influences, such progeny might not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein. The phrase "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide 10 sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. The phrase "bind(s) substantially" refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to 15 achieve the desired detection of the target nucleic acid sequence. As used herein, the term "inhibitor" refers to a chemical substance that inactivates or decreases the biological activity of a polypeptide such as a biosynthetic and catalytic activity, receptor, signal transduction polypeptide, structural gene product, or transport polypeptide. The term 20 "herbicide" (or "herbicidal compound") is used herein to define an inhibitor applied to a plant at any stage of development, whereby the herbicide inhibits the growth of the plant or kills the plant. An "isolated" nucleic acid molecule or protein, or biologically active portion thereof, is substantially free of other cellular material, or culture 25 medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. Thus, the term "isolated nucleic acid" refers to a polynucleotide of genomic, cDNA, or synthetic origin or some combination thereof, which (1) is not associated with the cell in which the "isolated nucleic acid" is found in nature, or (2) is 30 operatively linked to a polynucleotide to which it is not linked in nature. Similarly, the term "isolated polypeptide" refers to a polypeptide, in certain WO 2004/061080 PCT/US2003/041098 32 embodiments prepared from recombinant DNA or RNA, or of synthetic origin, or some combination thereof, which (1) is not associated with proteins that it is normally found with in nature, (2) is isolated from the cell in which it normally occurs, (3) is isolated free of other proteins from the same cellular 5 source, (4) is expressed by a cell from a different species, or (5) does not occur in nature. In certain embodiments, an "isolated" nucleic acid is free of sequences (e.g., protein encoding or regulatory sequences) that naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the 10 nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of the nucleotide sequences that naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. 15 A protein that is substantially free of cellular material includes preparations of protein or polypeptide having less than about 30%, 20%, 10%, or 5%, (by dry weight)' of contaminating protein. When the protein of the presently disclosed subject matter, or biologically active portion thereof, is recombinantly produced, culture medium represents less than about 30%, 20 20%, 10%, or 5% (by dry weight) of chemical precursors or non-protein of interest chemicals. Thus, the term "isolated", when used in the context of an isolated DNA molecule or an isolated polypeptide, refers to a DNA molecule or polypeptide that, by the hand of man, exists apart from its native environment and is therefore not a product of nature. An isolated DNA 25 molecule or polypeptide can exist in a purified form or can exist in a non native environment such as, for example, in a transgenic host cell. The term "isolated", when used in the context of an "isolated cell", refers to a cell that has been removed from its natural environment, for example, as a part of an organ, tissue, or organism.
WO 2004/061080 PCT/US2003/041098 33 As used herein, the term "mature polypeptide" refers to a polypeptide from which the transit peptide, signal peptide, and/or propeptide portions have been removed. As used herein, the term "minimal promoter" refers to the smallest 5 piece of a promoter, such as a TATA element, that can support any transcription. A minimal promoter typically has greatly reduced promoter activity in the absence of upstream or downstream activation. In the presence of a suitable transcription factor, a minimal promoter can function to permit transcription. 10 As used herein, the term "modified enzyme activity" refers to enzyme activity that is different from that which naturally occurs in a plant (i.e. enzyme activity that occurs naturally in the absence of direct or indirect manipulation of such activity by man). In one embodiment, a modified enzyme activity is displayed by a non-naturally occurring enzyme that is 15 tolerant to inhibitors that inhibit the cognate naturally occurring enzyme activity. As used herein, the term "modulate" refers to an increase, decrease, or other alteration of any, or all, chemical and biological activities or properties of a biochemical entity, e.g., a wild-type or mutant nucleic acid 20 molecule. As such, the term "modulate" can refer to a change in the expression level of a gene, or a level of RNA molecule or equivalent RNA molecules encoding one or more proteins or protein subunits, or activity of one or more proteins or protein subunits is up regulated or down regulated, such that expression, level, or activity is greater than or less than that 25 observed in the absence of the modulator. For example, the term "modulate" can mean "inhibit" or "suppress", but the use of the word "modulate" is not limited to this definition. As used herein, the terms "inhibit", "suppress", "down regulate", and grammatical variants thereof are used interchangeably and refer to an 30 activity whereby gene expression or a level of an RNA encoding one or more gene products is reduced below that observed in the absence of a nucleic WO 2004/061080 PCT/US2003/041098 34 acid molecule of the presently disclosed subject matter. In one embodiment, inhibition with a nucleic acid molecule (for example, a dsRNA, an antisense RNA, or an siRNA) results in a decrease in the steady state level of a target RNA. In another embodiment, inhibition with a a nucleic acid molecule (for 5 example, a dsRNA, an antisense RNA, or an siRNA) results in an expression level of a target gene that is below that level observed in the presence of an inactive or attenuated molecule that is unable to mediate an RNAi response. In another embodiment, inhibition of gene expression with a nucleic acid molecule (for example, a dsRNA, an antisense RNA, or an siRNA) of the 10 presently disclosed subject matter is greater in the presence of the a nucleic acid molecule than in its absence. In still another embodiment, inhibition of gene expression is associated with an enhanced rate of degradation of the mRNA encoded by the gene (for example, by RNAi mediated by an siRNA, a dsRNA, or an antisense RNA). 15 The term "modulation" as used herein refers to both upregulation (i.e., activation or stimulation) and downregulation (i.e., inhibition or suppression) of a response. Thus, the term "modulation", when used in reference to a functional property or biological activity or process (e.g., enzyme activity or receptor binding), refers to the capacity to upregulate (e.g., activate or 20 stimulate), downregulate (e.g., inhibit or suppress), or otherwise change a quality of such property, activity, or process. In certain instances, such regulation can be contingent on the occurrence of a specific event, such as activation of a signal transduction pathway, and/or can be manifest only in particular cell types. 25 The term "modulator" refers to a polypeptide, nucleic acid, macromolecule, complex, molecule, small molecule, compound, species, or the like (naturally occurring or non-naturally occurring), or an extract made from biological materials such as bacteria, plants, fungi, or animal cells or tissues, that can be capable of causing modulation. Modulators can be 30 evaluated for potential activity as inhibitors or activators (directly or indirectly) of a functional property, biological activity or process, or combination of WO 2004/061080 PCT/US2003/041098 35 them, (e.g., agonist, partial antagonist, partial agonist, inverse agonist, antagonist, anti-microbial agents, inhibitors of microbial infection or proliferation, and the like) by inclusion in assays. In such assays, many modulators can be screened at one time. The activity of a modulator can be 5 known, unknown, or partially known. Modulators can be either selective or non-selective. As used herein, the term "selective" when used in the context of a modulator (e.g., an inhibitor) refers to a measurable or otherwise biologically relevant difference in the way the modulator interacts with one molecule (e.g., a gene of 10 interest) versus another similar but not identical molecule (e.g., a member of the same gene family as the gene of interest). It must be understood that it is not required that the degree to which the interactions differ be completely opposite. Put another way, the term selective modulator encompasses not only those molecules that only bind to 15 mRNA transcripts from a gene of interest and not those of related family members. The term is also intended to include modulators that are characterized by interactions with transcripts from genes of interest and from related family members that differ to a lesser degree. For example, selective modulators include modulators for which conditions can be found (such as 20 the degree of sequence identity) that would allow a biologically relevant difference in the binding of the modulator to transcripts form the gene of interest versus transcripts from related genes. When a selective modulator is identified, the modulator will bind to one molecule (for example an mRNA transcript of a gene of interest) in a 25 manner that is different (for example, stronger) than it binds to another molecule (for example, an mRNA transcript of a gene related to the gene of interest). As used herein, the modulator is said to display "selective binding" or "preferential binding" to the molecule to which it binds more strongly. As used herein, the term "mutation" carries its traditional connotation 30 and refers to a change, inherited, naturally occurring or introduced, in a WO 2004/061080 PCT/US2003/041098 36 nucleic acid or polypeptide sequence, and is used in its sense as generally known to those of skill in the art. As used herein, the term "native" refers to a gene that is naturally present in the genome of an untransformed plant cell. Similarly, when used 5 in the context of a polypeptide, a "native polypeptide" is a polypeptide that is encoded by a native gene of an untransformed plant cell's genome. As used herein, the term "naturally occurring" refers to an object that is found in nature as distinct from being artificially produced by man. For example, a polypeptide or nucleotide sequence that is present in an 10 organism (including a virus) in its natural state, which has not been intentionally modified or isolated by man in the laboratory, is naturally occurring, As such, a polypeptide or nucleotide sequence is considered "non-naturally occurring" if it is encoded by or present within a recombinant molecule, even if the amino acid or nucleic acid sequence is identical to an 15 amino acid or nucleic acid sequence found in nature. As used herein, the terms "nucleic acid" and "nucleic acid molecule" refer to any of deoxyribonucleic acid (DNA), ribonucleic acid (RNA), oligonucleotides, fragments generated by the polymerase chain reaction (PCR), and fragments generated by any of ligation, scission, endonuclease 20 action, and exonuclease action. Nucleic acids can be composed of monomers that are naturally occurring nucleotides (such as deoxyribonucleotides and ribonucleotides), or analogs of naturally occurring nucleotides (e.g., a-enantiomeric forms of naturally occurring nucleotides), or a combination of both. Modified nucleotides can have modifications in 25 sugar moieties and/or in pyrimidine or purine base moieties. Sugar modifications include, for example, replacement of one or more hydroxyl groups with halogens, alkyl groups, amines, and azido groups, or sugars can be functionalized as ethers or esters. Moreover, the entire sugar moiety can be replaced with sterically and electronically similar structures, such as aza 30 sugars and carbocyclic sugar analogs. Examples of modifications in a base moiety include alkylated purines and pyrimidines, acylated purines or WO 2004/061080 PCT/US2003/041098 37 pyrimidines, or other well-known heterocyclic substitutes. Nucleic acid monomers can be linked by phosphodiester bonds or analogs of such linkages. Analogs of phosphodiester linkages include phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiselenoate, 5 phosphoroanilothioate, phosphoranilidate, phosphoramidate, and the like. The term "nucleic acid" also includes so-called "peptide nucleic acids", which comprise naturally occurring or modified nucleic acid bases attached to a polyamide backbone. Nucleic acids can be either single stranded or double stranded. 10 The term "operatively linked", when describing the relationship between two nucleic acid regions, refers to a juxtaposition wherein the regions are in a relationship permitting them to function in their intended manner. For example, a control sequence "operatively linked" to a coding sequence is ligated in such a way that expression of the coding sequence is 15 achieved under conditions compatible with the control sequences, such as when the appropriate molecules (e.g., inducers and polymerases) are bound to the control or regulatory sequence(s). Thus, in one embodiment, the phrase "operatively linked" refers to a promoter connected to a coding sequence in such a way that the transcription of that coding sequence is 20 controlled and regulated by that promoter. Techniques for operatively linking a promoter to a coding sequence are well known in the art; the precise orientation and location relative to a coding sequence of interest is dependent, inter alia, upon the specific nature of the promoter. Thus, the term "operatively linked" can refer to a promoter region that 25 is connected to a nucleotide sequence in such a way that the transcription of that nucleotide sequence is controlled and regulated by that promoter region. Similarly, a nucleotide sequence is said to be under the "transcriptional control" of a promoter to which it is operatively linked. Techniques for operatively linking a promoter region to a nucleotide 30 sequence are known in the art. The term "operatively linked" can also refer to a transcription termination sequence or other nucleic acid that is WO 2004/061080 PCT/US2003/041098 38 connected to a nucleotide sequence in such a way that termination of transcription of that nucleotide sequence is controlled by that transcription termination sequence. Additionally, the term "operatively linked" can refer to a enhancer, silencer, or other nucleic acid regulatory sequence that when 5 operatively linked to an open reading frame modulates the expression of that open reading frame, either in a positive or negative fashion. As used herein, the phrase "percent identical"," in the context of two nucleic acid or polypeptide sequences, refers to two or more sequences or subsequences that have in one embodiment 60%, in another. embodiment 10 70%, in another embodiment 80%, in another embodiment 90%, in another embodiment 95%, and in still another embodiment at least 99% nucleotide or amino acid residue identity, respectively, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection. The percent 15 identity exists in one embodiment over a region of the sequences that is at least about 50 residues in length, in another embodiment over a region of at least about 100 residues, and in another embodiment, the percent identity exists over at least about 150 residues. In still another embodiment, the percent identity exists over the entire length of the sequences. 20 For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated if necessary, and sequence algorithm program parameters are designated. The sequence 25 comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters. Optimal alignment of sequences for comparison can be conducted, for example, by the local homology algorithm disclosed in Smith & 30 Waterman, 1981, by the homology alignment algorithm disclosed in Needleman & Wunsch, 1970, by the search for similarity method disclosed in WO 2004/061080 PCT/US2003/041098 39 Pearson & Lipman, 1988, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the GCG Wisconsin Package, available from Accelrys, Inc., San Diego, California, United States of America), or by visual inspection. See generally, Ausubel et al., 1988. 5 One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al., 1990. Software for performing BLAST analysis is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high 10 scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold. See generally, Altschul et al., 1990. These initial neighborhood word hits act as 15 seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score for 20 mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when the cumulative alignment score falls off by the quantity X from its maximum achieved value, the cumulative score goes to zero or below due to the accumulation of one or more negative scoring 25 residue alignments, or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, a cutoff of 100, M = 5, N = 4, and a comparison of both strands. For amino acid sequences, 30 the BLASTP program uses as defaults a wordlength (W) of 3, an expectation WO 2004/061080 PCT/US2003/041098 40 (E) of 10, and the BLOSUM62 scoring matrix. See Henikoff & Henikoff, 1992. In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two 5 sequences (see e.g., Karlin & Altschul, 1993). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a test nucleic acid sequence is considered similar to a reference sequence if 10 the smallest sum probability in a comparison of the test nucleic acid sequence to the reference nucleic acid sequence is in one embodiment less than about 0.1, in another embodiment less than about 0.01, and in still another embodiment less than about 0.001. The phrase "hybridizing substantially to" refers to complementary 15 hybridization between a probe nucleic acid molecule and a target nucleic acid molecule and embraces minor mismatches (for example, polymorphisms) that can be accommodated by reducing the stringency of the hybridization and/or wash media to achieve the desired hybridization. "Stringent hybridization conditions" and "stringent hybridization wash 20 conditions" in the context of nucleic acid hybridization experiments such as Southern and Northern blot analysis are both sequence- and environment dependent. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen, 1993. Generally, high stringency hybridization and wash conditions are 25 selected to be about 5*C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Typically, under "highly stringent conditions" a probe will hybridize specifically to its target subsequence, but to no other sequences. Similarly, medium stringency hybridization and wash conditions are selected to be more than about 5*C 30 lower than the Tm for the specific sequence at a defined ionic strength and pH. Exemplary medium stringency conditions include hybridizations and WO 2004/061080 PCT/US2003/041098 41 washes as for high stringency conditions, except that the temperatures for the hybridization and washes are in one embodiment 8"C, in another embodiment 10'C, in another embodiment 12'C, and in still another embodiment 150C lower than the Tm for the specific sequence at a defined 5 ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the Tm for a particular probe. An example of highly stringent hybridization conditions for Southern 10 or Northern Blot analysis of complementary nucleic acids having more than about 100 complementary residues is overnight hybridization in 50% formamide with I mg of heparin at 420C. An example of highly stringent wash conditions is 15 minutes in 0.1x standard saline citrate (SSC), 0.1% (w/v) SDS at 65'C. Another example of highly stringent wash conditions is 15 15 minutes in 0.2x SSC buffer at 650C (see Sambrook and Russell, 2001 for a description of SSC buffer and other stringency conditions). Often, a high stringency wash is preceded by a lower stringency wash to remove background probe signal. An example of medium stringency wash conditions for a duplex of more than about 100 nucleotides is 15 minutes in 20 1X SSC at 450C. Another example of medium stringency wash for a duplex of more than about 100 nucleotides is 15 minutes in 4-6X SSC at 4 0 "C. For short probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1M Na+ ion, typically about 0.01 to 1M Na+ ion concentration (or other salts) at pH 7.0-8.3, and the 25 temperature is typically at least about 30*C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2-fold (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. 30 The following are examples of hybridization and wash conditions that can be used to clone homologous nucleotide sequences that are WO 2004/061080 PCT/US2003/041098 42 substantially similar to reference nucleotide sequences of the presently disclosed subject matter: a probe nucleotide sequence hybridizes in one example to a target nucleotide sequence in 7% sodium dodecyl sulfate (NaDS), 0.5M NaPO4, 1 mm ethylene diamine tetraacetic acid (EDTA) at 5 50'C followed by washing in 2X SSC, 0.1% NaDS at 50'C; in another example, a probe and target sequence hybridize in 7% NaDS, 0.5 M NaPO4, 1 mm EDTA at 50"C followed by washing in 1X SSC, 0.1% NaDS at 50*C; in another example, a probe and target sequence hybridize in 7% NaDS, 0.5 M NaPO4, 1 mm EDTA at 50'C followed by washing in 0.5X SSC, 0.1% NaDS 10 at 500C; in another example, a probe and target sequence hybridize in 7% NaDS, 0.5 M NaPO4, 1 mm EDTA at 500C followed by washing in 0.1X SSC, 0.1% NaDS at 500C; in yet another example, a probe and target sequence hybridize in 7% NaDS, 0.5 M NaPO4, 1 mm EDTA at 500C followed by washing in 0.1X SSC, 0.1% NaDS at 650C. In one embodiment, 15 hybridization conditions comprise hybridization in a roller tube for at least 12 hours at 42*C. The term "phenotype" refers to the entire physical, biochemical, and physiological makeup of a cell or an organism, e.g., having any one trait or any group of traits. As such, phenotypes result from the expression of genes 20 within a cell or an organism, and relate to traits that are potentially observable or assayable. As used herein, the terms "polypeptide", "protein", and "peptide", which are used interchangeably herein, refer to a polymer of the 20 protein amino acids, or amino acid analogs, regardless of its size or function. 25 Although "protein" is often used in reference to relatively large polypeptides, and "peptide" is often used in reference to small polypeptides, usage of these terms in the art overlaps and varies. The term "polypeptide" as used herein refers to peptides, polypeptides and proteins, unless otherwise noted. As used herein, the terms "protein", "polypeptide" and "peptide" are used 30 interchangeably herein when referring to a gene product. The term "polypeptide" encompasses proteins of all functions, including enzymes.
WO 2004/061080 PCT/US2003/041098 43 Thus, exemplary polypeptides include gene products, naturally occurring proteins, homologs, orthologs, paralogs, fragments, and other equivalents, variants and analogs of the foregoing. The terms "polypeptide fragment" or "fragment", when used in 5 reference to a reference polypeptide, refers to a polypeptide in which amino acid residues are deleted as compared to the reference polypeptide itself, but where the remaining amino acid sequence is usually identical to the corresponding positions in the reference polypeptide. Such deletions can occur at the amino-terminus or carboxy-terminus of the reference 10 polypeptide, or alternatively both. Fragments typically are at least 5, 6, 8 or 10 amino acids long, at least 14 amino acids long, at least 20, 30, 40 or 50 amino acids long, at least 75 amino acids long, or at least 100, 150, 200, 300, 500 or more amino acids long. A fragment can retain one or more of the biological activities of the reference polypeptide. In certain 15 embodiments, a fragment can comprise a domain or feature, and optionally additional amino acids on one or both sides of the domain or feature, which additional amino acids can number from 5, 10, 15, 20, 30, 40, 50, or up to 100 or more residues. Further, fragments can include a sub-fragment of a specific region, which sub-fragment retains a function of the region from 20 which it is derived. In another embodiment, a fragment can have immunogenic properties. As used herein, the term "pre-polypeptide" refers to a polypeptide that is normally targeted to a cellular organelle, such as a chloroplast, and still comprises a transit peptide. 25 As used herein, the term "primer" refers to a sequence comprising in one embodiment two or more deoxyribonucleotides or ribonucleotides, in another embodiment more than three, in another embodiment more than eight, and in yet another embodiment at least about 20 nucleotides of an exonic or intronic region. Such oligonucleotides are in one embodiment 30 between ten and thirty bases in length.
WO 2004/061080 PCT/US2003/041098 44 The term "promoter" or "promoter region" each refers to a nucleotide sequence within a gene that is positioned 5' to a coding sequence and functions to direct transcription of the coding sequence. The promoter region comprises a transcriptional start site, and can additionally include one 5 or more transcriptional regulatory elements. In one embodiment, a method of the presently disclosed subject matter employs a RNA polymerase IlIl promoter. A "minimal promoter" is a nucleotide sequence that has the minimal elements required to enable basal level transcription to occur. As such, 10 minimal promoters are not complete promoters but rather are subsequences of promoters that are capable of directing a basal level of transcription of a reporter construct in an experimental system. Minimal promoters include but are not limited to the CMV minimal promoter, the HSV-tk minimal promoter, the simian virus 40 (SV40) minimal promoter, the human b-actin minimal 15 promoter, the human EF2 minimal promoter, the adenovirus EIB minimal promoter, and the heat shock protein (hsp) 70 minimal promoter. Minimal promoters are often augmented with one or more transcriptional regulatory elements to influence the transcription of an operatively linked gene. For example, cell-type-specific or tissue-specific transcriptional regulatory 20 elements can be added to minimal promoters to create recombinant promoters that direct transcription of an operatively linked nucleotide sequence in a cell-type-specific or tissue-specific manner Different promoters have different combinations of transcriptional regulatory elements. Whether or not a gene is expressed in a cell is 25 dependent on a combination of the particular transcriptional regulatory elements that make up the gene's promoter and the different transcription factors that are present within the nucleus of the cell. As such, promoters are often classified as "constitutive", "tissue-specific", "cell-type-specific", or "inducible", depending on their functional activities in vivo or in vitro. For 30 example, a constitutive promoter is one that is capable of directing transcription of a gene in a variety of cell types. Exemplary constitutive WO 2004/061080 PCT/US2003/041098 45 promoters include the promoters for the following genes which encode certain constitutive or "housekeeping" functions: hypoxanthine phosphoribosyl transferase (HPRT), dihydrofolate reductase (DHFR; Scharfmann et al., 1991), adenosine deaminase, phosphoglycerate kinase 5 (PGK), pyruvate kinase, phosphoglycerate mutase, the p-actin promoter (see e.g., Williams et al., 1993), and other constitutive promoters known to those of skill in the art. "Tissue-specific" or "cell-type-specific" promoters, on the other hand, direct transcription in some tissues and cell types but are inactive in others. Exemplary tissue-specific promoters include those 10 promoters described in more detail hereinbelow, as well as other tissue specific and cell-type specific promoters known to those of skill in the art. When used in the context of a promoter, the term "linked" as used herein refers to a physical proximity of promoter elements such that they function together to direct transcription of an operatively linked nucleotide 15 sequence The term "transcriptional regulatory sequence" or "transcriptional regulatory element", as used herein, each refers to a nucleotide sequence within the promoter region that enables responsiveness to a regulatory transcription factor. Responsiveness can encompass a decrease or an 20 increase in transcriptional output and is mediated by binding of the transcription factor to the DNA molecule comprising the transcriptional regulatory element. In one embodiment, a transcriptional regulatory sequence is a transcription termination sequence, alternatively referred to herein as a transcription termination signal. 25 The term "transcription factor" generally refers to a protein that modulates gene expression by interaction with the transcriptional regulatory element and cellular components for transcription, including RNA Polymerase, Transcription Associated Factors (TAFs), chromatin-remodeling proteins, and any other relevant protein that impacts gene transcription. 30 As used herein, "significance" or "significant" relates to a statistical analysis of the probability that there is a non-random association between WO 2004/061080 PCT/US2003/041098 46 two or more entities. To determine whether or not a relationship is "significant" or has "significance", statistical manipulations of the data can be performed to calculate a probability, expressed as a "p-value". Those p values that fall below a user-defined cutoff point are regarded as significant. 5 In one example, a p-value less than or equal to 0.05, in another example less than 0.01, in another example less than 0.005, and in yet another example less than 0.001, are regarded as significant. The term "purified" refers to an object species that is the predominant species present (i.e., on a molar basis it is more abundant than any other 10 individual species in the composition). A "purified fraction" is a composition wherein the object species comprises at least about 50 percent (on a molar basis) of all species present. In making the determination of the purity of a species in solution or dispersion, the solvent or matrix in which the species is dissolved or dispersed is usually not included in such determination; instead, 15 only the species (including the one of interest) dissolved or dispersed are taken into account. Generally, a purified composition will have one species that comprises more than about 80 percent ,of all species present in the composition, more than about 85%, 90%, 95%, 99% or more of all species present. The object species can be purified to essential homogeneity 20 (contaminant species cannot be detected in the composition by conventional detection methods) wherein the composition consists essentially of a single species. A skilled artisan can purify a polypeptide of the presently disclosed subject matter using standard techniques for protein purification in light of the teachings herein. Purity of a polypeptide can be determined by a 25 number of methods known to those of skill in the art, including for example, amino-terminal amino acid sequence analysis, gel electrophoresis, and mass-spectrometry analysis. A "reference sequence" is a defined sequence used as a basis for a sequence comparison. A reference sequence can be a subset of a larger 30 sequence, for example, as a segment of a full-length nucleotide or amino acid sequence, or can comprise a complete sequence. Generally, when WO 2004/061080 PCT/US2003/041098 47 used to refer to a nucleotide sequence, a reference sequence is at least 200, 300 or 400 nucleotides in length, frequently at least 600 nucleotides in length, and often at least 800 nucleotides in length. Because two proteins can each (1) comprise a sequence (i.e., a portion of the complete protein 5 sequence) that is similar between the two proteins, and (2) can further comprise a sequence that is divergent between the two proteins, sequence comparisons between two (or more) proteins are typically performed by comparing sequences of the two proteins over a "comparison window" (defined hereinabove) to identify and compare local regions of sequence 10 similarity. The term "regulatory sequence" is a generic term used throughout the specification to refer to polynucleotide sequences, such as initiation signals, enhancers, regulators, promoters, and termination sequences, which are necessary or desirable to affect the expression of coding and non-coding 15 sequences to which they are operatively linked. Exemplary regulatory sequences are described in Goeddel, 1990, and include, for example, the early and late promoters of simian virus 40 (SV40), adenovirus or cytomegalovirus immediate early promoter, the lac system, the trp system, the TAC or TRC system, T7 promoter whose expression is directed by T7 20 RNA polymerase, the major operator and promoter regions of phage lambda, the control regions for fd coat protein, the promoter for 3 phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast a-mating factors, the polyhedron promoter of the baculovirus system and other sequences known 25 to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof. The nature and use of such control sequences can differ depending upon the host organism. In prokaryotes, such regulatory sequences generally include promoter, ribosomal binding site, and transcription termination sequences. The term 30 "regulatory sequence" is intended to include, at a minimum, components whose presence can influence expression, and can also include additional WO 2004/061080 PCT/US2003/041098 48 components whose presence is advantageous, for example, leader sequences and fusion partner sequences. In certain embodiments, transcription of a polynucleotide sequence is under the control of a promoter sequence (or other regulatory sequence) that 5 controls the expression of the polynucleotide in a cell-type in which expression is intended. It will also be understood that the polynucleotide can be under the control of regulatory sequences that are the same or different from those sequences which control expression of the naturally occurring form of the polynucleotide. 10 The term "reporter gene" refers to a nucleic acid comprising a nucleotide sequence encoding a protein that is readily detectable either by its presence or activity, including, but not limited to, luciferase, fluorescent protein (e.g., green fluorescent protein), chloramphenicol acetyl transferase, P-galactosidase, secreted placental alkaline phosphatase, p-lactamase, 15 human growth hormone, and other secreted enzyme reporters. Generally, a reporter gene encodes a polypeptide not otherwise produced by the host cell, which is detectable by analysis of the cell(s), e.g., by the direct fluorometric, radioisotopic or spectrophotometric analysis of the cell(s) and typically without the need to kill the cells for signal analysis. In certain 20 instances, a reporter gene encodes an enzyme, which produces a change in fluorometric properties of the host cell, which is detectable by qualitative, quantitative, or semiquantitative function or transcriptional activation. Exemplary enzymes include esterases, 8-lactamase, phosphatases, peroxidases, proteases (tissue plasminogen activator or urokinase) and 25 other enzymes whose function can be detected by appropriate chromogenic or fluorogenic substrates known to those skilled in the art or developed in the future. As used herein, the term "sequencing" refers to determining the ordered linear sequence of nucleic acids or amino acids of a DNA or protein 30 target sample, using conventional manual or automated laboratory techniques.
WO 2004/061080 PCT/US2003/041098 49 As used herein, the term "substantially pure" refers to that the polynucleotide or polypeptide is substantially free of the sequences and molecules with which it is associated in its natural state, and those molecules used in the isolation procedure. The term "substantially free" 5 refers to that the sample is in one embodiment at least 50%, in another embodiment at least 70%, in another embodiment 80% and in still another embodiment 90% free of the materials and compounds with which is it associated in nature. As used herein, the term "target cell" refers to a cell, into which it is 10 desired to insert a nucleic acid sequence or polypeptide, or to otherwise effect a modification from conditions known to be standard in the unmodified cell. A nucleic acid sequence introduced into a target cell can be of variable length. Additionally, a nucleic acid sequence can enter a target cell as a component of a plasmid or other vector or as a naked sequence. 15 As used herein, the term "transcription" refers to a cellular process involving the interaction of an RNA polymerase with a gene that directs the expression as RNA of the structural information present in the coding sequences of the gene. The process includes, but is not limited to, the following steps: (a) the transcription initiation; (b) transcript elongation; (c) 20 transcript splicing; (d) transcript capping; (e) transcript termination; (f) transcript polyadenylation; (g) nuclear export of the transcript; (h) transcript editing; and (i) stabilizing the transcript. As used herein, the term "transcription factor" refers to a cytoplasmic or nuclear protein which binds to a gene, or binds to an RNA transcript of a 25 gene, or binds to another protein which binds to a gene or an RNA transcript or another protein which in turn binds to a gene or an RNA transcript, so as to thereby modulate expression of the gene. Such modulation can additionally be achieved by other mechanisms; the essence of a "transcription factor for a gene" pertains to a factor that alters the level of 30 transcription of the gene in some way.
WO 2004/061080 PCT/US2003/041098 50 The term "transfection" refers to the introduction of a nucleic acid, e.g., an expression vector, into a recipient cell, which in certain instances involves nucleic acid-mediated gene transfer. The term "transformation" refers to a process in which a cell's genotype is changed as a result of the 5 cellular uptake of exogenous nucleic acid. For example, a transformed cell can express a recombinant form of a polypeptide of the presently disclosed subject matter or antisense expression can occur from the transferred gene so that the expression of a naturally occurring form of the gene is disrupted. The term "vector" refers to a nucleic acid capable of transporting 10 another nucleic acid to which it has been linked. One type of vector that can be used in accord with the presently disclosed subject matter is an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Other vectors include those capable of autonomous replication and expression of nucleic acids to which they are linked. Vectors capable of directing the expression 15 of genes to which they are operatively linked are referred to herein as "expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of . plasmids. In the present specification, "plasmid" and "vector" are used interchangeably as the plasmid is the most commonly used form of vector. However, the presently disclosed 20 subject matter is intended to include such other forms of expression vectors which serve equivalent functions and which become known in the art subsequently hereto. The term "expression vector" as used herein refers to a DNA sequence capable of directing expression of a particular nucleotide 25 sequence in an appropriate host cell, comprising a promoter operatively linked to the nucleotide sequence of interest which is operatively linked to transcription termination sequences. It also typically comprises sequences required for proper translation of the nucleotide sequence. The construct comprising the nucleotide sequence of interest can be chimeric. The 30 construct can also be one that is naturally occurring but has been obtained in a recombinant form useful for heterologous expression. The nucleotide WO 2004/061080 PCT/US2003/041098 51 sequence of interest, including any additional sequences designed to effect proper expression of the nucleotide sequences, can also be referred to as an "expression cassette". The terms "heterologous gene", "heterologous DNA sequence", 5 "heterologous nucleotide sequence", "exogenous nucleic acid molecule", or "exogenous DNA segment", as used herein, each refer to a sequence that originates from a source foreign to an intended host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but 10 has been modified, for example by mutagenesis or by isolation from native transcriptional regulatory sequences. The terms also include non-naturally occurring multiple copies of a naturally occurring nucleotide sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the host cell nucleic 15 acid wherein the element is not ordinarily found. Two nucleic acids are "recombined" when sequences from each of the two nucleic acids are combined in a progeny nucleic acid. Two sequences are "directly" recombined when both of the nucleic acids are substrates for recombination. Two sequences are "indirectly recombined" 20 when the sequences are recombined using an intermediate such as a cross over oligonucleotide. For indirect recombination, no more than one of the sequences is an actual substrate for recombination, and in some cases, neither sequence is a substrate for recombination. As used herein, the term "regulatory elements" refers to nucleotide 25 sequences involved in controlling the expression of a nucleotide sequence. Regulatory elements can comprise a promoter operatively linked to the nucleotide sequence of interest and termination signals. Regulatory sequences also include enhancers and silencers. They also typically encompass sequences required for proper translation of the nucleotide 30 sequence.
WO 2004/061080 PCT/US2003/041098 52 As used herein, the term "significant increase" refers to an increase in activity (for example, enzymatic activity) that is larger than the margin of error inherent in the measurement technique, in one embodiment an increase by about 2 fold or greater over a baseline activity (for example, the 5 activity of the wild type enzyme in the presence of the inhibitor), in another embodiment an increase by about 5 fold or greater, and in still another embodiment an increase by about 10 fold or greater. As used herein, the terms "significantly less" and "significantly reduced" refer to a result (for example, an amount of a product of an 10 enzymatic reaction) that is reduced by more than the margin of error inherent in the measurement technique, in one embodiment a decrease by about 2 fold or greater with respect to a baseline activity (for example, the activity of the wild type enzyme in the absence of the inhibitor), in another embodiment, a decrease by about 5 fold or greater, and in still another 15 embodiment a decrease by about 10 fold or greater. As used herein, the terms "specific binding" and "immunological cross-reactivity" refer to an indicator that two molecules are substantially similar. An indication that two nucleic acid sequences or polypeptides are substantially similar is that the polypeptide encoded by the first nucleic acid 20 is immunologically cross reactive with, or specifically binds to, the polypeptide encoded by the second nucleic acid. Thus, a polypeptide is typically substantially similar to a second polypeptide, for example, where the two polypeptides differ only by conservative substitutions. The phrase "specifically (or selectively) binds to an antibody," or 25 "specifically (or selectively) immunoreactive with," when referring to a polypeptide or peptide, refers to a binding reaction which is determinative of the presence of the polypeptide in the presence of a heterogeneous population of polypeptides and other biologics. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular 30 polypeptide and do not bind in a significant amount to other polypeptides present in the sample. Specific binding to an antibody under such conditions WO 2004/061080 PCT/US2003/041098 53 can require an antibody that is selected for its specificity for a particular polypeptide. For example, antibodies raised to the polypeptide with the amino acid sequence encoded by any of the nucleic acid sequences of the presently disclosed subject matter can be selected to obtain antibodies 5 specifically immunoreactive with that polypeptide and not with other polypeptides except for polymorphic) variants. A variety of immunoassay formats can be used to select antibodies specifically immunoreactive with a particular polypeptide. For example, solid phase ELISA immunoassays, Western blots, or immunohistochemistry are routinely used to select 10 monoclonal antibodies specifically immunoreactive with a polypeptide. See Harlow & Lane, 1988, for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity. Typically a specific or selective reaction will be at least twice background signal or noise and more typically more than 10 to 100 times background. 15 As used herein, the term "subsequence" refers to a sequence of nucleic acids or amino acids that comprises a part of a longer sequence of nucleic acids or amino acids (e.g., polypeptide), respectively. As used herein, the term "substrate" refers to a molecule that an enzyme naturally recognizes and converts to a product in the biochemical 20 pathway in which the enzyme naturally carries out its function; or is a modified version of the molecule, which is also recognized by the enzyme and is converted by the enzyme to a product in an enzymatic reaction similar to the naturally-occurring reaction. As used herein, the term "suitable growth conditions" refers to growth 25 conditions that are suitable for a certain desired outcome, for example, the production of a recombinant polypeptide or the expression of a nucleic acid molecule. As used herein, the term "transformation" refers to a process for introducing heterologous DNA into a plant cell, plant tissue, or plant. 30 Transformed plant cells, plant tissue, or plants are understood to encompass WO 2004/061080 PCT/US2003/041098 54 not only the end product of a transformation process, but also transgenic progeny thereof. As used herein, the terms "transformed", "transgenic", and "recombinant" refer to a host organism such as a bacterium or a plant into 5 which a heterologous nucleic acid molecule has been introduced. The nucleic acid molecule can be stably integrated into the genome of the host or the nucleic acid molecule can also be present as an extrachromosomal molecule. Such an extrachromosomal molecule can be auto-replicating. Transformed cells, tissues, or plants are understood to encompass not only 10 the end product of a transformation process, but also transgenic progeny thereof. A "non-transformed," "non-transgenic", or "non-recombinant" host refers to a wild-type organism, e.g., a bacterium or plant, which does not contain the heterologous nucleic acid molecule. As used herein, the term "viability" refers to a fitness parameter of a 15 plant. Plants are assayed for their homozygous performance of plant development, indicating which polypeptides are essential for plant growth. Ill. Nucleic Acids and Polypeptides In one aspect, the presently disclosed subject matter provides an 20 isolated nucleic acid molecule encoding a stress-related polypeptide, wherein the polypeptide binds to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGT1 (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID 25 NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170). In certain embodiments, the isolated nucleic acid molecule is derived from rice (i.e., Oryza sativa). As used herein, the phrase "stress-related polypeptide" refers to a protein or polypeptide (note that these two terms are used interchangeably 30 throughout) that is involved in stress, particularly plant stress. Such a polypeptide can be involved in an increase in stress response; conversely, WO 2004/061080 PCT/US2003/041098 55 such a polypeptide can be involved in the abrogation or inhibition of stress response. Moreover, the polypeptide can be involved in stress response, for example, when the cell is exposed to a biotic or abiotic stress. A "stress related polypeptide" of the presently disclosed subject matter is identified by 5 the ability of an increase or decrease in the level of expression of such a polypeptide in a cell to modulate that cell's response to stress. As used herein, term "binds" means that a stress-related polypeptide preferentially interacts with a stated target molecule. In some embodiments, that interaction allows a biological read-out (e.g., a positive in the yeast two 10 hybrid system). In some embodiments, that interaction is measurable (e.g., a KD of at least 10- M). Disclosed herein are rice (0. sativa)-derived cDNAs encoding plant proteins that interact with OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), 15 OsSGT1 (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170) in the yeast two-hybrid system. In certain embodiments, the presently disclosed subject matter provides an isolated nucleic acid molecule comprising a nucleotide 20 sequence substantially similar to the nucleotide sequence of the nucleic acid molecule encoding a stress-related polypeptide disclosed herein. In a broad sense, the term "substantially similar", as used herein with respect to a nucleotide sequence, refers to a nucleotide sequence corresponding to a reference nucleotide sequence (i.e., a nucleotide 25 sequence of a nucleic acid molecule encoding a stress-related protein of the presently disclosed subject matter), wherein the corresponding sequence encodes a polypeptide having substantially the same structure as the polypeptide encoded by the reference nucleotide sequence. In some embodiments, the substantially similar nucleotide sequence encodes the 30 polypeptide encoded by the reference nucleotide sequence (i.e., although the nucleotide sequence is different, the encoded protein has the same WO 2004/061080 PCT/US2003/041098 56 amino acid sequence). In some embodiments, "substantially similar" refers to nucleotide sequences having at least 50% sequence identity, or at least 60%, 70%, 80% or 85%, or at least 90% or 95%, or at least 96%, 97% or 99% sequence identity, compared to a reference sequence containing 5 nucleotide sequences encoding one of the stress-related proteins of the presently disclosed subject matter (e.g., the proteins described below in the Examples). "Substantially similar" also refers to nucleotide sequences having at least 50% identity, or at least 80% identity, or at least 95% identity, or at 10 least 99% identity, to a region of nucleotide sequence encoding a BIOPATH protein and/or an Functional Protein Domain (FPD), wherein the nucleotide sequence comparisons are conducted using GAP analysis as described herein. The term "substantially similar" is specifically intended to include nucleotide sequences wherein the sequence has been modified to optimize 15 expression in particular cells. A polynucleotide including a nucleotide sequence "substantially similar" to the reference nucleotide sequence hybridizes to a polynucleotide including the reference nucleotide sequence in one embodiment in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM ethylenediamine 20 teatraacetic acid (EDTA) at 50*C with washing in 2X standard saline citrate (SSC), 0.1% SDS at 500C, in another embodiment in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM EDTA at 500C with washing in IX SSC, 0.1% SDS at 500C, in another embodiment in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM EDTA at 500C with washing in 0.5X SSC, 0.1% 25 SDS at 500C, or in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM EDTA at 50'C with washing in 0.1X SSC, 0.1% SDS at 500C, or in still another embodiment in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM EDTA at 500C with washing in 0.1X SSC, 0.1% SDS at 650C. The term "substantially similar", when used herein with respect to a 30 protein or polypeptide, refers to a protein or polypeptide corresponding to a reference protein (i.e., a stress-related protein of the presently disclosed WO 2004/061080 PCT/US2003/041098 57 subject matter), wherein the protein has substantially the same structure and function as the reference protein, where only changes in amino acids sequence that do not materially affect the polypeptide function occur. When used for a protein or an amino acid sequence the percentage of identity 5 between the substantially similar and the reference protein or amino acid sequence is at least 30%, or at least 40%, 50%, 60%, 70%, 80%, 85%, or 90%, or at least 95%, or at least 99% with every individual number falling within this range of at least 30% to at least 99% also being part of the presently disclosed subject matter, using default GAP analysis parameters 10 with the GCG Wisconsin Package SEQWEB@ application of GAP, based on the algorithm of Needleman & Wunsch, 1970. In one embodiment, the polypeptide is involved in a function such as abiotic stress tolerance, disease resistance, enhanced yield or nutritional quality or composition. In one embodiment, the polypeptide is involved in 15 drought resistance. In one embodiment, isolated polypeptides comprise the amino acid sequences set forth in even numbered SEQ ID NOs: 2-112, and variants having conservative amino acid modifications. The term "conservative modified variants" refers to polypeptides that can be encoded by nucleic acid 20 sequences having degenerate codon substitutions wherein at least one position of one or more selected (or all) codons is substituted with mixed base and/or deoxyinosine residues (Batzer et al., 1991; Ohtsuka et al., 1985; Rossolini et al., 1994). Additionally, one skilled in the art will recognize that individual substitutions, deletions, or additions to a nucleic acid, peptide, 25 polypeptide, or polypeptide sequence that alters, adds, or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservative modification" where the modification results in the substitution of an amino acid with a chemically similar amino acid. Conservative modified variants provide similar biological activity as the 30 unmodified polypeptide. Conservative substitution tables listing functionally similar amino acids are known in the art. See Creighton, 1984.
WO 2004/061080 PCT/US2003/041098 58 The term "conservatively modified variant" also refers to a peptide having an amino acid residue sequence substantially similar to a sequence of a polypeptide of the presently disclosed subject matter in which one or more residues have been conservatively substituted with a functionally 5 similar residue. Examples of conservative substitutions include the substitution of one non-polar (hydrophobic) residue such as isoleucine, valine, leucine or methionine for another; the substitution of one polar (hydrophilic) residue for another such as between arginine and lysine, between glutamine and asparagine, between glycine and serine; the 10 substitution of one basic residue such as lysine, arginine or histidine for another; or the substitution of one acidic residue, such as aspartic acid or glutamic acid for another. Amino acid substitutions, such as those which might be employed in modifying the polypeptides described herein, are generally based on the 15 relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and the like. An analysis of the size, shape and type of the amino acid side-chain substituents reveals that arginine, lysine and hisfidine are all positively charged residues; that alanine, glycine and serine are all of similar size; and that phenylalanine, tryptophan 20 and tyrosine all have a generally similar shape. Therefore, based upon these considerations, arginine, lysine and histidine; alanine, glycine and serine; and phenylalanine, tryptophan and tyrosine; are defined herein as biologically functional equivalents. Other biologically functionally equivalent changes will be appreciated by those of skill in the art. 25 In making biologically functional equivalent amino acid substitutions, the hydropathic index of amino acids can be considered. Each amino acid has been assigned a hydropathic index on the basis of their hydrophobicity and charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); leucine (+ 3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1.9); 30 alanine (+ 1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan ( 0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); WO 2004/061080 PCT/US2003/041098 59 glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). The importance of the hydropathic amino acid index in conferring interactive biological function on a protein is generally understood in the art 5 (Kyte & Doolittle, 1982, incorporated herein by reference). It is known that certain amino acids can be substituted for other amino acids having a similar hydropathic index or score and still retain a similar biological activity. Substitutions of amino acids involve amino acids for which the hydropathic indices are in one embodiment within ±2 of the original value, in another 10 embodiment within ±1 of the original value, and in still another embodiment within ±0.5 of the original value in making changes based upon the hydropathic index. It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 15 4,554,101, incorporated herein by reference, states that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e. with a biological property of the protein. It is understood that an amino acid can be substituted for another having a similar hydrophilicity value and still 20 obtain a biologically equivalent protein. As detailed in U.S. Patent No. 4,554,101, the following hydrophilicity values have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ± 1); 25 alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). Substitutions of amino acids involve amino acids for which the hydrophilicity values are in one embodiment within ±2 of the original value, in 30 another embodiment within ±1 of the original value, and in still another WO 2004/061080 PCT/US2003/041098 60 embodiment within ±0.5 of the original value in making changes based upon similar hydrophilicity values. While discussion has focused on functionally equivalent polypeptides arising from amino acid changes, it will be appreciated that these changes 5 can be effected by alteration of the encoding DNA, taking into consideration also that the genetic code is degenerate and that two or more codons can code for the same amino acid. In one embodiment, the polypeptide is expressed in a specific location or tissue of a plant. In one embodiment, the location or tissue includes, but 10 is not limited to, epidermis, vascular tissue, meristem, cambium, cortex, -or pith. In another embodiment, the location or tissue is leaf or sheath, root, flower, and developing ovule or seed. In another embodiment, the location or tissue can be, for example, epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, or flower. In yet another embodiment, the 15 location or tissue is a seed. The polypeptides of the presently disclosed subject matter, fragments thereof, or variants thereof, can comprise any number of contiguous amino acid residues from a polypeptide of the presently disclosed subject matter, wherein the number of residues is selected from the group of integers 20 consisting of from 10 to the number of residues in a full-length polypeptide of the presently disclosed subject matter. In one embodiment, the portion or fragment of the polypeptide is a functional polypeptide. The presently disclosed subject matter includes active polypeptides having specific activity of at least in one embodiment 20%, in another embodiment 30%, in another 25 embodiment 40%, in another embodiment 50%, in another embodiment 60%, in another embodiment 70%, in another embodiment 80%, in another embodiment 90%, and in still another embodiment 95% that of the native (non-synthetic) endogenous polypeptide. Further, the substrate specificity (kcat/Km) can be substantially similar to the native (non-synthetic), 30 endogenous polypeptide. Typically the Km will be at least in one embodiment 30%, in another embodiment 40%, in another embodiment 50% WO 2004/061080 PCT/US2003/041098 61 of the native, endogenous polypeptide; and in another embodiment at least 60%, in another embodiment 70%, in another embodiment 80%, and in yet another embodiment 90% of the native, endogenous polypeptide. Methods of assaying and quantifying measures of activity and substrate specificity are 5 well known to those of skill in the art. The isolated polypeptides of the presently disclosed subject matter can elicit production of an antibody specifically reactive to a polypeptide of the presently disclosed subject matter when presented as an immunogen. Therefore, the polypeptides of the presently disclosed subject matter can be 10 employed as immunogens for constructing antibodies immunoreactive to a polypeptide of the presently disclosed subject matter for such purposes including, but not limited to, immunoassays or polypeptide purification techniques. Immunoassays for determining binding are well known to those of skill in the art and include, but are not limited to, enzyme-linked 15 immunosorbent assays (ELISAs) and competitive immunoassays. IV. The Yeast Two-Hybrid System The yeast two-hybrid system is a well known system which is based on the finding that most eukaryotic transcription activators are modular (see e.g., Gyuris et al., 1993; Bartel & Fields, 1997; Feys et ai., 2001). The yeast 20 two-hybrid system uses: 1) a plasmid that directs the synthesis of a "bait" (a known protein which is brought to the yeast's DNA by being fused to a DNA binding domain); 2) one or more reporter genes ("reporters") with upstream binding sites for the bait; and 3) a plasmid that directs the synthesis of proteins fused to activation domains and other useful moieties ("activation 25 tagged proteins", or "prey"). In all of the Examples described below, an automated, high throughput yeast two-hybrid assay technology (provided by Myriad Genetics Inc., Salt Lake City, Utah, United States of America) was used to search for protein interactions with the bait proteins. Briefly, the target protein (e.g., 30 OsE2F1) was expressed in yeast as a fusion to the DNA-binding domain of the yeast Gal4p polypeptide. DNA encoding the target protein or a WO 2004/061080 PCT/US2003/041098 62 fragment of this protein was amplified from cDNA by PCR or prepared from an available clone. The resulting DNA fragment was cloned by ligation or recombination into a DNA-binding domain vector (e.g., pGBT9, pGBT.C, pAS2-1) such that an in-frame fusion between the Gal4p and target protein 5 sequences was created. The resulting construct, the target gene construct, was introduced by transformation into a haploid yeast strain. A screening protocol was then used to search the individual baits against two activation domain libraries of assorted peptide motifs of greater than five million cDNA clones. The libraries were derived from RNA isolated 10 from leaves, stems, and roots of rice plants grown in normal conditions, plus tissues from plants exposed to various stresses (input trait library), and from various seed stages, callus, and early and late panicle (output trait library). To screen, a library of activation domain fusions (i.e., 0. sativa cDNA cloned into an activation domain vector) was introduced by transformation into a 15 haploid yeast strain of the opposite mating type. The yeast strain that carried the activation domain constructs contained one or more Gal4p responsive reporter genes, the expression of which can be monitored. Non limiting examples of some yeast reporter strains include Y190, PJ69, and CBY1 4a. 20 Yeast carrying the target gene construct was combined with yeast carrying the activation domain library. The two yeast strains mated to form diploid yeast and were plated on media that selected for expression of one or more Gal4p-responsive reporter genes. Thus, both hybrid proteins (i.e., the target "bait" protein and the activation domain "prey" protein) were 25 expressed in a yeast reporter strain where an interaction between the test proteins results in transcription of the reporter genes TRPI and LEU2, allowing growth on selective medium lacking tryptophan and leucine. Colonies that arose after incubation were selected for further characterization. The activation domain plasmid was isolated from each 30 colony obtained in the two-hybrid search. The sequence of the insert in this construct was obtained by sequence analysis (e.g., Sanger's dideoxy WO 2004/061080 PCT/US2003/041098 63 nucleotide chain termination method; see Ausubel et al., 1988, including updates up to 2002). Thus, the identity of positives obtained from these searches was determined by sequence analysis against proprietary and public (e.g., GENBANK@) nucleic acid and protein databases. 5 Interaction of the activation domain fusion with the target protein was confirmed by testing for the specificity of the interaction. The activation domain construct was co-transformed into a yeast reporter strain with either the original target protein construct or a variety of other DNA-binding domain constructs. Expression of the reporter genes in the presence of the target 10 protein but not with other test proteins indicated that the interaction was genuine. To further characterize the genes encoding the interacting proteins, the nucleic acid sequences of the baits and preys were compared with nucleic acid sequences present on Torrey Mesa Research Institute (TMRI)'s 15 proprietary GENECHIP@ Rice Genome Array (Affymetrix, Santa Clara, California, United States of America; see Zhu et al., 2001). The rice genome array contained 25-mer oligonucleotide probes with sequences corresponding to the 3' ends of 21,000 predicted open reading frames found in approximately 42,000 contigs that make up the rice genome map (see 20 Goff et al., 2002). Sixteen different probes were used to measure the expression level of each nucleic acid. The sequences of the probes are available at http://tmri.org/geneexpweb/. The calculated expression value was determined based on the observed expression level minus the noise background associated with each probe. Experiments included evaluating 25 the differential gene expression from various plant tissues comprising seed, root, leaf and stem, panicle, and pollen. Gene expression was also measured in plants exposed to environmental cold (i.e., 14'C), osmotic pressure (growth media supplemented with 260 mM mannitol), drought (media supplemented with 25% polyethylene glycol 8000), salt (media 30 supplemented with 150 mM NaCI), abscisic acid (ABA)-inducible stresses (media supplemented with 50 uM ABA; see Chen et al., 2002), infection by WO 2004/061080 PCT/US2003/041098 64 the fungal pathogen Magnaporthe grisea, and treatment with plant hormones (jasmonic acid (JA; 100 tM), gibberellin (GA3; 50 piM), and abscisic acid) and with herbicides benzylamino purine (BAP; 10 gM), 2,4 dichlorophenoxyacetic acid (2,4-D; 2 5 mg/I), and BL2 (10 pM). Many of the stress-related proteins of the presently disclosed subject matter interact with one another. V. Controlling and Modulating the Expression of Nucleic Acid Molecules 10 A. General Considerations One aspect of the presently disclosed subject matter provides compositions and methods for modulating (i.e. increasing or decreasing) the level of nucleic acid molecules and/or polypeptides of the presently disclosed subject matter in plants. In particular, the nucleic acid molecules and 15 polypeptides of the presently disclosed subject matter are expressed constitutively, temporally, or spatially (e.g., at developmental stages), in certain tissues, and/or quantities, which are uncharacteristic of non recombinantly engineered plants. Therefore, the presently disclosed subject matter provides utility in such exemplary applications as altering the 20 specified characteristics identified above. The isolated nucleic acid molecules of the presently disclosed subject matter are useful for expressing a polypeptide of the presently disclosed subject matter in a recombinantly engineered cell such as a bacterial, yeast, insect, mammalian, or plant cell. Expressing cells can produce the 25 polypeptide in a non-natural condition (e.g., in quantity, composition, location and/or time) because they have been genetically altered to do so. Those skilled in the art are knowledgeable in the numerous expression systems available for expression of nucleic acids encoding a polypeptide of the presently disclosed subject matter.
WO 2004/061080 PCT/US2003/041098 65 In another aspect, the presently disclosed subject matter features a stress-related polypeptide encoded by a nucleic acid molecule disclosed herein. In certain embodiments, the stress-related polypeptide is isolated. The presently disclosed subject matter further provides a method for 5 modifying (i.e. increasing or decreasing) the concentration or composition of a polypeptide of the presently disclosed subject matter in a plant or part thereof. Modification can be effected by increasing or decreasing the concentration and/or the composition (i.e. the ration of the polypeptides of the presently disclosed subject matter) in a plant. The method comprises 10 introducing into a plant cell an expression cassette comprising a nucleic acid molecule of the presently disclosed subject matter as disclosed above to obtain a transformed plant cell or tissue, and culturing the transformed plant cell or tissue. The nucleic acid molecule can be under the regulation of a constitutive or inducible promoter. The method can further comprise 15 inducing or repressing expression of a nucleic acid molecule of a sequence in the plant for a time sufficient to modify the concentration and/or composition in the plant or plant part. A plant or plant part having modified expression of a nucleic acid molecule of the presently disclosed subject matter can be analyzed and 20 selected using methods known to those skilled in the art including, but not limited to, Southern blotting, DNA sequencing, or PCR analysis using primers specific to the nucleic acid molecule and detecting amplicons produced therefrom. In general, a concentration or composition is increased or decreased 25 by at least in one embodiment 5%, in another embodiment 10%, in another embodiment 20%, in another embodiment 30%, in another embodiment 40%, in another embodiment 50%, in another embodiment 60%, in another embodiment 70%, in another embodiment 80%, and in still another embodiment 90% relative to a native control plant, plant part, or cell lacking 30 the expression cassette. B. Modulation of Expression of Nucleic Acid Molecules WO 2004/061080 PCT/US2003/041098 66 The compositions of the presently disclosed subject matter include plant nucleic acid molecules, and the amino acid sequences of the polypeptides or partial-length polypeptides encoded by nucleic acid molecules comprising an open reading frame. These sequences can be 5 employed to alter the expression of a particular gene corresponding to the open reading frame by decreasing or eliminating expression of that plant gene or by overexpressing a particular gene product. Methods of this embodiment of the presently disclosed subject matter include stably transforming a plant with a nucleic acid molecule of the presently disclosed 10 subject matter that includes an open reading frame operatively linked to a promoter capable of driving expression of that open reading frame (sense or antisense) in a plant cell. By "portion" or "fragment", as it relates to a nucleic acid molecule that comprises an open reading frame or a fragment thereof encoding a partial-length polypeptide having the activity of the full length 15 polypeptide, is meant a sequence having in one embodiment at least 80 nucleotides, in another embodiment at least 150 nucleotides, and in still another embodiment at least 400 nucleotides. If not employed for expression, a "portion" or "fragment" means in representative embodiments at least 9, or 12, or 15, or at least 20, consecutive nucleotides (e.g., probes 20 and primers or other oligonucleotides) corresponding to the nucleotide sequence of the nucleic acid molecules of the presently disclosed subject matter. Thus, to express a particular gene product, the method comprises introducing into a plant, plant cell, or plant tissue an expression cassette comprising a promoter operatively linked to an open reading frame so as to 25 yield a transformed differentiated plant, transformed cell, or transformed tissue. Transformed cells or tissue can be regenerated to provide a transformed differentiated plant. The transformed differentiated plant or cells thereof can express the open reading frame in an amount that alters the amount of the gene product in the plant or cells thereof, which product is 30 encoded by the open reading frame. The presently disclosed subject matter WO 2004/061080 PCT/US2003/041098 67 also provides a transformed plant prepared by the methodsa disclosed herein, as well as progeny and seed thereof. The presently disclosed subject matter further includes a nucleotide sequence that is complementary to one (hereinafter "test" sequence) that 5 hybridizes under stringent conditions to a nucleic acid molecule of the presently disclosed subject matter, as well as an RNA molecule that is transcribed from the nucleic acid molecule. When hybridization is performed under stringent conditions, either the test or nucleic acid molecule of presently disclosed subject matter can be present on a support: e.g., on a 10 membrane or on a DNA chip. Thus, either a denatured test or nucleic acid molecule of the presently disclosed subject matter is first bound to a support and hybridization is effected for a specified period of time at a temperature of, in one embodiment, between 550C and 700C, in 2X SSC containing 0.1% SDS, followed by rinsing the support at the same temperature but with a 15 buffer having a reduced SSC concentration. Depending upon the degree of stringency required, such reduced concentration buffers are typically IX SSC containing 0.1% SDS, 0.5X SSC containing 0.1% SDS, or 0.1X SSC containing 0.1% SDS. In a further embodiment, the presently disclosed subject matter 20 provides a transformed plant host cell, or one obtained through breeding, capable of over-expressing, under-expressing, or having a knockout of a polypeptide-encoding gene and/or its gene product(s). The plant cell is transformed with at least one such expression vector wherein the plant host cell can be used to regenerate plant tissue or an entire plant, or seed there 25 from, in which the effects of expression, including overexpression and underexpression, of the introduced sequence or sequences can be measured in vitro or in planta. In another aspect, the presently disclosed subject matter features an isolated stress-related polypeptide, wherein the polypeptide binds to a 30 fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDADI (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), WO 2004/061080 PCT/US2003/041098 68 OsCRTC (SEQ ID NO: 134), OsSGTI (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIBI (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170). In some embodiments, the presently disclosed subject matter features an isolated 5 polypeptide comprising or consisting of an amino acid sequence substantially similar to the amino acid sequence of an isolated stress-related polypeptide of the presently disclosed subject matter. Because the proteins of the presently disclosed subject matter have a roll in stress, in certain embodiments, a cell introduced with a nucleic acid 10 molecule of the presently disclosed subject matter has a different stress response as compared to a cell not introduced with the nucleic acid molecule. In another aspect, the presently disclosed subject matter features a method for modulating stress response of a plant cell, the method 15 comprising introducing an isolated nucleic acid molecule encoding a stress related polypeptide into the plant cell, wherein the polypeptide binds to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGTI (SEQ ID NO: 144), OsERP (SEQ ID 20 NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170), wherein the polypeptide is expressed by the cell. In another aspect, the presently disclosed subject matter features a method for modulating stress response of a plant cell comprising introducing 25 an isolated nucleic acid molecule encoding a stress-related polypeptide into the plant cell, wherein the polypeptide binds to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGT1 (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), 30 OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID WO 2004/061080 PCT/US2003/041098 69 NO: 164), and OsCAA90866 (SEQ ID NO: 170), wherein expression of the polypeptide encoded by the nucleic acid molecule is reduced in the cell. As discussed herein, the stress-related proteins described herein can affect a cell under conditions of stress (e.g., when the plant is exposed to 5 biotic or abiotic stress). Accordingly, by changing the amount of a stress related protein of the presently disclosed subject matter in a plant cell, the response of that plant cell to stress can be modulated. In some situations, increasing expression of a stress-related protein of the presently disclosed subject matter in a cell will cause that cell to increase 10 its stress response (in some cases, rate of proliferation). In other situations, increasing expression of a stress-related protein of the presently disclosed subject matter in a cell causes that cell to reduce its stress response (in some cases, rate of proliferation). Similarly, decreasing the expression of a stress-related protein of the presently disclosed subject matter in a cell can 15 increase or decrease that cell's stress response (in some cases, rate of proliferation). What is relevant is that the stress response of the cell changes if the level of expression of a stress-related protein of the presently disclosed subject matter is either increased or decreased. Increasing the level of expression of a stress-related protein of the 20 presently disclosed subject matter in a cell is a relatively simple matter. For example, overexpression of the protein can be accomplished by transforming the cell with a nucleic acid molecule encoding the protein according to standard methods such as those described above. Reducing the level of expression of a stress-related protein of the 25 presently disclosed subject matter in a cell is likewise simply accomplished using standard methods. For example, an antisense RNA or DNA oligonucleotide that is complementary to the sense strand (i.e., the mRNA strand) of a nucleic acid molecule encoding the protein can be administered to the cell to reduce expression of that protein in that cell (see e.g., Agrawal, 30 1993; U.S. Patent No. 5,929,226).
WO 2004/061080 PCT/US2003/041098 70 The modulation in expression of the nucleic acid molecules of the presently disclosed subject matter can be achieved, for example, in one of the following ways: I. "Sense" Suppression 5 Alteration of the expression of a nucleotide sequence of the presently disclosed subject matter, in one embodiment reduction of its expression, is obtained by "sense" suppression (referenced in e.g., Jorgensen et al., 1996). In this case, the entirety or a portion of a nucleotide sequence of the presently disclosed subject matter is comprised in a DNA molecule. The 10 DNA molecule can be operatively linked to a promoter functional in a cell comprising the target gene, in one embodiment a plant cell, and introduced into the cell, in which the nucleotide sequence is expressible. The nucleotide sequence is inserted in the DNA molecule in the "sense orientation", meaning that the coding strand of the nucleotide sequence can be 15 transcribed. In one embodiment, the nucleotide sequence is fully translatable and all the genetic information comprised in the nucleotide sequence, or portion thereof, is translated into a polypeptide. In another embodiment, the nucleotide sequence is partially translatable and a short peptide is translated. In one embodiment, this is achieved by inserting at 20 least one premature stop codon in the nucleotide sequence, which brings translation to a halt. In another embodiment, the nucleotide sequence is transcribed but no translation product is made. This is usually achieved by removing the start codon, i.e. the "ATG", of the polypeptide encoded by the nucleotide sequence. In a further embodiment, the DNA molecule 25 comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule. In transgenic plants containing one of the DNA molecules disclosed 30 immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule WO 2004/061080 PCT/US2003/041098 71 can be reduced. The nucleotide sequence in the DNA molecule in one embodiment is at least 70% identical to the nucleotide sequence the expression of which is reduced, in another embodiment is at least 80% identical, in another embodiment is at least 90% identical, in another 5 embodiment is at least 95% identical, and in still another embodiment is at least 99% identical. 2. "Antisense" Suppression In another embodiment, the alteration of the expression of a nucleotide sequence of the presently disclosed subject matter, for example 10 the reduction of its expression, is obtained by "antisense" suppression. The entirety or a portion of a nucleotide sequence of the presently disclosed subject matter is comprised in a DNA molecule. The DNA molecule can be operatively linked to a promoter functional in a plant cell, and introduced in a plant cell, in which the nucleotide sequence is expressible. The nucleotide 15 sequence is inserted in the DNA molecule in the "antisense orientation", meaning that the reverse complement (also called sometimes non-coding strand) of the nucleotide sequence can be transcribed. In one embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated in the genome of the plant cell. In another embodiment 20 the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule. Several publications describing this approach are cited for further illustration (Green et al., 1986; van der Krol et al., 1991; Powell et al., 1989; Ecker & Davis, 1986). 25 In transgenic plants containing one of the DNA molecules disclosed immediately above, the expression of the nucleotide sequence corresponding to the nucleotide sequence comprised in the DNA molecule can be reduced. The nucleotide sequence in the DNA molecule is in one embodiment at least 70% identical to the nucleotide sequence the 30 expression of which is reduced, in another embodiment at least 80% identical, in another embodiment at least 90% identical, in another WO 2004/061080 PCT/US2003/041098 72 embodiment at least 95% identical, and in still another embodiment at least 99% identical. 3. Homologous Recombination In another embodiment, at least one genomic copy corresponding to a 5 nucleotide sequence of the presently disclosed subject matter is modified in the genome of the plant by homologous recombination as further illustrated in Paszkowski et al., 1988. This technique uses the ability of homologous sequences to recognize each other and to exchange nucleotide sequences between respective nucleic acid molecules by a process known in the art as 10 homologous recombination. Homologous recombination can occur between the chromosomal copy of a nucleotide sequence in a cell and an incoming copy of the nucleotide sequence introduced in the cell by transformation. Specific modifications are thus accurately introduced in the chromosomal copy of the nucleotide sequence. In one embodiment, the regulatory 15 elements of the nucleotide sequence of the presently disclosed subject matter are modified. Such regulatory elements are easily obtainable by screening a genomic library using the nucleotide sequence of the presently disclosed subject matter, or a portion thereof, as a probe. The existing regulatory elements are replaced by different regulatory elements, thus 20 altering expression of the nucleotide sequence, or they are mutated or deleted, thus abolishing the expression of the nucleotide sequence. In another embodiment, the nucleotide sequence is modified by deletion of a part of the nucleotide sequence or the entire nucleotide sequence, or by mutation. Expression of a mutated polypeptide in a plant cell is also 25 provided in the presently disclosed subject matter. Recent refinements of this technique to disrupt endogenous plant genes have been disclosed (Kempin et al., 1997 and Miao & Lam, 1995). In one embodiment, a mutation in the chromosomal copy of a nucleotide sequence is introduced by transforming a cell with a chimeric 30 oligonucleotide composed of a contiguous stretch of RNA and DNA residues in a duplex conformation with double hairpin caps on the ends. An WO 2004/061080 PCT/US2003/041098 73 additional feature of the oligonucleotide is for example the presence of 2'-O methylation at the RNA residues. The RNA/DNA sequence is designed to align with the sequence of a chromosomal copy of a nucleotide sequence of the presently disclosed subject matter and to contain the desired nucleotide 5 change. For example, this technique is further illustrated in U.S. Patent No. 5,501,967 and Zhu et al., 1999. 4. Ribozymes In a further embodiment, an RNA coding for a polypeptide of the presently disclosed subject matter is cleaved by a catalytic RNA, or 10 ribozyme, specific for such RNA. The ribozyme is expressed in transgenic plants and results in reduced amounts of RNA coding for the polypeptide of the presently disclosed subject matter in plant cells, thus leading to reduced amounts of polypeptide accumulated in the cells. This method is further illustrated in U.S. Patent No. 4,987,071. 15 5. Dominant-Negative Mutants In another embodiment, the activity of a polypeptide encoded by the nucleotide sequences of the presently disclosed subject matter is changed. This is achieved by expression of dominant negative mutants of the polypeptides in transgenic plants, leading to the loss of activity of the 20 endogenous polypeptide. 6. Aptamers In a further embodiment, the activity of polypeptide of the presently disclosed subject matter is inhibited by expressing in transgenic plants nucleic acid ligands, so-called aptamers, which specifically bind to the 25 polypeptide. Aptamers can be obtained by the SELEX (Systematic Evolution of Ligands by Exponential Enrichment) method. In the SELEX method, a candidate mixture of single stranded nucleic acids having regions of randomized sequence is contacted with the polypeptide and those nucleic acids having an increased affinity to the target are partitioned from the 30 remainder of the candidate mixture. The partitioned nucleic acids are amplified to yield a ligand-enriched mixture. After several iterations a nucleic WO 2004/061080 PCT/US2003/041098 74 acid with optimal affinity to the polypeptide is obtained and is used for expression in transgenic plants. This method is further illustrated in U.S. Patent No. 5,270,163. 7. Zinc Finger Polypeptides 5 A zinc finger polypeptide that binds a nucleotide sequence of the presently disclosed subject matter or to its regulatory region can also be used to alter expression of the nucleotide sequence. In alternative embodiments, transcription of the nucleotide sequence is reduced or increased. Zinc finger polypeptides are disclosed in, for example, Beerli et 10 al., 1998, or in WO 95/19431, WO 98/54311, or WO 96/06166, all incorporated herein by reference in their entirety. 8. dsRNA Alteration of the expression of a nucleotide sequence of the presently disclosed subject matter can also be obtained by double stranded RNA 15 (dsRNA) interference (RNAi) as disclosed, for example, in WO 99/32619, WO 99/53050, or WO 99/61631, all incorporated herein by reference in their entireties. In one embodiment, the alteration of the expression of a nucleotide sequence of the presently disclosed subject matter, in one embodiment the reduction of its expression, is obtained by dsRNA 20 interference. The entirety, or in one embodiment a portion, of a nucleotide sequence of the presently disclosed subject matter, can be comprised in a DNA molecule. The size of the DNA molecule is in one embodiment from 100 to 1000 nucleotides or more; the optimal size to be determined empirically. Two copies of the identical DNA molecule are linked, separated 25 by a spacer DNA molecule, such that the first and second copies are in opposite orientations. In one embodiment, the first copy of the DNA molecule is the reverse complement (also known as the non-coding strand) and the second copy is the coding strand; in another embodiment, the first copy is the coding strand, and the second copy is the reverse complement. 30 The size of the spacer DNA molecule is in one embodiment 200 to 10,000 nucleotides, in another embodiment 400 to 5000 nucleotides, and in yet WO 2004/061080 PCT/US2003/041098 75 another embodiment 600 to 1500 nucleotides in length. The spacer is in one embodiment a random piece of DNA, in another embodiment a random piece of DNA without homology to the target organism for dsRNA interference, and in still another embodiment a functional intron that is 5 effectively spliced by the target organism. The two copies of the DNA molecule separated by the spacer are operatively linked to a promoter functional ,in a plant cell, and introduced in a plant cell in which the nucleotide sequence is expressible. In one embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is stably integrated 10 in the genome of the plant cell. In another embodiment, the DNA molecule comprising the nucleotide sequence, or a portion thereof, is comprised in an extrachromosomally replicating molecule. Several publications describing this approach are cited for further illustration (Waterhouse et al., 1998; Chuang & Meyerowitz, 2000; Smith et al., 2000). 15 In another non-limiting example, RNA interference (RNAi) or post transcriptional gene silencing (PTGS) can be employed to reduce the level of expression of a stress-related protein of the presently disclosed subject matter in a cell. As used herein, the terms "RNA interference" and "post transcriptional gene silencing" are used interchangeably and refer to a 20 process of sequence-specific modulation of gene expression mediated by a small interfering RNA (siRNA; see generally Fire et al., 1998), resulting in null or hypomorphic phenotypes. Thus, because described herein are nucleotide sequences encoding the stress-related proteins of the presently disclosed subject matter, RNAi can be readily designed. Indeed, constructs 25 encoding an RNAi molecule have been developed which continuously synthesize an RNAi molecule, resulting in prolonged repression of expression of the targeted gene (Brummelkamp et al., 2002). In transgenic plants containing one of the DNA molecules disclosed immediately above, the expression of the nucleotide sequence 30 corresponding to the nucleotide sequence comprised in the DNA molecule is in one embodiment reduced. In one embodiment, the nucleotide sequence WO 2004/061080 PCT/US2003/041098 76 in the DNA molecule is at least 70% identical to the nucleotide sequence the expression of which is reduced, in another embodiment it is at least 80% identical, in another embodiment it is at least 90% identical, in another embodiment it is at least 95% identical, and in still another embodiment it is 5 at least 99% identical. 9. Insertion of a DNA Molecule (Insertional Mutagenesis) In one embodiment, a DNA molecule is inserted into a chromosomal copy of a nucleotide sequence of the presently disclosed subject matter, or into a regulatory region thereof. In one embodiment, such DNA molecule 10 comprises a transposable element capable of transposition in a plant cell, such as, for example, Ac/Ds, Em/Spm, mutator. Alternatively, the DNA molecule comprises a T-DNA border of an Agrobacterium T-DNA. The DNA molecule can also comprise a recombinase or integrase recognition site that can be used to remove part of the DNA molecule from the chromosome of 15 the plant cell. Methods of insertional mutagenesis using T-DNA, transposons, oligonucleotides, or other methods known to those skilled in the art are also encompassed. Methods of using T-DNA and transposon for insertional mutagenesis are disclosed in Winkler & Feldmann, 1989, and Martienssen, 1998, incorporated herein by reference in their entireties. 20 10. Deletion Mutaqenesis In yet another embodiment, a mutation of a nucleic acid molecule of the presently disclosed subject matter is created in the genomic copy of the sequence in the cell or plant by deletion of a portion of the nucleotide sequence or regulator sequence. Methods of deletion mutagenesis are 25 known to those skilled in the art. See e.g., Miao & Lam, 1995. In yet another embodiment, a deletion is created at random in a large population of plants by chemical mutagenesis or irradiation and a plant with a deletion in a gene of the presently disclosed subject matter is isolated by forward or reverse genetics. Irradiation with fast neutrons or gamma rays is 30 known to cause deletion mutations in plants (Silverstone et al., 1998; Bruggemann et al., 1996; Redei & Koncz, 1992). Deletion mutations in a WO 2004/061080 PCT/US2003/041098 77 gene of the presently disclosed subject matter can be recovered in a reverse genetics strategy using PCR with pooled sets of genomic DNAs as has been shown in C. elegans (Liu et al., 1999). A forward genetics strategy involves mutagenesis of a line bearing a trait of interest followed by screening the M2 5 progeny for the absence of the trait. Among these mutants would be expected to be some that disrupt a gene of the presently disclosed subject matter. This could be assessed by Southern blotting or PCR using primers designed for a gene of the presently disclosed subject matter with genomic DNA from these mutants. 10 11. Overexpression in a Plant Cell In yet another embodiment, a nucleotide sequence of the presently disclosed subject matter encoding a polypeptide is overexpressed. Examples of nucleic acid molecules and expression cassettes for over expression of a nucleic acid molecule of the presently disclosed subject 15 matter are disclosed above. Methods known to those skilled in the art of over-expression of nucleic acid molecules are also encompassed by the presently disclosed subject matter. In one embodiment, the expression of the nucleotide sequence of the presently disclosed subject matter is altered in every cell of a plant. This can 20 be obtained, for example, though homologous recombination or by insertion into a chromosome. This can also be obtained, for example, by expressing a sense or antisense RNA, zinc finger polypeptide or ribozyme under the control of a promoter capable of expressing the sense or antisense RNA, zinc finger polypeptide, or ribozyme in every cell of a plant. Constitutive, 25 inducible, tissue-specific, cell type-specific, or developmentally-regulated expression are also within the scope of the presently disclosed subject matter and result in a constitutive, inducible, tissue-specific, or developmentally-regulated alteration of the expression of a nucleotide sequence of the presently disclosed subject matter in the plant cell. 30 Constructs for expression of the sense or antisense RNA, zinc finger polypeptide, or ribozyme, or for over-expression of a nucleotide sequence of WO 2004/061080 PCT/US2003/041098 78 the presently disclosed subject matter, can be prepared and transformed into a plant cell according to the teachings of the presently disclosed subject matter, for example, as disclosed herein. C. Construction of Plant Expression Vectors 5 Further encompassed within the presently disclosed subject matter is a recombinant vector comprising an expression cassette according to the embodiments of the presently disclosed subject matter. Also encompassed are plant cells comprising expression cassettes according to the present disclosure, and plants comprising these plant cells. In one embodiment, the 10 plant is a dicot. In another embodiment, the plant is a gymnosperm. In another embodiment, the plant is a monocot. In one embodiment, the monocot is a cereal. In one embodiment, the cereal is, for example, maize, wheat, barley, oats, rye, millet, sorghum, triticale, secale, einkorn, spelt, emmer, teff, milo, flax, gramma grass, Tripsacum or teosinte. In another 15 embodiment, the cereal is sorghum. In one embodiment, the expression cassette is expressed throughout the plant. In another embodiment, the expression cassette is expressed in a specific location or tissue of a plant. In one embodiment, the location or tissue includes, but is not limited to, epidermis, root, vascular tissue, 20 meristem, cambium, cortex, pith, leaf, flower, and combinations thereof. In another embodiment, the location or tissue is a seed. In one embodiment, the expression cassette is involved in a function including, but not limited to, disease resistance, yield, biotic or abiotic stress resistance, nutritional quality, carbon metabolism, photosynthesis, signal 25 transduction, cell growth, reproduction, disease processes (for example, pathogen resistance), gene regulation, and differentiation. In one embodiment, the polypeptide is involved in a function such as biotic or abiotic stress tolerance, enhanced yield or proliferation, disease resistance, or nutritional composition. 30 For example, a nucleic acid molecule of the presently disclosed subject matter can be introduced, under conditions for expression, into a WO 2004/061080 PCT/US2003/041098 79 host cell such that the host cell transcribes and translates the nucleic acid molecule to produce a stress-related polypeptide. By "under conditions for expression" is meant that a nucleic acid molecule is positioned in the cell such that it will be expressed in that cell. For example, a nucleic acid 5 molecule can be located downstream of a promoter that is active in the cell, such that the promoter will drive the expression of the polypeptide encoded for by the nucleic acid molecule in the cell. Any regulatory sequence (e.g., promoter, enhancer, inducible promoter) can be linked to the nucleic acid molecule; alternatively, the nucleic acid molecule can include its own 10 regulatory sequence(s) such that it will be expressed (i.e., transcribed and/or translated) in a cell. Where the nucleic acid molecule of the presently disclosed subject matter is introduced into a cell under conditions of expression, that nucleic acid molecule can be included in an expression cassette. Thus, the 15 presently disclosed subject matter further provides a host cell comprising an expression cassette comprising a nucleic acid molecule encoding a stress related polypeptide as disclosed herein. Such an expression cassette can include, in addition to the nucleic acid molecule encoding a stress-related polypeptide of the presently disclosed subject matter, at least one regulatory 20 sequence (e.g., a promoter and/or an enhancer). As such, coding sequences intended for expression in transgenic plants can be first assembled in expression cassettes operatively linked to a suitable promoter expressible in plants. The expression cassettes can also comprise any further sequences required or selected for the expression of 25 the transgene. Such sequences include, but are not limited to, transcription terminators, extraneous sequences to enhance expression such as introns, vital sequences, and sequences intended for the targeting of the gene product to specific organelles and cell compartments. These expression cassettes can then be easily transferred to the plant transformation vectors 30 disclosed below. The following is a description of various components of typical expression cassettes.
WO 2004/061080 PCT/US2003/041098 80 1. Promoters The selection of the promoter used in expression cassettes can determine the spatial and temporal expression pattern of the transgene in the transgenic plant. Selected promoters can express transgenes in specific 5 cell types (such as leaf epidermal cells, mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves, or flowers, for example) and the selection can reflect the desired location for accumulation of the gene product. Alternatively, the selected promoter can drive expression of the gene under various inducing conditions. Promoters vary in their strength; 10 i.e., their abilities to promote transcription. Depending upon the host cell system utilized, any one of a number of suitable promoters can be used, including the gene's native promoter. The following are non-limiting examples of promoters that can be used in expression cassettes. In one non-limiting example, a plant promoter fragment can be 15 employed that will direct expression of the gene in all tissues of a regenerated plant. Such promoters are referred to herein as "constitutive" promoters and are active under most environmental conditions and states of development or cell differentiation. Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation 20 region, the 1'- or 2'-promoter derived from T-DNA of Agrobacterium tumefaciens, and other transcription initiation regions from various plant genes known to those of ordinary skill in the art. Such genes include for example, the AP2 gene, ACTII from Arabidopsis (Huang et al., 1996), Cat3 from Arabidopsis (GENBANK@ Accession No. U43147; Zhong et al., 1996), 25 the gene encoding stearoyl-acyl carrier protein desaturase from Brassica napus (GENBANK@ Accession No. X74782; Solocombe et al., 1994), GPc1 from maize (GENBANK@ Accession No. X15596; Martinez et al., 1989), and Gpc2 from maize (GENBANK@ Accession No. U45855; Manjunath et al., 1997). 30 Alternatively, the plant promoter can direct expression of the nucleic acid molecules of the presently disclosed subject matter in a specific tissue WO 2004/061080 PCT/US2003/041098 81 or can be otherwise under more precise environmental or developmental control. Examples of environmental conditions that can effect transcription by inducible promoters include anaerobic conditions, elevated temperature, or the presence of light. Such promoters are referred to herein as 5 "inducible", "cell type-specific", or "tissue-specific" promoters. Ordinary skill in the art will recognize that a tissue-specific promoter can drive expression of operatively linked sequences in tissues other than the target tissue. Thus, as used herein a tissue-specific promoter is one that drives expression preferentially in the target tissue, but can also lead to some expression in 10 other tissues as well. Examples of promoters under developmental control include promoters that initiate transcription only (preferentially) in certain tissues, such as fruit, seeds, or flowers. Promoters that direct expression of nucleic acids in ovules, flowers, or seeds are particularly useful in the presently 15 disclosed subject matter. As used herein a seed-specific or preferential promoter is one that directs expression specifically or preferentially in seed tissues. Such promoters can be, for example, ovule-specific, embryo specific, endosperm-specific, integument-specific, seed coat-specific, or some combination thereof. Examples include a promoter from the ovule 20 specific BELl gene described in Reiser et al., 1995 (GENBANK@ Accession No. U39944). Non-limiting examples of seed specific promoters are derived from the following genes: MACI from maize (Sheridan et al., 1996), Cat3 from maize (GENBANK@ Accession No. L05934; Abler et al., 1993), the gene encoding oleosin 18 kD from maize (GENBANK@ Accession No. 25 J05212; Lee et al., 1994), vivparous-1 from Arabidopsis (GENBANK@ Accession No. U93215), the gene encoding oleosin from Arabidopsis (GENBANK@ Accession No. Z17657), Atmycl from Arabidopsis (Urao et al., 1996), the 2s seed storage protein gene family from Arabidopsis (Conceicao et al., 1994) the gene encoding oleosin 20 kD from Brassica napus 30 (GENBANK@ Accession No. M63985), napA from Brassica napus (GENBANK@ Accession No. J02798; Josefsson et al., 1987), the napin gene WO 2004/061080 PCT/US2003/041098 82 family from Brassica napus (Sjodahl et al., 1995), the gene encoding the 2S storage protein from Brassica napus (Dasgupta et al, 1993), the genes encoding oleosin A (GENBANK@ Accession No. U09118) and oleosin B (GENBANK@ Accession No. U09119) from soybean, and the gene encoding 5 low molecular weight sulphur rich protein from soybean (Choi et al., 1995). Alternatively, particular sequences that provide the promoter with desirable expression characteristics, or the promoter with expression enhancement activity, could be identified and these or similar sequences introduced into the sequences via cloning or via mutation. It is further 10 contemplated that these sequences can be mutagenized in order to enhance the expression of transgenes in a particular species. Furthermore, it is contemplated that promoters combining elements from more than one promoter can be employed. For example, U.S. Patent No. 5,491,288 discloses combining a Cauliflower Mosaic Virus (CaMV) 15 promoter with a histone promoter. Thus, the elements from the promoters disclosed herein can be combined with elements from other promoters. a. Constitutive Expression: the Ubiquitin Promoter Ubiquitin is a gene product known to accumulate in many cell types and its promoter has been cloned from several species for use in transgenic 20 plants (e.g., sunflower - Binet et al., 1991; maize - Christensen et al., 1989; and Arabidopsis - Callis et al., 1990; Norris et al., 1993). The maize ubiquitin promoter has been developed in transgenic monocot systems and its sequence and vectors constructed for monocot transformation are disclosed in the patent publication EP 0 342 926 (to Lubrizol) which is herein 25 incorporated by reference. Taylor et al., 1993, describes a vector (pAHC25) that comprises the maize ubiquitin promoter and first intron and its high activity in cell suspensions of numerous monocotyledons when introduced via microprojectile bombardment. The Arabidopsis ubiquitin promoter is suitable for use with the nucleotide sequences of the presently disclosed 30 subject matter. The ubiquitin promoter is suitable for gene expression in transgenic plants, both monocotyledons and dicotyledons. Suitable vectors WO 2004/061080 PCT/US2003/041098 83 are derivatives of pAHC25 or any of the transformation vectors disclosed herein, modified by the introduction of the appropriate ubiquitin promoter and/or intron sequences. b. Constitutive Expression: the CaMV 35S Promoter 5 Construction of the plasmid pCGN1761 is disclosed in the published patent application EP 0 392 225 (Example 23), which is hereby incorporated by reference. pCGN1761 contains the "double" CaMV 35S promoter and the tml transcriptional terminator with a unique EcoRI site between the promoter and the terminator and has a pUC-type backbone. A derivative of 10 pCGN1761 is constructed which has a modified polylinker that includes Notl and Xhol sites in addition to the existing EcoRl site. This derivative is designated pCGN1761ENX. pCGN1761ENX is useful for the cloning of cDNA sequences or coding sequences (including microbial ORF sequences) within its polylinker for the purpose of their expression under the control of 15 the 35S promoter in transgenic plants. The entire 35S promoter-coding sequence-tml terminator cassette of such a construction can be excised by HindIll, Sphl, Sall, and Xbal sites 5' to the promoter and Xbal, BamHl and Bgl sites 3' to the terminator for transfer to transformation vectors such as those disclosed below. Furthermore, the double 35S promoter fragment can 20 be removed by 5' excision with Hindlll, Sphl, Sall, Xbal, or Pstl, and 3' excision with any of the polylinker restriction sites (EcoRi, Notl or Xhol) for replacement with another promoter. If desired, modifications around the cloning sites can be made by the introduction of sequences that can enhance translation. This is particularly useful when overexpression is 25 desired. For example, pCGN1761ENX can be modified by optimization of the translational initiation site as disclosed in Example 37 of U.S. Patent No. 5,639,949, incorporated herein by reference. c. Constitutive Expression: the Actin Promoter Several isoforms of actin are known to be expressed in most cell 30 types and consequently the actin promoter can be used as a constitutive promoter. In particular, the promoter from the rice Act/ gene has been WO 2004/061080 PCT/US2003/041098 84 cloned and characterized (McElroy et al., 1990). A 1.3 kilobase (kb) fragment of the promoter was found to contain all the regulatory elements required for expression in rice protoplasts. Furthermore, numerous expression vectors based on the Act/ promoter have been constructed 5 specifically for use in monocotyledons (McElroy et al., 1991). These incorporate the Acti-intron 1, Adh/ 5' flanking sequence (from the maize alcohol dehydrogenase gene) and Adhl-intron 1 and sequence from the CaMV 35S promoter. Vectors showing highest expression were fusions of 35S and Act! intron or the Act/ 5' flanking sequence and the Act/ intron. 10 Optimization of sequences around the initiating ATG (of the p-glucuronidase (GUS) reporter gene) also enhanced expression. The promoter expression cassettes disclosed in McElroy et al., 1991, can be easily modified for gene expression and are particularly suitable for use in monocotyledonous hosts. For example, promoter-containing fragments are removed from the McElroy 15 constructions and used to replace the double 35S promoter in pCGN1761ENX, which is then available for the insertion of specific gene sequences. The fusion genes thus constructed can then be transferred to appropriate transformation vectors. In a separate report, the rice Act! promoter with its first intron has also been found to direct high expression in 20 cultured barley cells (Chibbar et al., 1993). d. Inducible Expression: PR-1 Promoters The double 35S promoter in pCGN1761ENX can be replaced with any other promoter of choice that will result in suitably high expression levels. By way of example, one of the chemically regulatable promoters 25 disclosed in U.S. Patent No. 5,614,395, such as the tobacco PR-1a promoter, can replace the double 35S promoter. Alternately, the Arabidopsis PR-1 promoter disclosed in Lebel et al., 1998, can be used. The promoter of choice can be excised from its source by restriction enzymes, but can alternatively be PCR-amplified using primers that carry appropriate terminal 30 restriction sites. Should PCR-amplification be undertaken, the promoter can be re-sequenced to check for amplification errors after the cloning of the WO 2004/061080 PCT/US2003/041098 85 amplified promoter in the target vector. The chemically/pathogen regulatable tobacco PR-1a promoter is cleaved from plasmid pCIB1004 (for construction, see example 21 of EP 0 332 104, which is hereby incorporated by reference) and transferred to plasmid pCGN1761ENX (Uknes et al., 5 1992). pCIB1004 is cleaved with Ncol and the resulting 3' overhang of the linearized fragment is rendered blunt by treatment with T4 DNA polymerase. The fragment is then cleaved with Hindill and the resultant PR-1a promoter containing fragment is gel purified and cloned into pCGN1761ENX from which the double 35S promoter has been removed. This is accomplished by 10 cleavage with Xhol and blunting with T4 polymerase, followed by cleavage with HindIll, and isolation of the larger vector-terminator containing fragment into which the pCIB1004 promoter fragment is cloned. This generates a pCGN1761ENX derivative with the PR-1a promoter and the tm/ terminator and an intervening polylinker with unique EcoRI and Notl sites. The selected 15 coding sequence can be inserted into this vector, and the fusion products (i.e. promoter-gene-terminator) can subsequently be transferred to any selected transformation vector, including those disclosed herein. Various chemical regulators can be employed to induce expression of the selected coding sequence in the plants transformed according to the presently 20 disclosed subject matter, including the benzothiadiazole, isonicotinic acid, and salicylic acid compounds disclosed in U.S. Patent Nos. 5,523,311 and 5,614,395. e. Inducible Expression: an Ethanol-Inducible Promoter A promoter inducible by certain alcohols or ketones, such as ethanol, 25 can also be used to confer inducible expression of a coding sequence of the presently disclosed subject matter. Such a promoter is for example the alcA gene promoter from Aspergillus nidufans (Caddick et al., 1998). In A. nidulans, the alcA gene encodes alcohol dehydrogenase 1, the expression of which is regulated by the AlcR transcription factors in presence of the 30 chemical inducer. For the purposes of the presently disclosed subject matter, the CAT coding sequences in plasmid palcA:CAT comprising a alcA WO 2004/061080 PCT/US2003/041098 86 gene promoter sequence fused to a minimal 35S promoter (Caddick et al., 1998) are replaced by a coding sequence of the presently disclosed subject matter to form an expression cassette having the coding sequence under the control of the alcA gene promoter. This is carried out using methods known 5 in the art. f. Inducible Expression: a Glucocorticoid-Inducible Promoter Induction of expression of a nucleic acid sequence of the presently disclosed subject matter using systems based on steroid hormones is also provided. For example, a glucocorticoid-mediated induction system is used 10 (Aoyama & Chua, 1997) and gene expression is induced by application of a glucocorticoid, for example a synthetic glucocorticoid, for example dexamethasone, at a concentration ranging in one embodiment from 0.1 mM to 1 mM, and in another embodiment from 10 mM to 100 mM. For the purposes of the presently disclosed subject matter, the luciferase gene 15 sequences Aoyama & Chua are replaced by a nucleic acid sequence of the presently disclosed subject matter to form an expression cassette having a nucleic acid sequence of the presently disclosed subject matter under the control of six copies of the GAL4 upstream activating sequences fused to the 35S minimal promoter. This is carried out using methods known in the art. 20 The trans-acting factor comprises the GAL4 DNA-binding domain (Keegan et al., 1986) fused to the transactivating domain of the herpes viral polypeptide VP16 (Triezenberg et al., 1988) fused to the hormone-binding domain of the rat glucocorticoid receptor (Picard et al., 1988). The expression of the fusion polypeptide is controlled either by a promoter known in the art or disclosed 25 herein. A plant comprising an expression cassette comprising a nucleic acid sequence of the presently disclosed subject matter fused to the 6x GAL4/minimal promoter is also provided. Thus, tissue- or organ-specificity of the fusion polypeptide is achieved leading to inducible tissue- or organ specificity of the nucleic acid sequence to be expressed. 30 g. Root Specific Expression WO 2004/061080 PCT/US2003/041098 87 Another pattern of gene expression is root expression. A suitable root promoter is the promoter of the maize metallothionein-like (MTL) gene disclosed in de Framond, 1991, and also in U.S. Patent No. 5,466,785, each of which is incorporated herein by reference. This "MTL" promoter is 5 transferred to a suitable vector such as pCGN1 761 ENX for the insertion of a selected gene and subsequent transfer of the entire promoter-gene terminator cassette to a transformation vector of interest. h. Wound-Inducible Promoters Wound-inducible promoters can also be suitable for gene expression. 10 Numerous such promoters have been disclosed (e.g., Xu et al., 1993; Logemann et al., 1989; Rohrmeier & Lehle, 1993; Firek et al., 1993; Warner et al., 1993) and all are suitable for use with the presently disclosed subject matter. Logemann et al. describe the 5' upstream sequences of the dicotyledonous potato wuni gene. Xu et al. show that a wound-inducible 15 promoter from the dicotyledon potato (pin2) is active in the monocotyledon rice. Further, Rohrmeier & Lehle describe the cloning of the maize Wip/ cDNA that is wound induced and which can be used to isolate the cognate promoter using standard techniques. Similarly, Firek et al. and Warner et al. have disclosed a wound-induced gene from the monocotyledon Asparagus 20 officinals, which is expressed at local wound and pathogen invasion sites. Using cloning techniques well known in the art, these promoters can be transferred to suitable vectors, fused to the genes pertaining to the presently disclosed subject matter, and used to express these genes at the sites of plant wounding. 25 i. Pith-Preferred Expression PCT International Publication WO 93/07278, which is herein incorporated by reference, describes the isolation of the maize trpA gene, which is preferentially expressed in pith cells. The gene sequence and promoter extending up to -1726 basepairs (bp) from the start of transcription 30 are presented. Using standard molecular biological techniques, this promoter, or parts thereof, can be transferred to a vector such as pCGN1761 WO 2004/061080 PCT/US2003/041098 88 where it can replace the 35S promoter and be used to drive the expression of a foreign gene in a pith-preferred manner. In fact, fragments containing the pith-preferred promoter or parts thereof can be transferred to any vector and modified for utility in transgenic plants. 5 L Leaf-Specific Expression A maize gene encoding phosphoenol carboxylase (PEPC) has been disclosed by Hudspeth & Grula, 1989. Using standard molecular biological techniques, the promoter for this gene can be used to drive the expression of any gene in a leaf-specific manner in transgenic plants. 10 k. Pollen-Specific Expression WO 93/07278 describes the isolation of the maize calcium-dependent protein kinase (CDPK) gene that is expressed in pblien cells. The gene sequence and promoter extend up to 1400 bp from the start of transcription. Using standard molecular biological techniques, this promoter or parts 15 thereof can be transferred to a vector such as pCGN1761 where it can replace the 35S promoter and be used to drive the expression of a nucleic acid sequence of the presently disclosed subject matter in a pollen-specific manner. 2. Transcriptional Terminators 20 A variety of 5' and 3' transcriptional regulatory sequences are available for use in the presently disclosed subject matter. Transcriptional terminators are responsible for the termination of transcription and correct mRNA polyadenylation. The 3' nontranslated regulatory DNA sequence includes from in one embodiment about 50 to about 1,000, and in another 25 embodiment about 100 to about 1,000, nucleotide base pairs and contains plant transcriptional and translational termination sequences. Appropriate transcriptional terminators and those that are known to function in plants include the CaMV 35S terminator, the tml terminator, the nopaline synthase terminator, the pea rbcS E9 terminator, the terminator for the T7 transcript 30 from the octopine synthase gene of Agrobacterium tumefaciens, and the 3' end of the protease inhibitor I or I genes from potato or tomato, although WO 2004/061080 PCT/US2003/041098 89 other 3' elements known to those of skill in the art can also be employed. Alternatively, a gamma coixin, oleosin 3, or other terminator from the genus Coix can be used. Non-limiting 3' elements include those from the nopaline synthase 5 gene of Agrobacterium tumefaciens (Bevan et al., 1983), the terminator for the T7 transcript from the octopine synthase gene of Agrobacterium tumefaciens, and the 3' end of the protease inhibitor I or 1l genes from potato or tomato. As the DNA sequence between the transcription initiation site and the 10 start of the coding sequence (i.e., the untranslated leader sequence, also referred to as the 5' untranslated region) can influence gene expression, a particular leader sequence can also be employed. Non-limiting leader sequences are contemplated to include those that include sequences predicted to direct optimum expression of the operatively linked gene; i.e., to 15 include a consensus leader sequence that can increase or maintain mRNA stability and prevent inappropriate initiation of translation. The choice of such sequences will be known to those of skill in the art in light of the present disclosure. Sequences that are derived from genes that are highly expressed in plants are useful in the presently disclosed subject matter. 20 Thus, a variety of transcriptional terminators are available for use in expression cassettes. These are responsible for termination of transcription and correct mRNA polyadenylation. Appropriate transcriptional terminators are those that are known to function in plants and include the CaMV 35S terminator, the ti terminator, the nopaline synthase terminator, and the pea 25 rbcS E9 terminator. These can be used in both monocotyledons and dicotyledons. In addition, a gene's native transcription terminator can be used. 3. Other Sequences for the Enhancement or Regulation of Expression 30 Numerous sequences have been found to enhance gene expression from within the transcriptional unit and these sequences can be used in WO 2004/061080 PCT/US2003/041098 90 conjunction with the genes of the presently disclosed subject matter to increase their expression in transgenic plants. Other sequences that have been found to enhance gene expression in transgenic plants include intron sequences (e.g., from Adh1, bronze, 5 acting, actin 2 (PCT International Publication No. WO 00/760067), or the sucrose synthase intron), and viral leader sequences (e.g., from Tobacco Mosaic Virus (TMV), Maize Chlorotic Mottle Virus (MCMV), or Alfalfa Mosaic Virus (AMV)). For example, a number of non-translated leader sequences derived from viruses are known to enhance the expression of operatively 10 linked nucleic acids. Specifically, leader sequences from Tobacco Mosaic Virus (TMV), Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be effective in enhancing expression (e.g., Gallie et al., 1987; Skuzeski et al., 1990). Other leaders known in the art include, but are not limited to picornavirus leaders, for example, 15 encephalomyocarditis virus (EMCV) leader (encephalomyocarditis 5' noncoding region; Elroy-Stein et al., 1989); potyvirus leaders (e.g., Tobacco Etch Virus (TEV) leader and Maize Dwarf Mosaic Virus (MDMV) leader); human immunoglobulin heavy-chain binding protein (BiP) leader (Macejak et al., 1991); untranslated leader from the coat protein mRNA of AMV (AMV 20 RNA 4; Jobling et al., 1987); TMV leader (Gallie et al., 1989); and maize chlorotic mottle virus leader (Lommel et al., 1991). See also, Della-Cioppa et al., 1987. Regulatory elements such as Adh intron 1 (Callis et al., 1987), sucrose synthase intron (Vasil et al., 1989) or TMV omega element (Gallie.et al., 1989), can further be included where desired, Non-limiting examples of 25 enhancers include elements from the CaMV 35S promoter, octopine synthase genes (Ellis et al., 1987), the rice actin I gene, the maize alcohol dehydrogenase gene (Callis et al., 1987), the maize shrunken I gene (Vasil et al., 1989), TMV omega element (Gallie et al., 1989) and promoters from non-plant eukaryotes (e.g., yeast; Ma et al., 1988). 30 A number of non-translated leader sequences derived from viruses are also known to enhance expression, and these are particularly effective in WO 2004/061080 PCT/US2003/041098 91 dicotyledonous cells. Specifically, leader sequences from Tobacco Mosaic Virus (TMV; the "W-sequence"), Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be effective in enhancing expression (see e.g., Gallie et al., 1987; Skuzeski et al., 1990). Other leader 5 sequences known in the art include, but are not limited to, picornavirus leaders, for example, EMCV (encephalomyocarditis virus) leader (5' noncoding region; see Elroy-Stein et al., 1989); potyvirus leaders, for example, from Tobacco Etch Virus (TEV; see Allison et al., 1986); Maize Dwarf Mosaic Virus (MDMV; see Kong & Steinbiss 1998); human 10 immunoglobulin heavy-chain binding polypeptide (BiP) leader (Macejak & Sarnow, 1991); untranslated leader from the coat polypeptide mRNA of alfalfa mosaic virus (AMV; RNA 4; see Jobling & Gehrke, 1987); tobacco mosaic virus (TMV) leader (Gallie et al., 1989); and Maize Chlorotic Mottle Virus (MCMV) leader (Lommel et al., 1991). See also, Della-Cioppa et al., 15 1987. In addition to incorporating one or more of the aforementioned elements into the 5' regulatory region of a target expression cassette of the presently disclosed subject matter, other elements can also be incorporated. Such elements include, but are not limited to, a minimal promoter. By 20 minimal promoter it is intended that the basal promoter elements are inactive or nearly so in the absence of upstream or downstream activation. Such a promoter has low background activity in plants when there is no transactivator present or when enhancer or response element binding sites are absent. One minimal promoter that is particularly useful for target genes 25 in plants is the Bz1 minimal promoter, which is obtained from the bronze gene of maize. -The Bz1 core promoter is obtained from the "myc" mutant Bzl-luciferase construct pBz1LucR98 via cleavage at the Nhel site located at positions -53 to -58 (Roth et al., 1991). The derived Bz1 core promoter fragment thus extends from positions -53 to +227 and includes the Bz1 30 intron-1 in the 5' untranslated region. Also useful for the presently disclosed subject matter is a minimal promoter created by use of a synthetic TATA WO 2004/061080 PCT/US2003/041098 92 element. The TATA element allows recognition of the promoter by RNA polymerase factors and confers a basal level of gene expression in the absence of activation (see generally, Mukumoto et al., 1993; Green, 2000. 4. Targeting of the Gene Product Within the Cell 5 Various mechanisms for targeting gene products are known to exist in plants and the sequences controlling the functioning of these mechanisms have been characterized in some detail. For example, the targeting of gene products to the chloroplast is controlled by a signal sequence found at the amino terminal end of various polypeptides that is cleaved during chloroplast 10 import to yield the mature polypeptides (see e.g., Comai et al., 1988). These signal sequences can be fused to heterologous gene products to affect the import of heterologous products into the chloroplast (Van den Broeck et al., 1985). DNA encoding for appropriate signal sequences can be isolated from the 5' end of the cDNAs encoding the ribulose-1,5-bisphosphate 15 carboxylase/oxygenase (RUBISCO) polypeptide, the chlorophyll a/b binding (CAB) polypeptide, the 5-enol-pyruvyl shikimate-3-phosphate (EPSP) synthase enzyme, the GS2 polypeptide and many other polypeptides which are known to be chloroplast localized. See also, the section entitled "Expression With Chloroplast Targeting" in Example 37 of U.S. Patent No. 20 5,639,949, herein incorporated by reference. Other gene products can be localized to other organelles such as the mitochondrion and the peroxisome (e.g., Unger et al., 1989). The cDNAs encoding these products can also be manipulated to effect the targeting of heterologous gene products to these organelles. Examples of such 25 sequences are the nuclear-encoded ATPases and specific aspartate amino transferase isoforms for mitochondria. Targeting cellular polypeptide bodies has been disclosed by Rogers et al., 1985. In addition, sequences have been characterized that control the targeting of gene products to other cell compartments. Amino terminal 30 sequences are responsible for targeting to the endoplasmic reticulum (ER), the apoplast, and extracellular secretion from aleurone cells (Koehler & Ho, WO 2004/061080 PCT/US2003/041098 93 1990). Additionally, amino terminal sequences in conjunction with carboxy terminal sequences are responsible for vacuolar targeting of gene products (Shinshi et al., 1990). By the fusion of the appropriate targeting sequences disclosed above 5 to transgene sequences of interest it is possible to direct the transgene product to any organelle or cell compartment. For chloroplast targeting, for example, the chloroplast signal sequence from the RUBISCO gene, the CAB gene, the EPSP synthase gene, or the GS2 gene is fused in frame to the amino terminal ATG of the transgene. The signal sequence selected can 10 include the known cleavage site, and the fusion constructed can take into account any amino acids after the cleavage site that are required for cleavage. In some cases this requirement can be fulfilled by the addition of a small number of amino acids between the cleavage site and the transgene ATG or, alternatively, replacement of some amino acids within the transgene 15 sequence. Fusions constructed for chloroplast import can be tested for efficacy of chloroplast uptake by in vitro translation of in vitro transcribed constructions followed by in vitro chloroplast uptake using techniques disclosed by Bartlett et al., 1982 and Wasmann et al., 1986. These construction techniques are well known in the art and are equally applicable 20 to mitochondria and peroxisomes. The above-disclosed mechanisms for cellular targeting can be utilized not only in conjunction with their cognate promoters, but also in conjunction with heterologous promoters so as to effect a specific cell-targeting goal under the transcriptional regulation of a promoter that has an expression 25 pattern different from that of the promoter from which the targeting signal derives. D. Construction of Plant Transformation Vectors 1. Introduction Numerous transformation vectors available for plant transformation 30 are known to those of ordinary skill in the plant transformation art, and the genes pertinent to the presently disclosed subject matter can be used in WO 2004/061080 PCT/US2003/041098 94 conjunction with any such vectors. The selection of vector will depend upon the selected transformation technique and the target species - for transformation. For certain target species, different antibiotic or herbicide selection markers might be employed. Selection markers used routinely in 5 transformation include the nptl gene, which confers resistance to kanamycin and related antibiotics (Messing & Vieira, 1982; Bevan et al., 1983); the bar gene, which confers resistance to the herbicide phosphinothricin (White et al., 1990; Spencer et al., 1990); the hph gene, which confers resistance to the antibiotic hygromycin (Blochinger & Diggelmann, 1984); the dhfr gene, 10 which confers resistance to methotrexate (Bourouis & Jarry, 1983); the EPSP synthase gene, which confers resistance to glyphosate (U.S. Patent Nos. 4,940,935 and 5,188,642); and the mannose-6-phosphate isomerase gene, which provides the ability to metabolize mannose (U.S. Patent Nos. 5,767,378 and 5,994,629). 15 The compositions of the presently disclosed subject matter include plant nucleic acid molecules, and the amino acid sequences of the polypeptides or partial-length polypeptides encoded by nucleic acid molecules comprising an open reading frame. These sequences can be employed to alter the expression of a particular gene corresponding to the 20 open reading frame by decreasing or eliminating expression of that plant gene or by overexpressing a particular gene product. Methods of this embodiment of the presently disclosed subject matter include stably transforming a plant with a nucleic acid molecule of the presently disclosed subject matter that includes an open reading frame operatively linked to a 25 promoter capable of driving expression of that open reading frame (sense or antisense) in a plant cell. By "portion" or "fragment", as it relates to a nucleic acid molecule that comprises an open reading frame or a fragment thereof encoding a partial-length polypeptide having the activity of the full length polypeptide, is meant a sequence having in one embodiment at least 80 30 nucleotides, in another embodiment at least 150 nucleotides, and in still another embodiment at least 400 nucleotides. If not employed for WO 2004/061080 PCT/US2003/041098 95 expression, a "portion" or "fragment" means in representative embodiments at least 9, or 12, or 15, or at least 20, consecutive nucleotides (e.g., probes and primers or other oligonucleotides) corresponding to the nucleotide sequence of the nucleic acid molecules of the presently disclosed subject 5 matter. Thus, to express a particular gene product, the method comprises introducing into a plant, plant cell, or plant tissue an expression cassette comprising a promoter operatively linked to an open reading frame so as to yield a transformed differentiated plant, transformed cell, or transformed tissue. Transformed cells or tissue can be regenerated to provide a 10 transformed differentiated plant. The transformed differentiated plant or cells thereof can express the open reading frame in an amount that alters the amount of the gene product in the plant or cells thereof, which product is encoded by the open reading frame. The presently disclosed subject matter also provides a transformed plant prepared by the methodsa disclosed 15 herein, as well as progeny and seed thereof. The presently disclosed subject matter further includes a nucleotide sequence that is complementary to one (hereinafter "test" sequence) that hybridizes under stringent conditions to a nucleic acid molecule of the presently disclosed subject matter, as well as an RNA molecule that is 20 transcribed from the nucleic acid molecule. When hybridization is performed under stringent conditions, either the test or nucleic acid molecule of presently disclosed subject matter can be present on a support: e.g., on a membrane or on a DNA chip. Thus, either a denatured test or nucleic acid molecule of the presently disclosed subject matter is first bound to a support 25 and hybridization is effected for a specified period of time at a temperature of, in one embodiment, between 55'C and 70*C, in 2X SSC containing 0.1% SDS, followed by rinsing the support at the same temperature but with a buffer having a reduced SSC concentration. Depending upon the degree of stringency required, such reduced concentration buffers are typically 1X 30 SSC containing 0.1% SDS, 0.5X SSC containing 0.1% SDS, or 0.1X SSC containing 0.1% SDS.
WO 2004/061080 PCT/US2003/041098 96 In a further embodiment, the presently disclosed subject matter provides a transformed plant host cell, or one obtained through breeding, capable of over-expressing, under-expressing, or having a knockout of a polypeptide-encoding gene and/or its gene product(s). The plant cell is 5 transformed with at least one such expression vector wherein the plant host cell can be used to regenerate plant tissue or an entire plant, or seed there from, in which the effects of expression, including overexpression and underexpression, of the introduced sequence or sequences can be measured in vitro or in planta. 10 In another aspect, the presently disclosed subject matter features an isolated stress-related polypeptide, wherein the polypeptide binds to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGT1 (SEQ ID NO: 144), OsERP (SEQ ID 15 NO: 146), OsCHIBI (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170). In some embodiments, the presently disclosed subject matter features an isolated polypeptide comprising or consisting of an amino acid sequence substantially similar to the amino acid sequence of an isolated stress-related 20 polypeptide of the presently disclosed subject matter. Because the proteins of the presently disclosed subject matter have a roll in stress response, in certain embodiments, a cell introduced with a nucleic acid molecule of the presently disclosed subject matter has a different stress response as compared to a cell not introduced with the 25 nucleic acid molecule. In another aspect, the presently disclosed subject matter features a method for modulating stress response of a plant cell comprising introducing an isolated nucleic acid molecule encoding a stress-related polypeptide into the plant cell, wherein the polypeptide binds to a fragment of a protein 30 selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID WO 2004/061080 PCT/US2003/041098 97 NO: 134), OsSGT1 (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170), wherein the polypeptide is expressed by the cell. 5 In another aspect, the presently disclosed subject matter features a method for modulating stress response of a plant cell comprising introducing an isolated nucleic acid molecule encoding a stress-related polypeptide into the plant cell, wherein the polypeptide binds to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 10 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGTI (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170), wherein expression of the polypeptide encoded by the nucleic acid molecule is reduced in the cell. 15 As discussed herein, the stress-related proteins described herein affect stress response (e.g., when the plant is exposed to biotic or abiotic stress). Accordingly, by changing the amount of a stress-related protein of the presently disclosed subject matter in a plant cell, the stress respsone of that plant cell can be modulated. 20 In some situations, increasing expression of a stress-related protein of the presently disclosed subject matter in a cell will cause that cell to increase its stress response (in some cases, rate of proliferation). In other situations, increasing expression of a stress-related protein of the presently disclosed subject matter in a cell causes that cell to reduce its stress response (in 25 some cases, rate of proliferation). Similarly, decreasing the expression of a stress-related protein of the presently disclosed subject matter in a cell can increase or decrease that cell's stress response (in some cases, rate of proliferation). What is relevant is that the stress response of the cell changes if the level of expression of a stress-related protein of the presently 30 disclosed subject matter is either increased or decreased.
WO 2004/061080 PCT/US2003/041098 98 Increasing the level of expression of a stress-related protein of the presently disclosed subject matter in a cell is a relatively simple matter. For example, overexpression of the protein can be accomplished by transforming the cell with a nucleic acid molecule encoding the protein 5 according to standard methods such as those described above. Once a nucleic acid sequence of the presently disclosed subject matter has been cloned into an expression system, it is transformed into a plant cell. The receptor and target expression cassettes of the presently disclosed subject matter can be introduced into the plant cell in a number of 10 art-recognized ways. Methods for regeneration of plants are also well known in the art. For example, Ti plasmid vectors have been utilized for the delivery of foreign DNA, as well as direct DNA uptake, liposomes, electroporation, microinjection, and microprojectiles. In addition, bacteria from the genus Agrobacterium can be utilized to transform plant cells. Below 15 are descriptions of representative techniques for transforming both dicotyledonous and monocotyledonous plants, as well as a representative plastid transformation technique. Transformation of a plant can be undertaken with a single DNA molecule or multiple DNA molecules (i.e., co-transformation), and both these 20 techniques are suitable for use with the expression cassettes of the presently disclosed subject matter. Numerous transformation vectors are available for plant transformation, and the expression cassettes of the presently disclosed subject matter can be used in conjunction with any such vectors. The selection of vector will depend upon the transformation 25 technique and the species targeted for transformation. A variety of techniques are available and known for introduction of nucleic acid molecules and expression cassettes comprising such nucleic acid molecules into a plant cell host. These techniques include, but are not limited to transformation with DNA employing A. tumefaciens or A. 30 rhizogenes as the transforming agent, liposomes, PEG precipitation, electroporation, DNA injection, direct DNA uptake, microprojectile WO 2004/061080 PCT/US2003/041098 99 bombardment, particle acceleration, and the like (see e.g., EP 0 295 959 and EP 0 138 341; see also below). However, cells other than plant cells can be transformed with the expression cassettes of the presently disclosed subject matter. A general descriptions of plant expression vectors and 5 reporter genes, and Agrobacterium and Agrobacterium-mediated gene transfer, can be found in Gruber et al., 1993, incorporated herein by reference in its entirety. Expression vectors containing genomic or synthetic fragments can be introduced into protoplasts or into intact tissues or isolated cells. In some 10 embodiments, expression vectors are introduced into intact tissue. "Plant tissue" includes differentiated and undifferentiated tissues or entire plants, including but not limited to roots, stems, shoots, leaves, pollen, seeds, tumor tissue, and various forms of cells and cultures such as single cells, protoplasts, embryos, and callus tissues. The plant tissue can be in plants 15 or in organ, tissue, or cell culture. General methods of culturing plant tissues are provided, for example, by Maki et al., 1993 and by Phillips et al. 1988. In some embodiments, expression vectors are introduced into maize or other plant tissues using a direct gene transfer method such as microprojectile mediated delivery, DNA injection, electroporation, or the like. In some 20 embodiments, expression vectors are introduced into plant tissues using microprojectile media delivery with a biolistic device (see e.g., Tomes et al., 1995). The vectors of the presently disclosed subject matter can not only be used for expression of structural genes but can also be used in exon-trap cloning or in promoter trap procedures to detect differential gene expression 25 in varieties of tissues (Lindsey et al., 1993; Auch & Reth, 1990). In some embodiments, the binary type vectors of the Ti and Ri plasmids of Agrobacterium spp are employed. Ti-derived vectors can be used to transform a wide variety of higher plants, including monocotyledonous and dicotyledonous plants including, but not limited to 30 soybean, cotton, rape, tobacco, and rice (Pacciotti et al., 1985: Byrne et al., 1987; Sukhapinda et al., 1987; Lorz et al., 1985; Potrykus, 1985; Park et al., WO 2004/061080 PCT/US2003/041098 100 1985: Hiei et al., 1994). The use of T-DNA to transform plant cells has received extensive study and is amply described (European Patent Application No. EP 0 120 516; Hoekema, 1985; Knauf et al., 1983; and An et al., 1985, each of which is incorporated by reference in its entirety). For 5 introduction into plants, the nucleic acid molecules of the presently disclosed subject matter can be inserted into binary vectors as described in the examples. Other transformation methods are available to those skilled in the art, such as direct uptake of foreign DNA constructs (see European Patent 10 Application No. EP 0 295 959), electroporation (Fromm et al., 1986), or high velocity ballistic bombardment of plant cells with metal particles coated with the nucleic acid constructs (Kline et al., 1987; U.S. Patent No. 4,945,050). Once transformed, the cells can be regenerated using techniques familiar to those of skill in the art. Of particular relevance are the recently described 15 methods to transform foreign genes into commercially important crops, such as rapeseed (De Block et al., 1989), sunflower (Everett et al., 1987), soybean (McCabe et al., 1988; Hinchee et al., 1988; Chee et al., 1989; Christou et al., 1989; European Patent Application No. EP 0 301 749), rice (Hiei et al., 1994), and corn (Gordon Kamm et al., 1990; Fromm et al., 1990). 20 Of course, the choice of method might depend on the type of plant, i.e., monocotyledonous or dicotyledonous, targeted for transformation. Suitable methods of transforming plant cells include, but are not limited to microinjection (Crossway et al., 1986), electroporation (Riggs et al., 1986), Agrobacterium-mediated transformation (Hinchee et al., 1988), direct gene 25 transfer (Paszkowski et al., 1984), and ballistic particle acceleration using devices available from Agracetus, Inc. (Madison, Wisconsin, United States of America) and BioRad (Hercules, California, United States of America). See e.g., U.S. Patent No. 4,945,050; McCabe et al., 1988; Weissinger et al., 1988; Sanford et al., 1987 (onion); Christou et al., 1988 (soybean); McCabe 30 et al., 1988 (soybean); Datta et al., 1990 (rice); Klein et al., 1988 (maize); Fromm et al., 1990 (maize); Gordon-Kamm et al., 1990 (maize); Svab et al., WO 2004/061080 PCT/US2003/041098 101 1990 (tobacco chloroplast); Koziel et al., 1993 (maize); Shimamoto et al., 1989 (rice); Christou et al., 1991 (rice); European Patent Application EP 0 332 581 (orchardgrass and other Pooideae); Vasil et al., 1993 (wheat); Weeks et al., 1993 (wheat). In one embodiment, the protoplast 5 transformation method for maize is employed (see European Patent Application EP 0 292 435; U. S. Patent No. 5,350,689). 2. Vectors Suitable for Agrobacterium Transformation Agrobacterum tumefaciens cells containing a vector comprising an expression cassette of the presently disclosed subject matter, wherein the 10 vector comprises a Ti plasmid, are useful in methods of making transformed plants. Plant cells are infected with an Agrobacterium tumefaciens as described above to produce a transformed plant cell, and then a plant is regenerated from the transformed plant cell. Numerous Agrobacterium vector systems useful in carrying out the presently disclosed subject matter 15 are known to ordinary skill in the art. Many vectors are available for transformation using Agrobacterium tumefaciens. These typically carry at least one T-DNA border sequence and include vectors such as pBIN19 (Bevan, 1984). Below, the construction of two typical vectors suitable for Agrobacterium transformation is disclosed. 20 a. pCIB200 and pCIB2001 The binary vectors pCIB200 and pCIB2001 are used for the construction of recombinant vectors for use with Agrobacterium and are constructed in the following manner. pTJS75kan is created by Nal digestion of pTJS75 (Schmidhauser & Helinski, 1985) allowing excision of the 25 tetracycline-resistance gene, followed by insertion of an Accl fragment from pUC4K carrying an NPTII sequence (Messing & Vieira, 1982: Bevan et al., 1983: McBride & Summerfelt, 1990). Xhol linkers are ligated to the EcoRV fragment of PCIB7 which contains the left and right T-DNA borders, a plant selectable nos/nptil chimeric gene and the pUC polylinker (Rothstein et al., 30 1987), and the Xhol-digested fragment are cloned into Sail-digested pTJS75kan to create pClB200 (see also EP 0 332 104, example 19).
WO 2004/061080 PCT/US2003/041098 102 pCIB200 contains the following unique polylinker restriction sites: EcoRI, Sstl, KpnI, BgIll, Xbal, and Sail. pCIB2001 is a derivative of pCIB200 created by the insertion into the polylinker of additional restriction sites. Unique restriction sites in the polylinker of pCIB2001 are EcoRI, Sstl, Kpnl, 5 Bgll, Xbal, Sall, M/ul, Bc/l, Avil, Apal, Hpal, and Stul. pCIB2001, in addition to containing these unique restriction sites, also has plant and bacterial kanamycin selection, left and right T-DNA borders for Agrobacterium-mediated transformation, the RK2-derived trfA function for mobilization between E. coli and other hosts, and the OriT and OriV 10 functions also from RK2. The pCIB2001 polylinker is suitable for the cloning of plant expression cassettes containing their own regulatory signals. b. pCIB1 0 and Hygromycin Selection Derivatives Thereof The binary vector pCIB10 contains a gene encoding kanamycin resistance for selection in plants, T-DNA right and left border sequences, 15 and incorporates sequences from the wide host-range plasmid pRK252 allowing it to replicate in both E. coli and Agrobacterium. Its construction is disclosed by Rothstein et al., 1987. Various derivatives of pCIB10 can be constructed which incorporate the gene for hygromycin B phosphotransferase disclosed by Gritz & Davies, 1983. These derivatives 20 enable selection of transgenic plant cells on hygromycin only (pCIB743), or hygromycin and kanamycin (pCIB715, pCIB717). 3. Vectors Suitable for non-Agrobacterium Transformation Transformation without the use of Agrobacterium tumefaciens circumvents the requirement for T-DNA sequences in the chosen 25 transformation vector, and consequently vectors lacking these sequences can be utilized in addition to vectors such as the ones disclosed above that contain T-DNA sequences. Transformation techniques that do not rely on Agrobacterium include transformation via particle bombardment, protoplast uptake (e.g., polyethylene glycol (PEG) and electroporation), and 30 microinjection. The choice of vector depends largely on the species being WO 2004/061080 PCT/US2003/041098 103 transformed. Below, the construction of typical vectors suitable for non Agrobacterium transformation is disclosed. a. pClB3064 pCIB3064 is a pUC-derived vector suitable for direct gene transfer 5 techniques in combination with selection by the herbicide BASTA@ (glufosinate ammonium or phosphinothricin). The plasmid pCIB246 comprises the CaMV 35S promoter in operational fusion to the E. coli p glucuronidase (GUS) gene and the CaMV 35S transcriptional terminator and is disclosed in the PCT International Publication WO 93/07278. The 35S 10 promoter of this vector contains two ATG sequences 5' of the start site. These sites are mutated using standard PCR techniques in such a way as to remove the ATGs and generate the restriction sites Sspl and Pvull. The new restriction sites are 96 and 37 bp away from the unique Sail site and 101 and 42 bp away from the actual start site. The resultant derivative of 15 pCIB246 is designated pClB3025. The GUS gene is then excised from pCIB3025 by digestion with Sall and Sacl, the termini rendered blunt and religated to generate plasmid pCIB3060. The plasmid pJIT82 is obtained from the John Innes Centre, Norwich, England, and the 400 bp Smal fragment containing the bar gene from Streptomyces viridochromogenes is 20 excised and inserted into the Hpal site of pCIB3060 (Thompson et al., 1987). This generated pCIB3064, which comprises the bar gene under the control of the CaMV 35S promoter and terminator for herbicide selection, a gene for ampicillin resistance (for selection in E. coli) and a polylinker with the unique sites Sphl, Pstl, HindllI, and BamHl. This vector is suitable for the cloning of 25 plant expression cassettes containing their own regulatory signals. b. pSOG19 and pSOG35 pSOG35 is a transformation vector that utilizes the E. coli dihydrofolate reductase (DHFR) gene as a selectable marker conferring resistance to methotrexate. PCR is used to amplify the 35S promoter (-800 30 bp), intron 6 from the maize Adh1 gene (-550 bp), and 18 bp of the GUS untranslated leader sequence from pSOG10. A 250-bp fragment encoding WO 2004/061080 PCT/US2003/041098 104 the E. coli dihydrofolate reductase type II gene is also amplified by PCR and these two PCR fragments are assembled with a Sacl-Pstl fragment from pB1221 (BD Biosciences Clontech, Palo Alto, California, United States of America) that comprises the pUC19 vector backbone and the nopaline 5 synthase terminator. Assembly of these fragments generates pSOG19 that contains the 35S promoter in fusion with the intron 6 sequence, the GUS leader, the DHFR gene, and the nopaline synthase terminator. Replacement of the GUS leader in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) generates the vector pSOG35. pSOG19 and 10 pSOG35 carry the pUC gene for ampicillin resistance and have Hindll, Sphl, Psti, and EcoRl sites available for the cloning of foreign substances. 4. Selectable Markers for Transformation Approaches Methods using either a form of direct gene transfer or Agrobacterium mediated transfer usually, but not necessarily, are undertaken with a 15 selectable marker that can provide resistance to an antibiotic (e.g., kanamycin, hygromycin, or methotrexate) or a herbicide (e.g., phosphinothricin). The choice of selectable marker for plant transformation is not, however, critical to the presently disclosed subject matter. For certain plant species, different antibiotic or herbicide selection 20 markers can be employed. Selection markers used routinely in transformation include the nptll gene, which confers resistance to kanamycin and related antibiotics (Messing & Vierra, 1982; Bevan et al., 1983), the bar gene, which confers resistance to the herbicide phosphinothricin (White et al., 1990, Spencer et al., 1990), the hph gene, which confers resistance to 25 the antibiotic hygromycin (Blochinger & Diggelmann, 1984), and the dhfr gene, which confers resistance to methotrexate (Bourouis et al., 1983). Selection markers resulting in positive selection, such as a phosphomannose isomerase (PMI) gene (described in PCT International Publication No. WO 93/05163) can also be used. Other genes that can be 30 used for positive selection are described in PCT International Publication No. WO 94/20627 and encode xyloisomerases and phosphomanno-isomerases WO 2004/061080 PCT/US2003/041098 105 such as mannose-6-phosphate isomerase and mannose-1-phosphate isomerase; phosphomanno mutase; mannose epimerases such as those that convert carbohydrates to mannose or mannose to carbohydrates such as glucose or galactose; phosphatases such as mannose or xylose phosphatase, 5 mannose-6-phosphatase and mannose-1-phosphatase, and permeases that are involved in the transport of mannose, or a derivative or a precursor thereof, into the cell. An agent is typically used to reduce the toxicity of the compound to the cells, and is typically a glucose derivative such as methyl-3-0-glucose or phloridzin. Transformed cells are identified without damaging or killing the 10 non-transformed cells in the population and without co-introduction of antibiotic or herbicide resistance genes. As described in PCT International Publication No. WO 93/05163, in addition to the fact that the need for antibiotic or herbicide resistance genes is eliminated, it has been shown that the positive selection method is often far more efficient than traditional 15 negative selection. As noted above, one vector useful for direct gene transfer techniques in combination with selection by the herbicide BASTA@ (or phosphinothricin) is pClB3064. This vector is based on the plasmid pClB246, which comprises the CaMV 35S promoter operatively linked to the E. coli P 20 glucuronidase (GUS) gene and the CaMV 35S transcriptional terminator, and is described in PCT International Publication No. WO 93/07278. One gene useful for conferring resistance to phosphinothricin is the bar gene from Streptomyces viridochromogenes (Thompson et al., 1987). This vector is suitable for the cloning of plant expression cassettes containing their own 25 regulatory signals. As noted above, an additional transformation vector is pSOG35, which utilizes the E. coli dihydrofolate reductase (DHFR) gene as a selectable marker conferring resistance to methotrexate. Polymerase chain reaction (PCR) was used to amplify the 35S promoter (about 800 basepairs 30 (bp)), intron 6 from the maize Adh1 gene (about 550 bp), and 18 bp of the GUS untranslated leader sequence from pSOG10. A 250 bp fragment WO 2004/061080 PCT/US2003/041098 106 encoding the E. coli dihydrofolate reductase type II gene was also amplified by PCR and these two PCR fragments are assembled with a Sacl-Pstl fragment from pB1221 (BD Biosciences - Clontech, Palo Alto, California, United States of America), which comprised the pUC19 vector backbone and 5 the nopaline synthase terminator. Assembly of these fragments generated pSOG19, which contains the 35S promoter in fusion with the intron 6 sequence, the GUS leader, the DHFR gene and the nopaline synthase terminator. Replacement of the GUS leader in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) generated the vector 10 pSOG35. pSOG19 and pSOG35 carry the pUC-derived gene for ampicillin resistance, and have Hindlil, Sphl, Pstl and EcoRI sites available for the cloning of foreign sequences. Binary backbone vector pNOV2117 contains the T-DNA portion flanked by the right and left border sequences, and including the 15 POSITECHTM (Syngenta Corp., Wilmington, Delaware, United States of America) plant selectable marker and the "candidate gene" gene expression cassette. The POSITECHTM plant selectable marker confers resistance to mannose and in this instance consists of the maize ubiquitin promoter driving expression of the PM (phosphomannose isomerase) gene, followed 20 by the cauliflower mosaic virus transcriptional terminator. 5. Vector Suitable for Chloroplast Transformation For expression of a nucleotide sequence of the presently disclosed subject matter in plant plastids, plastid transformation vector pPH143 (PCT International Publication WO 97/32011, example 36) is used. The nucleotide 25 sequence is inserted into pPH143 thereby replacing the protoporphyrinogen oxidase (Protox) coding sequence. This vector is then used for plastid transformation and selection of transformants for spectinomycin resistance. Alternatively, the nucleotide sequence is inserted in pPH143 so that it replaces the aadH gene. In this case, transformants are selected for 30 resistance to PROTOX inhibitors. 6. Transformation of Plastids WO 2004/061080 PCT/US2003/041098 107 In another embodiment, a nucleotide sequence of the presently disclosed subject matter is directly transformed into the plastid genome. Plastid transformation technology is described in U.S. Patent Nos. 5,451,513; 5,545,817; and 5,545,818; and in PCT International Publication 5 No. WO 95/16783; and in McBride et al., 1994. The basic technique for chloroplast transformation involves introducing regions of cloned plastid DNA flanking a selectable marker together with the gene of interest into a suitable target tissue, e.g., using biolistics or protoplast transformation (e.g., calcium chloride or PEG mediated transformation). The 1 to 1.5 kilobase (kb) 10 flanking regions, termed targeting sequences, facilitate orthologous recombination with the plastid genome and thus allow the replacement or modification of specific regions of the plastome. Initially, point mutations in the chloroplast 16S rRNA and rps12 genes conferring resistance to spectinomycin and/or streptomycin are utilized as selectable markers for 15 transformation (Svab et al., 1990; Staub et al., 1992). This resulted in stable homoplasmic transformants at a frequency of approximately one per 100 bombardments of target leaves. The presence of cloning sites between these markers allowed creation of a plastid targeting vector for introduction of foreign genes (Staub et al., 1993). Substantial increases in transformation 20 frequency are obtained by replacement of the recessive rRNA or r-protein antibiotic resistance genes with a dominant selectable marker, the bacterial aadA gene encoding the spectinomycin-detoxifying enzyme aminoglycoside 3N-adenyltransferase (Staub et al., 1993). Other selectable markers useful for plastid transformation are known in the art and encompassed within the 25 scope of the presently disclosed subject matter. Typically, approximately 15 20 cell division cycles following transformation are required to reach a homoplastidic state. Plastid expression, in which genes are inserted by orthologous recombination into all of the several thousand copies of the circular plastid 30 genome present in each plant cell, takes advantage of the enormous copy number advantage over nuclear-expressed genes to permit expression WO 2004/061080 PCT/US2003/041098 108 levels that can readily exceed 10% of the total soluble plant protein. In one embodiment, a nucleotide sequence of the presently disclosed subject matter is inserted into a plastid targeting vector and transformed into the plastid genome of a desired plant host. Plants homoplastic for plastid 5 genomes containing a nucleotide sequence of the presently disclosed subject matter are obtained, and are in one embodiment capable of high expression of the nucleotide sequence. An example of plastid transformation follows. Seeds of Nicotiana tabacum c.v. 'Xanthi nc' are germinated seven per plate in a 1" circular array 10 on T agar medium and bombarded 12-14 days after sowing with 1 jpm tungsten particles (M10, Biorad, Hercules, California, United States of America) coated with DNA from plasmids pPH143 and pPH145 essentially as disclosed (Svab & Maliga, 1993). Bombarded seedlings are incubated on T medium for two days after which leaves are excised and placed abaxial 15 side up in bright light (350-500 pmol photons/m 2 /s) on plates of RMOP medium (Svab et al., 1990) containing 500 pg/ml spectinomycin dihydrochloride (Sigma, St. Louis, Missouri, United States of America). Resistant shoots appearing underneath the bleached leaves three to eight weeks after bombardment are subcloned onto the same selective medium, 20 allowed to form callus, and secondary shoots isolated and subcloned. Complete segregation of transformed plastid genome copies (homoplasmicity) in independent subclones is assessed by standard techniques of Southern blotting (Sambrook & Russell, 2001). BamHl/EcoRl digested total cellular DNA (Mettler, 1987) is separated on 1 % Tris-borate 25 EDTA (TBE) agarose gels, transferred to nylon membranes (Amersham Biosciences, Piscataway, New Jersey, United States of America) and probed with 32 P-labeled random primed DNA sequences corresponding to a 0.7 kb BamHl/Hindlll DNA fragment from pC8 containing a portion of the rps7/12 plastid targeting sequence. Homoplasmic shoots are rooted aseptically on 30 spectinomycin-containing MS/IBA medium (McBride et al., 1994) and transferred to the greenhouse.
WO 2004/061080 PCT/US2003/041098 109 7. Transformation of Dicotyledons Transformation techniques for dicotyledons are well known in the art and include Agrobacterium-based techniques and techniques that do not require Agrobacterium. Non-Agrobacterium techniques involve the uptake of 5 exogenous genetic material directly by protoplasts or cells. This can be accomplished by PEG or electroporation-mediated uptake, particle bombardment-mediated delivery, or microinjection. Examples of these techniques are disclosed in Paszkowski et al., 1984; Potrykus et al., 1985; Reich et al., 1986; and Klein et al., 1987. In each case the transformed cells 10 are regenerated to whole plants using standard techniques known in the art. Agrobacterium-mediated transformation is a useful technique for transformation of dicotyledons because of its high efficiency of transformation and its broad utility with many different species. Agrobacterium transformation typically involves the transfer of the binary 15 vector carrying the foreign DNA of interest (e.g., pCIB200 or pCIB2001) to an appropriate Agrobacterium strain which can depend on the complement of vir genes carried by the host Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g., strain CIB542 for pCIB200 and pClB2001 (Uknes et al., 1993). The transfer of the recombinant binary vector to 20 Agrobacterium is accomplished by a triparental mating procedure using E. coli carrying the recombinant binary vector, a helper E. coli strain that carries a plasmid such as pRK2013 and which is able to mobilize the recombinant binary vector to the target Agrobacterium strain. Alternatively, the recombinant binary vector can be transferred to Agrobacterium by DNA 25 transformation (H6fgen & Willmitzer, 1988). Transformation of the target plant species by recombinant Agrobacterium usually involves co-cultivation of the Agrobacterium with explants from the plant and follows protocols well known in the art. Transformed tissue is regenerated on selectable medium carrying the 30 antibiotic or herbicide resistance marker present between the binary plasmid T-DNA borders.
WO 2004/061080 PCT/US2003/041098 110 Another approach to transforming plant cells with a gene involves propelling inert or biologically active particles at plant tissues and cells. This technique is disclosed in U.S. Patent Nos. 4,945,050; 5,036,006; and 5,100,792; all to Sanford et al. Generally, this procedure involves propelling 5 inert or biologically active particles at the cells under conditions effective to penetrate the outer surface of the cell and afford incorporation within the interior thereof. When inert particles are utilized, the vector can be introduced into the cell by coating the particles with the vector containing the desired gene. Alternatively, the target cell can be surrounded by the vector 10 so that the vector is carried into the cell by the wake of the particle. Biologically active particles (e.g., dried yeast cells, dried bacterium, or a bacteriophage, each containing DNA sought to be introduced) can also be propelled into plant cell tissue. 8. Transformation of Monocotyledons 15 Transformation of most monocotyledon species has now also become routine. Exemplary techniques include direct gene transfer into protoplasts using PEG or electroporation, and particle bombardment into callus tissue. Transformations can be undertaken with a single DNA species or multiple DNA species (i.e. co-transformation), and both these techniques are suitable 20 for use with the presently disclosed subject matter. Co-transformation can have the advantage of avoiding complete vector construction and of generating transgenic plants with unlinked loci for the gene of interest and the selectable marker, enabling the removal of the selectable marker in subsequent generations, should this be regarded as desirable. However, a 25 disadvantage of the use of co-transformation is the less than 100% frequency with which separate DNA species are integrated into the genome (Schocher et al., 1986). Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe techniques for the preparation of callus and protoplasts from an 30 elite inbred line of maize, transformation of protoplasts using PEG or electroporation, and the regeneration of maize plants from transformed WO 2004/061080 PCT/US2003/041098 111 protoplasts. Gordon-Kamm et al., 1990 and Fromm et al., 1990 have published techniques for transformation of A188-derived maize line using particle bombardment. Furthermore, WO 93/07278 and Koziel et al., 1993 describe techniques for the transformation of elite inbred lines of maize by 5 particle bombardment. This technique utilizes immature maize embryos of 1.5-2.5 mm length excised from a maize ear 14-15 days after pollination and a PDS-1000He Biofistic particle delivery device (DuPont Biotechnology, Wilmington, Delaware, United States of America) for bombardment. Transformation of rice can also be undertaken by direct gene transfer 10 techniques utilizing protoplasts or particle bombardment. Protoplast mediated transformation has been disclosed for Japonica-types and Indica types (Zhang et al., 1988; Shimamoto et al., 1989; Datta et al., 1990) of rice. Both types are also routinely transformable using particle bombardment (Christou et al., 1991). Furthermore, WO 93/21335 describes techniques for 15 the transformation of rice via electroporation. Casas et al., 1993 discloses the production of transgenic sorghum plants by microprojectile bombardment. Patent Application EP 0 332 581 describes techniques for the generation, transformation, and regeneration of Pooideae protoplasts. 20 These techniques allow the transformation of Dactylis and wheat. Furthermore, wheat transformation has been disclosed in Vasil et al., 1992 using particle bombardment into cells of type C long-term regenerable callus, and also by Vasil et al., 1993 and Weeks et al., 1993 using particle bombardment of immature embryos and immature embryo-derived callus. 25 A representative technique for wheat transformation, however, involves the transformation of wheat by particle bombardment of immature embryos and includes either a high sucrose or a high maltose step prior to gene delivery. Prior to bombardment, embryos (0.75-1 mm in length) are plated onto MS medium with 3% sucrose (Murashige & Skoog, 1962) and 3 30 mg/I 2,4-dichlorophenoxyacetic acid (2,4-D) for induction of somatic embryos, which is allowed to proceed in the dark. On the chosen day of WO 2004/061080 PCT/US2003/041098 112 bombardment, embryos are removed from the induction medium and placed onto the osmoticum (i.e. induction medium with sucrose or maltose added at the desired concentration, typically 15%). The embryos are allowed to plasmolyze for 2-3 hours and are then bombarded. Twenty embryos per 5 target plate are typical, although not critical. An appropriate gene-carrying plasmid (such as pCIB3064 or pSG35) is precipitated onto micrometer size gold particles using standard procedures. Each plate of embryos is shot with the DuPont BIOLISTICS@ helium device using a burst pressure of about 1000 pounds per square inch (psi) using a standard 80 mesh screen. After 10 bombardment, the embryos are placed back into the dark to recover for about 24 hours (still on osmoticum). After 24 hours, the embryos are removed from the osmoticum and placed back onto induction medium where they stay for about a month before regeneration. Approximately one month later the embryo explants with developing embryogenic callus are 15 transferred to regeneration medium (MS + 1 mg/liter NAA, 5 mg/liter GA), further containing the appropriate selection agent (10 mg/I BASTA@ in the case of pCIB3064 and 2 mg/I methotrexate in the case of pSOG35). After approximately one month, developed shoots are transferred to larger sterile containers known as "GA7s" which contain half-strength MS, 2% sucrose, 20 and the same concentration of selection agent. Transformation of monocotyledons using Agrobacterium has also been disclosed. See WO 94/00977 and U.S. Patent No. 5,591,616, both of which are incorporated herein by reference. See also Negrotto et al., 2000, incorporated herein by reference. Zhao et al., 2000 specifically discloses 25 transformation of sorghum with Agrobacterium. See also U.S. Patent No. 6,369,298. Rice (Oryza sativa) can be used for generating transgenic plants. Various rice cultivars can be used (Hiei et al., 1994; Dong et al., 1996; Hiei et al., 1997). Also, the various media constituents disclosed below can be 30 either varied in quantity or substituted. Embryogenic responses are initiated and/or cultures are established from mature embryos by culturing on MS- WO 2004/061080 PCT/US2003/041098 113 CIM medium (MS basal salts, 4.3 g/liter; B5 vitamins (200 x), 5 ml/liter; Sucrose, 30 g/liter; proline, 500 mg/liter; glutamine, 500 mg/liter; casein hydrolysate, 300 mg/liter; 2,4-D (1 mg/mI), 2 ml/liter; pH adjusted to 5.8 with I N KOH; Phytagel, 3 g/liter). Either mature embryos at the initial stages of 5 culture response or established culture lines are inoculated and co-cultivated with the Agrobacterium tumefaciens strain LBA4404 (Agrobacterium) containing the desired vector construction. Agrobacterium is cultured from glycerol stocks on solid YPC medium (plus 100 mg/L spectinomycin and any other appropriate antibiotic) for about 2 days at 280C. Agrobacterium is re 10 suspended in liquid MS-CIM medium. The Agrobacterium culture is diluted to an OD0o of 0.2-0.3 and acetosyringone is added to a final concentration of 200 pM. Acetosyringone is added before mixing the solution with the rice cultures to induce Agrobacterium for DNA transfer to the plant cells. For inoculation, the plant cultures are immersed in the bacterial suspension. The 15 liquid bacterial suspension is removed and the inoculated cultures are placed on co-cultivation medium and incubated at 220C for two days. The cultures are then transferred to MS-CIM medium with ticarcillin (400 mg/liter) to inhibit the growth of Agrobacterium. For constructs utilizing the PMI selectable marker gene (Reed et al., 2001), cultures are transferred to 20 selection medium containing mannose as a carbohydrate source (MS with 2% mannose, 300 mg/liter ticarcillin) after 7 days, and cultured for 3-4 weeks in the dark. Resistant colonies are then transferred to regeneration induction rpedium (MS with no 2,4-D, 0.5 mg/liter IAA, I mg/liter zeatin, 200 mg/liter TIMENTIN@, 2% mannose, and 3% sorbitol) and grown in the dark for 14 25 days. Proliferating colonies are then transferred to another round of regeneration induction media and moved to the light growth room. Regenerated shoots are transferred to GA7 containers with GA7-1 medium (MS with no hormones and 2% sorbitol) for 2 weeks and then moved to the greenhouse when they are large enough and have adequate roots. Plants 30 are transplanted to soil in the greenhouse (To generation) grown to maturity WO 2004/061080 PCT/US2003/041098 114 and the T 1 seed is harvested. E. Growth and Screening of Transformed Cells Transgenic plant cells are then placed in an appropriate selective medium for selection of transgenic cells, which are then grown to callus. 5 Shoots are grown from callus and plantlets generated from the shoot by growing in rooting medium. The various constructs normally are joined to a marker for selection in plant cells. Conveniently, the marker can be resistance to a biocide (for example, an antibiotic including, but not limited to kanamycin, G418, bleomycin, hygromycin, chloramphenicol, herbicide, or 10 the like). The particular marker used is designed to allow for the selection of transformed cells (as compared to cells lacking the DNA that has been introduced). Components of DNA constructs including transcription cassettes of the presently disclosed subject matter are prepared from sequences that are native (endogenous) or foreign (exogenous) to the host. 15 As used herein, the terms "foreign" and "exogenous" refer to sequences that are not found in the wild-type host into which the construct is introduced, or alternatively, have been isolated from the host species and incorporated into an expression vector. Heterologous constructs contain in one embodiment at least one region that is not native to the gene from which the transcription 20 initiation region is derived. To confirm the presence of the transgenes in transformed cells and plants, a variety of assays can be performed. Such assays include, for example, "molecular biological" assays well known to those of skill in the art, such as Southern and Northern blotting, in situ hybridization and nucleic 25 acid-based amplification methods such as PCR or RT-PCR; "biochemical" assays, such as detecting the presence of a protein product, e.g., by immunological means (enzyme-linked immunosorbent assays (ELISAs) and Western blots) or by enzymatic function; plant part assays, such as seed assays; and also by analyzing the phenotype of the whole regenerated plant, 30 e.g., for disease or pest resistance.
WO 2004/061080 PCT/US2003/041098 115 DNA can be isolated from cell lines or any plant parts to determine the presence of the preselected nucleic acid segment through the use of techniques well known to those skilled in the art. Note that intact sequences will not always be present, presumably due to rearrangement or deletion of 5 sequences in the cell. The presence of nucleic acid elements introduced through the methods of this presently disclosed subject matter can be determined by the polymerase chain reaction (PCR). Using this technique, discreet fragments of nucleic acid are amplified and detected by gel electrophoresis. This type 10 of analysis permits one to determine whether a preselected nucleic acid segment is present in a stable transformant. It is contemplated that using PCR techniques it would be possible to clone fragments of the host genomic DNA adjacent to an introduced preselected DNA segment. Positive proof of DNA integration into the host genome and the 15 independent identities of transformants can be determined using the technique of Southern hybridization. Using this technique, specific DNA sequences that are introduced into the host genome and flanking host DNA sequences can be identified. Hence, the Southern hybridization pattern of a given transformant serves as an identifying characteristic of that 20 transformant. In addition, it is possible through Southern hybridization to demonstrate the presence of introduced preselected DNA segments in high molecular weight DNA: e.g., to confirm that the introduced preselected DNA segment has been integrated into the host cell genome. Southern hybridization provides certain information that can also be obtained using 25 PCR, e.g., the presence of a preselected DNA segment, but can also demonstrate integration of an exogenous nucleic acid molecule into the genome and can characterize each individual transformant. It is contemplated that using the techniques of dot or slot blot hybridization, which are modifications of Southern hybridization techniques, 30 the same information that is derived from PCR could be obtained (e.g., the presence of a preselected DNA segment).
WO 2004/061080 PCT/US2003/041098 116 Both PCR and Southern hybridization techniques can be used to demonstrate transmission of a preselected DNA segment to progeny. In most instances, the characteristic Southern hybridization pattern for a given transformant will segregate in progeny as one or more Mendelian genes 5 (Spencer et al., 1992; Laursen et al., 1994), indicating stable inheritance of the gene. The non-chimeric nature of the callus and the parental transformants (Ro) can be suggested by germline transmission and the identical Southern blot hybridization patterns and intensities of the transforming DNA in callus, RO plants, and R 1 progeny that segregated for 10 the transformed gene. Whereas certain DNA analysis techniques can be conducted using DNA isolated from any part of a plant, specific RNAs might only be expressed in particular cells or tissue types and hence it can be necessary to prepare RNA for analysis from these tissues. PCR techniques can also be 15 used for detection and quantitation of RNA produced from introduced preselected DNA molecules. In this application of PCR, it is first necessary to reverse transcribe RNA into complementary DNA (cDNA) using an enzyme such as a reverse transcriptase, and then through the use of conventional PCR techniques, to amplify the resulting cDNA. 20 In some instances, PCR techniques might not demonstrate the integrity of the RNA product. Further information about the nature of the RNA product can be obtained by Northern blotting. This technique demonstrates the presence of an RNA species and additionally gives information about the integrity of that RNA. The presence or absence of an 25 RNA species can also be determined using dot or slot blot Northern hybridizations using techniques known in the art. These techniques are modifications of Northern blotting and typically demonstrate only the presence or absence of an RNA species. Thus, Southern blotting and PCR can be used to detect the presence 30 of a DNA molecule of interest. Expression can be evaluated by specifically WO 2004/061080 PCT/US2003/041098 117 identifying the protein products of the introduced preselected DNA segments or evaluating the phenotypic changes brought about by their expression. Assays for the production and identification of specific proteins can make use of physical-chemical, structural, functional, or other properties of 5 the proteins. Unique physical-chemical or structural properties allow the proteins to be separated and identified by electrophoretic procedures, such as native or denaturing gel electrophoresis or isoelectric focusing, or by chromatographic techniques such as ion exchange or gel exclusion chromatography. The unique structures of individual proteins offer 10 opportunities for use of specific antibodies to detect the presence of individual proteins using art-recognized techniques such as an ELISA assay. Combinations of approaches can be employed to gain additional information, such as Western blotting, in which antibodies are used to locate individual gene products that have been separated by electrophoretic techniques and 15 transferred to a solid support. Additional techniques can be employed to confirm the identity of the product of interest, such as evaluation by amino acid sequencing following purification. Although these are among the most commonly employed, other procedures known to the skilled artisan can also be used. 20 Assay procedures can also be used to identify the expression of proteins by their functions, especially the ability of enzymes to catalyze specific chemical reactions involving specific substrates and products. These reactions can be followed by providing and quantifying the loss of substrates or the generation of products of the reactions by physical or 25 chemical procedures. Examples are as varied as the enzyme to be analyzed, and are known in the art for many different enzymes. The expression of a gene product can also be determined by evaluating the phenotypic results of its expression. These assays also can take many forms including, but not limited to analyzing changes in the 30 chemical composition, morphology, or physiological properties of the plant. Morphological changes can include greater stature or thicker stalks.
WO 2004/061080 PCT/US2003/041098 118 Changes in the response of plants or plant parts to imposed treatments are typically evaluated under carefully controlled conditions termed bioassays. As such, protein expression levels can be measured by any standard method. For example, antibodies (monoclonal or polyclonal) can be 5 generated by standard methods that specifically bind to a stress-related protein of the presently disclosed subject matter (see methods for making antibodies in, e.g., Ausubel et al., 1988, including updates up to 2002; Harlow & Lane, 1988). Using such a stress-related protein-specific antibody, protein levels can be determined by any immunological method including, 10 without limitation, Western blotting, immunoprecipitation, and ELISA. Another non-limiting method for measuring protein level is by measuring mRNA levels. For example, total mRNA can be isolated from a cell introduced with a nucleic acid molecule of the presently disclosed subject matter (or with an antisense of such a nucleic acid molecule) and 15 from an untreated cell. Northern blotting analysis using the nucleic acid molecule that was introduced to the treated cell as a probe can indicate if the treated cell expresses the nucleic acid molecule at a different level (at both the mRNA and polypeptide levels) as compared to the untreated cell. Changes in stress response (either in unchallenged cells and plants, 20 or in cells and plants challenged with, for example, exposure to salt or pathogen-infection) can be readily determined by any standard method, such as counting the cells by any standard method. For example, cells can be manually counted using a hemacytometer or microscope. Callus growth and plant growth can be measured by weight and/or height. Individual cell 25 growth can be determined by any standard stress response assay (e.g., 3 H incorporation). The presently disclosed subject matter further includes the manipulation of stress response by modulation of the expression of more than one of the stress-related proteins described herein. For example, an 30 increase in the level of expression of a first stress-related protein coupled with a decrease in the level of expression of a second stress-related protein WO 2004/061080 PCT/US2003/041098 119 can result in a greater change in the stress response of a cell (or plant including such a cell) than either the increase in the level of expression of a first stress-related protein of the decrease in the level of expression of a second the stress-related protein alone. The presently disclosed subject 5 matter has provided numerous stress-related proteins and their interrelations with one another. Manipulation of expression of one or more of the stress related proteins of the presently disclosed subject matter enables the development of genetically engineered plants (i.e., transgenic plants) that have superior stress response under stress (e.g., biotic or abiotic stress). 10 V. Plants, Breedinq, and Seed Production A. Plants A host cell is any type of cell including, without limitation, a bacterial cell, a yeast cell, a plant cell, an insect cell, and a mammalian cell. 15 Numerous such cells are commercially available, for example, from the American Type Culture Collection, Manassas, Virginia, United States of America. In certain embodiments, the cell is a plant cell, which can be regenerated to form a transgenic plant. Thus, the presently disclosed 20 subject matter provides a transformed (transgenic) plant cell, in planta or ex planta, including a transformed plastid or other organelle (e.g., nucleus, mitochondria or chloroplast). As used herein, a "transgenic plant" is a plant having one or more plant cells that contain an exogenous nucleic acid molecule (e.g., a nucleic acid molecule encoding a stress-related 25 polypeptide of the presently disclosed subject matter). Thus, a transgenic plant can comprise a nucleic acid molecule comprising a foreign nucleic acid sequence (i.e. a nucleic acid sequence derived from a different plant species). Alternatively or in addition, a transgenic plant can comprise a nucleic acid molecule comprising a nucleic acid sequence from the same 30 plant species, wherein the nucleic acid sequence has been isolated from that plant species. In the latter example, the nucleic acid sequence can be WO 2004/061080 PCT/US2003/041098 120 the same or different from the wild-type sequence, and can optionally include regulatory sequences that are the same or different from those that are found in the naturally occurring plant. The presently disclosed subject matter can be used for transforming 5 cells of any plant species, including, but not limited to from corn (Zea mays), Brassica sp. (e.g., B. napus, B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum)), proso millet (Panicum 10 miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanut (Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea 15 batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea ultilane), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew 20 (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, duckweed (Lemna), barley, vegetables, ornamentals, and conifers. Duckweed (Lemna, see PCT International Publication No. WO 25 00/07210) includes members of the family Lemnaceae. There are known four genera and 34 species of duckweed as follows: genus Lemna (L. aequinoctialis, L. disperma, L. ecuadoriensis, L. gibba, L. japonica, L. minor, L. miniscula, L. obscura, L. perpusilla, L. tenera, L. trisulca, L.turionifera, L. valdiviana); genus Spirodela (S. intermedia, S. polyrrhiza, S. punctata); 30 genus Woffia (Wa. Angusta, Wa. Arrhiza, Wa. Australina, Wa. Borealis, Wa. Brasiliensis, Wa. Columbiana, Wa. Elongata, Wa. Globosa, Wa.
WO 2004/061080 PCT/US2003/041098 121 Microscopica, Wa. Neglecta) and genus Wofiella (W1. ultila, W1. ultilanen, W1. gladiata, W1. ultila, W1. lingulata, W1. repunda, W1. rotunda, and W1. neotropica). Any other genera or species of Lemnaceae, if they exist, are also aspects of the presently disclosed subject matter. In one embodiment, 5 Lemna gibba is employed in the presently disclosed subject matter, and in other embodiments, Lemna minor and Lemna miniscula are employed. Lemna species can be classified using the taxonomic scheme described by Landolt, 1986. Vegetables within the scope of the presently disclosed subject matter 10 include tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus Iimensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. cantalupensis), and musk melon (C. melo). Ornamentals include azalea (Rhododendron spp.), hydrangea (Macrophylla 15 hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnations (Dianthus caryophyllus), poinsettias (Euphorbia putcherrima), and chrysanthemums. Conifers that can be employed in practicing the presently disclosed subject matter include, for example, pines such as loblolly pine 20 (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata), Douglas fir (Pseudotsuga menziesii); Western hemlock (Tsuga ultilane); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as 25 Western red cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis nootkatensis). Leguminous plants that can be employed in the presently disclosed subject matter include beans and peas. Representative beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima 30 bean, fava bean, lentils, chickpea, etc. Legumes include, but are not limited to Arachis (e.g., peanuts), Vicia (e.g., crown vetch, hairy vetch, adzuki bean, WO 2004/061080 PCT/US2003/041098 122 mung bean, and chickpea), Lupinus (e.g., lupine, trifolium), Phaseolus (e.g., common bean and lima bean), Pisum (e.g., field bean), Meliotus (e.g., clover), Medicago (e.g., alfalfa), Lotus (e.g., trefoil), tens (e.g., lentil), and false indigo. Non-limiting forage and turf grass for use in the methods of the 5 presently disclosed subject matter include alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, and redtop. Other plants within the scope of the presently disclosed subject matter include Acacia, aneth, artichoke, arugula, blackberry, canola, cilantro, clementines, escarole, eucalyptus, fennel, grapefruit, honey dew, jicama, 10 kiwifruit, lemon, lime, mushroom, nut, okra, orange, parsley, persimmon, plantain, pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, 15 pepper, cauliflower, Brassica, e.g., broccoli, cabbage, ultilan sprouts, onion, carrot, leek, beet, broad bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, ultilane, and zucchini. Ornamental plants within the scope of the presently disclosed subject matter include impatiens, Begonia, Pelargonium, Viola, Cyclamen, Verbena, 20 Vinca, Tagetes, Primula, Saint Paulia, Agertum, Amaranthus, Antihirrhinum, Aquilegia, Cineraria, Clover, Cosmo, Cowpea, Dahlia, Datura, Delphinium, Gerbera, Gladiolus, Gloxinia, Hippeastrum, Mesembryanthemum, Salpiglossos, and Zinnia. In certain embodiments, transgenic plants of the presently disclosed 25 subject matter are crop plants and in particular cereals. Such crop plants and cereals include, but are not limited to corn, alfalfa, sunflower, rice, Brassica, canola, soybean, barley, soybean, sugarbeet, cotton, safflower, peanut, sorghum, wheat, millet, and tobacco. The presently disclosed subject matter also provides plants 30 comprising the disclosed compositions. In one embodiment, the plant is characterized by a modification of a phenotype or measurable characteristic WO 2004/061080 PCT/US2003/041098 123 of the plant, the modification being attributable to the expression cassette. In one embodiment, the modification involves, for example, nutritional enhancement, increased nutrient uptake efficiency, enhanced production of endogenous compounds, or production of heterologous compounds. In 5 another embodiment, the modification includes having increased or decreased resistance to an herbicide, an abiotic stress, or a pathogen. In another embodiment, the modification includes having enhanced or diminished requirement for light, water, nitrogen, or trace elements. In another embodiment, the modification includes being enriched for an 10 essential amino acid as a proportion of a polypeptide fraction of the plant. In another embodiment, the polypeptide fraction can be, for example, total seed polypeptide, soluble polypeptide, insoluble polypeptide, water extractable polypeptide, and lipid-associated polypeptide. In another embodiment, the modification includes overexpression, underexpression, 15 antisense modulation, sense suppression, inducible expression, inducible repression, or inducible modulation of a gene. B. Breeding The plants obtained via transformation with a nucleic acid sequence of the presently disclosed subject matter can be any of a wide variety of 20 plant species, including monocots and dicots; however, the plants used in the method for the presently disclosed subject matter are selected in one embodiment from the list of agronomically important target crops set forth hereinabove. The expression of a gene of the presently disclosed subject matter in combination with other characteristics important for production and 25 quality can be incorporated into plant lines through breeding. Breeding approaches and techniques are known in the art. See e.g., Welsh, 1981; Wood, 1983; Mayo, 1987; Singh, 1986; Wricke & Weber, 1986. The genetic properties engineered into the transgenic seeds and plants disclosed above are passed on by sexual reproduction or vegetative 30 growth and can thus be maintained and propagated in progeny plants. Generally, the maintenance and propagation make use of known agricultural WO 2004/061080 PCT/US2003/041098 124 methods developed to fit specific purposes such as tilling, sowing, or harvesting. Specialized processes such as hydroponics or greenhouse technologies can also be applied. As the growing crop is vulnerable to attack and damage caused by insects or infections as well as to competition 5 by weed plants, measures are undertaken to control weeds, plant diseases, insects, nematodes, and other adverse conditions to improve yield. These include mechanical measures such as tillage of the soil or removal of weeds and infected plants, as well as the application of agrochemicals such as herbicides, fungicides, gametocides, nematicides, growth regulants, ripening 10 agents, and insecticides. Use of the advantageous genetic properties of the transgenic plants and seeds according to the presently disclosed subject matter can further be made in plant breeding, which aims at the development of plants with improved properties such as tolerance of pests, herbicides, or biotic or 15 abiotic stress, improved nutritional value, increased yield or proliferation, or improved structure causing less loss from lodging or shattering. The various breeding steps are characterized by well-defined human intervention such as selecting the lines to be crossed, directing pollination of the parental lines, or selecting appropriate progeny plants. 20 Depending on the desired properties, different breeding measures are taken. The relevant techniques are well known in the art and include, but are not limited to, hybridization, inbreeding, backcross breeding, multiline breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. Hybridization techniques can also include the sterilization of plants to yield 25 male or female sterile plants by mechanical, chemical, or biochemical means. Cross-pollination of a male sterile plant with pollen of a different line assures that the genome of the male sterile but female fertile plant will uniformly obtain properties of both parental lines. Thus, the transgenic seeds and plants according to the presently disclosed subject matter can be 30 used for the breeding of improved plant lines that, for example, increase the effectiveness of conventional methods such as herbicide or pesticide WO 2004/061080 PCT/US2003/041098 125 treatment or allow one to dispense with said methods due to their modified genetic properties. Alternatively new crops with improved stress tolerance can be obtained, which, due to their optimized genetic "equipment", yield harvested product of better quality than products that were not able to 5 tolerate comparable adverse developmental conditions (for example, drought). Additionally, The presently disclosed subject matter also provides a transgenic plant, a seed from such a plant, and progeny plants from such a plant including hybrids and inbreds. In representative embodiments, 10 transgenic plants are transgenic maize, soybean, barley, alfalfa, sunflower, canola, soybean, cotton, peanut, sorghum, tobacco, sugarbeet, rice, wheat, rye, turfgrass, millet, sugarcane, tomato, or potato. A transformed (transgenic) plant of the presently disclosed subject matter includes a plant, the genome of which is augmented by an 15 exogenous nucleic acid molecule, or in which a gene has been disrupted, e.g., to result in a loss, a decrease, or an alteration in the function of the product encoded by the gene, which plant can also have increased yields and/or produce a better-quality product than the corresponding wild-type plant. The nucleic acid molecules of the presently disclosed subject matter 20 are thus useful for targeted gene disruption, as well as for use as markers and probes. The presently disclosed subject matter also provides a method of plant breeding, e.g., to prepare a crossed fertile transgenic plant. The method comprises crossing a fertile transgenic plant comprising a particular 25 nucleic acid molecule of the presently disclosed subject matter with itself or with a second plant, e.g., one lacking the particular nucleic acid molecule, to prepare the seed of a crossed fertile transgenic plant comprising the particular nucleic acid molecule. The seed is then planted to obtain a crossed fertile transgenic plant. The plant can be a monocot or a dicot. In a 30 particular embodiment, the plant is a cereal plant.
WO 2004/061080 PCT/US2003/041098 126 The crossed fertile transgenic plant can have the particular nucleic acid molecule inherited through a female parent or through a male parent. The second plant can be an inbred plant. The crossed fertile transgenic can be a hybrid. Also included within the presently disclosed subject matter are 5 seeds of any of these crossed fertile transgenic plants. C. Seed Production Some embodiments of the presently disclosed subject matter also provide seed and isolated product from plants that comprise an expression cassette comprising a promoter sequence operatively linked to an isolated 10 nucleic acid as disclosed herein. In some embodiments, the isolated nucleic acid molecule is selected from the group consisting of: a. a nucleic acid molecule encoding a polypeptide comprising an amino acid sequence of one of even numbered SEQ ID NOs: 2 112; 15 b. a nucleic acid molecule comprising a nucleic acid sequence of one of odd numbered SEQ ID NOs: 1-111; c. a nucleic acid molecule that has a nucleic acid sequence at least 90% identical to the nucleic acid sequence of the nucleic acid molecule of (a) or (b); 20 d. a nucleic acid molecule that hybridizes to (a) or (b) under conditions of hybridization selected from the group consisting of: i. 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM ethylenediamine tetraacetic acid (EDTA) at 500C with a final wash in 2X standard saline citrate (SSC), 0.1% SDS 25 at50'C; ii. 7% SDS, 0.5 M NaPO 4 , 1 mM EDTA at 50*C with a final wash in IX SSC, 0.1% SDS at 50*C; iii. 7% SDS, 0.5 M NaPO 4 , 1 mM EDTA at 50'C with a final wash in 0.5X SSC, 0.1% SDS at 50"C; WO 2004/061080 PCT/US2003/041098 127 iv. 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM EDTA at 50'C with a final wash in 0.1X SSC, 0.1% SDS at 50'C; and v. 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM 5 EDTA at 50'C with a final wash in O.1X SSC, 0.1% SDS at 65'C; e. a nucleic acid molecule comprising a nucleic acid sequence fully complementary to (a); and f. a nucleic acid molecule comprising a nucleic acid sequence that 10 is the full reverse complement of (a). In one embodiment the isolated product comprises an enzyme, a nutritional polypeptide, a structural polypeptide, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin, or a plant hormone. 15 Embodiments of the presently disclosed subject matter also relate to isolated products produced by expression of an isolated nucleic acid containing a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence that hybridizes under conditions of hybridization of 450C in 1 M NaCl, followed by a final washing 20 step at 500C in 0.1 M NaCI to a nucleotide sequence listed in odd numbered sequences of SEQ ID NOs: 1-185, or a fragment, domain, or feature thereof; (b) a nucleotide sequence encoding a polypeptide that is an ortholog of a polypeptide listed in even numbered sequences of 25 SEQ ID NOs: 2-186, or a fragment, domain, or feature thereof; (c) a nucleotide sequence complementary (for example, fully complementary) to (a) or (b); and (d) a nucleotide sequence that is the reverse complement (for example, its full reverse complement) of (a) or (b) according to 30 the present disclosure.
WO 2004/061080 PCT/US2003/041098 128 In one embodiment, the product is produced in a plant. In another embodiment, the product is produced in cell culture. In another embodiment, the product is produced in a cell-free system. In one embodiment, the product comprises an enzyme, a nutritional polypeptide, a structural 5 polypeptide, an amino acid, a lipid, a fatty acid, a polysaccharide, a sugar, an alcohol, an alkaloid, a carotenoid, a propanoid, a steroid, a pigment, a vitamin, or a plant hormone. In another embodiment, the product is polypeptide comprising an amino acid sequence listed in even numbered sequences of SEQ ID NOs: 2-112, or ortholog thereof. In one embodiment, 10 the polypeptide comprises an enzyme. In seed production, germination quality and uniformity of seeds are essential product characteristics. As it is difficult to keep a crop free from other crop and weed seeds, to control seedborne diseases, and to produce seed with good germination, fairly extensive and well-defined seed 15 production practices have been developed by seed producers who are experienced in the art of growing, conditioning, and marketing of pure seed. Thus, it is common practice for the farmer to buy certified seed meeting specific quality standards instead of using seed harvested from his own crop. Propagation material to be used as seeds is customarily treated with a 20 protectant coating comprising herbicides, insecticides, fungicides, bactericides, nematicides, molluscicides, or mixtures thereof. Customarily used protectant coatings comprise compounds such as captan, carboxin, thiram (tetramethylthiuram disulfide; TMTD@; available from R. T. Vanderbilt Company, Inc., Norwalk, Connecticut, United States of America), methalaxyl 25 (APRON XL@; available from Syngenta Corp., Wilmington, Delaware, United States of America), and pirimiphos-methyl (ACTELLIC@; available from Agriliance, LLC, St. Paul, Minnesota, United States of America). If desired, these compounds are formulated together with further carriers, surfactants, and/or application-promoting adjuvants customarily employed in the art of 30 formulation to provide protection against damage caused by bacterial, fungal, or animal pests. The protectant coatings can be applied by WO 2004/061080 PCT/US2003/041098 129 impregnating propagation material with a liquid formulation or by coating with a combined wet or dry formulation. Other methods of application are also possible such as treatment directed at the buds or the fruit. The presently disclosed subject matter will be further described by 5 reference to the following detailed examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified. Examples The following Examples have been included to illustrate modes of the 10 presently disclosed subject matter. In light of the present disclosure and the general level of skill in the art, those of skill will appreciate that the following Examples are intended to be exemplary only and that numerous changes, modifications, and alterations can be employed without departing from the scope of the presently disclosed subject matter. 15 Example I The example describes the identification and characterization of rice proteins that interact at the thylakoid of chloroplasts and other cellular membranes. Specifically, described in this example are newly characterized rice proteins interacting with the rice 14-3-3 protein homolog GF14-c 20 (OsGF14-c) and with Defender Against Apoptotic Death 1 (OsDAD1). The 14-3-3 proteins (reviewed in Muslin & Xing, 2000) interact with a variety of regulators of cellular signaling, cell cycle, and apoptosis by binding to their partner proteins. The high potential for specific protein-protein interactions makes these proteins suitable for two-hybrid assays. The 14-3 25 3 proteins are known to participate in protein complexes within the nucleus and are commonly found in the cytoplasm. Studies using yeast two-hybrid assays have also localized GF14 isoforms to the chloroplast stroma and the stromal side of thylakoid membranes (Sehnke et al., 2000). However, the subcellular localization of GF14-c had not been directly assessed to date. 30 Investigation of the protein interactions involving OsGF14-c can lead to the identification of its location within the cell.
WO 2004/061080 PCT/US2003/041098 130 OsDAD1 is encoded by the rice homolog of the highly conserved DAD gene, a suppressor of endogenous programmed cell death, or apoptosis, in animals and plants (Apte et al., 1995; Gallois et a., 1997). In support of this role for DAD, expression of a DAD plant homolog has been shown to be 5 down-regulated during flower petal senescence (an example of programmed cell death) and by the plant hormone ethylene, which is associated with a variety of stress responses and developmental processes (Orzaez & Granell, 1997). While these studies have been conducted with DAD homologs from Arabidopsis and pea, the rice DADI is not described in the literature. The 10 interaction studies provided below were aimed at further characterizing this protein. An automated, high-throughput yeast two-hybrid assay technology (as described above) was used to search for rice protein that interacted with the bait proteins OsGF14-c and OsDADI. The sequences encoding the protein 15 fragments used in the search were then compared by BLAST analysis against databases to determine the sequences of the full-length genes. The proteins found appear to be localized to the thylakoid of chloroplasts, vacuolar membrane and plasma membrane. The results indicate that OsGF14-c is a membrane, component in rice. The subset of proteins 20 interacting with OsGF14-c at the thylakoid form a novel chloroplast protein complex involved in the photosynthetic processes. This interaction study also identifies the rice OsDAD1 as a membrane protein, in agreement with previously characterized DAD homologs from other species. Elucidation of the role of proteins interacting at the thylakoid and other cellular membranes 25 in rice chloroplasts can allow the development of herbicides specifically targeted to disrupting the structure and function of the thylakoid or endomembrane system. This example provides newly characterized rice proteins interacting with the rice 14-3-3 protein homolog GF14-c (OsGF14-c) and with Defender 30 Against Apoptotic Death 1 (OsDAD1). An automated, high-throughput yeast two-hybrid assay technology (provided by Myriad Genetics Inc., Salt Lake WO 2004/061080 PCT/US2003/041098 131 City, UT) was used to search for protein interactions with the bait proteins OsGF14-c and OsDADI. The 14-3-3 proteins (reviewed in Muslin & Xing, 2000) interact with a variety of regulators of cellular signaling, cell cycle, and apoptosis by binding to their partner proteins. The high potential for specific 5 protein-protein interactions makes these proteins suitable for two-hybrid assays. The 14-3-3 proteins are known to participate in protein complexes within the nucleus and are commonly found in the cytoplasm. Studies using yeast two-hybrid assays have also localized GF14 isoforms to the chloroplast stroma and the stromal side of thylakoid membranes (Sehnke et 10 al., 2000). However, the subcellular localization of GF14-c had not been directly assessed to date. Investigation of the protein interactions involving OsGF14-c can lead to the identification of its location within the cell. OsDADI is encoded by the rice homolog of the highly conserved DAD gene, a suppressor of endogenous programmed cell death, or apoptosis, in 15 animals and plants (Apte et aL, 1995; Gallois et al, 1997). In support of this role for DAD, expression of a DAD plant homolog has been shown to be down-regulated during flower petal senescence (an example of programmed cell death) and by the plant hormone ethylene, which is associated with a variety of stress responses and developmental processes (Orzaez & Granell, 20 1997). While these studies have been conducted with DAD homologs from Arabidopsis and pea, the rice DADI is not described. The interaction studies provided in this example are aimed at characterizing this protein. Results GF14-c was found to interact with EPSP synthase, an enzyme in the 25 shikimate pathway (OsBAB61062); two enzymes with roles in the Calvin cycle reactions in chloroplasts, a rice chloroplastic aldolase (OsBAA02730) and a the chloroplast enzyme RUBISCO (OsRBCL); the RUBISCO activase precursor (OsRCAAI); and two rice photosystem proteins, putative 33kDa oxygen-evolving protein of photosystem if (OsPN23059) and photosystem II 30 10 kDa polypeptide (OsAAB46718). Eight additional interactors for GF14-c are novel rice proteins: a photosystem protein (OsPN23061) similar to WO 2004/061080 PCT/US2003/041098 132 barley (Hordeum vufgare) photosystem I reaction center subunit II, chloroplast precursor; a protein (OsPN22858) similar to Arabidopsis thaliana GTP cyclohydrolase II, an enzyme involved in the biosynthesis of vitamin B riboflavin (a cofactor in the shikimate pathway); a protein (OsPN22874) 5 similar to A. thaliana phosphatidylinositol-4-phosphate 5 kinase (Pl4P5K), an enzyme involved in signaling events associated with water-stress response in plants; two H*-ATPases, similar to A. thaliana vacuolar ATP synthase subunit C (OsPN22866) and to barley plasma membrane H*-ATPase (OsPN23022); a putative dynamin homolog (OsPN30846) that is likely 10 localized to the chloroplast, as are other plant dynamin family members; and two proteins of unknown function (OsPN29982 and OsPN30974). OsDAD1 was found to interact with three membrane proteins: rice beta-expansin (OsEXPB2), which is localized to the plasma membrane adjacent to the cell wall; a novel putative phosphate cotransporter 15 (OsPN23053); and the H*-ATPase-like protein OsPN23022 that also interacts with GF14-c. The proteins that interacted with OsGF14-c (14-3-3 protein homolog GF14-c) and OsDADI are listed in Tables I and 2, respectively, followed by detailed information on each protein and a discussion of the significance of 20 the interactions. A diagram of the interactions is provided in Figure 1. The nucleotide and amino acid sequences of the proteins of the Example are provided in SEQ ID NOs: 1-18 and 114-130. Nine of the proteins identified represent rice proteins previously uncharacterized. Based on their presumed biological function and on the, 25 ability of the prey proteins to specifically interact with the bait proteins OsGF14-c and OsDAD1, it was speculated that OsGF14-c is a membrane component. Based on the results described below, OsGF14-c is presumably localized to the thylakoid of rice chloroplasts and to other cellular membranes. The proteins interacting in the thylakoid are part of a novel 30 protein complex and are involved in the photosynthetic processes occurring in the chloroplasts. Knowledge of the role of proteins interacting at the WO 2004/061080 PCT/US2003/041098 133 thylakoid in rice could be exploited for the development of herbicides specifically targeted to disrupting the structure and function of the thylakoid membrane. The interactions found in this study also identify OsDADI as a likely membrane component in rice, an observation consistent with previous 5 reports on other animal and plant DAD homologs. Table 1 Interacting Proteins Identified for OsGF14-c (14-3-3 protein homoloq GF14-c). The names of the clones of the proteins used as baits and found as preys 10 are given. Nucleotide/protein sequence accession numbers for the proteins of the Example (or related proteins) are shown in parentheses under the protein name. The bait and prey coordinates (Coord) are the amino acids encoded by the bait fragment(s) used in the search and by the interacting prey clone(s), respectively. The source is the library from which each prey 15 clone was retrieved. Gene Name Protein Name Bait Prey (GENBANK@ Accession No.) Coord Coord (source) BAIT PROTEIN OsGF14-c 0. sativa 14-3-3 Protein Homolog 1-257# PN12464 GFI4-c (U65957) (SEQ ID NO: 114) INTERACTORS OsBAB61062 0. sativa 3-Phosphoshikimate 1- 1-150 463-511 PN22844 carboxyvinyltransferase (a.k.a. EPSP (input (SEQ ID NO: Synthase) (AB052962; BAB61062.1) trait) 116) WO 2004/061080 PCT/US2003/041098 134 OsPN22858 Novel Protein 22858, Fragment, 1-150 27-154 (SEQ ID NO: 2) similar to Arabidopsis GTP (input Cyclohydrolase II (BAB09512.1; e=0) trait) OsPN22874 Novel Protein 22874, Fragment, 1-150 1-88 (SEQ ID NO: 4) similar to Arabidopsis Putative (input Phosphatidylinositol-4-phosphate 5- trait) kinase (NP_187603.1; 4e-") OsBAA02730 0. sativa Fructose-Bisphosphate 1-150 206-269 PN22832 Aldolase, Chloroplast Precursor (input (Contig4280.fast (Q40677) trait) a.Contigl) (SEQ ID NO: 118) OsRBCL 0. sativa Chloroplast Ribulose 1-150 287-462 PN23426 Bisphosphate Carboxylase, Large (input (SEQ ID NO : Chain (D00207; P12089) trait) 120) OsRCAA1 0. sativa Ribulose Bisphosphate 1-150 68-210 PN19842 Carboxylase/Oxygenase Activase, (input (SEQ ID NO: Large Isoform Al (AB034698, trait) 122) BAA97583) OsPN22866 Novel Protein PN22866, Fragment, 1-150 95-305 (Contig388.fasta. Similar to A. Thaliana Vacuolar ATP (input Contig2) Synthase Subunit C (V-ATPase C trait) (SEQ ID NO : 6) subunit) (Vacuolar proton pump C subunit) (Q9SDS7; e- 152 ) OsPN23022$ Novel Protein PN23022, Fragment, 1-150 149-285 (SEQ ID NO : 8) similar to H. Vulgare Plasma (input Membrane H*-ATPase (CAC50884; trait) e=0.0) WO 2004/061080 PCT/US2003/041098 135 OsPN23061 Hypothetical Protein OsContig3864, 1-150 94-203 (Contig3864.fast Similar to H. vulgare Photosystem I (input a.Contigl) Reaction Center Subunit II, trait) (SEQ ID NO : 10) Chloroplast Precursor (P36213; 6e 87 ) OsPN23059 OsContig4331, 0. sativa Putative 1-150 193-333 (Contig4331 .fast 33kDa Oxygen-Evolving Protein of 90-169 a.Contigl Photosystem II (BAB64069) (input (SEQ ID NO: trait) 132) OsAAB46718 0. sativa Photosystem 11 10 kDa 1-150 82-126 PN22840 Polypeptide (U86018; T04177) (input (FLR01_003_H trait) 20.g.la.Sp6a TMRI) (SEQ ID NO: 126) OsPN29982 Novel Protein PN29982 1-150 201-300 (SEQ ID NO : 12) (input trait) OsPN30846 Novel Protein PN30846 1-150 1-266 (SEQ ID NO: 14) (input trait) OsPN30974 Novel Protein PN30974 1-150 38-178 (SEQ ID NO : 16) (input trait) NOTE: Interactions of GF14-c with the maize transcription factor Viviparous 1 (ZmVP1) and with Em binding protein (EmBp) are also reported in the literature (Schultz et aL., 1998).
WO 2004/061080 PCT/US2003/041098 136 # Self-activating clone, i.e., it activates the reporter genes in the two-hybrid system in the absence of a prey protein, and thus it was not used in the search. $ A prey clone of OsPN23022 also interacts with a clone of Defender 5 Against Apoptotic Death 1 (OsDAD1) used as a bait, and the bait OsDAD1 interacts with Beta-Expansin EXPB2 (OsEXPB2) and with Novel Protein 23053, Fragment, Similar to Arabidopsis Putative Na+ Dependent Inorganic Phosphate Cotransporter (OsPN23053). These interactions are shown in Table 2 below. 10 Table 2 Interacting Proteins Identified for OsDAD1 (Defender Against Apoptotic Death 1). Gene Name Protein Name Bait Prey (GENBANK@ Accession No.) Coord Coord (source) BAIT PROTEIN OsDAD1 0. sativa Defender Against PN20251 Apoptotic Death 1 (D89727; (SEQ ID NO: BAA24104) 128) INTERACTORS OsPN23022 Novel Protein PN23022, Fragment, 30-115 37-371 (SEQ ID NO: 8) similar to H. Vulgare Plasma (input Membrane H*-ATPase trait) (CAC50884; e=0.0) OsPN23053 Novel Protein 23053, Fragment, 30-115 2x 1-180 (SEQ ID NO : 18) Similar to Arabidopsis Putative (input Na+-Dependent Inorganic trait) Phosphate Cotransporter (NP_181341.1; e 1 05
)
WO 2004/061080 PCT/US2003/041098 137 OsEXPB2 Beta-Expansin EXPB2 1-115 80-207 PN19902 (U95968; AAB61710) (input (SEQ ID NO: trait) 130) 30-115 183-261 2x 80 218 (input trait) Two-hybrid system using Os GF14-c as bait GF14-c (GENBANK@ Accession #U65957) is a 256-amino acid protein that has been reported to interact with site-specific DNA-binding proteins (i.e., basic leucine zipper factor EmBP1) and tissue-specific 5 regulatory factors (i.e., viviparous-1; VP-1; Schultz et al., 1998). It can act to form complexes with EmBP1 and VP-1 to mediate gene expression. The 14-3-3 proteins are found in virtually every eukaryotic organism and tissue and usually consist, in any given organism, of multiple protein isoforms (De Lille et a!., 2001). They are thought to act as molecular scaffolds or 10 chaperones and to regulate the cytoplasmic and nuclear localization of proteins with which they interact by regulating their nuclear import/export (Zilliacus et al., 2001; reviewed by Muslin & Xing, 2000). The 14-3-3 proteins bind to a multitude of functionally diverse regulatory proteins involved in cellular signaling pathways, cell cycling, and apoptosis. In plants, 15 enzymes under the control of 14-3-3 proteins include starch synthase, Glu synthase, F1 ATP synthase, ascorbate peroxidase, and affeate o-methyl transferase, plasmamembrane H*-ATPase, light- and substrate-regulated metabolic enzymes of the nitrogen and carbon assimilation pathways, and those involved in transcriptional regulation such as the G-box complex and 20 core transcription factors TBP, TFIIB, and EmBP. However, the specific 14 3-3 isoforms required by each of these pathways have not been fully characterized (De Lille et al., supra). The 14-3-3 proteins have previously WO 2004/061080 PCT/US2003/041098 138 been detected as participants in protein complexes within the nucleus (Bihn et al., 1997; Imhof & Wolffe, 1999; Zilliacus et al., supra), in the cytoplasm, and mitochondria (De Lille et al., supra). Plant 14-3-3 proteins have also been localized to the chloroplast stroma and the stromal side of thylakoid 5 membranes (Sehnke et al., supra). However, subcellular localization of GF14-c has not been directly assessed and thus its location within the cell is yet to be precisely defined. Analysis of the amino acid sequence of GF14-c identified a cAMP and GMP-dependent phosphorylation site at amino acids 107 to 110, six 10 protein kinase C phosphorylation sites (amino acids 10 to 12, 29 to 31, 56 to 61, 29 to 31, 59 to 61, and 74 to 76), three casein kinase Il phosphorylation sites (amino acids 110 to 113, 120 to 123, and 177 to 180), an N myristoylation site (amino acids 9 to 14), and two amidation sites (amino acids 77 to 80 and 105 to 108). The bait fragment used in this search 15 encodes amino acids I to 150 of GF14-c. A BLAST analysis comparing the nucleotide sequence of GF14-c against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS009195_at (e.
4 8expectation value) as the closest match. Gene expression experiments indicated that this gene is not specifically expressed in several different tissue types and is 20 not specifically induced by a broad range of stresses, herbicides and applied hormones. The bait protein encoding amino acids I to 150 of GF14-c was found to interact with 0. sativa 3-phosphoshikimate 1-carboxyvinyltransferase (a.k.a. EPSP Synthase) (OsBAB61062). OsBAB61062 is a 511-amino acid 25 protein that contains an EPSP synthase signature 1 site (amino acids 162 to 176), an EPSP signature 2 site (amino acids 423 to 441), and it is alanine rich at the N-terminus. A BLAST analysis of the amino acid sequence of OsBAB61062 determined that this protein is the rice 3-phosphoshikimate 1 carboxyvinyltransferase (also commonly referred to as EPSP synthase) 30 (GENBANK@ Accession No. BAB61062.1, 83.9% identity, e = 0.0). This 511-amino acid enzyme is located in the chloroplasts where it catalyzes an WO 2004/061080 PCT/US2003/041098 139 essential step in aromatic amino acid synthesis, referred to as the shikimate pathway. Because EPSP synthase is essential to algae, higher plants, bacteria, and fungi, but not present in mammals, this enzyme is a useful herbicide and antimicrobial target. 5 A BLAST analysis comparing the nucleotide sequence of EPSP synthase against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS020639.1_at (e1 56 expectation value) as the closest match. Gene expression experiments indicated that this gene is induced by jasmonic acid, a plant hormone involved in signal transduction 10 events associated with a plant's stress response, and by M. grisea, the fungus that causes rice blast disease. The gene is repressed under drought conditions. The bait protein encoding amino acids 1 to 150 of GF14-c was found to interact with protein 22858, a fragment which is similar to A. thaliana GTP 15 cyclohydrolase II (OsPN22858). This prey clone of OsPN22858 is a 460 amino acid protein fragment with a transmembrane region spanning amino acids 182 to 198 and a possible cleavage site between amino acids 24 and 25, although no N-terminal signal peptide is present. A BLAST analysis of OsPN22858 determined that its amino acid sequence most nearly matches 20 that of GTP cyclohydrolase II; 3,4-dihydroxy-2-butanone-4-phoshate synthase from A. thaliana (GENBANK@ Accession No. BAB09512.1, 74.4% identity, e=0). GTP cyclohydrolase II catalyzes the first committed reaction in the biosynthesis of the B vitamin riboflavin (Ritz et al., 2001). A BLAST analysis comparing the nucleotide sequence of Novel 25 Protein 22858 against TMRI's GENECHIP@ Rice Genome Array sequence database identified OS015318 s_at (5e 10 expectation value) as the closest match. The expectation value is too low for this probeset to be a reliable indicator of the gene expression of this GTP cyclohydrolase. The bait protein encoding amino acids 1 to 150 of GF14-c was found 30 to interact with Protein 22874, a fragment that is similar to A. thaliana putative phosphatidylinositol-4-phosphate 5-kinase (OsPN22874). A BLAST WO 2004/061080 PCT/US2003/041098 140 analysis of OsPN22874 determined that its 89-amino acid sequence most nearly matches that of phosphatidylinositol-4-phosphate 5-kinase (PI4P5K) from A. thaliana (GENBANK@ Accession No. NP_187603.1, 65.5% identity, 4e 18 ). Pl4P5K is an enzyme that plays a well-defined role in many signaling 5 events in many species, including the endoplasmic reticulum (ER) stress response in plants (Shank et al., 2001). Animal and yeast PI4P5K phosphorylates phosphatidylinositol-4-phosphate to produce phosphatidylinositol-4,5-bisphosphate as a precursor of two second messengers, inositol-1,4,5-triphosphate and diacylglycerol, and as a 10 regulator of many cellular proteins involved in signal transduction and cytoskeletal organization (reviewed in Mikami et al., 1998). Mikami et al. identified a full-length cDNA clone encoding a Pl4P5K protein in A. thaliana whose mRNA expression is induced by treatment of the plant with drought, salt and abscisic acid, suggesting that this protein is involved in water-stress 15 signal transduction (Mikami et al., supra). Elge et a/. report that A. thaliana PI4P5K is expressed predominantly in vascular tissues of leaves, flowers and roots, namely in cells of the lateral meristem, i.e., the procambium (Elge et al., 2001). The bait protein encoding amino acids 1 to 150 of GF14-c was also 20 found to interact with 0. sativa fructose-bisphosphate aldolase, a chloroplast precursor (OsBAA02730). OsBAA02730 (GENBANK® Accession No. Q40677) is a 388-amino acid protein that includes a fructose-bisphosphate aldolase class-I active site (amino acids 44 and 388), as determined by analysis of the amino acid sequence (8.5e-22). A BLAST analysis of the 25 amino acid sequence of OsBAA02730 indicated that this protein is the rice fructose-bisphosphate aldolase, chloroplast precursor (GENBANK@ Accession No. Q40677). The gene encoding chloroplastic aldolase was isolated along with that encoding the cytoplasmic form of the enzyme (Tsutsumi et al., 1994). The chloroplastic aldolase is encoded at a single 30 locus, while the cytoplasmic form is distributed between three loci on the genome. Aldolases are present in higher plants as two isoforms, the WO 2004/061080 PCT/US2003/041098 141 cytosolic and the chloroplastic types. The cytoplasmic form is highly conserved among plants and appears to be regulated through a Ca2 mediated protein kinase/phosphatase pathway (Nakamura et al., 1996). This enzyme is though to have a role in the fruit ripening process (Schwab et 5 aL, 2001). The chloroplastic enzyme is involved in two major sugar phosphate metabolic pathways of green chloroplasts: the C3 photosynthetic carbon reaction cycle (Calvin cycle) and reactions of the starch biosynthetic pathway. In both cases, aldolase catalyzes the formation of fructose 1,6 biphosphate from dihydroxyacetone 3-phosphate and glyceraldehyde 3 10 phosphate. These topics are reviewed by Michelis et a., 2000, who also identified a 44-kDa heat-induced isoform of the fructose-bisphosphate aldolase in oat chloroplast, confirming its localization to the thylakoid membrane and suggesting that this enzyme is not embedded but rather tends to adhere to the chloroplast membranes. Similar heat-induced 15 thylakoid-associated aldolase homologues were found in other plant species. A BLAST analysis comparing the nucleotide sequence of the aldolase protein against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS006916.1_at (e- 15 6 expectation value) as the 20 closest match. Our gene expression experiments indicate that this gene is down-regulated by jasmonic acid and drought. In addition, the bait protein encoding amino acids 1 to 150 of GF14-c was found to interact with 0. sativa ribulose bisphosphate carboxylase large chain precursor (RUBISCO Large Subunit; OsRBCL). A BLAST analysis of 25 the amino acid sequence of OsRBCL determined that this protein is the rice chloroplast ribulose bisphosphate carboxylase, large chain precursor (RuBP carboxylase/oxygenase, also called RUBISCO for short; GENBANK@ Accession No. P12089). RUBISCO is a 477-amino acid protein present in the chloroplast of higher plants, with an active site in position 196-204. The 30 chloroplast RuBP carboxylase/oxygenase is part of the C0 2 -fixing multienzyme complexes bound to the thylakoid membrane (Suss et aL, WO 2004/061080 PCT/US2003/041098 142 1993) with roles in the Calvin cycle reactions that occur in the stroma of the chloroplast during photosynthesis. The starting and ending compound in the Calvin cycle is the five-carbon sugar ribulose 1,5-biphosphate (RuBP). As its name indicates, RuBP carboxylase/oxygenase catalyzes two types of 5 reactions that involve RuBP. In the presence of high carbon dioxide and low oxygen concentrations, the carboxylase activity of RUBISCO is favored and the enzyme catalyzes the initial reaction in the Calvin cycle, the carboxylation of RuBP, leading to the formation of 3-phosphoglyceric acid (PGA). However, in the presence of low carbon dioxide and high oxygen 10 concentrations, oxygen competes with carbon dioxide as a substrate for RUBISCO and the enzyme's oxygenase activity also occurs, resulting in condensation of oxygen with RuBP to form 3-phosphoglycerate and phosphoglycolate. RUBISCO is the world's most abundant enzyme, accounting for as much as 40 percent of total soluble protein in leaves (these 15 topics are discussed in Raven et al., 1999). A BLAST analysis comparing the nucleotide sequence of the RUBISCO protein against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS000296_s_at (e=O expectation value) as the closest match. Gene expression experiments indicated that 20 this gene is down-regulated by BAP, 2,4-D, BL2, jasmonic acid, gibberellin, and abscisic acid. The gene is up-regulated under osmotic stress conditions. The bait protein encoding amino acids 1 to 150 of GF14-c was found to interact with 0. sativa ribulose bisphosphate carboxylase/oxygenase 25 activase, large isoform Al (OsRCAAI). A BLAST analysis of the amino acid sequence of OsRCAA1 determined that this 466-amino acid protein is the rice RUBISCO activase large isoform precursor (GENBANK@ Accession No. BAA97583). It contains two active sites (amino acid 31 to 38 and 156 to 163). RUBISCO activase is an AAA+ (ATPases associated with a variety of 30 cellular activities) protein that facilitates the ATP-dependent removal of sugar phosphates from RUBISCO active sites. This action frees the active site of WO 2004/061080 PCT/US2003/041098 143 RUBISCO for spontaneous carbamylation by CO 2 and metal binding, prerequisites for activity (reviewed in Salvucci et al., 2001; Salvucci & Ogren, 1996). The bait protein encoding amino acids 1 to 150 of GF14-c was found 5 to interact with protein PN22866, a fragment similar to A. thaliana vacuolar ATP synthase subunit C (V-ATPase C subunit; vacuolar proton pump C subunit) (OsPN22866). OsPN22866 is a 408-amino acid protein fragment. Its amino acid sequence most nearly matches that of A. thaliana Vacuolar ATP synthase subunit C (V-ATPase C subunit) (Vacuolar proton pump C 10 subunit) (Q9SDS7, 72.7% identity, e" 152 ), as determined by BLAST analysis. The H*-translocating ATPases (H*-ATPase, V-ATPase) are multi-subunit enzymes that function as essential proton pumps in eukaryotes. The catalytic site of human V-ATPase consists of a hexamer of three A subunits and three B subunits that bind and hydrolyze ATP and are regulated by 15 accessory subunits C, D, and E (van Hille et al., 1993). ATPases are essential cellular energy converters that transduce the chemical energy of ATP hydrolysis from transmembrane ionic electrochemical potential differences. The plant ATPases are present in chloroplasts, mitochondria and vacuoles. In vacuoles, ATPases regulate the 20 contents and volume of vacuoles, which depends on the coordinated activities of transporters and channels located in the tonoplast (vacuolar membrane). The V-ATPase uses the energy released during cleavage of the phosphate group of cytosolic ATP to pump protons into the vacuolar lumen, thereby creating an electrochemical H*-gradient that is the driving 25 force for transport of ions and metabolites. Thus V-ATPase is important as a 'house-keeping' and as a stress response enzyme. Expression of V-ATPase has been shown to be highly regulated depending on metabolic conditions. The V-ATPase consists of several polypeptide subunits that are located in two major domains, a membrane peripheral domain (V 1 ) and a membrane 30 integral domain (Vo). Subunit C is a highly hydrophobic protein containing four membrane-spanning domains. The function of subunit C is unknown, WO 2004/061080 PCT/US2003/041098 144 although it is suggested to be directly involved in H* transport and might be involved in stabilization of V 1 . The structure, function and regulation of the plant V-ATPase are reviewed in Ratajczak, 2000. The bait protein encoding amino acids 1 to 150 of GF14-c was also 5 found to interact with protein PN23022, a fragment similar to H. Vulgare plasma membrane H*-ATPase (OsPN23022). Protein PN23022 is a 534 amino acid fragment that includes seven transmembrane domains (amino acids 170 to 186, 202 to 218, 226 to 242, 266 to 282, 308 to 324, 337 to 353, and 373 to 389), as predicted by analysis of its amino acid sequence. 10 A BLAST analysis of the amino acid sequence of OsPN23022 determined that this protein is similar to H. vulgare plasma membrane H*-ATPase (GENBANK@ Accession No. CAC50884; 88.2% identity, e=0 expectation value), an enzyme that translocates protons into intracellular organelles or across the plasma membrane of eukaryotic cells. A BLAST analysis 15 comparing the nucleotide sequence of Novel protein PN23022 against TMRI's GENECHIP@ Rice Genome Array sequence database identified OS000972_ fat (e-" expectation value) as the closest match. The expectation value is too low for this probeset to be a reliable indicator of the gene expression of this ATPase. OsPN23022 was also found to interact 20 with Defender Against Apoptotic Death 1 (OsDAD1; see Table 22). The bait protein encoding amino acids 1 to 150 of GF14-c was found to interact with protein OsContig3864, which is similar to H. vulgare photosystem I reaction center subunit II, chloroplast precursor (OsPN23061). Analysis of the OsContig3864 amino acid sequence predicted that it is a 25 203-amino acid protein containing a possible cleavage site between amino acids 21 and 22, although there appears to be no N-terminal signal peptide. A BLAST analysis determined that the OsContig3864 clone has an amino acid sequence that most nearly matches that of H. vulgare photosystem I reaction center subunit II, chloroplast precursor (Photosystem 1 20 kDa 30 subunit; PSI-D; GENBANK@ Accession No. P36213, 80% identity, 3e 8 6 ). The photosystems (photosystems I and II) are large multi-subunit protein WO 2004/061080 PCT/US2003/041098 145 complexes embedded into the photosynthetic thylakoid membrane. They operate in series and catalyze the primary step in oxygenic photosynthesis, the light-induced charge separation process by which light energy from the sun is converted to carbon dioxide and carbohydrates in plants and 5 cyanobacteria. Photosystem I catalyzes the light-induced electron transfer from plastocyanin/cytochrome c 6 on the lumenal side of the membrane (inside the thylakoids) to ferredoxin/flavodoxin at the stromal side by a chain of electron carriers (reviewed in Fromme et al., 2001). A BLAST analysis comparing the nucleotide sequence of 10 OsContig3864 against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS000721_at (e = 0 expectation value) as the closest match. Gene expression experiments indicated that this gene is not specifically expressed in several different plant tissue types and is not specifically induced by a broad range of stresses, herbicides and applied 15 hormones. The bait protein encoding amino acids 1 to 150 of GF14-c was also found to interact with OsContig4331, an 0. Sativa putative 33kDa oxygen evolving protein of photosystem II (OsPN23059). The two prey clones retrieved from the input trait library encode amino acids 193 to 333 and 90 to 20 169 of OsContig4331. These clones are non-overlapping, suggesting that multiple GF14-c-binding sites exist within OsContig4331. Analysis of the OsContig4331 protein sequence predicted that it codes for a 333-amino acid protein. The analysis also indicated that OsContig 4331 contains a possible cleavage site between amino acids 37 and 38, although no N-terminal signal 25 peptide is evident. A BLAST analysis of the OsContig 4331 amino acid sequence determined that this protein is the rice putative 33kDa oxygen evolving protein of photosystem II (GENBANK® Accession No. BAB64069, 90.6% identity, e- 16 9 ). Photosystem Il uses photooxidation to convert water to molecular oxygen, thereby releasing electrons into the photosynthetic 30 electron transfer chain.
WO 2004/061080 PCT/US2003/041098 146 A BLAST analysis comparing the nucleotide sequence of OsContig4331, rice Photosystem I Reaction Center Subunit 11 Precursor against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS000372_at (e = 0 expectation value) as the closest 5 match. Our gene expression experiments indicate that this gene is down regulated during cold stress. The bait protein encoding amino acids 1 to 150 of GF14-c was also found to interact with 0. Sativa photosystem II 10 kDa polypeptide (OSAAB46718). OSAAB46718 is a 126-amino acid protein fragment that 10 includes a predicted transmembrane domain (amino acids 102 to 118). A BLAST analysis against the Genpept database revealed that OsAAB46718 is the Oryza sativa photosystem I 10kDa polypeptide (GENBANK@ Accession No. T04177, 91.2% identity, 2e-6 1 ). The bait protein encoding amino acids 1 to 150 of GF14-c was also 15 found to interact with protein PN29982 (OsPN29982). The 300-amino acid sequence of the protein OsPN29982 most nearly matches that of a putative protein of unknown function from A. thaliana (GENBANK® Accession No. NP_196688.1, 47% identity, 3e-054), as determined by BLAST analysis. The second best match was CHICK LIM/homeobox protein Lhx1 (Homeobox 20 protein LIM-1) (GENBANK@ Accession No. P53411, 28% identity, e=0.002). Based on the homeoboxdomain, this interaction can be similar to 14-3-3 protein interactions with transcription factors like VP1. The bait protein encoding amino acids I to 150 of GF14-c was also found to interact with protein PN30846 (OsPN30846). A BLAST analysis of 25 protein OsPN30846 determined that its 266-amino acid sequence most nearly matches that of dynamin homolog from the leguminous plant Astragalus sinicus (GENBANK@ Accession No. AAF19398.1, 70.6% identity, 2e-9 9 ). Since the discovery of the GTP-binding dynamin in rat brain, dynamin-like proteins have been isolated from various organisms and 30 tissues and shown to be involved in diverse and seemingly unrelated biological processes. Many different isoforms of dynamin-like proteins have WO 2004/061080 PCT/US2003/041098 147 been identified in plant cells, and these plant homologs can be grouped into several subfamilies, such as G68/ADLI, ADL2 and ADL3, based on their amino acid sequence similarity (reviewed in Kim et al., 2001). The biological roles have been characterized for a few of these plant dynamin-like proteins. 5 The dynamin-like protein ADL1 from Arabidopsis has been shown to be localized to and to be involved in biogenesis of the thylakoid membranes of chloroplasts (Park et al., 1998). Another Arabidopsis dynamin-like protein, ADL2, is targeted to the plastid, and its recombinant form expressed in E. coli binds specifically to phosphatidylinositol 4-phosphate through the 10 pleckstrin homology (PH) domain present in ADL2 (Kim et al., supra). Based on the similarity between the biochemical properties of ADL2 and those of dynamin and other related proteins, ADL2 can be involved in vesicle formation at the chloroplast envelope membrane. The bait protein encoding amino acids 1 to 150 of GF14-c was also 15 found to interact with protein PN30974 (OsPN30974). A BLAST analysis of the novel protein OsPN30974 determined that its 476-amino acid sequence most nearly matches that of an Arabidopsis hypothetical protein of unknown function (GENBANK@ Accession No. NP_173623.1, 49% identity, e 1 37 ). The next 13 best hits with an expectation value <e 15 are all Arabidopsis or 20 rice proteins of unknown function annotated in the public domain. Two-hybrid system usinq OsDADI as bait A second bait protein, namely 0. sativa Defender Against Apoptotic Death I (OsDAD1), was used to identify interactors. OsDADI (GENBANK@ Accession No. BAA24104) is a 114-amino acid protein that includes three 25 predicted transmembrane domains (amino acids 33 to 49, 59 to 75, and 94 to 110). DADI is a suppressor of programmed cell death, or apoptosis, a process in which unwanted cells are eliminated during growth and development. DAD is a highly conserved protein with homologs identified in animals and plants (Apte et al., 1995; Gallois et al, 1997). Dysfunction and 30 down-regulation of this gene has been linked to programmed cell death in these organisms (Lindholm et al., 2000). DADI is an essential subunit of the WO 2004/061080 PCT/US2003/041098 148 oligosaccharyltransferase that is located in the ER membrane (Lindholm et al., supra). DAD1 expression declines dramatically upon flower anthesis disappearance in senescent petals and is down-regulated by the plant hormone ethylene (Orzaez & Granell, 1997), which is involved in a variety of 5 stress responses and developmental processes including petal senescence (Shibuya et a/., 2000), cell elongation, cell fate patterning in the root epidermis, and fruit ripening (Ecker, 1995). Two clones, encoding amino acids 1-115 and 30-115 of OsDAD1, were used as baits in this Example. 10 OsDAD1 was found to interact with protein 23053, a fragment which is similar to Arabidopsis putative Na-dependent inorganic phosphate cotransporter (OsPN23053). OsPN23053 is a protein fragment; however, its available 379-amino acid sequence contains five predicted transmembrane regions (amino acids 100 to 116, 118 to 134, 226 to 242, 259 to 275, and 15 324 to 340) and a cleavable signal peptide (amino acids 1 to 46). A BLAST analysis determined that OsPN23053 is similar to an Arabidopsis putative Na*-dependent inorganic phosphate cotransporter (GENBANK® Accession No. NP_181341.1, 55.4% identity, e 105 ). In mammals, Na*-dependent inorganic phosphate cotransporter is present in neuronal synaptic vesicles 20 and endocrine synaptic-like micrdvesicles as a vesicular glutamate transporter and is responsible for storage of glutamate, the major excitatory neurotransmitter in the mammalian central nervous system (CNS; Takamori et al., 2000). At least two isoforms of Na*-dependent inorganic phosphate cotransporter exist (Takamori et al., supra; Aihara et al., 2000) and are 25 expressed in pancreas and brain (Hayashi et al., 2001; Fujiyama et al., 2001). OsPN23053 is the first of a family of Na*-dependent inorganic phosphate cotransporters to be discovered in rice. Plants utilize glutamate in important biological processes including protein synthesis and glutamate mediated signaling (Lacombe et al., 2001). The formation of glutamate from 30 glutamine during nitrogen recycling (Singh et al., 1998) and the control of nitrogen assimilatory pathways by light-signaling (Oliveira et al., 2001) in WO 2004/061080 PCT/US2003/041098 149 plants suggest a link between glutamate formation and light-signal transduction. OsDADI was found to interact with beta-expansin EXPB2 (OsEXPB2). A BLAST analysis of the amino acid sequence of OsEXPB2 5 determined that this protein is rice beta-expansin (GENBANK@ Accession No. AAB61710, 99.6% identity, e- 156 ). Expansins promote cell wall extension in plants. Shcherban et al. isolated two cDNA clones from cucumber that encode expansins with signal peptides predicted to direct protein secretion to the cell wall Shcherban et al., 1995). These authors identified at least four 10 distinct expansin cDNAs in rice and at least six in Arabidopsis from collections of anonymous cDNAs (Expressed Sequence Tags). They determined that expansins are highly conserved in size and sequence and suggest that this multigene family formed before the evolutionary divergence of monocotyledons and dicotyledons. Their analyses indicate no similarities 15 to known functional domains that might account for the action of expansins on wall extension, though a series of highly conserved tryptophans can mediate expansin binding to cellulose or other glycans. Summary The thylakoid membrane of the chloroplasts contains the 20 photosynthetic pigments, reaction centres and electron transport chains associated with photosynthesis. Localization of OsGF14-c to this site is consistent with the interactions of OsGF14-c with the photosystem proteins of this Example. The photosystems (photosystems I and II) are large multi subunit protein complexes embedded in the thylakoid membrane. As part of 25 a larger group of protein-pigment complexes, the photosynthetic reaction centers, they catalyze the light-induced charge separation associated with photosynthesis. Both photosystems use the energy of photons from sunlight to translocate electrons across the thylakoid membrane via a chain of electron carriers. The electron transfer processes are coupled to a build-up 30 of a difference in proton concentration across the thylakoid membrane. The resulting electrochemical membrane potential drives the synthesis of ATP, WO 2004/061080 PCT/US2003/041098 150 which is used to reduce C02 to carbohydrates in the subsequent dark reactions. OsGF14-c is found to interact with OsContig3864, similar to photosystem I reaction center subunit II, chloroplast precursor, with OsContig4331, the rice putative 33kDa oxygen-evolving protein of 5 photosystem 11, and with rice photosystem I 10 kDa polypeptide. The validity of these interactions is supported by results in a report by Sehnke et al., 2000, in which yeast two-hybrid technology was used to identify an interaction between a plant 14-3-3 protein and another photosystem I subunit protein, A. thaliana photosystem I N-subunit At pPSI-N. The 10 interactions of OsGF14-c with OsPN23061 (OsContig3864), OsPN23059 (OsContig4331), and OsAAB46718 (photosystem Hl 10 kDa polypeptide) suggest that OsGF14-c has a role in coupling the physical contact between proteins in or on the periphery of thylakoid membranes. Given the interactions of OsGF14-c and components of the 15 chloroplast photosystem, some of the other proteins found to interact with OsGF14-c in this study are likely to be localized to the chloroplast as well, and they are possibly co-located to the thylakoid membrane as interaction complexes. For example, OsGF14-c interacts with EPSP synthase (OsBAB61062), a shikimate pathway enzyme located in the chloroplast, 20 where aromatic amino acid synthesis initiates. It is interesting to note that an enzyme in the shikimate pathway requires a flavin as a cofactor (Bornemann et al., Biochemistry 35(30): 9907-9916, 1996) and that OsGF14-c also interacts with OsPN22858, a novel protein fragment similar to A. thaliana GTP cyclohydrolase II. GTP cyclohydrolase II participates in the 25 biosynthesis of the B vitamin riboflavin, which is a cofactor for enzymes functioning in the shikimate pathway. The interactions of these proteins with OsGF14-c can keep key proteins of the shikimate pathway in close proximity in or at the thylakoid. The interactions of OsGF14-c with chloroplastic aldolase (OsBAA02730), an enzyme shown to be localized to the thylakoid 30 membrane and involved in the sugar phosphate metabolic pathway of chloroplasts, and with the Calvin cycle enzyme RUBISCO (OsRBCL) and WO 2004/061080 PCT/US2003/041098 151 RUBISCO activase large isoform precursor (OsRCAA1) further support localization of OsGF14-c and these interactors to the thylakoid membrane. Previous reports have identified a fructose-bisphosphate aldolase isoform at the thylakoid membrane in oat chloroplasts (Michelis et al., supra). 5 In addition, a novel interactor identified for OsGF14-c is a putative dynamin homolog (OsPN30846). Plant dynamin-like proteins have been localized to the thylakoid and envelope membranes of chloroplasts Park et al., 1998; Kim et a/2001). Thus it is likely that this rice dynamin homolog is a membrane protein that resides in the chloroplast. This and the fact that 10 other interactors identified for OsGF14-c are present in the thylakoid of chloroplasts substantiates the notion that the 14-3-3 protein functions as a component of the thylakoid or envelope membrane of chloroplasts. In further support of this hypothesis, a recombinant Arabidopsis dynamin-like protein member of the ADL2 subfamily binds specifically to 15 phosphatidylinositol 4-phosphate. The interactions between dynamins and phosphoinositides documented in the literature (reviewed in Kim et al., supra) are consistent with the concomitant presence of the dynamin-like protein OsPN30846 and the phosphatidylinositol-4-phosphate 5-kinase OsPN22874 (rice P14P5K), both interacting with OsGF14-c, at the thylakoid. 20 We speculate that the interactors described above are part of a protein complex involved in the photosynthetic processes at the thylakoid membrane. In addition to components of the chloroplast thylakoid, OsGF14-c was found to interact with proteins similar to a plasma membrane H*-ATPase 25 (OsPN23022) and to a vacuolar ATPase (OsPN22866), which suggests that OsGF14-c is also present in plasma and vacuolar membranes. The interactions of OsGF14-c with the ATPases can represent 14-3-3 regulation of the plant turgor pressure. This hypothesis is corroborated by reports of 14-3-3 proteins accomplishing this function via regulation of at least one form 30 of a plasma membrane H+ ATPase (reviewed in DeLille et al., 2001). The interaction of the vacuolar ATPase with OsGF14-c can occur in the vacuolar WO 2004/061080 PCT/US2003/041098 152 membrane, but also in membranes of the ER, Golgi bodies, coated vesicles, and provacuoles. The biological significance of the interaction of OsGF14-c with the novel protein OsPN22874 (rice P14P5K) can be defined based on functional 5 homology with A. thafiana P14P5K, which is induced under water-stress conditions and is expressed in leaves. Given the interaction of OsGF14-c with components of the thylakoid and vacuolar membranes, the rice PIP5K can be located in the chloroplast but it can also reside at the vacuole, with the vacuolar ATPase. In either case, the rice PIP5K can direct synthesis of 10 molecules involved in kinase signaling events associated with chloroplast protection or vacuole size regulation under abiotic stress. Two additional interactors, OsPN29982 and OsPN30974, found for OsGF14-c are proteins of unknown function. Nevertheless, because 14-3-3 proteins acts as chaperones, these interactions can represent a process in 15 which the prey proteins achieve proper protein folding, or OsGF14-c can be responsible for proper subcellular localization of OsPN29982 and OsPN30974. Because all other interactors for OsGF14-c appear to be membrane-associated proteins, OsPN29982 and OsPN30974 are likely to be membrane proteins and can reside at the thylakoid or other cellular 20 membrane structures. In summary, some of the rice proteins found to interact with OsGF14 c appear to be located at the thylakoid membrane where they participate in photosynthetic processes occurring in the chloroplast; these interactions are consistent with previously reported localization of 14-3-3 proteins to the 25 chloroplast stroma and the stromal side of thylakoid membranes (Sehnke et al., 2000). Other interactors identified are associated with the plasma or vacuolar membrane. OsGF14-c is, thus, likely to be a membrane component in rice. Because 14-3-3 proteins participate in many types of signaling pathways and are thought to act as molecular chaperones 30 necessary for the assembly, unfolding or transport of proteins through membranes, it is likely that OsGF14-c functions as a molecular glue or WO 2004/061080 PCT/US2003/041098 153 stabilizer to regulate the function of the proteins with which it interacts at the thylakoid or other membrane structures. The identification of OsGF14-c as a membrane component represents a novel observation and the first functional characterization of the GF14-c protein in rice. In particular, the proteins 5 identified in this Example as interacting at the thylakoid membrane of chloroplasts represent a novel rice protein complex. Three interactors were identified in this study for OsDAD1. One is the putative plasma membrane H*-ATPase (OsPN23022) that interacts with OsGF14-c. Evidence exists that both OsDAD1 and H*-ATPase are integral 10 membrane proteins (Lindholm et a/., 2000; Ratajczak et al., 2000). H* ATPase translocates protons into intracellular organelles or across the plasma membrane of specialized cells, its activity resulting in acidification of intracellular compartments in eukaryotic cells. The acidic interior of lysosomes has been shown to be necessary for apoptosis under some 15 conditions (Kagedal et al., 2001; Bursch, 2001). Thus, the activities of these two enzymes can be necessary for regulation of programmed cell death, and their physical interaction can represent a step in control of this event. Furthermore, 14-3-3 proteins have been implicated in regulation of many cellular processes including apoptosis (van Hemert et a/., 2001). It is 20 possible that the interactions of OsPN23022 with GF14-c and with OsDAD1 represent steps in such regulation. Another novel interactor found for OsDAD1 is the novel rice Na* dependent inorganic phosphate cotransporter. We speculate that the rice phosphate cotransporter is also a membrane protein based on functional 25 homology with its mammalian homologs, which are localized to neuronal and endocrine vesicles and have a role in glutamate storage (Takamori et al., 2000). It is likely that glutamate participates in apoptosis regulation in plants as it does in mammals (Bezzi et al., 2001), and that this occurs in rice through the association of the phosphate cotransporter OsPN23053 with 30 OsDAD1.
WO 2004/061080 PCT/US2003/041098 154 Finally, OsDAD1 was found to interact with the rice beta-expansin. Expansins are localized to the plasma membrane adjacent to the cell wall, from which they mediate cell wall extension. Since genes regulating cell death are part of the defense response, this interaction can be associated 5 with structural changes in the cell wall in response to cell death. The interactions here reported represent the first characterization of the DAD1 protein homolog in rice. Notably, the fact that OsDAD1 and its interactors appear to be membrane proteins and that one of them, OsPN23022, interacts with OsGF14-c lend further support to the notion that 10 OsGF14-c is a membrane component. Example 11 The rice senescence-associated protein (Os006819-2510) shares 61.4% amino acid sequence similarity with daylily Senescence-Associated Protein 5, a protein encoded by one (DSA5) of six cDNA sequences the 15 levels of which increase during petal senescence. Transcripts of these genes are found predominantly in petals, their expression increase during petal but not leaf senescence, and they are induced by a concentration of abscisic acid (ABA) that causes premature senescence of the petals. Petal senescence is an example of endogenous programmed cell death, or 20 apoptosis, a process in which unwanted cells are eliminated during growth and development. Genes performing a regulatory function in cell death or survival are important to developmental processes. The rice senescence associated protein Os006819-2510 was chosen as a bait for these interaction studies based on its potential relevance to plant growth and 25 development. To identify proteins that interacted with the rice senescence associated protein Os006819-2510, an automated, high-throughput yeast two-hybrid assay technology (provided by Myriad Genetics Inc., Salt Lake City, UT) was employed, as has been described above. 30 Results WO 2004/061080 PCT/US2003/041098 155 The rice senescence-associated protein Os006819-2510 was found to interact with eight rice proteins. Five interactors are known, namely, the rice histone deacetylase HD1 (OsAAK01712), an enzyme involved in regulation of core histone acetylation; the calcium-binding protein calreticulin 5 precursor (OsCRTC), which also interacts with the starch biosynthetic enzyme soluble starch synthase (OsSSS) and with a novel protein (OsPN29950) of unknown function; low temperature-induced protein 5 (OsLIP5); the dehydrin RAB 16B, which is induced by water stress; and rice putative myosin (OsPN23878), an actin motor protein which also interacts 10 with a putative calmodulin-kinase that is associated with a network of proteins involved in cell cycle regulation (see Examples I and 1l). Three interactors for senescence-associated protein are novel proteins including a putative calllose synthase (OsPN23226), an enzyme involved in the biosynthesis of the glucan callose; a protein similar to barley 15 coproporphyrinogen Ill oxidase, chloroplast precursor, an enzyme of the chlorophyll biosynthetic pathway (OsPN23485); and a protein similar to Arabidopsis Gamma Hydroxybutyrate Dehydrogenase. The interacting proteins of this Example are listed in Tables 3-5, followed by detailed information on each protein and a discussion of the 20 significance of the interactions. The nucleotide and amino acid sequences of the proteins of the Example are provided in SEQ ID NOs: 19-30 and 131 138. Note that several prey proteins identified are, like the bait protein Os006819-2510, membrane-associated molecules (OsCRTC, OsPN23226, 25 OsLIP5). Several appear to be associated with cell cycle processes in rice (OsPN23878, Os003118-3674, OsCRTC, OsSSS, OsPN23226, OsAAK01712), while others are involved in the plant stress response (OsRAB16B, OsLIP5, OsCRTC). Some of the proteins identified represent rice proteins previously uncharacterized. Based on the presumed biological 30 function of the prey proteins and on their ability to specifically interact with the bait protein Os006819-2510, Os006819-2510 is speculated to be WO 2004/061080 PCT/US2003/041098 156 involved in cell cycle/mitotic processes and in the plant resistance to stress, and can actually represents a link between these processes in rice. Proteins that participate in cell cycle regulation in rice can be targets for genetic manipulation or for compounds that modify their level or activity, 5 thereby modulating the plant cell cycle. The identification of genes encoding these proteins can allow genetic manipulation of crops or application of compounds to effect agronomically desirable changes in plant development or growth. Likewise, genes that are involved in conferring plants resistance to stress have important commercial applications, as they could be used to 10 facilitate the generation and yield of crops. Table 3 Interacting Proteins Identified for 0s006819-2510 (Hypothetical Protein 006819-2510, Similar to Hemeroca//is Senescence-Related Protein 5). The names of the clones of the proteins used as baits and found as preys 15 are given. Nucleotide/protein sequence accession numbers for the proteins of the Example (or related proteins) are shown in parentheses under the protein name. The bait and prey coordinates (Coord) are the amino acids encoded by the bait fragment(s) used in the search and by the interacting prey clone(s), respectively. The source is the library from which each prey 20 clone was retrieved. Gene Name Protein Name Bait Prey (GENBANK@ Accession No.) Coord Coord (source) BAIT PROTEIN Os006819-2510 Hypothetical Protein 006819-2510, PN20462 Similar to Senescence-Related (SEQ ID NO: Protein 5 from Hemerocallis Hybrid 20) Cultivar (AAC34855.1; e- 9 7 )
INTERACTORS
WO 2004/061080 PCT/US2003/041098 157 OsAAK01712 0. sativa Histone Deacetylase HD1 1-150 90-221 PN24059 (AF332875; AAKO1712.1) (output (SEQ ID NO: trait) 132) OsCRTC* 0. sativa Calreticulin Precursor 1-273 283-301 PN20544 (AB021259; BAA88900) (output (SEQ ID NO : trait) 134) OsLIP5 Otyza sativa Low Temperature- 1-150 29-60 PN22883 Induced Protein 5 (AB011368; (input trait) (SEQ ID NO: BAA24979.1) 136) OsPN23878# Oryza sativa Putative Myosin 1-150 685-888 (SEQ ID NO: (AC090120; AAL31066.1) (output 138) trait) OsRAB16B 0. sativa DEHYDRIN RAB 16B 1-273 147-164 PN20554 (P22911) (output (SEQ ID NO: trait) 140) OsPN23226 Novel Protein PN23226, Callose 1-273 345-432 (SEQ ID NO : synthase (output 22) trait) OsPN23485 Novel Protein PN23485, Similar to 1-273 90-243 (SEQ ID NO: Hordeum vulgare Coproporphyrinogen (output 24) III Oxidase, chloroplast precursor trait) (Q42840; e-169) OsPN29037 Novel Protein PN29037 1-150 73-165 (SEQ ID NO : (input trait) 26) * Additional interactions identified for OsCRTC are listed in Table 4 # Additional interactions identified for OsPN23878 are listed in Table 5 WO 2004/061080 PCT/US2003/041098 158 Table 4 Gene Name Protein Name Bait Prey Coord (GENBANK@ Accession No.) Coord (source) BAIT PROTEIN OsCRTC Calreticulin Precursor (AB021259; PN20544 BAA88900) (SEQ ID NO: 134) INTERACTORS OsPN29950 Novel Protein PN29950 1-150 7-103 (SEQ ID NO: 2x 138-343 28) 50-343 (output trait) OsSSS Soluble Starch Synthase 250-425 68-270 PN19701 (AF165890; AAD49850) (input trait) (SEQ ID NO: 97-263 142) (output trait) Table 5 Gene Name Protein Name Bait Coord Prey (GENBANK@ Accession No.) Coord (source) PREY PROTEIN OsPN23878 Oryza sativa Putative Myosin (SEQ ID NO: (AC090120; AAL31066.1) 138) BAIT PROTEIN WO 2004/061080 PCT/US2003/041098 159 Os003118- Hypothetical Protein 003118-3674 75-149 824-935 3674 Similar to Lycopersicon (output PN20551 esculentum Calmodulin trait) (SEQ ID NO: 30) Os006819-2510 is a 276-amino acid protein that includes a cleavable signal peptide (amino acids 1 to 27) and three transmembrane domains (amino acids 48 to 64, 82 to 98, and 233 to 249), as predicted by analysis of 5 its amino acid sequence. The analysis also predicted two endoplasmic reticulum retention motifs, one N-terminal (AFRL) and the other C-terminal (KGGY), and a prokaryotic membrane lipoprotein lipid attachment site beginning with amino acid 57 (Prosite). This site, when functional, is a region of protein processing. Analysis by Pfam also identified a 10 transmembrane superfamily domain, also called a tetraspanin family domain, typically found in a group of eukaryotic cell surface antigens that are evolutionarily related and include transmembrane domains. A BLAST analysis against the Genpept database indicated that 0s006819-2510 is similar to Senescence-Associated Protein 5 from 15 Hemerocallis hybrid cultivar (daylily; GENBANK@ Accession No. AAC34855.1; 61.4% identity; e-97). In agreement with this result, the protein with the amino acid sequence most similar (63% identity) to that of Os006819-2510 in Myriad's proprietary database is Hypothetical Protein 005991-3479, Similar to Hemerocallis Senescence-Associated Protein 5 20 (Os005991-3479). In an effort to identify the components of the genetic program that leads daylily petals to senescence and cell death ca. 24 hours after the flower opens, the cDNA encoding senescence-associated protein 5 in petals was isolated as one of six cDNAs (designated DSA3, 4, 5, 6, 12 and 15) whose levels increase during petal senescence (Panavas et a., 25 1999). However, no sequence homology was identified in the public database for the DSA5 gene product, which remains as yet unidentified.
WO 2004/061080 PCT/US2003/041098 160 The levels of DSA mRNAs in leaves was determined to be less than 4% of the maximum detected in petals, with no differences between younger and older leaves, and the DSA genes (except DSA12) are, expressed at low levels in daylily roots and (except DSA4) induced by a concentration of 5 abscisic acid that causes premature senescence of the petals. Two bait fragments, encoding amino acid 1-273 and 1-150, of Os006819-2510 were used in the yeast two-hybrid screen. A bait fragment encoding amino acids 1-150 of Os006819-2510 was found to interact with 0. sativa histone deacetylase HD1 (OsAAK01712). A 10 BLAST analysis of the amino acid sequence of OsAAK01712 indicated that this prey protein is the rice Histone Deacetylase HD1 (GENBANK@ Accession No. AAK01712.1, 100% identity, e = 0.0). Histone deacetylase (HD) enzymes have been isolated from plants, fungi and animals (reviewed by Lechner et al., 1996). The enzymatic activity of histone deacetylase and 15 that of histone acetyltransferase maintain the enzymatic equilibrium of reversible core histone acetylation. Core histones are a group of highly conserved nuclear proteins in eukaryotic cells; they represent the main component of chromatin, the DNA-protein complex in which chromosomal DNA is organized. Besides their role in chromatin structural organization, 20 core histones participate in gene regulation, their regulatory function being ascribed to their ability to undergo reversible posttranslational modifications such as acetylation, phosphorylation, glycosylation, ADP-ribosylation, and ubiquitination. Histone deacetylase exists as multiple enzyme forms, and this multiplicity reflects the complex regulation of core histone acetylation. 25 Four nuclear HDs have been identified and characterized from germinating maize embryos (HD1-A, HD1-BI, HD1-BIl, and HD2), based on their expression during germination, molecular weight, physiochemical properties and inhibition by various compounds. Based on these data, Lechner et al., supra, suggest that HD enzymes have a role in establishing and maintaining 30 histone-protein interactions, and that acetylation can modulate the binding of proteins with anionic domains to certain chromatin areas.
WO 2004/061080 PCT/US2003/041098 161 0s006819-2510 was found to interact with 0. sativa Calreticulin Precursor (OsCRTC). A BLAST analysis of the amino acid sequence of the prey clone OsCRTC indicated that this protein is the rice Calreticulin Precursor (GENBANK@ Accession No. BAA88900/SwissProt #Q9SLY8, 5 100% identity, e=0.0). OsCRTC is a 424-amino acid protein with a cleavable signal peptide (amino acids I to 29), a calreticulin family repeat motif (amino acids 218 to 230), and an endoplasmic reticulum targeting sequence (amino acids 421 to 424), as predicted by analysis of the OsCRTC amino acid sequence (see Munro & Pelham, 1987; Pelham, 1990). In agreement with 10 its designation as a calreticulin precursor, the analysis identified a calreticulin family signature calreticulin family signature (amino acids 31 to 343, 1.3e-16 6 ; see Michalak et al., 1992; Bergeron et al., 1994; Watanabe et al., 1994). The analysis also predicted a transmembrane domain (amino acids 7 to 29) and a coiled coil (amino acids 360 to 389). The cDNA encoding the rice 15 calreticulin OsCRTC was first identified by Li & Komatsu, who found this gene to be involved in the regeneration of rice cultured suspension cells. These authors report that the rice calreticulin protein is highly conserved, showing high homology (70-93%) to other plant calreticulins, but only 50 53% homology to mammalian calreticulins. Calreticulin (CRT) is an 20 endoplasmic reticulum (ER) calcium-binding protein thought to be involved in many functions in eukaryotic cells, including Ca2+ signaling, regulation of intracellular Ca 2 + storage and store-operated Ca2+ fluxes through the plasma membrane, modulation of endoplasmic reticulum Ca2+-ATPase function, chaperone activity to promote protein folding, control of cell adhesion, gene 25 expression, and apoptosis (reviewed by Michalak et al., 1998 and by Persson et al.,). In plants, CRT has been localized to the endoplasmic reticulum, Golgi, plasmodesmata, and plasma membrane (Borisjuk et al., 1998; Hassan et al., 1995; Baluska et al., 2001), and it has been shown to affect cellular calcium homeostasis, as reported by Persson et al., supra. 30 This study shows that induction of calreticulin expression in transgenic tobacco and Arabidopsis plants enhances the ATP-dependent Ca 2
+
WO 2004/061080 PCT/US2003/041098 162 accumulation of the endoplasmic reticulum, and that this CRT-mediated alteration of the ER Ca 2 + pool regulates ER-derived Ca 2 + signals. These results demonstrate that CRT plays a key role as a regulator of calcium storage in the endoplasmic ER, and that the ER, in addition to the vacuole, is 5 an important Ca 2 ' store in plant cells. A role for the Arabidopsis calreticulin homolog in anther maturation or dehiscence has also been proposed (Nelson et al., 1997) based on localization of this protein in anthers which are degenerating at the time of maximum CRT expression. Furthermore, the tobacco homolog of mammalian CRTC participates in protein-protein 10 interactions in a stress- and ATP-dependent fashion Denecke et aL, 1995). This notion supports the use of the yeast two-hybrid technology to identify proteins that interact with OsCRTC. OsCRTC was also used as bait and found to interact with rice Soluble Starch Synthase (OsSSS; see Table 24) and Novel Protein PN29950 15 (OsPN29950). OsSSS is the rice homolog of soluble starch synthase (SSS), one of the three enzymes involved in starch biosynthesis in plants. Starch is the major component of yield in the world's main crop plants and one of the most important products synthesized by plants that is used in industrial processes. It consists of two kinds of glucose polymers: highly branched 20 amylopectin and relatively unbranched amylose. Starch synthase contributes to the synthesis of amylopectin. The enzyme utilizes the glucosyl donor ADPGIc to add glucosyl units to the nonreducing end of a glucan chain through Li(1 -> 4) linkages, thus elongating the linear chains (reviewed by Cao et al., 2000; Kossman & Lloyd, 2000). Distinct classes of 25 isoforms of starch synthase were defined on the basis of similarity in amino acid sequence, molecular mass, and antigenic properties. Plant organs vary greatly in the classes they possess and in the relative contribution of the classes to soluble starch synthase activity (Smith et al., 1997 cited in Cao et al., supra). OsPN29950 is a protein of unknown function determined by 30 BLAST analysis to be similar to putative protein from Arabidopsis thaliana (GENBANK@ Accession No. NP_199037.1, 32% identity, 2e29).
WO 2004/061080 PCT/US2003/041098 163 Os006819-2510 was found to interact with low temperature-induced protein 5 (OsLIP5). OsLIP5 is a 276-amino acid protein with a cleavable signal peptide (amino acids 1 to 27) and three putative transmembrane regions (amino acids 48 to 64, 82 to 98, and 233 to 249). A BLAST analysis 5 of the amino acid sequence of this prey clone determined that it is the rice LIP5 protein (GENBANK@ Accession No. BAA24979.1, 100% identity, 8e 052). The rice LIP5 protein is a direct submission to the public database and is not described in the literature. In yeast, LIPS is involved in lipoic acid metabolism (Sulo & Martin, 1993). The BLAST analysis shows that the rice 10 LIPS-like protein OsLIP5 is also similar to rice WS1724 (GENBANK@ Accession No. T07613, 98% identity, 3e 05), a protein encoded by one of nine cDNAs induced by short-term water stress and thought to be responsible for acquired resistance to chilling in a chilling-sensitive variety of rice (Takahashi et al., 1994). Among the proteins encoded by these cDNAs, 15 which were found to be differentially expressed following water stress, expression of the WS1724 protein remained relatively fixed. A BLAST analysis comparing the nucleotide sequence of OsLIP5 against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS000070_r-at (e=4e- 75 ) as the closest match. Gene expression 20 experiments indicated that this gene is down-regulated by the herbicide BL2. Os006819-2510 was also found to interact with Oryza sativa putative myosin (OsPN23878). A BLAST analysis of the amino acid sequence of OsPN23878 indicated that this prey protein is the rice putative myosin (GENBANK@ Accession No. AAL31066.1, 99% identity, e=0.0). 25 OsPN23878 is also similar to Myosin Vill, ZMM3 - maize (fragment) from Z. mays (GENBANK@ Accession No. A59311, 89% identity, e=0.0). Myosins are discussed in Example I. Based on current knowledge of plant myosins, the myosin VIII prey protein OsPN23878 can be a cytoskeletal component that participates in events relating to cytokinesis. 30 The prey protein OsPN23878 also interacts with hypothetical protein 003118-3674, which is similar to Lycopersicon esculentum Calmodulin WO 2004/061080 PCT/US2003/041098 164 (Os003118-3674; see Table 25). Os003118-3674 is a 148-amino acid protein with two EF-hand calciur-binding domains (amino acids 22 to 34 and 93 to 105). In agreement with the observation that Os003118-3674 includes EF-hand calcium-binding domains, a BLAST analysis of the 5 Genpept database indicated that this protein shares 72% identity with A. thaliana putative calmodulin (GENBANK® Accession -No. NP_1764705, e 57 ), although the top hit in this search is A. thaliana putative serine/threonine kinase (GENBANK@ Accession No. NP_172695.1, 76% identity, 7e~60). Therefore, the possibility that this calmodulin-like protein 10 possesses kinase activity is worth consideration. A BLAST analysis comparing the nucleotide sequence of OsPN23878 against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset OS002190_lat (e-1 6 5 ) as the closest match. Our gene expression experiments indicate that this gene is not specifically induced 15 under a range of given conditions. Additionally, Os006819-2510 was found to interact with OsRAB16B (OsRAB16B), a 164-amino acid protein that has a possible cleavage site between amino acids 51 and 52, although it does not appear to have a cleavable signal peptide. Analysis of its amino acid sequence predicted 20 (2.6e- 81 ) this protein to be a member of a group of plant proteins called dehydrins, which are induced in plants by water stress (see Close et al., 1989; Robertson & Chandler, 1992; Dure et al., 1989). Dehydrins include the basic, glycine-rich RAB (responsive to abscisic acid) proteins. In agreement with this notion, the analysis indicated that OsRAB16B is a basic, 25 glycine-rich protein. A BLAST analysis against the public database revealed that OsRAB16B is the rice DEHYDRIN RAB 16B (GENBANK@ Accession No. P22911, 100% identity, 4e~9). The cDNA encoding this protein was isolated by (Yamaguchi-Shinozaki et al., 1990) as one of four rice RAB genes that are differentially expressed in rice tissues. In agreement with the 30 notion that OsRAB1 6B is a rice RAB protein, a BLAST analysis against Myriad's proprietary database indicated that OsRAB16B shares 57% identity WO 2004/061080 PCT/US2003/041098 165 with OsRAB25. While expression data for OsRAB16B are not available, the rice RAB16B promoter contains two abscisic acid (ABA)-responsive elements required for ABA induction (Ono et al., 1996). Among other rice RAB proteins, the RAB16A gene has been linked to salt stress (Saijo et al., 5 2001), and the activity of the RAB16A promoter is also induced by ABA and by osmotic stresses in various tissues of vegetative and floral organs (Ono et aL., supra). Another rice RAB protein, RAB21, is induced in rice embryos, leaves, roots and callus-derived suspension cells treated with NaCl and/or ABA (Mundy & Chua, 1988). Based on these data, it is likely that the 10 OsRAB16B prey protein has a role in the stress response. Os006819-2510 was found to interact with protein PN23226 (OsPN23226). A BLAST analysis against the public database indicated that OsPN23226 is similar to putative glucan synthase (GENBANK@ Accession 15 No. NP_563743.1, 78% identity, e=0.0) and to callose synthase I catalytic subunit (GENBANK@ Accession No. NP_563743.1, 78% identity, e=0.0) from A. thaliana. Callose synthase (CaIS) from higher plants is a multisubunit membrane-associated enzyme involved in callose synthesis (reviewed in Hong et at., 2001). Callose is a linear 1,3-E-glucan with some 20 1,6- branches and differs from cellulose, the major component of the plant cell wall. Callose is synthesized on the forming cell plate and several other locations in the plant, and its deposition at the cell plate precedes the synthesis of cellulose. Callose synthesis can also be induced by wounding, pathogen infection, and physiological stress. The activity of callose synthase 25 is highly regulated during plant development and can be affected by various biotic and abiotic factors. CalS, like cellulose synthase, is a large transmembrane protein. Its structure includes a large hydrophilic loop that is relatively conserved among the CalS isoforms, a less conserved, long N terminal segment, and a short C-terminal segment, all located on the 30 cytoplasmic side. The central loop is thought to act as a receptacle to hold other proteins that are essential for CalS catalytic activity (see below); the N- WO 2004/061080 PCT/US2003/041098 166 terminal segment can contain subdomains for interaction with proteins that regulate 1,3-B-glucan synthase activity. The cDNA encoding the callose synthase (CaIS1) catalytic subunit from Arabidopsis was identified by Hong et al., supra), who demonstrated 5 that higher plants encode multiple forms of CaIS enzymes and that the Arabidopsis CaIS1 is a cell plate-specific isoform. In addition, these authors used yeast two-hybrid and in vitro experiments to show that CaIS1 interacts with two other cell plate-specific proteins, phragmoplastin and a UDP glucose transferase, and suggest that it can form a large complex with these 10 and other proteins to facilitate callose deposition on the cell plate. Moreover, the plasma membrane CaIS is strictly Ca2+-dependent, and Ca2+ plays a key role in cell plate formation and can activate the cell plate-specific CaIS1. The prey protein OsPN23226 is likely a rice callose synthase homolog that can function similarly to the Arabidopsis CaIS1 catalytic subunit. 15 In addition to the cell plate, callose is synthesized in a variety of specialized tissues and in response to mechanical and physiological stresses. Multiple CalS isozymes are thought to be required in higher plants to catalyze callose synthesis in different locations and in response to different physiological and developmental signals (Hong et a/., supra). 20 Os006819-2510 was also found to interact with protein PN23485, which is similar to Hordeum vulgare coproporphyrinogen Ill oxidase, chloroplast precursor (OsPN23485). A BLAST analysis of the amino acid sequence of OsPN23485 determined that this protein is similar to barley (H. vulgare) Coproporphyrinogen Ill Oxidase, Chloroplast Precursor (coprogen 25 oxidase) (GENBANK@ Accession No. Q42840, 89.3% identity, e- 1 6 9 ). Coproporphyrinogen III oxidase (CPO) catalyzes a step in the pathway from 5-amino-levulinate to protoporphyrin IX, a common reaction in the biosynthesis of heme in animals and chlorophyll in photosynthetic organisms. The N-terminal sequences of plant CPOs are characteristic of 30 plastid transit peptides. CPO is exclusively located in the stroma of plastids, and in vitro transcribed and translated CPO is imported into the stroma of WO 2004/061080 PCT/US2003/041098 167 pea plastids and truncated by a stromal endopeptidase (reviewed by Ishikawa et a., 2001). Plant cDNA sequences encoding CPO were obtained from soybean, tobacco and barley (Kruse et a., 1995). They found that the plant coprogen oxidase mRNA was expressed to different extents in various 5 tissues, with maximum amounts in developing cells and drastically decreased amounts in completely differentiated cells, suggesting differing requirements for tetrapyrroles in different organs. Based on these results, these authors propose that enzymes involved in tetrapyrrole (porphyrin) synthesis are regulated developmentally rather than by light, and that 10 regulation of these enzymes guarantees a constant flux of metabolic intermediates and help avoid photodynamic damage by accumulating porphyrins. Inhibition of the pathway for chlorophyll synthesis causes lesion formation such as that found in the pale green and lesion-formation phenotype of Iin2 plants. Ishikawa et aL., supra found that a deficiency of 15 coproporphyrinogen Ill oxidase causes lesion formation in these Arabidopsis mutants. Furthermore, based on the observation that transgenic tobacco plants with reduced CPO activity accumulate photosensitizing tetrapyrrole intermediates and exhibit antioxidative responses and necrotic leaf lesions, these authors suggest that CPO inhibition causes lesion formation leading to 20 induction of a set of defense responses that resemble the HR observed after pathogen attack. These lesions are the equivalent of diseases known as porphyrias in humans. If accumulated, coproporphyrin(ogen), as a photosensitizer, induces damage through generation of reactive oxidative species, which play a key role in the initiation of cell death and lesion 25 formation both in the HR and in certain lesion mimic mutants. They suggest that in lin2 mutants, the generation of an oxidative burst triggered by coproporphyrin accumulation leads to cell death. Os006819-2510 was found to interact with protein PN29037 (OsPN29037). A BLAST analysis of the amino acid sequence of 30 OsPN29037 indicated that this prey protein is similar to Gamma Hydroxybutyrate Dehydrogenase from A. thaliana (GENBANK@ Accession WO 2004/061080 PCT/US2003/041098 168 No. AAK94781.1, 80.7%, identity, e1 27 ). This enzyme oxidizes gamma hydroxybutyrate. As a minor brain metabolite directly or indirectly involved in scavenging oxygen-derived free radicals in animals, gamma-hydroxybutyrate demonstrates similarities with melatonin (Cash, 1996). 5 Summary Thus, the senescence-associated protein Os006819-2510 interacts with several proteins that have possible roles in cell cycle processes. One of these is OsPN23878, a protein annotated in the public domain as the rice putative myosin. Myosins are cytoskeletal proteins that function as 10 molecular motors in ATP-dependent interactions with actin filaments in various cellular events. Based on the similarity of the prey protein to a class Vill myosin and on the reported role of plant myosin VIll in maturation of the cell plate and in organization of the actin cytoskeleton at cytokinesis, we speculate that the myosin OsPN23878 is a cytoskeletal component that 15 participates in events occurring at cytokinesis in rice. The association of the myosin OsPN23878 with senescence-associated protein can be a step in cell-cycle-dependent events involving cytoskeleton organization and senescence. Specific expression of the gene encoding OsPN23878 in panicle (our gene expression experiments) is consistent with an interaction 20 between this protein and Os006819-2510, and with a role for the latter in flower senescence, as suggested for the gene encoding the daylily homolog of this protein (Panavas et al., 1999). Localization of senescence-associated protein to the ER suggests that some of the events in which OsPN23878 functions could be associated with plasmodesmata function. 25 Note that the myosin protein OsPN23878 also interacts with a novel calmodulin-kinase-like protein Os003118-3674 (see Table 25), and that the latter interacts with a myosin heavy chain (OsAAK98715) found to interact with rice cyclin OsCYCOS2 and presumed to be involved in cytoskeleton organization during mitotic events. The interactions of myosins with a 30 calcium-binding calmodulin-like protein are consistent with published evidence of regulation of myosin function by calcium (Yokota et al., 1999, WO 2004/061080 PCT/US2003/041098 169 reviewed in Reddy, 2001). The possibility that Os003118-3674 possesses kinase activity raises the probability that these interactions propagate a cell cycle-related signaling event. The calmodulin-like protein Os003118-3674 thus provides a link between the senescence-associated protein and 5 interacting partners of this Example and the cell cycle network. Another interactor with a possible role in cell cycle regulation is the rice histone deacetylase OsAAK01712. This enzyme includes a transmembrane domain and is involved in regulation of core histones acetylation. The acetylation/deacetylation of histones, the main protein 10 component of chromatin, is connected to replication during the cell cycle in plants, as is in other eukaryotes (Jasencakova et al., 2001). Thus, the Os006819-2510-OsAAK01712 interaction likely participates in mitotic events involving chromatin organization. Another novel interactor found for senescence-associated protein is 15 OsPN23485, similar to coproporphyrinogen Ill oxidase, chloroplast precursor, an enzyme of the pathway leading to the biosynthesis of chlorophyll in plants. The observation that the lesion formation in the lin2 mutant Arabidopsis plants is the result of loss-of-function of CPO (Ishikawa et al., 2001) links the gene encoding CPO to regulation of cell death 20 pathways. Moreover, plant CPO enzymes are regulated developmentally and by light (reviewed by Ishikawa et al., supra). Based on these reports, the interaction of rice CPO (OsPN23485) with senescence-associated protein can participate in regulation of programmed cell death in a development-dependent manner in rice. 25 The senescence-associated protein Os006819-2510, which is presumed to be a transmembrane protein based on analysis of its amino acid sequence, interacts with the rice calreticulin OsCRTC which, like other plant calreticulins, is likely an ER transmembrane protein. The presence of two endoplasmic reticulum retention motifs in Os006819-2510 and of an 30 endoplasmic reticulum targeting sequence in OsCRTC suggests that both proteins are localized in the ER. This notion is in agreement with the WO 2004/061080 PCT/US2003/041098 170 possibility of an interaction between Os006819-2510 and OsCRTC in planta. Os006819-2510 can participate in events controlled by OsCRTC within the endoplasmic reticulum. This interaction is consistent with the suggested role of plant CRT in anther maturation and dehiscence, which was proposed by 5 Nelson et a/., 1997 based on the observation that maximum expression of the Arabidopsis CRT in the anthers coincides with anther degeneration. Moreover, Denecke et a/., 1995 reported detection of another plant CRT homolog in the nuclear envelope, in the ER, and in mitotic cells in association with the spindle apparatus and the phragmoplast. Given the 10 interaction of senescence-associated protein with proteins having roles in mitosis, it is possible that the rice CRT of this Example functions in mitotic events. However, Nelson et al., supra, indicate possible additional roles for plant CRT in developmental processes, including a chaperone function that can be reconciled with CRT localization in the developing endosperm, a site 15 characterized by high protein synthesis rates, and in secreting nectaries, which are associated with heavy traffic of secretory proteins through the ER. Note that OsCRTC also interacts with the rice soluble starch synthase homolog OsSSS. Soluble starch synthase enzymes have been isolated from plant endosperm cells (Cao et al., 2000). These data suggest that the 20 rice CRT homolog of this Example can also be found in this tissue, where it is conceivable that it interacts with the soluble starch synthase OsSSS in a chaperone role to promote proper folding of this protein during protein synthesis. To further corroborate the notion that the rice senescence-associated 25 protein Os006819-2510 is a membrane-associated protein, a novel interactor identified for this protein is a putative callose synthase catalytic subunit (OsPN23226), another transmembrane enzyme involved in glucan synthesis. Plasma membrane proteins participate in a variety of interactions with the cell wall, including synthesis and assembly of cell wall polymers 30 (Biochemistry and Molecular Biology of Plants, Buchanan, Gruissem and Jones (eds.), John Wiley& Sons, New York, NY 2002, p. 13). The prey WO 2004/061080 PCT/US2003/041098 171 protein OsPN23226 likely functions as its Arabidopsis homolog, a plasma membrane enzyme that utilizes UDP-glucose as substrate to synthesize callose for deposition in the cell wall. The interactions of senescence associated protein with the rice putative callose synthase OsPN23226 and 5 with the calreticulin OsCRTC, and the interaction between OsCRTC and the soluble starch synthase OsSSS all involve membrane-associated proteins. While there is no evidence that such interactions occur at the same time, they can be associated with the traffic that sorts, distributes and targets membrane proteins and other molecules between compartments of the 10 endomembrane system (Biochemistry and Molecular Biology of Plants, Buchanan, Gruissem and Jones (eds.), John Wiley& Sons, New York, NY 2002, p. 14) during the different stages of the cell cycle/development and in response to different physiological and developmental signals. Moreover, the interactions identified in this Example link the senescence-associated bait 15 protein to glucan synthesis, a process that is vital to the plant normal growth. For example, the formation of a functional callose synthase 1 catalytic subunit (CalS1) complex is vital to cell plate formation. Functional characterization of the various components of the CaISI complex and CaIS associated proteins has been proposed as a means to reveal how the 20 activity of this enzyme is regulated during cell plate formation and to clarify callose synthesis and deposition in plants (Hong et a/., Plant Cell 13(4): 755 768, 2001). The interaction identified here between senescence-associated protein and the novel putative callose synthase catalytic subunit (OsPN23226) provides new insight into this process in rice. 25 Other interactors identified for senescence-associated protein link this protein to the plant stress response. OsRAB16B is a member of the RAB family of proteins known to be induced by water stress and treatment with the plant hormone abscisic acid. ABA levels increase during seed development in many plant species, stimulating production of seed storage 30 proteins and preventing premature germination; ABA is also induced by water stress and is thought to regulate stomatal transpiration (Raven, Eivert WO 2004/061080 PCT/US2003/041098 172 and Eichhorn, p. 684). Based on functional homology with other RAB proteins and on the presence of the ABA-responsive elements in the OsRAB16B promoter, we presume that OsRAB16B has a role in the response to abiotic stress in rice and that its function can be regulated by 5 Ca 2 . Another interactor correlated with stress is low temperature-induced protein 5 (OsLIP5), which in yeast is involved in lipoic acid metabolism. Lipoic acid in animals has been shown to help minimize the effects of systemic stress (Kelly, 1999) and to provide animal cells with significant protection against the cytotoxic effects of repin, a sesquiterpene lactone 10 isolated from Russian knapweed (Robles et al., 1997). The high similarity (98%) of the rice LIP5-like protein to rice WS1724, a protein encoded by a gene induced by water stress and linked to resistance to chilling in rice, points to similar roles for the OsLIP5 prey protein. Gene expression experiments indicate that the gene encoding OsLIP5 is down-regulated upon 15 treatment with the herbicide BL2. This finding suggests a role for OsLIP5 in the response to abiotic stress. While the specific function of the interactions between Os006819-2510 and the prey proteins OsRAB16B and OsLIP5 is not obvious, these interactions can participate in biological processes related to flower senescence and response to water stress and chilling. 20 In addition, the rice calreticulin OsCRTC discussed above can also have a role in the stress response. This hypothesis is based on functional homology with the tobacco CRT protein studied by Denecke et al., 1995 and found to participate in protein-protein interactions in a stress-dependent fashion. 25 In summary, among the interactors identified for the rice senescence associated protein 0s006819-2510 are several membrane-associated proteins, which supports the notion that the rice Os006819-2510 is a transmembrane protein. Among the interactors identified are proteins involved in cell cycle processes/mitosis and proteins with functions in the 30 plant stress response. Some are newly characterized rice proteins. The interactions identified for rice senescence-associated protein with proteins WO 2004/061080 PCT/US2003/041098 173 involved in cell cycleldevelopment and in resistance to stress suggests an overlapping of roles for the bait protein. Indeed, Os006819-2510 can constitute a link between stress tolerance and processes for cell division in rice. 5 Example IlIl OsSGTI is a 367-amino acid protein that includes a tetratricopeptide repeat domain, two variable regions, the CS motif present in metazoan CHORD and SGTI proteins, and the SGS motif. In yeast, Sgt1 is required for cell-cycle signaling. In yeast, SGT1 associates with the kinetochore 10 complex and the SCF-type E3 ubiquitin ligase by interacting with SKPI. COP9 signalosome interacts with SCF E3 ubiquitin ligases. By its interaction with SCF complexes, SGT1 exerts its essential activity in degrading of SICI and CLN1. Thus, one possible role of SGT1 could be to target proteins for degradation by the 26S proteasome via specific SCF 15 complexes or the SGTI complex can participate in the modification of protein activity or can have a dual role for activation and degradation of the target via ubiquitylation. A. thalana has two SGT1 homologs. At nonpermissive temperatures AtSGT1a and AtSGT1b can complement G1 and G2 arrest in temperature sensitive sgtl yeast mutants. However, 20 SGT1b interacts with RAR1 which is required for RPP5 regulated disease resistance to downy mildew. In this scenario, target proteins involved in disease resistance can be targeted for protein degradation by the SGTI pathway. Barley encodes a SGT1 homolog that also interacts with barley RARI, which is implicated in disease resistance in barley to downy mildew. 25 (Austin et al., 2002; Azevedo et al., 2002). A BLAST analysis comparing the nucleotide sequence of OsSGT1 against TMRI's GENECHIP@ Rice Genome Array sequence database identified probeset 0S016424.1 (98%) as the closest match. Gene expression experiments indicated that this gene is up-regulated by the blast infection. 30 The rice SGT1 protein shares 74 and 75% amino acid sequence similarity with two Arabidopsis thaliana SGT1 homologs and 45% amino acid WO 2004/061080 PCT/US2003/041098 174 sequence similarity with Saccharomyces cerevisiae SGT1. In yeast, SGTI is required for cell-cycle progression at the G1/S-phase and G2/M-phase transitions. In A. thalana, SGT1 b interacts with Rar1 and mediates disease resistance. Thus, in plants, SGTI likely controls processes that are 5 fundamental to disease resistance and development. The rice OsSGTI protein was chosen as a bait for these interaction studies based on its potential relevance to disease resistance and development. One bait fragment encoding amino acid 200-368 of OsSGTI was used in the yeast two-hybrid screen, as described above. 10 Results The OsSGT1 was found to interact with ten rice proteins. Three interactors have been previously described, namely OsSGT1, a Ras GTPase (gil730510), and elicitor responsive protein (gil11358958). The remaining seven interactors are novel proteins with identifiable protein 15 domains, or are similar to other proteins. These are an L-aspartase-like protein, an RNA binding domain protein, an auxin induced-like protein, an archain delta COP-like protein, a fibrillin-like protein, a HSP70-like protein, and a proline rich protein. The elicitor responsive protein was also used as a bait and interacted with 12 novel proteins with identifiable protein domains, 20 with similarity to known proteins or that are unidentifiable by sequence similarity. These were an NAD(P) binding domain protein, a gamma adaptin-like protein, a pectinesterase-like protein, a receptor like kinase protein kinase like protein, a pyruvate orthophosphate dikinase like protein, an lsp-4 like protein, a xanthine dehydrogenase like protein, a ubiquitin 25 specific protease like protein and 4 unknown proteins. The interacting proteins of this Example are listed in Tables 6-8, followed by detailed information on each protein and a discussion of the significance of the interactions. The nucleotide and amino acid sequences of the proteins of the Example are provided in SEQ ID NOs: 31-70 and 143 30 150. Based on the biological function of SGT1, it is possible that the interacting proteins are also involved in cell cycle/mitotic processes and/or in WO 2004/061080 PCT/US2003/041098 175 the plant resistance to stress. Likewise, the interactors with the elicitor responsive protein can also be involved in plant resistance to stress. Proteins that participate in cell cycle regulation in rice can be targets for genetic manipulation or for compounds that modify their level or activity, 5 thereby modulating the plant cell cycle. The identification of genes encoding these proteins can allow genetic manipulation of crops or application of compounds to effect agronomically desirable changes in plant development or growth. Likewise, genes that are involved in conferring plants resistance to stress have important commercial applications, as they could be used to 10 facilitate the generation and yield of stress-resistant crops. Table 6 Interacting Proteins Identified for Os006819-2510 (Hypothetical Protein 006819-2510, Similar to Hemerocallis Senescence-Related Protein 5). The names of the clones of the proteins used as baits and found as preys 15 are given. Nucleotide/protein sequence accession numbers for the proteins of the Example (or related proteins) are shown in parentheses under the protein name. The bait and prey coordinates (Coord) are the amino acids encoded by the bait fragment(s) used in the search and by the interacting prey clone(s), respectively. The source is the library from which each prey 20 clone was retrieved. Gene Name Protein Name Bait Prey Coord (GENBANK@ Accession No.) Coord (source) BAIT PROTEIN PN20285 OsSGT1 (gil6581058) (SEQ ID NO: 144) INTERACTORS PN24060 L-aspartase-like protein-like 200-368 176-315 (SEQ ID NO : 32) (output trait) WO 2004/061080 PCT/US2003/041098 176 PN20696* Elicitor responsive protein 200-368 54-144 (OsERP) (gil11358958) (input trait) (SEQ ID NO: 146) PN23914 RNA binding domain protein 200-368 1-263 x 3 (SEQ ID NO : 34) (output trait) PN23221# Proline rich protein 200-368 182-366 x 2 (SEQ ID NO : 36) (output trait) 207-344 (input trait) 134-254 (output trait) PN20285 OsSGT1 (gil6581058) 200-368 9-227 (SEQ ID NO: (output trait) 144) PN24061 Auxin induced protein-like 200-368 34-236 (SEQ ID NO : 38) (output trait) PN24063 RAS GTPase (gil73051 0) 200-368 63-202 (SEQ ID NO: (output trait) 148) PN23949 HSP70-like 200-368 244-418 (SEQ ID NO : 40) (outpu trait) PN28982 Archain delta COP-like (SEQ ID NO : 42) PN29042 Fibrillin-like (SEQ ID NO : 44) * Additional interactions identified for elicitor responsive protein are shown in Table 7 # Additional interactions identified for PN23221 are shown in Table 8 Table 7 WO 2004/061080 PCT/US2003/041098 177 Gene Name Protein Name Bait Coord Prey Coord (GENBANK@ (source) Accession No.) BAIT PROTEIN PN20696 Elicitor responsive (OsERP) protein (gil11358958) (SEQ ID NO: 146) INTERACTORS PN29984 Novel Protein 50-145 1-38 (SEQ ID NO : 46) PN29984 5-41 (input trait) PN30844 Novel protein 50-145 1-64 (SEQ ID NO : 48) PN30844 (input trait) PN30868 NAD(P) binding 50-145 167-336 (SEQ ID NO : 50) domain protein (input trait) PN24292 Gamma adaptin-like 23-120 737-918 (SEQ ID NO : 52) (output) PN29983 Novel protein 50-145 1-131 (SEQ ID NO : 54) PN29983 (input trait) PN30845 Pectinesterase-like 50-145 1-64 (SEQ ID NO : 56) (input trait) PN31085 Receptor-like protein 23-120 378-553 (SEQ ID NO : 58) kinase-like (output trait) PN20674 Pyruvate 50-145 64-263 (SEQ ID NO : 60) orthophosphate 71-298 dikinase-like (input trait) PN30870 Isp-4 like 50-145 1-446 (SEQ ID NO : 62) (input trait) WO 2004/061080 PCT/US2003/041098 178 PN29997 Xanthine 23-120 737/918 (SEQ ID NO : 64) dehydrogenase-like (output trait) PN30843 Ubiquitin specific 50-145 164-221 (SEQ ID NO: 66) protease-like (input trait) PN30857 Novel protein 50-145 1-148 (SEQ ID NO : 68) PN30857 (input trait) Table 8 Gene Name Protein Name Bait Coord Prey Coord (GENBANK@ (source) Accession No.) PREY PROTEIN PN23221 Proline rich protein (SEQ ID NO: 36) BAIT PROTEIN PN20621 Shaggy kinase 120-435 175-311 (SEQ ID NO: (giJ13677093) (output trait) 150) PN20115 Ring zinc finger protein 5-140 84-302 (SEQ ID NO: 70) 191-324 (output trait) Yeast Two-Hyrid using OsSGT1 as Bait The bait fragment encoding amino acid 200-368 of OsSGT1 was 5 found to interact with L-aspartase-like protein PN24060. A BLAST analysis of the amino acid sequence of PN24060 indicated that this prey protein has 36.5% similarity to A. thaliana L-aspartase (gil18394135). The enzyme L aspartate ammonia-lyase (aspartase) catalyzes the reversible deamination of the amino acid L-aspartic acid, using a carbanion mechanism to produce 10 fumaric acid and ammonium ion. While the catalytic activity of this enzyme has been known for nearly 100 years, a number of recent studies have revealed some interesting and unexpected new properties of this reasonably WO 2004/061080 PCT/US2003/041098 179 well-characterized enzyme. The non-linear kinetics that are seen under certain conditions have been shown to be caused by the presence of a separate regulatory site. The substrate, aspartic acid, can also play the role of an activator, binding at this site along with a required divalent metal ion. 5 So it is possible that PN24060 catalyses a reaction that pertains to protein modification and the modification can be important for disease resistance or cell cycling. The bait fragment encoding amino acid 200-368 of OsSGT1 was also found to interact with elicitor responsive protein, PN20696. A BLAST 10 analysis of the amino acid sequence of the prey clone PN20696 indicated that this protein is the rice elicitor responsive protein (gill 1358958; OsERP). OsERP is a 144-amino acid protein that, according to GENBANK@, is expressed by rice culture cells in the presence of the rice blast fungal elicitor. Thus, OsERP can have a role in disease responses in rice. 15 OsERP was also used as bait and found to interact with 12 other proteins (see Table 7). These prey are described in this Example below. An A. thaliana homologue to OsERP was identified by BLAST. At1g63220 shares 75% amino acid similarity with OsERP. To see if Arabidopsis homologues of OsERP have roles in disease resistance, 20 Arabidopsis thaliana with T-DNA insertions in At1g63220 (line SAIL_320_D02) was identified from a random insertion seed library. DNA regions surrounding the insertions were sequenced and revealed that the T DNAs were located within exon 5 of At1g63220. Plants were backcrossed and plants homozygous for the T-DNA insertion were identified by PCR. 25 Homozygous mutants and wild type plants were challenged with Pseudomonas syringae pv. maculicola ES4326 and plants were assayed for amount of P. syringae bacteria accumulation 3 days post inoculation (Glazebrook et al., 1996) These experiments were repeated twice on at least six plants. Data are reported as means and standard deviations of the 30 log of colony forming units per leaf cm 2 . By three days after inoculation, the mutant plants accumulated more than 10 times as much bacteria as wild WO 2004/061080 PCT/US2003/041098 180 type plants (wt = 3.94 log cfu/leaf disk std. 0.57, at1g63220 = 5.34 std. 0.63). Hence, At1g63220 contributes to disease resistance in A. thaliana. It is possible that the At1g63220 mutation inhibits defense responses that are dependent upon SGTI interactions. 5 In addition, the bait fragment encoding amino acid 200-368 of OsSGT1 was found to interact with RNA-binding domain protein, PN23914. PN23914 is a 164-amino acid protein. A BLAST analysis of the amino acid sequence of this prey shows it has 35.9% sequence identity to tFZRI from Oncorhynchus mykiss (gi|2982698). TFZRI is an orphan nuclear receptor 10 family member, tFZR1, which has a FTZ-F1 box. The amino acid sequences of the zinc finger domain and the FTZ-F1 box has 92.8% and 100% identity, respectively, with those of zebrafish FTZ-F1. On the other hand, the overall homology between tFZR1 and zebrafish FTZ-F1 is low (33.0%). The results indicate that tFZR1 is a new member of fushitarazu factor 1 (FTZ-FI) 15 subfamily. It is possible that PN23914 shares functionality through the zing finger domain. In addition, bait fragment encoding amino acid 200-368 of OsSGTI was found to interact with proline rich protein, PN23221. A BLAST analysis of the amino acid sequence of PN23221 indicated that this prey protein is 20 40.3% similar to a rice repetitive proline rich protein (gi118478606). Proline rich proteins can mediate interaction among proteins (Zhao et al., 2001). Note that proline rich protein PN23221 also interacts with shaggy kinase PN20621 and ring zinc finger protein-like PN20115 (see Table 28). Thus, the proline rich protein PN23221 can serve to bring these proteins together 25 with OsSGTI. The bait fragment encoding amino acid 200-368 of OsSGT1 was also found to interact with OsSGTI. In other words, OsSGT1 interacts with itself. Although the bait for OsSGTI included amino acids 200-368, the prey included amino acids 9-227. Although OsSGT1 can be a self-regulator 30 through aggregation, these bait and prey domains can reflect natural protein folding of a single native OsSGT1 protein.
WO 2004/061080 PCT/US2003/041098 181 Additionally, the bait fragment encoding amino acid 200-368 of OsSGT1 was found to interact with an auxin-induced protein like protein, PN24061. A BLAST analysis against the public database indicated that PN24061 is 63.5% similar to a rice putative IAA1 protein (gil17154533). 5 Indole acetic acid is a plant growth hormone and is classified as an auxin. IAA is associated with a variety of physiological processes, including apical dominance, tropisms, shoot elongation, induction of cambial cell division and root initiation. Thus, genes that are induced by IAA likely produce proteins that are responding developmental changes. This associated goes hand in 10 hand with regulation of cell division by interaction with SGT1. The bait fragment encoding amino acid 200-368 of OsSGTI was also found to interact with Ras GTPase, PN24063. A BLAST analysis of the amino acid sequence of PN24063 determined that this protein is ras-related GTP binding protein possessing GTPase activity (gi|730510). This protein 15 has four conserved regions involved in GTP binding and hydrolysis which are characteristic in the ras and ras-related small GTP-binding protein genes. In addition, two consecutive cysteine residues near the carboxyl terminal end required for membrane anchoring are also present. This protein synthesized in Escherichia coli possessed GTPase activity (i.e., hydrolysis of 20 GTP to GDP; Kidou et al., 1993). Ras GTPases are likely involved in signaling processes for development. ORFX from tomato that is expressed early in floral development, controls carpel cell number, and has a sequence suggesting structural similarity to the human oncogene c-H-ras p21 (fw2.2: a quantitative trait locus key to the evolution of tomato fruit size. (Frary et al., 25 2000). The Rho family of GTPases are also involved in control of cell morphology, and are also thought to mediate signals from cell membrane receptors (Winge et al., 1997). An A. thaliana homologue to PN24063 was identified by BLAST. AtlgO2130 shares 90% amino acid similarity with PN24063. To see if 30 Arabidopsis homologues of PN24063 have roles in disease resistance Arabidopsis thaliana with T-DNA insertions in AtlgO2130 (line WO 2004/061080 PCT/US2003/041098 182 SAIL_680_D03) was identified from a random insertion seed library. DNA regions surrounding the insertions were sequenced and revealed that the T DNAs were located within the promoter of AtlgO2130. Plants were backcrossed and plants homozygous for the T-DNA insertion were identified 5 by PCR. Homozygous mutants and wild type plants were challenged with Pseudomonas syringae pv. maculicola ES4326 and plants were assayed for amount of P. syringae bacteria accumulation 3 days post inoculation (Glazebrook et al., supra). These experiments were repeated twice on at least six plants. Data are reported as means and standard deviations of the 10 log of colony forming units per leaf cm 2 . By three days after inoculation, the mutant plants accumulated more than 10 times as much bacteria as wild type plants (wt = 3.93 log cfu/leaf disk std. 0.57, at1g02130 = 5.22 std. 0.9). Hence, At1g02130 contributes to disease resistance in A. thaliana. It is possible that the At1g02130 mutation inhibits defense responses that are 15 dependent upon SGTI interactions. The bait fragment encoding amino acid 200-368 of OsSGT1 was found to interact with Archain delta COP, PN28982. A BLAST analysis of the amino acid sequence of PN28982 indicated that this prey protein is 92% similar to rice archain delta COP (gil2506139). Cytosolic coat proteins that 20 bind reversibly to membranes have a central function in membrane transport within the secretory pathway. One well-studied example is COPI or coatomer, a heptameric protein complex that is recruited to membranes by the GTP-binding protein Arf1. Assembly into an electron-dense coat then helps in budding off membrane to be transported between the endoplasmic 25 reticulum (ER) and Golgi apparatus. Activated Arf1 brings coatomer to membranes. However, once associated with membranes, Arf1 and coatomer have different residence times: coatomer remains on membranes after Arfl-GTP has been hydrolysed and dissociated. Rapid membrane binding and dissociation of coatomer and Arf1 occur stochastically, even 30 without vesicle budding. This continuous activity of coatomer and Arf1 generates kinetically stable membrane domains that are connected to the WO 2004/061080 PCT/US2003/041098 183 formation of COPI-containing transport intermediates. This role for Arf1/coatomer might provide a model for investigating the behaviour of other coat protein systems within cells. (Presley et al., 2002). It is possible that this delta COP interacts with the OsSGTI and a Ras GTPase to coordinate 5 membrane transport for proteolytically processed proteins. An A. thaliana homologue to PN28982 was identified by BLAST. At5g05010 shares 77% amino acid similarity with PN28982. To see if Arabidopsis homologues of PN28982 have roles in disease resistance Arabidopsis tha/iana with T-DNA insertions in At5g05010 (line 10 SAIL_84_C10) was identified from a random insertion seed library. DNA regions surrounding the insertions were sequenced and revealed that the T DNAs were located within the promoter of At5g05010. Plants were backcrossed and plants homozygous for the T-DNA insertion were identified by PCR. Homozygous mutants and wild type plants were challenged with 15 Pseudomonas syringe pv. maculcola ES4326 and plants were assayed for amount of P. syringae bacteria accumulation 3 days post inoculation (Glazebrook et al., supra). These experiments were repeated twice on at least six plants. Data are reported as means and standard deviations of the log of colony forming units per leaf cm 2 . By three days after inoculation, the 20 mutant plants accumulated more than 10 times as much bacteria as wild type plants (wt = 3.93 log cfu/leaf disk std. 0.57, at5g05010= 5.24 std. 0.52). Hence, At5g05010 contributes to disease resistance in A. thaliana. It is possible that the At5g05010 mutation inhibits defense responses that are dependent upon SGT1 interactions. 25 The bait fragment encoding amino acid 200-368 of OsSGT1 was found to interact with fibrillin-like protein, PN29042. A BLAST analysis of the amino acid sequence of OsPN29037 indicated that this prey protein is 75% similar to the potato fibrillin homolog CDSP34 precursor from chloroplasts (gi7489242). Plastid lipid-associated proteins, also termed fibrillin/CDSP34 30 proteins, are known to accumulate in fibrillar-type chromoplasts such as those of ripening pepper fruit, and in leaf chloroplasts from Solanaceae WO 2004/061080 PCT/US2003/041098 184 plants under abiotic stress conditions. Further, substantially increased levels of fibrillin/ CDSP34 proteins are shown in various dicotyledonous and monocotyledonous plants in response to water deficit. (Langenkamper et al., 2001) In water-stressed tomato plants, similar increases in the CDSP 34 5 related transcript amount were noticed in wild-type and ABA-deficient flacca mutant, but protein accumulation was observed only in wild-type, suggesting a posttranscriptional role of ABA in CDSP 34 synthesis regulation. Substantial increases in CDSP 34 transcript and protein abundances were also observed in potato plants subjected to high illumination. The CDSP 34 10 protein is proposed to play a structural role in stabilizing stromal lamellae thylakoids upon osmotic or oxidative stress. (Gillet et al., 1998). A BLAST analysis comparing the nucleotide sequence of PN29042 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS011738 (100%) as the closest match. Gene 15 expression experiments indicated that this gene is up-regulated by ABA treatment. An A. thaliana homologue to PN29042 was identified by BLAST. At4g22240 shares 79% amino acid similarity with PN29042. To see if Arabidopsis homologues of PN29042 have roles in disease resistance 20 Arabidopsis thaliana with T-DNA insertions in At4g22240 (line SAIL_691_B11) was identified from a random insertion seed library. DNA regions surrounding the insertions were sequenced and revealed that the T DNAs were located within exon I of At4g22240. Plants were backcrossed and plants homozygous for the T-DNA insertion were identified by PCR. 25 Homozygous mutants and wild type plants were challenged with Pseudomonas syringae pv. maculicola ES4326 and plants were assayed for amount of P. syringae bacteria accumulation 3 days post inoculation (Glazebrook et al., supra). These experiments were repeated twice on at least six plants. Data are reported as means and standard deviations of the 30 log of colony forming units per leaf cm 2 . By three days after inoculation, the mutant plants accumulated more than 10 times as much bacteria as wild WO 2004/061080 PCT/US2003/041098 185 type plants (wt = 3.93 log cfu/leaf disk std. 0.57, at4g22240= 5.21 std. 0.43). Hence, At4g22240 contributes to disease resistance in A. thaliana. It is possible that the At4g22240 mutation inhibits defense responses that are dependent upon SGTI interactions. 5 Additionally, the bait fragment encoding amino acid 200-368 of OsSGTI was found to interact with HSP70-like protein, PN23949. A BLAST analysis of the amino acid sequence of OsPN3949 indicated that this prey protein is 71% similar to the cucumber 70K heat shock protein found in chloroplasts (gil7441856). Heat shock proteins (reviewed in Bierkens et al., 10 2000) are stress proteins that function as intracellular chaperones to facilitate protein folding/unfolding and assembly/disassembly. They are selectively expressed in plant cells in response to a range of stimuli, including heat and a variety of chemicals. As regulators, HSP proteins are thus part of the plant protective stress response. A BLAST analysis 15 comparing the nucleotide sequence of PN23949 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS015016 (97%) as the closest match. Gene expression experiments indicated that this gene is down-regulated by herbicide and JA treatment. Yeast Two-Hybrid Using OsERP (PN20696) as Bait 20 Next, one of the proteins found to interact with OsSGTI, namely the elicitor responsive protein PN20696 (gill 1358958; OsERP), was used as a bait. As shown in Table 27, the rice elicitor responsive protein PN20696 (gi(1 1358958; OsERP) was found to interact with a receptor-like protein kinase like protein, PN31085. A BLAST analysis of the amino acid 25 sequence of OsPN31085 indicated that this prey protein is 48% similar to a rice receptor like protein kinase (gi|7434420). The receptor protein kinases include a large group of proteins and most contain a cytoplasmic protein kinase catalytic domain, a transmembrane region, and and/or an extracellular domain consisting of leucine-rich repeats, which are thought to 30 interact with other macromolecules. Cell to cell communication is likely WO 2004/061080 PCT/US2003/041098 186 mediated by receptor kinases which have important roles in plant morphogenesis. OsERP was also found to interact with pyruvate orthophosphate dikinase, PN20674. A BLAST analysis of the amino acid sequence of 5 PN20674 indicates that this prey protein is 97% similar to rice pyruvate orthophosphate dikinase (gil743444). Pyruvate orthophosphate dikinase (PPDK) is known for its role in C4 photosynthesis but has no established function in C3 plants. Abscisic acid, PEG and submergence were found to markedly induce a protein of about 97 kDa, identified by microsequencing as 10 PPDK, in rice roots (C3). One rice PPDK is ABA-induced protein from roots. Western blot analysis showed a PPDK induction in roots of rice seedlings during gradual drying, cold, high salt and mannitol treatment, indicating a water deficit response. PPDK was also induced in the roots and sheath of submerged rice seedlings, and in etiolated rice seedlings exposed to an 15 oxygen-free N2 atmosphere, which indicated a low-oxygen stress response. None of the stress treatments induced PPDK protein accumulation in the lamina of green rice seedlings. Ppdk transcripts were found to accumulate in roots of submerged seedlings, concomitant with the induction of alcohol dehydrogenase 1. Low-oxygen stress triggered an increase in PPDK activity 20 in roots and etiolated rice seedlings, accompanied by increases in phosphoenolpyruvate carboxylase and malate dehydrogenase activities. The results indicate that cytosolic PPDK is involved in a metabolic response to water deficit and low-oxygen stress in rice, an anoxia-tolerant species (Moons et al., 1998). 25 Additionally, OsERP was found to interact with gamma adaptin, PN24292. A BLAST analysis of the amino acid sequence of PN24292 indicated that this prey protein is 97% similar to the Arabidopsis gamma adaptin (gil5091510). Eukaryotic vesicular transport requires the recognition of 30 membranes through specific protein complexes. The heterotetrameric adaptor protein complexes 1, 2, and 3 (AP1/2/3) are composed of two large, WO 2004/061080 PCT/US2003/041098 187 one small, and one medium adaptin subunit. Large subunits of AP1I/2/3 are homologous and two subunits of the heptameric coatomer I (COPI) complex belong to this gene family. In addition, all small subunits and the aminoterminal domain of the medium subunits of the heterotetramers are 5 homologous to each other; this also holds for two corresponding subunits of the COP[ complex. AP1/2/3 and a substructure (heterotetrameric, F-COPI subcomplex) of the heptameric COPI have a common ancestral complex (called pre-F-COPI). Since all large and all small/medium subunits share sequence similarity, the ancestor of this complex is inferred to have been a 10 heterodimer composed of one large and one small subunit. (Schledzewski et al., 1999). An archain delta COP interacts with OsSGT1 which interacts with the Gamma adaptin bait ERP. OsERP was also found to interact with xanthine dehydrogenase, PN29997. A BLAST analysis of the amino acid sequence of PN29997 15 indicated that this prey protein is 66% similar to the Arabidopsis xanthine dehydrogenase (gi115236216). Xanthine dehydrogenase is the enzyme responsible for xanthine degradation. Xanthine dehydrogenase is involved in purine catabolism and stress reactions. A BLAST analysis comparing the nucleotide sequence of PN29997 against TMRI's GENECHIP* Rice 20 Genome Array sequence database identified probeset 0S013724 (100%) as the closest match. Gene expression experiments indicated that this gene is expressed in seeds. OsERP was also found to interact with ubiquitin specific protease, PN30843. A BLAST analysis of the amino acid sequence of PN30843 25 indicated that this prey protein is 40% similar to an Arabidopsis ubiquitin specific protease (gill 1993486). The ubiquitin/26S proteasome pathway is a major route for selectively degrading cytoplasmic and nuclear proteins in eukaryotes. In this pathway, chains of ubiquitins become attached to short lived proteins, signaling recognition and breakdown of the modified protein 30 by the 26S proteasome. During or following target degradation, the attached multi-ubiquitin chains are released and subsequently disassembled by WO 2004/061080 PCT/US2003/041098 188 ubiquitin-specific proteases (UBPs) to regenerate free ubiquitin monomers for re-use. T-DNA insertion mutations in an Arabidopsis ubiquitin protease cause an embryonic lethal phenotype, with the homozygous embryos arresting at the globular stage. The arrested seeds have substantially 5 increased levels of multi-ubiquitin chains, indicative of a defect in ubiquitin recycling. Thus, there is essential role for the ubiquitin/26S proteasome pathway in general and for AtUBP14 in particular during early plant development (Doelling et al., Plant J. 27(5): 393-405, 2001). SGT1 also interacts with components of the ubiquitin/26S proteasome pathway and the 10 ERP that interacts with this ubiquitin specific protease interacts with OsSGT. This protease can be have roles in disease resistance as well as development. OsERP was also found to interact with pectinesterase, PN30845. A BLAST analysis of the amino acid sequence of PN30845 indicated that this 15 prey protein is 71% similar to a rice pectinesterase (gi[15528783). Pectinesterases catalyse the esterification of cell wall polygalacturonans. In dicot plants, these ubiquitous cell wall enzymes are involved in important developmental processes including cellular adhesion and stem elongation. A BLAST analysis comparing the nucleotide sequence of PN30845 against 20 TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS007057 (99%) as the closest match. Gene expression experiments indicated that this gene is up-regulated as a result of JA treatment, high saline growth conditions and herbicide treatment. OsERP was also found to interact with several proteins, namely 25 PN30870, PN29984, PN30844, PN29983, PN30868 and PN30857. A BLAST analysis of the amino acid sequence of PN30870, PN29984, PN30844, PN29983, PN30868 and PN30857 indicates that these prey proteins have no sufficient homology to any other characterized proteins. However, based on association with the rice elicitor responsive protein 30 PN20696, these proteins can have roles in disease resistance or cell cycling.
WO 2004/061080 PCT/US2003/041098 189 A BLAST analysis comparing the nucleotide sequence of PN30857 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset 08008661.1 (99%) as the closest match. Gene expression experiments indicated that this gene is up-regulated as a result of 5 blast infection. An A. thaliana homologue to PN29983 was identified by BLAST. At2g36950 shares 52% amino acid similarity with PN29983. To see if Arabidopsis homologues of PN29983 have roles in disease resistance, Arabidopsis thaliana with T-DNA insertions in At2g36950 (line 10 SAIL_779_El1) was identified from a random insertion seed library. DNA regions surrounding the insertions were sequenced and revealed that the T DNAs were located within exon 3 of At2g36950. Plants were backcrossed and plants homozygous for the T-DNA insertion were identified by PCR. Homozygous mutants and wild type plants were challenged with 15 Pseudomonas syringae pv. maculicola ES4326 and plants were assayed for amount of P. syringae bacteria accumulation 3 days post inoculation (Glazebrook et al., supra). These experiments were repeated twice on at least six plants. Data are reported as means and standard deviations of the log of colony forming units per leaf cm 2 . By three days after inoculation, the 20 mutant plants accumulated more than 10 times as much bacteria as wild type plants (wt = 3.94 log cfu/leaf disk std. 0.57, at2g36950 = 5.95 std. 0.72). Hence, At2g36950 contributes to disease resistance in A. thaliana. It is possible that the At2g36950 mutation inhibits defense responses that are dependent upon ERP/SGTI interactions. 25 It should be noted that the all of the following bait proteins, namely OsSGT, ring zinc finger, PN20115, and shaggy kinase, PN20621, identified proline rich protein, PN23221, as their prey. OsSGT and PN23221 have been described earlier in this Example. A BLAST analysis of the amino acid sequence of ring zinc finger 30 PN20115 indicated that this bait protein is 65% similar to A. thaliana ring zinc finger protein At1g63170. The RING domain is a conserved zinc finger motif, WO 2004/061080 PCT/US2003/041098 190 which serves as a protein-protein interaction interface. This protein can interact with other proteins to control developmental or stress tolerance processes. A BLAST analysis comparing the nucleotide sequence of PN20115 against TMRI's GENECHIP* Rice Genome Array sequence 5 database identified probeset OS015830 (90%) as the closest match. Gene expression experiments indicated that this gene is up-regulated as a result of conditions of drought. A BLAST analysis of the amino acid sequence of shaggy kinase PN20621 indicated that this bait protein is the rice shaggy kinase 10 (gil131677093). GSK3/SHAGGY is a highly conserved serine/threonine kinase implicated in many signaling pathways in eukaryotes. Many GSK3/SHAGGY-like kinases have been identified in plants. The Arabidopsis BRASSINOSTEROID-INSENSITIVE 2 (BIN2) gene encodes a GSK3/SHAGGY-like kinase. Gain-of-function mutations within its coding 15 sequence or its overexpression inhibit brassinosteroid (BR) signaling, resulting in plants that resemble BR-deficient and BR-response mutants. In contrast, reduced BIN2 expression via cosuppression partially rescues a weak BR-signaling mutation. Thus, BIN2 acts as a negative regulator to control steroid signaling in plants (Li and Nam, Science 295(5558): 1299 20 1301, 2002). Summary As one of the major human staples, rice has been a target of genetic engineering for higher yields. and resistance to diseases, pests, and environmental stresses of various kinds. The proteins identified in the 25 present Example have presumed roles in cell cycle processes and/or the stress response. Knowledge of the proteins and molecular interactions associated with cell cycle processes and stress response in rice could lead to important applications in agriculture. Modulation of these interactions can be exploited to effect changes in plant development or growth that would 30 result in increased crop yield and tolerance to environmental stress conditions.
WO 2004/061080 PCT/US2003/041098 191 Plant disease response often mimics certain normal developmental processes. For example, plants responses to fungal gibberellic acid and fusicoccin toxin are similar to responses to plant-produced gibberellin and auxin, respectively (Hedden and Kamiya, Annual Rev. Plant Physiol. Plant 5 Mol. Biot. 48: 431, 1977; Baunsgaard et al., Plant J. 13: 661, 1998). The same can be said for abiotic stress responses and certain stages of plant development. Leaf cells undergoing dehydration stress express some of the same genes that embryonic cells express during development or seed desiccation (Medina et al., Plant Physiol. 125: 1655, 2001). Since systematic 10 regulation of gene expression drives developmental processes and stress responses (Chen et al., Plant Cell 14: 559, 2002) it is likely that there is a broader overlapping set of genes and their cognate proteins involved in such responses. This Example describes one such overlapping set of genes. The results described in this Example are useful for predicting gene 15 function in rice or other plants. For example, rice has a homolog (OsSGT1; gbjAAF18438) to the barley SGT1 and A. thaliana SGT1b proteins that participate in pathogen defense through interactions with resistance gene and ubiquitinylation protein degradation pathways. OsSGTI is inducible by blast infection and likely participates in pathogen defense. OsSGT1 20 interacted with several undefined and known proteins, including one whose transcript is induced upon treatment with a rice blast fungal elicitor (gblAF090698). The elicitor-responsive protein (OsERP) interacted with other undefined proteins and an ubiquitin protease-related protein, which implicates OsERP in SGT1 mediated protein degradation. These rice 25 proteins, as well as other plant homologs, are suspected to have associated roles in disease resistance. A. thaliana proteins homologous to OsERP (PN20696), Ras GTPase (PN24063), Archain delta COP-like (28982), fibrillin-like (PN29042) and to one of the undefined proteins that interacted with OsERP (PN29983) have 30 also been identified. A.thaliana homozygous for insertion mutations in the cognate genes were challenged with Pseudomonas syringae. By three days WO 2004/061080 PCT/US2003/041098 192 after inoculation, the mutant plants accumulated more than 10 times as many bacteria as wild type plants. Hence, these Arabidopsis homologs contribute to disease resistance in A. thaliana. It is possible that these mutations inhibit defense responses that are dependent upon SGTI 5 interactions. Based upon homology and the interaction map, the rice homologs from which are associated the Arabidopsis genes can also involved in disease resistance and other processes utilizing SGTI as a factor. These results demonstrate that the combined datasets can be used to predict gene functions that can be verified using phenotypes of mutants. 10 Example IV This Example describes the identification and characterization of rice proteins that interact at the cell wall in response to biotic stress. As has been described above, an automated, high-throughput yeast two-hybrid assay technology was used to identify proteins interacting with rice chitinase, 15 class Ill, and with cellulose synthase catalytic subunit. The sequences encoding the protein fragments used in the search were then compared by BLAST analysis against proprietary and public databases to determine the sequences of the full-length genes. The proteins found appear to be localized or targeted to the cell wall and to participate in the plant pathogen 20 induced defense response. The identification and characterization of proteins participating in pathways and biochemical reactions associated with defense against pathogens in rice can allow the development of genetically modified crops with enhanced or reduced disease resistance. Chitinases are glycohydrolases that degrade chitin, a structural 25 component of insects and plant pathogens such as nematodes, fungi, and bacteria. These enzymes are involved in multiple biological functions that include defense against chitin-containing pathogens, with class III chitinases having a substrate specificity for bacterial cell walls (Brunner et al., Plant J. 14(2): 225-34, 1998). Chitinase was chosen as a bait for these interaction 30 studies based on its relevance to TMRI's plant health programs. The high potential for specific enzyme-substrate interactions makes these proteins WO 2004/061080 PCT/US2003/041098 193 suitable for two-hybrid assays. The identification of rice genes encoding proteins involved in the plant response to pathogens are important to agriculture, as their discovery can allow genetic manipulation of crops to obtain plants with enhanced or reduced disease resistance. 5 The second bait used in this Example, namely cellulose synthase catalytic subunit, is part of a membrane-bound enzyme complex involved in the synthesis of cellulose, an essential component of the cell wall of higher plants whose production is central to morphogenesis and many other biological processes in plants (reviewed in Perrin R.M., Curr. Biol. 11(6): 10 R213-R216, 2001). This example provides newly characterized rice proteins interacting with a rice chitinase, class Ill (OsCHIBI), and with rice cellulose synthase catalytic subunit, RSWI-like (OsCS). An automated, high-throughput yeast two-hybrid assay technology (provided by Myriad Genetics Inc., Salt Lake 15 City, UT) was used to search for protein interactions with the chitinase and cellulose synthase bait proteins. Results Chitinase, class Ill, was found to interact with rice catalase A, an antioxidant enzyme that is part of the plant's detoxification mechanism 20 against molecules induced in response to environmental stresses. A second interactor, cellulose synthase catalytic subunit, is an enzyme involved in cellulose biosynthesis and is the second bait protein of this Example. The search also identified four novel rice proteins interacting with chitinase: a protein similar to plant ABC transporter proteins, which play an important 25 role in defense responses by eliminating toxins from tissues; a peptidase similar to Arabidopsis thaliana glutamyl aminopeptidase, whose proteolitic activity can be associated with activation of signaling molecules during the response of the plant to pathogens; a protein similar to a putative ATPase from A. thaliana, and one unknown protein, similar to a putative protein from 30 A. thaliana.
WO 2004/061080 PCT/US2003/041098 194 The cellulose synthase catalytic subunit bait clone was found to interact with itself and with twelve proteins. These include three known rice proteins: the DNAJ homologue, a type of molecule known to participate in the plant protective stress response as a regulator of heat shock proteins, 5 and two proteins that function as membrane-spanning pumps: the product of the salT gene, which is induced by salt and stress, and the channel protein aquaporin. Nine interactors are novel proteins: a DNA-damage inducible-like protein with a putative role in the plant defense mechanism against nucleic acid damage; a putative BAG protein which presumably 10 participates in the plant stress response by regulating heat shock proteins; a protein similar to the riboflavin precursor 6,7-dimethyl-8-ribityllumazine synthase precursor from A. thaliana and possibly involved in biosynthesis of riboflavin during oxidative stress; a protein similar to soybean calcium dependent protein kinase and one similar to A. thallana putative zinc finger 15 protein, with likely roles as mediators of molecular signaling or transcription following damage to the cell wall; and four proteins of unknown function. The interacting proteins of the Example are listed in Table 9 and Table 10 below, followed by detailed information on each protein and a discussion of the significance of the interactions. A diagram of the 20 interactions is provided in Figure 2. The nucleotide and amino acid sequences of the proteins of the Example are provided in SEQ ID NOs: 71 96 and 151-162. Some of the proteins identified represent rice proteins previously uncharacterized. These proteins appear to participate in the plant defense 25 mechanism against pathogens. Based on their presumed biological function and on their ability to specifically interact with the chitinase and cellulose synthase bait proteins, the interacting proteins can be localized or targeted to the cell wall, where they are involved in biochemical reactions and gene induction associated with local or systemic defense against pathogens. 30 Table 9 Interacting Proteins Identified for OsCHIBI (Chitinase, Class 1i1).
WO 2004/061080 PCT/US2003/041098 195 The names of the clones of the proteins used as baits and found as preys are given. Nucleotide/protein sequence accession numbers for the proteins of the Example (or related proteins) are shown in parentheses under the protein name. The bait and prey coordinates (Coord) are the amino acids 5 encoded by the bait fragment(s) used in the search and by the interacting prey clone(s), respectively. The source is the library from which each prey clone was retrieved. Gene Name Protein Name Bait Prey Coord (GENBANK@) Accession Coord (Source) No.) BAIT PROTEIN OsCHIBI 0. sativa Chitinase, Class IllI PN19651 (AF296279; AAG02504) (SEQ ID NO: 152) INTERACTORS OsCATA 0. sativa Catalase A 10-200 332-433 PN20899 Isozyme (input trait) (SEQ ID NO: 154) (D29966; BAA06232) OSCS* 0. sativa Cellulose 10-200 411-489 PN19707 Synthase Catalytic Subunit, (input trait) (SEQ ID NO: 156) RSW1-Like (AF030052; AAC39333) OsPN22823 Novel Protein PN22823, 10-200 25-106 (SEQ ID NO: 72) Similar to ABC Transporter (input trait) Proteins (T02187, AB043999.1, NP_171753; e=0) WO 2004/061080 PCT/US2003/041098 196 OsPN22154 Novel Protein PN22154, 10-200 390-562 (SEQ ID NO: 74) Similar to A. thaliana (input trait) Glutamyl Aminopeptidase (AL035525; e=0) OsPN29041 Novel Protein PN29041, 10-200 2x 5-108 (SEQ ID NO: 76) Fragment, Similar to A. (input trait) thaliana Putative ATPase (AAG52137; e- 17 ) OsPN22020 Novel Protein PN22020, 10-200 3x 76-170 (FL_R01_P005_C09. Fragment, Similar to A. 128-170 g.la.Sp6a) tha/iana Putative Protein (input trait) (SEQ ID NO: 78) (NP_197783; 3e- 34 ) * The cellulose synthase catalytic subunit was also used as a bait; its interactions are shown in Table 10. Table 10 Interacting Proteins Identified for OsCS 5 (Cellulose Synthase Catalytic Subunit, RSWI-Like) Gene Name Protein Name Bait Prey Coord (GENBANK@ Accession No.) Coord (Source) BAIT PROTEIN OsCS 0. sativa Cellulose Synthase PN19707 Catalytic Subunit, RSW1 -Like (SEQ ID NO: (AF030052; AAC39333) 156) INTERACTORS OsCS 0. sativa Cellulose Synthase 316-583 316-582 PN19707 Catalytic Subunit, RSWI-Like (input trait) (SEQ ID NO: (AF030052; AAC39333) 156) WO 2004/061080 PCT/US2003/041098 197 OsAAB53810 0. sativa salT Gene Product 316-583 6-145 PN29086 (AF001395; AAB53810.1) (output trait) (SEQ ID NO: 158) OsPIP2A 0. sativa Aquaporin 316-583 123-290 PN29098 (AF062393) (output trait) (SEQ 1D NO: 160) OsPN22825 Novel Protein PN22825, Fragment 316-583 5-129 (SEQ ID NO: (input trait) 80) OsPN29076 Novel Protein PN29076, Fragment 316-583 1-187 (SEQ ID NO: 43-388 82) 122-304 (output trait) OsPN29077 Novel Protein PN29077, Fragment, 316-583 4x 1-242 (SEQ ID NO: Similar to A. thaliana DNA-Damage (output trait) 84) Inducible Protein DDI1-Like (BAB02792; 5e-94) OsPN29084 Novel Protein PN29084, Fragment, 316-583 3x 1-253 (SEQ ID NO: Similar to Soybean (Glycine max) (output trait) 86) Calcium-Dependent Protein Kinase (A43713, 2e-79) OsPN29113 0. sativa DNAJ Homologue 316-583 1-92 (SEQ ID NO: (BAB70509.1) (output trait) 162) WO 2004/061080 PCT/US2003/041098 198 OsPN29115 Novel Protein PN29115, Fragment, 316-583 1-188 (SEQ ID NO: Similar to A. thaliana 6,7-Dimethyl- (output trait) 88) 8-Ribityllumazine Synthase Precursor (AAK93590, 6e1 7 ) OsPN29116 Novel Protein PN29116, Fragment 316-583 1-169 (SEQ ID NO: (output trait) 90) OsPN29117 Novel Protein PN29117 316-583 -7-151 (FL R01__P07 (output trait) 8_N11.fasta.c ontig1)* (SEQ ID NO: 92) OsPN29118 Novel Protein PN29118, Fragment 316-583 1-136 (SEQ ID NO: (output trait) 94) OsPN29119 Novel Protein PN29119, Fragment 316-583 -53.to 155 (FL ROIP08 (output trait) 4_P01.g.1a.S p6a) (SEQ ID NO: 96) * OsPN29117 also interacts with heat shock protein hsp70 (OsHSP70, PN20775): three prey clones of OsPN29117 (one encoding amino acids 11 160, two encoding amino acids 29-160) from the output trait library 5 interacted with a clone (amino acids 138-360) of OsHSP70 used as bait. Yeast Two-Hybrid Using OsCHIB1 (Chitinase, Class Ill) as Bait WO 2004/061080 PCT/US2003/041098 199 The rice class il chitinase (GENBANK@ Accession No. AF296279) is a 286-amino acid protein. Chitinases are glycohydrolases that degrade chitin. Chitin is a structural component of insects, nematodes, fungi, and bacteria. Chitinases are one of the several kinds of pathogenesis-related 5 (PR) proteins induced in higher plants in response to infection by pathogens (reviewed in Stintzi et al., Biochimie. 75(8): 687-706, 1993). While chitinases perform multiple biological functions, the class IllI chitinases' substrate specificity for bacterial cell walls suggests a main role for these enzymes as defense proteins (Brunner et al., supra). The enzyme directly attacks the 10 pathogen by degrading the fungal or bacterial cell wall. The bait fragment used in this search encodes amino acids 10 to 200 of OsCHIB1 (Chitinase, Class Ill). This region of the protein includes the active site of the enzyme (amino acids 127 to 135). There is no match for the gene encoding OsCHIB1 on TMRI's GENECHIP* Rice Genome Array. 15 OsCHIBI (Chitinase, Class lil) was found to interact with OsCATA (PN20899; 0. sativa Catalase A Isozyme (D29966; BAA06232)). Catalase A (GENBANK@ Accession No. D29966) is the product of the rice CatA gene, which was identified by Higo and Higo, Plant Mol. Biol. 30(3): 505-521, 1996 as the homologue of the Cat-3 gene from Indian corn (Zea mays; 20 GENBANK@ Accession No. L05934). Both rice CatA and Z. mays Cat-3 genes belong to the monocot-specific group, one of three groups into which plant catalase genes have been classified based on their molecular evolution from a common ancestor (Guan and Scandalios, J. Mol. Evol. 42(5): 570-579, 1996). Rice catalase A contains 491 amino acids with two 25 catalytic sites in position H65 and N138, and a heme binding-site in position Y348. The heme group is a cofactor for catalases' enzymatic activity. Higo and Higo, supra, showed that the CatA gene is expressed at high levels in seeds during early development and also in young seedlings, and that this gene is induced by the herbicide paraquat, but not or only slightly by abscisic 30 acid (ABA), wounding, salicylic acid, and hydrogen peroxide.
WO 2004/061080 PCT/US2003/041098 200 Catalases are stress-induced enzymes found in almost all aerobic organisms. They are part of the enzymatic detoxification mechanism against active oxygen species (AOS) in plant cells. AOS are induced in response to environmental stress and act as signaling molecules to activate multiple 5 defense responses through induction of PR genes and of other signaling molecules (e.g., salicylic acid, SA), leading to increased stress tolerance (Lamb and Dixon, Ann. Rev. Plant Biol. 48 (1): 251, 1997). AOS, however, can also damage proteins, membrane lipids, DNA and other cellular components of the plant. The balance between these two diverging effects 10 depends on the tight control of cellular levels of AOS, which is achieved through a diverse battery of oxidant scavengers. Among these antioxidant molecules, catalases protect plant cells from the toxic effects of the AOS precursor hydrogen peroxide generated in the oxidative burst by converting it to dioxygen and water (reviewed in Dat et al., Redox Rep. 6(1): 37-42, 15 2001). OsCHIBI (Chitinase, Class Ill) was found to interact with 0. Sativa Cellulose Synthase Catalytic Subunit, RSWI-Like (OsCS; PN19707). The prey clone found in our search, retrieved from the input trait library, encodes amino acids 411 to 489 of rice cellulose synthase catalytic subunit. This 20 region of the 583-amino acid protein is C-terminal to the transmembrane domains and is predicted by amino acid sequence analysis to be on the cytoplasmic side of the plasma membrane. Cellulose synthase is a membrane-bound enzyme complex comprising multiple isoforms. Cellulose synthase catalytic subunit 25 (GENBANK@ Accession No. AF030052) is involved in the synthesis of cellulose, a polysaccharide that is an essential component of the cell wall of higher plants. Cellulose imparts mechanical properties to plants which determine plant growth and cell shape, and its production impacts many aspects of plant biology. Most plants synthesize cellulose at the plasma 30 membrane through the activity of cellulose synthase. As part of a structure called the rosette, the enzyme extends nascent cellulose chains by adding a WO 2004/061080 PCT/US2003/041098 201 sugar nucleotide precursor, and these chains then assemble into microfibrils that align in the same direction on the surface of the plasma membrane. This process seems to depend on a precise organization and orientation of the rosette (Perrin, R.M., Curr. Biol. 11(6): R213-6, 2001). A mutation in the 5 A. thaliana rswl gene that causes cellulose disassembly results in altered root morphogenesis (Baskin et al., Aust. J. Plant Physiol. 19(4): 427-437, 1992), indicating that proper cellulose synthesis is critical to plant development and morphology. Arioli et al., Science 279(5351): 717-720, 1998 showed that the rswl gene in A. tha/iana encodes a catalytic subunit of 10 cellulose synthase. However, genetic and biochemical evidence now supports the concept that a family of genes encode the catalytic subunit of cellulose synthase in higher plants, with various members showing tissue specific expression or being differentially expressed in response to various conditions. These topics are reviewed in Perrin, R.M., supra. These authors 15 indicate that the presence of many genes for the cellulose synthase catalytic subunit in plants suggests that multiple isoforms of cellulose synthase can be needed in the same cell for the formation of functional multimeric complexes, most likely dimers. In addition, many other polypeptides have been detected within the rosette whose identities have not been determined. 20 Interaction studies aimed at identifying the proteins interacting with synthase can help elucidate the organization of the cellulose synthase rosette machinery and address some of the questions that still remain about the biosynthesis of cellulose. There is no match for the gene encoding OsCS on TMRI's GENECHIP* Rice Genome Array. 25 Cellulose synthase catalytic subunit was also used as a bait protein. Its interactors are shown in Table 30 and discussed in later in this Example. OsCHIB1 (Chitinase, Class Ill) was found to interact with Protein PN22823, which is similar to ABC Transporter Proteins (OsPN22823). Protein PN22823 is a 1239-amino acid protein that includes ten predicted 30 transmembrane domains (amino acids 45 to 61, 154 to 170, 174 to 190, 253 to 269, 295 to 311, 671 to 687, 715 to 731, 794 to 810, 818 to 834, and 933 WO 2004/061080 PCT/US2003/041098 202 to 949) and two ATP/GTP-binding site motifs A (P-loops) (amino acids 383 to 390 and 1031 to 1038). A BLAST analysis against the Genpept database indicated that PN22823 shares 55% identity with Japanese goldthread (Coptis japonica) CjMDR1 (GENBANK@ Accession No. AB043999.1; e=0.0). 5 CjMDR1is a multidrug resistance gene expressed in the rhizome, where alkaloids are highly accumulated compared to other organs (Yazaki et al., J. Exp. Bot. 52(357): 877-9, 2001). Other proteins highly similar to PN22823 include A. thaliana putative ABC transporter (GENBANK@ Accession No. T02187; e=0) and putative P-glycoprotein (GENBANK@ Accession No. 10 NP_171753; e=0). These types of proteins contain ATP-binding cassettes (ABC) and belong to a family that includes P-glycoprotein (P-gp) and multidrug resistance-associated protein 2 (MRP2) (reviewed by Fardel et al., Toxicology 167(1): 37-46, 2001). ABC proteins are membrane-spanning proteins that transport a wide variety of compounds across biological 15 membranes, including phospholipids, ions, peptides, steroids, polysaccharides, amino acids, organic anions, drugs and other xenobiotics. In mammals, ABC transporters participate in the biliary elimination of exogenous compounds and xenobiotics, and their expression can be up regulated by these toxins. The large number of ABC transporter protein 20 family members identified in A. thaliana (129 according to Sanchez Fernandez et a/., J. Biol. Chem. 276(32): 30231-30244, 2001), suggests an important role for these proteins in plants. In agreement with this notion, ABC transporters were among the immediate early genes found to be up regulated in a tropical japonica rice cultivar (Oryza sativa cv. Drew) in 25 response to jasmonic acid, benzothiadiazole, and/or blast infection (Xiong et a/., Mol. Plant Microbe Interact. 14(5): 685-692, 2001). This suggests that ABC proteins play a role in defense against toxins in plants as they do in mammals. Most of the ABC transporters characterized in plants to date have been localized in the vacuolar membrane and are considered to be involved 30 in the intracellular sequestration of cytotoxins (reviewed in Leslie et al., Toxicology 167(1): 3-23, 2001). Furthermore, plant ABC transporters appear WO 2004/061080 PCT/US2003/041098 203 to have a role equivalent to that of the mammalian ABC transporter in multidrug resistance, as shown in a study in which an ABC transporter protein was up-regulated in a Nicotiana plumbaginifolia cell culture following treatment with a close analog of the antifungal diterpene sclareol (Jasinski et 5 al., Plant Cell 13(5): 1095-107, 2001). MRP homologues isolated from A. thaliana (AtMRPs) are implicated in providing herbicide resistance to plants (Rea et al., Annu. Rev. Plant Physiol. Plant Mol. Biot. 49: 727-760, 1998). There is also evidence that ABC transporter proteins act as hormone transporters as they do in mammals. Specifically, a mutation in one of the 10 ABC transporters in A. thaliana, AtMRP5, results in decreased root growth and increased lateral root formation possibly due to the inability of the mutant AtMRP5 to act as an auxin conjugate transporter Gaedeke et al., EMBO J. 20(8): 1875-1887, 2001). A BLAST analysis comparing the nucleotide sequence of Novel 15 Protein PN22823 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OSORF012127_at (e- 45 expectation value) as the closest match. Gene expression experiments indicated that this gene is induced by the fungal pathogen M. grisea. OsCHIB1 (Chitinase, Class Ill) was found to interact with protein 20 PN22154, which is similar to A. thaliana Glutamyl Aminopeptidase (OsPN22154). OsPN22154 is a 173-amino acid protein fragment that is 65% identical to a protein from A. thaliana (GENBANK@ Accession No. AL035525) described as a homologue of mouse aminopeptidase (GENBANK@ Accession No.U35646). The cDNA sequence of the A. 25 thaliana aminopeptidase-like protein and the rice genome sequence (as a template) were used to generate a rice DNA sequence coding for a protein of 874 amino acids, which is 54.7 % identical to the A. thaliana aminopeptidase-like protein. Indeed, domain analysis of the novel rice protein detected a peptidase M1 domain (amino acids 17 to 402), and a zinc 30 binding domain (amino acids 311 to 320), suggesting that this protein is a metallo-aminopeptidase. It is unclear whether this protein is encoded by an WO 2004/061080 PCT/US2003/041098 204 orthologue or an analogue of the A. thaliana aminopeptidase-like gene. A BLAST analysis comparing the nucleotide sequence of Novel Protein PN22154 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS_004263_at (4e- 83 expectation value) as the 5 closest match. Gene expression experiments indicated that this gene is expressed in panicle. OsCHIB1 (Chitinase, Class Ill) was found to interact with protein PN29041 (OsPN29041). A BLAST analysis indicated that this protein fragment is similar to putative ATPase from A. thaliana (GENBANK@ 10 Accession No. AAG52137; e 17 ). ATPases can be localized to the plasma membrane which is adjacent to the cell wall. There is no match for this gene on TMRI's GENECHIP* Rice Genome Array, and thus no gene expression data that would allow prediction of its function during stress or infection. It is possible that this protein can have no role in pathogen invasion. However, it 15 is part of the chitinase multiprotein complex identified in this Example through the yeast two-hybrid interactions, which we suggest exists at the cell wall interface. One hypothesis is that the ATPase-like protein can reside in the plasma membrane and participate in cell wall synthesis. Further interaction data can help elucidate the biological significance of its 20 participation in the chitinase multiprotein complex. OsCHIBI (Chitinase, Class Ill) was found to interact with protein PN22020 (OsPN22020). Protein PN22020 is a 175-amino acid protein fragment that shares 55% identity with A. thaliana putative protein (GENBANK@ Accession No. NP 197783; 3e 34 ). Analysis of the amino acid 25 sequence identified a C2 domain (amino acids 5 to 90, e=0.037), as found in protein kinase C isozymes, which suggests that PN22020 can participate in signaling pathways similar to those modulated by protein kinase C. Perhaps its interaction with chitin represents a signaling event that occurs in response to pathogen or toxin exposure. However, this domain has been detected in 30 other kinases and nonkinase proteins (Ponting and Parker, Protein Sci. 5(1): 162-166, 1996). Identification of the full amino acid sequence of novel WO 2004/061080 PCT/US2003/041098 205 protein PN22020 can make it possible to determine the class of C2 domain containing proteins to which it belongs. A BLAST analysis comparing the nucleotide sequence of Novel Protein PN22020 against TMRI's GENECHIP* Rice Genome Array 5 sequence database identified probeset OS008182_r_at (e- 1 02 expectation value) as the closest match. Gene expression experiments indicated that this gene is constitutively expressed in leaves, stems, roots, seeds, panicle and pollen. Yeast Two-Hybrid Using OsCS as Bait 10 A second bait, namely 0. sativa Cellulose Synthase Catalytic Subunit, RSW1-Like (OsCS; PN19707; GENBANK@ Accession No. AF030052), was also used. This protein is described earlier in this Example because it was found to interact with the bait protein 0. sativa Chitinase, Class IlI (OsCHIB1; PN19651). The bait fragment used in the search encodes amino 15 acids 316 to 583 of OsCS. OsCS was found to interact with 0. sativa Cellulose Synthase Catalytic Subunit, RSW1-like (OsCS). In other words, OsCS was found to interact with itself. The prey clone was retrieved from the input trait library, and encoded almost the same amino acids as the bait clone (the prey clone 20 encoded amino acids 316 to 582). The self-interaction supports the concept of cellulose synthase acting as a dimer, as has been suggested (see Perrin, R.M., Curr. Biol. 11(6): R213-R216, 2001)). OsCS was also found to interact with 0. sativa salT Gene Product (OsAAB53810). A BLAST analysis of the 145-amino acid protein 25 OsAAB5381 0 amino acid sequence indicated that this protein is the rice salT Gene Product (AAB53810.1; 100% identity; 3e~8). This protein is encoded by a cDNA clone, salT, which was isolated from rice roots subjected to salinity stress, as reported by Claes et al. (Plant Cell 2(1): 19-27, 1990). These authors showed that the salT mRNA is specifically expressed in 30 sheaths and roots from mature plants and seedlings in response to salt stress and drought. Expression data reported previously by Garcia et al., WO 2004/061080 PCT/US2003/041098 206 Planta 207(2): 172-80, 1998 indicate that expression of salT in each region of the plant is dependent on the metabolic activity of the cells as well as on whether or not they are responding to stress. These authors also found that the salT gene is induced by gibberellic acid and abscisic acid and suggest 5 that induction by these growth regulators occurs through independent and possibly antagonistic pathways. Analysis of the OsAAB53810 protein sequence predicted a jacalin-like lectin domain (amino acids 14 to 145, 2.3e~ 32). Jacalin interacts with carbohydrates in a highly specific manner (Sankaranarayanan et al., Nat. Struct. Biol. 3(7): 596-603, 1996). 10 OsCS was also found to interact with Aquaporin (OsPIP2a). Aquaporin (GENBANK@ Accession No. AF062393) is a 290-amino acid protein that includes six predicted transmembrane domains (amino acids 48 to 64, 83 to 99, 131 to 147, 175 to 191, 207 to 223, and 254 to 270) and a Major Intrinsic Protein (MIP) family signature (amino acids 34 to 271), as 15 determined by amino acid sequence analysis. The prey clone retrieved from the output trait library encodes amino acids 123 to 290 of OsPIP2a, a region that includes the four most C-terminal predicted transmembrane domains and part of the MIP family signature. Aquaporin is thought to be a plasma membrane intrinsic protein (Malz and Sauter, Plant Mol. Biol. 40(6): 985-995, 20 1999). Such proteins facilitate movement of small molecules, often times functioning as water channels. This is why OsPIP2a is also called aquaporin. Malz and Sauter identified OsPIP2a along with OsPIP1a and report that these two proteins possess several hallmark motifs and homologies that justify their assignment to their respective PIP subfamilies. 25 They report that OsPIP2a and OsPIP1a display similar, but not identical, expression patterns in rice, both being expressed at higher levels in seedlings than in adult plants, and that expression in the primary root is regulated by light. Furthermore, their study indicates that gibberellic acid also regulates the expression of these OsPIP transcripts in internodes of 30 deepwater rice plants induced to grow rapidly by submergence, although expression did not correlate with growth. In A. tha/iana, different PIP WO 2004/061080 PCT/US2003/041098 207 proteins are expressed in response to different agonists and conditions, e.g., salt stress induces tonoplast intrinsic protein (SITIP), as reported by Pih et al., Mol. Cells 9(1): 84-90, 1999. These authors suggest that PIP proteins can be responsible for osmoregulation in plants under high osmotic stress 5 such as a high salt condition. OsCS was also found to interact with protein PN22825 (OsPN22825). OsPN22825 is a 229-amino acid protein fragment for which the complete sequence is not known. A BLAST analysis against the public and Myriad's proprietary databases indicated that OsPN22825 is similar to two unknown 10 proteins from A. thaliana (GENBANK@ Accession No. NP_188565, 67% identity, 3e- 82 ; and GENBANK@ Accession No. AB025624, 37% identity, 3e 82). There is no match for the gene encoding OsPN22825 on TMRI's GENECHIP* Rice Genome Array, and thus no gene expression data that would allow prediction of its function during stress or infection. 15 OsCS was also found to interact with protein PN29076 (OsPN29076). OsPN29076 is a 389-amino acid protein fragment for which the complete sequence is not known. Analysis of the available amino acid sequence identified a cytochrome c family heme-binding site (amino acids 142 to 147). A BLAST analysis revealed no proteins with high similarity to OsPN29076, 20 the best hit being an A. thaliana unknown protein (GENBANK@ Accession No. AAF24616, 34% identity, 3e~ 46 ). Three prey clones encoding amino acids 1 to 187, 42 to 389, and 121 to 304 of OsPN29076 were retrieved from the output trait library. The clones share an overlapping region which spans amino acids 121 to 187 of OsPN29076 and which includes the cytochrome c 25 family heme-binding site. There is no match for the gene encoding OsPN29076 on TMRI's GENECHIP* Rice Genome Array, and thus no gene expression data that would allow prediction of its function during stress or infection. The lack of information about OsPN29076 makes it difficult to determine its function. Identification of the complete amino acid sequence 30 for OsPN29076 can contribute to clarifying the function of this protein and the biological significance of the OsCS-OsPN29076 interaction.
WO 2004/061080 PCT/US2003/041098 208 OsCS was also found to interact with protein PN29077, which is similar to A. thaliana DNA-Damage Inducible Protein DD11-Like (OsPN29077). OsPN29077 is 243-amino acid protein fragment for which the complete sequence is not known. A BLAST analysis indicated that 5 OsPN29077 shares 73% identity with A. thaliana DNA-damage inducible protein DD11-like (GENBANK@ Accession No. BAB02792; 5e-94). DD11 is thought to be a cell-cycle checkpoint protein in yeast and its expression is induced by a variety of DNA-damaging agents. Such proteins arrest cells at certain stages and regulate the transcriptional response to DNA damage 10 (Zhu and Xiao, Nucleic Acids Res. 26(23): 5402-5408, 1998). DDl1 has been reported to interact with ubiquitin (Bertolaet et al., Nat. Struct. Biol. 8(5): 417-422, 2001), an observation that supports the use of the yeast two hybrid approach to study such proteins. A BLAST analysis comparing the nucleotide sequence of OsPN29077 15 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS016688.1_at (e 83 expectation value) as the closest match. Gene expression experiments indicated that this gene is not specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides, and applied 20 hormones. OsCS was also found to interact with protein PN29084, which is similar to G. max calcium-dependent protein kinase (OsPN29084). OsPN29084 is a 284-amino acid protein fragment for which the complete sequence is not known. Analysis of the available amino acid sequence 25 identified four EF-hand calcium-binding domains (amino acids 110 to 122, 146 to 158, 182 to 194, and 216 to 228). In agreement with the presence of these domains, a BLAST analysis indicated that OsPN29084 is highly similar to many calcium-dependent protein kinases including soybean (G. max) calcium-dependent protein kinase (GENBANK@ Accession No. A43713, 30 81% identity, 2e-79). This soybean protein also includes four EF-hand calcium-binding domains and requires calcium but not calmodulin or WO 2004/061080 PCT/US2003/041098 209 phospholipids for activity (Harper et al., Science 252(5008): 951-954, 1991). Calcium can function as a second messenger through stimulation of such calcium-dependent protein kinases. A BLAST analysis comparing the nucleotide sequence of OsPN29084 5 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS004083.1_at (e- 83 expectation value) as the closest match. Gene expression experiments indicated that this gene is not specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides, and applied 10 hormones. OsCS was also found to interact with 0. sativa DNAJ homologue (OsPN29113). OsPN29113 is a 92-amino acid protein whose sequence includes an ATP/GTP-binding site motif A (P-loop, amino acids 43 to 50). A BLAST analysis of the available amino acid sequence indicated that 15 OsPN29113 is the rice DNAJ homologue (GENBANK@ Accession No. BAB70509.1; 100% identity; 5e- 3 9 ). In eukaryotic cells, DnaJ-like proteins regulate the chaperone (protein folding) function of Hsp70 heat-shock proteins through direct interaction of different Hsp70 and DnaJ-like protein pairs (Cyr et al., Trends Biochem. Sci. 19(4): 176-181, 1994). Heat shock 20 proteins (reviewed in Bierkens, J.G., Toxicology 153(1-3): 61-72, 2000) are stress proteins that function as intracellular chaperones to facilitate protein folding/unfolding and assembly/disassembly. They are selectively expressed in plant cells in response to a range of stimuli, including heat and a variety of chemicals. As regulators of heat shock proteins, DnaJ-like 25 proteins are thus part of the plant protective stress response. A BLAST analysis comparing the nucleotide sequence of OsPN29113 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS002926_at (e- 24 expectation value) as the closest match. Gene expression experiments indicated that this gene is not 30 specifically expressed in several different tissue types and is not specifically WO 2004/061080 PCT/US2003/041098 210 induced by a broad range of plant stresses, herbicides, and applied hormones. OsCS was also found to interact with protein PN291115, which is similar to A. thaliana 6,7-dimethyl-8-ribityllumazine synthase precursor 5 (OsPN29115). OsPN29115 is a 188-amino acid protein fragment for which the complete sequence is not known. The available sequence includes an ATP/GTP-binding site motif A (P-loop, amino acids 94 to 101) and a 6,7 dimethyl-8-ribityllumazine synthase family signature (amino acids 42 to 186), as determined by analysis of the available amino acid sequence. The 10 presence of the latter domain is in agreement with the results of a BLAST analysis indicating that OsPN29115 shares 50% identity with A. thaliana putative 6,7-dimethyl-8-ribityllumazine synthase precursor (GENBANK® Accession No. AAK93590, 6e 37 ). The cofactor riboflavin is synthesized from the precursor 6,7-dimethyl-8-ribityllumazine (Nielsen et al., J. Biol. Chem. 15 261(8): 3661-3669, 1986). Flavins are involved in numerous biological processes (reviewed by Massey, V., Biochem. Soc. Trans. 28(4): 283-296, 2000). For example, they participate in electron transfer reactions and thereby contribute to oxidative stress through their ability to produce superoxide, but at the same time flavins participate in the reduction of 20 hydroperoxides, the products of oxygen-derived radical reactions. Flavins also contribute to soil detoxification and are linked to light-induced DNA repair in plants. The chemical versatility of flavoproteins is controlled by specific interactions with the proteins with which they are bound. A BLAST analysis comparing the nucleotide sequence of OsPN29115 25 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS015577_at (e~ 4 1 expectation value) as the closest match. Gene expression experiments indicated that this gene is not specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides, and applied 30 hormones.
WO 2004/061080 PCT/US2003/041098 211 OsCS was also found to interact with protein PN29116 (OsPN29116). OsPN29116 is a 170-amino acid protein fragment for which the complete sequence is not known. Analysis of the available amino acid sequence identified a WD40 domain (amino acids 82 to 118), which is reported to 5 participate in protein-protein interactions (Ajuh et al., J. Biol. Chem. 276(45): 42370-42381, 2001). A BLAST analysis indicated that OsPN29116 shares identity with two unknown proteins from A. thaliana (GENBANK@ Accession No. T45879, 67% identity, e~ 6 4 ; and GENBANK@ Accession No. NP_181253, 69% identity, e 5 8 ). The lack of information about OsPN29116 makes it 10 difficult to determine its function. Identification of the complete amino acid sequence for OsPN29116 can clarify the function of this protein and the biological relevance of the OsCSC-OsPN29116 interaction. A BLAST analysis comparing the nucleotide sequence of OsPN29116 against TMRI's GENECHIP* Rice Genome Array sequence database 15 identified probeset OS016500_ rat (e1 2 expectation value) as the closest match. The expectation value is too low for this probeset to be a reliable indicator of the gene expression of OsPN29116. OsCS was also found to interact with protein PN29117 (OsPN29117). OsPN29117 is a 237-amino acid protein that includes a ubiquitin domain 20 (amino acids 12 to 84). Analysis of the amino acid sequence identified a BAG domain (amino acids 106 to 187, 2.le"), which is known to bind and regulate Hsp70/Hsc7O molecular chaperones (Briknarova et al., Nat. Struct. Biol. 8(4): 349-352, 2001). The BAG family of cochaperones functionally regulates signal-transducing proteins and transcription factors important for 25 cell stress responses, apoptosis, proliferation, cell migration and hormone action (Briknarova et al., supra; Antoku et al., Biochem. Biophys. Res. Commun. 286(5): 1003-1010, 2001). A BLAST analysis indicated that OsPN29117 shares identity with an A. thaliana unknown protein (GENBANK@ Accession No. AAC14405, 44% identity, 4e 52 ). In agreement 30 with the notion that OsPN29117 is a member of the BAG family of proteins, it was also found to interact with hsp70 (OsHSP70) (see note * under Table WO 2004/061080 PCT/US2003/041098 212 30). Heat shock proteins (discussed above) are stress proteins which function as ATP-dependent intracellular chaperones and which are selectively expressed in plant cells in response to a range of stimuli, including heat and a variety of chemicals. As a regulator of heat shock 5 proteins, the BAG protein OsPN29117 can thus be part of the plant protective stress response. The prey clone retrieved in the search encodes amino acids I to 151 of OsPN29117, a region that includes the ubiquitin domain. Note that the prey clone includes a small portion (-7 to 0) of the 5' untranslated region, and 10 thus its coordinates are shown in Table 2 as amino acids -7 to 151. A BLAST analysis comparing the nucleotide sequence of OsPN29117 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS017803 at (e 73 expectation value) as the closest match. Gene expression experiments indicated that this gene is not specifically expressed 15 in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides, and applied hormones. OsCS was also found to interact with protein PN29118 (OsPN29118). OsPN29118 is a 136-amino acid protein fragment for which the complete sequence is not known. A BLAST analysis indicated that OsPN29118 has 20 only weak similarity to proteins in the public domain and in Myriad's proprietary database, the best hit being an A. thaliana putative zinc finger protein SHI-like (GENBANK@ Accession No. NP_201436, 42% identity, 5e 15). The protein with the next highest identity is an A. thaliana hypothetical protein (GENBANK@ Accession No. T04595, 38% identity, 9e- 5 ). Discovery 25 of the complete amino acid sequence for OsPN29118 can contribute to clarifying the function of this protein and the biological relevance of the OsCSC-OsPN29118 interaction. A BLAST analysis comparing the nucleotide sequence of OsPN29118 against TMRI's GENECHIP* Rice Genome Array sequence database 30 identified probeset OS004996.1_at (e_ 38 expectation value) as the closest match. Gene expression experiments indicated that this gene is not WO 2004/061080 PCT/US2003/041098 213 specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides, and applied hormones. OsCS was also found to interact with protein PN29119 (OsPN29119). 5 OsPN29119 is a 327-amino acid protein fragment for which the complete sequence is not known. A BLAST analysis indicated that OsPN29119 shares 38% identity with an A. thalana unknown protein, T17H3.9 (GENBANK@ Accession No. AAD45997, 7e- 5 4 ). Discovery of the complete amino acid sequence for OsPN291 19 can contribute to clarifying the function 10 of this protein and the biological relevance of the OsCSC-OsPN29119 interaction. One prey clone encoding amino acids 1 to 155 of OsPN29119 was retrieved from the output trait library. This prey clone includes a portion of the 5' untranslated region and thus its coordinates are shown in Table 2 as amino acids -53 to 155. A BLAST analysis comparing the nucleotide 15 sequence of OsPN29119 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS014829.1_at (e 1 31 expectation value) as the closest match. Gene expression experiments indicated that this gene is not specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides, and 20 applied hormones. Summary Proteins that Interact with OsCHIB1 (Chitinase, Class III). The yeast two-hybrid assay designed to search for proteins interacting with the chitinase bait proteins led to the isolation of proteins that 25 appear to be associated with the plant defense response to pathogens. Resistance to disease occurs on several levels that include local and nonspecific systemic responses. The hypersensitive response (HR) in plants is a mechanism of local resistance to pathogenic microbes characterized by a rapid and localized tissue collapse and cell death at the 30 infection site, resulting in immobilization of the intruding pathogen. This process is triggered by pathogen elicitors and orchestrated by an oxidative WO 2004/061080 PCT/US2003/041098 214 burst, which occurs rapidly after the attack (Lamb and Dixon, Ann. Rev. Plant Biol. 48(1): 251, 1997). The accumulation of active oxygen species (AOS) is a central theme during plant responses to both biotic and abiotic stresses. AOS are generated at the onset of the HR and might be 5 instrumental in killing host tissue during the initial stages of infection. AOS also act as signaling molecules that induce expression of PR genes and production of other signaling molecules which participate in the signal cascade that leads, to PR gene induction. The triggering of defense genes can extend to the uninfected tissues and the whole plant, leading to local 10 resistance (LR) and systemic acquired resistance (SAR; reviewed in Martinez et al., Plant Physiol. 122(3): 757-766, 2000). As a result of SAR, other portions of the plant are provided with long-lasting protection against the same and unrelated pathogens. Hydrogen peroxide from the oxidative burst plays an important role in 15 the localized HR not only by driving the cross-linking of cell wall structural proteins, but also by triggering cell death in challenged cells and as a diffusible signal for the induction in adjacent cells of genes encoding cellular protectants such as glutathione S-transferase and glutathione peroxidase, and for the production of salicylic acid (SA). SA is thought to act as a 20 signaling molecule in LR and SAR through generation of SA radicals, a likely by-product of the interaction of SA with catalases and peroxidases, as reported by Martinez et al. (supra). These authors showed that recognition of a bacterial pathogen by cotton triggers the oxidative burst that precedes the production of SA in cells undergoing the HR, and that hydrogen peroxide 25 is required for local and systemic accumulation of SA, thus acting as the initiating signal for LR and SAR. The involvement of catalase in SA mediated induction of SAR in plants was previously demonstrated by Chen et al., Science 262(5141): 1883-1886, 1993 who showed that binding of catalase to SA results in inhibition of catalase activity, and that consequent 30 accumulation of hydrogen peroxide induces expression of defense-related genes associated with SAR.
WO 2004/061080 PCT/US2003/041098 215 In this study, chitinase was found to interact with catalase A. Given the established role of chitinase as a defense protein, this interaction is consistent with the presence of the stress-induced catalase during pathogen attack and suggests that both enzymes can be located at the cell wall, where 5 they participate in PR gene induction. The significance of the chitinase catalase interaction as part of the defense response against microbes finds further support in the observation that fungal catalase has a role in protecting necrotrophic fungi from the deleterious effects of AOS during colonization of a host expressing the HR (Mayer et aL., Phytochemistry 58(1): 10 33-41, 2001). These organisms were shown to secrete catalase, among other enzymes, to remove or inactivate AOS from the host. In addition, the cell wall can play a role in defense against bacterial and fungal pathogens by receiving information from the surface of the pathogen from molecules called elicitors, and by transmitting this information 15 to the plasma membrane of plant cells, resulting in gene-activated processes that lead to resistance. One type of biochemical reaction induced by elicitors and associated with the hypersensitive response is the synthesis and accumulation of phytoalexins, antimicrobial compounds produced in the plant after fungal or bacterial infection (reviewed in Hammerschmidt, R., 20 Ann. Rev. Phytopatho. 37: 285-306, 1999). One of the proteins found to interact with chitinase is an ABC transporter. ABC transporters are known to sequester cytotoxins, metabolites and other molecules from plant tissues. It is thus likely that the ABC transporter found to interact with chitinase resides at the cell wall, where it participates in the transport of toxins. Though the 25 function of phytoalexins in the plant defense response has not been thoroughly elucidated (Hammerschmidt, R., supra), it is tempting to speculate that the ABC transporter can be involved in the elimination of these toxins from the plant cells during the plant pathogen-induced defense response. Furthermore, gene expression experiments indicated that the 30 gene encoding the ABC transporter protein is induced by the fungal pathogen M. grisea. These results are consistent with the putative role of WO 2004/061080 PCT/US2003/041098 216 this protein in the defense response induced by pathogenic fungi and bacteria in rice. Chitinase was also found to interact with novel protein PN22154 similar to A. thaliana glutamyl aminopeptidase. While the specific function of 5 this prey protein has not been determined, it is well known that proteolytic activity is a common component of plant defense mechanisms against pathogens. These mechanisms include both chitinases and proteases. Peptidase activity has been associated with regulation of signaling. Carboxypeptidases, for instance, hydrolytically remove the pyroglutamyl 10 group from peptide hormones, thereby activating these signaling molecules. A carboxypeptidase regulates Brassinosteroid-insensitive I (BRI1) signaling in A. thaliana by proteolytic processing of a protein (Li et al., Proc. Nati. Acad. Sci. USA 98(10): 5916-5921, 2001). ' Based on its ability to interact with chitinase and on the well-established role of the latter in PR defense, 15 chitinase and novel protein PN22154 can interact as components of a complex with chitinolytic and proteolytic activities targeted against plant invaders, and that the rice glutamyl aminopeptidase-like protein can have a role in activating signaling molecules at the cell wall that are involved in the plant defense response. 20 A fourth interactor found for chitinase is cellulose synthase catalytic subunit. This enzyme acts as a complex at the plasma membrane where it participates in cell wall synthesis, and its regulation can allow the plant to respond with morphological changes to physical insult produced by pathogen attack. This interaction can be significant to maintaining the 25 balance of the metabolism of cell wall components during the defense response. It is possible that either chitinase resides at the cell wall where it interacts with cellulose synthase immediately following pathogen attack, or chitinase is targeted to this site and interacts with synthase after PR gene induction. 30 Aside from novel proteins PN22020 and PN29041, the rice proteins found to interact with chitinase appear to be localized at or recruited to the WO 2004/061080 PCT/US2003/041098 217 cell wall where they participate in the plant defense response to pathogen attack. Two of the interactors, an ABC transporter and a glutamyl aminopeptidase-like protein, are newly characterized proteins in rice. As a whole, all of these proteins can interact as a multicomponent 5 complex at the cell wall interface in the plant cell, and all can have roles in controlling AOS levels, inducing PR genes, and synthesizing and maintaining the integrity of the cell wall to protect the plant against the effects of pathogen invasion. Proteins that Interact with Cellulose Synthase Catalytic Subunit (OsCS) 10 The interactions involving OsCS expand the stress-response protein network identified for the chitinase bait protein. OsCS interacts with several proteins that appear to participate in the plant response to pathogen-induced stress at the cell wall. Published evidence links some of these proteins to the plant response to various stresses. These include aquaporin (OsPIP2a) 15 and salt-stress induced protein (OsAAB53810), two molecules that, although they can not have a direct role in disease resistance, can function as membrane-spanning pumps in the protein complex at the cell wall to regulate turgor pressure or transmit solutes. Moreover, the presence of the jacalin-like lectin domain in OsAAB53810 is of particular interest in the 20 context of its interaction with an enzyme that synthesizes carbohydrate chains. Given the carbohydrate-binding property of jacalin (Sankaranarayanan et al., Nat. Struct. Biol. 3(7): 596-603, 1996), OsAAB53810 can specifically bind nascent cellulose chains as they are produced by OsCS, thus playing an active role in OsCS-dependent events 25 relating to cell wall metabolism. The fact that OsAAB53810 is induced by salt and stress supports a role for this protein in such physiological events. Another interactor, the rice DNAJ homologue OsPN29113, likely participates in the plant protective stress response by regulating the chaperone function of heat shock proteins, which are induced by various 30 forms of stress. It is possible that the interaction of the DNAJ protein with cellulose synthase is part of the plant response to chemicals produced by WO 2004/061080 PCT/US2003/041098 218 pathogens or generated in cells undergoing the HR, and that such response is associated with injury to the cell wall that has occurred in response to the stress. Among the novel proteins found to interact with OsCS, OsPN29077 is 5 similar to A. thalana DNA-damage inducible protein DDl1-like. Based on the expression of yeast DD11 in response to DNA damage and on sequence homology, we speculate that OsPN29077 performs the same function as DDI1 and that the OsCS-OsPN29077 interaction is associated with the plant defense mechanism against DNA damage. Likewise, we attribute the BAG 10 like protein OsPN29117 a putative role in the plant protective stress response as a regulator of heat shock proteins. In agreement with this role, OsPN29117 also interacts with hsp70, which our gene expression experiments indicate is expressed constitutively and is down-regulated by jasmonic acid (see chart in Appendix 1), a component of plant defense 15 response pathways. Since OsPN29077 and OsPN29117 interact with the cellulose synthase catalytic subunit, and the latter interacts with the pathogen-induced defense protein chitinase, these interactors can be a part of the same complex at the cell wall where they participate in the response to pathogen attack. 20 The novel protein OsPN29115 is similar to the riboflavin precursor 6,7-dimethyl-8-ribityllumazine synthase precursor from A. thaliana. Among the roles reported for riboflavin is its association with the redox reactions occurring as a result of oxidative stress (Massey, V., Biochem. Soc. Trans. 28(4): 283-96, 2000). Based on this evidence and on sequence homology 25 for the identified interactor, the OsCS-OsPN29115 interaction can link the plant response to stress and toxins produced by pathogens with structural changes requiring OsCS activity. Additional novel proteins interacting with OsCS include a protein similar to soybean calcium-dependent protein kinase (OsPN29084) and a 30 protein similar to A. thaliana putative zinc finger protein (OsPN29118). The similarities of these interactors to protein kinases and zinc finger proteins WO 2004/061080 PCT/US2003/041098 219 suggest that they function as mediators of molecular signaling and transcription, respectively. Their interactions with OsCS can represent signaling or transcriptional events occurring after disruption following damage to the cell wall by pathogens, and these prey proteins can move 5 from the cell wall to other parts of the cell to mediate such events. The OsCS-OsPN29084 interaction likely represents a step in the transduction of an extracellular signal that results in a physiological response, while the OsCS-OsPN29118 interaction can be associated with transcriptional regulation also in response to an extracellular signal. This signal can be in 10 the form of an insult to the plant produced by pathogen attack. For the remaining proteins found to interact with OsCS-OsPN22825, OsPN29076, OsPN29116, and OsPN29119--based on their association with cellulose synthase and chitinase, these prey proteins can also be important factors for pathogen defense, cell wall integrity, or for holding together 15 protein complexes. Thus, the results presented in this Example show that proteins interacting with the cellulose synthase catalytic subunit are also part of the chitinase multiprotein complex localized at the cell wall interface. Example V 20 Janssens and Goris teach that type 2A serine/threonine protein phosphatases (PP2A) are important regulators of signal transduction, which they affect by dephosphorylation of other proteins (Janssens and Goris, Biochem J. 353(Pt 3): 417-439, 2001). Members of the protein phosphatase 2A (PP2A) family of serine/threonine phosphatases contain a well-conserved 25 catalytic subunit, the activity of which is highly regulated (Janssens and Goris, supra). There are multiple PP2A isoforms in plants and other organisms, and they appear to be differentially expressed in various tissues and at different stages of development (Arino et al., Plant Mol. Biol. 21(3): 475-485, 1993). Harris et al. cites a number of reports describing the 30 association of PP2A subunits with a variety of cellular proteins in addition to regulatory subunits, suggesting that PP2As function as regulators of various WO 2004/061080 PCT/US2003/041098 220 signaling pathways associated with protein synthesis, cell cycle and apoptosis (Harris et al., Plant Physiol. 121(2): 609-617, 1999). PP2A enzymes have been implicated as mediators of a number of plant growth and developmental processes. 5 In addition, PP2A enzymes play a role in pathogen invasion. In animals, a variety of viral proteins target specific PP2A enzymes to deregulate chosen cellular pathways in the host and promote viral progeny (Sontag, E., Cell Signal 13(1): 7-16, 2001; Garcia et al., Microbes Infect. 2(4): 401-407, 2000). PP2A enzymes interact with many cellular and viral 10 proteins, and these protein-protein interactions are critical to modulation of PP2A signaling (Sontag, supra). The proteins interacting with PP2A (e.g., PP2A) can, for example, target PP2A to different subcellular compartments, or affect PP2A enzyme activity. Moreover, PP2A enzymes play a role in plants in their response to viral infection (Dunigan and Madlener, Virology 15 207(2): 460-466, 1995). Indeed, serine/threonine protein phosphatase is required for tobacco mosaic virus-mediated programmed cell death (Dunigan and Madlener, supra). OsPP2A-2 (GENBANK@ Accession No. AF134552) is a 308-amino acid subunit of a family of protein phosphatases that contains a 20 serine/threonine protein phosphatase signature (amino acids 112 to 117). As described above, a yeast two-hybrid approach was taken to dissect PP2A-mediated signaling events. The bait fragments used in this search and found to have interactors encode amino acids I to 308 and 150-308 of OsPP2A-2. 25 The second bait used in this Example, OsCAA90866, is a protein encoded by a complete cDNA sequence that is only known to be inducible by chilling in rice. OsCAA90866 was chosen as a bait for these interaction studies based on its relevance to abiotic stress. Investigation into the interactions involving OsCAA90866 will provide insight into the function of 30 this poorly defined protein. The identification of rice genes involved in modulating the response of the plant to an environmental challenge, thus WO 2004/061080 PCT/US2003/041098 221 conferring it a selective advantage, would facilitate the generation and yield of crops resistant to abiotic stress. Results OsPP2A-2 was found to interact with rice putative proline-rich protein, 5 which is possibly a transcriptional regulator, and with the seed storage protein glutelin. The search also identified five novel rice proteins interacting with OsPP2A-2: a putative PP2A regulatory subunit protein also similar to rice chilling-inducible protein CAA90866 (the second bait protein of this Example); an enzyme similar to phosphoribosylanthran late transferase that 10 is likely involved in the plant response to pathogen infection; a disulfide isomerase, with a putative role in protein folding; a voltage-dependent ion channel protein; and a DnaJ-like protein with a putative role in the pathogen induced defense response. The second bait protein of this Example, chilling-inducible protein 15 CAA90866 was found to interact with itself and with six proteins. One of these is the same putative PP2A regulatory subunit protein (similar to the bait protein itself) found to interact with the bait OsPP2A-2 of described in this Example. This interaction links the two networks of proteins identified in thi Example (i.e., links proteins associated with biotic and abiotic stress to 20 phosphatases). The other interactors identified in this search include a 14-3 3-like protein that is induced under various abiotic stress conditions; a pyrrolidone carboxyl peptidase-like protein with a putative role in activating signaling peptides involved in the plant's response to cold stress; a novel protein containing an inositol phosphate domain likely involved in regulation 25 of signaling events associated with cold tolerance; a novel rice homolog of wheat initiation factor (iso)4f p82 subunit with a putative role in RNA decay pathways associated with stress conditions; and a novel protein similar to plants 2-dehydro-3-deoxyphosphooctonate aldolase. The interacting proteins of the Example are listed in Table 11 and 30 Table 12 below, followed by detailed information on each protein and a discussion of the significance of the interactions. A diagram of the WO 2004/061080 PCT/US2003/041098 222 interactions is provided- in Figure 3. The nucleotide and amino acid sequences of the proteins of the Example are provided in SEQ ID NOs: 97 112 and 163-174. Some of the proteins identified represent rice proteins previously 5 uncharacterized. Based on their presumed biological function and on their ability to specifically interact with the bait proteins OsPP2A-2 or OsCAA90866, we speculate that the proteins interacting with OsPP2A-2 represent a network involved in the rice defense response to biotic stress, and those interacting with OsCAA90866 are associated with the abiotic 10 stress response. Importantly, the interactions identified suggest that phosphatases play a role in the regulation of both biotic and abiotic stress response in rice. Table 11 Interacting Proteins Identified for OsPP2A-2 15 (Serine/Threonine Protein Phosphatase PP2A-2). The names of the clones of the proteins used as baits and found as preys are given. Nucleotide/protein sequence accession numbers for the proteins of the Example (or related proteins) are shown in parentheses under the protein name. The bait and prey coordinates (Coord) are the amino acids 20 encoded by the bait fragment(s) used in the search and by the interacting prey clone(s), respectively. The source is the library from which each prey clone was retrieved. Gene Name Protein Name Bait Coord Prey (GENBANK@ Accession Coord No.) (Source) BAIT PROTEIN OsPP2A-2 0. sativa Serine/Threonine PN20254 (AF134552- Protein Phosphatase PP2A OS002763) 2, Catalytic Subunit (SEQ ID NO: 164) (AF134552, AAD22116)
INTERACTORS
WO 2004/061080 PCT/US2003/041098 223 OsAAK63900 0. sativa Putative Proline- 1-308 122-224 PN23266 Rich Protein AAK63900 (input (SEQ ID NO: 166) (AC084884) trait) OsORF020300-2233.2 Hypothetical Protein 1-308 93-387 PN21639 (2233(2)-OS- ORF020300-2233.2, 118-388 ORF020300 novel Putative PP2A Regulatory (input (SEQ ID NO: 98) Subunit, Similar to trait) OsCAA90866 (AAD39930; 5e- 92 ) (CAA90866; 5e- 5 3 ) OsPN23268 Novel Protein 23268, 1-308 2x 12 PN23268 novel Similar to 200 (SEQ ID NO: 100) Phosphoribosylanthranilate (input Transferase, Chloroplast trait) Precursor, Fragment (AAB02913.1; 5e-95) OsCAA33838 0. sativa Glutelin 150-308 5-155 PN24775 CAA33838 (output (SEQ ID NO: 168) (X15833) trait) OsPN26645 Novel Protein PN26645, 1-308 24-164 (Contig3412.fasta.Contig Putative Protein Disulfide (input 1) (novel) Isomerase-Related Protein trait) (SEQ ID NO: 102) Precursor (BAB09470.1; e-28) OsPN24162 Novel Protein PN24162, 150-308 28-164 (Contig3453.fasta.Contig Porin-like, Voltage- (output 1) (novel) Dependent Anion Channel trait) (SEQ ID NO: 104) Protein (NP_201551; 3e- 86
)
WO 2004/061080 PCT/US2003/041098 224 Os011994-D16 PN20618 Hypothetical Protein 150-308 99-368 (FLRO1_P028_D16OSO 011994-D16, Similar to Z. (output 11994) (novel) mays DnaJ protein trait) (SEQ ID NO: 106) (T01643; e=0) Table 12 Interacting Proteins Identified for OsCAA90866 (0. sativa Chilling-Inducible Protein CAA90866). 5 The names of the clones of the proteins used as baits and found as preys are given. Nucleotide/protein sequence accession numbers for the proteins of the Example (or related proteins) are shown in parentheses under the protein name. The bait and prey coordinates (Coord) are the amino acids encoded by the bait fragment(s) used in the search and by the interacting 10 prey clone(s), respectively. The source is the library from which each prey clone was retrieved. Gene Name Protein Name Bait Prey Coord (GENBANK@ Coord (Source) Accession No.) BAIT PROTEIN OsCAA90866 0. sativa Chilling PN20311 Inducible Protein (984756_OS015052)\ CAA90866 (SEQ ID NO: 170) (Z54153, CAA90866) INTERACTORS OsCAA90866 0. sativa Chilling- 100-250 1-126 PN20311 Inducible Protein (output trait) (SEQ ID NO: 170) CAA90866 (Z54153, CAA90866) WO 2004/061080 PCT/US2003/041098 225 Os008938-3209 0. sativa Putative 14-3-3 100-250 4x 53-259 PN20215 (3209- Protein (input trait) OS208938) (AAK38492) (SEQ ID NO: 172) OsAAG46136 0. sativa Putative 100-250 2x 92-222 PN23186 Pyrrolidone Carboxyl (input trait) (SEQ ID NO: 174) Peptidase (AAG46136) OsORF020300-223 Hypothetical Protein 100-250 3x 1-206 PN21639 ORF020300-2233.2, 3x 1-190 (SEQ ID NO: 98) Putative PP2A (output trait) Regulatory Subunit, Similar to OsCAA90866 (AAD39930; 5e-92) (CAA90866, 5e-1 3 ) OsPN23045 Novel Protein PN23045 100-250 2x 240-287 (SEQ ID NO: 108) (input trait) OsPN23225 Novel Protein PN23225, 100-250 639-792 (SEQ ID NO: 110) Similar to Tritticum (input trait) aestivum Initiation Factor (iso)4f p82 Subunit (AAA74724; e=0) OsPN29883 Novel Protein PN29883, 100-250 58-175 (SEQ ID NO: 112) Fragment (output trait) Two Hybrid Usinq OsPP2A as a Bait The bait fragment encoding amino acids 1 to 308 of 0. sativa Serine/Threonine Protein Phosphatase PP2A-2, Catalytic Subunit (OsPP2A 5 2) was found to interact with 0. sativa (rice) putative proline-rich protein, which is possibly a transcriptional regulator. The bait fragment (i.e., aa 1- WO 2004/061080 PCT/US2003/041098 226 308 of OsPP2A-2) includes the serine/threonine protein phosphatase signature of OsPP2A-2. One prey clone encoding amino acids 122 to 224 of OsAAK63900 was retrieved from the input trait library. Somewhat surprisingly, this prey clone does not code for the HLH domain of 5 OsAAK63900. 0. sativa Putative Proline-Rich Protein AAK63900 (OsAAK63900) (GENBANK@ Accession No. AC084884) is a 224-amino acid protein that includes a putative transmembrane spanning region (amino acids 7 to 23). It also contains a gntR family signature (amino acids 10 to 34) common to a 10 group of DNA-binding transcriptional regulation proteins in bacteria (see Buck and Guest, Biochem. J. 260: 737-747, 1989; Haydon and Guest, FEMS Microbiol. Lett. 79: 291-296, 1991; and Reizer et al., Mot. Microbiol. 5: 1081-1089, 1991. This signature includes a helix loop helix (HLH) protein dimerization domain (amino acids 5 to 20) that is often found in transcription 15 factors (see Murre et al., Cel 56: 777-783, 1989; Garrel and Campuzano, BioEssays 13: 493-498, 1991, Kato and Dang, FASEB J. 6: 3065-3072, 1992; Krause et al., Cell 63: 907-919, 1990; and Riechmann et aL, Nuc. Acids Res. 22: 749-755, 1994). However, no DNA-binding motif is detectable. 20 Note that analysis of the amino acid sequence of OsAAK63900 also detected an Ole e I family signature (amino acids 30 to 162) including six conserved cysteines that are involved in disulfide bonds. This signature is a conserved region found in a group of plant pollen proteins of unknown function which tend to be secreted and consist of about 145 amino acids 25 (and thus are shorter than OsAAK63900). The first of the Ole e I family of proteins to be discovered was Ole e I (IUIS nomenclature), a constitutive protein in the olive tree Olea europaea pollen and a major allergen (Villalba et al., Eur. J. Biochem. 216(3): 863-869, 1993). The bait fragment encoding amino acids I to 308 of OsPP2A-2 (which 30 includes the serine/threonine protein phosphatase signature of OsPP2A-2) was also found to interact with O. sativa OsORF020300-2233.2, a novel WO 2004/061080 PCT/US2003/041098 227 418-amino acid protein which has a putative PP2A regulatory subunit, similar to OsCAA90866. Two prey clones encoding amino acids 93 to 387 and 118 to 388 of ORF020300-233 were retrieved from the input trait library, which indicates that OsORF020300-223 interacts with OsPP2A-2 through a region 5 within amino acids 118 to 387. OsORF020300-223 includes a possible cleavage site between amino acids 50 and 51, although it appears to have no N-terminal signal peptide. OsORF020300-223 is similar to A. tha/iana PP2A regulatory subunit (GENBANK@ Accession No. AAD39930.1; 44.5% amino acid sequence identity; 5e~ 9 1 expectation value). OsORF020300-223 10 is also similar to rice chilling-inducible protein CAA90866 (GENBANK@ Accession No. CAA90866, 68% sequence identity; 9e 48 expectation value), a protein related to chilling tolerance in rice, with which OsORF020300-223 also interacts. CAA90866 was also used as a bait protein, and the interactions identified for it are discussed later in this Example. 15 A BLAST analysis comparing the nucleotide sequence of OsORF020300-223 against TMRI's GENECHIP* Rice Genome Array sequence database (http://tmri.org/gene expweb/) identified probeset OS015607_ at (e- 13 5 expectation value) as the closest match. Gene expression experiments indicated that this gene is induced by the fungal 20 pathogen M. grisea. The bait fragment encoding amino acids 1 to 308 of OsPP2A-2 (which includes the serine/threonine protein phosphatase signature of OsPP2A-2) was also found to interact with a novel protein (23268), an enzyme similar to phosphoribosylanthranilate transferase that is likely involved in the plant 25 response to pathogen infection. The novel protein, which was named OsPN23268, is similar to anthranilate phosphoribosyltransferase, a chloroplast precursor. Two prey clones encoding amino acids 12 to 200 of novel protein OsPN23268 were retrieved from the input trait library. OsPN23268 is a novel 320-amino acid protein with a possible 30 cleavage site between amino acids 43 and 44, although there does not appear to be an N-terminal peptide sequence. Analysis of the Os23268 WO 2004/061080 PCT/US2003/041098 228 protein sequence detected two domains originally defined in E. co/i thymidine phosphorylase (Walter et aL, J. Biol. Chem. 265(23): 14016-22, 1990): the glycosyl transferase family, helical bundle domain (amino acids 1 to 61) and a glycosyl transferase family, a/b domain (amino acids 66 to 303). 5 The latter contains a beta-sheet that is splayed open to accommodate a putative phosphate-binding site (Walter et al, J. Biol. Chem. 265(23): 14016 14022, 1990). Two prey clones of OsPN23268 retrieved from the input trait library and found to interact with OsPP2A-2 included sequence encoding amino acids 12 to 200 of novel protein OsPN23268. This sequence of 10 OsPN23268 includes the glycosyl transferase family helical bundle domain and part of the a/b domain. The glycosyl transferase family includes thymidine phosphorylase and anthranilate phosphoribosyltransferase enzymes. In mammalian cells, thymidine phosphorylase is identical to the angiogenic factor, platelet 15 derived endothelial cell growth factor (Morita et a., Curr. Pharm. Biotechnol. 2(3): 257-267, 2001; Browns and Bicknell, Biochem. J. 334(Pt 1): 1-8, 1998), and it also controls the effectiveness of the chemotherapeutic drug capecitabine by converting it to its active form (Ackland and Peters, Drug Resist. Updat. 2(4): 205-214, 1999). As its name indicates, novel protein 20 23268 is similar to A. thaliana phosphoribosylanthranilate transferase (GENBANK@ Accession No. AAB02913.1; 56.6% identity; 5e-9 5 ), an enzyme with a role in the tryptophan biosynthetic pathway which is also found in bacteria (Edwards et al., J. MoL. Biol. 203(2): 523-524, 1988). In A. thaliana, this tryptophan biosynthetic enzyme is synthesized as a higher-molecular 25 weight precursor and then imported into chloroplasts to be processed into its mature form (Zhao and Last, J. Biol. Chem. 270(11): 6081-6087, 1995). The A. thaliana anthranilate phosphoribosyltransferase is also similar to DESCA11 (GENBANK@ Accession No. B1534445; e 17 ), one of the genes identified in Chenopodium amaranticolor (a plant with broad-spectrum virus 30 resistance) which are induced during the hypersensitive response (HR) WO 2004/061080 PCT/US2003/041098 229 response of the plant subsequent to infection with tobacco mosaic virus and tobacco rattle tobravirus (Cooper, B., Plant J. 26(3): 339-349, 2001). A BLAST analysis comparing the nucleotide sequence of OsPN23268 against TMRI's GENECHIP* Rice Genome Array sequence database 5 identified probeset 0S015603 s_ at (3e 41 expectation value) as the closest match. Our gene expression experiments indicate that this gene is induced by the fungal pathogen M. grisea. The bait fragment of OsPP2A-2 containing amino acids 150 to 308 was also found to interact with the seed storage protein glutelin CAA33838 10 (OsCAA33838). Glutelin CAA33838 is the major seed storage protein in rice. Its cDNA sequence was identified by Wen et al., Nucleic Acids Res. 17(22): 9490, 1989, and the accumulation of the protein in rice endosperm occurs between five and seven days after flowering (Udaka et al., J. Nutr. Sci. Vitaminol. (Tokyo) 46(2): 84-90, 2000). One prey clone encoding amino 15 acids 5 to 155 of OsCAA33838 was retrieved from the output trait library. OsCAA33838 (GENBANK@ Accession No. X15833) is a 499-amino acid protein that includes a cleavable signal peptide (amino acids 1 to 24), as determined by analysis of the amino acid sequence. The analysis identified an 11S plant seed storage protein domain (amino acids 1 to 469; 1e- 243 ). 20 The 11 S plant seed storage proteins tend to be glycosylated proteins that form hexameric structures. They are composed of two peptides linked by disulfide bonds and are also members of the cupin superfamily of proteins by virtue of their two beta-barrel domains. The analysis also detected this domain but localized it to a narrower region (amino acids 302 to 324). In 25 addition, a 7S seed storage protein, C-terminal domain (amino acids 319 to 478; 602e- 0 4 ), was identified which is also found in members of the cumin superfamily. In agreement with the evidence that OsCAA33838 is a glycosylated protein, an N-glycosylation site (amino acids 491 to 494) was identified. 30 A BLAST analysis comparing the nucleotide sequence of OsCAA33838 against TMRI's GENECHIP* Rice Genome Array sequence WO 2004/061080 PCT/US2003/041098 230 database identified probeset 05000688.1 _ at (e=0 expectation value) as the closest match. Our gene expression experiments indicate that this gene is not specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides and 5 applied hormones. The bait fragment of OsPP2A-2 was also found to interact with novel protein PN26645, a putative protein disulfide isomerase-related protein precursor (also called OsPN26645). The bait fragment used in this search encodes amino acids 1 to 308 of OsPP2A-2, which includes the 10 serinelthreonine protein phosphatase signature of OsPP2A-2. One prey clone encoding amino acids 24 to 164 of OsPN26645 was retrieved from the input trait library. OsPN26645 is a 311-amino acid protein that includes a cleavable signal peptide (amino acids 1 to 17) and a predicted transmembrane domain (amino acids 210 to 226), as determined by analysis 15 of the amino acid sequence. A BLAST analysis against the Genpept database revealed that OsPN26645 is similar to an A. thaliana protein (GENBANK@ Accession No. BAB09470.1; 32.8% identity; e28) that is similar to the rat protein disulfide isomerase-related protein precursor (GENBANK@ Accession No.: gi5668777, 46% identity, 1e- 63 ). As its name indicates, 20 disulfide isomerase catalyzes the formation of disulfide bonds. This enzyme can therefore be important for proper protein folding. In mammals, disulfide isomerase in the lumen of the endoplasmic reticulum creates disulfide bonds in secretory and cell-surface proteins, and microsomes deficient in this enzyme are unable to conduct cotranslational formation of disulphide bonds 25 (Bulledi and Freedman, Nature 335(6191): 649-651, 1988). Although the activity of this enzyme is not as well characterized in plants, it is likely that it serves in a similar capacity. A BLAST analysis comparing the nucleotide sequence of OsPN26645 against TMRI's GENECHIP* Rice Genome Array sequence database 30 identified probeset OS002485.1 _ at (e 10 5 expectation value) as the closest match. Gene expression experiments indicated that this gene is not WO 2004/061080 PCT/US2003/041098 231 specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides and applied hormones. The bait fragment of OsPP2A-2 was also found to interact with novel 5 protein PN24162 (OsPN24162), a porin-like, voltage-dependent anion channel protein. The bait fragment used in this search encodes amino acids 150 to 308 of OsPP2A-2. One prey clone encoding amino acids 28 to 164 of OsPN24162 was retrieved from the output trait library. BLAST analysis of the OsPN24162 amino acid sequence indicated that this protein is most 10 similar to a porin-like protein from A. thaliana (GENBANK@ Accession No. NP_201551; 53% amino acid sequence identity; 3e- 8 6 ). OsPN24162 is also similar to a rice mitochondrial voltage-dependent anion channel (GENBANK® Accession No. Y18104; 44% identity; 2e~ 6 1 ), a 274-amino acid protein encoded by a cDNA found to belong to a small multigene family in 15 the rice genome (Roosens et al., Biochim. Biophys. Acta 1463(2): 470-476, 2000). Expression of this gene was found to be regulated in function of the plantlets maturation and organs, and not responsive to osmotic stress (Roosens et al., supra). Mitochondrial voltage-dependent ion channels are also called mitochondrial porins by analogy with the proteins forming pores 20 in the outer membrane of Gram-negative bacteria. A BLAST analysis comparing the nucleotide sequence of OsPN24162 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS007036.1 - at (e~ 65 expectation value) as the closest match. Our gene expression experiments indicate that this gene is not 25 specifically expressed in several different tissue types and is not specifically induced by a broad range of plant stresses, herbicides and applied hormones. The bait fragment of OsPP2A-2 was also found to interact with search a DnaJ-like protein with a putative role in the pathogen-induced defense 30 response. The bait fragment used in this search encodes amino acids 150 to 308 of OsPP2A-2. One prey clone encoding amino acids 99 to 368 of WO 2004/061080 PCT/US2003/041098 232 Os011994-D16 was retrieved from the output trait library. This new protein was named 011994-D16 or, because it was identified from 0. sativa, OsO1 1994-D16. BLAST analysis of the Os011994-D16 amino acid sequence indicated 5 that this protein is similar to maize (Zea mays) DnaJ protein homolog ZMDJ1 (GENBANK@ Accession No. T01643; 84% identity; e=0). In eukaryotic cells, DnaJ-like proteins regulate the chaperone (protein folding) function of Hsp70 heat-shock proteins through direct interaction of different Hsp70 and DnaJ like protein pairs (Cyr et al., Trends Biochem. Sci. 19(4): 176-181, 1994). 10 Heat shock proteins (reviewed in Bierkens et al., Toxicology 153(1-3): 61-72, 2000) are stress proteins which function as intracellular chaperones to facilitate protein folding and assembly and which are selectively expressed in plant cells in response to a range of stimuli, including heat and a variety of chemicals. As regulators of heat shock proteins, DnaJ-like proteins are thus 15 part of the plant protective stress response. A BLAST analysis comparing the nucleotide sequence of Os011994 D16 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS009139.1_at (e = 0 expectation value) as the closest match. Gene expression experiments indicated that expression of this gene 20 is repressed by the plant hormone jasmonic acid. Yeast Two-Hybrid Using 0. sativa Chilling-Inducible Protein CAA90866 (OsCAA90866) as Bait The bait protein, namely 0. sativa chilling-inducible protein CAA90866 (OsCAA90866), is a 379-amino acid protein encoded by a complete cDNA 25 sequence related to chilling tolerance in rice. BLAST analysis indicated that OsCAA90866 is similar to the same PP2A regulatory subunit from A. thaliana (GENBANK@ Accession No. AAD39930; 35% amino acid sequence identity; e 57 expectation value) that was found similar to OsORF020300-223, interactor for the bait protein PP2A-2 (see Example Ill, page). A BLAST 30 analysis comparing the nucleotide sequence of the chilling-inducible protein against TMRI's GENECHIP* Rice Genome Array sequence database WO 2004/061080 PCT/US2003/041098 233 identified probeset OS015052 _at (4e 78 expectation value) as the closest match. Gene expression experiments indicated that this gene is induced by cold stress. As described in Table 32, a bait clone encoding amino acids 100 to 5 250 of 0. sativa Chilling-Inducible Protein CAA90866 (OsCAA90866) was found to interact with a prey clone encoding amino acids I to 126 of the same protein retrieved from the output trait library. In addition, the bait clone encoding amino acids 100 to 250 of 0. sativa Chilling-Inducible Protein CAA90866 (OsCAA90866) was found to 10 interact with Os008938-3209. Four prey clones encoding amino acids 53 259 of Os008938-3209 were retrieved from the input trait library. Os008938 3209 is a 260-amino acid protein that includes a 14-3-3 protein signature 1 (amino acids 48-60) and a 14-3-3 protein signature 2 (amino acids 220 to 260), which suggests that Os008938-3209 is a member of the 14-3-3 family. 15 BLAST analysis indicated that the amino acid sequence of Os008938-3209 shares 100% identity with that of rice putative 14-3-3 protein (GENBANK@ Accession No. AAK38492, 8e1 45 ). The 14-3-3 proteins interact with regulators of cellular signaling, cell cycle regulation, and apoptosis. They are thought to act as molecular scaffolds or chaperones and to regulate the 20 cytoplasmic and nuclear localization of proteins with which they interact by regulating their nuclear import/export Zilliacus et at., Mol. Endocrinol. 15(4): 501-511, 2001); reviewed by Muslin et al., Cell Signal 12(11-12): 703-709, 2000. Since 14-3-3 proteins participate in protein complexes within the nucleus (Imhof and Wolffe, Biochemistry 38(40): 13085-13093, 1999; 25 Zilliacus et al., supra), cytoplasm (De Lille et al., Plant Physiol. 126(1): 35 38, 2001), mitochondria (De Lille et al., supra) and chloroplast (Sehnke et aL., Plant Physiol. 122(1): 235-242, 2000), additional information would be necessary to determine where Os008938-3209 resides within the cell. Cellular localization of this prey protein could lead to a better interpretation of 30 the significance of its interaction with chilling-inducible protein CAA90866.
WO 2004/061080 PCT/US2003/041098 234 A BLAST analysis comparing the nucleotide sequence of the Os008938-3209 protein against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS008938_s_at (e-61 expectation value) as the closest match. Gene expression experiments indicated that 5 this gene is induced by salicylic acid, ABA, BAP, BL2, and 2,4D, during cold stress, and under drought conditions. In addition, the bait clone encoding amino acids 100 to 250 of 0. sativa Chilling-Inducible Protein CAA90866 (OsCAA90866) was found to interact with OsAAG46136, a pyrrolidone carboxyl peptidase from 0. sativa. 10 Two prey clones encoding amino acids 92-222 of OsAAG46136 were retrieved from the input trait library. These clones include the pyroglutamyl peptidase I motif of OsAAG46136. OsAAG46136 is a 222-amino acid protein that contains a pyroglutamyl peptidase I motif (amino acids 11 to 221). This motif is found in 15 the N-terminal regions of peptide hormones (including thyrotropin-releasing hormone and luteinizing hormone releasing hormone), and it confers protease resistance to the protein (Odagaki et al., Structure Fold Des. 7(4): 399-411, 1999). BLAST analysis indicated that the amino acid sequence of OsAAG46136 shares 100% identity with that of rice putative pyrrolidone 20 carboxyl peptidase (GENBANK@ Accession No. AAG46136; 4e- 1 2 6 ). OsAAG46136 is also similar to two unknown proteins from A. thaliana (GENBANK@ Accession Nos. NP_176063, 8e' 0 80 and AAK25976.1, e- 0 7 6 , both not described in the literature. The similarity of OsAAG46136 to pyrrolidone carboxyl peptidase gives some suggestion as to the function of 25 this poorly defined rice protein. Pyrrolidone carboxyl peptidase (Pcps) is an enzyme that removes an N-terminal pyroglutamyl group from some proteins. It is present in many species (reviewed by Awade et al., Proteins 20(1): 34 51, 1994) and is a valuable tool for bacterial diagnosis (most of the literature describing this protein addresses bacterial homologs). The active site of the 30 Pseudomonas fluorescens Pcps has been characterized and the nature of this site (Cys-1 44 and His-1 66 are necessary for activity) suggests that it can WO 2004/061080 PCT/US2003/041098 235 represent a new class of thiol aminopeptidases (Le Saux et a/., J. Bacteriol. 178(11): 3308-3313, 1996). Peptidases in this protein family are necessary for processing and activation of important bioactive peptides including amyloid precursor protein (APP), strongly implicated in Alzheimer's disease 5 (Lefterov et al., FASEB J. 14(12): 1837-1847, 2000). Furthermore, this enzyme deaminates and thus inactivates the glycopeptide anticancer agent bleomycin (Schwartz et aL., Proc. Nat. Acad. Sci. USA 96(8): 4680-4685, 1999). A BLAST analysis comparing the nucleotide sequence of 10 OsAAG46136 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS013894_ sat (e- expectation value) as the closest match. The expectation value is too low for this probeset to be a reliable indicator of the gene expression of OsAAG46136. The bait clone encoding amino acids 100 to 250 of 0. sativa Chilling 15 Inducible Protein CAA90866 (OsCAA90866) was also found to interact vWith protein ORF020300-2233.2 (OsORF020300-223), having a putative PP2A regulatory subunit and being similar to OsCAA90866 (see description in Example 111). Three prey clones encoding amino acids 1 to 206 and three prey clones encoding amino acids 1-190 of OsORF020300-223 were 20 retrieved from the output trait library. Additionally, the bait clone encoding amino acids 100 to 250 of 0. sativa Chilling-Inducible Protein CAA90866 (OsCAA90866) was found to interact with protein PN23045 (OsPN23045). Two prey clones encoding amino acids 240 to 287 of OsPN23045 were retrieved from the input trait 25 library. OsPN23045 is a 287-amino acid protein that includes an inositol P domain (amino acids 233 to 272). This domain was identified in bovine inositol polyphosphate 1-phosphatase protein, which is involved in signal transduction (see York et al., Biochemistry 33(45): 13164-13171, 1994). 30 Mikami et al. showed that phosphatidylinositol-4-phosphate 5-kinase (AtPIP5KII) is induced by water stress and abscisic acid (ABA) in A.
WO 2004/061080 PCT/US2003/041098 236 thaliana, suggesting a link between phosphoinositide signaling cascades with water-stress responses in plants (Mikami et al., Plant J. 15(4): 563-568, 1998). Xiong et al. reported that FRYI, a mutant gene in A. thaliana encoding an inositol polyphosphate 1-phosphatase, is a negative regulator 5 of ABA and stress signaling in this plant (Xiong et al., Genes Dev. 15(15): 1971-1984, 2001), providing evidence that phosphoinositols mediate ABA and stress signal transduction in plants. A BLAST analysis comparing the nucleotide sequence of OsPN23045 against TMRI's GENECHIP* Rice Genome Array sequence database 10 identified probeset OS006742.1_at (e = 0 expectation value) as the closest match. Gene expression experiments indicated that this gene is specifically expressed in leaf and stem. The bait clone encoding amino acids 100 to 250 of 0. sativa Chilling Inducible Protein CAA90866 (OsCAA90866) was also found to interact with 15 protein PN23225, which is a novel 792-amino acid protein similar to T. aestivum initiation factor (iso)4f p82 subunit (p82) (GENBANK@ Accession No. AAA74724; 69.6% amino acid sequence identity; e=0). One prey clone encoding amino acids 639 to 792 of OsPN23225 was retrieved from the input trait library. The wheat protein contains possible motifs for ATP 20 binding, metal binding, and phosphorylation (Allen et al., J. Biol. Chem. 267(32): 23232-23236, 1992). OsPN23225 contains an MIF4G domain (amino acids 207 to 434) named after Middle domain of eukaryotic initiation factor 4G (eIF4G), and an MA3 domain (amino acids 627 to 739) also found in elF proteins (Ponting, C.P., Trends Biochem. Sci. 25(9): 423-426, 2000). 25 These domains are found in molecules that participate in mRNA decay pathways. Although the function of the bait chilling-inducible protein CAA90866 is not well defined, it appears to be a nuclear protein and its interaction with the elF-like protein OsPN23225 supports the notion that CAA90866 participates in the rice transcriptional machinery. The 30 identification of the OsPN23225 prey protein likely represents the discovery of a novel rice elF.
WO 2004/061080 PCT/US2003/041098 237 A BLAST analysis comparing the nucleotide sequence of OsPN23225 against TMRI's GENECHIP* Rice Genome Array sequence database identified probeset OS003249 at (e- 17 expectation value) as the closest match. The expectation value is too low for this probeset to be a reliable 5 indicator of the gene expression of OsPN23225. The bait clone encoding amino acids 100 to 250 of 0. sativa Chilling Inducible Protein CAA90866 (OsCAA90866) was also found to interact with OsPN29883, a 340-amino acid fragment that is similar to A. thaliana putative 2-dehydro-3-deoxyphosphooctonate aldolase (GENBANK@ Accession No. 10 NP_178068; 3e-142 expectation value) and pea (Pisum sativum) 2-dehydro-3 deoxyphosphooctonate aldolase (Kdo8P synthase) (GENBANK@ Accession No. 050044; 3e- 1 4 2 expectation value). One prey clone encoding amino acids 58 to 175 of OsPN29883 was retrieved from the output trait library. Kdo8P synthase in pea catalyzes the biosynthesis of Kdo-8-P, a component 15 of lipopolysaccharide of plant cell walls, with high structural and functional similarities to enterobacterial Kdo8P synthase (Brabetz et al., Planta 212(1): 136-143, 2000). Summary The interactors identified for the OsPP2A-2 bait protein (i.e., proteins 20 that bind to OsPP2A-2) comprise a network that is speculated to be associated with the plant defense response to pathogens. Among the five novel rice proteins identified as interactors for OsPP2A-2, Os23268 is similar to the A. tha/iana tryptophan biosynthetic enzyme anthranilate phosphoribosyltransferase. This enzyme is encoded by a gene that is 25 similar to the DESCA11 gene involved in resistance to virus infection (Cooper, B., Plant J. 26(3): 339-49, 2001). While the role of tryptophan in disease resistance is unknown, tryptophan is used in the biosynthesis of indol-3-acetic acid, a plant hormone and signaling molecule. Tryptophan can thus have a role in modulation of gene expression in plants. Moreover, 30 the glycosyl transferase function in Os23268 can be associated with disease resistance signaling pathways or with phytoalexin cellular distribution.
WO 2004/061080 PCT/US2003/041098 238 Phytoalexins are low-molecular-weight antimicrobial compounds that accumulate in plants as a result of infection or stress, and the rapidity of their accumulation is associated with resistance in plants to diseases caused by fungi and bacteria. Taken altogether, these data suggest that anthranilate 5 phosphoribosyltransferases plays a role in the plant response to pathogen infection. Moreover, gene expression experiments confirmed that this gene is induced by the fungal pathogen M. grisea. Thus, the anthranilate phosphoribosyltransferase-like novel protein Os23268 is believed to be involved in the signaling and regulation pathways that mediate the response 10 of rice to biotic stress. Novel protein Os011994-D16, similar to DnaJ protein, is another interactor for OsPP2A-2 with a likely role in the pathogen-induced defense response. DnaJ-like proteins are known to be regulators of heat shock proteins and are thus part of the plant protective stress response. Gene 15 expression experiments support this notion, indicating that the gene encoding the DnaJ-like protein of this Example is repressed by jasmonic acid, a component of signaling networks that provide the specificity of plant pathogen-induced defense responses (reviewed in Nurnberger and Scheel, Trends Plant Sci. 6(8): 372-379, 2001). 20 OsPP2A-2 was also found to interact with the novel protein OsORF020300-2233.2, which is similar to A. thaliana PP2A regulatory subunit and to rice chilling inducible protein CAA90866 (OsCAA90866) (the second bait protein of this Example). The similarity of OsORF020300-223 to PP2A regulatory subunit validates its interaction with the PP2A-2 catalytic 25 subunit, this interaction being consistent with the subunit composition of PP2A enzymes (Awotunde et al., Biochim Biophys Acta 1480(1-2): 65-76, 2000). The OsORF020300-223-OsPP2A-2 interaction suggests that OsORF020300-223 participates in signaling events that involve OsPP2A-2 enzymatic activity, and the similarity of OsORF020300-223 to rice chilling 30 inducible protein OsCAA90866 suggests that cold tolerance can involve one of these signaling events.
WO 2004/061080 PCT/US2003/041098 239 OsPP2A-2 was also found to interact with rice putative proline-rich protein OsAAK63900. Though it has no known DNA-binding motif, there are indications that OsAAK63900 can play a role as a transcriptional regulator. It has an HLH domain common to transcription factors, although this domain 5 mediates protein dimerization only. It also has a gntR family signature common to bacterial DNA-binding transcriptional regulators, although the function of this domain is not known. The existence of the Ole e I suggests that OsPP2-2 can dephosphorylate OsAAK69300, thus regulating its function as a pollen protein, although the lack of data on the Ole e I 10 signature function makes this possibility more difficult to argue. Evidence also exists that PP2A proteins regulate the DNA-binding activity of transcription factors in plants Vazquez-Tello et a!., Mol. Gen. Genet. 257(2): 157-166, 1998) and mammalian cells (Wadzinski et al., MoL. Cell Biol. 13(5): 2822-2834, 1993). Therefore, it is most likely that the O'PP2A-2 15 OsAAK63900 interaction occurs in the nucleus and that it plays a role in regulating transcriptional events in rice. Other proteins found to interact with OsPP2A-2 include a disulfide isomerase with a putative role in protein folding (novel protein OsPN26645), a voltage-dependent ion channel protein (novel protein OsPN24162) and the 20 seed storage protein glutelin (OsCAA33838). The biological significance of these interactions is unclear. Analysis of the amino acid sequence of glutelin identified several protein kinase C and casein kinase II phosphorylation sites. It is possible that the phosphorylation state of glutelin determines its function or stability, and its interaction with OsPP2A-2 can occur during 25 dephosphorylation of glutelin. Alternatively, this interaction can result in localization of OsPP2A-2 and thereby affect events downstream of OsPP2A 2-dependent dephosphorylation. Given the presence of a disulfide bond between the two peptide chains of typical plant seed storage proteins, it is interesting that OsPP2A-2 also interacts with a putative protein disulfide 30 isomerase (OsPN26645). Perhaps OsPP2A-2 interacts with other enzymes to create a co-translational modification complex. Additional yeast-two- WO 2004/061080 PCT/US2003/041098 240 hybrid data can clarify the purpose of these interactions. However, given the association of PP2A with other proteins involved in biotic stress responses, the aforementioned associations could also be involved in biotic stress responses. 5 The chilling-inducible protein CAA90866 was found to interact with itself and with six proteins. These proteins are speculated to interact as components of a network of proteins relevant to the rice response to cold stress. This hypothesis finds support in gene expression experiments, which confirmed that the gene encoding the chilling-inducible protein is induced by 10 cold. One of the interactors is the putative 14-3-3 protein Os008938-3209. The relationship to chilling tolerance of the bait protein OsCAA90866 suggests that its interaction with Os008938-3209 can be associated with cold tolerance. Gene expression experiments showed that this protein is induced under a broad range of stress conditions. Its activation probably 15 allows its interaction with a number of stress proteins. Given the function of 14-3-3 proteins as molecular chaperones, Os008938-3209 can act as a molecular glue for these interactions to preserve protein complex stability in membranes, or it can coordinate interactions involving transcription factors associated with stress genes. Subcellular localization of Os008938-3209 20 can further clarify the significance of its interaction with OsCAA90866. Another interactor for OsCAA90866 is a pyrrolidone carboxyl peptidase-like protein (OsAAG46136). The putative pyrrolidone carboxyl peptidase function of OsAAG46136 suggests that it participates in processing and/or activation of substrate proteins, and these proteins can be 25 important to the plant response to chilling. Peptidase activity has been associated with regulation of signaling. Carboxypeptidases, for instance, hydrolytically remove the pyroglutamy group from peptide hormones, thereby activating these signaling molecules. A carboxypeptidase regulates Brassinosteroid-insensitive 1 (BRII) signaling in A. tha/iana by proteolytic 30 processing of a protein (Li et al., Proc. Natl. Acad. Sci. USA 98(10): 5916 5921, 2001). Based on its ability to interact with chilling-inducible protein WO 2004/061080 PCT/US2003/041098 241 and on the role of the latter in chilling tolerance, it is speculated that the carboxypeptidase-like protein OsAAG46136 can have a role in activating signaling molecules/hormonal peptides that are involved in the plant response to cold stress. 5 The interactions of OsCAA90866 with OsPN23045, a protein with a putative inositol phosphate function, and with OsPN23225, a rice homolog of wheat initiation factor (iso)4f p82 subunit, provide further insight into the function of the bait protein. Phosphoinositols are known to mediate ABA and stress signal transduction in plants (Mikami et al., Plant J. 15(4): 563-568, 10 1998; Xiong et al., Genes Dev. 15(15): 1971-1984, 2001). The putative inositol phosphatase protein OsPN23045 can function in a similar way and its interaction with the chilling-inducible protein can be associated with regulation of cell signaling events that relate to cold tolerance. The prey protein OsPN23225 likely represents a novel rice elF. The elF proteins have 15 a role in RNA processing pathways (Ponting C.P., Trends Biochem. Sci. 25(9): 423-426, 2000) and stress is typically associated with an abundance of RNA transcripts. Based on this information and on the relationship that CAA90866 has to chilling tolerance, the OsCA90866- PN23225 interaction is speculated to control translational events related to cold stress. 20 Finally, OsCAA90866 interacts with and is similar to the same putative PP2A regulatory subunit protein OsORF020300-223 found to interact with the bait protein OsPP2A-2. This interaction provides a link between the two networks of this Example and suggests the involvement of OsPP2A-2 in both biotic and abiotic stress response pathways (see diagram 25 in Appendix 1). Based on the observed interactions and on sequence similarities among the proteins involved in these interactions, OsPP2A-2 appears to regulate both biotic and abiotic stress response pathways. Thus, the two pathways, though independent, are speculated to be linked through protein phosphatases, and that these enzymes likely mediate the plant's 30 stress response by dephosphorylation of the proteins participating in these pathways. In this scenario, it is possible that the self-interaction observed for WO 2004/061080 PCT/US2003/041098 242 OsCAA90866 participates in the creation of multicomponent phosphatase complexes. Furthermore, the interaction of OsCA90866 with the aldolase like protein OsPN29883 suggests that the aldolase needs to be dephosphorylated for activation/inactivation, and that this novel protein can 5 have roles during stress responses based upon the other interactions and the gene expression patterns of the chilling-inducible protein. Moreover, OsORF020300-223 the A. thaliana regulatory A subunit of protein phosphatase 2A (PP2A-A) has been implicated in the regulation of auxin transport in A. thaliana (Garbers et a/., EMBO J. 15(9): 2115-2124, 10 1996). The phytohormone auxin controls processes such as cell elongation, root hair development and root branching. Since OsORF020300-223 is also similar to and interacts with chilling-inducible protein CAA90866, it is possible that the latter can be involved in auxin transport. References 15 The references listed below as well as all references cited in the specification are incorporated herein by reference to the extent that they supplement, explain, provide a background for or teach methodology, techniques and/or compositions employed herein. Aasland et al. (1995) Trends Biochem. Sci. 20: 56-59. 20 Abdel-Ghany et al. (2000) DNA Cell Biol. 19: 567-578. Abler et al. (1993) Plant Mol. Biol. 22: 1031-1038. Ach & Gruissem (1997) Proc. Nati. A cad. Sc. USA 91: 5863-5867. Ackland & Peters, (1999) Drug Resist. Update 2: 205-214. Agrawal S (ed.) (1993) Methods in Molecular Biology, volume 20, Humana 25 Press, Totowa, New Jersey, United States of America. Agueli et al. (2001) Biochem. J. 360: 413-419. Aihara et al. (2000) J. Neurochem. 74: 2622-2625. Ajuh et al. (2001) J. Biol. Chem. 276: 42370-42381. Allen et al. (1992) J. Bio/. Chem. 267: 23232-23236. 30 Allison et al. (1986) Virology 154:9-20. Altschul et al. (1990) J. Mol. Biol. 215:403.
WO 2004/061080 PCT/US2003/041098 243 Altschul et al. (1997) Nucl. Acids Res. 25: 3389. An et al. (1985) EMBO J. 4: 277. Antoku et al. (2001) Biochem. Biophys. Res. Commun. 286: 1003-1010. Aoyama & Chua (1997) Plant J 11:605-612. 5 Apte et al. (1995) FEBS Lett. 363: 304-306. Arino et al. (1993) Plant Mol. Biol. 21: 475-485. Arioli et al. (1998) Science 279: 717-720. Auch & Reth (1990) Nuc/. Acids Res. 18: 6743. Austin et al. (2002) Science 295: 2077-2080. 10 Ausubel et al. (1988) Current Protocols in Molecular Biology, John Wiley & Sons, New York, New York, United States of America. Awade et al. (1994) Proteins 20: 34-51. Awotunde et al. (2000) Biochim Biophys Acta 1480: 65-76. Azevedo et al. (2002) Science 295: 2073-2076. 15 Aznar & Lacal (2001) Prog. Nucleic Acid Res. Mol. Biol. 67: 193-234. Baluska et al. (2001) Plant Physiol. 126: 39-46. Bartel & Fields (eds.), (1997) The Yeast Two-Hybrid System, Oxford Press. Bartlett et al. (1982) in Methods in Chloroplast Molecular Biologv, (Edelman et al., eds.) Elsevier Biomedical Press, New York, New York, United 20 States of America, pp. 1081-1091. Baskin et al. (1992) Aust. J. Plant Physiol. 19: 427-437. Batzer et al. (1991) Nucleic Acid Res. 19:5081. Baunsgaard et al. (1998) Plant J. 13: 661. Beerli et al. (1998) Proc Natl Acad Sci U S A 95:14628-14633. 25 Bergeron et al. (1994) Trends Biochem. Sci. 19: 124-128. Bertolaet et al. (2001) Nat. Struct. Biol. 8: 417-422. Bevan (1984) Nuc/. Acids Res. 12:8711. Bevan et a. (1983) Nature 304:184-187. Bezzi et al. (2001) Nat. Neurosci. 4: 702-710. 30 Biedenkapp et al. (1988) Nature 335: 835-837. Bierkens et al. (2000) Toxicology 153: 61-72.
WO 2004/061080 PCT/US2003/041098 244 Bihn et al. (1997) Plant J. 12: 1439-1445. Binet et al. (1991) Plant Mol Biol 17:395-407. Bischoff et al. (1994) Proc. Nat/. Acad. Sci. USA 91: 2587-2591. Bisikirska et al. (1997) Z. Naturforsch 52: 180-186. 5 Blochinger & Diggelmann (1984) Mol Cell Bio/ 4:2929-2931. Bolhuis et al. (1998) J. Biol. Chem. 273: 21217-21224. Borisjuk et al. (1998) Planta 206: 504-14. Bornemann et al. (1996) Biochemistry 35: 9907-9916. Bourouis & Jarry (1983) EMBO J 2:1099-1104. 10 Brabetz et al. (2000) Planta 212: 136-143. Bradwell et al. (1994) Genes Dev. 8: 1664-1677. Braun (2001) Plant Physiol. 125:1611-1619. Briknarova et al. (2001) Nat. Struct. Biol. 8: 349-352. Browns & Bicknell (1998) Biochem. J. 334: 1-8. 15 Bruggemann et al. (1996) Plant J 10:755-760. Brummelkamp et al. (2002) Science 296: 550-3. Brunner et al. (1998) Plant J. 14: 225-34. Buchanan et al. (eds.), (2002) Biochemistry and Molecular Bioloqy of Plants, John Wiley & Sons, New York, New York, United States of America. 20 Buck & Guest (1989) Biochem. J. 260: 737-747. Bulledi & Freedman (1988) Nature 335: 649-651. Burglin et al. (1997) Nucl. Acids Res. 25: 4173-4180. Bursch (2001) Cell Death Differ. 8: 569-81. Busch et al. (1990) Trends Genet. 6: 36-40. 25 Byrne et al. (1987) Plant Cell Tissue Org Culture 8: 3. Caddick et al., (1998) Nat Biotechnol 16:177-180. Callebaut & Mornon (1997) FEBS Lett. 400: 25-30. Callis et al. (1987) Genes Dev. 1: 1183. Callis et al., (1990) J Biol Chem 265:12486-12493. 30 Cao et al. (2000) Arch Biochem Biophys. 373: 135-46. Casas et al., (1993) Proc Natl Acad Sci U S A 90:11212-6.
WO 2004/061080 PCT/US2003/041098 245 Cash (1996) Med Hypotheses 47: 455-459. Chan et al. (1998) Biochim. Biophys. Acta 1442: 1-19. Chee et al. (1989) Plant Physio/. 91:1212. Chen et al. (1993) Science 262: 1883-1886. 5 Chen et al. (2002) Plant Cell 14: 559-574. Chibbar et al., (1993) Plant Cell Rep 12:506-509. Choi et al. (1995) Mol Gen, Genet. 246:266. Christensen et al., (1989) Plant Mol Biol 12:619-632. Christou et al. (1988) Plant Physiol. 87:671. 10 Christou et al. (1989) Proc. Natl. Acad. Sci. USA 86:7500. Christou et al., (1991) Bio/Technology 9: 957-962. Chuang & Meyerowitz, (2000) Proc Natl Acad Sci U S A 97:4985-90. Chung et al. (1994) Plant Mol. Biol. 26: 657-665. Claes et al. (1990) Plant Cell 2: 19-27. 15 Close et al. (1989) Plant Mol. Biol. 13: 95-108. Comai et al., (1988) J Biol Chem 263:15104-15109. Conceicao et al. (1994) Plant 5:493. Corpet et al. (1988) NucI. Acids Res. 16:10881. Creighton, (1984) Proteins, WH Freeman & Co., New York, New York, 20 United States of America. Crossway et al. (1986) Bio/Techniques 4:320. Cruz et al. (1998) P. R. Health Sci. J. 17: 323-326. Cyr et al. (1994) Trends Biochem. Sci. 19: 176-181. Dasgupta et al. (1993) Gene 133: 301. 25 Dasso (2000) Cell 104: 321-324. Dat et al. (2001) Redox Rep. 6: 37-42. Datta et al. (1990) Bio/Technology 8: 736. Davies et al. (1996) EMBO J. 15:4330-4343. De Block et al. (1989) Plant Physio/. 91:694. 30 de Framond, (1991) FEBS Lett 290:103-6. Della-Cioppa et al. (1987) Plant Physiol. 84:965.
WO 2004/061080 PCT/US2003/041098 246 De Lille et al. (2001) Plant Physiol. 126: 35-38. Denecke et al. (1995) Plant Cell 7: 391-406. Doelling et al. (2001) Plant J. 27: 393-405. Dong et al., (1996) MolBreeding 2:267-276. 5 Doonan et al. (1997) Curr. Opin. Cell. Blot. 9: 824-830. Dunigan & Madlener (1995) Virology 207: 460-466. Dure et al. (1989) Plant Mol. Bio. 12: 475-486. Dwyer et al. (1996) Biochim Biophys Acta 1289: 231-237. Ecker (1995) Science 268: 667-675. 10 Ecker & Davis, (1986) Proc Natl Acad Sci USA 83:5372-5376. Edwards et al. (1988) J. Mol. Biol. 203: 523-524. Elbashir et-ah-et al., (2001) EMBO J 20:6877-88. Elge et al. (2001) Plant J. 26: 561-571. Ellenberger (1994) Curr. Opin. Struct. Biol. 4: 12-21. 15 Ellis et al. (1987) EMBO J., 6:3203. Elroy-Stein et al. (1989) Proc. Nat/. Acad. Sci. USA. 86:6126. EP O120 516 EP 0 138 341 EPO 292435 20 EP0295959 EP 0 301 749 EP 0 332 104 EP 0 332 581 EP 0 342 926 25 EP 0 392 225 EP 1 116 793 Everett et al. (1987) Bio/Technology 5:1201. Fardel et al. (2001) Toxicology 167: 37-46. Feys et al. (2001) EMBO J. 20: 5400-5411. 30 Fire et al. (1998) Nature 391: 806-811. Firek et al., (1993) Plant Mol Biot 22:129-142.
WO 2004/061080 PCT/US2003/041098 247 Frary et al. (2000) Science 289: 85-88. Freier et al., (1986) Proc Nat/A cad Sci USA 83:9373-9377. Fromental-Ramain et al. (1996) Development 122: 461-472. Fromm et al. (1986) Nature 319: 791. 5 Fromm et al. (1990) Bio/Technology 8: 833. Fromme et al. (2001) Biochim. Biophys Acta 1507: 5-31. Fujiyama et al. (2001) J. Comp. Neurol. 435: 379-387. Gaedeke et al. (2001) EMBO J. 20: 1875-1887. Gallie et al. (1987) Nuc/. Acids Res. 15:3257. 10 Gallie et al. (1989) Plant Cell 1:301. Gallois et al. (1997) Plant J. 11: 1325-1331. Garbers et al. (1996) EMBO J. 15: 2115-2124. Garcia et al. (1998) Planta 207: 172-80. Garcia et al. (2000) Microbes Infect. 2: 401-407. 15 Garrel & Campuzano (1991) BioEssays 13: 493-498. Gehring (1992) Trends Biochem. Sci. 17: 277-280. Gehring & Hiromi (1986) Ann. Rev. Genet. 20: 147-173. Gillet et al. (1998) Plant J. 16: 257-262. Glazebrook et al. (1996) Genetics 143: 973-982. 20 Godwin et al. (1998) Proc. Nat/. Acad. Sci. USA 95: 13042-13047. Goeddel, (1990) Methods in Enzymology, Volume 185, Academic Press, San Diego, California, United States of America. Goff (2001) Plant J. 26: 339-349. Goff et al. (2002) Science 296: 92-100. 25 Gocal et al. (2001) Plant Physio/. 125: 1788-1801. Gordon Kamm et al. (1990) Plant Cell 2:603. Greco et al. (1997) Mol. Gen. Genet. 253: 615-623. Green, (2000) Trends Biochem Sci 25:59-63. Green et al., (1986) Ann Rev Biochem 55:569-597. 30 Gritz et al. (1983) Gene 25:179. Grotewold et al. (1991) Proc. Nat/. A cad. Sci. USA 88: 4587-4591.
WO 2004/061080 PCT/US2003/041098 248 Groves, et al. (1999) Curr. Opin. Struct. Biol. 9: 383-389. Gruber et al. (1993) Vectors for Plant Transformation, in Methods in Plant Molecular Biology, Glich et al. eds, pp. 89-119, CRC Press. Guan & Scandalios (1996) J. Mol. Evol. 42: 570-579. 5 Guo et al. (2000) EMBO J. 19: 6891-6899. Gyuris et al. (1993) Cell 1993, 75: 791-803. Hake et al. (1995) Philos Trans. R Soc. Lond. B Biol. Sci. 350: 45-51. Hammerschmidt (1999) Ann. Rev. Phytopathol. 37: 285-306. Hannig et al. (1995) Bioessays 17: 915-919. 10 Haraven et al. (1996) Cell 84: 735-44. Harlow & Lane, (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, United States of America. Harper et al. (1991) Science 252: 951-954. 15 Harris et al. (1999) Plant Physio/. 121: 609-617. Hartwig (1995) Protein Profile 2: 703-800. Hassan et al. (1995) Biochem. Biophys. Res. Commun. 211: 54-49. Hatzfeld (1999) Int. Rev. Cytol. 186: 179-224. Hayashi et al. (2001) J. Biol. Chem. 276: 43400-43406. 20 Haydon & Guest (1991) FEMS Microbiol. Lett. 79: 291-296. Hedden & Kamiya (1977) Annual Rev. Plant Physiol. Plant Mol. Bio/. 48: 431. Henikoff & Henikoff (1989) Proc. Nat/. Acad. Sci. USA, 89:10915. Hiei et al. (1994) Plant J. 6:271. 25 Hiei et al., (1997) Plant Mol Biol 35:205-18. Higgins et al. (1988) Gene 73:237. Higgins et al. (1989) CAB/OS 5:151. Higo & Higo (1996) Plant Mol. Biol. 30: 505-521. Hinchee et al. (1988) Bio/Technology 6:915. 30 Hoekema (1985) in The Binary Plant Vector System, Offset-drukkerij Kanters
B.V.
WO 2004/061080 PCT/US2003/041098 249 Holk et al. (1996) Plant Mol. Bio. 31: 1153-1161. Hong et al. (2001) Plant Cell 13: 755-768. Huang et al. (1992) CABIOS 8:155. Huang et al. (1996) Plant Mol. Biol. 33:125. 5 Huber et al. (1997) Cell 90: 871-882. Hudspeth & Grula, (1989) Plant Molec Biol 12:579-589. Hurst (1995) Protein Prof. 2: 105-168. Imhof & Wolffe (1999) Biochemistry 38: 13085-13093. Ishikawa et al. (2001) Plant J. 27: 89-99. 10 Janssens & Goris (2001) Biochem J. 353: 417-439. Jasencakova et al. (2001) Chromosoma 110: 83-92. Jasinski et al. (2001) Plant Cell 13: 1095-107. Jia et al. (2000) Plant Sci. 155:115-122. Jin & Martin (1999) Plant Mol. Biol. 4: 577-585. 15 Jobling & Gehrke (1987) Nature 325:622. John et al. (2001) Protoplasma 216: 119-142. Johnson et al. (1997) J. Biol. Chem. 272: 7106-7113. Jorgensen et al., (1996) Plant Mo/ Biol 31:957-973. Josefsson et al. (1987) J. Biol. Chem. 262:12196. 20 Kagedal et al. (2001) Biochem J. 359: 335-343. Kaitna et al. (2000) Curr. Biol. 10:1172-1181. Kang et al. (1995) Plant Mol. Biol. 29:1-10. Kang et al. (1997) Mot. Cells 7: 45-51. Kao et al. (2000) Biochem. Biophys. Res. Commun. 267: 201-207. 25 Karlin & Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264. Karlin & Altschul (1993) Proc. Natl. A cad. Sci. USA 90:5873. Kato & Dang (1992) FASEB J. 6: 3065-3072. Kawalleck et al. (1992) Proc. Nat. Acad. Sci. USA 89:4713-7. Keegan et al., (1986) Science 231:699-704. 30 Kehres et al. (1998) Microb. Comp. Genomics 3: 151-169. Kelly (1999) A/tern. Med. Rev. 4: 249-265.
WO 2004/061080 PCT/US2003/041098 250 Kempin et al., (1997) Nature 389:802-803. Kerstetter et al. (1997) Development 124: 3045-3054. Kidou et al. (1993) FEBS Lett. 332: 282-286. Kim et al. (2001) Plant Physiol. 127: 1243-1255. 5 Kitade et al. (1999) Nuc/. Acids Symp. Ser. 42: 25-26. Klein et al. (1988) Bio/Technology 6:559. Kline et al. (1987) Nature 327:70. Knauf et al. (1983) Analysis of Host Range Expression by Agrobacterium, in Molecular Genetics of the Bacteria-Plant Interaction, Puhler, (ed.), 10 Springer-Verlag. Kobe et al. (1994) Trends Biochem. Sci. 19: 415-421. Kong & Steinbiss (1998) Arch Virol 143:1791-1799. Koonin (1993) J. Mo/. Biol. 229: 1165-1174. Korfhage et al. (1994) Plant Cell 6: 695-708. 15 Kossman & Lloyd (2000) Crit. Rev. Biochem. Mol. Biol. 35: 141-196. Kostyal et al. (1998) Clin. Exp. Immunol. 112: 355-362. Kosugi & Ohashi (2002) Plant Physio/. 128: 833-843. Koziel et al. (1993) Biotechnology 11:194. Krause et al. (1990) Ce// 63: 907-919. 20 Kruse et al. (1995) Planta 196: 796-803. Kyozuka et al. (2000) Plant Cell Physiol. 41:710-718. Kyte & Doolittle, (1982) J Mol Biol 157:105-132. Lacombe et al. (2001) Science 292: 1486-1487. Lamb & Dixon (1997) Ann. Rev. Plant Biol. 48: 251. 25 Landolt (1986) Biosystematic Investigation on the Family of Duckweeds: The family of Lemnaceae - A Monograph Study, Geobatanischen Institut ETH, Stiftung Rubel, Zurich. Landschulz et al. (1988) Science 240: 1759-1764. Langenkamper et al. (2001) J. Exp. Bot. 52: 1545-1554. 30 Laursen et al. (1994) Plant Mol. Biol. 24:51. Lebel et al., (1998) Plant J 16:223-233.
WO 2004/061080 PCT/US2003/041098 251 Lechner et al. (1996) Biochim Biophys Acta 1296: 181-188. Lee et al. (1994) Plant Mol. Biol. 26:1981. Lefterov et al. (2000) FASEB J. 14: 1837-1847. Lenny et al. (1997) Mol. Biol. Rep. 24: 157-168. 5 Le Saux et al. (1996) J. Bacteriol. 178: 3308-3313. Leslie et al. (2001) Toxicology 167: 3-23. Levine et al., (1994) Cell 79: 583-593. Li & Komatsu (2000) Eur. J. Biochem. 267: 737-745. Li & Nam (2002) Science 295: 1299-1301. 10 Li et al. (2001)Proc. Natl. Acad. Sci. USA 98: 5916-5921. Lim et al. (1999) Plant Physiol. 120: 1193-1204. Lim et al. (2000) Plant Mol. Biol. 44: 513-527. Lincoln et al. (1994) Plant Cel/ 6: 1859-1876. Lindholm et al. (2000) Mech. Dev. 93: 169-173. 15 Lindsey et al. (1993) Transgen. Res. 2: 3347. Liu et al. (1992) Antiviral Res. 19: 247-265. Liu et al., (1999) Genome Res 9:859-867. Logemann et al., (1989) Plant Cell 1:151-158. Lommel et al. (1991) Virology 181:382. 20 Lopez-Dee et al. (1999) Dev. Genet. 25: 237-244. Lorz et al. (1985) Mol. Gen. Genet. 199: 178. Ma et al. (1988) Nature 334: 631. Macejak & Sarnow, (1991) Nature 353:90-94. Magyar et al. (2000) FEBS Lett. 486: 79-87. 25 Maki et al. (1993) Methods in Plant Molecular Biology, Glich et al. eds, pp. 67-88, CRC Press. Malz & Sauter (1999) Plant Mo. Biol. 40: 985-995. Manjunath et al. (1997) Plant Mol. Biol. 33:97. Mann & Affolter (1998) Curr. Opin. Genet. Dev. 8: 423-429. 30 Mannervick (1999) Bioessays 21: 267-270. Martinez et al. (1989) J. Mol. Biot. 208:551.
WO 2004/061080 PCT/US2003/041098 252 Martinez et al. (2000) Plant Physiol. 122: 757-766. Massey (2000) Biochem. Soc. Trans. 28: 283-296. Mayer et al. (2001) Phytochemistry 58: 33-41. Mayo, (1987) The Theory of Plant Breeding, Second Edition, Clarendon 5 Press, New York, New York, United States of America. McBride et al. (1994) Proc. Nat/. Acad. Sc. USA 91: 7301. McBride et al. (1990) Plant Mol. Bio. 14: 266. McCabe et al. (1988) Bio/Technology 6:923. McElroy et al., (1990) Plant Ce// 2:163-71. 10 McEwen, et al. (2001) Mol. Biol. Cell. 12: 2776-89. McLeod (1986) Bioessays 6: 208-212. Medina et al. (2001) Plant Physiol. 125: 1655. Merkle et al. (1994) Plant J. 6: 555-565. Messing & Vierra (1982) Gene 19:259. 15 Miao & Lam, (1995) Plant J 7:359-365. Michalak et al. (1992) Biochem. J. 285: 681-692. Michalak et al. (1998) Biochem. Cell. Biol. 76: 779-785. Michelis et al. (2000) Plant Mol. Biol. 44: 487-498. Mikami et al. (1998) Plant J. 15: 563-568. 20 Moon et al. (1999) Plant Physiol. 120: 1193-1204. Moons et al. (1998) Plant J. 15: 89-98. Morita et al. (2001) Curr. Pharm. Biotechnol. 2: 257-267. Muehlbauer et at, (1999) Plant Physio/. 119: 651-62. Mukumoto et al., (1993) Plant Mol Biol 23: 995-1003. 25 Muller et al. (1995) Nature 374: 727-730. Mundy & Chua (1988) EMBO J. 7: 2279-2286. Munro & Pelham (1987) Cell 48: 899-907. Munster et al. (2001) Gene 262:1-130. Murre et al. (1989) Cell 56: 777-783. 30 Muslin & Xing (2000) Cell Signal 12: 703-709. Myers & Miller (1988) CABIOS 4:11.
WO 2004/061080 PCT/US2003/041098 253 Nagamara-Inoue et al. (2001) Int. Rev. Immunol. 20: 83-105. Nakamura et al. (1996) Plant Mol. Biol. 30: 381-385. Needleman & Wunsch (1970) J. Mo. Biol. 48:443. Negrotto et al., (2000) Plant Cell Reports 19:798-803. 5 Nelson et al. (1997) Plant Physio. 114: 29-37. Nepveu (2001) Gene 270: 1-15. Ng & Yanofsky (2001) Nat. Rev. Genet. 2:186-195. Nielsen et al. (1986) J. Biol. Chem. 261: 3661-3669. Norris et al., (1993) Plant Mol Biol 21:895-906. 10 Nurnberger & Scheel (2001) Trends Plant Sci. 6: 372-379. O'Connell et al. (2001) J. Biol. Chem. 276: 43065-43073. Odagaki et al. (1999) Structure Fold Des. 7: 399-411. Ohtsuka et al., (1985) J. Biol. Chem. 260:2605-2608. Oliveira et al. (2001) Braz. J. Med. Biol. Res. 34: 567-575. 15 Ono et al. (1996) Plant Physio/. 112: 483-491. Ono et al. (2001) Proc. Nat/. Acad. Sci. USA 98: 759-764. Oppenheimer et al. (1991) Cell 67: 483-493. Orzaez & Granell (1997) FEBS Lett. 404: 275-278. O'Shea et al. (1989) Science 243: 538-542. 20 Pacciotti et al. (1985) Bio/Technology 3:241. Panavas et al. (1999) Plant Mol. Bio. 40: 237-248. Park et al. (1985) J. Plant Biol., 38:365. Park et at. (1998) EMBO J. 17: 859-867. Pascual et al. (2000) J. Mol. Biol. 304: 723-729. 25 Paszkowski et al. (1984) EMBO J. 3:2717. PCT International Publication No. WO 93/05163 PCT International Publication No. WO 93/07278 PCT International Publication WO 93/21335 PCT International Publication WO 94/00977 30 PCT International Publication No. WO 94/20627 PCT International Publication No. WO 95/16783 WO 2004/061080 PCT/US2003/041098 254 PCT International Publication WO 95/19431 PCT International Publication WO 96/06166 PCT International Publication WO 98/54311 PCT International Publication WO 99/32619 5 PCT International Publication WO 99/53050 PCT International Publication WO 99/61631 PCT International Publication No. WO 00/07210 PCT International Publication No. WO 00/760067 PCT International Publication No. WO 01/07618 10 Pearson & Lipman (1988) Proc. Nat/. Acad. Sc. USA 85:2444. Pearson et al. (1994) Meth. Mo/. Biol. 24:307. Peifer et al. (1990) Cell 63:1167-76. Peifer et al. (1994) Cell 76:789-791. Pelham (1990) Trends Biochem. Sci. 15: 483-486. 15 Perrin (2001) Curr. Biol. 11: R213-R216. Persson et al. (2001) Plant Physiol. 126: 1092-1104. Phillips et al. (1988) in Corn and Corn Improvement, 3 rd ed, Sprague et al. eds., Amer. Soc of Agronomy. Picard et al., (1988) Cell 54: 1073-1080. 20 Pih et al. (1999) Mol. Cells 9: 84-90. Ponting (2000) Trends Biochem. Sci. 25: 423-426. Ponting & Parker (1996) Protein Sci. 5: 162-166. Postma-Haarsma et al. (2002) Plant Mol Biol 48: 423-41. Potrykus (1985) Trends Biotech. 7:269. 25 Potuschak & Doerner (2001) Curr. Opin. Plant Biol. 4: 501-506. Powell et al., (1989) Proc. Nat/. A cad. Sci. USA 86:6949-6952. Presley et al. (2002) Nature 417: 187-193. Purugganan et al. (1995) Genetics 140: 345-356. Ratajczak (2000) Biochim Biophys Acta 1465: 17-36. 30 Raven et al. (1999) Biology of Plants, Freeman/Worth. Rea et al. (1998) Annu. Rev. Plant Physio/. Plant Mol. Bio. 49: 727-760.
WO 2004/061080 PCT/US2003/041098 255 Reddy (2001) Int. Rev. Cytol. 204: 97-178. Redei & Koncz, (1992) in Methods in Arabidopsis Research (Koncz C, Chua N-H & Schell J, eds.) World Scientific Press, River Edge, New Jersey, United States of America, pp. 16-82. 5 Reed et al., (2001) /n Vitro Cell Dev Biol-Plant 37:127-132. Reichelt et al. (1999) Plant J. 19: 555-567. Reiser et al. (1995) Cel 83:735. Reizer et al. (1991) Mol. Microbiol. 5: 1081-1089. Riechmann et al. (1994) Nuc/. Acids Res. 22: 749-755. 10 Riechmann & Meyerowitz (1997) Biol. Chem. 378: 1079-1101. Riggs et a). (1986) Proc. Natl. Acad. Sci. USA 83:5602. Ritz et al. (2001) J. Bio. Chem. 276: 22273-22277. Robertson & Chandler (1992) Plant Mol. Biol. 19: 1031-1044. Robins et al. (1998) J. Med. Chem. 41: 3857-3864. 15 Robles et al. (1997) J. Neurosci. Res. 47: 90-97. Rogers et al., (1985) Proc. Nat. Acad. Sci. USA 82:6512-6516. Rogers et al. (2001) J. Bio/. Chem. 276: 30914-30922. Rohrmeier & Lehle, (1993) Plant Mol Biol 22:783-792. Roosens et al. (2000) Biochim. Biophys. Acta 1463: 470-476. 20 Rossolini et al., (1994) Mol Cell Probes 8:91-98. Roth et al., (1991) Plant Cell 3:317-325. Rothstein et al. (1987) Gene 53:153. Ruberti et at. (1991) EMBO J. 10: 1787-1791. Sabelli et al. (1999) Mol. Gen. Genet. 261: 820-830. 25 Saijo et al. (2001) Plant Cell Physiol. 42: 1228-1233. Salvucci & Ogren (1996) Phosynthesis Res. 47: (1) 1-11. Salvucci et al. (2001) Plant Physiol. 127: 1053-1064. Sanchez-Fernandez et al. (2001) J. Biol. Chem. 276: 30231-30244. Sanderfoot et al. (1999) Plant Physio. 121: 929-938. 30 Sanford et al. (1987) Particulate Sci. Tech. 5:27. Sankaranarayanan et al. (1996) Nat. Struct. Biol. 3: 596-603.
WO 2004/061080 PCT/US2003/041098 256 Saraste et al. (1990) Trends Biochem. Sci. 15: 430-434. Sato et at. (1997) J. Biol. Chem. 272:24530-5. Sauter et al. (1995) Plant J. 7: 623-632. Savidge et al. (1995) Plant Cell 7: 721-33. 5 Sazer & Dasso (2000) J. Cell Sci. 113: 1111-1118. Scharfmann et al., (1991) Proc Nat/ Acad Sci U S A 88:4626-4630. Schledzewski et al. (1999) J. Mo. Evol. 48: 770-778. Schmidhauser & Helinski (1985) J. Bacteriol. 164:446. Schneeberger et al. (1998) Development 125: 2857-2865. 10 Schocher et al., (1986) Bilo/Technology 4:1093-1096. Schofield (1987) Trends Neurosc. 10: 3-6. Schultz et al. (1998) Plant Cell 10: 837-47. Schwab et al. (2001) Phytochemistry 56: 407-415. Schwartz et al. (1999) Proc. Nat. A cad. Sci. USA 96: 4680-4685. 15 Sehnke et al. (2000) Plant Physiol. 122: 235-242. Sganga et at. (1992) Proc. Nat/. Acad. Sci. USA 89: 6328-6332. Shalev et al. (2001) J. Bio/. Chem. 276: 34948-34957. Shank et al. (2001) Plant Physiol. 126: 267-277. Shcherban et al. (1995) Proc. Nat/. A cad. Sci. USA 92: 9245-9249. 20 Sheridan et al. (1996) Genetics 142:1009. Shibuya et al. (2000) J. Exp. Bot. 51: 2067-2073. Shimamoto et al. (1989) Nature 338:274. Shinshi et al, (1990) Plant Mol Biol 14:357-368. Silverstone et al., (1998) Plant Cell 10:155-169. 25 Singh, (1986) Breeding for Resistance to Diseases and Insect Pests, Springer-Verlag, New York, New York, United States of America. Singh et at. (1998) J. Plant. Physiol. 153: 316-323. Sinha et al. (1993) Genes Dev. 7: 787-795. Sjodahl et al. (1995) Planta 197:264. 30 Skuzeski et al. (1990) Plant Mol. Biol., 15: 65. Smilie (1979) Trends Biochem. Sci. 4: 151-155.
WO 2004/061080 PCT/US2003/041098 257 Smith et al., (2000) Nature 407:319-320. Smith & Waterman (1981) Adv. Apple. Math. 2: 482. Smith et al. (1997) Ann. Rev. Plant Biol. 48: 67. Solocombe et al. (1994) Plant Physiol. 104:1167. 5 Sontag (2001) Cell Signal 13: 7-16. Spencer et al. (1990) Theor. Apple. Genet. 79:625. Staub et al. (1992) Plant Cell 4:39. Staub et al. (1993) EMBO J. 12:601. Stintzi et al. (1993) Biochimie. 75: 687-706. 10 Sukhapinda et al. (1987) Plant Mol. Biol. 8:209. Sulo & Martin (1993) J. Bio/. Chem. 268: 17634-17639. Sung et al. (2001) Mol. Cells 11: 352-359. Suss et al. (1993) Proc. Nat. Acad. Sci. USA 90: 5514-5518. Svab et al. (1990) Proc. Nat. Acad. Sci. USA 87:8526. 15 Takahashi et al. (1994) Plant Mol. Biol. 26: 339-352. Takamori et al. (2000) Nature 407: 189-194. Tanaka et al. (1997) Plant Mol. Biol. 35: 981-986. Tapon & Hall (1997) Curr. Opin. Cell. Biol. 9: 86-92. Taylor et al., (1993) Plant Cell Rep 12:491-495. 20 Theissen et al. (2000) Plant Mol. Biol. 42: 115-149. Thompson et al. (1987) EMBO J. 6:2519. Tomes et al. (1995) Plant Cell, Tissue and Organ Culture: Fundamental Methods, Springer-Verlag. Triezenberg et al., (1988) Genes Dev 2:718-729. 25 Trimarchi & Lees (2002) Nat. Rev. Mol. Cell. Blo1. 3: 11-20. Tsutsumi et al. (1994) Gene 141: 215-220. Turner et al., (1987) Cold Spring Harb Symp Quant Biot L1l:123-133. Udaka et al. (2000) J. Nutr. Sci. Vitaminol. (Tokyo) 46: 84-90. Uknes et al., (1992) Plant Cell4:645-656. 30 Umeda et al. (1999) Mol. Gen. Genet. 262: 230-238. Unger et al., (1989) Plant Mol Biol13:411-418.
WO 2004/061080 PCT/US2003/041098 258 Urao et al. (1996) Plant Mol. Biol. 32:571. U.S. Patent Application No. 20010049831 U.S. Patent No. 4,554,101 U.S. Patent No. 4,940,935 5 U.S. Patent No. 4,945,050 U.S. Patent No. 4,987,071 U.S. Patent No. 5,036,006 U.S. Patent No. 5,100,792 U.S. Patent No. 5,188,642 10 U.S. Patent No. 5,270,163 U.S. Patent No. 5,350,689 U.S. Patent No. 5,451,513 U.S. Patent No. 5,466,785 U.S. Patent No. 5,491,288 15 U.S. Patent No. 5,501,967 U.S. Patent No. 5,523,311 U.S. Patent No. 5,545,817 U.S. Patent No. 5,545,818 U.S. Patent No. 5,591,616 20 U.S. Patent No. 5,614,395 U.S. Patent No. 5,639,949 U.S. Patent No. 5,767,378 U.S. Patent No. 5,929,226 U.S. Patent No. 5,990,386 25 U.S. Patent No. 5,994,629 U.S. Patent No. 6,087,175 U.S. Patent No. 6,369,298 Van Breusegem et al. (1994) Planta 193: 57-66. Van den Broeck et al., (1985) Nature 313:358-363.
WO 2004/061080 PCT/US2003/041098 259 van der Krol et al., (1991) In Antisense nucleic acids and proteins (Joseph M & van der Krol A, eds.) Marcel Dekker Inc, New York, New York, United States of America, pp. 125-141. van Hemert et al. (2001) Bioessays 23: 936-946. 5 van Hille et al. (1993) Biochem Biophys Res. Commun. 197: 15-21. Vasil et al. (1989) Mol. Microbiol. 3:371. Vasil et al., (1992) Bio/Technology 10:667-674. Vasil et al. (1993) Biotechnology 11:1553. Vazquez-Tello et al. (1998) Mol. Gen. Genet. 257: 157-166. 10 Villalba et al. (1993) Eur. J. Biochem. 216: 863-869. Vollbrecht et al. (1991) Nature 350: 241-243. Vos et al. (2000) Plant Cell 12: 979-990. Wadzinski et al. (1993) Mol. Ce// Bio/. 13: 2822-2834. Walter et al. (1990) J. Biol. Chem. 265: 14016-22. 15 Warner et al., (1993) Plant J 3:191-201. Wasmann et al., (1986) Mol Gen Genet 205:446-453. Watanabe et al. (1994) J. Bio/l. Chem. 269: 7744-7749. Waterhouse et al., (1998) Proc NatlA cad Sc/ U S A 95:13959-13964. Weeks et al. (1993) Plant Physio/. 102: 1077. 20 Weissinger et al. (1988) Ann. Rev. Genet. 22: 421. Welsh, (1981) Fundamentals of Plant Genetics and Breeding, John Wiley & Sons, New York, New York, United States of America. Wen et al. (1989) Nuc/. Acids Res. 17: 9490. White et al. (1990) Nucl. Acids Res. 18:1062. 25 Williams et al., (1993) J Clin Invest 92:503-508. Williams-Carrier et al. (1997) Development 124: 3737-3745. Winge et al. (1997) Plant Mol. Biol. 35: 483-495. Wolf & Borchardt (1991) J. Med. Chem. 34: 1521-1530. Wood, (1983) Crop Breeding, American Society of Agronomy, Madison, 30 Wisconsin, United States of America.
WO 2004/061080 PCT/US2003/041098 260 Wricke & Weber, (1986) Quantitative Genetics and Selection Plant Breeding, Walter de Gruyter and Co., Berlin, Germany. Wu et al. (1995) Mol. Cell. Biot. 15: 2536-2546. Xia et al. (1996) Plant J. 10: 761-769. 5 Xiong et al. (2001a) Genes Dev. 15: 1971-1984. Xiong et al. (2001b) Mol. Plant Microbe Interact. 14: 685-692. Xu et al., (1993) Plant Mo/ Bio/ 22:573-588. Yamaguchi-Shinozaki et al. (1990) Plant Mol. Biol. 14: 29-39. Yao et al. (2001) Proc. Nat. Acad. Sci. USA 98: 1306-1311. 10 Yazaki et al. (2001) J. Exp. Bot. 52: 877-9. Yokota et al. (1999a) Plant Physiol. 119: 231-240. Yokota et al. (1999b) Plant Physiol. 121: 525-534. York et al. (1994) Biochemistry 33: 13164-13171. Yucel (2000) J. Cell. Biol. 150: 1-11. 15 Zhang et al. (1998) EMBO J. 17: 6404-6411. Zhao & Last, (1995) J. Biol. Chem. 270: 6081-6087. Zhao et al., (2000) Plant Mol Biot 44:789-98. Zhao et al. (2001) EMBO J. 20: 2315-2325. Zhong et al. (1996) Mo/. Gen. Genet. 251:196. 20 Zhu & Xiao (1998) Nucl. Acids Res. 26: 5402-5408. Zhu et al., (1999) Proc NatlAcad Sci U S A 96:8768-8773. Zhu et al. (2001) Plant Physiol. Biochem. 39: 221-242. Zilliacus et al. (2001) Mol. Endocrino. 15: 501-511. Zollman, et al. (1994) Proc. Nati. Acad. Sci. USA 91: 10717-21. 25 Those skilled in the art will recognize, or be able to ascertain, using no more than routine experimentation, numerous equivalents to the specific embodiments described specifically herein. Such equivalents are intended to be encompassed in the scope of the following claims. 30
Claims (46)
1. An isolated nucleic acid molecule encoding a stress-related 5 polypeptide, wherein the polypeptide binds in a yeast two hybrid assay to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDADI (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGTI (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 10 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170).
2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule is derived from rice (Oryza sativa).
3. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence selected from the 15 group consisting of odd numbered SEQ ID NOs: 1-111.
4. The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 1-15 and the protein comprises an amino acid sequence of SEQ ID NO: 114. 20
5. The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of one of SEQ ID NOs: 7 and 17 and the protein comprises an amino acid sequence of SEQ ID NO: 128.
6. The isolated nucleic acid molecule of claim 3, wherein the isolated 25 nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 21-25 and the protein comprises an amino acid sequence of SEQ ID NO: 20.
7. The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of SEQ ID NO: 27 30 and the protein comprises an amino acid sequence of SEQ ID NO: 134. WO 2004/061080 PCT/US2003/041098 262
8. The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of SEQ ID NO: 29 and the protein comprises an amino acid sequence of SEQ ID NO: 138.
9. The isolated nucleic acid molecule of claim 3, wherein the isolated 5 nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 31-43 and the protein comprises an amino acid sequence of SEQ ID NO: 144.
10.The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd 10 numbered SEQ ID NOs: 45-67 and the protein comprises an amino acid sequence of SEQ ID NO: 146.
11.The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of SEQ ID NO: 69 and the protein comprises an amino acid sequence of SEQ ID NO: 36. 15
12. The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 71-77 and the protein comprises an amino acid sequence of SEQ ID NO: 152.
13. The isolated nucleic acid molecule of claim 3, wherein the isolated 20 nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 79-95 and the protein comprises an amino acid sequence of SEQ ID NO: 156.
14. The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd 25 numbered SEQ ID NOs: 97-105 and the protein comprises an amino acid sequence of SEQ ID NO: 164.
15. The isolated nucleic acid molecule of claim 3, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 97 and 107-111 and the protein comprises an 30 amino acid sequence of SEQ ID NO: 170. WO 2004/061080 PCT/US2003/041098 263
16.An isolated nucleic acid molecule encoding a stress-related polypeptide, wherein the nucleic acid molecule is selected from the group consisting of: (a) a nucleic acid molecule encoding a polypeptide comprising an 5 amino acid sequence of one of even numbered SEQ ID NOs: 2-112; (b) a nucleic acid molecule comprising a nucleic acid sequence of one of odd numbered SEQ ID NOs: 1-111; (c) a nucleic acid molecule that has a nucleic acid sequence at 10 least 90% identical to the nucleic acid sequence of the nucleic acid molecule of (a) or (b) ; (d) a nucleic acid molecule that hybridizes to (a) or (b) under conditions of hybridization selected from the group consisting of: 15 (i) 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , I M ethylenediamine tetraacetic acid (EDTA) at 5000 with a final wash in 2X standard saline citrate (SSC), 0.1% SIDS at 50'C; (ii) 7% SOS, 0.5 M NaPO 4 , I mM EDTA at 50'0 with a final 20 wash in I X SSC, 0. 1% SIDS at 5000; (iii) 7% SDS, 0.5 M NaPO 4 , I mM EDTA at 50'0 with a final wash in 0.5X SS0, 0.1% SIDS at 50'C; (iv) 7% sodium dodecyl sulfate (SOS), 0.5 M NaPO 4 , 1 M EDTA at 5000 with a final wash in 0.1X 550, 0.1% SIDS 25 at 5000; and (v) 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO 4 , 1 mM EDTA at 5000 with a final wash in aiX SSC, 0.1% SOS at 65C; (e) a nucleic acid molecule comprising a nucleic acid sequence 30 fully complementary to (a); and WO 2004/061080 PCT/US2003/041098 264 (f) a nucleic acid molecule comprising a nucleic acid sequence that is the full reverse complement of (a).
17.An isolated stress-related polypeptide encoded by the isolated nucleic acid molecule of claim 16, or a functional fragment, domain, or 5 feature thereof.
18.A method for producing a polypeptide of claim 17, comprising the steps of: (a) growing cells comprising an expression cassette under suitable growth conditions, the expression cassette comprising a 10 nucleic acid molecule of claim 16; and (b) isolating the polypeptide from the cells.
19.A transgenic plant cell comprising an isolated nucleic acid molecule of claim 1.
20.The transgenic plant of claim 19, wherein the plant is selected 15 from the group consisting of corn (Zea mays), Brassica sp., alfalfa (Medicago sativa), rice (Oryza sativa ssp.), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana), sunflower (Helianthus annuus), safflower (Carthamus 20 tinctorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanut (Arachis hypogaea), cotton, sweet potato (lpomoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea 25 (Camellia sinensis), banana (Musa spp.), avocado (Persea ultilane), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, 30 duckweed (Lemna), barley, a vegetable, an ornamental, and a conifer. WO 2004/061080 PCT/US2003/041098 265
21.The transgenic plant of claim 20, wherein the plant is rice (Oryza sativa ssp.)
22.The transgenic plant of claim 20, wherein the duckweed is selected from the group consisting of genus Lemna, genus Spirodela, genus 5 Woffia, and genus WofielIa.
23.The transgenic plant of claim 20, wherein the vegetable is selected from the group consisting of tomatoes, lettuce, guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, green bean, lima bean, pea, and members of the genus Cucumis. 10
24.The transgenic plant of claim 20, wherein the ornamental is selected from the group consisting of impatiens, Begonia, Pelargonium, Viola, Cyclamen, Verbena, Vinca, Tagetes, Primula, Saint Paulia, Agertum, Amaranthus, Antihirrhinum, Aquilegia, Cineraria, Clover, Cosmo, Cowpea, Dahlia, Datura, Delphinium, Gerbera, Gladiolus, Gloxinia, Hippeastrum, 15 Mesembryanthemum, Salpiglossos, and Zinnia, azalea, hydrangea, hibiscus, rose, tulip, daffodil, petunia, carnation, poinsettia, and chrysanthemum.
25.The transgenic plant of claim 20, wherein the conifer is selected from the group consisting of loblolly pine, slash pine, ponderosa pine, lodgepole pine, Monterey pine, Douglas-fir, Western hemlock, Sitka spruce, 20 redwood, silver fir, balsam fir, Western red cedar, and Alaska yellow-cedar.
26. The transgenic plant of claim 19, wherein the transgenic plant is a plant selected from the group consisting of Acacia, aneth, artichoke, arugula, blackberry, canola, cilantro, clementines, escarole, eucalyptus, fennel, grapefruit, honey dew, jicama, kiwifruit, lemon, lime, mushroom, nut, okra, 25 orange, parsley, persimmon, plantain, pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, pepper, cauliflower, Brassica, broccoli, cabbage, ultilan sprouts, 30 onion, carrot, leek, beet, broad bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, ultilane, and zucchini. WO 2004/061080 PCT/US2003/041098 266
27.An isolated stress-related polypeptide, wherein the polypeptide binds in a yeast two hybrid assay to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ IDNO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), 5 OsSGTI (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIB1 (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID NO: 164), and OsCAA90866 (SEQ ID NO: 170).
28.The isolated stress-related polypeptide of claim 17, wherein the isolated stress-related polypeptide is selected from the group consisting of: 10 (a) a polypeptide comprising an amino acid sequence of even numbered SEQ ID NOs: 2-112; and (b) a polypeptide comprising an amino acid sequence at least 80% similar to the polypeptide of (a) using the GCG Wisconsin Package SEQWEB@ application of GAP with the default GAP 15 analysis parameters.
29.The isolated stress-related polypeptide of claim 28, wherein the polypeptide comprises an amino acid sequence of one of even numbered SEQ ID NOs: 2-112.
30.An expression cassette comprising a nucleic acid molecule 20 encoding a stress-related polypeptide of claim 1.
31.The expression cassette of claim 30, wherein the nucleic acid molecule encoding a stress-related polypeptide comprises a nucleic acid sequence selected from odd numbered SEQ ID NOs: 1-111.
32.The expression cassette of claim 30, wherein the expression 25 cassette further comprises a regulatory element operatively linked to the nucleic acid molecule.
33.The expression cassette of claim 32, wherein the regulatory element comprises a promoter.
34.The expression cassette of claim 33, wherein the promoter is a 30 plant promoter. WO 2004/061080 PCT/US2003/041098 267
35. The expression cassette of claim 33, wherein the promoter is a constitutive promoter.
36.The expression cassette of claim 33, wherein the promoter is a tissue-specific or a cell type-specific promoter. 5
37. The expression cassette of claim 36, wherein the tissue-specific or cell type-specific promoter directs expression of the expression cassette in a location selected from the group consisting of epidermis, root, vascular tissue, meristem, cambium, cortex, pith, leaf, flower, seed, and combinations thereof. 10
38.A transgenic plant cell comprising the expression cassette of claim 30.
39.The transgenic plant cell of claim 38, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence of one of odd numbered SEQ ID NOs: 1-111. 15
40. A transgenic plant comprising the expression cassette of claim 30.
41. Transgenic seeds or progeny of the trangenic plant of claim 40.
42.A method for modulating stress response of a plant cell comprising introducing into the plant cell an expression cassette comprising an isolated nucleic acid molecule encoding a stress-related polypeptide, wherein the 20 polypeptide binds in a yeast two hybrid assay to a fragment of a protein selected from the group consisting of OsGF14-c (SEQ ID NO: 113), OsDAD1 (SEQ ID NO: 128), Os006819-2510 (SEQ ID NO: 20), OsCRTC (SEQ ID NO: 134), OsSGTI (SEQ ID NO: 144), OsERP (SEQ ID NO: 146), OsCHIBI (SEQ ID NO: 152), OsCS (SEQ ID NO: 156), OsPP2A-2 (SEQ ID 25 NO: 164), and OsCAA90866 (SEQ ID NO: 170).
43.The method of claim 42, wherein expression of the polypeptide in the cell results in an enhancement of a rate or extent of proliferation of the cell.
44. The method of claim 42, wherein expression of the polypeptide in 30 the cell results in a decrease in a rate or extent of proliferation of the cell. WO 2004/061080 PCT/US2003/041098 268
45. The method of claim 42, wherein the isolated nucleic acid molecule comprises a nucleic acid sequence selected from one of odd numbered SEQ ID NOs: 1-173.
46.The method of claim 45, wherein the isolated nucleic acid 5 molecule comprises a nucleic acid sequence selected from one of odd numbered SEQ ID NOs: 1-111.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU2008202017A AU2008202017A1 (en) | 2002-12-26 | 2008-05-07 | Stress-related polypeptides and uses therefor |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US43656402P | 2002-12-26 | 2002-12-26 | |
| US60/436,564 | 2002-12-26 | ||
| PCT/US2003/041098 WO2004061080A2 (en) | 2002-12-26 | 2003-12-23 | Stress-related polypeptides and uses therefor |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU2008202017A Division AU2008202017A1 (en) | 2002-12-26 | 2008-05-07 | Stress-related polypeptides and uses therefor |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| AU2003299859A1 true AU2003299859A1 (en) | 2004-07-29 |
| AU2003299859B2 AU2003299859B2 (en) | 2008-05-22 |
Family
ID=32713069
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU2003299859A Ceased AU2003299859B2 (en) | 2002-12-26 | 2003-12-23 | Stress-related polypeptides and uses therefor |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US20060235215A1 (en) |
| EP (1) | EP1578971A4 (en) |
| CN (1) | CN1922323A (en) |
| AU (1) | AU2003299859B2 (en) |
| CA (1) | CA2507868A1 (en) |
| WO (1) | WO2004061080A2 (en) |
Families Citing this family (30)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2511824A1 (en) * | 2002-12-26 | 2004-07-22 | Syngenta Participations Ag | Cell proliferation-related polypeptides and uses therefor |
| EP3078749B1 (en) * | 2004-12-21 | 2019-10-09 | Monsanto Technology LLC | Transgenic plants with enhanced agronomic traits |
| AR051866A1 (en) * | 2004-12-24 | 2007-02-14 | Cropdesign Nv | PLANTS WITH INCREASED PERFORMANCE AND PREPARATION METHOD |
| WO2007091634A1 (en) | 2006-02-09 | 2007-08-16 | Japan Science And Technology Agency | Plant having improved growth ability and disease resistance and method for production thereof |
| MX2008015742A (en) | 2006-06-06 | 2008-12-19 | Monsanto Technology Llc | Method for selection of transformed cells. |
| US7855326B2 (en) * | 2006-06-06 | 2010-12-21 | Monsanto Technology Llc | Methods for weed control using plants having dicamba-degrading enzymatic activity |
| WO2008087932A1 (en) | 2007-01-16 | 2008-07-24 | Japan Science And Technology Agency | Plant having increased yield of seeds |
| US7838729B2 (en) * | 2007-02-26 | 2010-11-23 | Monsanto Technology Llc | Chloroplast transit peptides for efficient targeting of DMO and uses thereof |
| WO2009095922A1 (en) * | 2008-01-31 | 2009-08-06 | Yeda Research And Development Co. Ltd. | Method for protecting plants from stress and senescence |
| WO2009095881A2 (en) | 2008-01-31 | 2009-08-06 | National Institute For Biological Sciences | Plants having altered growth and/or development and a method for making the same |
| CA2722806A1 (en) * | 2008-05-05 | 2009-11-12 | Basf Plant Science Gmbh | Plants having enhanced yield-related traits and a method for making the same |
| CN102046797A (en) * | 2008-05-05 | 2011-05-04 | 巴斯夫植物科学有限公司 | Plants having enhanced yield-related traits and a method for making the same |
| WO2010087805A2 (en) * | 2008-07-10 | 2010-08-05 | Carnegie Institution Of Washington | Insult resistant plants and methods of producing and using the same |
| KR101183112B1 (en) | 2010-05-18 | 2012-09-27 | 대한민국 | Mshsp23.3 gene from Medicago sativa and uses thereof |
| KR101416506B1 (en) | 2012-08-10 | 2014-07-09 | 연세대학교 산학협력단 | Gene Implicated in Abiotic Stress Tolerance and Growth Accelerating and Use Thereof |
| CN103073626B (en) * | 2012-12-11 | 2014-06-04 | 贵州省油菜研究所 | Wheat calreticulin fragment TaCRT1-206, and coding sequence and application thereof |
| WO2014190532A1 (en) * | 2013-05-31 | 2014-12-04 | 创世纪转基因技术有限公司 | Cotton pp2ac-type protein phosphatase pp2ac-4, coding gene of same, and application thereof |
| WO2018169787A1 (en) * | 2017-03-13 | 2018-09-20 | Ball Horticultural Company | Downy mildew resistant impatiens |
| US11166424B2 (en) | 2017-03-13 | 2021-11-09 | Ball Horticultural Company | Downy mildew resistant Impatiens |
| CN109609512B (en) * | 2018-12-25 | 2022-06-07 | 郑州师范学院 | Application of Phalaenopsis PP2A Gene as Internal Reference Gene |
| CN109913478A (en) * | 2019-04-09 | 2019-06-21 | 贵州大学 | A kind of sorghum E3 ubiquitin ligase SbBAG4 gene and its recombinant vector and expression method |
| CN110343154B (en) * | 2019-07-18 | 2021-02-09 | 中国农业科学院生物技术研究所 | Cloning and application of a key gene SEM1 controlling the source and flow of rice |
| CN110468142B (en) * | 2019-09-27 | 2022-06-07 | 西北农林科技大学 | Negative regulator AtRTP5 gene and its application against Phytophthora phytophthora |
| CN111073904B (en) * | 2019-12-10 | 2023-12-22 | 北大荒垦丰种业股份有限公司 | Genetic transformation, gene editing and analysis method of soybean main cultivar |
| CN112552387B (en) * | 2020-12-31 | 2022-11-11 | 山西大学 | A kind of antitumor active protein of daylily and its preparation method and application |
| CN113817038B (en) * | 2021-10-29 | 2023-09-19 | 海南大学 | Application of VaVPAC, a protein derived from adzuki bean and its encoding gene, in enhancing drought resistance of tobacco |
| CN116768991B (en) * | 2022-03-10 | 2024-07-05 | 中国科学院遗传与发育生物学研究所 | Soybean four-transmembrane-region protein GmTET270,270 related to lipid metabolism regulation and encoding gene and application thereof |
| CN116179379B (en) * | 2022-07-05 | 2025-07-04 | 浙江工业大学 | Recombinant Pichia pastoris highly expressing DNase I, construction method and application thereof |
| CN115820660A (en) * | 2022-07-13 | 2023-03-21 | 东北农业大学 | Hemerocallis fulva PDS gene VIGS silencing system and application thereof |
| CN120064671B (en) * | 2025-04-29 | 2025-07-29 | 吉林大学 | Composition for detecting content of infant saliva PRB1 and application |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1033405A3 (en) * | 1999-02-25 | 2001-08-01 | Ceres Incorporated | Sequence-determined DNA fragments and corresponding polypeptides encoded thereby |
| US6677502B1 (en) * | 1999-07-12 | 2004-01-13 | E.I. Du Pont De Nemours And Company | Plant metabolism genes |
| AR025996A1 (en) * | 1999-10-07 | 2002-12-26 | Valigen Us Inc | NON-TRANSGENIC PLANTS RESISTANT TO HERBICIDES. |
| JP2006507819A (en) * | 2002-11-15 | 2006-03-09 | ポスコ | Rice organ preferential gene identification method using T-DNA insertion mutation and gene identified by the method |
-
2003
- 2003-12-23 EP EP03800133A patent/EP1578971A4/en not_active Withdrawn
- 2003-12-23 AU AU2003299859A patent/AU2003299859B2/en not_active Ceased
- 2003-12-23 CA CA002507868A patent/CA2507868A1/en not_active Abandoned
- 2003-12-23 US US10/533,176 patent/US20060235215A1/en not_active Abandoned
- 2003-12-23 CN CNA2003801077621A patent/CN1922323A/en active Pending
- 2003-12-23 WO PCT/US2003/041098 patent/WO2004061080A2/en not_active Ceased
-
2008
- 2008-10-03 US US12/245,449 patent/US20090178162A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| EP1578971A2 (en) | 2005-09-28 |
| WO2004061080A2 (en) | 2004-07-22 |
| CN1922323A (en) | 2007-02-28 |
| EP1578971A4 (en) | 2006-06-28 |
| AU2003299859B2 (en) | 2008-05-22 |
| US20090178162A1 (en) | 2009-07-09 |
| WO2004061080A3 (en) | 2004-11-04 |
| CA2507868A1 (en) | 2004-07-22 |
| US20060235215A1 (en) | 2006-10-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU2003299859B2 (en) | Stress-related polypeptides and uses therefor | |
| US20090178157A1 (en) | Cell proliferation-related polypeptides and uses therefor | |
| US20100218282A1 (en) | Nucleic acid molecules from rice controlling abiotic stress tolerance | |
| US20040219675A1 (en) | Nucleic acid molecules from rice encoding proteins for abiotic stress tolerance, enhanced yeild, disease resistance and altered nutritional quality and uses thereof | |
| US20090328255A1 (en) | Nitrogen limitation adaptability gene and protein and modulation thereof | |
| US8586825B2 (en) | Nitrogen-regulated sugar sensing gene and protein and modulation thereof | |
| US8742201B2 (en) | Nitrogen-regulated sugar sensing gene and protein and modulation thereof | |
| US7230159B2 (en) | Isolated BOS1 gene promoters from arabidopsis and uses thereof | |
| US6956115B2 (en) | Nucleic acid molecules from rice encoding RAR1 disease resistance proteins and uses thereof | |
| AU2008202017A1 (en) | Stress-related polypeptides and uses therefor | |
| WO2007036045A1 (en) | Method of modulating flowering time and shoot branching |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FGA | Letters patent sealed or granted (standard patent) | ||
| MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |