WO2024233528A2 - Genetic markers associated with increased fertility in maize - Google Patents
Genetic markers associated with increased fertility in maize Download PDFInfo
- Publication number
- WO2024233528A2 WO2024233528A2 PCT/US2024/028116 US2024028116W WO2024233528A2 WO 2024233528 A2 WO2024233528 A2 WO 2024233528A2 US 2024028116 W US2024028116 W US 2024028116W WO 2024233528 A2 WO2024233528 A2 WO 2024233528A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- plant
- marker
- marker loci
- position number
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/02—Methods or apparatus for hybridisation; Artificial pollination ; Fertility
- A01H1/022—Genic fertility modification, e.g. apomixis
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/04—Processes of selection involving genotypic or phenotypic markers; Methods of using phenotypic markers for selection
- A01H1/045—Processes of selection involving genotypic or phenotypic markers; Methods of using phenotypic markers for selection using molecular markers
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H5/00—Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
- A01H5/10—Seeds
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H6/00—Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
- A01H6/46—Gramineae or Poaceae, e.g. ryegrass, rice, wheat or maize
- A01H6/4684—Zea mays [maize]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Definitions
- This disclosure relates to the field of plant biotechnology.
- sequence listing is submitted electronically as an XML formatted sequence listing with a file named 109770_82681_SL, created on April 23, 2024, and having a size of 304,751 bytes, and is filed concurrently with the specification.
- sequence listing contained in this xml formatted document is part of the specification and is herein incorporated by reference in its entirety.
- the MIR162 event is a commercially important event for controlling certain pests in maize, such as fall armyworm.
- the MIR162 locus has been observed to be associated with decreased male fertility in certain inbred maize genetic backgrounds.
- the degree to which male fertility is impacted is inbred specific: some inbreds exhibit little or no reduction in male fertility when homozygous for the MIR162 locus; and other inbreds exhibit a significant reduction in male fertility when homozygous for the MIR162 locus.
- the degree to which male fertility is decreased is also affected by environmental factors, such as water availability and temperature.
- drought and high temperature conditions can increase MIR162-associated reductions in male fertility, which can make seed production more challenging and costly.
- a method of producing an MIR162 plant with increased fertility comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize event MIR162, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising one or more fertile QTLs.
- a method of producing a MIR162 plant with increased fertility comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and
- a method of producing a MIR162 plant with increased fertility comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, and introducing a MIR162 event into the selected maize plant.
- a method of producing a MIR162 plant with increased fertility comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712
- a method of producing a MIR162 plant with increased fertility comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR162 event, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the transposon element.
- TE transposon element
- a method of producing a plant compatible with a MIR162 event comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, and selecting from the plurality of progeny plants a selected progeny plant comprising the transposon element.
- TE transposon element
- a MIR162 plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9
- a MIR162 plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO
- kits for genotyping a plant comprising reagents for detecting the presence of one or more marker loci of SEQ ID Nos: 1- 223 or 255; one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located
- a method of detecting one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL comprising genotyping one or more fertile QTLs from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56
- a method of detecting one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL comprising genotyping one or more fertile QTLs from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56
- plants having genomic loci identified as associated with decreased fertility can be excluded from such a breeding program. These loci can so be used to rescue the conversion failure phenotype by introducing a favorable allele at a genomic locus (e.g., by breeding or genome editing) into a genetic background that contains the unfavorable allele at the genomic locus.
- a “coding sequence” is a nucleic acid sequence that is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA. Preferably the RNA is then translated in an organism to produce a protein.
- locus is a chromosome region or chromosomal region where a polymorphic nucleic acid, trait determinant, gene, or marker is located.
- a locus can represent a single nucleotide, a few nucleotides or a large number of nucleotides in a genomic region.
- the loci of this disclosure comprise one or more polymorphisms in a population (e.g., alternative alleles are present in some individuals).
- allele refers to one of two or more different nucleotides or nucleotide sequences (or the absence thereof) that occur at a specific locus or set of contiguous loci. In some embodiments, the term “allele” may be used interchangeably with the term “marker.”
- a “centimorgan” is a unit of measure of recombination frequency and genetic distance between two loci. One cM is equal to a 1% chance that a marker at one genetic locus will be separated from a marker at a second locus due to crossing over in a single generation.
- “closely linked” means that the marker or locus is within about 20 cM, 15 cM, 10 cM, 5 cM, 4 cM, 3 cM, 2 cM, 1 cM, 0.5 cM, or less than 0.5 cM of another marker or locus.
- 20 cM means that recombination occurs between the marker and the locus with a frequency of equal to or less than about 20%.
- cross refers to the fusion of gametes via pollination to produce progeny (e.g., cells, seeds or plants).
- progeny e.g., cells, seeds or plants.
- the term encompasses both sexual crosses (the pollination of one plant by another) and selfing (self-pollination, e.g., when the pollen and ovule are from the same plant).
- crossing refers to the act of fusing gametes via pollination to produce progeny.
- the plant disclosed herein may be a whole plant, or may be a plant cell, seed, or tissue, or a plant part such as leaf, stem, pollen, or cell that can be cultivated into a whole plant.
- backcross and “backcrossing” refer to the process whereby a progeny plant is crossed back to one of its parents for one or more generations (e.g., 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, or 7 or more times, etc.).
- the “donor” parent refers to the parental plant with the desired gene or locus to be introgressed.
- the “recipient” parent (used one or more times) or “recurrent” parent (used two or more times) refers to the parental plant into which the gene or locus is being introgressed. For example, see Ragot et al.
- At least one or more generations of progeny are identified and/or selected for the presence of the desired gene or locus (e.g., in a nucleic acid sample from the progeny plant or plant part). In embodiments, two or more generations (or even all generations) of progeny are identified and/or selected for the presence of the desired gene or locus.
- elite and “elite line” refer to any line that has resulted from breeding and selection for desirable agronomic performance.
- An elite line may be substantially homozygous. Numerous elite lines are available and known to those of skill in the art, for example, those described in Table 5.
- QTL quantitative trait locus
- QTLs quantitative trait loci
- transgenic “event” refers to a recombinant plant produced by transformation and regeneration of a plant cell or tissue with heterologous DNA, for example, an expression cassette that includes a gene of interest.
- the term “event” refers to the original transformant and/or progeny of the transformant that include the heterologous DNA.
- the term “event” also refers to progeny produced by a sexual outcross between the transformant and another com line. Even after repeated backcrossing to a recurrent parent, the inserted DNA and the flanking DNA from the transformed parent is present in the progeny of the cross at the same chromosomal location.
- event also refers to DNA from the original transformant comprising the inserted DNA and flanking genomic sequence immediately adjacent to the inserted DNA that would be expected to be transferred to a progeny that receives inserted DNA including the transgene of interest as the result of a sexual cross of one parental line that includes the inserted DNA (e.g., the original transformant and progeny resulting from selfing) and a parental line that does not contain the inserted DNA.
- transformation of plant tissue produces multiple events, each of which represent insertion of a DNA construct into a different location in the genome of a plant cell. Based on the expression of the transgene or other desirable characteristics, a particular event is selected.
- fertile refers to a plant that is fertile enough for use in a breeding and/or seed production program.
- fertile plants are plants that release at least about 100,000; 150,000; 200,000; 250,000, 300,000, 350,000, 400,000 or 450,000 pollen grains per tassel per day in the three-day period immediately following anther extrusion.
- the term “infertile” refers to a plant that is insufficiently fertile for use in a breeding program.
- “infertile” plants are plants that release fewer than 25,000; 50,000; 75,000 or 100,000 pollen grains per tassel per day in the three-day period immediately following anther extrusion.
- “Infertile” plants may produce and/or release viable pollen grains. Indeed, in some embodiments, “infertile” plants produce and release viable pollen grains but do so a rate that is insufficient for effective use in a breeding and/or seed production program.
- the terms “increase,” “increases,” “increasing” and similar terms refer to an augmentation of at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 100%, 125%, 150%, 175%, 200%, 300% or more.
- the term “increased fertility” refers to an improvement in one or more fertility traits of a subject plant as compared to one or more control plants.
- a control plant used in this context, refers to a plant having the identical genetic background as compared to the subject plant except for the lack of one or more markers/alleles associated with increased fertility. For example, an increase in fertility of an MIR162 plant having a fertile QTL is relative to a control plant and said control plant is an MIR162 plant lacking such a fertile QTL.
- Exemplary fertility traits include, but are not limited to, pollen count, pollen morphology, pollen production per anther, anther count, anther morphology, anthers per tassel, tassel count, tassel morphology, tassels per plant, silk count, silk morphology, silk production per plant, kernel count, kernel morphology, kernel production per ear, prevalence of kernel abortion, kernel production per plant and kernel viability.
- a control plant e.g., a native plant/germplasm of the same species, one or both parents, a near
- the term “increased male fertility” refers to an improvement in one or more male fertility traits as compared to one or more controls (e.g., a native plant/germplasm of the same species; one or both parents, a near isogenic plant that lacks one or more markers/alleles associated with increased fertility, a near isogenic plant that lacks MIR162, a plant that contains MIR162 and one or more unfavorable alleles such as those in the Examples, etc.).
- a native plant/germplasm of the same species e.g., a native plant/germplasm of the same species; one or both parents, a near isogenic plant that lacks one or more markers/alleles associated with increased fertility, a near isogenic plant that lacks MIR162, a plant that contains MIR162 and one or more unfavorable alleles such as those in the Examples, etc.
- Exemplary male fertility traits include, but are not limited to, pollen count, pollen morphology, pollen production per anther, anther count, anther morphology, anthers per tassel, tassel count, tassel morphology and tassels per plant.
- a control plant e.g., a native plant/germplasm of the same species, one or both parents, a near isogenic plant that lacks one or more markers/alleles associated with increased fertility, etc.
- the term “increased male fertility” refers to an improvement in one or more male fertility traits in a plant that arises from that plant part.
- the term “unique” to MIR162 means distinctively characteristic of MIR162. Therefore, nucleic acids unique to event MIR162 are not found in other non- MIR162 maize plants.
- Vip3A class of proteins comprises, for example, Vip3Aa, Vip3Ab, Vip3Ac, Vip3Ad, Vip3Ae, VipAf, and Vip3Ag, , and their homologues.
- “Homologue” means that the indicated protein or polypeptide bears a defined relationship to other members of the Vip3A class of proteins.
- “Vip3Aa20” is a Vip3A homologue unique to event MIR162. It was generated by spontaneous mutations introduced into the maize-optimized Vip3Aal9 gene comprised in pNOV1300 (SEQ ID NO: 234) during the plant transformation process as described in U.S. Pat. No. 8,232,456 and below.
- a marker associated with a trait refers to when the marker and trait are linked such that the presence of the marker is an indicator of the presence and/or extent the desired trait or trait form will occur in a plant/plant part comprising the marker.
- a marker is “associated with” an allele when the marker and allele are linked such that the presence of the marker is an indicator of the presence of the allele in a plant/plant part comprising the marker.
- a marker associated with increased male fertility refers to a marker whose presence or absence can be used to predict whether and/or to what extent a plant/plant part will display increased male fertility.
- SNP single nucleotide polymorphism
- allele of interest and “favorable allele” are used interchangeably to refer to an allele that is linked to a desired trait, e.g., increased fertility.
- An “allele of interest” may be associated with either an increase or decrease of or in a given trait, depending on the nature of the desired phenotype, and may be associated with a change in morphology, color, etc.
- the “allele of interest” is associated with increased male fertility and may therefore be used as a marker to identify, select and/or produce fertile maize plants; to predict whether and/or to what extent a maize plant will be fertile; to reduce the costs associated with breeding and/or seed production programs; and/or to increase the efficiency of breeding and/or seed production programs.
- unfavorable allele refers to an allele that segregates with an unfavorable plant phenotype, therefore providing the benefit of identifying plants that can be removed from a breeding program or planting.
- “fertile QTL” refers to a QTL comprising a marker locus or marker loci with one or more alleles that, in combination, are associated with baseline or increased fertility of MIR162 plants, e.g., as measured by seed set and/or yield, relative to control plants that do not contain MIR162 or increased fertility relative to plants that contain MIR162 but also contain infertile QTLs. These alleles are referred to as “favorable allele” in this application.
- infertile QTL refers to a QTL comprising a marker locus or marker loci with one or more alleles that, in combination, are associated with decreased fertility of MIR162 plants, e.g., as measured by seed set and/or yield, relative to control plants that do not contain MIR162. These alleles are referred to as “unfavorable allele” in this application.
- the term “plant” may refer to any suitable plant, including, but not limited to, spermatophytes (e.g., angiosperms and gymnosperms) and embryophytes (e.g., bryophytes, ferns and fern allies).
- spermatophytes e.g., angiosperms and gymnosperms
- embryophytes e.g., bryophytes, ferns and fern allies.
- the plant is a monocotyledonous (monocot) plant such as a rice, maize, wheat, barley, sorghum, millet, oat, triticale, rye, buckwheat, fonio, quinoa, sugar cane, bamboo, banana, ginger, onion, lily, daffodil, iris, amaryllis, orchid, canna, bluebell, tulip, garlic, secale, einkom, spelt, emmer, durum, kamut, grass (e.g., gramma grass), teff, milo, flax, Tripsacum sp., or teosinte plant.
- monocot monocot
- the plant is a dicotyledonous (dicot) plant such as a blackberry, raspberry, strawberry, barberry, bearberry, blueberry, coffee berry, cranberry, crowberry, currant, elderberry, gooseberry, goji berry, honeyberry, lemon, lime, lingonberry, mangosteen, orange, pepper, persimmon, pomegranate, prune, cotton, clover, acai, plum, peach, nectarine, cherry, guava, almond, pecan, walnut, amaranth, apple, sweet pea, pear, potato, soybean, sugar beet, sunflower, sweet potato, tamarind, tea, tobacco or tomato plant.
- a dicotyledonous (dicot) plant such as a blackberry, raspberry, strawberry, barberry, bearberry, blueberry, coffee berry, cranberry, crowberry, currant, elderberry, gooseberry, goji berry, honeyberry, lemon, lime, lingonberry, mangosteen, orange
- plant cell refers to a cell existing in, taken from and/or derived from a plant (e.g., derived from a plant cell/tissue culture).
- plant cell may refer to an isolated plant cell, a plant cell in a culture, a plant cell in an isolated tissue/organ and/or a plant cell in a whole plant.
- plant part refers to at least a fragment of a whole plant or to a cell culture or tissue culture derived from a plant.
- plant part may refer to plant cells, plant tissues and plant organs, as well as cell/tissue cultures derived from plant cells, plant tissues and plant cultures.
- Embodiments of the present invention may comprise and/or make use of any suitable plant part, including, but not limited to, anthers, branches, buds, calli, clumps, cobs, cotyledons, ears, embryos, filaments, flowers, fruits, husks, kernels, leaves, lodicules, ovaries, palea, panicles, pedicels, pods, pollen, protoplasts, roots, root tips, seeds, silks, stalks, stems, stigma, styles, and tassels.
- the plant part is a plant germplasm.
- polymorphism refers to a variation in the nucleotide sequence at a locus, where said variation is too common to be due merely to a spontaneous mutation.
- a polymorphism generally has a frequency of at least about 1% in a population.
- a polymorphism can be a single nucleotide polymorphism (SNP), or an insertion/deletion polymorphism, also referred to herein as an “indel.” Additionally, the variation can be in a transcriptional profile or a methylation pattern.
- the polymorphic site or sites of a nucleotide sequence can be determined by comparing the nucleotide sequences at one or more loci in two or more germplasm entries.
- polypeptide As used herein, the terms “polypeptide,” “peptide” and “protein” refer to a polymer of amino acid residues. The terms encompass amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
- progeny refers to the descendant(s) of a particular cross. Typically, progeny result from breeding of two individuals, although some species (particularly some plants and hermaphroditic animals) can be selfed (i.e., the same plant acts as the donor of both male and female gametes). The descendant s) can be, for example, of the Fl, the F2, or any subsequent generation.
- MIR162 plants are described in U.S. Pat. No. 8,232,456. Methods of generating and identifying MIR162 plants are described in Example 1 of U.S. Pat. No. 8,232,456.
- a MIR162 plant can be hemizygous or homozygous for the MIR162 event.
- the MIR162 plants comprise unique nucleic acid sequences that are not found in plants not comprising the MIR162 event.
- the unique nucleic acid sequences include junction sequences, a Vip3Aa20 coding sequence, which encodes a Vip3Aa20 insecticidal protein that confers insect resistance, and a pmi coding sequence, which encodes a PMI protein that confers upon corn cells the ability to utilize mannose as a carbon source.
- a MIR162 plant typically comprises nucleic acid sequences one or more of SEQ ID NO: 224-231.
- the MIR162 plant comprises nucleic acid sequences SEQ ID NO: 224 (Vip3A coding sequence), SEQ ID NO: 227 (the 5’ genome-insert junction sequence), and 228 (the 3’ genome-insert junction sequence).
- seeds from MIR162 plants were deposited in accordance with the Budapest Treaty at the American Type Culture Collection (ATCC), 1801 University Boulevard, Manassas, Virginia, 20110 under ATCC Accession No. PTA-6188.
- Introducing the MIR162 event into maize plants is effective in controlling lepidopteran insect pests including, but not limited to, black cutworm (BCW, Agrotis ipsilon), fall armyworm (FAW, Spodoptera frugiperda), tobacco budworm (TBW, Heliothis virescens), sugarcane borer (SCB, Diatraea saccharalis), lesser cornstalk borer (LCB, Elasmopalpus lignosellus), corn earworm (CEW, Helicoverpa zea), and western bean cutworm (WBCW, Striacosta albicosta).
- MIR162 event is associated with unsuccessful event conversion in some maize backgrounds, i.e., introduction of the MIR162 event into certain maize backgrounds results in the plants having reduced, in some cases severely reduced, male fertility.
- the overall unsuccessful event conversion rate for MIR162 across maize backgrounds was observed to be about 41-47% (Table 2).
- the disclosure provides various QTLs with marker loci that are associated with fertility of MIR162 plants. These QTLs comprise marker loci associated with fertility of MIR162 plants. These QTLs include, but are not limited to, a QTL on chromosome 5 that comprises one or more marker loci of SEQ ID NOs: 1-5; a QTL on chromosome 2 that comprises one or more of SEQ ID NO: 6-55; a QTL on chromosome 6 that comprises one or more of SEQ ID NO: 56-89; a QTL on chromosome 7 that comprises one or more of SEQ ID NO: 90- 131; a a QTL located on chromosome 8 comprising SEQ ID NO: 255; and a QTL on chromosome 9 that comprises one or more of SEQ ID NO: 132-223.
- a QTL on chromosome 5 that comprises one or more marker loci of SEQ ID NOs: 1-5
- a QTL on chromosome 2 that comprises one or more of SEQ
- the fertile QTL i.e., QTLs comprising marker loci with alleles that are associated with increased fertility of MIR162 plants having these QTLs.
- the fertile QTL is located on chromosome 5 and comprises one or more marker loci of SEQ ID NOs: 1-5, with the position number 100 of the sequences being the favorable alleles shown in Table 3, i.e., G, C, G, A, and C, respectively.
- the fertile QTL comprises SEQ ID NO: 2, with the position number 100 of the sequence being C.
- the fertile QTL comprises SEQ ID NO: 2, with the position number 100 of the sequence being C and one or more of SEQ ID NO: 1 and 3-5, with the position number 100 of the sequences being G, G, A, and C, respectively.
- the fertile QTL is on chromosome 2, comprising one or more of SEQ ID NO: 6-55, the position number 100 of the sequences being the favorable alleles shown in Table 4.
- the fertile QTL is on chromosome 6, comprising one or more of SEQ ID NO: 56-89, the position number 100 of the sequences being the favorable alleles shown in Table 4.
- the fertile QTL is on chromosome 7, comprising one or more of SEQ ID NO: 90- 131, the position number 100 of the sequences being the favorable alleles shown in Table 4.
- the fertile QTL is on chromosome 8 and comprises SEQ ID NO: 255, with position 100 of the sequence being the favorable allele shown in Table 9.
- the fertile QTL is on chromosome 9 and comprises one or more of SEQ ID NO: 132-223, with position 100 of the sequences being the favorable alleles shown in Table 4.
- the one or more fertile QTLs comprise a combination of SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, comprises one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T, or (xii) A, A, G, C, and T.
- the one or more fertile QTLs comprise one of the following: (i) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a T at position 100; (ii) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (iii) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO:
- the one or more fertile QTLs comprise SEQ ID NO: 2, wherein position number 100 is C or A; SEQ ID NO: 52, wherein position number 100 is A or G; SEQ ID NO: 88, wherein position number 100 is A or G; SEQ ID NO: 139, wherein position number 100 is T or C; and SEQ ID NO: 94, wherein position number 100 is T or C.
- the one or more fertile QTLs comprise one or more of SEQ ID NO: 2, wherein position number 100 is C or A; SEQ ID NO: 52, wherein position number 100 is A or G; SEQ ID NO: 88, wherein position number 100 is A or G; SEQ ID NO: 139, wherein position number 100 is T or C; or SEQ ID NO: 94, wherein position number 100 is T or C.
- the infertile QTLs include SEQ ID NO: 1-5, with position number 100 of the sequences being the unfavorable alleles in Table 3, i.e., A, A, A, G, and A, respectively.
- the infertile QTL is on chromosome 2, comprising one or more of SEQ ID NO: 6-55, the position number 100 of the sequences being the unfavorable alleles shown in Table 4.
- the infertile QTL is on chromosome 6, comprising one or more of SEQ ID NO: 56-89, the position number 100 of the sequences being the unfavorable alleles shown in Table 4.
- the infertile QTL is on chromosome 7, comprising one or more of SEQ ID NO: 90- 131, the position number 100 of the sequences being the unfavorable alleles shown in Table 4.
- the infertile QTL is on chromosome 8, comprising SEQ ID NO: 255, with position 100 of the sequence being the unfavorable alle shown in Table 9.
- the infertile QTL is on chromosome 9 and comprises one or more of SEQ ID NO: 132-223, with position 100 of the sequences being the unfavorable alleles shown in Table 4.
- the one or more infertile QTLs comprise a combination of SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) A, G, G, T, and T; (ii) A, G, G, C, and T; (iii) A, A, G, C, C; (iv) A, G, G, T, C; or (v) A, G, G, C, C.
- the one or more infertile QTLs comprise one of the following: (i) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a T at position 100; (ii) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (iii) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T
- the presence of the QTL is confirmed by a locus that is closely linked to the QTLs, for example a locus that is within about 20 centimorgan (cM ), 15 cM, 10 cM, 5 cM, 4 cM, 3 cM, 2 cM, I cM, 0.5 cM, or less than 0.5 cM of the QTL.
- a locus that is closely linked to the QTLs for example a locus that is within about 20 centimorgan (cM ), 15 cM, 10 cM, 5 cM, 4 cM, 3 cM, 2 cM, I cM, 0.5 cM, or less than 0.5 cM of the QTL.
- a transposable element that is associated with fertility of MIR162 plants.
- the TE is located on chromosomal 5.
- the TE is located on Chromosome 5: 130436801- 130439201 in B73_ver5.
- the TE comprises SEQ ID NO: 232 or 233.
- the TE comprises a sequence sharing at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% sequence identity to SEQ ID NO: 232 or 233.
- SEQ ID NO: 232 or 233 the presence of the TE (SEQ ID NO: 232 or 233) on chromosome 5 was shown to positively correlate with conversion success.
- plants with one or more fertile QTLs and/or a TE disclosed herein are selected and used to cross with MIR162 plants to produce further MIR162 plants, e.g., hybrid plants.
- the MIR162 event can be introduced to the plants with one or more fertile QTLs disclosed herein to produce MIR162 plants having increased fertility.
- the MIR162 event can be introduced to the plants with a TE disclosed herein into Chromosome 2 or 5 at the location disclosed above to produce MIR162 plants having increased fertility.
- the genome of MIR162 plants are edited to remove the unfavorable alleles and introduce favorable allele and/or the TE to increase fertility.
- MIR162 plants with one or more fertile QTLs and/or a TE associated with increased fertility can be created by crossing a first maize plant comprising the fertile QTL(s) and/or the TE with a second maize plant to produce a plurality of progeny plants.
- the first maize plant is an MIR162 plant.
- the second maize plant is an MIR162 plant.
- both the first and second maize plant are MIR162 plants.
- the MIR162 maize plant is obtained from growing a seed deposited at the ATCC under the accession No. PTA-6188. The crosses of the first and second maize plants produce a plurality of progeny plants.
- progeny plants are genotyped to confirm (i) the presence of fertile QTLs and/or the absence of infertile QTLs and/or (ii) the presence or absence of the TE associated with increased fertility using the methods disclosed in the section above.
- retention of the MIR162 event in these progeny plants can be confirmed by detecting the presence of one or more nucleic acids (e.g., one or more nucleic acids encoding Vip3Aa) or proteins that are unique to the MIR162 event and/or by detecting the insect resistance in the plant due to the presence of the MIR162 event.
- nucleic acids e.g., one or more nucleic acids encoding Vip3Aa
- the method of detecting one or more nucleic acids that are unique to the MIR162 event comprises (a) contacting the sample with a pair of polynucleotide primers that, when used in a nucleic acid amplification reaction with genomic DNA from the MIR162 event produces an amplicon that is diagnostic for the MIR162 event; (b) performing a nucleic acid amplification reaction, thereby producing the amplicon; and (c) detecting the amplicon.
- the amplicon comprises a nucleotide sequence selected from the group consisting of SEQ ID NO: 224-231 and the complements thereof.
- Vip3A protein e.g., Vip3Aa
- ELISA enzyme-linked immunosorbent assay
- progeny plants comprising the fertile QTL(s) and/or a TE associated with increased fertility and MIR162 events can be backcrossed with a parent plant comprising the MIR162 events and/or the fertile QTL(s) and/or the TE.
- the Fl progeny plants from the cross can be assayed for one or more fertile QTLs or a TE associated with increased fertility disclosed herein.
- a selected Fl progeny plant is then backcrossed with the parent MIR162 plant (recurrent parent). Plants from the BC1 generation are also genotyped for the one or more fertile QTLs or a TE associated with increased fertility disclosed herein.
- a new elite maize inbred line comprising both one or more fertile QTLs or a TE associated with increased fertility disclosed herein and the MIR162 event).
- inbred plants can be used as parent plants, e.g., as male parent plants, to create hybrid plants, which are a preferred commercial embodiment of maize in most countries due to heterosis of the hybrids.
- MIR162 plants with fertile QTL(s) can be created by selecting a maize plant comprising one or more fertile QTLs or a TE associated with increased fertility as disclosed herein, introducing a MIR162 event into said maize plant, and growing the plant to result in a MIR162 plant comprising these fertile QTL(s) and/or TE.
- the maize plant comprising one or more fertile QTL(s) and/or TE disclosed herein is a known maize line, for example, any one of the maize lines listed in Table 5.
- the MIR162 event can be introduced into corn plants by transforming nucleic acid sequences that are unique to the MIR162 event into corn plants using methods well known in the art. See Section II above, entitled “Vip3 A plants and fertility.” One illustrative example is described in Example 1 of U.S. Pat. No. 8,232,456. 3. Genome editing
- one or more fertile QTLs and a TE associated with increased fertility can be introduced into a maize plant by genome editing.
- Various embodiments of the methods described herein use genome editing.
- genome editing is used to modify the genome of a plant to produce plants having one or more of the fertile QTLs and/or a TE associated with increased fertility to increase the fertility of MIR162 plants.
- the genome of an MIR162 plant is edited to replace the one or more unfavorable alleles in SEQ ID NOs: 1-223 or 255 with the corresponding favorable alleles in Tables 3, 4 or 9.
- genome editing can be performed in a maize plant to replace the unfavorable allele A at position 100 in SEQ ID NO: 2 with the favorable allele C.
- genome editing is used to introduce the favorable allele from any of the loci in Table 4, such as at chromosome 2 at position 243529534, chromosome 5 at position 129259011, chromosome 6 at position 158572433, chromosome 7 at position 134444680, or chromosome 9 at position 8013654, or an allelic combination in Table 4, into a maize plant that does not contain such allele(s) or allelic combination and would otherwise not succeed in MIR162 event conversion. It is expected that introduction of the favorable allele(s) or allelic combination will improve the likelihood of a successful MIR162 event conversion.
- the genome of a MIR162 plant is edited to insert a TE associated with increased fertility into the genome.
- the TE is inserted on chromosome 5.
- the TE is inserted into Chromosome 5 at positions 130436801- 130439201 in B73_ver5.
- the TE comprises SEQ ID NO: 232 or 233.
- Methods of editing genomes are well known in the art. Such methods include, but are not limited to, meganucleases designed against the plant genomic sequence of interest CRISPR-Cas9, TALENs, and other technologies for precise editing of genomes (Feng, et al. Cell Research 23: 1229-1232, 2013, WO 2013/026740); Cre-lox site-specific recombination; FLP-FRT recombination (Li et al. (2009) Plant Physiol 151 : 1087-1095); Bxbl -mediated integration (Yau et al. Plant J (2011) 701 : 147-166); zinc-finger mediated integration (Wright et al.
- genome editing may involve transient, inducible, or constitutive expression of the genome editing components or systems in the target plant or bombardment of the genome editing components directly into the plant.
- Genome editing may involve genomic integration or episomal presence of the genome editing components or systems.
- Genome editing generally refers to the use of a site-directed nuclease (including but not limited to CRISPR/Cas, zinc fingers, meganucleases, and the like) to cut a nucleotide sequence at a desired location. This may be to cause a deletion or an insertion/deletion (“indel”) mutation, (i.e., “SDN1”), a base edit (i.e., “SDN2”), or allele insertion or replacement (i.e., “SDN3”).
- indel insertion/deletion
- SDN2 or SDN3 genome editing may comprise the provision of one or more recombination templates (e.g., in a vector) comprising a gene sequence of interest that can be used for homology directed repair (HDR) within the plant (i.e., to be introduced into the plant genome).
- the gene or allele of interest is one that is able to confer increased fertility as disclosed herein.
- the recombination template can be introduced into the plant to be edited either through transformation or through breeding with a donor plant comprising the recombination template. Breaks in the plant genome may be introduced within, upstream, and/or downstream of a target sequence. In some embodiments, a double strand DNA break is made within or near the target sequence locus.
- breaks are made upstream and downstream of the target sequence locus, which may lead to its excision from the genome.
- one or more single strand DNA breaks are made within, upstream, and/or downstream of the target sequence (e.g., using a nickase Cas9 variant). Any of these DNA breaks, as well as those introduced via other methods known to one of skill in the art, may induce HDR.
- the target sequence is replaced by the sequence of the provided recombination template comprising a polynucleotide of interest, e.g., a fertile QTL and/or TE.
- mutations in the genes of interest described herein may be generated without the use of a recombination template via targeted introduction of DNA double strand breaks. Such breaks may be repaired through the process of non-homologous end joining (NHEJ), which can result in the generation of small insertions or deletions (indels) at the repair site. Such indels may lead to frameshift mutations causing premature stop codons or other types of loss-of-function mutations in the targeted genes.
- NHEJ non-homologous end joining
- the nucleic acid modification or mutation is effected by a (modified) zinc-finger nuclease (ZFN) system.
- ZFN zinc-finger nuclease
- the ZFN system uses artificial restriction enzymes generated by fusing a zinc finger DNA-binding domain to a DNA-cleavage domain that can be engineered to target desired DNA sequences. Exemplary methods of genome editing using ZFNs can be found for example in U.S. Patent Nos. 6,534,261; 6,607,882; 6,746,838; 6,794,136; 6,824,978; 6,866,997; 6,933,113; and 6,979,539.
- the nucleic acid modification is effected by a (modified) meganuclease, which are endodeoxyribonucleases characterized by a large recognition site (double-stranded DNA sequences of 12 to 40 base pairs).
- a (modified) meganuclease which are endodeoxyribonucleases characterized by a large recognition site (double-stranded DNA sequences of 12 to 40 base pairs).
- Exemplary method for using meganucleases can be found in US Patent Nos: 8,163,514; 8,133,697; 8,021,867; 8,119,361; 8,119,381; 8,124,369; and 8,129,134, which are specifically incorporated by reference.
- the nucleic acid modification is effected by a (modified) CRISPR/Cas complex or system.
- the CRISPR/Cas system or complex is a class 2 CRISPR/Cas system.
- said CRISPR/Cas system or complex is a type II, type V, or type VI CRISPR/Cas system or complex.
- the CRISPR/Cas system does not require the generation of customized proteins to target specific sequences but rather a single Cas protein can be programmed by an RNA guide (gRNA) to recognize a specific nucleic acid target, in other words the Cas enzyme protein can be recruited to a specific nucleic acid target locus (which may comprise or consist of RNA and/or DNA) of interest using said short RNA guide.
- gRNA RNA guide
- CRISPR/Cas or CRISPR system is as used herein and as described in foregoing documents refers collectively to transcripts and other elements involved in the expression of or directing the activity of CRISPR-associated (“Cas”) genes, including sequences encoding a Cas gene and one or more of, a tracr (trans-activating CRISPR) sequence (e.g.
- RNA(s) as that term is herein used (e.g., RNA(s) to guide Cas, such as Cas9, e.g. CRISPR RNA and, where applicable, transactivating (tracr) RNA or a single guide RNA (sgRNA) (chimeric RNA)) or other sequences and transcripts from a CRISPR locus.
- RNA(s) e.g., RNA(s) to guide Cas, such as Cas9, e.g. CRISPR RNA and, where applicable, transactivating (tracr) RNA or a single guide RNA (sgRNA) (chimeric RNA)) or other sequences and transcripts from a CRISPR locus.
- a CRISPR system is characterized by elements that promote the formation of a CRISPR complex at the site of a target sequence (also referred to as a protospacer in the context of an endogenous CRISPR system).
- target sequence refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a CRISPR complex.
- a target sequence may comprise any polynucleotide, such as DNA or RNA polynucleotides.
- the gRNA is a chimeric guide RNA or single guide RNA (sgRNA).
- the gRNA comprises a guide sequence and a tracr mate sequence (or direct repeat).
- the gRNA comprises a guide sequence, a tracr mate sequence (or direct repeat), and a tracr sequence.
- the CRISPR/Cas system or complex as described herein does not comprise and/or does not rely on the presence of a tracr sequence (e.g. if the Cas protein is Casl2a).
- the Cas protein as referred to herein such as but not limited to Cas9, Cas 12a (formerly referred to as Cpfl), Cas 12b (formerly referred to as C2cl), Cas 13a (formerly referred to as C2c2), C2c3, Cast 3b protein, may originate from any suitable source, and hence may include different orthologues, originating from a variety of (prokaryotic) organisms, as is well documented in the art.
- the Cas protein is (modified) Cas9, preferably (modified) Staphylococcus aureus Cas9 (SaCas9) or (modified) Streptococcus pyogenes Cas9 (SpCas9).
- the Cas protein is Casl2a, optionally from Acidaminococcus sp., such as Acidaminococcus sp. BV3L6 Cpfl (AsCasl2a) or Lachnospiraceae bacterium Cas 12a, such as Lachnospiraceae bacterium MA2020 or Lachnospiraceae bacterium MD2006 (LBCasl2a). See U.S. Pat. No. 10,669,540, incorporated herein by reference in its entirety.
- the Casl2a protein may be from Moraxella bovoculi AAX08_00205 [Mb2Casl2a] or Moraxella bovoculi AAXl l_00205 [Mb3Casl2a], See WO 2017/189308, incorporated herein by reference in its entirety.
- the Cas protein is (modified) C2c2, preferably Leptotrichia wadei C2c2 (LwC2c2) or Listeria newyorkensis FSL M6-0635 C2c2 (LbFSLC2c2).
- the (modified) Cas protein is C2cl.
- the (modified) Cas protein is C2c3.
- the (modified) Cas protein is Cas 13b.
- Other Cas enzymes are available to a person skilled in the art.
- Genome editing methods and compositions are also disclosed in US Pat. Nos. 10,519,456 and 10,285,348 82, the entire content of which is herein incorporated by reference.
- the gene-editing machinery (e.g., the DNA modifying enzyme) introduced into the plants can be controlled by any promoter that can drive recombinant gene expression in plants.
- the promoter is a constitutive promoter.
- the promoter is a tissue-specific promoter, e.g., a pollen-specific promoter or a sperm cell specific promoter, a zygote specific promoter, or a promoter that is highly expressed in sperm, eggs and zygotes (e.g., prOsActinl).
- Suitable promoters are disclosed in U.S. Pat. No. 10,519,456, the entire content of which is herein incorporated by reference.
- the guide RNA and the Cas protein may be delivered in DNA form, e.g., in a suitable vector that can be introduced into a yeast cell.
- DNA encoding the gRNA is cloned into a vector downstream of a promoter for expression.
- the sgRNA and Cas may be expressed from the same vector of the system or from different vectors.
- the vectors are separately transformed into the maize plant to induce genome editing.
- the coding sequence for Cas9 and the coding sequence for the sgRNA are ligated into a single vector, which is then transformed into the maize plant to induce genomic modification.
- the Cas9 vector and the sgRNA vector typically contains a selection marker, for example, spectinomycin, for identification of transformants comprising the genome editing machinery.
- the method of introducing desired genomic modifications comprises using a first maize plant expressing a DNA modification enzyme and at least one optional guide nucleic acid as described above to pollinate a target plant comprising genomic DNA to be edited.
- target nucleic acid sequence e.g., a particular allele (e.g., a favorable allele in Table 3) can be detected by methods including, but not limited to, nucleic acid sequencing, hybridization methods, amplification methods (e.g., PCR-based sequence specific amplification methods), detection of restriction fragment length polymorphisms (RFLP), detection of isozyme markers, detection of polynucleotide polymorphisms by allele specific hybridization (ASH), detection of amplified variable sequences of the plant genome, detection of self-sustained sequence replication, detection of simple sequence repeats (SSRs), detection of single nucleotide polymorphisms (SNPs), detection of amplified fragment length polymorphisms (AFLPs), detection of expressed sequence tags (ESTs).
- the detection of the target nucleic acid sequence can be facilitated through the use of nuclear acid amplification methods.
- nuclear acid amplification methods specifically increase concentration of poly nucleotides that span the polymorphic site, or include that site and sequences located either distal or proximal to it.
- amplified molecules can be readily detected by gel electrophoresis fluorescent detection methods or other means.
- a target nucleic acid sequence can be detected by probe ligation methods as provided in U.S. Pat. No. 5,800,944 where sequence of interest is amplified and hybridized to probes followed by ligation to detect a labeled part of the probe.
- Target nucleic acid sequence can also be detected by probe linking methods, employing at least one pair of probes having sequences homologous to adjacent portions of the target nucleic acid sequence and having side chains which non-covalently bind to form a stem upon base pairing of the probes to the target nucleic acid sequence. See, U.S. Pat. No. 5,616,464.
- Polymorphism in nucleic acid sequences can be detected using various methods.
- polymorphism in nucleic acid sequences is detected by hybridization to allele specific oligonucleotide probes.
- single or multiple nucleotide variations in nucleic acid sequence are detected in nucleic acids by a process in which the sequence containing the nucleotide variation is amplified, spotted on a membrane and treated with a labeled sequence-specific oligonucleotide probe. See, U.S. Pat. Nos. 5,468,613 and 5,217,863.
- polymorphism in nucleic acid sequences is detected using microarray, in which oligonucleotide probe sets are assembled in an overlapping fashion to represent a single sequence such that a difference in the target sequence at one point would result in partial probe hybridization (Borevitz et al., Large-scale identification of single-feature polymorphism in complex genomes. Genome Research, 13:513-523 (2003).
- SBE single base extension
- SBE methods include, but are not limited, to those provided in U.S. Pat. Nos. 6,004,744; 6,013,431; 5,595,890; 5,762,876; and 5,945,283.
- SBE methods are based on extension of a nucleotide primer that is adjacent to a polymorphism to incorporate a detectable nucleotide residue upon extension of the primer. If the polymorphism is present on the template, one of the labeled dideoxynucleosidetriphosphates can be added to the primer in a single base chain extension.
- the allele present is then inferred by determining which of the two differential labels was added to the extension primer. Homozygous samples will result in only one of the two labeled bases being incorporated and thus only one of the two labels will be detected. Heterozygous samples have both alleles present, and will thus direct incorporation of both labels (into different molecules of the extension primer) and thus both labels will be detected.
- polymorphism in nucleic acid sequences can be detected by methods provided in U.S. Pat. Nos. 5,210,015; 5,876,930; and 6,030,787 in which an oligonucleotide probe having a 5’ fluorescent reporter dye and a 3’ quencher dye covalently linked to the 5’ and 3’ ends of the probe.
- an oligonucleotide probe having a 5’ fluorescent reporter dye and a 3’ quencher dye covalently linked to the 5’ and 3’ ends of the probe.
- the proximity of the reporter dye to the quencher dye results in the suppression of the reporter dye fluorescence, e.g., by Forster-type energy transfer.
- forward and reverse primers hybridize to a specific sequence of the target DNA flanking a polymorphism while the hybridization probe hybridizes to polymorphismcontaining sequence within the amplified PCR product.
- DNA polymerase with 5’— >3’ exonuclease activity cleaves the probe and separates the reporter dye from the quencher dye resulting in increased fluorescence of the reporter.
- a target nucleic acid sequence can be detected using nucleic acid sequencing technologies.
- Methods for nucleic acid sequencing are known in the art and include technologies provided by 454 Life Sciences (Branford, Conn.), Agencourt Bioscience (Beverly, Mass.), Applied Biosystems (Foster City, Calif.), LLCOR Biosciences (Lincoln, Nebr.), NimbleGen Systems (Madison, Wis.), Illumina (San Diego, Calif.), Pac-Bio (Menlo Park, Calif.) and VisiGen Biotechnologies (Houston, Tex.).
- Such nucleic acid sequencing technologies comprise formats such as parallel bead arrays, sequencing by ligation, capillary electrophoresis, electronic microchips, “biochips,” microarrays, parallel microchips, and single-molecule arrays.
- kits for genotyping a plant to detect the presence of favorable alleles in the QTLs disclosed herein, or to detect the presence of a TE that is associated with increased fertility.
- the kit may comprise one of more reagents disclosed above, for example, the kit may comprise reagents used for detecting the presence of one or more marker loci of SEQ ID Nos: 1-223 or 255; and/or a transposon element (TE) of SEQ ID NO: 232 or 233.
- TE transposon element
- the kit comprises one or more of the following: primers for sequencing the one or more marker loci and/or the TE (for example, primers comprising at least 10 consecutive nucleotides that are complementary to any one of SEQ ID Nos: 1-223 or 255 or 232); primers for amplifying the one or more marker loci and/or the TE (for example, primers comprising at least 10 consecutive nucleotides that are complementary to any one of SEQ ID Nos: 1-223 or 255 or 232), or a nucleic acid molecule that is complementary to one or more marker loci, amplify and/or sequence one or more marker loci and/or the TE. can be used for amplifying and/or sequencing the alleles.
- primers for sequencing the one or more marker loci and/or the TE for example, primers comprising at least 10 consecutive nucleotides that are complementary to any one of SEQ ID Nos: 1-223 or 255 or 232
- the MIR162 plants with one or more fertile QTLs produced herein can be stacked with one or more polynucleotides encoding a desirable trait such as a polynucleotide that confers, for example, insect, disease or herbicide resistance or other desirable agronomic traits of interest including, but not limited to, traits associated with high oil content; increased digestibility; balanced amino acid content; and/or high energy content.
- a desirable trait such as a polynucleotide that confers, for example, insect, disease or herbicide resistance or other desirable agronomic traits of interest including, but not limited to, traits associated with high oil content; increased digestibility; balanced amino acid content; and/or high energy content.
- Such traits may refer to properties of both seed and non-seed plant tissues, or to food or feed prepared from plants or seeds having such traits.
- Example polynucleotides of interest include_the following events: Btl l (see US Patent No. US6114608), MIR604 (see US Patent No. US8884102), 5307 (see US Patent No. US10428393), MZIR098 (see US Patent Application No. US20200190533), TC1507 (see US Patent No. US7288643), DAS-59122-7 (see US Patent No. US7323556), MON810 (see US6713259), MON863 (see US Patent No. US7705216), MON89034 (see US Patent No. US8062840), MON88017 (see US Patent No. US9556492), DP-4114 (see US Patent No.
- CN112852801 Choinese Patent Application No. CN112831584), LP007-3 (Chinese Patent Application No. CN112877454), LP007-4 (Chinese Patent Application No. CN112831585), LP007-5 (Chinese Patent Application No. CN113151534), LP007-6 (Chinese Patent Application No. CN113151533), LP007-7 (Chinese Patent Application No. CN112852991), LP007-8 (CN113980958), Ruifeng8, ND207, Ruifengl25 (see Chinese Patent Application No. CN105017391), GA21 (see PCT Publication No.
- WO98/44140 NK603 (see US Patent No. US6825400), DAS40278 (see PCT Publication No. WO20 11/022469), DBN9858 (see PCT Publication No. WO2016173508), MON87429 (see PCT Publication No. WO19/152316), LW2-2 (see Chinese Patent Application No. CN113278721), T25 (see USDA/APHIS Petition 94-357-01 for Determination of Nonregulated Status for Glufosinate Resistant Com Transformation Events T14 and T25, June 1995), or the 3272 event (see US Patent No. US7635799).
- gene or trait “stacking” is combining desired genes or traits into one transgenic plant line.
- plant breeders stack transgenic traits by making crosses between parents that each have a desired trait and then identifying offspring that have both of these desired traits (so-called “breeding stacks”).
- Another way to stack genes is by transferring two or more genes into the cell nucleus of a plant at the same time during transformation.
- Another way to stack genes is by re-transforming a transgenic plant with another gene of interest.
- gene stacking can be used to combine two different insect resistance traits, an insect resistance trait and a disease resistance trait, or a herbicide resistance trait (such as, for example, Btl l).
- the use of a selectable marker in addition to a gene of interest would also be considered gene stacking.
- a nucleic acid molecule or vector of the disclosure can include an additional coding sequence for one or more polypeptides or double stranded RNA molecules (dsRNA) of interest for agronomic traits that primarily are of benefit to a seed company, grower or grain processor.
- a polypeptide of interest can be any polypeptide encoded by a nucleotide sequence of interest.
- Non-limiting examples of polypeptides of interest that are suitable for production in plants include those resulting in agronomically important traits such as herbicide resistance (also sometimes referred to as “herbicide tolerance”), virus resistance, bacterial pathogen resistance, insect resistance, nematode resistance, or fungal resistance. See, e.g., U.S.
- the polypeptide also can be one that increases plant vigor or yield (including traits that allow a plant to grow at different temperatures, soil conditions and levels of sunlight and precipitation), or one that allows identification of a plant exhibiting a trait of interest (e.g., a selectable marker, seed coat color, relative maturity group, etc.).
- Various polypeptides of interest, as well as methods for introducing these polypeptides into a plant are described, for example, in US Patent Nos.
- stacking this resistance trait with the MIR162 event by expressing the additional polypeptide in a MIR162 plant, can result in maize plants protected from insect feeding damage to a greater degree than the insect resistance traits conferred by the MIR162 event alone.
- Polynucleotides conferring resistance/tolerance to an herbicide that inhibits the growing point or meristem can also be suitable in some embodiments.
- Exemplary polynucleotides in this category code for mutant ALS and AHAS enzymes as described, e.g., in U.S. Patent Nos. 5,767,366 and 5,928,937.
- U.S. Patent Nos. 4,761,373 and 5,013,659 are directed to plants resistant to various imidazalinone or sulfonamide herbicides.
- 4,975,374 relates to plant cells and plants containing a nucleic acid encoding a mutant glutamine synthetase (GS) resistant to inhibition by herbicides that are known to inhibit GS, e.g., phosphinothricin and methionine sulfoximine.
- GS glutamine synthetase
- U.S. Patent No. 5,162,602 discloses plants resistant to inhibition by cyclohexanedione and aryloxyphenoxypropanoic acid herbicides. The resistance is conferred by an altered acetyl coenzyme A carboxylase (ACCase).
- Polypeptides encoded by nucleotides sequences conferring resistance to glyphosate are also suitable for the disclosure. See, e.g., U.S. Patent No. 4,940,835 and U.S. Patent No. 4,769,061.
- U.S. Patent No. 5,554,798 discloses transgenic glyphosate resistant maize plants, which resistance is conferred by an altered 5-enolpyruvyl-3-phosphoshikimate synthase (EPSPS) gene.
- EPSPS 5-enolpyruvyl-3-phosphoshikimate synthase
- Polynucleotides coding for resistance to phosphono compounds such as glufosinate ammonium or phosphinothricin, and pyridinoxy or phenoxy propionic acids and cyclohexones are also suitable. See, European Patent Application No. 0 242 246. See also, U.S. Patent Nos. 5,879,903, 5,276,268, and 5,561,236.
- suitable polynucleotides include those coding for resistance to herbicides that inhibit photosynthesis, such as a triazine and a benzonitrile (nitrilase) See, U.S. Patent No. 4,810,648. Additional suitable polynucleotides coding for herbicide resistance include those coding for resistance to 2,2-dichloropropionic acid, sethoxydim, haloxyfop, imidazolinone herbicides, sulfonylurea herbicides, triazolopyrimidine herbicides, s-triazine herbicides and bromoxynil.
- polynucleotides conferring resistance to a protox enzyme, or that provide enhanced resistance to plant diseases; enhanced tolerance of adverse environmental conditions (abiotic stresses) including but not limited to drought, excessive cold, excessive heat, or excessive soil salinity or extreme acidity or alkalinity; and alterations in plant architecture or development, including changes in developmental timing. See, e.g., U.S. Patent Publication No. 2001/0016956 and U.S. Patent No. 6,084,155.
- Additional suitable polynucleotides include those coding for insecticidal polypeptides. These polypeptides may be produced in amounts sufficient to control, for example, insect pests (i.e., insect controlling amounts). It is recognized that the amount of production of an insecticidal polypeptide in a plant necessary to control insects or other pests may vary depending upon the cultivar, type of pest, environmental factors and the like. Polynucleotides useful for additional insect or pest resistance include, for example, those that encode toxins identified in Bacillus organisms.
- Bt insecticidal proteins include the Cry proteins such as CrylAa, CrylAb, CrylAc, CrylB, CrylC, CrylD, CrylEa, CrylFa, Cry3A, Cry9A, Cry9B, Cry9C, and the like, as well as vegetative insecticidal proteins such as Vipl, Vip2, Vip3, and the like.
- an additional polypeptide is an insecticidal polypeptide derived from a non-Bt source, including without limitation, an alpha-amylase, a peroxidase, a cholesterol oxidase, a patatin, a protease, a protease inhibitor, a urease, an alpha-amylase inhibitor, a pore-forming protein, a chitinase, a lectin, an engineered antibody or antibody fragment, a Bacillus cereus insecticidal protein, a Xenorhabdus spp. (such as X. nematophila or X. bovienii) insecticidal protein, a Photorhabdus spp.
- a non-Bt source including without limitation, an alpha-amylase, a peroxidase, a cholesterol oxidase, a patatin, a protease, a protease inhibitor, a urease, an alpha-amylase
- insecticidal protein such as P. luminescens or P. asymobiotica
- Brevibacillus spp. such as B. laterosporous insecticidal protein
- Lysinibacillus spp. such as L. sphearicus
- Chromobacterium spp. such as C. subtsugae or C. foundedae
- Yersinia spp. such as Y. entomophaga
- insecticidal protein such as P. propylaea
- Clostridium spp. such as C. bifermentans
- insecticidal protein such as C. bifermentans
- Pseudomonas spp. such as P. fluorescens
- lignin such as P. fluorescens
- Polypeptides that are suitable for production in plants further include those that improve or otherwise facilitate the conversion of harvested plants or plant parts into a commercially useful product, including, for example, increased or altered carbohydrate content or distribution, improved fermentation properties, increased oil content, increased protein content, modified oil profile, improved digestibility, and increased nutraceutical content, e.g., increased phytosterol content, increased tocopherol content, increased stanol content or increased vitamin content.
- Polypeptides of interest also include, for example, those resulting in or contributing to a reduced content of an unwanted component in a harvested crop, e.g., phytic acid, or sugar degrading enzymes. By “resulting in” or “contributing to” is intended that the polypeptide of interest can directly or indirectly contribute to the existence of a trait of interest (e.g., increasing cellulose degradation by the use of a heterologous cellulase enzyme).
- the polypeptide contributes to improved digestibility for food or feed.
- Xylanases are hemicellulolytic enzymes that improve the breakdown of plant cell walls, which leads to better utilization of the plant nutrients by an animal. This leads to improved growth rate and feed conversion. Also, the viscosity of the feeds containing xylan can be reduced. Heterologous production of xylanases in plant cells also can facilitate lignocellulosic conversion to fermentable sugars in industrial processing.
- a polypeptide useful for the disclosure can be a polysaccharide degrading enzyme. Plants of this disclosure producing such an enzyme may be useful for generating, for example, fermentation feedstocks for bioprocessing.
- enzymes useful for a fermentation process include alpha amylases, proteases, pullulanases, isoamylases, cellulases, hemicellulases, xylanases, cyclodextrin glycotransferases, lipases, phytases, laccases, oxidases, esterases, cutinases, granular starch hydrolyzing enzyme and other glucoamylases.
- Polysaccharide-degrading enzymes include: starch degrading enzymes such as a- amylases (EC 3.2.1.1), glucuronidases (E.C. 3.2.1.131); exo-l,4-a-D glucanases such as amyloglucosidases and glucoamylase (EC 3.2.1.3), P-amylases (EC 3.2.1.2), a-glucosidases (EC 3.2.1.20), and other exo-amylases; starch debranching enzymes, such as a) isoamylase (EC 3.2.1.68), pullulanase (EC 3.2.1.41), and the like; b) cellulases such as exo-1,4-3- cellobiohydrolase (EC 3.2.1.91), exo-l,3-P-D-glucanase (EC 3.2.1.39), P-glucosidase (EC 3.2.1.21); c) L-arabin
- proteases such as fungal and bacterial proteases.
- Fungal proteases include, but are not limited to, those obtained from Aspergillus, Trichoderma, Mucor and Rhizopus, such as A. niger, A. awamori, A. oryzae and M. miehei.
- the polypeptides of this disclosure can be cellobiohydrolase (CBH) enzymes (EC 3.2.1.91).
- the cellobiohydrolase enzyme can be CBH1 or CBH2.
- hemicellulases such as mannases and arabinofuranosidases (EC 3.2.1.55); ligninases; lipases (e.g., E.C. 3.1.1.3), glucose oxidases, pectinases, xylanases, transglucosidases, alpha 1,6 glucosidases (e.g., E.C. 3.2.1.20); esterases such as ferulic acid esterase (EC 3.1.1.73) and acetyl xylan esterases (EC 3.1.1.72); and cutinases (e.g. E.C. 3.1.1.74).
- hemicellulases such as mannases and arabinofuranosidases (EC 3.2.1.55); ligninases; lipases (e.g., E.C. 3.1.1.3), glucose oxidases, pectinases, xylanases, transglucosidases
- Double stranded RNA molecules useful with the disclosure include but are not limited to those that suppress target insect genes.
- gene suppression when taken together, are intended to refer to any of the well-known methods for reducing the levels of protein produced as a result of gene transcription to mRNA and subsequent translation of the mRNA. Gene suppression is also intended to mean the reduction of protein expression from a gene or a coding sequence including posttranscriptional gene suppression and transcriptional suppression.
- Posttranscriptional gene suppression is mediated by the homology between of all or a part of a mRNA transcribed from a gene or coding sequence targeted for suppression and the corresponding double stranded RNA used for suppression and refers to the substantial and measurable reduction of the amount of available mRNA available in the cell for binding by ribosomes.
- the transcribed RNA can be in the sense orientation to effect what is called co-suppression, in the anti-sense orientation to effect what is called anti-sense suppression, or in both orientations producing a dsRNA to effect what is called RNA interference (RNAi).
- Transcriptional suppression is mediated by the presence in the cell of a dsRNA, a gene suppression agent, exhibiting substantial sequence identity to a promoter DNA sequence or the complement thereof to effect what is referred to as promoter trans suppression.
- Gene suppression may be effective against a native plant gene associated with a trait, e.g., to provide plants with reduced levels of a protein encoded by the native gene or with enhanced or reduced levels of an affected metabolite.
- Gene suppression can also be effective against target genes in plant pests that may ingest or contact plant material containing gene suppression agents, specifically designed to inhibit or suppress the expression of one or more homologous or complementary sequences in the cells of the pest.
- genes targeted for suppression can encode an essential protein, the predicted function of which is selected from the group consisting of muscle formation, juvenile hormone formation, juvenile hormone regulation, ion regulation and transport, digestive enzyme synthesis, maintenance of cell membrane potential, amino acid biosynthesis, amino acid degradation, sperm formation, pheromone synthesis, pheromone sensing, antennae formation, wing formation, leg formation, development and differentiation, egg formation, larval maturation, digestive enzyme formation, hemolymph synthesis, hemolymph maintenance, neurotransmission, cell division, energy metabolism, respiration, and apoptosis.
- Maize plants used in the methods and compositions are plants from Zea mays L. subsp. Mays.
- the maize plant or plant part is from the group Zea mays L. subsp. Mays Identata, sometimes referred to as dent corn.
- the maize plant or plant part is from the group Zea mays L. subsp. Mays Indurata, sometimes referred to as flint corn.
- the maize plant or plant part is from the group Zea mays L. subsp. Mays Saccharata, sometimes referred to as sweet com.
- the maize plant or plant part is from the group Zea mays L. subsp.
- the maize plant or plant part is from the group Zea mays L. subsp. Mays Everts, sometimes referred to as popcorn.
- Maize plants that can be identified selected and/or produced with methods and compositions of the present invention include hybrids, inbreds, partial inbreds, members of defined populations and members of undefined populations.
- the maize plant is an elite maize line.
- the elite line is one of NP2222, NP2660, NP2276, NP2391, NP2460, or ID3461 or one of the lines disclosed in Table 5.
- a plant cell, seed, or plant part or harvest product can be obtained from a plant produced as above and the plant cell, seed, or plant part can be screened using methods disclosed above for the evidence of stable incorporation of the polynucleotide.
- plant parts include, but are not limited to, single cells and tissues from pollen, ovules, zygotes, leaves, embryos, roots, root tips, anthers, flowers, flower parts, fruits, stems, shoots, cuttings, and seeds; as well as pollen, ovules, egg cells, zygotes, leaves, embryos, roots, root tips, anthers, flowers, flower parts, fruits, stems, shoots, cuttings, scions, rootstocks, seeds, protoplasts, calli, and the like.
- plant products can be harvested from the plant disclosed above and processed to produce processed products, such as meal, oil, plant extract, starch, fermentation products, digestion products, and the like. These processed products are also within the scope of this invention if they comprise a polynucleotide or polypeptide or variant thereof disclosed herein.
- this disclosure also provides a corn meal.
- this disclosure also provides a method of providing a com meal by crushing oilseed of any of the plants provided herein.
- a MIR162 maize plant can be bred by crossing a first parental maize plant that is a transgenic MIR162 maize plant comprising one or more fertile QTLs and/or TE as described above and a second parental maize, thereby producing a plurality of first progeny plants.
- the second parental maize plant does not comprise MIR162 event. The method further comprises selecting a first progeny plant that comprises the MIR162 event and the one or more fertile QTLs.
- the method further comprises selfing the first progeny plant, thereby producing a plurality of second progeny plants; and then selecting from the second progeny plants that comprise both the MIR162 event and the fertile QTL(s) and/or TE.
- the first progeny plant is repeatedly crossed back to one of its parents through a process referred to herein as “backcrossing”.
- the progeny is backcrossed for at least two generations to the maize parental plant (e.g., the second maize parental plant).
- the “donor” parent refers to the parental plant with the desired gene or locus to be introgressed.
- the “recipient” parent (used one or more times) or “recurrent” parent (used two or more times) refers to the parental plant into which the gene or locus is being introgressed. For example, see Ragot, M. et al.
- Embodiment 1 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1- 223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize event MIR162, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising one or more fertile QTLs.
- Embodiment 2 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1- 5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4.
- cM centimorgans
- Embodiment 3 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1- 223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, and introducing a MIR162 event into the selected maize plant.
- Embodiment 4 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1- 5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,
- Embodiment 5 is the method of any one of embodiments 1-4, wherein the MIR162 event is introduced to the selected maize plant by transforming or breeding in a Vip3A coding sequence into the selected maize plant.
- Embodiment 6 is the method of any one of embodiments 1-4, wherein the method further comprises producing an inbred MIR162 plant from the selected maize plant containing MIR162.
- Embodiment 7 is the method of any one of embodiments 1-6, wherein the first maize plant comprises a QTL associated with increased fertility and the QTL comprises SEQ ID NO: 2, with a C at the position number 100.
- Embodiment 8 is the method of any one of embodiments 1-6 , wherein the one or more fertile QTLs comprise one or more of: SEQ ID NO: 1 with a G at position number 100, SEQ ID NO: 2 with a C at position number 100, SEQ ID NO: 3 with a G at position number 100, SEQ ID NO: 4 with an A at position number 100, or SEQ ID NO: 5 with a C at position number 100.
- Embodiment 9 is the method of any one of embodiments 1-6, wherein the one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T; or (xii) A, A, G, C, and T.
- Embodiment 10 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR162 event, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the transposon element.
- TE transposon element
- Embodiment 11 is the method of example(s) 10, wherein the transposon element is on chromosome 5 130436801-130439201 in B73_ver5.
- Embodiment 12 is the method of any one of embodiments 1-11, wherein the one or more fertile QTLs or the TE is introduced to the first maize plant by genome editing or by breeding.
- Embodiment 13 is the method of example(s) 12, wherein the genome editing is performed using a site-directed nuclease selected from the group consisting of Cas9 nuclease, Cpfnucleas(Casl2a), meganucleases (MNs), zinc-finger nucleases, (ZFNs), transcriptionactivator like effector nucleases (TALENs), dCas9-Fokl, dCpfl-Fokl, chimeric Cas9-cytidine deaminase, chimeric Cas9-adenine deaminase, chimeric FENl-FokI, MegaTALs, a nickase Cas9 (nCas9), chimeric dCas9 non-Fokl nuclease, dCpfl non-Fokl nuclease, chimeric Cpfl- cytidine deaminase, and Cpfl-adenine
- Embodiment 14 is the method of any one of embodiments 1-13, wherein the first maize plant and/or the second maize plant is an elite maize line.
- Embodiment 15 is the method of example(s) 1, wherein the increased fertility is increased male fertility.
- Embodiment 16 is the method of any one of embodiments 1-15, wherein the method further comprises using the selected progeny MIR162 plant as a pollinator in a second cross with itself or a third maize plant.
- Embodiment 17 is the method of example(s) 16, wherein the third maize plant expresses a polynucleotide or polypeptide of interest.
- Embodiment 18 is the method of example(s) 17, wherein the polynucleotide or polypeptide of interest confers insect resistance, disease resistance, herbicide resistance, high oil content, increased digestibility; balanced amino acid content, and/or high energy content.
- Embodiment 19 is the method of example(s) 16, wherein the second cross produces increased seed production as compared to a control cross.
- Embodiment 20 is a method of producing a plant compatible with a MIR162 event, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, and selecting from the plurality of progeny plants a selected progeny plant comprising the transposon element.
- TE transposon element
- Embodiment 21 is the method of example(s) 20, wherein the method further comprises crossing the selected progeny plant comprising the transposon element with a second maize plant comprising maize MIR162 event.
- Embodiment 22 is a breeding program comprising the method of any one of embodiments 1-21.
- Embodiment 23 is a MIR162 plant produced according to any one of embodiments 1-19.
- Embodiment 24 is a MIR162 plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9.
- Embodiment 25 is a MIR162 plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-
- Embodiment 26 is the MIR162 plant of example(s) 24 or 25, wherein the one or more fertile QTLs comprise SEQ ID NO: 2, with a C at the position number 100.
- Embodiment 27 is the MIR162 plant of example(s) 24 or 25, wherein the one or more fertile QTLs comprise one or more of: SEQ ID NO: 1, with a G at position number 100, SEQ ID NO: 2, with a C at position number 100, SEQ ID NO: 3, with a G at position number 100, SEQ ID NO: 4, with an A at position number 100, or SEQ ID NO: 5, with a C at position number 100.
- Embodiment 28 is the method of example(s) 24 or 25, wherein one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T; or (xii) A, A, G, C, and T.
- Embodiment 29 is a MIR162 plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233.
- TE transposon element
- Embodiment 30 is the MIR162 plant of example(s) 29 wherein the transposon element is on chromosome 5 130436801- 130439201 in B73_ver5.
- Embodiment 31 is the MIR162 plant of example(s) 23-31, wherein the MIR162 plant is an inbred MIR162 plant.
- Embodiment 32 is a plant cell, seed, or plant part derived from the MIR162 plant of any one of embodiments 24-33.
- Embodiment 33 is a harvested product derived from the MIR162 plant of any one of embodiments 24-33 or the plant cell, seed, or plant part of example(s) 32.
- Embodiment 34 is a kit for genotyping a plant wherein the kit comprises reagents for detecting the presence of one or more marker loci of SEQ ID Nos: 1-223 or 255; one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 2
- Embodiment 35 is the kit of example(s) 34, wherein reagents comprise one or more of the following: primers for sequencing the one or more marker loci and/or the TE, primers for amplifying the one or more marker loci and/or the TE, or a oligonucleotide probe that is complementary to one or more marker loci.
- Embodiment 36 is a method of detecting one or more fertile QTLs in an MIR162 plant, the method comprising genotyping the plant at one or more marker loci linked to one or more fertile QTL, wherein the fertile QTL comprises one or more marker loci of SEQ ID Nos 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9
- Embodiment 37 is a method of detecting one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, the method comprising genotyping one or more fertile QTLs from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1- 5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, where
- Embodiment 38 is the method of example(s) 36 or 37, wherein the one or more fertile QTLs comprise one or more of: SEQ ID NO: 1 with a G at position number 100, SEQ ID NO: 2 with a C at position number 100, SEQ ID NO: 3 with a G at position number 100, SEQ ID NO: 4 with an A at position number 100, or SEQ ID NO: 5 with a C at position number 100.
- Embodiment 39 is the method of example(s) 36-38 wherein the one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T; or (xii) A, A, G, C, and T.
- Embodiment 40 is the method of any one of embodiments 36-39, wherein the genotyping is performed by sequencing, polymerase chain reaction (PCR), probe hybridization, or single base extension.
- genotyping is performed by sequencing, polymerase chain reaction (PCR), probe hybridization, or single base extension.
- the MIR162 event is known to be associated with a reduced male fertility phenotype in certain inbred genetic backgrounds, which is thought to be attributed to reduced pollen shed.
- a first approach was to use a genome-wide association study (GWAS) (Yu et al. 2006) to identify genomic loci associated with the MIR162 inbred pollen shed phenotype.
- GWAS genome-wide association study
- a bi-parental QTL mapping approach was used.
- Loci from the second study are disclosed, e.g., in U.S. Patent No. 10,214,784. However, these loci did not contribute to an understanding of the full variation of the MIR162-associated male fertility phenotype observed in certain maize backgrounds.
- Genotype and phenotype data of 338 inbred lines that went through the trait introgression program from 2017 to 2020 in maize was used (Table 2). Phenotypic data was classified as a binary trait, as either “failure” or “success” for each MIR162 conversion project.
- a MIR162 conversion project typically involved the initial crossing of an elite inbred line with a MIR162-containing donor line followed by repeated backcrossing to the elite inbred line to introgress the MIR162 event.
- Each inbred was evaluated based on 1) if they met the minimum seed quantity (e.g., producing at least 130 seeds per conversion event) and number of versions of conversion events resulted from individual backcrosses to be considered a successful trait introgression and 2) if it did not meet the key performance indicators (KPIs) in 1), were there other factors besides MIR162 that contributed to the failure. If the answer to the first question was “Yes”, it was defined as “success”, and if the answer to both questions was “No”, it was defined as a “failure”. If the answer to the question 2 was “Yes”, it was eliminated from the training data set.
- KPIs key performance indicators
- VT Version Testing
- SPR Seed Production Research
- a version is a distinct instance of a finished conversion that derives from an individual backcross plant.
- backcrossing is performed until a certain % recurrent parent recovery (%RP) (e.g., >95% RP) is achieved (typically 2-4 backcrossed) and then self pollination of backcrossed plants (e.g., BC2, BC3 or BC4) is performed to start fixing the traits.
- %RP % recurrent parent recovery
- Each version was derived from a unique BC plant that was selfed twice to create a BxF3 converted line, e.g., 4 B3F3 versions handed off and each one derived back to a different BC3 plant that was selfed.
- the minimum number of versions that was considered “successful” is 2 versions.
- the minimum seed quantity per version that is considered successful was 130 seeds per version.
- the MIR162 failure rate averaged 0.43 with a range from 0.41 to 0.47. This failed conversion of MIR162 is thought to attributed to a decrease in male fertility in inbreds.
- the failure rate refers to the number of elite inbreds for which a MIR162 conversion was attempted and did not succeed (i.e. the plants remained infertile or had reduced fertility) out of the total number MIR162 conversions that were attempted in a MIR162 conversion project. For example, in one experiment, it was attempted to convert 123 inbreds as described herein and 58 of them were not successful.
- the term “failure rate” in this context refers to the number of MIR162 conversion attempts (e.g., elite inbred or recurrent parents) that are infertile out the total number of MIR162 conversions s produced from a predetermined number of MIR162 projects.
- the term “success rate” refers to the number of MIR162 conversions that are fertile (e.g., successful trait stacking) over the total number of MIR162 conversions produced from a predetermined number of MIR162 projects.
- the predetermined number is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, or at least 120.
- the sum of the success rate and the failure rate for particular event equals 1 or 100%.
- a total of 5,030 SNPs were used to genotype each inbred line before MIR162 conversion, and the genotypic data from these inbreds were used for subsequent analysis. The genotypes from each untraited inbred line were used for the training and prediction of the GRAF model.
- the GRAF model was used to identify novel loci associated with MIR162 event conversion.
- One locus on Chromosome 5 was highly correlated with the conversion failure (see Table 3 below).
- Additional further analysis using the GRAF model also identified SNPs on chromosomes 2 and 8 as being highly correlated with MIR162 event conversion (see Table 9 below).
- the SNP at position 129259011 of Chromosome 5 was the most significantly correlated with the phenotype within the locus according to the GRAF model.
- These loci and/or the SNPs contained therein are considered to provide stronger predictive power than previous loci and SNPs disclosed in US 10,214,784.
- Table 3 Chromosome 5 SNPs significantly associated with MIR162 event conversion success
- the importance index in Table 9 is a measure of the relative importance of each SNP in predicting the outcome of the model. It is calculated by measuring the decrease in the model's accuracy when the values of a particular feature are randomly permuted. The higher the decrease in accuracy, the more important the feature is considered to be in the model. This index can be used to identify the most important genomic markers in the model and to gain insights into the relationships between the markers and the outcome of MIR162 event conversion.
- GWAS was performed using a precomputed K (kinship) similarity matrix. Loci were detected on Chromosomes, 2, 6, 7 and 9 having SNPs with a significant correlation to the conversion phenotype (see Table 4 below). In these loci, the following SNPs were the most significantly correlated with the conversion phenotype: Chromosome 2 at position 243529534, chromosome 6 at position 158572433, chromosome 7 at position 134444680, and chromosome 9 at position 8013654.
- the favorable alleles in Table 4 could be used to positively select varieties that are likely to have a successful MIR162 conversion and the unfavorable alleles could be used to eliminate varieties that are likely to have an unsuccessful MIR162 conversion.
- logP refers to the strength of the association for the locus with the MIR162 conversion trait measured as -log 10 transformed P (probability of the occurrence given the event). The P value was calculated with the Wald test based on the T-statistic. Locus. logFDR is a measure of false discovery rate (FDR), which is the expected ratio of the number of false positive results to the number of total positive test results.
- the progeny are then backcrossed for at least two generations to the second parent maize plant to fix the favorable allele(s) or allelic combination or TE into the second parent maize plant’s genetic background. It is expected that fixation of the favorable allele(s) or allelic combination or TE will improve the likelihood of a successful MIR162 event conversion.
- genome editing e.g., using a CRISPR enzyme such as Cas9 or Cpfl, or a variant thereof
- a CRISPR enzyme such as Cas9 or Cpfl, or a variant thereof
- Cas9 or Cpfl a CRISPR enzyme
- a variant thereof genome editing is used to introduce the favorable allele from any of the loci in Table 4, such as at chromosome 2 at position 243529534, chromosome 5 at position 129259011, chromosome 6 at position 158572433, chromosome 7 at position 134444680, or chromosome 9 at position 8013654, or an allelic combination in Table 4, or introduce the TE into a maize plant that does not contain such allele(s) or allelic combination and would otherwise fail MIR162 event conversion. It is expected that introduction of the favorable allele(s) or allelic combination will improve the likelihood of a successful MIR162 event conversion.
- the locus on chromosome 5 is devoid of any genes, but there are potential function regions based on low/no methylation.
- the region has no accessible chromatin regions (ACR) or histone modifications.
- the region is also devoid of any known miRNA.
- the region does have several predicted repetitive elements, notably a long terminal repeat (LTR) transposable element (TE).
- LTR long terminal repeat
- the sequence of the TE is SEQ ID NO: 232 and is located at Chromosome 5: 130436801-130439201 in B73_ver5.
- a blast search of the TE identified a region consisting of 2,340 nucleotides on chromosome 2 on B73_v5, and the region has 100% identity along its length to the TE on chromosome 5.
- This region on chromosome 2 is upstream of indole-3- acetic acid amido synthetase, aka., aas2, or auxin synthetase2).
- the maize allele of aas2 is Zm00001d006753, which was found to be highly expressed in maize embryo, anther, tassel, and spikelets.
- the region on chromosome 2 also contains ACRs from maize ear tissue, as well as H3K56Ac marks that suggest active transcription in ear primordia and young leaf tissue. The results related to the region on chromosome 2 are consistent with the hypothesis that the region mapped by GRAF identified a TE on chromosome 5 that carries a regulatory region.
- TE SEQ ID NO: 232, present in certain genetic backgrounds like NP2222
- SEQ ID NO: 233 synthetic TE
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Botany (AREA)
- Developmental Biology & Embryology (AREA)
- Environmental Sciences (AREA)
- Genetics & Genomics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Physiology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Mycology (AREA)
- Natural Medicines & Medicinal Plants (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Peptides Or Proteins (AREA)
Abstract
Compositions and methods to increased fertility of MIR162 plants are provided. Compositions include maize plants comprising fertile QTLs and/or transposable elements. Methods and kits are provided for identifying and producing these plants via transgenic means, breeding or genomic editing approaches.
Description
GENETIC MARKERS ASSOCIATED WITH INCREASED FERTILITY IN MAIZE
RELATED APPLICATION
[0001] This application claims priority to US Provisional Application No. 63/501,094, filed on May 9, 2023. The entire content of said provisional application is herein incorporated by reference for all purposes.
FIELD
[0002] This disclosure relates to the field of plant biotechnology.
SEQUENCE LISTING
[0003] The official copy of the sequence listing is submitted electronically as an XML formatted sequence listing with a file named 109770_82681_SL, created on April 23, 2024, and having a size of 304,751 bytes, and is filed concurrently with the specification. The sequence listing contained in this xml formatted document is part of the specification and is herein incorporated by reference in its entirety.
BACKGROUND
[0004] The MIR162 event is a commercially important event for controlling certain pests in maize, such as fall armyworm. However, the MIR162 locus has been observed to be associated with decreased male fertility in certain inbred maize genetic backgrounds. The degree to which male fertility is impacted is inbred specific: some inbreds exhibit little or no reduction in male fertility when homozygous for the MIR162 locus; and other inbreds exhibit a significant reduction in male fertility when homozygous for the MIR162 locus. The degree to which male fertility is decreased is also affected by environmental factors, such as water availability and temperature. Furthermore, drought and high temperature conditions can increase MIR162-associated reductions in male fertility, which can make seed production more challenging and costly. There remains a need for methods to identify and use genetic backgrounds that are more likely to be associated with increased male fertility in the presence of the MIR162 locus.
BRIEF SUMMARY
[0005] In one aspect, provided herein is a method of producing an MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker
loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize event MIR162, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising one or more fertile QTLs.
[0006] In another aspect, provided herein is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4, and crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR162 event; and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the one or more fertile QTL using the one or more marker loci linked to and within 10 centimorgans (cM) of the one or more fertile QTL.
[0007] In yet another aspect, provided herein is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one
or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, and introducing a MIR162 event into the selected maize plant.
[0008] In yet another aspect, provided herein is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4, and introducing a MIR162 event into the selected maize plant.
[0009] In yet another aspect, provided herein is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR162 event, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the transposon element.
[0010] In yet another aspect, provided herein is a method of producing a plant compatible with a MIR162 event, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, and selecting from the plurality of progeny plants a selected progeny plant comprising the transposon element.
[0011] In yet another aspect, provided herein is a MIR162 plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9
[0012] In yet another aspect, provided herein is a MIR162 plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4
[0013] In yet another aspect, provided herein is a MIR162 plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in
Table 4
[0014] In yet another aspect, provided herein is a kit for genotyping a plant wherein the kit comprises reagents for detecting the presence of one or more marker loci of SEQ ID Nos: 1- 223 or 255; one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus
of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; and/or a transposon element (TE) of SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233.
[0015] In yet another aspect, provided herein is a method of detecting one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, the method comprising genotyping one or more fertile QTLs from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4
[0016] In yet another aspect, provided herein is a method of detecting one or more fertile QTLs in an MIR162 plant, the method comprising genotyping the plant at one or more marker loci linked to one or more fertile QTL, wherein the fertile QTL comprises one or more marker loci of SEQ ID Nos 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9.
[0017] In yet another aspect, provided herein is a method of detecting one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, the method comprising genotyping one or more fertile QTLs from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4
DETAILED DESCRIPTION
[0018] All technical and scientific terms used herein, unless otherwise defined below, are intended to have the same meaning as commonly understood by one of ordinary skill in the art. References to techniques employed herein are intended to refer to the techniques as commonly understood in the art, including variations on those techniques and/or substitutions of equivalent techniques that would be apparent to one of skill in the art.
[0019] Provided herein are methods and compositions related to identifying genomic loci associated with increased or decreased fertility in maize plants comprising MIR162 event. These loci can be used to select plants having genetic backgrounds that have a higher likelihood of a successful MIR162 event conversion. Plants having genomic loci identified as associated with increased fertility can be selected and used in a breeding program to produce maize MIR162 plants having increased fertility as compared to control plants. Conversely, plants having genomic loci identified as associated with decreased fertility can be excluded from such a breeding program. These loci can so be used to rescue the conversion failure phenotype by introducing a favorable allele at a genomic locus (e.g., by breeding or genome editing) into a genetic background that contains the unfavorable allele at the genomic locus.
I. Terminology
[0020] A “coding sequence” is a nucleic acid sequence that is transcribed into RNA such as mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA. Preferably the RNA is then translated in an organism to produce a protein.
[0021] As used herein, “locus” is a chromosome region or chromosomal region where a polymorphic nucleic acid, trait determinant, gene, or marker is located. A locus can represent a single nucleotide, a few nucleotides or a large number of nucleotides in a genomic region. The loci of this disclosure comprise one or more polymorphisms in a population (e.g., alternative alleles are present in some individuals).
[0022] As used herein, the term “allele” refers to one of two or more different nucleotides or nucleotide sequences (or the absence thereof) that occur at a specific locus or set of contiguous loci. In some embodiments, the term “allele” may be used interchangeably with the term “marker.”
[0023] As used herein, a “centimorgan” (cM) is a unit of measure of recombination frequency and genetic distance between two loci. One cM is equal to a 1% chance that a marker at one genetic locus will be separated from a marker at a second locus due to crossing over in a single generation. As used herein, “closely linked” means that the marker or locus is within about 20 cM, 15 cM, 10 cM, 5 cM, 4 cM, 3 cM, 2 cM, 1 cM, 0.5 cM, or less than 0.5 cM of another marker or locus. For example, 20 cM means that recombination occurs between the marker and the locus with a frequency of equal to or less than about 20%.
[0024] As used herein, the terms “cross” or “crossed” refer to the fusion of gametes via pollination to produce progeny (e.g., cells, seeds or plants). The term encompasses both
sexual crosses (the pollination of one plant by another) and selfing (self-pollination, e.g., when the pollen and ovule are from the same plant). The term “crossing” refers to the act of fusing gametes via pollination to produce progeny. The plant disclosed herein may be a whole plant, or may be a plant cell, seed, or tissue, or a plant part such as leaf, stem, pollen, or cell that can be cultivated into a whole plant.
[0025] As used herein, the terms “backcross” and “backcrossing” refer to the process whereby a progeny plant is crossed back to one of its parents for one or more generations (e.g., 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, or 7 or more times, etc.). In a backcrossing scheme, the “donor” parent refers to the parental plant with the desired gene or locus to be introgressed. The “recipient” parent (used one or more times) or “recurrent” parent (used two or more times) refers to the parental plant into which the gene or locus is being introgressed. For example, see Ragot et al. Marker-assisted Backcrossing: A Practical Example, in TECHNIQUES ET UTILISATIONS DES MARQUEURS MOLECULAIRES LES COLLOQUES, Vol. 72, pp. 45-56 (1995); and Openshaw et al., Marker-assisted Selection in Backcross Breeding, in PROCEEDINGS OF THE SYMPOSIUM “ANALYSIS OF MOLECULAR MARKER DATA,” pp. 41-43 (1994). The initial cross gives rise to the Fl generation. The term “BC1” refers to the second use of the recurrent parent, “BC2” refers to the third use of the recurrent parent, and so on. In embodiments, at least one or more generations of progeny are identified and/or selected for the presence of the desired gene or locus (e.g., in a nucleic acid sample from the progeny plant or plant part). In embodiments, two or more generations (or even all generations) of progeny are identified and/or selected for the presence of the desired gene or locus.
[0026] As used herein, the terms “elite” and “elite line” refer to any line that has resulted from breeding and selection for desirable agronomic performance. An elite line may be substantially homozygous. Numerous elite lines are available and known to those of skill in the art, for example, those described in Table 5.
[0027] As used herein, “quantitative trait locus” (QTL) or “quantitative trait loci” (QTLs) refer to a genetic domain that effects a phenotype that can be described in quantitative terms and can be assigned a “phenotypic value” which corresponds to a quantitative value for the phenotypic trait.
[0028] As used herein the term transgenic “event” refers to a recombinant plant produced by transformation and regeneration of a plant cell or tissue with heterologous DNA, for
example, an expression cassette that includes a gene of interest. The term “event” refers to the original transformant and/or progeny of the transformant that include the heterologous DNA. The term “event” also refers to progeny produced by a sexual outcross between the transformant and another com line. Even after repeated backcrossing to a recurrent parent, the inserted DNA and the flanking DNA from the transformed parent is present in the progeny of the cross at the same chromosomal location. The term “event” also refers to DNA from the original transformant comprising the inserted DNA and flanking genomic sequence immediately adjacent to the inserted DNA that would be expected to be transferred to a progeny that receives inserted DNA including the transgene of interest as the result of a sexual cross of one parental line that includes the inserted DNA (e.g., the original transformant and progeny resulting from selfing) and a parental line that does not contain the inserted DNA. Normally, transformation of plant tissue produces multiple events, each of which represent insertion of a DNA construct into a different location in the genome of a plant cell. Based on the expression of the transgene or other desirable characteristics, a particular event is selected. Thus, in the context of this disclosure, the terms “event MIR162”, “MIR162” and “MIR162 event” are used interchangeably to refer to the event described in U.S. Pat. No. 8,232,456 and below. MIR162 plants, or cells or parts thereof, comprise the MIR162 event.
[0029] As used herein, the term “fertile” refers to a plant that is fertile enough for use in a breeding and/or seed production program. In some embodiments, fertile plants are plants that release at least about 100,000; 150,000; 200,000; 250,000, 300,000, 350,000, 400,000 or 450,000 pollen grains per tassel per day in the three-day period immediately following anther extrusion.
[0030] As used herein, the term “infertile” refers to a plant that is insufficiently fertile for use in a breeding program. In some embodiments, “infertile” plants are plants that release fewer than 25,000; 50,000; 75,000 or 100,000 pollen grains per tassel per day in the three-day period immediately following anther extrusion. “Infertile” plants may produce and/or release viable pollen grains. Indeed, in some embodiments, “infertile” plants produce and release viable pollen grains but do so a rate that is insufficient for effective use in a breeding and/or seed production program.
[0031] As used herein, the terms “increase,” “increases,” “increasing” and similar terms refer to an augmentation of at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%,
50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 100%, 125%, 150%, 175%, 200%, 300% or more.
[0032] As used herein, the term “increased fertility” refers to an improvement in one or more fertility traits of a subject plant as compared to one or more control plants. A control plant, used in this context, refers to a plant having the identical genetic background as compared to the subject plant except for the lack of one or more markers/alleles associated with increased fertility. For example, an increase in fertility of an MIR162 plant having a fertile QTL is relative to a control plant and said control plant is an MIR162 plant lacking such a fertile QTL. Exemplary fertility traits include, but are not limited to, pollen count, pollen morphology, pollen production per anther, anther count, anther morphology, anthers per tassel, tassel count, tassel morphology, tassels per plant, silk count, silk morphology, silk production per plant, kernel count, kernel morphology, kernel production per ear, prevalence of kernel abortion, kernel production per plant and kernel viability. Thus, a plant that exhibits increased pollen production, increased pollen production per anther, improved pollen morphology, increased anther production, increased anther production per tassel, improved anther morphology, increased tassel production, improved tassel morphology, increased silk production, improved silk morphology, increased silk production per plant, increased kernel count, improved kernel morphology, increased kernel production per ear, decreased prevalence of kernel abortion, increased kernel production per plant, increased kernel viability, increased fertility under stress conditions (e.g., drought conditions), increased fertility under elevated daytime temperatures and/or increased fertility under elevated nighttime temperatures as compared to a control plant (e.g., a native plant/germplasm of the same species, one or both parents, a near isogenic plant that lacks one or more markers/alleles associated with increased fertility, a near isogenic plant that lacks MIR162, etc.) displays increased fertility. When used in reference to a plant part (e.g., a germplasm), the term “increased fertility” refers to an improvement in one or more fertility traits in a plant that arises from that plant part.
[0033] As used herein, the term “increased male fertility” refers to an improvement in one or more male fertility traits as compared to one or more controls (e.g., a native plant/germplasm of the same species; one or both parents, a near isogenic plant that lacks one or more markers/alleles associated with increased fertility, a near isogenic plant that lacks MIR162, a plant that contains MIR162 and one or more unfavorable alleles such as those in the Examples, etc.). Exemplary male fertility traits include, but are not limited to, pollen
count, pollen morphology, pollen production per anther, anther count, anther morphology, anthers per tassel, tassel count, tassel morphology and tassels per plant. Thus, a plant that exhibits increased pollen production, increased pollen production per anther, improved pollen morphology, increased anther production, increased anther production per tassel, improved anther morphology, increased tassel production, improved tassel morphology, increased male fertility under stress conditions (e.g., drought conditions), increased male fertility under elevated daytime temperatures and/or increased male fertility under elevated nighttime temperatures as compared to a control plant (e.g., a native plant/germplasm of the same species, one or both parents, a near isogenic plant that lacks one or more markers/alleles associated with increased fertility, etc.), displays increased male fertility. When used in reference to a plant part (e.g., a germplasm), the term “increased male fertility” refers to an improvement in one or more male fertility traits in a plant that arises from that plant part.
[0034] As used herein, the term “unique” to MIR162 means distinctively characteristic of MIR162. Therefore, nucleic acids unique to event MIR162 are not found in other non- MIR162 maize plants.
[0035] The “Vip3A” class of proteins comprises, for example, Vip3Aa, Vip3Ab, Vip3Ac, Vip3Ad, Vip3Ae, VipAf, and Vip3Ag, , and their homologues. “Homologue” means that the indicated protein or polypeptide bears a defined relationship to other members of the Vip3A class of proteins. “Vip3Aa20” is a Vip3A homologue unique to event MIR162. It was generated by spontaneous mutations introduced into the maize-optimized Vip3Aal9 gene comprised in pNOV1300 (SEQ ID NO: 234) during the plant transformation process as described in U.S. Pat. No. 8,232,456 and below.
[0036] As used herein, the phrase “associated with” as in, for example, “a marker associated with a trait” refers to when the marker and trait are linked such that the presence of the marker is an indicator of the presence and/or extent the desired trait or trait form will occur in a plant/plant part comprising the marker. Similarly, a marker is “associated with” an allele when the marker and allele are linked such that the presence of the marker is an indicator of the presence of the allele in a plant/plant part comprising the marker. For example, “a marker associated with increased male fertility” refers to a marker whose presence or absence can be used to predict whether and/or to what extent a plant/plant part will display increased male fertility.
[0037] As used herein, “SNP” or “single nucleotide polymorphism” means a sequence variation that occurs when a single nucleotide (A, T, C, or G) in the genome sequence is altered or variable. “SNP markers” exist when SNPs are mapped to sites on the genome.
[0038] As used herein, the terms “allele of interest,” and “favorable allele” are used interchangeably to refer to an allele that is linked to a desired trait, e.g., increased fertility. An “allele of interest” may be associated with either an increase or decrease of or in a given trait, depending on the nature of the desired phenotype, and may be associated with a change in morphology, color, etc. In some embodiments of the present invention, the “allele of interest” is associated with increased male fertility and may therefore be used as a marker to identify, select and/or produce fertile maize plants; to predict whether and/or to what extent a maize plant will be fertile; to reduce the costs associated with breeding and/or seed production programs; and/or to increase the efficiency of breeding and/or seed production programs.
[0039] As used herein, the term “unfavorable allele” refers to an allele that segregates with an unfavorable plant phenotype, therefore providing the benefit of identifying plants that can be removed from a breeding program or planting.
[0040] As used herein, “fertile QTL” refers to a QTL comprising a marker locus or marker loci with one or more alleles that, in combination, are associated with baseline or increased fertility of MIR162 plants, e.g., as measured by seed set and/or yield, relative to control plants that do not contain MIR162 or increased fertility relative to plants that contain MIR162 but also contain infertile QTLs. These alleles are referred to as “favorable allele” in this application.
[0041] As used herein, “infertile QTL” refers to a QTL comprising a marker locus or marker loci with one or more alleles that, in combination, are associated with decreased fertility of MIR162 plants, e.g., as measured by seed set and/or yield, relative to control plants that do not contain MIR162. These alleles are referred to as “unfavorable allele” in this application.
[0042] As used herein, the term “plant” may refer to any suitable plant, including, but not limited to, spermatophytes (e.g., angiosperms and gymnosperms) and embryophytes (e.g., bryophytes, ferns and fern allies). In some embodiments, the plant is a monocotyledonous (monocot) plant such as a rice, maize, wheat, barley, sorghum, millet, oat, triticale, rye, buckwheat, fonio, quinoa, sugar cane, bamboo, banana, ginger, onion, lily, daffodil, iris, amaryllis, orchid, canna, bluebell, tulip, garlic, secale, einkom, spelt, emmer, durum, kamut,
grass (e.g., gramma grass), teff, milo, flax, Tripsacum sp., or teosinte plant. In some embodiments, the plant is a dicotyledonous (dicot) plant such as a blackberry, raspberry, strawberry, barberry, bearberry, blueberry, coffee berry, cranberry, crowberry, currant, elderberry, gooseberry, goji berry, honeyberry, lemon, lime, lingonberry, mangosteen, orange, pepper, persimmon, pomegranate, prune, cotton, clover, acai, plum, peach, nectarine, cherry, guava, almond, pecan, walnut, amaranth, apple, sweet pea, pear, potato, soybean, sugar beet, sunflower, sweet potato, tamarind, tea, tobacco or tomato plant.
[0043] As used herein, the term “plant cell” refers to a cell existing in, taken from and/or derived from a plant (e.g., derived from a plant cell/tissue culture). Thus, the term “plant cell” may refer to an isolated plant cell, a plant cell in a culture, a plant cell in an isolated tissue/organ and/or a plant cell in a whole plant.
[0044] As used herein, the term “plant part” refers to at least a fragment of a whole plant or to a cell culture or tissue culture derived from a plant. Thus, the term “plant part” may refer to plant cells, plant tissues and plant organs, as well as cell/tissue cultures derived from plant cells, plant tissues and plant cultures. Embodiments of the present invention may comprise and/or make use of any suitable plant part, including, but not limited to, anthers, branches, buds, calli, clumps, cobs, cotyledons, ears, embryos, filaments, flowers, fruits, husks, kernels, leaves, lodicules, ovaries, palea, panicles, pedicels, pods, pollen, protoplasts, roots, root tips, seeds, silks, stalks, stems, stigma, styles, and tassels. In some embodiments, the plant part is a plant germplasm.
[0045] As used herein, the term “polymorphism” refers to a variation in the nucleotide sequence at a locus, where said variation is too common to be due merely to a spontaneous mutation. A polymorphism generally has a frequency of at least about 1% in a population. A polymorphism can be a single nucleotide polymorphism (SNP), or an insertion/deletion polymorphism, also referred to herein as an “indel.” Additionally, the variation can be in a transcriptional profile or a methylation pattern. The polymorphic site or sites of a nucleotide sequence can be determined by comparing the nucleotide sequences at one or more loci in two or more germplasm entries.
[0046] As used herein, the terms “polypeptide,” “peptide” and “protein” refer to a polymer of amino acid residues. The terms encompass amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
[0047] The term “progeny,” refers to the descendant(s) of a particular cross. Typically, progeny result from breeding of two individuals, although some species (particularly some plants and hermaphroditic animals) can be selfed (i.e., the same plant acts as the donor of both male and female gametes). The descendant s) can be, for example, of the Fl, the F2, or any subsequent generation.
II. MIR162 plants and fertility
[0048] MIR162 plants are described in U.S. Pat. No. 8,232,456. Methods of generating and identifying MIR162 plants are described in Example 1 of U.S. Pat. No. 8,232,456. A MIR162 plant can be hemizygous or homozygous for the MIR162 event. The MIR162 plants comprise unique nucleic acid sequences that are not found in plants not comprising the MIR162 event. The unique nucleic acid sequences include junction sequences, a Vip3Aa20 coding sequence, which encodes a Vip3Aa20 insecticidal protein that confers insect resistance, and a pmi coding sequence, which encodes a PMI protein that confers upon corn cells the ability to utilize mannose as a carbon source. A MIR162 plant typically comprises nucleic acid sequences one or more of SEQ ID NO: 224-231. In some embodiments, the MIR162 plant comprises nucleic acid sequences SEQ ID NO: 224 (Vip3A coding sequence), SEQ ID NO: 227 (the 5’ genome-insert junction sequence), and 228 (the 3’ genome-insert junction sequence). As disclosed in U.S. Pat. No. 8,232,456, seeds from MIR162 plants were deposited in accordance with the Budapest Treaty at the American Type Culture Collection (ATCC), 1801 University Boulevard, Manassas, Virginia, 20110 under ATCC Accession No. PTA-6188.
[0049] Introducing the MIR162 event into maize plants is effective in controlling lepidopteran insect pests including, but not limited to, black cutworm (BCW, Agrotis ipsilon), fall armyworm (FAW, Spodoptera frugiperda), tobacco budworm (TBW, Heliothis virescens), sugarcane borer (SCB, Diatraea saccharalis), lesser cornstalk borer (LCB, Elasmopalpus lignosellus), corn earworm (CEW, Helicoverpa zea), and western bean cutworm (WBCW, Striacosta albicosta).
[0050] However it has been observed that the MIR162 event is associated with unsuccessful event conversion in some maize backgrounds, i.e., introduction of the MIR162 event into certain maize backgrounds results in the plants having reduced, in some cases severely reduced, male fertility. For example, over a three year period, the overall
unsuccessful event conversion rate for MIR162 across maize backgrounds was observed to be about 41-47% (Table 2).
1. Fertile QTLs
[0051] In one aspect, the disclosure provides various QTLs with marker loci that are associated with fertility of MIR162 plants. These QTLs comprise marker loci associated with fertility of MIR162 plants. These QTLs include, but are not limited to, a QTL on chromosome 5 that comprises one or more marker loci of SEQ ID NOs: 1-5; a QTL on chromosome 2 that comprises one or more of SEQ ID NO: 6-55; a QTL on chromosome 6 that comprises one or more of SEQ ID NO: 56-89; a QTL on chromosome 7 that comprises one or more of SEQ ID NO: 90- 131; a a QTL located on chromosome 8 comprising SEQ ID NO: 255; and a QTL on chromosome 9 that comprises one or more of SEQ ID NO: 132-223. One or more of these QTLs can be used in the methods and comprised in the plants described in this disclosure. One or more of any of these QTLs can be used in the methods and comprised in the plants described in this disclosure.
[0052] Also provided by the disclosure are one or more fertile QTLs, i.e., QTLs comprising marker loci with alleles that are associated with increased fertility of MIR162 plants having these QTLs. In some embodiments, the fertile QTL is located on chromosome 5 and comprises one or more marker loci of SEQ ID NOs: 1-5, with the position number 100 of the sequences being the favorable alleles shown in Table 3, i.e., G, C, G, A, and C, respectively. In some embodiments, the fertile QTL comprises SEQ ID NO: 2, with the position number 100 of the sequence being C. In some embodiments, the fertile QTL comprises SEQ ID NO: 2, with the position number 100 of the sequence being C and one or more of SEQ ID NO: 1 and 3-5, with the position number 100 of the sequences being G, G, A, and C, respectively. In some embodiments, the fertile QTL is on chromosome 2, comprising one or more of SEQ ID NO: 6-55, the position number 100 of the sequences being the favorable alleles shown in Table 4. In some embodiments, the fertile QTL is on chromosome 6, comprising one or more of SEQ ID NO: 56-89, the position number 100 of the sequences being the favorable alleles shown in Table 4. In some embodiments, the fertile QTL is on chromosome 7, comprising one or more of SEQ ID NO: 90- 131, the position number 100 of the sequences being the favorable alleles shown in Table 4. In some embodiments, the fertile QTL is on chromosome 8 and comprises SEQ ID NO: 255, with position 100 of the sequence being the favorable allele shown in Table 9. In some embodiments, the fertile QTL is on chromosome 9 and
comprises one or more of SEQ ID NO: 132-223, with position 100 of the sequences being the favorable alleles shown in Table 4.
[0053] In some embodiments, the one or more fertile QTLs comprise a combination of SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, comprises one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T, or (xii) A, A, G, C, and T.
[0054] In some embodiments, the one or more fertile QTLs comprise one of the following: (i) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a T at position 100; (ii) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (iii) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (iv) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (v) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (vi) at least one of SEQ ID NO: 2 with a C at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a C at position 100; (vii) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a T at position 100; (viii) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a T at position 100; (ix) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (x) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ
ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a T at position 100; (xi) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with an A at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; or (xii) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100.
[0055] In some embodiments, the one or more fertile QTLs comprise SEQ ID NO: 2, wherein position number 100 is C or A; SEQ ID NO: 52, wherein position number 100 is A or G; SEQ ID NO: 88, wherein position number 100 is A or G; SEQ ID NO: 139, wherein position number 100 is T or C; and SEQ ID NO: 94, wherein position number 100 is T or C.
[0056] In some embodiments, the one or more fertile QTLs comprise one or more of SEQ ID NO: 2, wherein position number 100 is C or A; SEQ ID NO: 52, wherein position number 100 is A or G; SEQ ID NO: 88, wherein position number 100 is A or G; SEQ ID NO: 139, wherein position number 100 is T or C; or SEQ ID NO: 94, wherein position number 100 is T or C.
[0057] Plants having infertile QTLs are expected to have reduced male fertility in MIR162 plants, therefore detecting infertile QTLs in plants can be a criterion for excluding plants from being used to generate MIR162 plants. In some embodiments, the infertile QTLs include SEQ ID NO: 1-5, with position number 100 of the sequences being the unfavorable alleles in Table 3, i.e., A, A, A, G, and A, respectively. In some embodiments, the infertile QTL is on chromosome 2, comprising one or more of SEQ ID NO: 6-55, the position number 100 of the sequences being the unfavorable alleles shown in Table 4. In some embodiments, the infertile QTL is on chromosome 6, comprising one or more of SEQ ID NO: 56-89, the position number 100 of the sequences being the unfavorable alleles shown in Table 4. In some embodiments, the infertile QTL is on chromosome 7, comprising one or more of SEQ ID NO: 90- 131, the position number 100 of the sequences being the unfavorable alleles shown in Table 4. In some embodiments, the infertile QTL is on chromosome 8, comprising SEQ ID NO: 255, with position 100 of the sequence being the unfavorable alle shown in Table 9. In some embodiments, the infertile QTL is on chromosome 9 and comprises one or more of SEQ ID NO: 132-223, with position 100 of the sequences being the unfavorable alleles shown in Table 4.
[0058] In some embodiments, the one or more infertile QTLs comprise a combination of SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) A, G, G, T, and T; (ii) A, G, G, C, and T; (iii) A, A, G, C, C; (iv) A, G, G, T, C; or (v) A, G, G, C, C.
[0059] In some embodiments, the one or more infertile QTLs comprise one of the following: (i) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a T at position 100; (ii) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a T at position 100; (iii) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with an A at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a C at position 100; (iv) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a T at position 100, or SEQ ID NO: 94 with a C at position 100; (v) at least one of SEQ ID NO: 2 with an A at position 100, SEQ ID NO: 52 with a G at position 100, SEQ ID NO: 88 with a G at position 100, SEQ ID NO: 139 with a C at position 100, or SEQ ID NO: 94 with a C at position 100.
[0060] In some embodiments, the presence of the QTL is confirmed by a locus that is closely linked to the QTLs, for example a locus that is within about 20 centimorgan (cM ), 15 cM, 10 cM, 5 cM, 4 cM, 3 cM, 2 cM, I cM, 0.5 cM, or less than 0.5 cM of the QTL.
2. TE ssociated with increased fertility
[0061] In another aspect, disclosed herein is a transposable element (TE) that is associated with fertility of MIR162 plants. In some embodiments, the TE is located on chromosomal 5. In some embodiments, the TE is located on Chromosome 5: 130436801- 130439201 in B73_ver5. In one embodiment, the TE comprises SEQ ID NO: 232 or 233. In some embodiments, the TE comprises a sequence sharing at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% sequence identity to SEQ ID NO: 232 or 233. In one illustrative embodiment as shown in Table 8, the presence of the TE (SEQ ID NO: 232 or 233) on chromosome 5 was shown to positively correlate with conversion success.
[0062] In some embodiments, plants with one or more fertile QTLs and/or a TE disclosed herein are selected and used to cross with MIR162 plants to produce further MIR162 plants, e.g., hybrid plants. In some embodiments, the MIR162 event can be introduced to the plants with one or more fertile QTLs disclosed herein to produce MIR162 plants having increased fertility. In some embodiments, the MIR162 event can be introduced to the plants with a TE disclosed herein into Chromosome 2 or 5 at the location disclosed above to produce MIR162 plants having increased fertility. In some embodiments, the genome of MIR162 plants are edited to remove the unfavorable alleles and introduce favorable allele and/or the TE to increase fertility. These approaches are further described in detail below.
III. Creating MIR162 plants with fertile QTLs
1. Crossing
[0063] In one approach, MIR162 plants with one or more fertile QTLs and/or a TE associated with increased fertility can be created by crossing a first maize plant comprising the fertile QTL(s) and/or the TE with a second maize plant to produce a plurality of progeny plants. In some embodiments, the first maize plant is an MIR162 plant. In some embodiments the second maize plant is an MIR162 plant. In some embodiments, both the first and second maize plant are MIR162 plants. In some embodiments, the MIR162 maize plant is obtained from growing a seed deposited at the ATCC under the accession No. PTA-6188. The crosses of the first and second maize plants produce a plurality of progeny plants.
[0064] In some embodiments, progeny plants are genotyped to confirm (i) the presence of fertile QTLs and/or the absence of infertile QTLs and/or (ii) the presence or absence of the TE associated with increased fertility using the methods disclosed in the section above.
[0065] In some embodiments, retention of the MIR162 event in these progeny plants can be confirmed by detecting the presence of one or more nucleic acids (e.g., one or more nucleic acids encoding Vip3Aa) or proteins that are unique to the MIR162 event and/or by detecting the insect resistance in the plant due to the presence of the MIR162 event. In one illustrative example, the method of detecting one or more nucleic acids that are unique to the MIR162 event comprises (a) contacting the sample with a pair of polynucleotide primers that, when used in a nucleic acid amplification reaction with genomic DNA from the MIR162 event produces an amplicon that is diagnostic for the MIR162 event; (b) performing a nucleic acid amplification reaction, thereby producing the amplicon; and (c) detecting the amplicon. In one aspect of this embodiment, the amplicon comprises a nucleotide sequence selected from
the group consisting of SEQ ID NO: 224-231 and the complements thereof. Other example methods include use of probes to detect one or more nucleic acids that are unique to the MIR162 event or sequencing all or part of the genome of the plant to detect one or more nucleic acids that are unique to the MIR162 event. In another illustrative example, retention of the MIR162 event in these progeny plants can be confirmed by detection of Vip3A protein (e.g,, Vip3Aa), e.g., by mass spectrometry or by enzyme-linked immunosorbent assay (ELISA). In some instances, Vip3 A protein (e.g., Vip3 Aa) is detected.
[0066] These progeny plants comprising the fertile QTL(s) and/or a TE associated with increased fertility and MIR162 events can be backcrossed with a parent plant comprising the MIR162 events and/or the fertile QTL(s) and/or the TE. The Fl progeny plants from the cross can be assayed for one or more fertile QTLs or a TE associated with increased fertility disclosed herein. A selected Fl progeny plant is then backcrossed with the parent MIR162 plant (recurrent parent). Plants from the BC1 generation are also genotyped for the one or more fertile QTLs or a TE associated with increased fertility disclosed herein. After multiple rounds of backcrossing (e.g., 5-7 generations) with the recurrent parent line, a new elite maize inbred line is obtained comprising both one or more fertile QTLs or a TE associated with increased fertility disclosed herein and the MIR162 event). In some embodiments, such inbred plants can be used as parent plants, e.g., as male parent plants, to create hybrid plants, which are a preferred commercial embodiment of maize in most countries due to heterosis of the hybrids.
2. Transformation
[0067] In another approach, MIR162 plants with fertile QTL(s) can be created by selecting a maize plant comprising one or more fertile QTLs or a TE associated with increased fertility as disclosed herein, introducing a MIR162 event into said maize plant, and growing the plant to result in a MIR162 plant comprising these fertile QTL(s) and/or TE. In some embodiments, the maize plant comprising one or more fertile QTL(s) and/or TE disclosed herein is a known maize line, for example, any one of the maize lines listed in Table 5.
[0068] The MIR162 event can be introduced into corn plants by transforming nucleic acid sequences that are unique to the MIR162 event into corn plants using methods well known in the art. See Section II above, entitled “Vip3 A plants and fertility.” One illustrative example is described in Example 1 of U.S. Pat. No. 8,232,456.
3. Genome editing
[0069] In another approach, one or more fertile QTLs and a TE associated with increased fertility can be introduced into a maize plant by genome editing. Various embodiments of the methods described herein use genome editing. In some embodiments, genome editing is used to modify the genome of a plant to produce plants having one or more of the fertile QTLs and/or a TE associated with increased fertility to increase the fertility of MIR162 plants. In some embodiments, the genome of an MIR162 plant is edited to replace the one or more unfavorable alleles in SEQ ID NOs: 1-223 or 255 with the corresponding favorable alleles in Tables 3, 4 or 9. For example, genome editing can be performed in a maize plant to replace the unfavorable allele A at position 100 in SEQ ID NO: 2 with the favorable allele C. In some embodiments, genome editing is used to introduce the favorable allele from any of the loci in Table 4, such as at chromosome 2 at position 243529534, chromosome 5 at position 129259011, chromosome 6 at position 158572433, chromosome 7 at position 134444680, or chromosome 9 at position 8013654, or an allelic combination in Table 4, into a maize plant that does not contain such allele(s) or allelic combination and would otherwise not succeed in MIR162 event conversion. It is expected that introduction of the favorable allele(s) or allelic combination will improve the likelihood of a successful MIR162 event conversion.
[0070] In some embodiments, the genome of a MIR162 plant is edited to insert a TE associated with increased fertility into the genome. In some embodiments, the TE is inserted on chromosome 5. In some embodiments, the TE is inserted into Chromosome 5 at positions 130436801- 130439201 in B73_ver5. In one embodiment, the TE comprises SEQ ID NO: 232 or 233.
[0071] Methods of editing genomes are well known in the art. Such methods include, but are not limited to, meganucleases designed against the plant genomic sequence of interest CRISPR-Cas9, TALENs, and other technologies for precise editing of genomes (Feng, et al. Cell Research 23: 1229-1232, 2013, WO 2013/026740); Cre-lox site-specific recombination; FLP-FRT recombination (Li et al. (2009) Plant Physiol 151 : 1087-1095); Bxbl -mediated integration (Yau et al. Plant J (2011) 701 : 147-166); zinc-finger mediated integration (Wright et al. (2005) Plant J 44:693-705); Cai et al. (2009) Plant Mol Biol 69:699-709); homologous recombination (Lieberman-Lazarovich and Levy (2011) Methods Mol Biol 701 :51-65); prime editing and transposases (Anzalone, A. et al. (2020) Nat Biotechnol. 38(7):824-844); translocation; and inversion.
[0072] In some embodiments, provided herein are plants transformed with and expressing genome-editing machinery as described above, which, when crossed with a target plant, result in genome editing in the target plant.
[0073] In general, genome editing may involve transient, inducible, or constitutive expression of the genome editing components or systems in the target plant or bombardment of the genome editing components directly into the plant. Genome editing may involve genomic integration or episomal presence of the genome editing components or systems.
[0074] Genome editing generally refers to the use of a site-directed nuclease (including but not limited to CRISPR/Cas, zinc fingers, meganucleases, and the like) to cut a nucleotide sequence at a desired location. This may be to cause a deletion or an insertion/deletion (“indel”) mutation, (i.e., “SDN1”), a base edit (i.e., “SDN2”), or allele insertion or replacement (i.e., “SDN3”). SDN2 or SDN3 genome editing may comprise the provision of one or more recombination templates (e.g., in a vector) comprising a gene sequence of interest that can be used for homology directed repair (HDR) within the plant (i.e., to be introduced into the plant genome). In some embodiments, the gene or allele of interest is one that is able to confer increased fertility as disclosed herein. The recombination template can be introduced into the plant to be edited either through transformation or through breeding with a donor plant comprising the recombination template. Breaks in the plant genome may be introduced within, upstream, and/or downstream of a target sequence. In some embodiments, a double strand DNA break is made within or near the target sequence locus. In some embodiments, breaks are made upstream and downstream of the target sequence locus, which may lead to its excision from the genome. In some embodiments, one or more single strand DNA breaks (nicks) are made within, upstream, and/or downstream of the target sequence (e.g., using a nickase Cas9 variant). Any of these DNA breaks, as well as those introduced via other methods known to one of skill in the art, may induce HDR. Through HDR, the target sequence is replaced by the sequence of the provided recombination template comprising a polynucleotide of interest, e.g., a fertile QTL and/or TE. By designing the system such that one or more single strand or double strand breaks are introduced within, upstream, and/or downstream of the corresponding region in the genome of a plant not comprising the gene sequence of interest, this region can be replaced with the template.
[0075] In some embodiments, mutations in the genes of interest described herein may be generated without the use of a recombination template via targeted introduction of DNA double strand breaks. Such breaks may be repaired through the process of non-homologous
end joining (NHEJ), which can result in the generation of small insertions or deletions (indels) at the repair site. Such indels may lead to frameshift mutations causing premature stop codons or other types of loss-of-function mutations in the targeted genes.
[0076] In certain embodiments, the nucleic acid modification or mutation is effected by a (modified) zinc-finger nuclease (ZFN) system. The ZFN system uses artificial restriction enzymes generated by fusing a zinc finger DNA-binding domain to a DNA-cleavage domain that can be engineered to target desired DNA sequences. Exemplary methods of genome editing using ZFNs can be found for example in U.S. Patent Nos. 6,534,261; 6,607,882; 6,746,838; 6,794,136; 6,824,978; 6,866,997; 6,933,113; and 6,979,539.
[0077] In certain embodiments, the nucleic acid modification is effected by a (modified) meganuclease, which are endodeoxyribonucleases characterized by a large recognition site (double-stranded DNA sequences of 12 to 40 base pairs). Exemplary method for using meganucleases can be found in US Patent Nos: 8,163,514; 8,133,697; 8,021,867; 8,119,361; 8,119,381; 8,124,369; and 8,129,134, which are specifically incorporated by reference.
[0078] In certain embodiments, the nucleic acid modification is effected by a (modified) CRISPR/Cas complex or system. In certain embodiments, the CRISPR/Cas system or complex is a class 2 CRISPR/Cas system. In certain embodiments, said CRISPR/Cas system or complex is a type II, type V, or type VI CRISPR/Cas system or complex. The CRISPR/Cas system does not require the generation of customized proteins to target specific sequences but rather a single Cas protein can be programmed by an RNA guide (gRNA) to recognize a specific nucleic acid target, in other words the Cas enzyme protein can be recruited to a specific nucleic acid target locus (which may comprise or consist of RNA and/or DNA) of interest using said short RNA guide.
[0079] In general, the CRISPR/Cas or CRISPR system is as used herein and as described in foregoing documents refers collectively to transcripts and other elements involved in the expression of or directing the activity of CRISPR-associated (“Cas”) genes, including sequences encoding a Cas gene and one or more of, a tracr (trans-activating CRISPR) sequence (e.g. tracrRNA or an active partial tracrRNA), a tracr-mate sequence (encompassing a “direct repeat” and a tracrRNA-processed partial direct repeat in the context of an endogenous CRISPR system), a guide sequence (also referred to as a “spacer” in the context of an endogenous CRISPR system), or “RNA(s)” as that term is herein used (e.g., RNA(s) to guide Cas, such as Cas9, e.g. CRISPR RNA and, where applicable, transactivating (tracr)
RNA or a single guide RNA (sgRNA) (chimeric RNA)) or other sequences and transcripts from a CRISPR locus. In general, a CRISPR system is characterized by elements that promote the formation of a CRISPR complex at the site of a target sequence (also referred to as a protospacer in the context of an endogenous CRISPR system). In the context of formation of a CRISPR complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a CRISPR complex. A target sequence may comprise any polynucleotide, such as DNA or RNA polynucleotides.
[0080] In certain embodiments, the gRNA is a chimeric guide RNA or single guide RNA (sgRNA). In certain embodiments, the gRNA comprises a guide sequence and a tracr mate sequence (or direct repeat). In certain embodiments, the gRNA comprises a guide sequence, a tracr mate sequence (or direct repeat), and a tracr sequence. In certain embodiments, the CRISPR/Cas system or complex as described herein does not comprise and/or does not rely on the presence of a tracr sequence (e.g. if the Cas protein is Casl2a).
[0081] The Cas protein as referred to herein, such as but not limited to Cas9, Cas 12a (formerly referred to as Cpfl), Cas 12b (formerly referred to as C2cl), Cas 13a (formerly referred to as C2c2), C2c3, Cast 3b protein, may originate from any suitable source, and hence may include different orthologues, originating from a variety of (prokaryotic) organisms, as is well documented in the art. In certain embodiments, the Cas protein is (modified) Cas9, preferably (modified) Staphylococcus aureus Cas9 (SaCas9) or (modified) Streptococcus pyogenes Cas9 (SpCas9). In certain embodiments, the Cas protein is Casl2a, optionally from Acidaminococcus sp., such as Acidaminococcus sp. BV3L6 Cpfl (AsCasl2a) or Lachnospiraceae bacterium Cas 12a, such as Lachnospiraceae bacterium MA2020 or Lachnospiraceae bacterium MD2006 (LBCasl2a). See U.S. Pat. No. 10,669,540, incorporated herein by reference in its entirety. Alternatively, the Casl2a protein may be from Moraxella bovoculi AAX08_00205 [Mb2Casl2a] or Moraxella bovoculi AAXl l_00205 [Mb3Casl2a], See WO 2017/189308, incorporated herein by reference in its entirety. In certain embodiments, the Cas protein is (modified) C2c2, preferably Leptotrichia wadei C2c2 (LwC2c2) or Listeria newyorkensis FSL M6-0635 C2c2 (LbFSLC2c2). In certain embodiments, the (modified) Cas protein is C2cl. In certain embodiments, the (modified) Cas protein is C2c3. In certain embodiments, the (modified) Cas protein is Cas 13b. Other Cas enzymes are available to a person skilled in the art.
[0082] Genome editing methods and compositions are also disclosed in US Pat. Nos. 10,519,456 and 10,285,348 82, the entire content of which is herein incorporated by reference.
[0083] The gene-editing machinery (e.g., the DNA modifying enzyme) introduced into the plants can be controlled by any promoter that can drive recombinant gene expression in plants. In some embodiments, the promoter is a constitutive promoter. In some embodiments, the promoter is a tissue-specific promoter, e.g., a pollen-specific promoter or a sperm cell specific promoter, a zygote specific promoter, or a promoter that is highly expressed in sperm, eggs and zygotes (e.g., prOsActinl). Suitable promoters are disclosed in U.S. Pat. No. 10,519,456, the entire content of which is herein incorporated by reference.
[0084] In some embodiments, the guide RNA and the Cas protein (or any other suitable nucleases) may be delivered in DNA form, e.g., in a suitable vector that can be introduced into a yeast cell. Generally, DNA encoding the gRNA is cloned into a vector downstream of a promoter for expression. The sgRNA and Cas may be expressed from the same vector of the system or from different vectors. In some embodiments, the vectors are separately transformed into the maize plant to induce genome editing. In some embodiments, the coding sequence for Cas9 and the coding sequence for the sgRNA are ligated into a single vector, which is then transformed into the maize plant to induce genomic modification. The Cas9 vector and the sgRNA vector typically contains a selection marker, for example, spectinomycin, for identification of transformants comprising the genome editing machinery.
[0085] In some embodiments, the method of introducing desired genomic modifications comprises using a first maize plant expressing a DNA modification enzyme and at least one optional guide nucleic acid as described above to pollinate a target plant comprising genomic DNA to be edited.
IV. Detecting presence of QTLs and Tes in plants
[0086] Also provided herein are methods of detecting the presence of a fertile QTL and/or a TE associated with increased fertility (“target nucleic acid sequence”) that can increase the fertility of MIR162 plants. A target sequence, e.g., a particular allele (e.g., a favorable allele in Table 3) can be detected by methods including, but not limited to, nucleic acid sequencing, hybridization methods, amplification methods (e.g., PCR-based sequence specific amplification methods), detection of restriction fragment length polymorphisms (RFLP), detection of isozyme markers, detection of polynucleotide polymorphisms by allele specific
hybridization (ASH), detection of amplified variable sequences of the plant genome, detection of self-sustained sequence replication, detection of simple sequence repeats (SSRs), detection of single nucleotide polymorphisms (SNPs), detection of amplified fragment length polymorphisms (AFLPs), detection of expressed sequence tags (ESTs). SSR markers derived from EST sequences and/or randomly amplified polymorphic DNA (RAPD).
[0087] In one aspect the detection of the target nucleic acid sequence can be facilitated through the use of nuclear acid amplification methods. Such methods specifically increase concentration of poly nucleotides that span the polymorphic site, or include that site and sequences located either distal or proximal to it. Such amplified molecules can be readily detected by gel electrophoresis fluorescent detection methods or other means.
[0088] In another aspect, a target nucleic acid sequence can be detected by probe ligation methods as provided in U.S. Pat. No. 5,800,944 where sequence of interest is amplified and hybridized to probes followed by ligation to detect a labeled part of the probe.
[0089] Target nucleic acid sequence can also be detected by probe linking methods, employing at least one pair of probes having sequences homologous to adjacent portions of the target nucleic acid sequence and having side chains which non-covalently bind to form a stem upon base pairing of the probes to the target nucleic acid sequence. See, U.S. Pat. No. 5,616,464.
[0090] Polymorphism in nucleic acid sequences (e.g., SNPs disclosed in Table 3 or Table 4 or Table 9) can be detected using various methods. In one embodiment, polymorphism in nucleic acid sequences is detected by hybridization to allele specific oligonucleotide probes. In one exemplary method, single or multiple nucleotide variations in nucleic acid sequence are detected in nucleic acids by a process in which the sequence containing the nucleotide variation is amplified, spotted on a membrane and treated with a labeled sequence-specific oligonucleotide probe. See, U.S. Pat. Nos. 5,468,613 and 5,217,863. In another embodiment, polymorphism in nucleic acid sequences is detected using microarray, in which oligonucleotide probe sets are assembled in an overlapping fashion to represent a single sequence such that a difference in the target sequence at one point would result in partial probe hybridization (Borevitz et al., Large-scale identification of single-feature polymorphism in complex genomes. Genome Research, 13:513-523 (2003).
[0091] Other exemplary methods for detecting polymorphism in nucleic acid sequences include single base extension (SBE) methods. Examples of SBE methods include, but are not
limited, to those provided in U.S. Pat. Nos. 6,004,744; 6,013,431; 5,595,890; 5,762,876; and 5,945,283. SBE methods are based on extension of a nucleotide primer that is adjacent to a polymorphism to incorporate a detectable nucleotide residue upon extension of the primer. If the polymorphism is present on the template, one of the labeled dideoxynucleosidetriphosphates can be added to the primer in a single base chain extension. The allele present is then inferred by determining which of the two differential labels was added to the extension primer. Homozygous samples will result in only one of the two labeled bases being incorporated and thus only one of the two labels will be detected. Heterozygous samples have both alleles present, and will thus direct incorporation of both labels (into different molecules of the extension primer) and thus both labels will be detected.
[0092] In another exemplary method for detecting polymorphisms, polymorphism in nucleic acid sequences can be detected by methods provided in U.S. Pat. Nos. 5,210,015; 5,876,930; and 6,030,787 in which an oligonucleotide probe having a 5’ fluorescent reporter dye and a 3’ quencher dye covalently linked to the 5’ and 3’ ends of the probe. When the probe is intact, the proximity of the reporter dye to the quencher dye results in the suppression of the reporter dye fluorescence, e.g., by Forster-type energy transfer. During PCR, forward and reverse primers hybridize to a specific sequence of the target DNA flanking a polymorphism while the hybridization probe hybridizes to polymorphismcontaining sequence within the amplified PCR product. In the subsequent PCR cycle DNA polymerase with 5’— >3’ exonuclease activity cleaves the probe and separates the reporter dye from the quencher dye resulting in increased fluorescence of the reporter.
[0093] In another aspect, a target nucleic acid sequence can be detected using nucleic acid sequencing technologies. Methods for nucleic acid sequencing are known in the art and include technologies provided by 454 Life Sciences (Branford, Conn.), Agencourt Bioscience (Beverly, Mass.), Applied Biosystems (Foster City, Calif.), LLCOR Biosciences (Lincoln, Nebr.), NimbleGen Systems (Madison, Wis.), Illumina (San Diego, Calif.), Pac-Bio (Menlo Park, Calif.) and VisiGen Biotechnologies (Houston, Tex.). Such nucleic acid sequencing technologies comprise formats such as parallel bead arrays, sequencing by ligation, capillary electrophoresis, electronic microchips, “biochips,” microarrays, parallel microchips, and single-molecule arrays.
[0094] Accordingly, provided herein are kits for genotyping a plant to detect the presence of favorable alleles in the QTLs disclosed herein, or to detect the presence of a TE that is associated with increased fertility. The kit may comprise one of more reagents disclosed
above, for example, the kit may comprise reagents used for detecting the presence of one or more marker loci of SEQ ID Nos: 1-223 or 255; and/or a transposon element (TE) of SEQ ID NO: 232 or 233. In some embodiments, the kit comprises one or more of the following: primers for sequencing the one or more marker loci and/or the TE (for example, primers comprising at least 10 consecutive nucleotides that are complementary to any one of SEQ ID Nos: 1-223 or 255 or 232); primers for amplifying the one or more marker loci and/or the TE (for example, primers comprising at least 10 consecutive nucleotides that are complementary to any one of SEQ ID Nos: 1-223 or 255 or 232), or a nucleic acid molecule that is complementary to one or more marker loci, amplify and/or sequence one or more marker loci and/or the TE. can be used for amplifying and/or sequencing the alleles. Provided below in Table 1 are example primers and probes which can be used, e.g., in a TAQMAN assay, to detect one or more marker loci.
V. Other polynucleotides and polypeptides of interest
[0095] In some embodiments, the MIR162 plants with one or more fertile QTLs produced herein can be stacked with one or more polynucleotides encoding a desirable trait such as a polynucleotide that confers, for example, insect, disease or herbicide resistance or other desirable agronomic traits of interest including, but not limited to, traits associated with high oil content; increased digestibility; balanced amino acid content; and/or high energy content. Such traits may refer to properties of both seed and non-seed plant tissues, or to food or feed prepared from plants or seeds having such traits.
[0096] Example polynucleotides of interest include_the following events: Btl l (see US Patent No. US6114608), MIR604 (see US Patent No. US8884102), 5307 (see US Patent No. US10428393), MZIR098 (see US Patent Application No. US20200190533), TC1507 (see US Patent No. US7288643), DAS-59122-7 (see US Patent No. US7323556), MON810 (see US6713259), MON863 (see US Patent No. US7705216), MON89034 (see US Patent No. US8062840), MON88017 (see US Patent No. US9556492), DP-4114 (see US Patent No. US9725772), MON87411 (see US Patent No. US9441240), DP-032218-9 (see US Patent Application No. US2015361447), DP-033121-3 (see US Patent Application No. US2015361446), DP-023211-2 (see PCT Publication No. WO2019209700), MON95379 (see US Patent Application No. US2020032289), DBN9936 (see PCT Publication No. WO2016173361), DBN9501 (see PCT Publication No. W020207125), GH5112E-117C (see PCT Publication No. WO 17/088480), LP007-1 (see Chinese Patent Application No. CN112852801), LP007-2 (Chinese Patent Application No. CN112831584), LP007-3 (Chinese Patent Application No. CN112877454), LP007-4 (Chinese Patent Application No. CN112831585), LP007-5 (Chinese Patent Application No. CN113151534), LP007-6 (Chinese Patent Application No. CN113151533), LP007-7 (Chinese Patent Application No.
CN112852991), LP007-8 (CN113980958), Ruifeng8, ND207, Ruifengl25 (see Chinese Patent Application No. CN105017391), GA21 (see PCT Publication No. WO98/44140), NK603 (see US Patent No. US6825400), DAS40278 (see PCT Publication No. WO20 11/022469), DBN9858 (see PCT Publication No. WO2016173508), MON87429 (see PCT Publication No. WO19/152316), LW2-2 (see Chinese Patent Application No. CN113278721), T25 (see USDA/APHIS Petition 94-357-01 for Determination of Nonregulated Status for Glufosinate Resistant Com Transformation Events T14 and T25, June 1995), or the 3272 event (see US Patent No. US7635799).
[0097] As used herein, gene or trait “stacking” is combining desired genes or traits into one transgenic plant line. As one approach, plant breeders stack transgenic traits by making crosses between parents that each have a desired trait and then identifying offspring that have both of these desired traits (so-called “breeding stacks”). Another way to stack genes is by transferring two or more genes into the cell nucleus of a plant at the same time during transformation. Another way to stack genes is by re-transforming a transgenic plant with another gene of interest. For example, gene stacking can be used to combine two different insect resistance traits, an insect resistance trait and a disease resistance trait, or a herbicide resistance trait (such as, for example, Btl l). The use of a selectable marker in addition to a gene of interest would also be considered gene stacking.
[0098] In some embodiments, a nucleic acid molecule or vector of the disclosure can include an additional coding sequence for one or more polypeptides or double stranded RNA molecules (dsRNA) of interest for agronomic traits that primarily are of benefit to a seed company, grower or grain processor. A polypeptide of interest can be any polypeptide encoded by a nucleotide sequence of interest. Non-limiting examples of polypeptides of interest that are suitable for production in plants include those resulting in agronomically important traits such as herbicide resistance (also sometimes referred to as “herbicide tolerance”), virus resistance, bacterial pathogen resistance, insect resistance, nematode resistance, or fungal resistance. See, e.g., U.S. Patent Nos. 5,569,823; 5,304,730; 5,495,071; 6,329,504; and 6,337,431. The polypeptide also can be one that increases plant vigor or yield (including traits that allow a plant to grow at different temperatures, soil conditions and levels of sunlight and precipitation), or one that allows identification of a plant exhibiting a trait of interest (e.g., a selectable marker, seed coat color, relative maturity group, etc.). Various polypeptides of interest, as well as methods for introducing these polypeptides into a plant, are described, for example, in US Patent Nos. 4,761,373; 4,769,061; 4,810,648; 4,940,835;
4,975,374; 5,013,659; 5,162,602; 5,276,268; 5,304,730; 5,495,071; 5,554,798; 5,561,236; 5,569,823; 5,767,366; 5,879,903, 5,928,937; 6,084,155; 6,329,504 and 6,337,431; as well as US Patent Publication No. 2001/0016956. In some embodiments, the expression of the polypeptide in the maize plants results in a different insect resistance than the one conferred by the MIR162 event. In such instances, stacking this resistance trait with the MIR162 event, by expressing the additional polypeptide in a MIR162 plant, can result in maize plants protected from insect feeding damage to a greater degree than the insect resistance traits conferred by the MIR162 event alone.
[0099] Polynucleotides conferring resistance/tolerance to an herbicide that inhibits the growing point or meristem, such as an imidazalinone or a sulfonylurea can also be suitable in some embodiments. Exemplary polynucleotides in this category code for mutant ALS and AHAS enzymes as described, e.g., in U.S. Patent Nos. 5,767,366 and 5,928,937. U.S. Patent Nos. 4,761,373 and 5,013,659 are directed to plants resistant to various imidazalinone or sulfonamide herbicides. U.S. Patent No. 4,975,374 relates to plant cells and plants containing a nucleic acid encoding a mutant glutamine synthetase (GS) resistant to inhibition by herbicides that are known to inhibit GS, e.g., phosphinothricin and methionine sulfoximine. U.S. Patent No. 5,162,602 discloses plants resistant to inhibition by cyclohexanedione and aryloxyphenoxypropanoic acid herbicides. The resistance is conferred by an altered acetyl coenzyme A carboxylase (ACCase).
[0100] Polypeptides encoded by nucleotides sequences conferring resistance to glyphosate are also suitable for the disclosure. See, e.g., U.S. Patent No. 4,940,835 and U.S. Patent No. 4,769,061. U.S. Patent No. 5,554,798 discloses transgenic glyphosate resistant maize plants, which resistance is conferred by an altered 5-enolpyruvyl-3-phosphoshikimate synthase (EPSPS) gene.
[0101] Polynucleotides coding for resistance to phosphono compounds such as glufosinate ammonium or phosphinothricin, and pyridinoxy or phenoxy propionic acids and cyclohexones are also suitable. See, European Patent Application No. 0 242 246. See also, U.S. Patent Nos. 5,879,903, 5,276,268, and 5,561,236.
[0102] Other suitable polynucleotides include those coding for resistance to herbicides that inhibit photosynthesis, such as a triazine and a benzonitrile (nitrilase) See, U.S. Patent No. 4,810,648. Additional suitable polynucleotides coding for herbicide resistance include those coding for resistance to 2,2-dichloropropionic acid, sethoxydim, haloxyfop, imidazolinone
herbicides, sulfonylurea herbicides, triazolopyrimidine herbicides, s-triazine herbicides and bromoxynil. Also suitable are polynucleotides conferring resistance to a protox enzyme, or that provide enhanced resistance to plant diseases; enhanced tolerance of adverse environmental conditions (abiotic stresses) including but not limited to drought, excessive cold, excessive heat, or excessive soil salinity or extreme acidity or alkalinity; and alterations in plant architecture or development, including changes in developmental timing. See, e.g., U.S. Patent Publication No. 2001/0016956 and U.S. Patent No. 6,084,155.
[0103] Additional suitable polynucleotides include those coding for insecticidal polypeptides. These polypeptides may be produced in amounts sufficient to control, for example, insect pests (i.e., insect controlling amounts). It is recognized that the amount of production of an insecticidal polypeptide in a plant necessary to control insects or other pests may vary depending upon the cultivar, type of pest, environmental factors and the like. Polynucleotides useful for additional insect or pest resistance include, for example, those that encode toxins identified in Bacillus organisms. Polynucleotides comprising nucleotide sequences encoding Bacillus thuringiensis (Bt) Cry proteins from several subspecies have been cloned and recombinant clones have been found to be toxic to lepidopteran, dipteran and/or coleopteran insect larvae. Examples of such Bt insecticidal proteins include the Cry proteins such as CrylAa, CrylAb, CrylAc, CrylB, CrylC, CrylD, CrylEa, CrylFa, Cry3A, Cry9A, Cry9B, Cry9C, and the like, as well as vegetative insecticidal proteins such as Vipl, Vip2, Vip3, and the like. A full list of Bt-derived proteins can be found on the worldwide web at Bacillus thuringiensis Toxin Nomenclature Database maintained by the University of Sussex (see also, Crickmore et al. (1998) Microbiol. Mol. Biol. Rev. 62:807-813).
[0104] In embodiments, an additional polypeptide is an insecticidal polypeptide derived from a non-Bt source, including without limitation, an alpha-amylase, a peroxidase, a cholesterol oxidase, a patatin, a protease, a protease inhibitor, a urease, an alpha-amylase inhibitor, a pore-forming protein, a chitinase, a lectin, an engineered antibody or antibody fragment, a Bacillus cereus insecticidal protein, a Xenorhabdus spp. (such as X. nematophila or X. bovienii) insecticidal protein, a Photorhabdus spp. (such as P. luminescens or P. asymobiotica) insecticidal protein, a Brevibacillus spp. (such as B. laterosporous) insecticidal protein, a Lysinibacillus spp. (such as L. sphearicus) insecticidal protein, a Chromobacterium spp. (such as C. subtsugae or C. piscinae) insecticidal protein, a Yersinia spp. (such as Y. entomophaga) insecticidal protein, a Paenibacillus spp. (such as P. propylaea) insecticidal
protein, a Clostridium spp. (such as C. bifermentans) insecticidal protein, a Pseudomonas spp. (such as P. fluorescens) and a lignin.
[0105] Polypeptides that are suitable for production in plants further include those that improve or otherwise facilitate the conversion of harvested plants or plant parts into a commercially useful product, including, for example, increased or altered carbohydrate content or distribution, improved fermentation properties, increased oil content, increased protein content, modified oil profile, improved digestibility, and increased nutraceutical content, e.g., increased phytosterol content, increased tocopherol content, increased stanol content or increased vitamin content. Polypeptides of interest also include, for example, those resulting in or contributing to a reduced content of an unwanted component in a harvested crop, e.g., phytic acid, or sugar degrading enzymes. By “resulting in” or “contributing to” is intended that the polypeptide of interest can directly or indirectly contribute to the existence of a trait of interest (e.g., increasing cellulose degradation by the use of a heterologous cellulase enzyme).
[0106] In some embodiments, the polypeptide contributes to improved digestibility for food or feed. Xylanases are hemicellulolytic enzymes that improve the breakdown of plant cell walls, which leads to better utilization of the plant nutrients by an animal. This leads to improved growth rate and feed conversion. Also, the viscosity of the feeds containing xylan can be reduced. Heterologous production of xylanases in plant cells also can facilitate lignocellulosic conversion to fermentable sugars in industrial processing.
[0107] Numerous xylanases from fungal and bacterial microorganisms have been identified and characterized (see, e.g., U.S. Patent No. 5,437,992; Coughlin et al. (1993) “Proceedings of the Second TRICEL Symposium on Trichoderma reesei Cellulases and Other Hydrolases” Espoo; Souminen and Reinikainen, eds. (1993) Foundation for Biotechnical and Industrial Fermentation Research 8: 125-135; U.S. Patent Publication No. 2005/0208178; and PCT Publication No. WO 03/16654). In particular, three specific xylanases (XYL-I, XYL-II, and XYL-III) have been identified in T. reesei (Tenkanen et al. (1992) Enzyme Microb. Technol. 14:566; Torronen et al. (1992) Bio/Technology 10: 1461; and Xu et al. (1998) Appl. Microbiol. Biotechnol. 49:718).
[0108] In other embodiments, a polypeptide useful for the disclosure can be a polysaccharide degrading enzyme. Plants of this disclosure producing such an enzyme may be useful for generating, for example, fermentation feedstocks for bioprocessing. In some
embodiments, enzymes useful for a fermentation process include alpha amylases, proteases, pullulanases, isoamylases, cellulases, hemicellulases, xylanases, cyclodextrin glycotransferases, lipases, phytases, laccases, oxidases, esterases, cutinases, granular starch hydrolyzing enzyme and other glucoamylases.
[0109] Polysaccharide-degrading enzymes include: starch degrading enzymes such as a- amylases (EC 3.2.1.1), glucuronidases (E.C. 3.2.1.131); exo-l,4-a-D glucanases such as amyloglucosidases and glucoamylase (EC 3.2.1.3), P-amylases (EC 3.2.1.2), a-glucosidases (EC 3.2.1.20), and other exo-amylases; starch debranching enzymes, such as a) isoamylase (EC 3.2.1.68), pullulanase (EC 3.2.1.41), and the like; b) cellulases such as exo-1,4-3- cellobiohydrolase (EC 3.2.1.91), exo-l,3-P-D-glucanase (EC 3.2.1.39), P-glucosidase (EC 3.2.1.21); c) L-arabinases, such as endo-l,5-a-L-arabinase (EC 3.2.1.99), a-arabinosidases (EC 3.2.1.55) and the like; d) galactanases such as endo-l,4-P-D-galactanase (EC 3.2.1.89), endo-l,3-P-D-galactanase (EC 3.2.1.90), a-galactosidase (EC 3.2.1.22), P-galactosidase (EC 3.2.1.23) and the like; e) mannanases, such as endo-l,4-P-D-mannanase (EC 3.2.1.78), P- mannosidase (EC 3.2.1.25), a-mannosidase (EC 3.2.1.24) and the like; f) xylanases, such as endo-l,4-P-xylanase (EC 3.2.1.8), P-D-xylosidase (EC 3.2.1.37), 1,3-P-D-xylanase, and the like; and g) other enzymes such as a-L-fucosidase (EC 3.2.1.51), a-L-rhamnosidase (EC 3.2.1.40), levanase (EC 3.2.1.65), inulanase (EC 3.2.1.7), and the like. In one embodiment, the a-amylase is the synthetic a-amylase, Amy797E, described is US Patent No. 8,093,453, herein incorporated by reference in its entirety.
[0110] Further enzymes which may be used with the disclosure include proteases, such as fungal and bacterial proteases. Fungal proteases include, but are not limited to, those obtained from Aspergillus, Trichoderma, Mucor and Rhizopus, such as A. niger, A. awamori, A. oryzae and M. miehei. In some embodiments, the polypeptides of this disclosure can be cellobiohydrolase (CBH) enzymes (EC 3.2.1.91). In one embodiment, the cellobiohydrolase enzyme can be CBH1 or CBH2.
[OHl] Other enzymes useful with the disclosure include, but are not limited to, hemicellulases, such as mannases and arabinofuranosidases (EC 3.2.1.55); ligninases; lipases (e.g., E.C. 3.1.1.3), glucose oxidases, pectinases, xylanases, transglucosidases, alpha 1,6 glucosidases (e.g., E.C. 3.2.1.20); esterases such as ferulic acid esterase (EC 3.1.1.73) and acetyl xylan esterases (EC 3.1.1.72); and cutinases (e.g. E.C. 3.1.1.74).
[0112] Double stranded RNA molecules useful with the disclosure include but are not limited to those that suppress target insect genes. As used herein the words "gene suppression", when taken together, are intended to refer to any of the well-known methods for reducing the levels of protein produced as a result of gene transcription to mRNA and subsequent translation of the mRNA. Gene suppression is also intended to mean the reduction of protein expression from a gene or a coding sequence including posttranscriptional gene suppression and transcriptional suppression. Posttranscriptional gene suppression is mediated by the homology between of all or a part of a mRNA transcribed from a gene or coding sequence targeted for suppression and the corresponding double stranded RNA used for suppression and refers to the substantial and measurable reduction of the amount of available mRNA available in the cell for binding by ribosomes. The transcribed RNA can be in the sense orientation to effect what is called co-suppression, in the anti-sense orientation to effect what is called anti-sense suppression, or in both orientations producing a dsRNA to effect what is called RNA interference (RNAi). Transcriptional suppression is mediated by the presence in the cell of a dsRNA, a gene suppression agent, exhibiting substantial sequence identity to a promoter DNA sequence or the complement thereof to effect what is referred to as promoter trans suppression. Gene suppression may be effective against a native plant gene associated with a trait, e.g., to provide plants with reduced levels of a protein encoded by the native gene or with enhanced or reduced levels of an affected metabolite. Gene suppression can also be effective against target genes in plant pests that may ingest or contact plant material containing gene suppression agents, specifically designed to inhibit or suppress the expression of one or more homologous or complementary sequences in the cells of the pest. Such genes targeted for suppression can encode an essential protein, the predicted function of which is selected from the group consisting of muscle formation, juvenile hormone formation, juvenile hormone regulation, ion regulation and transport, digestive enzyme synthesis, maintenance of cell membrane potential, amino acid biosynthesis, amino acid degradation, sperm formation, pheromone synthesis, pheromone sensing, antennae formation, wing formation, leg formation, development and differentiation, egg formation, larval maturation, digestive enzyme formation, hemolymph synthesis, hemolymph maintenance, neurotransmission, cell division, energy metabolism, respiration, and apoptosis.
VI. Plants, plant cells and plant parts
[0113] Maize plants used in the methods and compositions are plants from Zea mays L. subsp. Mays. In some embodiments, the maize plant or plant part is from the group Zea mays
L. subsp. Mays Identata, sometimes referred to as dent corn. In some embodiments, the maize plant or plant part is from the group Zea mays L. subsp. Mays Indurata, sometimes referred to as flint corn. In some embodiments, the maize plant or plant part is from the group Zea mays L. subsp. Mays Saccharata, sometimes referred to as sweet com. In some embodiments, the maize plant or plant part is from the group Zea mays L. subsp. Mays Amylacea, sometimes referred to as flour com. In some embodiments, the maize plant or plant part is from the group Zea mays L. subsp. Mays Everts, sometimes referred to as popcorn. Maize plants that can be identified selected and/or produced with methods and compositions of the present invention include hybrids, inbreds, partial inbreds, members of defined populations and members of undefined populations. In some embodiments, the maize plant is an elite maize line. In some embodiments, the elite line is one of NP2222, NP2660, NP2276, NP2391, NP2460, or ID3461 or one of the lines disclosed in Table 5.
[0114] In some embodiments, a plant cell, seed, or plant part or harvest product can be obtained from a plant produced as above and the plant cell, seed, or plant part can be screened using methods disclosed above for the evidence of stable incorporation of the polynucleotide. Examples of plant parts include, but are not limited to, single cells and tissues from pollen, ovules, zygotes, leaves, embryos, roots, root tips, anthers, flowers, flower parts, fruits, stems, shoots, cuttings, and seeds; as well as pollen, ovules, egg cells, zygotes, leaves, embryos, roots, root tips, anthers, flowers, flower parts, fruits, stems, shoots, cuttings, scions, rootstocks, seeds, protoplasts, calli, and the like.
[0115] In some embodiments, plant products can be harvested from the plant disclosed above and processed to produce processed products, such as meal, oil, plant extract, starch, fermentation products, digestion products, and the like. These processed products are also within the scope of this invention if they comprise a polynucleotide or polypeptide or variant thereof disclosed herein. In another example, this disclosure also provides a corn meal. In another example, this disclosure also provides a method of providing a com meal by crushing oilseed of any of the plants provided herein.
VII. Breeding program and seed production program
[0116] In some embodiments, a MIR162 maize plant can be bred by crossing a first parental maize plant that is a transgenic MIR162 maize plant comprising one or more fertile QTLs and/or TE as described above and a second parental maize, thereby producing a plurality of first progeny plants. In some embodiments, the second parental maize plant does
not comprise MIR162 event. The method further comprises selecting a first progeny plant that comprises the MIR162 event and the one or more fertile QTLs.
[0117] In some embodiments, the method further comprises selfing the first progeny plant, thereby producing a plurality of second progeny plants; and then selecting from the second progeny plants that comprise both the MIR162 event and the fertile QTL(s) and/or TE.
[0118] In some embodiments, the first progeny plant is repeatedly crossed back to one of its parents through a process referred to herein as “backcrossing”. In some embodiments, the progeny is backcrossed for at least two generations to the maize parental plant (e.g., the second maize parental plant). In a backcrossing scheme, the “donor” parent refers to the parental plant with the desired gene or locus to be introgressed. The “recipient” parent (used one or more times) or “recurrent” parent (used two or more times) refers to the parental plant into which the gene or locus is being introgressed. For example, see Ragot, M. et al. (1995) Marker-assisted Backcrossing: A Practical Example, in Techniques et Utilisations des Marqueurs Moleculaires Les Colloques, Vol. 72, pp. 45-56; and Openshaw et al., (1994) Marker-assisted Selection in Backcross Breeding, in Proceedings of the Symposium “Analysis of Molecular Marker Data,” Joint Plant Breeding Symposia Series, American Society for Horticultural Science/Crop Science of America, Corvallis, Oregon, pp. 41-43. The initial cross gives rise to the Fl generation. The term “BC1” typically refers to the second use of the recurrent parent, “BC2” refers to the third use of the recurrent parent, and so on. The progeny plants may then be genotyped to confirm the presence of the MIR162 event and the fertile QTLs.
Exemplary Embodiments
[0119] Embodiment 1 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1- 223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize event MIR162, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising one or more fertile QTLs.
[0120] Embodiment 2 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1- 5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4. SNPs significantly associated with MIR162 event conversion success; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4, and crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR162 event; and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the one or more fertile QTL using the one or more marker loci linked to and within 10 centimorgans (cM) of the one or more fertile QTL.
[0121] Embodiment 3 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1- 223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, and introducing a MIR162 event into the selected maize plant.
[0122] Embodiment 4 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1- 5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4, and introducing a MIR162 event into the selected maize plant.
[0123] Embodiment 5 is the method of any one of embodiments 1-4, wherein the MIR162 event is introduced to the selected maize plant by transforming or breeding in a Vip3A coding sequence into the selected maize plant.
[0124] Embodiment 6 is the method of any one of embodiments 1-4, wherein the method further comprises producing an inbred MIR162 plant from the selected maize plant containing MIR162.
[0125] Embodiment 7 is the method of any one of embodiments 1-6, wherein the first maize plant comprises a QTL associated with increased fertility and the QTL comprises SEQ ID NO: 2, with a C at the position number 100.
[0126] Embodiment 8 is the method of any one of embodiments 1-6 , wherein the one or more fertile QTLs comprise one or more of: SEQ ID NO: 1 with a G at position number 100, SEQ ID NO: 2 with a C at position number 100, SEQ ID NO: 3 with a G at position number 100, SEQ ID NO: 4 with an A at position number 100, or SEQ ID NO: 5 with a C at position number 100.
[0127] Embodiment 9 is the method of any one of embodiments 1-6, wherein the one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T; or (xii) A, A, G, C, and T.
[0128] Embodiment 10 is a method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR162 event, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the transposon element.
[0129] Embodiment 11 is the method of example(s) 10, wherein the transposon element is on chromosome 5 130436801-130439201 in B73_ver5.
[0130] Embodiment 12 is the method of any one of embodiments 1-11, wherein the one or more fertile QTLs or the TE is introduced to the first maize plant by genome editing or by breeding.
[0131] Embodiment 13 is the method of example(s) 12, wherein the genome editing is performed using a site-directed nuclease selected from the group consisting of Cas9 nuclease, Cpfnucleas(Casl2a), meganucleases (MNs), zinc-finger nucleases, (ZFNs), transcriptionactivator like effector nucleases (TALENs), dCas9-Fokl, dCpfl-Fokl, chimeric Cas9-cytidine deaminase, chimeric Cas9-adenine deaminase, chimeric FENl-FokI, MegaTALs, a nickase Cas9 (nCas9), chimeric dCas9 non-Fokl nuclease, dCpfl non-Fokl nuclease, chimeric Cpfl- cytidine deaminase, and Cpfl-adenine deaminase.
[0132] Embodiment 14 is the method of any one of embodiments 1-13, wherein the first maize plant and/or the second maize plant is an elite maize line.
[0133] Embodiment 15 is the method of example(s) 1, wherein the increased fertility is increased male fertility.
[0134] Embodiment 16 is the method of any one of embodiments 1-15, wherein the method further comprises using the selected progeny MIR162 plant as a pollinator in a second cross with itself or a third maize plant.
[0135] Embodiment 17 is the method of example(s) 16, wherein the third maize plant expresses a polynucleotide or polypeptide of interest.
[0136] Embodiment 18 is the method of example(s) 17, wherein the polynucleotide or polypeptide of interest confers insect resistance, disease resistance, herbicide resistance, high oil content, increased digestibility; balanced amino acid content, and/or high energy content.
[0137] Embodiment 19 is the method of example(s) 16, wherein the second cross produces increased seed production as compared to a control cross.
[0138] Embodiment 20 is a method of producing a plant compatible with a MIR162 event, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, and selecting from the plurality of progeny plants a selected progeny plant comprising the transposon element.
[0139] Embodiment 21 is the method of example(s) 20, wherein the method further comprises crossing the selected progeny plant comprising the transposon element with a second maize plant comprising maize MIR162 event.
[0140] Embodiment 22 is a breeding program comprising the method of any one of embodiments 1-21.
[0141] Embodiment 23 is a MIR162 plant produced according to any one of embodiments 1-19.
[0142] Embodiment 24 is a MIR162 plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9.
[0143] Embodiment 25 is a MIR162 plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile
QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4.
[0144] Embodiment 26 is the MIR162 plant of example(s) 24 or 25, wherein the one or more fertile QTLs comprise SEQ ID NO: 2, with a C at the position number 100.
[0145] Embodiment 27 is the MIR162 plant of example(s) 24 or 25, wherein the one or more fertile QTLs comprise one or more of: SEQ ID NO: 1, with a G at position number 100, SEQ ID NO: 2, with a C at position number 100, SEQ ID NO: 3, with a G at position number 100, SEQ ID NO: 4, with an A at position number 100, or SEQ ID NO: 5, with a C at position number 100.
[0146] Embodiment 28 is the method of example(s) 24 or 25, wherein one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T; or (xii) A, A, G, C, and T.
[0147] Embodiment 29 is a MIR162 plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233.
[0148] Embodiment 30 is the MIR162 plant of example(s) 29 wherein the transposon element is on chromosome 5 130436801- 130439201 in B73_ver5.
[0149] Embodiment 31 is the MIR162 plant of example(s) 23-31, wherein the MIR162 plant is an inbred MIR162 plant.
[0150] Embodiment 32 is a plant cell, seed, or plant part derived from the MIR162 plant of any one of embodiments 24-33.
[0151] Embodiment 33 is a harvested product derived from the MIR162 plant of any one of embodiments 24-33 or the plant cell, seed, or plant part of example(s) 32.
[0152] Embodiment 34 is a kit for genotyping a plant wherein the kit comprises reagents for detecting the presence of one or more marker loci of SEQ ID Nos: 1-223 or 255; one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in
Table 4; and/or a transposon element (TE) of SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233.
[0153] Embodiment 35 is the kit of example(s) 34, wherein reagents comprise one or more of the following: primers for sequencing the one or more marker loci and/or the TE, primers for amplifying the one or more marker loci and/or the TE, or a oligonucleotide probe that is complementary to one or more marker loci.
[0154] Embodiment 36 is a method of detecting one or more fertile QTLs in an MIR162 plant, the method comprising genotyping the plant at one or more marker loci linked to one or more fertile QTL, wherein the fertile QTL comprises one or more marker loci of SEQ ID Nos 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9
[0155] Embodiment 37 is a method of detecting one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, the method comprising genotyping one or more fertile QTLs from the group consisting of a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1- 5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3; a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4.
[0156] Embodiment 38 is the method of example(s) 36 or 37, wherein the one or more fertile QTLs comprise one or more of: SEQ ID NO: 1 with a G at position number 100, SEQ ID NO: 2 with a C at position number 100, SEQ ID NO: 3 with a G at position number 100, SEQ ID NO: 4 with an A at position number 100, or SEQ ID NO: 5 with a C at position number 100.
[0157] Embodiment 39 is the method of example(s) 36-38 wherein the one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following: (i) C, A, A, T, and T; (ii) C, A, A, C, and T; (iii) C, A, G, C, and T; (iv) C, G, A, C, and T; (v) C, G, G, C, and T; (vi) C, G, G, C, and C; (vii) A, A, A, T, and T; (viii) A, A, G, T, and T; (ix) A, A, A, C, and T; (x) A, G, A, T, and T; (xi) A, G, A, C, and T; or (xii) A, A, G, C, and T.
[0158] Embodiment 40 is the method of any one of embodiments 36-39, wherein the genotyping is performed by sequencing, polymerase chain reaction (PCR), probe hybridization, or single base extension.
[0159] The examples presented below are intended to illustrate potential and specific implementations of the disclosure. It can be appreciated that the examples are intended primarily for purposes of illustration of the disclosure for those skilled in the art. There may be variations to these diagrams, or the operations described herein without departing from the spirit of the disclosure. For instance, in certain cases, method steps or operations may be performed or executed in differing order, or operations may be added, deleted or modified.
EXAMPLES
Example 1. Chromosome 5 Locus Identification
[0160] The MIR162 event is known to be associated with a reduced male fertility phenotype in certain inbred genetic backgrounds, which is thought to be attributed to reduced pollen shed. In a prior study, a first approach was to use a genome-wide association study (GWAS) (Yu et al. 2006) to identify genomic loci associated with the MIR162 inbred pollen shed phenotype. However, due to complicated genetic architecture, even with tremendous efforts in the GWAS approach, a portion of the variation of the MIR162 male fertility phenotype observed in certain maize backgrounds remained unexplained. In a second study, a bi-parental QTL mapping approach was used. Loci from the second study are disclosed, e.g., in U.S. Patent No. 10,214,784. However, these loci did not contribute to an
understanding of the full variation of the MIR162-associated male fertility phenotype observed in certain maize backgrounds.
[0161] In the current study, a genomic random forest (GRAF) prediction analytics model was used by integrating random forest into genomic prediction. Genotype and phenotype data of 338 inbred lines that went through the trait introgression program from 2017 to 2020 in maize was used (Table 2). Phenotypic data was classified as a binary trait, as either “failure” or “success” for each MIR162 conversion project. A MIR162 conversion project typically involved the initial crossing of an elite inbred line with a MIR162-containing donor line followed by repeated backcrossing to the elite inbred line to introgress the MIR162 event. Each inbred was evaluated based on 1) if they met the minimum seed quantity (e.g., producing at least 130 seeds per conversion event) and number of versions of conversion events resulted from individual backcrosses to be considered a successful trait introgression and 2) if it did not meet the key performance indicators (KPIs) in 1), were there other factors besides MIR162 that contributed to the failure. If the answer to the first question was “Yes”, it was defined as “success”, and if the answer to both questions was “No”, it was defined as a “failure”. If the answer to the question 2 was “Yes”, it was eliminated from the training data set. Each conversion was also followed through Version Testing (VT) and Seed Production Research (SPR) and if it failed a version test because of low pollen count the previous rating of “success” was updated to “failure”. In the context of this analysis, a version is a distinct instance of a finished conversion that derives from an individual backcross plant. Typically, backcrossing is performed until a certain % recurrent parent recovery (%RP) (e.g., >95% RP) is achieved (typically 2-4 backcrossed) and then self pollination of backcrossed plants (e.g., BC2, BC3 or BC4) is performed to start fixing the traits. 3-4 versions of each conversion were analyzed. Each version was derived from a unique BC plant that was selfed twice to create a BxF3 converted line, e.g., 4 B3F3 versions handed off and each one derived back to a different BC3 plant that was selfed. The minimum number of versions that was considered “successful” is 2 versions. The minimum seed quantity per version that is considered successful was 130 seeds per version. Over four years, the MIR162 failure rate averaged 0.43 with a range from 0.41 to 0.47. This failed conversion of MIR162 is thought to attributed to a decrease in male fertility in inbreds.
[0162] In the context of this analysis, the failure rate refers to the number of elite inbreds for which a MIR162 conversion was attempted and did not succeed (i.e. the plants remained infertile or had reduced fertility) out of the total number MIR162 conversions that were
attempted in a MIR162 conversion project. For example, in one experiment, it was attempted to convert 123 inbreds as described herein and 58 of them were not successful. Thus, the term “failure rate” in this context refers to the number of MIR162 conversion attempts (e.g., elite inbred or recurrent parents) that are infertile out the total number of MIR162 conversions s produced from a predetermined number of MIR162 projects. Conversely, the term “success rate” refers to the number of MIR162 conversions that are fertile (e.g., successful trait stacking) over the total number of MIR162 conversions produced from a predetermined number of MIR162 projects. In some embodiments, the predetermined number is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, or at least 120. The sum of the success rate and the failure rate for particular event equals 1 or 100%.
[0163] A total of 5,030 SNPs were used to genotype each inbred line before MIR162 conversion, and the genotypic data from these inbreds were used for subsequent analysis. The genotypes from each untraited inbred line were used for the training and prediction of the GRAF model.
[0164] The GRAF model was used to identify novel loci associated with MIR162 event conversion. One locus on Chromosome 5 was highly correlated with the conversion failure (see Table 3 below). Additional further analysis using the GRAF model also identified SNPs on chromosomes 2 and 8 as being highly correlated with MIR162 event conversion (see Table 9 below). In particular, the SNP at position 129259011 of Chromosome 5 was the most significantly correlated with the phenotype within the locus according to the GRAF model. These loci and/or the SNPs contained therein are considered to provide stronger predictive power than previous loci and SNPs disclosed in US 10,214,784.
Table 3. Chromosome 5 SNPs significantly associated with MIR162 event conversion success
The importance index in Table 9 is a measure of the relative importance of each SNP in predicting the outcome of the model. It is calculated by measuring the decrease in the model's accuracy when the values of a particular feature are randomly permuted. The higher the decrease in accuracy, the more important the feature is considered to be in the model. This index can be used to identify the most important genomic markers in the model and to gain insights into the relationships between the markers and the outcome of MIR162 event conversion.
SEQ ID NO: 255:
T C T G T AAAAC AAAC AG C C AT T T GAT C C AT T GAG CCTGATTTTTCGTATAGGGT C AAAAAAC T T GGTAAC T T CAAAGCAAGGCAC TAAAT GT T CACAT GAACAAAACAAAGGGCAGACCCAGC T C CCACATGAATGTGGGGTCTTTCCCCCACAAATGCGGAAAGGAAAATCCAGCATGTAAATTCA TCCAAGCAACAGAAACAAAAACTCGGCCNGGGAAGGAAAGACCGCCCTCCCGGTATTCTATT AAGAAGAGACCGAAACATGGTCCCGGCCGAAAAAATCCCCGAACCCTAGCCCCCCATCACTA GTTGGCCGACATCGCGCACTCTGCAAATGCCCAGCCGGAGGGTGGGGTGCATGACATAACCC GAGGGCGGGCGGGGCACAACGAAGGGATTTTTTAACCAAGCCCGAAATTCGCCCCCAAGGGG GATCGAACCCGGGACCTGGAGGTGCTACTTGGAAGCTTTAACCATTACGCTAAAGGCCCTTT C G GAG C GAG AGAAAC AC AC AT AAG T T GAT T C C T AAAC T AAAG C T G T T AC AGAAAAAC AG T AA TGTCCGAACTCCGGATATGTAGATTATCTAACAGAACTGATCACCAGAAGCAAAGGGCCTGT TCAGTTTGGGTTTCAGCTGCTG
Example 2. Other Non-Chromosome 5 Locus Identification
[0165] While lines that did contain the favorable allele at the SNP at position 129259011 of Chromosome 5 had about an 80% success rate, lines that did not contain the favorable allele at the SNP at position 129259011 of Chromosome 5 had about a 50% success rate. This indicates that there are other genomic loci that correlate with successful MIR162 event conversion and could be used instead of or with the SNP on Chromosome 5. To discover possibly new, conditionally distinct loci, fertile individuals carrying the favorable allele at the Chromosome 5 locus were removed from the sample set and a new sample set of 411 lines was used for a new GWAS. Structure analysis was performed to detect and correct for false positives due to population relatedness using a Nei similarity kinship matrix. GWAS was performed using a precomputed K (kinship) similarity matrix. Loci were detected on Chromosomes, 2, 6, 7 and 9 having SNPs with a significant correlation to the conversion phenotype (see Table 4 below). In these loci, the following SNPs were the most significantly correlated with the conversion phenotype: Chromosome 2 at position 243529534, chromosome 6 at position 158572433, chromosome 7 at position 134444680, and chromosome 9 at position 8013654. The favorable alleles in Table 4 could be used to positively select varieties that are likely to have a successful MIR162 conversion and the unfavorable alleles could be used to eliminate varieties that are likely to have an unsuccessful MIR162 conversion. The loci and/or the SNPs contained therein in Table 4 and Table 3, e.g., in combinations of two, three, or more, were considered to be of stronger predictive power than previous loci and SNPs disclosed in US10,214,784.
Table 4. SNPs significantly associated with MIR162 event conversion success
[0166] Key for Table 4: Favorable Allele = allele significantly associated with MIR162 conversion success, Unfavorable Allele = allele not significantly associated with MIR162 conversion success, Position is mapped to the ZmB73 refence genome, version 5 (B73_ver5) available as of the filing date (see, e.g., maizegdb.org/genome/assembly/Zm-B73- REFERENCE-NAM-5.0 or ncbi.nlm.nih.gov/assembly/GCF_902167145.1 for full assembly). Locus. logP refers to the strength of the association for the locus with the MIR162 conversion trait measured as -log 10 transformed P (probability of the occurrence given the event). The P value was calculated with the Wald test based on the T-statistic. Locus. logFDR is a measure of false discovery rate (FDR), which is the expected ratio of the number of false positive results to the number of total positive test results.
Example 3. Publicly available lines that contain the favorable alleles
[0167] Publicly available maize lines were screened and it was determined, using methods described in Section IV of this disclosure, that the following lines comprised one or more of the favorable alleles at respective loci as shown in Table 3 and Table 4. The variety codes and the published patent literature describing these varieties are shown in Table 5. Each of the listed patents and patent applications is incorporated herein by reference in its entirety.
Example 4. Allelic combination data
[0168] The most highly correlated SNPs from each chromosomal loci in Table 4 were used to create allelic combinations that also correlated with the MIR162 conversion phenotype. Each data set had an N of at least 3. The allelic combinations are shown below in Table 6 and Table 7. The combinations in Table 6 could be used to positively select varieties that are likely to have a successful MIR162 conversion and the combinations in Table 7 could be used to eliminate varieties that are likely to have an unsuccessful MIR162 conversion.
Table 7. Allelic combinations correlated with failed MIR162 event conversion
Example 5. Breeding to rescue the conversion failure phenotype
[0169] A first parent maize plant containing a favorable allele from any of the SNPs in Table 4, such as at chromosome 2 at position 243529534, chromosome 5 at position 129259011, chromosome 6 at position 158572433, chromosome 7 at position 134444680, or chromosome 9 at position 8013654, or containing an allelic combination in Table 4, or the TE described in Example 6 is crossed to a second parent maize plant that does not contain such allele(s) or allelic combination and would otherwise fail MIR162 event conversion. The progeny are then backcrossed for at least two generations to the second parent maize plant to fix the favorable allele(s) or allelic combination or TE into the second parent maize plant’s genetic background. It is expected that fixation of the favorable allele(s) or allelic combination or TE will improve the likelihood of a successful MIR162 event conversion.
[0170] Alternatively, genome editing (e.g., using a CRISPR enzyme such as Cas9 or Cpfl, or a variant thereof) is used to introduce the favorable allele from any of the loci in Table 4, such as at chromosome 2 at position 243529534, chromosome 5 at position 129259011, chromosome 6 at position 158572433, chromosome 7 at position 134444680, or chromosome 9 at position 8013654, or an allelic combination in Table 4, or introduce the TE into a maize plant that does not contain such allele(s) or allelic combination and would otherwise fail MIR162 event conversion. It is expected that introduction of the favorable allele(s) or allelic combination will improve the likelihood of a successful MIR162 event conversion.
Example 6. Functional annotation of Chr 5 locus
[0171] The locus on chromosome 5 is devoid of any genes, but there are potential function regions based on low/no methylation. The region has no accessible chromatin regions (ACR) or histone modifications. The region is also devoid of any known miRNA. The region does have several predicted repetitive elements, notably a long terminal repeat (LTR) transposable element (TE). The sequence of the TE is SEQ ID NO: 232 and is located at Chromosome 5: 130436801-130439201 in B73_ver5. A blast search of the TE identified a region consisting of 2,340 nucleotides on chromosome 2 on B73_v5, and the region has 100% identity along its length to the TE on chromosome 5. This region on chromosome 2 is upstream of indole-3- acetic acid amido synthetase, aka., aas2, or auxin synthetase2). The maize allele of aas2 is Zm00001d006753, which was found to be highly expressed in maize embryo, anther, tassel, and spikelets. The region on chromosome 2 also contains ACRs from maize ear tissue, as well as H3K56Ac marks that suggest active transcription in ear primordia and young leaf
tissue. The results related to the region on chromosome 2 are consistent with the hypothesis that the region mapped by GRAF identified a TE on chromosome 5 that carries a regulatory region. It is possible that this regulatory region may impact the area near the locus on chromosome 5. The nearest flanking gene is SWEET4c, known to be involved in sucrose transport to developing kernels. The presence of TE (SEQ ID NO: 232, present in certain genetic backgrounds like NP2222) or a synthetic TE (SEQ ID NO: 233, present in certain genetic backgrounds such as NPCI6621) on chromosome 5 is positively correlated with conversion success (see Table 8). One of ordinary skill in the art would understand that the sequence of the TE can vary to some degree due to the highly repetitive nature of such sequences.
References
1. Bernardo R., and Yu J. (2007). Prospects for genome wide selection for quantitative traits in maize. Crop Sci., 47, 1082-1090. DOI: 10.2135/Cropsci2006.11.0690
2. Breiman, L. (2001). Random forests. Mach. Learn., 45, 5-32. doi: 10.1023/A: 1010933404324
3. Chen, L., et al. (2015). A cascade of sequentially expressed sucrose transporters in the seed coat and endosperm provides nutrition for the Arabidopsis embryo. Plant Cell, 27, 607-19
4. Chen, L., et al. (2012). Sucrose efflux mediated by SWEET proteins as a key step for phloem transport. Science, 335, 207-11
5. Guo, Z., et al. (2012). Evaluation of genome-wide selection efficiency in maize nested association mapping populations. Theor. Appl. Genet., 124, 261-275
6. Meuwissen, T. H., et al. (2001). Prediction of total genetic value using genomewide dense marker maps. Genetics, 157, 1819-1829
7. Yu, J., et al. (2006). A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet., 38, 203-208
Claims
1. A method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize event MIR162, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising one or more fertile QTLs.
2. A method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of
(i) a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3;
(ii) a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iii) a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iv) a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker
locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(v) a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and
(vi) a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4, and crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR 162 event; and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the one or more fertile QTL using the one or more marker loci linked to and within 10 centimorgans (cM) of the one or more fertile QTL.
3. A method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9, and introducing a MIR162 event into the selected maize plant.
4. A method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of
(i) a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3;
(ii) a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iii) a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iv) a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(v) a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and
(vi) a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4, and introducing a MIR162 event into the selected maize plant.
5. The method of any one of claims 1-4, wherein the MIR162 event is introduced to the selected maize plant by transforming or breeding in a Vip3 A coding sequence into the selected maize plant.
6. The method of any one of claims 1-4 , wherein the method further comprises producing an inbred MIR162 plant from the selected maize plant containing MIR162.
7. The method of any one of claims 1-6 , wherein the first maize plant comprises a QTL associated with increased fertility and the QTL comprises SEQ ID NO: 2, with a C at the position number 100.
8. The method of any one of claims 1-6 , wherein the one or more fertile QTLs comprise one or more of:
SEQ ID NO: 1 with a G at position number 100,
SEQ ID NO: 2 with a C at position number 100,
SEQ ID NO: 3 with a G at position number 100,
SEQ ID NO: 4 with an A at position number 100, or
SEQ ID NO: 5 with a C at position number 100.
9. The method of any one of claims 1-6, wherein the one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following:
(i) C, A, A, T, and T;
(ii) C, A, A, C, and T;
(iii) C, A, G, C, and T;
(iv) C, G, A, C, and T;
(v) C, G, G, C, and T;
(vi) C, G, G, C, and C;
(vii) A, A, A, T, and T;
(viii) A, A, G, T, and T;
(ix) A, A, A, C, and T;
(x) A, G, A, T, and T;
(xi) A, G, A, C, and T; or
(xii) A, A, G, C, and T.
10. A method of producing a MIR162 plant with increased fertility, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233, crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, wherein the first maize plant and/or the second maize plant comprises maize MIR162 event, and selecting from the plurality of progeny plants a selected progeny MIR162 plant comprising the transposon element.
11. The method of claim 10, wherein the transposon element is on chromosome 5 130436801-130439201 in B73_ver5.
12. The method of any one of claims 1-11, wherein the one or more fertile QTLs or the TE is introduced to the first maize plant by genome editing or by breeding.
13. The method of claim 12, wherein the genome editing is performed using a site-directed nuclease selected from the group consisting of Cas9 nuclease, Cpfl nuclease (Casl2a), meganucleases (MNs), zinc-finger nucleases, (ZFNs), transcription-activator like effector nucleases (TALENs), dCas9-Fokl, dCpfl-Fokl, chimeric Cas9-cytidine deaminase, chimeric Cas9-adenine deaminase, chimeric FENl-FokI, MegaTALs, a nickase Cas9 (nCas9), chimeric dCas9 non-Fokl nuclease, dCpfl non-Fokl nuclease, chimeric Cpfl-cytidine deaminase, and Cpfl-adenine deaminase.
14. The method of any one of claims 1-13, wherein the first maize plant and/or the second maize plant is an elite maize line.
15. The method of claim 1, wherein the increased fertility is increased male fertility.
16. The method of any one of claims 1-15, wherein the method further comprises using the selected progeny MIR162 plant as a pollinator in a second cross with itself or a third maize plant.
17. The method of claim 16, wherein the third maize plant expresses a polynucleotide or polypeptide of interest.
18. The method of claim 17, wherein the polynucleotide or polypeptide of interest confers insect resistance, disease resistance, herbicide resistance, high oil content, increased digestibility; balanced amino acid content, and/or high energy content.
19. The method of claim 16, wherein the second cross produces increased seed production as compared to a control cross.
20. A method of producing a plant compatible with a MIR162 event, said method comprising: selecting a first maize plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233,
crossing the first maize plant with a second maize plant to produce a plurality of progeny plants, and selecting from the plurality of progeny plants a selected progeny plant comprising the transposon element.
21. The method of claim 20, wherein the method further comprises crossing the selected progeny plant comprising the transposon element with a second maize plant comprising maize MIR162 event.
22. A breeding program comprising the method of any one of claims 1-21.
23. A MIR162 plant produced according to any one of claims 1-19.
24. A MIR162 plant comprising one or more fertile QTLs, wherein the one or more fertile QTLs comprise one or more marker loci of SEQ ID NOs: 1-223 or 255, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3, Table 4 or Table 9.
25. A MIR162 plant comprising one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of
(i) a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3;
(ii) a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iii) a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iv) a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker
locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(v) a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and
(vi) a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4.
26. The MIR162 plant of claim 24 or 25, wherein the one or more fertile QTLs comprise SEQ ID NO: 2, with a C at the position number 100.
27. The MIR162 plant of claim 24 or 25, wherein the one or more fertile QTLs comprise one or more of:
SEQ ID NO: 1, with a G at position number 100,
SEQ ID NO: 2, with a C at position number 100,
SEQ ID NO: 3, with a G at position number 100,
SEQ ID NO: 4, with an A at position number 100, or
SEQ ID NO: 5, with a C at position number 100.
28. The method of claim 24 or 25, wherein one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following:
(i) C, A, A, T, and T;
(ii) C, A, A, C, and T;
(iii) C, A, G, C, and T;
(iv) C, G, A, C, and T;
(v) C, G, G, C, and T;
(vi) C, G, G, C, and C;
(vii) A, A, A, T, and T;
(viii) A, A, G, T, and T;
(ix) A, A, A, C, and T;
(x) A, G, A, T, and T;
(xi) A, G, A, C, and T; or
(xii) A, A, G, C, and T.
29. A MIR162 plant comprising a transposon element (TE) comprising SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233.
30. The MIR162 plant of claim 29 wherein the transposon element is on chromosome 5 130436801- 130439201 in B73_ver5.
31. The MIR162 plant of claim 23-31, wherein the MIR162 plant is an inbred MIR162 plant.
32. A plant cell, seed, or plant part derived from the MIR162 plant of any one of claims 24-33.
33. A harvested product derived from the MIR162 plant of any one of claims 24- 33 or the plant cell, seed, or plant part of claim 32.
34. A kit for genotyping a plant wherein the kit comprises reagents for detecting the presence of
(a) One or more marker loci of SEQ ID Nos: 1-223 or 255;
(b) one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, wherein the one or more fertile QTLs are selected from the group consisting of
(i) a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3;
(ii) a QTL located on chromosome 2 between 220,204,265 and
244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iii) a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56- 89, wherein at least one marker locus of the one or more marker loci
comprises a favorable allele at position number 100 as shown in Table 4;
(iv) a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90- 131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(v) a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and
(vi) a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4; and/or
(c) a transposon element (TE) of SEQ ID NO: 232, 233, or a variant thereof that is at least 90% identical to SEQ ID NO: 232 or 233.
35. The kit of claim 34, wherein reagents comprise one or more of the following:
(1) primers for sequencing the one or more marker loci and/or the TE,
(2) primers for amplifying the one or more marker loci and/or the TE, or
(3) a oligonucleotide probe that is complementary to one or more marker loci.
36. A method of detecting one or more fertile QTLs in an MIR162 plant, the method comprising genotyping the plant at one or more marker loci linked to one or more fertile QTL, wherein the fertile QTL comprises one or more marker loci of SEQ ID Nos 1- 223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3 or Table 4 or Table 9.
37. A method of detecting one or more marker loci linked to and within 10 centimorgans (cM) of one or more fertile QTL, the method comprising genotyping one or more fertile QTLs from the group consisting of
(i) a QTL located on chromosome 5 between 126407035 and 137510449 comprising one or more marker loci of SEQ ID NOs: 1-5, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 3;
(ii) a QTL located on chromosome 2 between 220,204,265 and 244,411,717 comprising one or more marker loci of SEQ ID NO: 6-55, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iii) a QTL located on chromosome 6 between 146,526,712 and 168,005,927 comprising one or more marker loci of SEQ ID NO: 56-89, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(iv) a QTL located on chromosome 7 between 130,409,997 and 180,307,479 comprising one or more marker loci of SEQ ID NO: 90-131, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4;
(v) a QTL located on chromosome 8 at position 14,787,385 comprising the marker locus of SEQ ID NO: 255, wherein the marker locus comprises a favorable allele at position number 100 as shown in Table 9; and
(vi) a QTL located on chromosome 9 between 974,242 and 158,560,296 comprising one or more marker loci of SEQ ID NO: 132-223, wherein at least one marker locus of the one or more marker loci comprises a favorable allele at position number 100 as shown in Table 4.
38. The method of claim 36 or 37, wherein the one or more fertile QTLs comprise one or more of:
SEQ ID NO: 1 with a G at position number 100,
SEQ ID NO: 2 with a C at position number 100,
SEQ ID NO: 3 with a G at position number 100,
SEQ ID NO: 4 with an A at position number 100, or
SEQ ID NO: 5 with a C at position number 100.
39. The method of claim 36-38 wherein the one or more fertile QTLs comprise SEQ ID NO: 2, 52, 88, 139, and 94, wherein position number 100 of each sequence, respectively and in combination, is the one of the following:
(i) C, A, A, T, and T;
(ii) C, A, A, C, and T;
(iii) C, A, G, C, and T;
(iv) C, G, A, C, and T;
(v) C, G, G, C, and T;
(vi) C, G, G, C, and C;
(vii) A, A, A, T, and T;
(viii) A, A, G, T, and T;
(ix) A, A, A, C, and T;
(x) A, G, A, T, and T;
(xi) A, G, A, C, and T; or
(xii) A, A, G, C, and T.
40. The method of any one of claims 36-39, wherein the genotyping is performed by sequencing, polymerase chain reaction (PCR), probe hybridization, or single base extension.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202480031484.8A CN121152559A (en) | 2023-05-09 | 2024-05-07 | Genetic markers associated with improved fertility in maize |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202363501094P | 2023-05-09 | 2023-05-09 | |
| US63/501,094 | 2023-05-09 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2024233528A2 true WO2024233528A2 (en) | 2024-11-14 |
| WO2024233528A3 WO2024233528A3 (en) | 2025-04-03 |
Family
ID=93431030
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2024/028116 Pending WO2024233528A2 (en) | 2023-05-09 | 2024-05-07 | Genetic markers associated with increased fertility in maize |
Country Status (4)
| Country | Link |
|---|---|
| CN (1) | CN121152559A (en) |
| AR (1) | AR132636A1 (en) |
| UY (1) | UY40735A (en) |
| WO (1) | WO2024233528A2 (en) |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| PT3632202T (en) * | 2014-02-21 | 2022-10-03 | Syngenta Participations Ag | Genetic loci associated with increased fertility in maize |
-
2024
- 2024-05-07 WO PCT/US2024/028116 patent/WO2024233528A2/en active Pending
- 2024-05-07 CN CN202480031484.8A patent/CN121152559A/en active Pending
- 2024-05-08 AR ARP240101170A patent/AR132636A1/en unknown
- 2024-05-08 UY UY0001040735A patent/UY40735A/en unknown
Also Published As
| Publication number | Publication date |
|---|---|
| AR132636A1 (en) | 2025-07-16 |
| WO2024233528A3 (en) | 2025-04-03 |
| CN121152559A (en) | 2025-12-16 |
| UY40735A (en) | 2024-12-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10428393B2 (en) | Corn event 5307 | |
| EP2736321B1 (en) | Insect resistant and herbicide tolerant soybean event 9582.814.19.1 | |
| AU2018357926A1 (en) | Wheat comprising male fertility restorer alleles | |
| US10455834B2 (en) | Insect resistant and herbicide tolerant soybean event pDAB9582.816.15.1 | |
| JP2012521782A (en) | Genetically modified rice event 17314 and method of use thereof | |
| WO2024060732A1 (en) | Transgenic zea mays l. event lp026-2 and detection method therefor | |
| US20230383307A1 (en) | Corn Elite Event MZIR098 | |
| US20240011043A1 (en) | Generation of plants with improved transgenic loci by genome editing | |
| CN112204156A (en) | Systems and methods for improving breeding by modulating recombination rates | |
| WO2017214074A1 (en) | Corn elite event mzhg0jg | |
| CN116782762B (en) | Plant haploid induction | |
| WO2024233528A2 (en) | Genetic markers associated with increased fertility in maize | |
| CN116732215A (en) | Nucleic acid sequence for detecting corn plant DBN9229 and detection method thereof | |
| EP4199703A1 (en) | Methods of increasing outcrossing rates in gramineae | |
| WO2025188947A2 (en) | Corn elite event mzir260 | |
| WO2024211809A1 (en) | Methods and compositions for increasing protein and/or oil content and/or oil profile in a plant | |
| CN118291517A (en) | A method for genetic transformation of immature embryos induced by Agrobacterium infection |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24804135 Country of ref document: EP Kind code of ref document: A2 |
|
| REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112025024389 Country of ref document: BR |
|
| ENP | Entry into the national phase |
Ref document number: 2024804135 Country of ref document: EP Effective date: 20251209 |