WO2025137000A1 - Marqueurs génétiques terpéniques - Google Patents
Marqueurs génétiques terpéniques Download PDFInfo
- Publication number
- WO2025137000A1 WO2025137000A1 PCT/US2024/060604 US2024060604W WO2025137000A1 WO 2025137000 A1 WO2025137000 A1 WO 2025137000A1 US 2024060604 W US2024060604 W US 2024060604W WO 2025137000 A1 WO2025137000 A1 WO 2025137000A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- genotype
- seq
- chromosome
- content
- analyzing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H6/00—Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
- A01H6/28—Cannabaceae, e.g. cannabis
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/04—Processes of selection involving genotypic or phenotypic markers; Methods of using phenotypic markers for selection
- A01H1/045—Processes of selection involving genotypic or phenotypic markers; Methods of using phenotypic markers for selection using molecular markers
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H5/00—Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
- A01H5/02—Flowers
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H5/00—Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
- A01H5/12—Leaves
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Definitions
- the present disclosure relates to genes associated with terpene production in Cannabis, and methods of producing Cannabis varieties having high terpene content.
- Sequence Listing is submitted as an XML file in the form of the file named “Sequence.xml” (95,829 bytes), created on December 2, 2024, which is incorporated by reference herein.
- the methods include (i) analyzing one or more genetic markers in a nucleic acid sample from the Cannabis plant or its germplasm; (ii) detecting one or more genetic markers that indicate modified teipene content; and (iii) identifying and/or selecting the Cannabis plant having one or more genetic markers that indicate modified terpene content, thereby idcntifying/sclccting Cannabis plants having modified terpene content. Also disclosed are methods of producing one or more Cannabis plants having modified terpene content (e.g., increased terpene content relative to a control).
- the modified terpene content is increased terpene content relative to a control.
- Modified terpene content can include, for example, modified total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, trans-nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content.
- the one or more genetic markers that are analyzed and/or detected can include, for example, a genetic marker disclosed herein e.g., one or more genetic markers included in Table 15 or Table 16). In some aspects, 2 to 10 genetic markers disclosed herein are analyzed. In further aspects, at least one genetic marker indicating modified terpene content is detected. In some aspects, at least one genetic marker indicating modified terpene content is detected. In some aspects, at least two genetic markers indicating modified terpene content are detected.
- products e.g., a kief, hashish, bubble hash, an edible product, solvent reduced oil, sludge, e-juice, or tincture
- methods of Cannabis breeding including crossing a Cannabis plant identified, selected, or produced by a method disclosed herein.
- SEQ ID NOs: 1-99 arc sequences encompassing SNP markers. Each sequence includes 50 bp of 5’ and 3' flanking sequence with the SNP marker at position 51 bp.
- Cannabis has long been used for drug and industrial purposes, fiber (hemp), for seed and seed oils, for medicinal purposes, and for recreational purposes.
- Industrial hemp products are made from Cannabis plants selected to produce an abundance of fiber.
- Some Cannabis varieties have been bred to produce minimal levels of THC, the principal psychoactive constituent responsible for the psychoactivity associated with marijuana.
- Marijuana has historically consisted of the dried flowers of Cannabis plants selectively bred to produce high levels of THC and other psychoactive cannabinoids. As a drug it usually comes in the form of dried flower buds (marijuana), resin (hashish), or various extracts collectively known as hashish oil.
- Cannabis is an annual, dioecious, flowering herb. The leaves are palmately compound or digitate, with serrate leaflets. Cannabis normally has imperfect flowers, with staminate “male” and pistillate “female” flowers occurring on separate plants. It is not unusual, however, for individual plants to separately bear both male and female flowers (i.eowski have monoecious plants). Although monoecious plants are often referred to as “hermaphrodites,” true hermaphrodites (which are less common in Cannabis) bear staminate and pistillate structures on individual flowers, whereas monoecious plants bear male and female flowers at different locations on the same plant.
- Cannabis plants The life cycle of Cannabis varies with each variety but can be generally summarized into germination, vegetative growth, and reproductive stages. Because of heavy breeding and selection by humans, most Cannabis seeds have lost dormancy mechanisms and do not require any pre-treatments or winterization to induce germination. Seed placed in viable growth conditions are expected to germinate in about 3 to 7 days. The first true leaves of a Cannabis plant contain a single leaflet, with subsequent leaves developing in opposite formation with increasing number of leaflets. Leaflets can be narrow or broad depending on the morphology of the plant grown. Cannabis plants are normally allowed to grow vegetatively for the first 4 to 8 weeks. During this period, the plant responds to increasing light with faster and faster growth. Under ideal conditions, Cannabis plants can grow up to 2.5 inches a day and are capable of reaching heights of up to 20 feet. Indoor growth pruning techniques tend to limit Cannabis size through careful pruning of apical or side shoots.
- the first genome sequence of Cannabis which is estimated to be 820 Mb in size, was published in 2011 by a team of Canadian scientists (Bakel et al., “The draft genome and transcriptome of Cannabis sativa” Genome Biology 12:R102).
- Cannabis plants produce a variety of secondary metabolites, including cannabinoids, terpenoids, and other compounds, which are often secreted by glandular trichomes that occur most abundantly on the floral calyxes and bracts of female plants.
- Cannabinoids are the most studied group of secondary metabolites in Cannabis. Most exist in two forms, as acids and in neutral (decarboxylated) forms. The acid form is designated by an “A” at the end of its acronym (i.e. THCA).
- the phytocannabinoids are synthesized in the plant as acid forms, and while some decarboxylation does occur in the plant, it increases significantly post-harvest and the kinetics increase at high temperatures (Sanchez and Verpoorte 2008).
- the biologically active forms for human consumption are the neutral forms. Decarboxylation is usually achieved by thorough drying of the plant material followed by heating it, often by either combustion, vaporization, or heating or baking in an oven.
- Cannabinoids found in Cannabis plants include, but are not limited to, A9-Tetrahydrocannabinol (A9- THC), A8-Tetrahydrocannabinol (A8-THC), Cannabichromene (CBC), Cannabicyclol (CBL), Cannabidiol (CBD), Cannabielsoin (CBE), Cannabigerol (CBG), Cannabinidiol (CBND), Cannabinol (CBN), Cannabitriol (CBT), and their propyl homologs, including, but are not limited to cannabidivarin (CBDV), A9- Tetrahydrocannabivarin (THCV), cannabichromevarin (CBCV), and cannabigerovarin (CBGV).
- A9-Tetrahydrocannabinol A9- THC
- A8-Tetrahydrocannabinol A8-THC
- Cannabichromene
- Non-THC cannabinoids can be collectively referred to as “CBs”, wherein CBs can be one of THCV, CBDV, CBGV, CBCV, CBD, CBC, CBE, CBG, CBN, CBND, and CBT cannabinoids.
- Terpenes are primarily produced in glandular trichomes of female inflorescences (Livingston et al., "Cannabis glandular trichomes alter morphology and metabolite content during flower maturation," The Plant Journal 101.1 (2020): 37-56). Besides affecting aroma and fragrance, terpenes may have a synergic effect with cannabinoids (Sommano et al., “The cannabis terpenes," Molecules 25.24 (2020): 5792), and have been attributed medicinal properties (Maggini et al., "An Optimized Terpene Profile for a New Medical Cannabis Oil,” Pharmaceutics 14.2 (2022): 298).
- terpenes in Cannabis Two main groups of terpenes in Cannabis are the monoterpenes and sesquiterpenes, which are produced in the methylerythritol phosphate pathway (MEP) and mevalonic acid pathway (MEV), respectively (Booth et al., "Terpene synthases from Cannabis sativa,” Pios one 12.3 (2017): e0173911).
- Monoterpenes have a ten-carbon isoprenoid precursor, geranyl diphosphate (GPP).
- Sesquiterpenes have a fifteen-carbon isoprenoid precursor, farnesyl diphosphate (FPP).
- GPP and FPP are converted to different monoterpenes and scsquitcipcncs, respectively, by tcipcnc synthases (TPS; Booth et al., “Terpenes in Cannabis sativa-From plant genome to humans," Plant Science 284 (2019): 67-72).
- TPS tcipcnc synthases
- SNP single nucleotide polymorphism
- a plant includes singular or plural plants and can be considered equivalent to the phrase “at least one plant.”
- the term “comprises” means “includes.” It is further to be understood that any and all base sizes or amino acid sizes, and all molecular weight or molecular mass values, given for nucleic acids or polypeptides are approximate, and are provided for descriptive purposes, unless otherwise indicated. Although many methods and materials similar or equivalent to those described herein can be used, particular suitable methods and materials are described herein. In case of conflict, the present specification, including explanations of terms, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting. To facilitate review of the various aspects, the following explanations of terms are provided:
- Abacus refers to the Cannabis sativa reference genome known as the Abacus reference genome version Csat_AbacusV2 (NCBI assembly accession GCA_025232715.1, incorporated by reference herein), which is also sometimes referred to as CsaAba2.
- alternative nucleotide call is a nucleotide polymorphism relative to a reference nucleotide for a SNP marker that is significantly associated with a desired phenotype (e.g., modified terpene content). Unless otherwise specified, the reference is the Abacus sequence.
- a “beneficial” as used herein refers to a genetic element ( ⁇ ?.g., gene, allele, or polymorphism) conferring or associated with modified terpene content (e.g., increased terpene content).
- a “beneficial polymorphism” or “beneficial allele” refers to a polymorphism or allele associated with modified terpene content (e.g., increased terpene content).
- hybridizing specifically to refers to the binding, duplexing, or hybridizing of a nucleic acid molecule preferentially to a particular nucleotide sequence under stringent conditions.
- stringent conditions refers to conditions under which a nucleic acid will hybridize preferentially to a target sequence, and to a lesser extent to, or not at all to, other off-target sequences.
- a “stringent hybridization” and “stringent hybridization wash conditions” in the context of nucleic acid hybridization are sequence dependent, and are different under different environmental parameters.
- line is used broadly to include, but is not limited to, a group of plants vegetatively propagated from a single parent plant, via tissue culture techniques or a group of inbred plants which are genetically very similar due to descent from a common parent(s).
- a plant is said to “belong” to a particular line if it (a) is a primary transformant (TO) plant regenerated from material of that line; (b) has a pedigree comprised of a TO plant of that line; or (c) is genetically very similar' due to common ancestry (e.g., via inbreeding or selfing).
- the term “pedigree” denotes the lineage of a plant, e.g. in terms of the sexual crosses affected such that a gene or a combination of genes, in heterozygous (hemizygous) or homozygous condition, imparts a desired trait to the plant (e.g., modified terpene content).
- a marker probe refers to a probe of any type that is able to distinguish (i.e., genotype) the particular allele that is present at a marker locus.
- a “marker locus” is a locus that can be used to track the presence of a second linked locus, e.g., a linked locus that encodes or contributes to expression of a phenotypic trait.
- a marker locus can be used to monitor segregation of alleles at a locus, such as a QTL, that are genetically or physically linked to the marker locus.
- a “marker allele,” alternatively an “allele of a marker locus,” is one of a plurality of polymorphic nucleotide sequences found at a marker locus in a population that is polymorphic for the marker locus.
- markers include restriction fragment length polymorphism (RFLP) markers, amplified fragment length polymorphism (AFLP) markers, single nucleotide polymorphisms (SNPs), microsatellite markers (e.g. SSRs), sequence-characterized amplified region (SCAR) markers, cleaved amplified polymorphic sequence (CAPS) markers or isozyme markers or combinations of the markers described herein which defines a specific genetic and chromosomal location.
- RFLP restriction fragment length polymorphism
- AFLP amplified fragment length polymorphism
- SNPs single nucleotide polymorphisms
- SCAR sequence-characterized amplified region
- CAS cleaved amplified polymorphic sequence
- modified Cannabis plant or “modified plant” is not a naturally occurring plant.
- offspring or “progeny” refer to a plant resulting as from a vegetative or sexual reproduction from one or more parent plants.
- an offspring/progeny plant may be obtained by cloning or selfing of a parent plant or by crossing two parent plants.
- An Fl is a first-generation offspring produced from parents at least one of which is used for the first time as donor of a trait, while offspring of second generation (F2) or subsequent generations (F3, F4, etc.) are specimens produced from selfings of Fl’s, F2’s etc.
- An Fl may thus be (and usually is) a hybrid resulting from a cross between two true breeding parents (true-breeding is homozygous for a trait), while an F2 may be (and usually is) an offspring resulting from self-pollination.
- Plant tissue refers to any tissue of a plant, including but not limited to, tissue from an embryo, shoot, root, stem, seed, stipule, leaf, trichome, petal, flower bud, flower, ovule, bract, branch, petiole, internode, bark, pubescence, tiller, rhizome, frond, blade, ovule, pollen, stamen.
- a plant tissue or plant organ may be a seed, protoplast, callus, or any other group of plant cells that is organized into a structural or functional unit.
- a plant cell or tissue culture may be capable of regenerating a plant having the physiological and morphological characteristics of the plant from which the cell or tissue was obtained, and of regenerating a plant having substantially the same genotype as the plant.
- Regenerable cells in a plant cell or tissue culture may be embryos, protoplasts, meristematic cells, callus, pollen, leaves, anthers, roots, root tips, silk, flowers, kernels, ears, cobs, husks, or stalks.
- Plant parts include harvestable parts and parts useful for propagation of progeny plants. Plant parts useful for propagation include, for example and without limitation: seed; fruit; a cutting; a seedling; a tuber; and a rootstock.
- a harvestable part of a plant may be any useful part of a plant, including, for example and without limitation: flower; pollen; seedling; tuber; leaf; stem; fruit; seed; and root.
- a plant cell is the structural and physiological unit of the plant.
- a plant cell may be in the form of an isolated single cell, or an aggregate of cells (e.g., a friable callus and a cultured cell), and may be part of a higher organized unit (e.g., a plant tissue, plant organ, and plant).
- a plant cell may be a protoplast, a gamete producing cell, or a cell or collection of cells that can regenerate into a whole plant.
- a seed which comprises multiple plant cells and is capable of regenerating into a whole plant, is considered a “plant cell.” Described herein are plants in the genus of Cannabis and plants derived therefrom, which can be produced by asexual or sexual reproduction.
- polymorphism refers to a difference in the nucleotide or amino acid sequence of a given region as compar ed to a nucleotide or amino acid sequence in a homologous-region of another individual, in particular, a difference in the nucleotide of amino acid sequence of a given region which differs between individuals of the same species.
- a polymorphism is generally defined in relation to a reference sequence. Unless indicated otherwise, the reference sequence is the Cannabis Abacus reference genome (version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1) or CDS produced from the Cannabis Abacus reference genome.
- Polymorphisms include single nucleotide differences, differences in sequence of more than one nucleotide, and single or multiple nucleotide insertions, inversions and deletions; as well as single amino acid differences, differences in sequence of more than one amino acid, and single or multiple amino acid insertions, inversions, and deletions.
- polynucleotide “polynucleotide sequence,” “nucleotide sequence,” “nucleic acid sequence,” and “nucleic acid fragment,” are used interchangeably. These terms encompass polymers composed of nucleotide units (ribonucleotides, deoxyribonucleotides, related naturally occurring structural variants, and synthetic non-naturally occurring analogs thereof).
- oligonucleotide typically refers to short polynucleotides, generally no greater than 150 nucleotides, for example, no greater than 125 nucleotides, no greater than 100 nucleotides, no greater than 75 nucleotides, no greater than 50 nucleotides, or no greater than 25 nucleotides. It will be understood that when a nucleic acid sequence is represented as a DNA sequence (z.e., A, T, G, C), this also includes an RNA sequence (z.e., A, U, G, C) in which “U” replaces “T.” Nucleic acids can be single- or double-stranded.
- nucleic acids include cDNA, genomic DNA, synthetic DNA, RNA, or mixtures thereof.
- polypeptide or protein refers to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
- amino acid residue or “amino acid” includes reference to an amino acid that is incorporated into a protein, polypeptide, or peptide.
- the amino acid can be a naturally occurring amino acid and, unless otherwise limited, can encompass known analogs of natural amino acids that can function in a similar manner as naturally occurring amino acids.
- recombinant includes reference to a protein produced using cells that do not have, in their native state, an endogenous copy of the DNA able to express the protein.
- the cells produce the recombinant protein because they have been genetically altered by the introduction of the appropriate isolated nucleic acid sequence.
- the term also includes reference to a cell, or nucleic acid, or vector, that has been modified by the introduction of a heterologous nucleic acid or the alteration of a native nucleic acid to a form not native to that cell, or that the cell is derived from a cell so modified.
- the primer may vary in length depending on the particular conditions and requirements of the application.
- the oligonucleotide primer is typically 15-25 or more nucleotides in length.
- the primer must be of sufficient complementarity to the desired template to prime the synthesis of the desired extension product, that is, to be able anneal with the desired template strand in a manner sufficient to provide the 3’ hydroxyl moiety of the primer in appropriate juxtaposition for use in the initiation of synthesis by a polymerase or similar enzyme. It is not required that the primer sequence represent an exact complement of the desired template.
- a non-complementary nucleotide sequence may be attached to the 5’ end of an otherwise complementary primer.
- non-complementary bases may be interspersed within the oligonucleotide primer sequence, provided that the primer sequence has sufficient complementarity with the sequence of the desired template strand to functionally provide a template-primer complex for the synthesis of the extension product.
- product as used in reference to a Cannabis product, is a composition including Cannabis (or an extract thereof).
- Products include, but are not limited to: a kief, hashish, bubble hash, an edible product, solvent reduced oil, sludge, e-juice, tincture, or other compositions including Cannabis (e.g., a Cannabis plant disclosed herein, or an extract thereof).
- promoter refers to a nucleic acid control sequence that directs transcription of a nucleic acid.
- a promoter includes necessary nucleic acid sequences near the start site of transcription, and may include distal enhancer or repressor elements.
- a “constitutive promoter” is a promoter that is continuously active and is not subject to regulation by external signals or molecules. In contrast, the activity of an “inducible promoter” is regulated by an external signal or molecule (for example, a transcription factor).
- Exemplary promoters include pol III promoters (e.g., U6), pol II promoter, ubiquitin promoter, Cauliflower Mosaic Virus (CaMV) 35S promoter, or RUB1SCO promoter.
- the terms “initiate transcription,” “initiate expression,” “drive transcription,” and “drive expression” are used interchangeably herein and all refer to the primary function of a promoter.
- purified as used herein relates to the isolation of a molecule or compound in a form that is substantially free of contaminants normally associated with the molecule or compound in a native or natural environment, or substantially enriched in concentration relative to other compounds present when the compound is first formed, and means having been increased in purity as a result of being separated from other components of the original composition.
- purified nucleic acid is used herein to describe a nucleic acid sequence which has been separated, produced apart from, or purified away from other biological compounds including, but not limited to polypeptides, lipids and carbohydrates, while effecting a chemical or functional change in the component (e.g., a nucleic acid may be purified from a chromosome by removing protein contaminants and breaking chemical bonds connecting the nucleic acid to the remaining DNA in the chromosome).
- recombinant refers to a nucleic acid or protein that has a sequence made by an artificial combination of two otherwise separated segments of sequence (e.g., a “chimeric” sequence). This artificial combination can be accomplished by chemical synthesis or by manipulation of isolated segments of nucleic acids, for example, by standard molecular biology techniques (e.g., cloning).
- a “recombinant expression construct” refers to an expression vector into which a nucleic acid sequence or fragment can be moved. Preferably, it is a plasmid vector, or a fragment thereof, comprising a promoter. The choice of plasmid vector is dependent upon the method that will be used to transform host plants.
- genetic elements that must be present on the plasmid vector to successfully transform, select and propagate host cells containing the chimeric gene is dependent on the specific transformation method. Different independent transformation events typically result in different levels and patterns of expression and thus multiple events must be screened to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by PCR and Southern analysis of DNA, RT-PCR and Northern analysis of mRNA expression. Western analysis of protein expression, or phenotypic analysis.
- reference plant or “reference genome” refers to a reference sequence that genetic markers or sequences of a test sample can be compared to in order to detect a modification of the sequence in the test sample.
- the reference plant or genome is Abacus (Csat_AbacusV2, NCBI assembly accession GCA_025232715.1).
- sequence identity or “percent identity” are used interchangeably to refer to a sequence comparison based on identical matches between correspondingly identical positions in two or more amino acid or nucleotide sequences that are being compared.
- the percent identity refers to the extent to which two optimally aligned polynucleotide or peptide sequences are invariant throughout a window of alignment of components, e.g., nucleotides or amino acids.
- Hybridization experiments and mathematical algorithms known in the art may be used to determine percent identity.
- Many mathematical algorithms exist as sequence alignment computer programs known in the art that calculate percent identity. These programs may be categorized as either global sequence alignment programs or local sequence alignment programs.
- the NCBI Basic Local Alignment Search Tool (BLAST) tool is often used and is available from several sources, including the National Center for Biotechnology Information (blast.ncbi.nlm.nih.gov/Blast.cgi).
- BLAST Basic Local Alignment Search Tool
- Various types of BLAST are available, for example, blastp, blastn, blastx, tblastn and tblastx.
- a description of how to determine sequence identity using this program is available on the NCBI website and other resources.
- percent sequence identity is determined by using BLAST with default parameters.
- nucleic acid fragments wherein changes in one or more nucleotide bases do not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype. These terms also refer to modifications of nucleic acid fragments, such as deletion or insertion of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the initial, unmodified fragment.
- a “substantially homologous sequence” refers to variants of the disclosed sequences such as those that result from site-directed mutagenesis, as well as synthetically derived sequences.
- a substantially homologous sequence also refers to fragments of a particular promoter nucleotide sequence disclosed herein that operate to promote the constitutive expression of an operably linked heterologous nucleic acid fragment.
- These promoter fragments will include at least about 20 contiguous nucleotides, for example, at least 50 contiguous nucleotides, at least 75 contiguous nucleotides, or at least 100 contiguous nucleotides of the particular promoter nucleotide sequence disclosed herein.
- the nucleotides of such fragments will usually comprise the TATA recognition sequence of the particular promoter sequence.
- Such fragments may be obtained by use of restriction enzymes to cleave the naturally occurring promoter nucleotide sequences disclosed herein; by synthesizing a nucleotide sequence from the naturally occurring promoter DNA sequence; or may be obtained through the use of PCR technology. Functional variants of these promoter fragments, such as those resulting from site-directed mutagenesis, are encompassed by the present disclosure.
- single nucleotide polymorphism refers to a change in which a single base in the DNA differs from the base at the corresponding position of a reference genome or sequence.
- target region refers to a nucleotide sequence that resides at a specific chromosomal location.
- the "target region” or “nucleic acid target” can be specifically recognized by a probe.
- terpene refers to a class of secondary metabolite typically found in plants. Terpenes are hydrocarbons with small isoprene units linked to one another to form chains. Two types of terpenes/terpenoids commonly found in Cannabis include monoterpenes (10C; two isoprenes) and sesquiterpenes (15C; three isoprenes).
- Exemplary monoterpenes include limonene (e.g., L- limonene or D-limonene), myrcene (also referred to as P-myrcene), pinene (e.g., a-pinene or -pinene), camphene, linalool, terpinolene, terpinene (e.g., a-terpinene or y-terpinene), and ocimene (also referred to as p-ocimene).
- limonene e.g., L- limonene or D-limonene
- myrcene also referred to as P-myrcene
- pinene e.g., a-pinene or -pinene
- camphene linalool
- terpinolene terpinene
- terpinene e.g., a-terpinene or y
- Exemplary sesquiterpenes include nerolidol (e.g., cis-nerolidol and/or rrans-nerolidol), humulene (also referred to as a-humulene), guaiol, and caryophyllene (also referred to as P-caryophyllene).
- nerolidol e.g., cis-nerolidol and/or rrans-nerolidol
- humulene also referred to as a-humulene
- guaiol guaiol
- caryophyllene also referred to as P-caryophyllene
- Terpene biosynthesis starts with common isoprenoid diphosphate precursors (5 carbon) through two biosynthetic pathways, the plastidial methylerythritol phosphate (MEP) pathway and the cytosolic mevalonate (MEV) pathway. Both the MEP and MEV pathways provide isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), which are condensed into longer-chain isoprenoid diphosphates that include geranyl diphosphate (GPP) and farnesyl diphosphate (FPP).
- IPP isopentenyl diphosphate
- DMAPP dimethylallyl diphosphate
- GPP geranyl diphosphate
- FPP farnesyl diphosphate
- Linear isoprenoid diphosphates are substrates for monoterpene synthases (mono-TPS) and sesquiterpene synthases (sesqui-TPS), respectively, which diversify these compounds through enzymatic modifications, such as hydroxylation, dehydrogenation, acylation, and glycosylation, resulting in the production of diverse mono- and sesquiterpenes.
- GPP is also a building block of cannabinoid biosynthesis.
- transformant refers to a cell, tissue or organism that has undergone transformation.
- the original transformant is designated as “TO” or “TO.”
- Selfing the TO produces a first transformed generation designated as “Tl” or “Tl.”
- transgenic refers to any cell, cell line, callus, tissue, plant part or plant, the genome of which has been altered by the presence of a heterologous nucleic acid, such as a recombinant DNA construct, including those initial transgenic events as well as those created by sexual crosses or asexual propagation from the initial transgenic event.
- a heterologous nucleic acid such as a recombinant DNA construct
- the term “transgenic” as used herein does not encompass the alteration of the genome (chromosomal or extra-chromosomal) by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non-recombinant transposition, or spontaneous mutation.
- a “transgene” is a gene that has been introduced into the genome by a transformation procedure.
- “variety” means a plant grouping within a single botanical taxon of the lowest known rank, which grouping, irrespective of whether the conditions for the grant of a breeder’ s right are fully met, can be i) defined by the expression of the characteristics resulting from a given genotype or combination of genotypes, ii) distinguished from any other plant grouping by the expression of at least one of the said characteristics and iii) considered as a unit with regard to its suitability for being propagated unchanged.
- vector refers to a nucleic acid molecule that can be introduced into a host cell (for example, by transformation), thereby producing a transformed host cell.
- a vector can include nucleic acid sequences that permit it to replicate in a host cell, such as an origin of replication.
- Recombinant DNA vectors are vectors containing recombinant DNA.
- a vector can also include one or more selectable marker genes and other genetic elements. Often vectors are DNA plasmids, however, they can also be viral vectors (DNA or RNA), cosmids, or artificial chromosomes.
- CBDVA cannabidivarinic acid
- Disclosed are methods of producing one or more Cannabis plants having modified terpene content comprising: (i) analyzing one or more genetic markers in a nucleic acid sample from a Cannabis plant or its germplasm; (ii) detecting one or more markers that indicate modified terpene content, (iii) crossing the Cannabis plant comprising the one or more markers, and (iv) obtaining one or more progeny plants comprising the one or more markers; wherein the one of more progeny plants have modified terpene content relative to a control, thereby producing one or more Cannabis plants having modified terpene content.
- Also disclosed are methods of identifying or selecting a Cannabis plant having modified terpene content comprising: (i) analyzing one or more genetic markers in a nucleic acid sample from the Cannabis plant or its germplasm; (ii) detecting one or more genetic markers that indicate modified terpene content; and (iii) identifying or selecting the Cannabis plant, thereby identifying or selecting the Cannabis plant having modified terpene content.
- the Cannabis plant having modified teipene content is selected for further analysis, propagation, crossing, or to make a product.
- the method further includes crossing the Cannabis plant having modified terpene content and producing one or more progeny plants having modified terpene content.
- Terpene content can be measured using standard analytical techniques, e.g.. gas chromatography and/or HPLC with mass-spectrometry. Modified terpene content can be determined, for example, as a difference in terpene content relative to a control/reference, e.g., a Cannabis plant not having the one or more markers that indicate modified teipene content.
- the terpene content is modified in flowers or inflorescence, for example, in female flowers or female inflorescence.
- the teipene content is modified in trichomes (e.g., glandular trichomes).
- the terpene content is modified in leaf or other vegetative tissue.
- the modified terpene content is an increase in terpene content relative to a suitable control (e.g., a sample from a Cannabis plant not having the one or more markers that indicate modified terpene content).
- a suitable control e.g., a sample from a Cannabis plant not having the one or more markers that indicate modified terpene content.
- the Cannabis plant is Cannabis sativa, Cannabis indie a, or Cannabis ruderalis.
- the Cannabis plant is Cannabis sativa.
- Modified terpene content as used herein can include, for example, modified total terpenes, total monoterpenes, beta-myrcene, total sesquiteipenes, alpha-pinene, beta-pinene, alpha-terpinene, gammaterpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, trans-nerolidol, alpha-humulene, betacaryophyllene, and/or guaiol levels.
- modified terpene content includes or consists of modified total tcipcncs; modified total monotcrpcncs; modified total monotcrpcncs absent bcta-myrccnc; modified total sesquiteipenes; modified alpha-pinene; modified beta-pinene; modified alpha-terpinene, gamma-terpinene, and terpinolene; modified beta-myrcene to total monoterpene ratio; modified beta-ocimene; modified camphene and D-limonene; modified linalool and trans-nerolidol; modified alpha-humulene and beta-caryophyllene; and/or modified guaiol.
- the modified terpene content is modified total terpene content. In some aspects, the modified terpene content is modified total monoterpene content. In some aspects, the modified terpene content is modified total monoterpenes content absent beta-myrcene. In some aspects, the modified terpene content is modified total sesquiterpene content. In some aspects, the modified terpene content is modified alpha-pinene content. In some aspects, the modified terpene content is modified beta-pinene content. In some aspects, the modified terpene content is modified alpha-terpinene, gamma-terpinene, and terpinolene content.
- the modified terpene content is modified beta-myrcene to monoterpene content ratio ((beta-myrcene + l)/((total monoteipenes - beta-myrcene) +1)).
- the modified terpene content is modified beta-ocimene content.
- the modified terpene content is modified camphene and D- limonene content.
- the modified terpene content is modified linalool and trans-nerolidol content.
- the modified terpene content is modified alpha-humulene and beta-caryophyllene content.
- the modified terpene content is modified guaiol content.
- a plant produced or selected by a method disclosed herein includes a terpene content (e.g., total terpenes, total monoteipenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma- teipinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, trans-nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) of at least 0.1% by weight, for example, at least 0.2%, at least 0.3%, at least 0.4%, at least 0.5%, at least 0.6%, at least 0.7%, at least 0.8%, at least 0.9%, at least 1.0%, at least 1.2%, at least 1.4%, at least 1.5%, at least 1.75%, at least 2.0%, at least 2.5%, at least
- the plant has a terpene content (e.g., total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, trans-nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) of at least 0.2% by weight.
- a terpene content e.g., total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene,
- the plant has a teipene content (e.g., total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alphapinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, trans-nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) of at least 1.0% by weight.
- a teipene content e.g., total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alphapinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene
- the plant has a terpene content (e.g., total terpenes, total monoterpenes, betamyrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta- ocimene, camphene, D-limonene, linalool, trans-nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) of at least 3% by weight.
- a terpene content e.g., total terpenes, total monoterpenes, betamyrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta- ocimene, camphene, D-limonene,
- the plant has a terpene content (e.g., total terpenes, total monoterpenes, bcta-myrccnc, total sesquiterpenes, alpha-pinene, bcta-pincnc, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, trans-nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) of at least 5% by weight.
- a terpene content e.g., total terpenes, total monoterpenes, bcta-myrccnc, total sesquiterpenes, alpha-pinene, bcta-pincnc, alpha-terpinene, gamma-terpinene, terpinolene,
- a plant produced or selected by a method disclosed herein includes a terpene content (e.g., total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma- terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, tra/i.s-ncrolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) of 0.1% to 10% by weight, for example, 0.1% to 9%, 0.1% to 8%, 0.1% to 7%, 0.1% to 6%, 0.1% to 5%, 0.1% to 4%, 0.1% to 3%, 0.1% to 2%, 0.1% to 1%, 0.2% to 10%, 0.2% to 9%, 0.2% to 8%, 0.2% to 7%, 0.
- the terpene content e.g., total terpenes, total monoteipenes, beta-myrcene, total sesquiterpenes, alpha-pinene, betapinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, transnerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content
- the terpene content e.g., total terpenes, total monoteipenes, beta-myrcene, total sesquiterpenes, alpha-pinene, betapinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, transnerolidol, al
- the terpene content (e.g., total terpenes, total monoteipenes, beta-myrcene, total sesquiterpenes, alphapinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, tra -ncrolidol. alpha-humulene, beta-caryophyllene, and/or guaiol content) is 0.5% to 7% by weight.
- the terpene content (e.g., total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D- limonene, linalool, trans-nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) is 0.5% to 3% by weight.
- the terpene content (e.g., total terpenes, total monoteipenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, tran -nerolidol, alpha-humulene, beta-caryophyllene, and/or guaiol content) is 1% to 7% by weight.
- total terpenes e.g., total terpenes, total monoteipenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma-terpinene, terpinolene, beta-ocimene, camphene, D-limonene
- a plant produced or selected by the methods disclosed herein includes a total terpene, total monoterpene, and/or total sesquiterpene content of at least 0.1% by weight, for example, at least 0.2%, at least 0.5%, at least 0.75%, at least 1.0%, at least 1.5%, at least 2.0%, at least 2.5%, at least 3%, at least 3.5%, at least 4%, at least 4.5%, at least 5%, at least 5.5%, at least 6%, at least 6.5%, at least 7%, at least 7.5%, at least 8%, at least 8.5%, at least 9%, at least 9.5%, at least 10%, at least 11%, at least 12%, at least 13%, at least 14%, at least 15%, at least 16%, at least 17%, at least 18%, at least 19%, at least 20%, at least 25%, or more, by weight in at least one plant part (e.g., leaves, flowers, or trichomes).
- at least 0.5% at least 0.75%, at least 1.0%,
- the plant has a total terpene, total iiionotcrpcnc. and/or total sesquiterpene content of at least 0.2% by weight. In another non-limiting example, the plant has a total terpene, total monoterpene, and/or total sesquiterpene content of at least 1% by weight. In a further example, the plant has a total terpene, total monoterpene, and/or total sesquiterpene content of at least 3% by weight. In another example, the plant has a total terpene, total monoterpene, and/or total sesquiterpene content of at least 5% by weight.
- a plant produced or selected by a method disclosed herein includes a total terpene, total monoterpene, and/or total sesquiterpene of 0.1% to 10% by weight, for example, 0.1% to 9%, 0.1% to 8%, 0.1% to 7%, 0.1% to 6%, 0.1% to 5%, 0.1% to 4%, 0.1% to 3%, 0.1% to 2%, 0.1% to 1%, 0.2% to 10%, 0.2% to 9%, 0.2% to 8%, 0.2% to 7%, 0.2% to 6%, 0.2% to 5%, 0.2% to 4%, 0.2% to 3%, 0.2% to 2%, 0.2 % to 1%, 0.5% to 10%, 0.5% to 9%, 0.5% to 8%, 0.5% to 7%, 0.5% to 6%, 0.5% to 5%, 0.5% to 4%, 0.5% to 3%, 0.5% to 2%, 0.5 % to 1%, 1% to 10%, 1% to 9%, 1% to 8%, 1% to 7%, 1% to 1% to
- the total terpene, total monoterpene, and/or total sesquiterpene is 0.5% to 7%. In some aspects, the total terpene, total monoterpene, and/or total sesquiterpene is 0.5% to 5%. In some aspects, the total terpene, total monoterpene, and/or total sesquiterpene is 1% to 7%.
- a measure of % by weight in a method disclosed herein can be % by dry weight or % by fresh weight. In several aspects, the % by weight is % by dry weight.
- the modified beta-myrcene to monoteipene content ratio ((beta-myrcene + l)/( (total monoterpenes - beta-myrcene) +1)) is at least 0.75, for example, at least 0.8, at least 0.9, at least 1.0, at least 1.1, at least 1.2, at least 1.3, at least 1.4, at least 1.5, at least 1.6, at least 1.7, at least 1.8, at least 1.9, at least 2.0, or more.
- the modified beta-myrcene to monoterpene content ratio is 0.75 to 5, for example, 0.75 to 4, 0.75 to 3, 0.75 to 2, 0.75 to 1.5, 0.75 to 1.4, 0.75 to 1.3, 0.75 to 1.2, 0.75 to 1.1, 0.75 to 1.0, 0.8 to 4, 0.8 to 3, 0.8 to 2, 0.8 to 1 .5, 0.8 to 1 .4, 0.8 to 1 .3, 0.8 to 1 .2, 0.8 to 1 .1 , 0.8 to 1 .0, 1 to 4, 1 to 3, 1 to 2, 1 to 1 .5, 1 to 1.4, 1 to 1.3, 1 to 1.2, 1 to 1.1, 1.2 to 4, 1.2 to 3, 1.2 to 2, 1.2 to 1.5, 1.2 to 1.4, or 1.2 to 1.3. In some aspects, the modified beta-myrcene to monoterpene content ratio is about 0.75 to 1.3. In some aspects, the modified betamyrcene to monoteipene content ratio is about 1.0 to about 1.4.
- the plant part can be any part of the plant selected or produced by the methods disclosed herein.
- the plant part is a flower (e.g., a female flower) or inflorescence tissue.
- the plant part is a trichome (e.g., glandular trichomes).
- the plant part is a leaf or other vegetative tissue.
- the one or more genetic markers that indicate modified terpene content include or consist of one or more genetic markers disclosed herein, for example, one or more genetic markers described in Table 15.
- the genetic marker is a polymorphism (e.g., SNP) found within one or more of the following haplotypes:
- the methods disclosed herein detect a haplotype associated with modified terpene content, or a haplotype that contains a terpene trait locus.
- the genetic marker is genetically linked to a terpene trait locus.
- analyzing or detecting one or more genetic markers that indicate modified terpene content includes analyzing or detecting at least one SNP disclosed herein, for example, in Table 1, Table 2, Tabic 3, Tabic 4, Tabic 5, Tabic 6, Tabic 7, Tabic 8, Tabic 9, Tabic 10, Tabic 11, Tabic 12, Tabic 13, Tabic 15, or Table 16, respectively.
- analyzing or detecting one or more genetic markers that indicate modified terpene content includes analyzing or detecting at least two SNPs described in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, Table 12, Table 13, Table 15, or Table 16, respectively.
- analyzing or detecting one or more genetic markers that indicate modified terpene content includes analyzing or detecting at least three SNPs described in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, Table 12, Table 13, Table 15, or Table 16, respectively. In some aspects, analyzing or detecting one or more genetic markers that indicate modified terpene content includes analyzing or detecting at least five SNPs described in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, Table 12, Table 13, Table 15, or Table 16, respectively.
- analyzing or detecting one or more genetic markers that indicate modified terpene content includes analyzing or detecting all of the SNPs described in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, Table 12, Table 13, Table 15, or Table 16, respectively.
- Combinations of SNP markers disclosed herein can be useful, for example, for screening increased levels of multiple specific terpenes of interest. While any combination of SNPs disclosed herein could be useful, an exemplary subset of SNPs is provided in Table 16.
- analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least one SNP from the list of SNPs disclosed in Table 16.
- analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least two SNPs from the list of SNPs disclosed in Table 16.
- analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least three SNPs from the list of SNPs disclosed in Table 16.
- analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least four SNPs from the list of SNPs disclosed in Table 16. In some aspects, analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least five SNPs from the list of SNPs disclosed in Table 16. In some aspects, analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least six SNPs from the list of SNPs disclosed in Table 16. In some aspects, analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least seven SNPs from the list of SNPs disclosed in Table 16.
- analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least eight SNPs from the list of SNPs disclosed in Table 16. In some aspects, analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting at least nine SNPs from the list of SNPs disclosed in Table 16. In some aspects, analyzing or detecting one or more genetic markers in any of the methods disclosed here includes analyzing or detecting all the SNPs from the list of SNPs disclosed in Table 16.
- analyzing one or more genetic markers in the nucleic acid sample includes analyzing one or more of nucleotide positions: 8,871,401; 8,886,933; 9,101,934; 10,446,475; 10,543,062; 10,561,778; 10,633,191; 10,934,458; 11,169,492; or 13,920,896 on chromosome 1;
- the one or more nucleic acid polymorphisms are beneficial polymorphisms associated with increased terpene content in Cannabis.
- analyzing one or more genetic markers in the nucleic acid sample includes analyzing at least 2 genetic markers, for example, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, or more markers.
- analyzing one or more genetic markers in the nucleic acid sample includes analyzing at least 3 genetic markers.
- analyzing one or more genetic markers in the nucleic acid sample includes analyzing at least 5 genetic markers.
- analyzing one or more genetic markers in the nucleic acid sample includes analyzing at least 7 genetic markers.
- analyzing one or more genetic markers in the nucleic acid sample includes analyzing 2 to 50 genetic markers, for example, 2 to 40, 2 to 30, 2 to 20, 2 to 10, 5 to 50, 5 to 40, 5 to 30, 5 to 20, 5 to 10, 10 to 50, 10 to 40, 10 to 30, or 10 to 20 genetic markers.
- 2 to 10 genetic markers e.g., SNPs
- the one or more genetic markers arc genetically linked to a terpene trait locus.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions: 93,291,929 on chromosome 2; 72,717,623 on chromosome 4; 55,114,152 on chromosome 9; 57,912,635 on chromosome x; and/or 58,545,628 on chromosome x, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total terpene content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions: 93,291,929 on chromosome 2; 47,140,085 on chromosome 3; 72,717,623 on chromosome 4; 3,807,710 on chromosome 5; 3,842,906 on chromosome 5; 55,475,322 on chromosome 5; and/or 33,592,849 on chromosome 8, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total monoterpene content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 1,100,981 ; 1,840,325; 2,366,529; 2,698,301; 3,081,773; 3,485,895; 3,585,965; 3,599,637; 3,629,225; and/or 4,384,123 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 3,081,773 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total monoterpenes absent beta-myrcene.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 2,774,108; 5,854,661; or 7,307,552 on chromosome 5, or 1,695,817; 1,727,397; 1,960,918; 5,175,087; 14,069,586; 14,329,191; and/or 14,866,064 of chromosome 6 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total sesquiterpene content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 1,366,137 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified beta-pinene content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 755,967; 1,840,325; 2,302,063; 2,366,529; 2,698,301; 3,485,895; 4,384,123; 11,993,646; 12,418,741; and/or 12,446,524 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 2,698,301 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified alpha-terpinene, gamma-terpinene, and terpinolene content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 1,929,134 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified monoterpene to beta-myrcene content ratio.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 1,828,050; 2,038,965; 2,120,881; 2,208,629; 2,360,380; 2,364,964; 32,342,917; 32,395,736; 62,911,168; and/or 66,477,802 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_ Abacus V2, NCBI assembly accession GCA_025232715.1 .
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 1,331,433; 1,487,633; 1,837,343; 2,288,919; 2,291,467; 2,318,276; 2,774,108; 2,780,345; 4,391,586; and/or 74,391,606 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 2,288,919 and/or 2,774,108 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified camphene and D-limonene content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 8,871,401; 8,886,933; 9,101,934; 10,446,475; 10,543,062; 10,561,778; 10,633,191; 10,934,458; 11,169,492; and/or 13,920,896 on chromosome 1 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 10,633,191 on chromosome 1, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified linalool and trans-nerolidol content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 1,695,817; 1,727,397; 1,960,918; 5,175,087; 5,468,920; 5,868,053; 6,061,359; 6,120,135; and/or 64,943,914 on chromosome 6; or 14,069,586 on chromosome 8; according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 5,175,087 on chromosome 6, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified alpha-humulene and beta-caryophyllene content.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing one or more of nucleotide positions 1,220,207; 1,288,012; 1,999,618; 2,012,149; 2,931,923; 3,073,845; 3,091,941; 3,185,660; 6,311,954; and/or 6,589,961 on chromosome 6 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- analyzing one or more genetic markers in the nucleic acid sample includes or consists of analyzing nucleotide position 6,311,954 on chromosome 6, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_O25232715.1 .
- the modified terpene content is modified guaiol content.
- detecting one or more genetic markers that indicate modified teipene content includes detecting one or more of the following SNPs:
- Chromosome 2 (k) a A/ A or T/A genotype at position 93291929;
- analyzing one or more genetic markers comprises analyzing one or more of nucleotide positions: (a) 3,081,773 on chromosome 5; (b) 10,633,191 on chromosome 1; (c) 1,366,137 on chromosome 5; (d) 1 ,929,134 on chromosome 5; (e) 2,038,965 on chromosome 5; (f) 2,288,919 on chromosome 5; (g) 2,534,579 on chromosome 5; (h) 2,698,301 on chromosome 5; (i) 2,774,108 on chromosome 5; (j) 5,175,087 on chromosome 6; and/or (k) 6,311,954 on chromosome 6.
- analyzing one or more genetic markers comprises analyzing all of nucleotide positions: (a) 3,081,773 on chromosome 5; (b) 10,633,191 on chromosome 1; (c) 1,366,137 on chromosome 5; (d) 1,929,134 on chromosome 5; (e) 2,038,965 on chromosome 5; (f) 2,288,919 on chromosome 5; (g) 2,534,579 on chromosome 5; (h) 2,698,301 on chromosome 5; (i) 2,774,108 on chromosome 5; (j) 5,175,087 on chromosome 6; and/or (k) 6,311,954 on chromosome 6.
- detecting one or more genetic markers that indicate modified terpene content includes detecting one or more of the following SNPs: (a) a T/T or C/T genotype at position 3,081,773 on chromosome 5; (b) a C/C or C/A genotype at position 10,633,191 on chromosome 1; (c) a A/A or G/A genotype at position 1,366,137 on chromosome 5; (d) a T/T or T/C genotype at position 1,929,134 on chromosome 5; (e) a C/C or T/C genotype at position 2,038,965 on chromosome 5; (f) a T/T or A/T genotype at position 2,288,919 on chromosome 5; (g) a A/A or A/T genotype at position 2,534,579 on chromosome 5; (h) a G/G or A/G genotype at position 2,698,301 on chromosome 5; (i
- detecting one or more genetic markers (e.g., polymorphisms) that indicate modified terpene content in the nucleic acid sample includes detecting at least 2 genetic markers, for example, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, or more markers.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting at least 3 genetic markers.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting at least 5 genetic markers.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting at least 7 genetic markers.
- the one or more genetic markers that indicate modified terpene content are beneficial markers that are associated with increased terpene content in Cannabis.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting 2 to 50 genetic markers, for example, 2 to 40, 2 to 30, 2 to 20, 2 to 10, 5 to 50, 5 to 40, 5 to 30, 5 to 20, 5 to 10, 10 to 50, 10 to 40, 10 to 30, or 10 to 20 genetic markers.
- 2 to 10 genetic markers that indicate modified terpene content are detected.
- the one or more genetic markers e.g., SNPs
- the one or more genetic markers that indicate modified terpene content are genetically linked to a terpene trait locus.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions: 93,291,929 on chromosome 2; 72,717,623 on chromosome 4; 55,114,152 on chromosome 9; 57,912,635 on chromosome x; and/or 58,545,628 on chromosome x, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total terpene content.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions: 93,291,929 on chromosome 2; 47,140,085 on chromosome 3; 72,717,623 on chromosome 4; 3,807,710 on chromosome 5; 3,842,906 on chromosome 5; 55,475,322 on chromosome 5; and/or 33,592,849 on chromosome 8, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total monoterpene content.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 1,100,981; 1,840,325; 2,366,529; 2,698,301; 3,081,773; 3,485,895; 3,585,965; 3,599,637; 3,629,225; and/or 4,384,123 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 3,081,773 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total monoterpenes absent beta-myrcene.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 2,774,108; 5,854,661; or 7,307,552 on chromosome 5, or 1,695,817; 1,727,397; 1,960,918; 5,175,087; 14,069,586; 14,329,191; and/or 14,866,064 of chromosome 6 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified total sesquiterpene content.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 516,340; 518,238; 523,626; 1,109,162; 1,366,137; 2,346,000; 2,534,579; 3,247,341; 3,503,143; and/or 3,629,225 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 2,534,579 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified alpha-pinene content.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 516,340; 518,238; 523,626; 608,718; 1,109,162; 1,366,137; 1,386,965; 2,003,303; 3,247,341; and/or 3,704,632 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 1,366,137 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified bcta-pincnc content.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 755,967; 1,840,325; 2,302,063; 2,366,529; 2,698,301; 3,485,895; 4,384,123; 11,993,646; 12,418,741; and/or 12,446,524 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified teipene content include detecting a SNP that indicates modified terpene content at nucleotide position 2,698,301 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified alpha-terpinene, gamma-terpinene, and terpinolene content.
- detecting one or more genetic markers that indicate modified teipene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 330,918; 1,353,878; 1,745,101; 1,828,050; 1,929,134; 2,072,869; 2,339,956; 3,081,773; 3,564,387; and/or 3,585,965 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 1,929,134 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified beta-myrcene to total monoterpene content ratio.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 1,828,050; 2,038,965; 2,120,881; 2,208,629; 2,360,380; 2,364,964; 32,342,917; 32,395,736; 62,911,168; and/or 66,477,802 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 2,038,965 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified beta-ocimene content.
- detecting one or more genetic markers that indicate modified teipene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 1,331,433; 1,487,633; 1,837,343; 2,288,919; 2,291,467; 2,318,276; 2,774,108; 2,780,345; 4,391,586; and/or 74,391,606 on chromosome 5 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 2,288,919 and/or 2,774,108 on chromosome 5, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified teipene content is modified camphene and D-limonene content.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 8,871,401; 8,886,933; 9,101,934; 10,446,475; 10,543,062; 10,561,778; 10,633,191; 10,934,458; 11,169,492; and/or 13,920,896 on chromosome 1 according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 10,633,191 on chromosome 1, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified linalool and trans-nerolidol content.
- detecting one or more genetic markers that indicate modified terpene content in the nucleic acid sample includes detecting a SNP that indicates modified terpene content at one or more of nucleotide positions 1,695,817; 1,727,397; 1,960,918; 5,175,087; 5,468,920; 5,868,053; 6,061,359; 6,120,135; and/or 64,943,914 on chromosome 6; or 14,069,586 on chromosome 8; according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 5,175,087 on chromosome 6, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified alpha-humulene and beta-caryophyllene content.
- detecting one or more genetic markers that indicate modified terpene content include detecting a SNP that indicates modified terpene content at nucleotide position 6,311 ,954 on chromosome 6, according to the Abacus Cannabis reference genome version Csat_AbacusV2, NCBI assembly accession GCA_025232715.1.
- the modified terpene content is modified guaiol content.
- the genetic markers described herein can also be identified based on corresponding SEQ ID NOs disclosed herein, rather than a particular chromosomal location relative to the Abacus Cannabis reference genome. Corresponding SEQ ID NOs are provided in Tables 1-13 and 15. Thus, in some aspects, the one or more genetic markers comprise a polymorphism at position 51 of one or more of SEQ ID NOs: 1-99. In some aspects, detecting one or more markers that indicate modified terpene content includes detecting one or more of the following: Chromosome 1:
- Chromosome 9 (cs) a C/C or G/C genotype at position 51 of SEQ ID NO: 97;
- Methods of analyzing/detecting genetic markers that are suitable for use in the methods disclosed herein have been described, and can include amplification of a target polynucleotide (e.g., by PCR).
- PCR uses a particular amplification primer pair that specifically hybridize to a target polynucleotide and produce an amplification product (the amplicon).
- Primers can be designed such that the amplicon can contain a nucleic acid polymorphism of interest.
- the primers can be radiolabeled, or labeled by any suitable means (e.g., using a non-radioactive fluorescent tag), to allow for rapid visualization of the different size amplicons following an amplification reaction without any additional labeling step or visualization step.
- nucleic acid amplification methods include, but are not limited to, reversetranscription PCR (RT-PCR), quantitative real-time PCR (qPCR), quantitative real-time reverse transcriptase PCR (qRT-PCR) (see, e.g., Adams, A beginner’s guide to RT-PCR, qPCR and RT-qPCR, Biochemist (Lond) (2020) 42(3): 48-53), isothermal amplification methods (see, e.g., Zanoli et al., Biosensors (2013) 3(1): 18-43), nucleic acid sequence-based amplification (NASBA) (see, e.g., Deiman and Sillekens, Mol Biotechnol (2002) 20(2): 163-79), loop-mediated isothermal amplification (LAMP) (see, e.g., Notomi et al., (2000) Nucleic Acids Res.
- RT-PCR reversetranscription PCR
- qPCR quantitative real-time PCR
- HDA helicase-dependent amplification
- RCA rolling circle amplification
- MDA multiple displacement amplification
- RPA recombinase polymerase amplification
- LCR ligase chain reaction
- transcription amplification see e.g., Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1173
- self-sustained sequence replication see e.g., Guatelli et al. (1990) Proc. Natl. Acad. Sci.
- amplification produces an amplicon that is at least 20 nucleotides in length, for example, at least 50 nucleotides in length, at least 100 nucleotides in length, at least 200 nucleotides in length, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1500, at least 2000, or at least 2500 nucleotides in length.
- the amplicon is no longer than 10000 nucleotides in length, for example, no longer than 3000, no longer than 5000, no longer than 7000, or no longer than 9000 nucleotides.
- marker amplification produces an amplicon that is 20 to 10000 nucleotides in length, for example, 20 to 9000 nucleotides, 20 to 8000 nucleotides, 20 to 7000 nucleotides, 20 to 6000 nucleotides, 20 to 5000 nucleotides, 20 to 4000 nucleotides, 20 to 3000 nucleotides, 20 to 2000 nucleotides, 20 to 1500 nucleotides, 20 to 1000 nucleotides, 20 to 500 nucleotides, 20 to 400 nucleotides, 20 to 300 nucleotides, 20 to 200 nucleotides, 20 to 150 nucleotides, 20 to 100 nucleotides, 20 to 50 nucleotides, 50 to 9000 nucleotides, 50 to 8000 nucleotides, 50 to 7000 nucleotides, 50 to 6000 nucleotides, 50 to 5000 nucleotides, 50 to 4000 nucleotides, 50 to 3000 nucleot
- the amplicon is 100 to 4000 nucleotides. In some aspects, the amplicon is 200 to 3000 nucleotides. In some aspects, the amplicon is at least 51 nucleotides. In some aspects, the amplicon is at least
- the presence of a nucleic acid polymorphism in an amplicon can be determined (detected), for example, by directly sequencing the amplicon, performing a restriction enzyme digest (e.g, restriction fragment length polymorphism (RFLP)), or by using a detection probe.
- detection includes using PCR, quantitative PCR (qPCR), reverse-transcription PCR (RT-PCR), quantitative real-time reverse transcriptase PCR (qRT-PCR), and/or sequencing methods.
- detection includes using PCR, quantitative PCR (qPCR), and/or sequencing based detection methods.
- PCR detection and quantification using dual-labeled fluorogenic oligonucleotide probes can also be performed according to the present disclosure.
- These probes are composed of short (e.g., 20-25 base) oligodeoxynucleotides that are labeled with two different fluorescent dyes. On the 5' terminus of each probe is a reporter dye, and on the 3' terminus of each probe a quenching dye is found.
- the oligonucleotide probe sequence is complementary to an internal target sequence present in a PCR amplicon. When the probe is intact, energy transfer occurs between the two fluorophores and emission from the reporter is quenched by the quencher by FRET.
- the probe is cleaved by 5' nuclease activity of the polymerase used in the reaction, thereby releasing the reporter from the oligonucleotide - qucnchcr and producing an increase in reporter emission intensity.
- TaqManTM probes arc oligonucleotides that have a label and a quencher, where the label is released during amplification by the exonuclease action of the polymerase used in amplification, providing a real time measure of amplification during synthesis.
- a variety of TaqManTM reagents are commercially available, e.g., from Applied Biosystems as well as from a variety of specialty vendors such as Biosearch Technologies.
- detecting a nucleic acid polymorphism includes use of an oligonucleotide primer or probe.
- synthetic methods for making oligonucleotides, including probes or primers are known.
- oligonucleotides can be synthesized chemically according to the solid phase phosphoramidite triester method.
- Oligonucleotides, including modified oligonucleotides can also be ordered from a variety of commercial sources.
- Nucleic acid probes to the marker loci can be cloned and/or synthesized. Any suitable label can be used with a probe.
- Detectable labels suitable for use with nucleic acid probes include, for example, any composition detectable by spectroscopic, radioisotopic, photochemical, biochemical, immunochemical, electrical, optical, or chemical means.
- Useful labels include biotin for staining with labeled streptavidin conjugate, magnetic beads, fluorescent dyes, radio labels, enzymes, and colorimetric labels.
- Other labels include ligands which bind to antibodies labeled with fluorophores, chemiluminescent agents, and enzymes.
- a probe can also constitute radio labeled PCR primers that are used to generate a radio labeled amplicon. It is not intended that the nucleic acid probes be limited to any particular size, however, nucleic acid probes are typically 20-100 base pairs.
- Amplification is not always required for detection of a nucleic acid polymorphism (e.g. Southern blotting or RFLP detection).
- Separate detection probes can also be omitted in amplification/detection methods, e.g., by performing a real time amplification reaction that detects product formation by modification of the relevant amplification primer upon incorporation into a product, incorporation of labeled nucleotides into an amplicon, or by monitoring changes in molecular rotation properties of amplicons as compared to unamplified precursors (e.g., by fluorescence polarization).
- the nucleic acid polymorphism is detected by sequencing a nucleic acid fragment comprising a target sequence of interest, or by whole genome sequencing (or whole transcriptome sequencing).
- suitable sequencing methods include capillary electrophoresis (e.g., Sanger sequencing) and high-throughput sequencing (e.g., Illumina® or 454 Sequencing®). High-throughput sequencing includes short read or long read techniques.
- sequencing includes whole genome sequencing (e.g., sequencing the genome of a Cannabis plant of interest).
- sequencing includes targeted sequencing (sequencing of a particular nucleic acid or amplicon of interest).
- sequencing includes sequencing a transcriptome (RNA-Seq) (e.g., sequencing the transcriptome of a Cannabis plant selected or produced by a method disclosed herein). In some implementations, sequencing does not include sequencing of RNA. In some implementations, the genome is sequenced.
- RNA-Seq a transcriptome
- sequencing does not include sequencing of RNA. In some implementations, the genome is sequenced.
- the methods disclosed herein include a step wherein a Cannabis plant including one or more markers that indicate modified terpene content as disclosed herein is identified and/or selected.
- the Cannabis plant including one or more markers that indicate modified terpene content is selected for further analysis, propagation, crossing, or to make a product (e.g., a kief, hashish, bubble hash, an edible product, solvent reduced oil, sludge, e-juice, or tincture).
- a product e.g., a kief, hashish, bubble hash, an edible product, solvent reduced oil, sludge, e-juice, or tincture.
- the product is not, or excludes, any naturally occurring products.
- the methods disclosed herein include a step wherein a Cannabis plant identified as including one or more markers that indicate modified terpene content is crossed (e.g., selfed, sibling crossed, outcrossed, or backcrossed). In some aspects, crossing includes marker-assisted selection (MAS) for at least two generations. In some aspects, progeny plants comprising the one or more markers are obtained from the cross.
- a Cannabis plant identified as including one or more markers that indicate modified terpene content is crossed (e.g., selfed, sibling crossed, outcrossed, or backcrossed).
- crossing includes marker-assisted selection (MAS) for at least two generations.
- progeny plants comprising the one or more markers are obtained from the cross.
- progeny plants including the one or more markers have modified terpene content (e.g., increased terpene content) relative to a control, for example (and without limitation), a sibling progeny plant that does not include the one or more markers indicating modified terpene content, or a parent plant that does not include the one or more markers indicating modified terpene content.
- modified terpene content e.g., increased terpene content
- Cannabis plants identified, selected, or produced by a method disclosed herein are encompassed by this disclosure, as well as material derived from such plants (e.g., a plant part), including seed, tissue, or cells (including protoplasts); and progeny of the plant (e.g., F1-F7, for example, Fl and/or F2 progeny).
- the Cannabis plant is Cannabis sativa, Cannabis indica, or Cannabis ruderaJis.
- the Cannabis plant is Cannabis sativa.
- the plant includes one or more genetic markers indicating increased terpene content disclosed herein.
- plants disclosed herein including plants identified, selected, or produced by a method disclosed herein, can be used for plant breeding (e.g., crossing).
- a plant disclosed herein is used to develop new, unique, and superior variety or hybrid with a desired phenotype (e.g., increased terpene production/content) .
- Pedigree breeding and recurrent selection breeding methods may be used to develop cultivars from breeding populations. Breeding programs may combine desirable traits from two or more varieties or various broad-based sources into breeding pools from which cultivars are developed by selfing and selection of desired phenotypes. The new cultivars may be crossed with other varieties and the hybrids from these crosses are evaluated to determine which have commercial potential.
- a plant identified, selected, or produced by a method disclosed herein is crossed.
- Exemplary types of crosses include selfing, sibling crossing, outcrossing, and backcrossing. Suitable methods of crossing are disclosed herein.
- Pedigree selection where both single plant selection and mass selection practices are employed, may be used for the generation of new varieties.
- Pedigree breeding is used commonly for the improvement of selfpollinating crops or inbred lines of cross-pollinating crops. Two parents which possess favorable, complementary traits are crossed to produce an Fl. An F2 population is produced by selfing one or several Fl’s or by intercrossing two Fl's (sib mating). Selection of the best individuals usually begins in the F2 population; then, beginning in the F3, the best individuals in the best families are usually selected. Replicated testing of families, or hybrid combinations involving individuals of these families, often follows in the F4 generation to improve the effectiveness of selection for traits with low heritability. At an advanced stage of inbreeding (e.g., F6 and F7), the best lines or mixtures of phenotypically similar lines are tested for potential release as new cultivars.
- F6 and F7 an advanced stage of inbreeding
- Choice of breeding or selection methods depends on the mode of plant reproduction, the heritability of the trait(s) being improved, and the type of cultivar used commercially (e.g., Fl hybrid cultivar, pureline cultivar, etc.). For highly heritable traits, a choice of superior individual plants evaluated at a single location will be effective, whereas for traits with low heritability, selection should be based on mean values obtained from replicated evaluations of families of related plants.
- Popular selection methods commonly include pedigree selection, modified pedigree selection, mass selection, and recurrent selection.
- Mass and recurrent selections can be used to improve populations of either self- or cross-pollinating crops.
- a genetically variable population of heterozygous individuals may be identified or created by intercrossing several different parents. The best plants may be selected based on individual superiority, outstanding progeny, or excellent combining ability. Preferably, the selected plants are intercrossed to produce a new population in which further cycles of selection are continued.
- Backcross breeding has been used to transfer genes for a simply inherited, highly heritable trait into a desirable homozygous cultivar or line that is the recurrent parent.
- the source of the trait to be transferred is called the donor parent.
- the resulting plant is expected to have the attributes of the recurrent parent (e.g., cultivar) and the desirable trait transferred from the donor parent.
- individuals possessing the phenotype of the donor parent may be selected and repeatedly crossed (backcrossed) to the recurrent parent.
- the resulting plant is expected to have the attributes of the recurrent parent (e.g., cultivar) and the desirable trait transferred from the donor parent.
- a single-seed descent procedure refers to planting a segregating population, harvesting a sample of one seed per plant, and using the one-seed sample to plant the next generation.
- the plants from which lines are derived will each trace to different F2 individuals.
- the number of plants in a population declines each generation due to failure of some seeds to germinate or some plants to produce at least one seed. As a result, not all of the F2 plants originally sampled in the population will be represented by a progeny when generation advance is completed.
- Mutation breeding is another method of introducing new traits into Cannabis varieties. Mutations that occur spontaneously or are artificially induced can be useful sources of variability for a plant breeder. The goal of artificial mutagenesis is to increase the rate of mutation for a desired characteristic. Mutation rates can be increased by many different means including temperature, long-term seed storage, tissue culture conditions, radiation (such as X-rays, Gamma rays, neutrons, Beta radiation, or ultraviolet radiation), chemical mutagens (such as base analogs like 5-bromo-uracil), antibiotics, alkylating agents (such as sulfur mustards, nitrogen mustards, epoxides, ethyleneamines, sulfates, sulfonates, sulfones, or lactones), azide, hydroxylamine, nitrous acid or acridines. Once a desired trait is observed through mutagenesis the trait may then be incorporated into existing germplasm by traditional breeding techniques. Details of mutation breeding can be found, for example, in Principles of Cultivar
- breeding method may be used to transfer one or a few favorable genes for a highly heritable trait into a desirable cultivar. This approach has been used extensively for breeding disease-resistant cultivars.
- Various recurrent selection techniques are used to improve quantitatively inherited traits controlled by numerous genes. The use of recurrent selection in self -pollinating crops depends on the ease of pollination, the frequency of successful hybrids from each pollination, and the number of hybrid offspring from each successful cross.
- Double haploids are produced by the doubling of a set of chromosomes from a heterozygous plant to produce a completely homozygous individual (e.g., see Wan et al., Theor. Appl. Genet., 77:889-892, 1989).
- MAS marker assisted selection
- MAS is a powerful shortcut to selecting for desired phenotypes and for introgressing desired traits into cultivars (e.g., introgressing desired traits into elite lines).
- MAS is easily adapted to high throughput molecular analysis methods that can quickly screen large numbers of plant or germplasm genetic material for the markers of interest and is much more cost effective than raising and observing plants for visible traits.
- MAS can be used in the methods disclosed herein to produce plants with desired traits (e.g., increased terpene content).
- Cannabis plants that have modified terpene content (e.g., increased or decreased terpene levels relative to a control) made by any of the methods disclosed herein.
- Material derived from the Cannabis plants, including seed, tissue, or cells (including protoplasts); or progeny of the plant, such as Fl or F2 progeny, are encompassed by this disclosure.
- the Cannabis plant can be Cannabis sativa, Cannabis indica, or Cannabis ruderalis.
- the Cannabis plant is Cannabis sativa.
- the product may be any product known in the Cannabis arts, and can include, but is not limited to, extracts, a kief, hashish, bubble hash, an edible product, a flower, a seed, solvent reduced oil, sludge, e-juice, or tincture.
- Kief refers to a composition of concentrated Cannabis trichomes, which are accumulated by being sifted from Cannabis flowers or buds using a mesh screen or sieve.
- Hashish or hash refers to a compressed or purified preparation from Cannabis tissue containing trichomes (e.g., flowers).
- Bubble hash refers to a solid concentr ation of Cannabis trichomes made from a solventless extraction method.
- Cannabis sludges are solvent-free Cannabis extracts made via multigas extraction including the refrigerant 134A, butane, iso-butane and propane in a ratio that delivers a very complete and balanced extraction of cannabinoids and essential oils.
- E-juice vape juice
- a tincture refers to an alcohol-based extract, for example, an extract of Cannabis tissue dissolved in an alcohol.
- the product can be formulated for administration to a subject (e.g., a human), such as by an injection (e.g., intravenous, subcutaneous, intramuscular, parenteral), or by topical, oral, or pulmonary administration.
- a subject e.g., a human
- an injection e.g., intravenous, subcutaneous, intramuscular, parenteral
- topical oral, or pulmonary administration.
- the product is a recreational product.
- the product is a therapeutic product (e.g., medicament).
- the composition is for pulmonary administration.
- the compositions include, but are not limited to, dry powder compositions consisting of the powder of a Cannabis oil described herein, and the powder of a suitable carrier and/or lubricant.
- the compositions for pulmonary administration can be inhaled from any suitable dry powder inhaler device.
- compositions may be conveniently delivered in the form of an aerosol spray from pressurized packs or a nebulizer, with the use of a suitable propellant, for example, dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide, or other suitable gas.
- a suitable propellant for example, dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide, or other suitable gas.
- the dosage unit can be determined by providing a valve to deliver a metered amount.
- Capsules and cartridges of, for example, gelatin for use in an inhaler or insufflator can be formulated containing a powder mix of the compound(s) and a suitable powder base, for example, lactose or starch.
- a composition can take the form of, e.g., a tablet or a capsule prepared by conventional means with a pharmaceutically acceptable excipient.
- a pharmaceutically acceptable excipient e.g., a tablet or a capsule prepared by conventional means with a pharmaceutically acceptable excipient.
- binders e.g., magnesium aluminum silicate, starch paste, gelatin, tragacanth, methylcellulose, sodium carboxymethylcellulose, polyvinylpyrrolidone and
- Tablets can be either uncoated or coated according to known methods.
- the excipients described herein can also be used for preparation of buccal dosage forms and sublingual dosage forms (e.g., films and lozenges) as described, for example, in U.S. Pat. Nos. 5,981 ,552 and 8,475,832.
- Formulation in chewing gums as described, for example, in U.S. Pat. No. 8,722,022, is also contemplated.
- Liquid preparations for oral administration can be prepared by conventional means with pharmaceutically acceptable additives, for example, suspending agents, for example, sorbitol syrup, cellulose derivatives, or hydrogenated edible fats; emulsifying agents, for example, lecithin, xanthan gum, or acacia; non-aqueous vehicles, for example, almond oil, sesame oil, hemp seed oil, fish oil, oily esters, ethyl alcohol, or fractionated vegetable oils; and preservatives, for example, methyl or propyl-p-hydroxybenzoates or sorbic acid.
- pharmaceutically acceptable additives for example, suspending agents, for example, sorbitol syrup, cellulose derivatives, or hydrogenated edible fats
- emulsifying agents for example, lecithin, xanthan gum, or acacia
- non-aqueous vehicles for example, almond oil, sesame oil, hemp seed oil, fish oil, oily esters, ethyl alcohol, or fractionated vegetable oils
- the preparations can also contain buffer salts, flavoring, coloring, and/or sweetening agents as appropriate.
- Typical formulations for topical administration include creams, ointments, sprays, lotions, hydrocolloid dressings, and patches, as well as eye drops, ear drops, and deodorants.
- Cannabis extracts/oils can be administered via transdermal patches as described, for example, in U.S. Pat. Appl. Pub. No. 2015/0126595 and
- Cannabis oils can be formulated, for example, as suppositories containing conventional suppository bases such as cocoa butter and other glycerides as described in U.S. Pat. Nos. 5,508,037 and 4,933,363.
- Compositions can contain other solidifying agents such as shea butter, beeswax, kokum butter, mango butter, illipe butter, tamanu butter, carnauba wax, emulsifying wax, soy wax, castor wax, rice bran wax, and candelilla wax.
- Compositions can further include clays (e.g., bentonite, French green clays, Fuller's earth, Rhassoul clay, white kaolin clay) and salts (e.g., sea salt, Himalayan pink salt, and magnesium salts such as Epsom salt).
- clays e.g., bentonite, French green clays, Fuller's earth, Rhassoul clay, white kaolin clay
- salts e.g., sea salt, Himalayan pink salt, and magnesium salts such as Epsom salt.
- compositions disclosed herein can be formulated for administration by injection, for example, by bolus injection or continuous infusion.
- Formulations for injection can be presented in unit dosage form, for example, in ampoules or in multi-dose containers, optionally with an added preservative.
- Injectable compositions are preferably aqueous isotonic solutions or suspensions, and suppositories are preferably prepared from fatty emulsions or suspensions.
- the compositions may be sterilized and/or contain adjuvants, such as preserving, stabilizing, wetting or emulsifying agents, solution promoters, salts for regulating the osmotic pressure, buffers, and/or other ingredients.
- the compositions can be in powder form for reconstitution with a suitable vehicle, for example, a carrier oil, before use.
- the compositions may also contain other therapeutic agents or substances.
- compositions can be prepared according to conventional mixing, granulating, and/or coating methods, and contain from about 0.1 to about 75%, for example from about 1% to about 50%, of a Cannabis extract.
- subjects receiving a Cannabis composition orally are administered doses ranging from about 1 to about 2000 mg of Cannabis extract.
- a small dose ranging from about 1 to about 20 mg can typically be administered orally when treatment is initiated, and the dose can be increased (e.g., doubled) over a period of days or weeks until the optimal or maximum dose is reached.
- kits for use in research, breeding, or other application may include oligonucleotide probes and/or primers to detect a genetic marker disclosed herein (e.g., any probes or primers disclosed herein).
- the kit includes seed or germplasm of a Cannabis plant.
- the kit includes DNA from a Cannabis plant, for example, that is useful as a positive or negative control.
- the kits include enzymes (e.g., polymerase), dNTPs, enzymatic substrates, reagents for colorimetric or fluorescent detection, buffers, etc.
- the kit components arc in separate containers. The kit can be used to practice any of the methods disclosed herein.
- the kit is for detecting a genetic marker (e.g., SNP) or set of genetic markers disclosed herein.
- the kits may include instructional materials containing directions (i.e., protocols) for the practice of the methods of this disclosure. While the instructional materials typically include written or printed materials, they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated. Such media include, but are not limited to, electronic storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), cloud-based media, and the like. Such media may include addresses to internet sites that provide such instructional materials.
- a method for producing one or more Cannabis plants having modified terpene content comprising: (i) analyzing one or more genetic markers in a nucleic acid sample from a Cannabis plant or its germplasm; (ii) detecting one or more genetic markers that indicate modified terpene content, (iii) crossing the Cannabis plant comprising the one or more genetic markers indicating modified terpene content, and (iv) obtaining one or more progeny plants comprising the one or more genetic markers indicating modified terpene content, and wherein the one or more progeny plants have modified terpene content relative to a control.
- a method for selecting a Cannabis plant having modified terpene content comprising: (i) analyzing one or more genetic markers in a nucleic acid sample from the Cannabis plant or its germplasm; (ii) detecting one or more genetic markers that indicate modified terpene content; and (iii) selecting the Cannabis plant, thereby selecting the Cannabis plant having modified terpene content.
- Clause 4 The method of any one of the prior clauses, further comprising crossing the Cannabis plant having modified terpene content and producing one or more progeny plants having modified terpene content.
- Clause 7 The method of any one of the prior clauses, wherein the selfing, sibling crossing, outcrossing, or backcrossing comprises marker-assisted selection for at least two generations.
- analyzing one or more genetic markers in the nucleic acid sample comprises analyzing one or more of nucleotide positions: 8,871,401; 8,886,933; 9,101,934; 10,446,475; 10,543,062; 10,561,778; 10,633,191 ; 10,934,458; 11,169,492; or 13,920,896 on chromosome 1; 93,291,929 on chromosome 2; 47,140,085 on chromosome 3; 72,717,623 on chromosome 4; 330,918; 516,340; 518,238; 523,626; 608,718; 755,967; 1,100,981; 1,109,162; 1,331,433; 1,353,878; 1,366,137; 1,386,965; 1,487,633; 1,745,101; 1,828,050; 1,837,343; 1,840,325; 1,929,
- Clause 16 The method of any one of the prior clauses, wherein the genetic markers that indicate modified terpene content comprise a polymorphism at position 51 of one or more of: SEQ ID NOs: 1-99.
- SEQ ID NO: 48 SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 76, SEQ ID NO: 77, SEQ ID NO: 78, SEQ ID NO: 85, SEQ ID NO: 93, SEQ ID NO: 94, and/or SEQ ID NO: 95;
- SEQ ID NO: 14 SEQ ID NO: 23, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 31, SEQ ID NO: 34, SEQ ID NO: 41, SEQ ID NO: 50, SEQ ID NO: 54, and/or SEQ ID NO: 55;
- SEQ ID NO: 28 SEQ ID NO: 33, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 71, and/or SEQ ID NO: 72;
- SEQ ID NO: 1 SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and/or SEQ ID NO: 10;
- reference genome is Abacus Cannabis reference genome version Csat_AbacusV2, NCB1 assembly accession GCA_025232715.1.
- Clause 21 The method of any one of the prior clauses, wherein the one or more genetic markers are genetically linked to a terpene trait locus.
- the modified terpene content comprises modified total terpenes, total monoterpenes, beta-myrcene, total sesquiterpenes, alpha-pinene, beta-pinene, alpha-terpinene, gamma- terpinene, terpinolene, beta-ocimene, camphene, D-limonene, linalool, tran. -ncrolidol, alpha-humulene, beta-caryophyllene, and/or guaiol levels.
- Clause 23 The method of any one of the prior clauses, wherein the modified terpene content comprises increased terpene content relative to the control.
- Clause 24 The method of any one of the prior clauses, wherein the modified terpene content comprises increased total terpenes absent beta myrcene.
- Clause 25 The method of any one of the prior clauses, wherein the modified terpene content comprises increased alpha-pinene.
- Clause 26 The method of any one of the prior clauses, wherein the modified terpene content comprises increased beta-pinene.
- Clause 27 The method of any one of the prior clauses, wherein the modified terpene content comprises increased alpha-terpinene, gamma-terpinene, and/or terpinolene.
- Clause 28 The method of any one of the prior clauses, wherein the modified terpene content comprises increased beta-ocimene.
- Clause 30 The method of any one of the prior clauses, wherein the modified terpene content comprises increased linalool and/or trans-nerolidol.
- Clause 31 The method of any one of the prior clauses, wherein the modified terpene content comprises increased guaiol.
- Clause 32 The method of any one of the prior clauses, wherein the modified terpene content comprises increased beta-myrcene.
- Clause 33 The method of any one of the prior clauses, wherein the one or more genetic markers that indicate modified terpene content indicate increased terpene content relative to a control.
- Clause 34 The method of any one of the prior clauses, wherein the control is a Cannabis plant without the one or more markers that indicate modified terpene content.
- Clause 36 The method of any one of the prior clauses, wherein the modified terpene content is increased terpene content relative to the control.
- Clause 38 A seed, plant part, tissue culture, or protoplast of the plant of clause 37.
- Clause 39 A method of Cannabis breeding, comprising crossing the Cannabis plant of clause 37.
- GC gas chromatography
- terpenes were measured using dried flower tissue of up to three clonal replicates per accession; the average across clonal replicates was used for validation.
- data were collected for 22 terpenes: 15 monoterpenes (alpha-pinene, alpha-terpinene, beta-myrcene, beta- ocimene, beta-pinene, camphene, delta-3-carene, D-limonene, eucalyptol, isopulegol, linalool, p-cymene, and terpinolene) and seven sesquiterpenes (alpha-Bisabolol, alpha-humulene, beta-caryophyllene, caryophyllene oxide, guaiol, c iv- nerol idol (also known as nerolidol 1), and tra/z.v-ncrolido
- mapping and validation was done for combinations of terpenes with highly correlated levels: 1. alpha-terpinene, gamma-terpinene, and terpinolene; 2. camphene and D-limonene; 3. alpha-humulene and beta-caryophyllene; 4.
- mapping set and both validation sets were genotyped with an Illumina bead array. After initial SNP quality control (QC), further filtering steps were performed to filter out known low quality SNPs, followed by filtering for missing data ( ⁇ 10%) and minor allele frequency (>1%) using vcftools (Danecek et al., "The variant call format and VCFtools," Bioinformatics 27.15 (2011): 2156-2158). Missing data were subsequently imputed in the mapping set (R package NAM “snpQC” option; Xavier et al., “NAM: association studies in multiple populations,” Bioinformatics 31.23 (2015): 3862-3864), resulting in 36,073 SNPs for the mapping set.
- nested association mapping was performed on terpene data collected for the mapping set of 900 diversity panel accessions with the R package NAM using seed lots as family structure and a kinship matrix to control for relatedness (GWAS2 function).
- the homozygous genotype with the highest average terpene trait value is referred to as the beneficial genotype.
- the homozygous genotype with the lowest average terpene trait value is referred to as the detrimental genotype.
- the heterozygous genotype is considered beneficial in addition to a homozygous genotype if the heterozygous genotype has either an average terpene trait value intermediate between the homozygous reference allele and homozygous alternate allele genotype average trait values or has an average terpene trait value similar' to the beneficial homozygous genotype.
- beneficial genotypes for the mapping set and the first validation set were compared.
- a SNP marker was considered validated in the first validation set if the beneficial genotype for the mapping set matched the beneficial genotype for the first validation set.
- the average terpene trait value across all 397 accessions in the second validation set was compared with the average terpene trait value after selecting for the beneficial and the detrimental genotypes, respectively, of all SNP markers that were validated in the first validation set for a given terpene trait.
- the combination of beneficial genotypes is considered validated if the beneficial genotypes of the SNP markers in combination result in an increased average terpene trait value as compared to the average without SNP marker selection.
- the combination of detrimental genotypes is considered validated if the detrimental genotypes of the SNP markers in combination result in a decreased average terpene trait value as compared to the average without SNP marker selection.
- NAM of total Terpenes in the diversity panel resulted in the identification of five significant (p-value ⁇ Bonferroni threshold of 1.39E-06) SNP markers on chromosomes 2, 4, 9, and X.
- Three of these five SNPs were validated in the first validation set (Table 1; Table 15).
- the beneficial genotypes for these three SNP markers resulted in increased total Terpenes in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of total Monoterpenes in the diversity panel resulted in the identification of seven significant SNP markers on chromosomes 2, 3, 4, 5, and 8.
- Four of these seven SNPs were validated in the first validation set (Table 2; Table 15).
- the beneficial genotypes for these four SNP markers resulted in increased total Monoterpenes in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of total Monoterpenes excluding beta-Myrcene also referred to as total Monoterpenes - beta- Myrcene
- NAM of total Monoterpenes excluding beta-Myrcene also referred to as total Monoterpenes - beta- Myrcene
- 65 significant SNP markers located on chromosome 5 9 of the top ten SNPs were validated in the first validation set (Table 3; Table 15).
- the beneficial genotypes for these nine SNP markers resulted in increased total Monotcrpcncs excluding beta-Myrcene in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of total Sesquiterpenes in the diversity panel resulted in the identification of 33 significant SNP markers located on chromosomes 3, 4, 5, 6, and 8; the majority of these markers consisting of 11 SNP markers are located on chromosome 6. All of the top ten SNPs were validated in the first validation set (Table 4; Table 15). The combination of homozygous alternate beneficial genotypes for three of these ten SNP markers resulted in increased total Sesquiterpenes in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of alpha-Pinene in the diversity panel resulted in the identification of 2157 significant SNP markers on all ten chromosomes; the majority of these markers consisting of 362 of these SNP markers are located on chromosome 5.
- Two of the top ten SNPs were validated in the first validation set (Table 5; Table 15).
- the beneficial genotypes of these two SNP markers resulted in increased alpha-Pinene in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of beta-Pinene in the diversity panel resulted in the identification of 837 significant SNP markers on all ten chromosomes; 194 of these SNP markers are located on chromosome 5.
- Five of the top ten SNPs were validated in the first validation set (Table 6; Table 15).
- the beneficial genotypes of these five SNP markers resulted in increased beta-Pinene in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of the combination of alpha-Terpinene, gamma-Terpinene, and Terpinolene (also referred to as alpha-Terpinene + gamma-Terpinene + Terpinolene) in the diversity panel resulted in the identification of 1395 significant SNP markers on all ten chromosomes; the majority of these markers consisting of 912 of these SNP markers are located on chromosome 5. Eight of the top ten SNPs were validated in the first validation set (Table 7; Table 15).
- the beneficial genotypes of these eight SNP markers resulted in increased alpha- Terpinene + gamma-Terpinene + Terpinolene in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of beta-Myrcene to total Monoterpenes Ratio in the diversity panel resulted in the identification of 121 significant SNP markers on all ten chromosomes; the majority of these markers consisting of 90 of these SNP markers are located on chromosome 5.
- Six of the top ten SNPs were validated in the first validation set (Table 8; Table 15).
- the beneficial genotypes for these six SNP markers resulted in increased beta-Myrcene to total Monoterpenes ratio in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of beta-Ocimene in the diversity panel resulted in the identification of 864 significant SNP markers on all 10 chromosomes; the majority of these markers consisting of 259 of these SNP markers are located on chromosome 5.
- Six of the top ten SNPs were validated in the first validation set (Table 9; Table 15).
- the beneficial genotypes for these six SNP markers resulted in increased beta-Ocimene in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of the combination of Camphene and D-Limonene (also referred to as Camphene + D-Limonene) in the diversity panel resulted in the identification of 209 significant SNP markers on all 10 chromosomes; the majority of these markers consisting of 103 of these SNP markers are located on chromosome 5.
- SNPs Nine of the top 10 SNPs were validated in the first validation set (Table 10; Table 15).
- the beneficial genotypes for these nine SNP markers resulted in increased Camphene and D-Limonene in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of the combination of linalool and tra/rv-ncrolidol (also referred to as linalool + tra/i.v-ncrolidol) in the diversity panel resulted in the identification of 481 significant SNP markers on all ten chromosomes; the majority of these markers consisting of 258 are located at the of chromosome 1. All ten of the top ten SNPs were validated in the first validation set (Table 11; Table 15). In combination, the beneficial genotypes for these ten SNP markers resulted in increased linalool and trans-nerolidol in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- NAM of the combination of alpha-Humulene and beta-Caryophyllene (also referred to as alpha- Humulene + beta-Caryophyllene) in the diversity panel resulted in the identification of 46 significant SNP markers on chromosomes 3, 4, 5, 6, and 8; the majority of these markers consisting of 23 SNPs are located on chromosome 6.
- NAM of Guaiol in the diversity panel resulted in the identification of 526 significant SNP markers on chromosomes 3, 4, 5, 6, 7, 8, 9, and X; the majority of these markers consisting of 488 SNPs are located on chromosome 6.
- SNPs Nine of the top ten SNPs were validated in the first validation set (Table 13; Table 15).
- the beneficial genotypes for these nine SNP markers resulted in increased Guaiol in the second validation set, therefore validating the combination of beneficial genotypes of these SNP markers (Table 14).
- Combinations of SNP markers disclosed herein can be useful, for example, for screening plants having increased levels of terpenes of interest. While any combination of SNPs disclosed herein could be useful, an exemplary subset of SNPs is provided in Table 16.
- at least one SNP selected from the list of SNPs in Table 16 is analyzed and/or detected in a nucleic acid sample from a Cannabis plant, indicating increased terpene levels in that plant.
- at least two SNPs selected from the list of SNPs in Table 16 are analyzed, and at least one SNP indicating increased terpene levels is detected in a nucleic acid sample from a Cannabis plant, indicating increased terpene levels in that plant.
- all of the SNPs from the list of SNPs provided in Table 16 are analyzed, and at least one SNP indicating increased terpene levels is detected in a nucleic acid sample from a Cannabis plant, indicating increased terpene levels in that plant.
- haplotype surrounding SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- Fifth column reference allele call; Sixth column, alternative allele call; Seventh column, Abacus reference
- a haplotype surrounding a significantly associated SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- Fifth column reference allele call; Sixth column, alternative allele call; Seventh column, Abacus reference genome chromosome;
- haplotype surrounding SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- Fifth column reference allele call; Sixth column, alternative allele call; Seventh column, Abacus reference genome chromosome;
- Abacus reference genome position (Csat_AbacusV2; NCBI assembly accession GCA_025232715.1); Ninth column, left flanking SNP of haplotype surrounding SNP marker; Tenth column, right flanking SNP of haplotype surrounding SNP marker; Eleventh column, Abacus reference genome position left flanking SNP of haplotype surrounding SNP marker; Twelfth column, Abacus reference genome position right flanking SNP of haplotype surrounding SNP marker.
- a haplotype surrounding a significantly associated SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- haplotype surrounding a significantly associated SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- Abacus reference genome position (Csat_AbacusV2; NCBI assembly accession GCA_025232715.1); Ninth column, left flanking SNP of haplotype surrounding SNP marker; Tenth column, right flanking SNP of haplotype surrounding SNP marker; Eleventh column, Abacus reference genome position left flanking SNP of haplotype surrounding SNP marker; Twelfth column, Abacus reference genome position right flanking SNP of
- haplotype surrounding SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- Abacus reference genome chromosome 5 Seventh column, Abacus reference genome chromosome; Eighth column, Abacus reference genome position (Csat_AbacusV2; NCBI assembly accession GCA_025232715.1); Ninth column, left flanking SNP of haplotype surrounding SNP marker; Tenth column, right flanking SNP of haplotype surrounding SNP marker; Eleventh column, Abacus reference genome position left flanking SNP of haplotype surrounding SNP marker; Twelfth column, Abacus reference genome position right flanking SNP of haplotype surrounding SNP marker.
- a haplotype surrounding a significantly associated SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- Abacus reference genome chromosome 5 Seventh column, Abacus reference genome chromosome; Eighth column, Abacus reference genome position (Csat_AbacusV2; NCBI assembly accession GCA_025232715.1); Ninth column, left flanking SNP of haplotype surrounding SNP marker; Tenth column, right flanking SNP of haplotype surrounding SNP marker; Eleventh column, Abacus reference genome position left flanking SNP of haplotype surrounding SNP marker; Twelfth column, Abacus reference genome position right flanking SNP of haplotype surrounding SNP marker.
- a haplotype surrounding a significantly associated SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column SNP marker number
- Second column SNP marker name
- Third column NAM p-value
- haplotype surrounding SNP marker Eleventh column, Abacus reference genome position left flanking SNP of haplotype surrounding SNP marker;
- haplotype surrounding SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- Abacus reference genome position (Csat_AbacusV2; NCBI assembly accession GCA_025232715.1); Ninth column, left flanking SNP of haplotype surrounding SNP marker; Tenth column, right flanking SNP of haplotype surrounding SNP marker; Eleventh column, Abacus reference genome position left flanking SNP of haplotype surrounding SNP marker; Twelfth column, Abacus reference genome position right flanking SNP of haplotype surrounding SNP marker.
- a haplotype surrounding a significantly associated SNP marker consists of the genomic region flanked by the nearest non-significant SNP on either side of the SNP marker.
- First column mapped trait
- Second column average value (% of dry weight) for mapped terpene trait in second validation set without using markers to make selections
- Third column number of SNP markers used in combination to make selections
- Fourth column average value (% of dry weight) for
- Table 15 50 bp flanking sequences with SNP marker at position 51 bp.
- First column SNP marker number;
- Second column SNP marker name;
- Third column 50 bp flanking sequences with SNP marker at position 51 bp.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Botany (AREA)
- Environmental Sciences (AREA)
- Developmental Biology & Embryology (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physiology (AREA)
- Analytical Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Mycology (AREA)
- Molecular Biology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- Natural Medicines & Medicinal Plants (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
L'invention concerne des marqueurs génétiques associés à une production accrue de terpènes chez le cannabis<i />. Les marqueurs génétiques sont utiles, par exemple, pour identifier, sélectionner et/ou améliorer génétiquement des plantes de cannabis ayant une teneur en terpènes modifiée<i />.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202363611288P | 2023-12-18 | 2023-12-18 | |
| US63/611,288 | 2023-12-18 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025137000A1 true WO2025137000A1 (fr) | 2025-06-26 |
Family
ID=96138788
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2024/060604 Pending WO2025137000A1 (fr) | 2023-12-18 | 2024-12-17 | Marqueurs génétiques terpéniques |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2025137000A1 (fr) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190230882A1 (en) * | 2016-05-27 | 2019-08-01 | New West Genetics | Industrial hemp cannabis cultivars and seeds with stable cannabinoid profiles |
| US20200316015A1 (en) * | 2017-02-07 | 2020-10-08 | Elevate Technologies Llc | Terpene-based compositions, methods of preparations and uses thereof |
| WO2022165507A1 (fr) * | 2021-01-28 | 2022-08-04 | Central Coast Agriculture, Inc. | Sélection de plantes de cannabis assistée par marqueurs |
| US20230002779A1 (en) * | 2020-06-29 | 2023-01-05 | Front Range Biosciences, Inc. | Characterization of plant cultivars based on terpene synthase gene profiles |
-
2024
- 2024-12-17 WO PCT/US2024/060604 patent/WO2025137000A1/fr active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190230882A1 (en) * | 2016-05-27 | 2019-08-01 | New West Genetics | Industrial hemp cannabis cultivars and seeds with stable cannabinoid profiles |
| US20200316015A1 (en) * | 2017-02-07 | 2020-10-08 | Elevate Technologies Llc | Terpene-based compositions, methods of preparations and uses thereof |
| US20230002779A1 (en) * | 2020-06-29 | 2023-01-05 | Front Range Biosciences, Inc. | Characterization of plant cultivars based on terpene synthase gene profiles |
| WO2022165507A1 (fr) * | 2021-01-28 | 2022-08-04 | Central Coast Agriculture, Inc. | Sélection de plantes de cannabis assistée par marqueurs |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20230242932A1 (en) | Autoflowering Markers | |
| Polashock et al. | The American cranberry: first insights into the whole genome of a species adapted to bog habitat | |
| Wolters et al. | Identification of alleles of carotenoid pathway genes important for zeaxanthin accumulation in potato tubers | |
| US11920187B2 (en) | Varin markers | |
| US20200270623A1 (en) | Method for differentiating cannabis plant cultivars based on cannabinoid synthase paralogs | |
| US12295309B2 (en) | Melon with red flesh linked to earliness | |
| Li et al. | Comparative biochemical and transcriptome analysis provides insights into the regulatory mechanism of striped leaf albinism in arecanut (Areca catechu L.) | |
| MX2013010485A (es) | Marcadores geneticos para myb28. | |
| CN105392361A (zh) | 具有粉红色果实的番茄植物 | |
| US20230087919A1 (en) | Cannabis Hybrid Varieties and Parent Lines | |
| ES2711627T3 (es) | Marcadores genéticos para resistencia a orobanca en girasol | |
| US20250137075A1 (en) | Varin genes | |
| WO2025137000A1 (fr) | Marqueurs génétiques terpéniques | |
| US11240978B2 (en) | Hemp variety NBS CBD-1 | |
| CA3189202A1 (fr) | Profils de varine | |
| US20240117450A1 (en) | Powdery mildew markers for cannabis | |
| WO2025184562A1 (fr) | Marqueurs génétiques pour hauteur de plante | |
| WO2024182623A2 (fr) | Gènes et marqueurs génétiques associés à une production de varine élevée | |
| US20240341254A1 (en) | Varin profiles | |
| US20250127104A1 (en) | Cannabinoid markers | |
| WO2024092249A2 (fr) | Marqueurs d'initiation florale | |
| WO2021138501A1 (fr) | Marqueurs de la synthase des cannabinoïdes | |
| WO2023137336A1 (fr) | Marqueurs de l'hermaphrodisme | |
| US20250361518A1 (en) | Autoflowering genes | |
| CN108289429B (zh) | 产生含有益化合物的果实的番茄植物 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24908776 Country of ref document: EP Kind code of ref document: A1 |