[go: up one dir, main page]

WO2017095960A1 - Compositions et procédés pour l'expression de gènes dans des algues - Google Patents

Compositions et procédés pour l'expression de gènes dans des algues Download PDF

Info

Publication number
WO2017095960A1
WO2017095960A1 PCT/US2016/064275 US2016064275W WO2017095960A1 WO 2017095960 A1 WO2017095960 A1 WO 2017095960A1 US 2016064275 W US2016064275 W US 2016064275W WO 2017095960 A1 WO2017095960 A1 WO 2017095960A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
gene
promoter
sequence
heterologous
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2016/064275
Other languages
English (en)
Inventor
Eric R. MOELLERING
Amanda R. EDWARDS
Nicholas BAUMAN
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Viridos Inc
Original Assignee
Synthetic Genomics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Synthetic Genomics Inc filed Critical Synthetic Genomics Inc
Publication of WO2017095960A1 publication Critical patent/WO2017095960A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/405Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from algae
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8209Selection, visualisation of transformants, reporter constructs, e.g. antibiotic resistance markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8218Antisense, co-suppression, viral induced gene silencing [VIGS], post-transcriptional induced gene silencing [PTGS]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • C12N15/8247Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8262Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield involving plant development
    • C12N15/8269Photosynthesis

Definitions

  • the present invention relates generally to the field of genetic engineering of eukaryotic cells, such as microorganisms such as algae.
  • Algal cells are a promising source of biofuels (Wijffels & Barbosa (2010) Science 329:796-799). Their ability to harness solar energy to convert carbon dioxide into carbon-rich lipids already exceeds the abilities of oil-producing agricultural crops, with the added advantage that algae grown for biofuel do not compete with oil-producing crops for agricultural land (Wijffels & Barbosa, 2010).
  • Parachlorella phytoplankton are unicellular green algae (phylum Chlorophya) of the Trebouxiophyceae class that can be cultured easily, rapidly, and economically. In order to maximize algal fuel production, new algal strains will need to be engineered for growth and carbon fixation at an industrial scale (Wijffels & Barbosa, 2010).
  • the disclosure provides an isolated DNA molecule comprising a sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides of a sequence selected from the group consisting SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23.
  • the isolated DNA molecule can comprise a sequence having at least 80% at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides extending in the 5' direction from the 3' end of a sequence selected from the group consisting of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • the present disclosure provides a promoter comprising a nucleic acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides of a sequence selected from the group consisting of SEQ ID NO: l, SEQ ID NO: 4, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • the promoter can comprise a sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides extending in the 5' direction from the 3' end of a sequence selected from the group consisting of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • the promoter comprises a sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides extending in the 5' direction from the 3' end of a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:8, and SEQ ID NO: 17.
  • the disclosure provides an isolated DNA molecule comprising a sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, or at least 400 contiguous nucleotides of a sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24, for example, comprising a sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, or at least 400 contiguous nucleotides extending in the 3' direction from the 5' end of a sequence selected from the group consisting of
  • the DNA molecule comprises a terminator.
  • the terminator comprises a sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, or at least 400 contiguous nucleotides of a sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:9, and SEQ ID NO: 18.
  • the terminator can be operably linked to a heterologous sequence encoding a polypeptide or functional RNA.
  • engineered genes encoding polypeptides in which the engineered genes include two or more introns that are heterologous with respect to the gene encoding the polypeptide, i.e., two or more introns derived from genes other than the genes from with the polypeptide-encoding seqeuences are derived.
  • the two or more heterologous introns can be derived from the same gene, which is a different from the gene from which the polypeptide-encoding seqeuences are derived.
  • the engineered genes can include two, three, four, five, six, seven, eight, nine, or more than nine heterologous introns, which can optionally all be derived from the same gene.
  • An engineered gene that includes heterologous introns can be operably linked to a heterologous promoter, where the promoter in some embodiments can be derived from the same gene from which the two or more introns are derived.
  • An engineered gene that include heterologous introns can further include a terminator operably linked to the polypeptide encoding sequence, where the terminator in some embodiments can be derived from the same gene the two or more introns and the promoter are derived from.
  • the engineered gene includes two or more introns having at least 80%, or at least 85%, 90%, 95%), 96%), 97%), 98%), or 99% identity to a sequence selected from the group consisting of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, or SEQ ID NO:43.
  • an engineered gene can include three or more introns selected from the group of introns having at least 80%, 85%, 90%, 95%, 96%. 97%, 98%, or 99% identity to SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, and SEQ ID NO:29.
  • an engineered gene can include three or more introns selected from the group of introns having at least 80%, 85%, 90%, 95%, 96%.
  • the at least two, at least three, at least four, or at least five heterologous introns that are derived from a different gene than the gene from which the amino acid-encoding sequences of the engineered gene are derived can be derived from a species different from the species from which the polypeptide encoded by the gene is derived.
  • the amino acid-encoding sequences of the gene can be codon-optimized, for example, for expression in a host organism of interest.
  • the at least two, at least three, at least four, or at least five heterologous introns can be naturally- occurring introns or can be derived from naturally-occurring introns, for example, the at least two, at least three, at least four, or at least five introns can be at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to a naturally-occurring intron or to an internally deleted variant of a natually-occurring intron, for example, an intron having from 1 to at least 2000 bp internally deleted from the naturally- occurring intron sequence.
  • the at least two, at least three, at least four, or at least five heterologous introns engineered into a gene to be expressed in a transgenic organism can all be derived from the same species, and in some embodiments are all derived from the same gene.
  • a nucleic acid molecule as provided herein can include an engineered gene that includes at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine heterologous introns, where at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, or all of the heterologous introns are derived from the same gene.
  • all of the introns of the engineered gene may be derived from a species that is not the source of the amino acid-encoding sequences.
  • the polypeptide encoded by the engineered gene is derived from a first species, and the at least three heterologous introns are derived from a second species.
  • the engineered gene can be operably linked to a promoter that in some examples may be derived from the same species as the heterologous introns.
  • a nucleic acid molecule can optionally further comprise a terminator operably linked to the engineered gene, where the terminator may be derived from the same species as the heterologous introns.
  • the engineered gene can optionally be codon-optimized, for example, the engineered gene can be codon-optimized for expression in a species from which the at least three heterologous introns are derived.
  • an engineered gene that is intronylated as disclosed herein can be a gene encoding a prokaryotic protein, such as a protein conferring resistance to an antibiotic or toxin or, for example, an RNA-guided endonuclease such as a cas protein.
  • an engineered gene can encode a selectable marker protein, detectable marker protein, or an RNA-guided endonuclease, such as but not limited to a Class II RNA-guided endonuclease, that may be, for example, a Cas9, Cpfl, C2cl, C2c2, or C2c3 RNA-guided nuclease.
  • RNA-guided endonuclease RNA-guided endonuclease.
  • an engineered gene can encode a metabolic enzyme, a kinase, a phosphatase, a nucleotide cyclase, a phosphodiesteras, a G protein, a protein involved in cell signaling, a transcription factor, a transcriptional activator, a cell cycle protein, a receptor, a porin, a channel protein, a transporter, a protein of a secretory system or protein import system, a ribosomal protein, a translation factor, a structural protein, a DNA binding protein, etc.
  • a nucleic acid molecule that includes an engineered gene that includes at least three heterologous introns can demonstrate a higher level of expression when introduced into a cell than is demonstrated by a nucleic acid molecule that includes an engineered gene that does not include at least three heterologous introns introduced into a cell. Increased expression can be reflected in higher transformation rates, where the engineeered gene encodes a selectable marker, for example, or a higher frequency of transformed cells exhibiting fully penetrant expression of the engineered gene.
  • the disclosure provides an expression cassette comprising a promoter as disclosed herein and a heterologous gene encoding a polypeptide or a functional RNA sequence operably linked to the promoter.
  • the expression cassette further comprises a terminator, such as for example a terminator as disclosed herein having at least 80%, at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to at least 200, at least 300, or at least 400 contiguous nucleotides of a sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24, operably linked to the heterologous gene.
  • the heterologous gene encodes a polypeptide, such as but not limited to a polypeptide associated with lipid biosynthesis, a lipase, a polypeptide that participates in photosynthesis, a polypeptide associated with carbon fixation, a metabolic enzyme, a transporter protein, a dehydrogenase, a transcription factor, a transcriptional activator, a kinase, a phosphatase, a polypeptide involved in protein synthesis, or a cell signaling protein.
  • a polypeptide such as but not limited to a polypeptide associated with lipid biosynthesis, a lipase, a polypeptide that participates in photosynthesis, a polypeptide associated with carbon fixation, a metabolic enzyme, a transporter protein, a dehydrogenase, a transcription factor, a transcriptional activator, a kinase, a phosphatase, a polypeptide involved in protein synthesis, or a cell signaling protein.
  • the heterologous gene is a reporter gene (e.g., a fluorescent protein or signal- generating enzyme such as a luciferase) or a selectable marker gene the encodes a polypeptide conferring resistance to an antibiotic, herbicide, or toxin, or a polypeptide that allows growth of an otherwise auxotrophic cell on restrictive media.
  • a gene encoding a polypeptide in an expression cassette as provided herein can be intronylated, i.e., can be engineered to include one or more introns that do not naturally occur in the gene that encodes the polypeptide.
  • the engineered gene can include two, three, four, five, six, seven, eight, nine, or more than nine heterologous introns, such as but not limited to any disclosed herein.
  • the engineered gene is operably linked to a heterologous promoter that is derived from a species of interest and includes two or more, three or more, four or more, five or more heterologous introns derived from the same species of interest.
  • the two, three, four, or five or more heterologous introns engineered into a gene are derived from the same gene that is the source of the heterologous promoter that is operably linked to the gene.
  • the expression cassette that includes an engineered gene that includes two, three, four, five, or more heterologous introns and that is operably linked to a heterologous promoter can further be operably linked to a heterologous terminator, such as but not limited to any disclosed herein.
  • the engineered gene is operably linked to a heterologous promoter and a heterologous terminator that are derived from a species of interest and includes two or more, three or more, four or more, five or more heterologous introns derived from the same species of interest.
  • the two, three, four, or five or more heterologous introns engineered into a gene are derived from the same gene that is the source of the heterologous promoter and heterologous terminator that are operably linked to the engineered gene.
  • the gene can optionally be codon-optimized for expression in the species of interest.
  • an expression cassette as provided herein includes an engineered gene that includes a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:25, a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:26, a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:27, a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:28, and a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:29.
  • the engineered gene can be operably a promoter from a Parachlorella species, and can optionally be operably linked to a promoter having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to at least 100, at least 200, at least 300, at least 400, at least 500, or between 500 and 530 contiguous nucleotides from the 3' end of SEQ ID NO: l .
  • the engineered gene can optionally further be operable linked to a terminator from a Parachlorella species, such as but not limited to any disclosed herein and can optionally be operably linked to a terminator comprising at least 100 contiguous nucleotides of SEQ ID NO:7.
  • an expression cassette as provided herein can include an engineered gene that includes at least five heterologous introns selected from the group consisting of: a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%), or 99%) identity to SEQ ID NO:35; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:36; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:37; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99%) identity to SEQ ID NO:38; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99%) identity to
  • the engineered gene can be operably linked to a promoter from a Parachlorella species, and can optionally be operably linked to a promoter having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to at least 100, at least 200, at least 300, at least 400, at least 500, or between 500 and 530 contiguous nucleotides from the 3' end of SEQ ID NO: 13.
  • the engineered gene can optionally further be operably linked to a terminator from a Parachlorella species, such as but not limited to any disclosed herein, and can optionally be operably linked to a terminator comprising at least 100 contiguous nucleotides of SEQ ID NO: 14.
  • an expression cassette as provided herein can include a promoter such as any disclosed herein operably linked to a heterologous gene that encodes a functional RNA, such as for example a functional RNA selected from the group consisting of an antisense sequence, an RNAi molecule, a micro RNA, a shRNA, an siRNA, a gRNA, and a ribozyme.
  • a functional RNA such as for example a functional RNA selected from the group consisting of an antisense sequence, an RNAi molecule, a micro RNA, a shRNA, an siRNA, a gRNA, and a ribozyme.
  • the expression cassette can optionlly further include a terminator, such as but not limited to any disclosed herein.
  • the disclosure provides a vector comprising an expression cassette as disclosed herein and one or both of an autonomous replication sequence and a selectable marker gene.
  • the vector includes at least one origin of replication.
  • the vector further comprises an additional promoter, such as but not limited to a promoter as disclosed herein, operably linked to the selectable marker or reporter gene.
  • a selectable marker gene is selected from the group consisting of a gene conferring resistance to an antibiotic (e.g., tetracycline, doxycyclin, or analogs thereof, puromycin, hygromycin, blasticidin, bleomycin or phleomycin (ZeocinTM), nourseothricin), a gene conferring resistance to an herbicide, a gene encoding acetyl CoA carboxylase (ACCase), a gene encoding acetohydroxy acid synthase (ahas), a gene encoding acetolactate synthase, a gene encoding aminoglycoside phosphotransferase, a gene encoding anthranilate synthase, a gene encoding bromoxynil nitrilase, a gene encoding cytochrome P450-NADH-cytochrome P450 oxidoreductase, a gene encoding dalapon dehalogen
  • an antibiotic
  • a detectable marker gene can be, for example, a tyrosinase gene, lacZ, an alkaline phosphatase gene, an a-amylase gene, a horseradish peroxidase gene, an a-galactosidase gene, a luciferin/luciferase gene, a beta-glucuronidase gene (GUS), or a gene encoding a fluorescent protein.
  • the disclosure provides a method for transforming a eukaryotic cell comprising: introducing a vector such as a vector as disclosed herein into the eukaryotic cell; and selecting for a transformed eukaryotic cell.
  • the vector is introduced by electroporation.
  • the field strength is between about 250 kV/m and about 800 kV/m.
  • the field strength can be between about 500 kV/m and about 750 kV/m.
  • the field strength is between about 500 kV/m and about 600 kV/m.
  • the resistance can be between about 100 Ohms and about 400 Ohms, for example, between about 200 Ohms and about 400 Ohms. In some embodiments, the resistance is between about 200 Ohms and about 300 Ohms. In some embodiments, the capacitance is between about 10 ⁇ and about 50 ⁇ for example between about 25 ⁇ and about 50 ⁇ .
  • the eukaryotic cell can be an algal cell, for example, a Chlorophyte algal cell, and can be a cell of an alga belonging to the Trebouxiophyceae class, for example, an algal call of a species of a genus such as Botryococcus, Chlorella, Auxenochlorella, Heveochlorella, Marinichlorella, Parachlorella, Pseudochlorella, Tetrachlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Picochlorum, or Prototheca.
  • a species of a genus such as Botryococcus, Chlorella, Auxenochlorella, Heveochlorella, Marinichlorella, Parachlorella, Pseudochlorella, Tetrachlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Picochlorum, or
  • the methods are used to transform an algal species belonging to a genus such as Auxenochlorella, Chlorella, Heveochlorella, Marinichlorella, Parachlorella, Pseudochlorella or Tetrachlorella.
  • the vector is introduced by a biolistic procedure. In some embodiments, about 300 psi or more of pressure is used to impel biolistic microcarriers coated with vector DNA into the eukaryotic cell.
  • Yet another aspect of the invention is a eukaryotic host cell comprising a nucleic acid molecule that includes an engineered gene that includes at least five heterologous introns.
  • the engineered gene can be a gene not derived from the same genus as the eukaryotic host cell, or can be a gene not derived from the same species as the eukaryotic host cell.
  • the at least five heterologous introns are derived from introns of the genus or species of the eukaryotic host cell.
  • the nucleic acid molecule comprises a promoter operably linked to the engineered gene.
  • the promoter can be a promoter derived from an organism of the same genus or species as the eukaryotic host cell.
  • the nucleic acid molecule can include a terminator sequence, where the terminator sequence can optionally be derived from an organism of the same genus or same species as the eukaryotic host cell.
  • a eukaryotic host cell as provided herein that includes a recombinant nucleic acid molecule that includes a gene that includes at least five heterologous introns can exhibit a higher level of expresson of the engineered gene than is demonstrated by a control eukaryotic host cell that comprises a nucleic acid molecule comprising an otherwise identical gene that is does not include at least five heterologous introns.
  • the at least five heterologous introns can be derived from a gene of an organism of the same genus or species as the eukaryotic host cell, and in various embodiments, the engineered gene is operably linked to a heterologous promoter which is derived from a gene of the same genus or species as the eukaryotic host cell. Further, the engineered gene can be operably linked to a heterologous terminator sequence which can be derived from a gene of the same genus or species as the eukaryotic host cell.
  • the engineered gene of the eukaryotic host cell includes at least five heterologous introns that are all derived from the same gene, where the gene from with the introns are sourced is native to the host cell (i.e., of the same species), and the engineered gene is operably linked to a promoter and a terminator that are also both sourced form the same gene as the heterologous introns.
  • the eukaryotic host cell can be a eukaryotic microorganism, and can be, for example, a heterokont or microalga.
  • the algal cell can be a Chlorophyte, Charyophyte, Eustigmatophyte, or Bacillariophyte alga.
  • the algal cell can be from a species selected from the group consisting of Achnanthes, Amphiprora, Amphora, Ankistrodesmus, Asteromonas, Boekelovia, Bolidomonas, Borodinella, Botrydium, Botryococcus, Bracteococcus, Chaetoceros, Carteria, Chlamydomonas, Chlorococcum, Chlorogonium, Chlorella, Chroomonas, Chrysosphaera, Cricosphaera, Crypthecodinium, Cryptomonas, Cyclotella, Dunaliella, Ellipsoidon, Emiliania, Eremosphaera, Ernodesmius, Euglena, Eustigmatos, Franceia, Fragilaria, Gloeothamnion, Haematococcus, Halocafeteria, Heterosigma, Hymenomonas, Isochrysis, Lepocin
  • the eukaryotic host cell can be a Chlorophyte or Charyophyte microalga, and can be, for example, a Chlorophyte alga of the Chlorophyceae, the Trebouxiophyceae, the Chlorodendrophyceae, the Ulvophyceae, the Pedinophyceae, or the Prasinophyceae class.
  • the eukaryotic host cell is a Chlorophyte alga selected from the group consisting of Asteromonas, Ankistrodesmus, Carteria, Chlamydomonas, Chlorococcum, Chlorogonium, Chrysosphaera, Desmodesmus, Dunaliella, Haematococcus, Monoraphidium, Neochloris, Oedogonium, Pelagomonas, Pleurococcus, Pyrobotrys, Scenedesmus, Volvox, Botryococcus, Chlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Parachlorella, Picochlorum, Prototheca, Pseudochlorella, Stichococcus, Viridiella, Prasinocladus, Scherffelia, and Tetraselmis.
  • Chlorophyte alga selected from the group consisting of Asteromonas, Ankistro
  • the algal cell is a chlorophyte algal cell of the Trebouxiophyceae class, such as, for example, a species of a genus such as Actinastrum, Amphikorikos, Asterochloris, Auxenochlorella, Botryococcus, Chlorella, Choricystis, Coccomyxa, Coenocystis, Closteriopsis, Diacanthos, Dicloster, Dictyosphaerium, Dictoyochloropsis, Didymogenes, Diplosphaera, Eremosphaera, Franceia, Fusochloruis, Gloeotila, Helicosporidium, Heveochlorella, Koliella, Koriellopsis, Lagerheimia, Leptosira, Loboshpaera, Makinoella, Marvania, Muriella, Meyrella, Marinichlorella, Microthamnion, Micractinium, Myrme
  • a eukaryotic host cell can be a species belonging to the genus of Botryococcus, Chlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Parachlorella, Picochlorum, Prototheca, or Pseudochlorella.
  • the eukaryotic host cell can be a species belonging to the genus of Auxenochlorella, Chlorella, Heveochlorella, Marinichlorella, Parachlorella, Pseudochlorella or Tetrachlorella.
  • a cell used in the methods provided herein or that comprises a nucleic acid molecule, engineered gene, expression cassette, or vector as disclosed herein can optionally be a species of Parachlorella, such as non-limiting examples: Parachlorella kessieri, P. hussii, P. beijerinckii, P. sp. CCAP 206/1, or P. sp. pgu003.
  • Parachlorella kessieri such as non-limiting examples: Parachlorella kessieri, P. hussii, P. beijerinckii, P. sp. CCAP 206/1, or P. sp. pgu003.
  • the engineered host includes an engineered gene that includes a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:25, a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:26, a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:27, a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:28, and a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%), or 99% identity to SEQ ID NO:29.
  • the engineered gene can be operably linked to a promoter having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to at least 100, at least 200, at least 300, at least 400, at least 500, or between 500 and 530 contiguous nucleotides from the 3' end of SEQ ID NO: l .
  • the engineered gene can optionally further be operable linked to a terminator comprising at least 100 contiguous nucleotides of SEQ ID NO:7.
  • the engineered host can include an engineered gene that includes at least five heterologous introns selected from the group consisting of: a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:35; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%), or 99% identity to SEQ ID NO: 36; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:37; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID NO:38; a heterologous intron having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99%)
  • the engineered gene can be operably linked to a promoter having at least 80%, or at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to at least 100, at least 200, at least 300, at least 400, at least 500, or between 500 and 530 contiguous nucleotides from the 3' end of SEQ ID NO: 13.
  • the engineered gene can optionally further be operably linked to a terminator comprising at least 100 contiguous nucleotides of SEQ ID NO: 14.
  • Yet another aspect of the invention is a method of expressing an exogenous gene in a recombinant eukaryotic microorganism comprising introducing a nucleic acid molecule that includes an engineered gene into the recombinant eukaryotic microorganism, where the engineered gene comprises at least three heterologous introns, and culturing the eukaryotic microorganism under conditions in which the eukaryotic microorganism expresses the engineered gene.
  • the engineered gene can include at least three heterologous introns that are derived from a gene of the same species as the eukaryotic microorganism.
  • the eukaryotic microorganism exhibits higher expression of the engineered gene than is exhibited by a control eukaryotic microorganism that includes an otherwise identical gene that does not include the at least three heterologous introns.
  • the engineered gene can be operably linked to a promoter derived from a gene of the host microorganism.
  • the engineered gene can additionally be operably linked to a terminator derived from a gene of the host microorganism.
  • the engineered gene includes at least four heterologous introns or at least five heterologous introns. The at least three, at least four, or at least five introns can all be derived from the same gene.
  • the microorganism can be a heterokont or alga, and in some embodiments is a chlorophyte alga, or example, a chlorophyte alga of the Chlorophyceae, the Trebouxiophyceae, the Chlorodendrophyceae, the Ulvophyceae, the Pedinophyceae, or the Prasinophyceae class, such as any disclosed herein.
  • the eukaryotic host cell is a chlorophyte alga selected from the group consisting of Chlorella, Parachlorella, Pseudochlorella, Auxenochlorella, Heveochlorella, Marinichlorella, or Tetrachlorella.
  • FIG. 1 is a diagram of plasmid pSGE6450 harboring a bleomycin R expression cassette that includes a blecomycin resistance gene (BleR, SEQ ID NO:2) operably linked to the RPS4 promoter (SEQ ID NO: 1) and the Nannochloropsis gaditana T4 terminator (SEQ ID NO:3) which generated one transformant when transformed into WT-1185 (Parachlorella sp.). Promoters are shown in black, open reading frames for genes in white, and terminators in gray.
  • BleR blecomycin resistance gene
  • SEQ ID NO:1 the Nannochloropsis gaditana T4 terminator
  • Figure 2 shows a picture of a gel of colony PCR of strain 6450-1 confirming the presence of plasmid pSGE6450. Sequence specific primers were used to amplify BleR gene (lanes labeled A) GFP (lanes labeled B), and RibD sequence region endogenous to WT-1185 (lanes labeled C). Lane 8 was loaded with the Log 2 DNA ladder from New England Biolabs. Samples tested by PCR are indicated by the group labels Negative (no template control PCR), TWE1185 (PCR template from WT-1185) and 6450-1 (PCR template from strain 6450-1).
  • Figures 3A-3B provides diagrams of A) plasmid pSGE6530 and B) plasmid pSGE6531. Positions of AscI and NotI restriction enzyme sites are indicated.
  • Figure 4 is a diagram of plasmid pSGE6543 that includes an intronylated BleR gene (SEQ ID NO:2) that includes five introns (SEQ ID NOs:25-29) of the RPS4 gene with as summary of construct components in the table below, including the RPS4 promoter (SEQ ID NO: l) and RPS4 terminator (SEQ ID NO:7). Positions of AscI and NotI restriction enzyme sites are indicated.
  • Figure 5 provides diagrams of constructs, including control plasmid pSGE6567 (upper diagram) where the ACPI promoter (SEQ ID NO: 8) and ACPI terminator (SEQ ID NO: 9) elements are positioned to drive the expression of GFP, and are adjacent to the BleR selection marker which is operably linked to the RPS4 promoter (SEQ ID NO: l) and the RPS4 terminator (SEQ ID NO:7).
  • pSGE6640 (middle) has the same organization as pSGE6567, except that the BleR selection marker is intronylated with the five introns (SEQ ID NOs:25-29) of the RPS4 gene as in pSGE6543 ( Figure 4).
  • the bottom diagram is a generalized diagram for plasmids pSGE6633 to pSGE6639, in which the promoter and terminator regions flanking GFP (and surrounded by dashed boxes) vary (see Table 1). As described in Example 7, seven promoter/terminator combinations were tested in this construct that harbored the intronylated BleR selectable marker expression cassette derived from plasmid pSGE6543.
  • Figures 6A-6D shows exemplary flow cytometry data for GFP fluorescence histograms of transformed lines indicating A) High level GFP expression with complete penetrance; B) Low level GFP expression with complete penetrance; C) No GFP fluorescence above wildtype; and D) Partial penetrance of GFP expression. Dark traces are from untransformed WT-1185 cells (background fluorescence), which are overlayed with light traces from different transgenic GFP-expressing strains generated from the experiment described in Example 6.
  • Figure 7 is a diagram of a construct that includes an intronylated version of the blaticidin resistance gene (BsdR; SEQ ID NO:26). This construct was used successfully in transformations (Example 8).
  • Figures 8A-8B provides diagrams of constructs that include A) non-intronylated and B) intronylated engineered nourseothricin resistance genes (natl).
  • Figures 9A-9C shows data from the optimization of the WT-1185 electroporation protocol.
  • A) Graph depicting colony number generated while determining the optimal voltage for electroporation of WT-1185.
  • lug pSGE06640 DNA was mixed with 5xl0
  • B) Graph depicting colony number generated while determining the optimal resistance and capacitance for transformation of WT1185 by electroporation.
  • lug pSGE06640 DNA was mixed with 5xl0 A 8cells in lOOuL 385mM sorbitol and electroporated at IkV applied to a 0.2cm cuvette under the indicated resistances and capacitances. Colonies were counted two weeks after transformation and selection.
  • Figure 10 provides diagrams of four constructs for expression of a codon-optimized, intronylated Cas9 gene.
  • the first construct, pSGE6707 included one FBPase intron
  • the second construct, pSGE6708 included two FBPase introns
  • the third construct, pSGE6709 included five FBPase introns
  • the fourth construct, pSGE6710 included nine FBPase introns. Introns are shown as arrows.
  • FIG. 11 Flow cytometry histograms show GFP fluorescence of putative transformants for Cas9 expression. 6709-1 and 6709-2 were the only two fully penetrant lines with strong GFP shifts. The black peaks represent the wild type strain WE-1185 background level of fluorescence; the gray peaks correspond to respective transformants.
  • FIG. 12 Anti-Cas9 Western Blot analysis of transformants. 6709-1 and 6709-2 had strong Cas9 expression; the band intensities are about 50% of the expression level observed in our Nannochloropsis Cas9 control strain GE-13038. Strain 6709-2 was given strain name GE- 15699. [0037] Figure 13 Overview of the colony PCR screening approach to detect insertion of the BleR cassette into the SRP54 locus.
  • FIG. 14 Colony PCR results. Transformants were screened using primers designed to amplify across the native targeted locus (AE596/AE597), which will produce a 700bp band if there is no integration or "knock-in" of the BleR cassette at the targeted locus, or a 4.3kb band if there is knock-in of a single BleR cassette. Colony PCR was also done using primers designed to amplify from the SRP54 gene (AE597), into our selectable marker (AE405/AE406).
  • a 1.2kb band will result from either amplification by primers 405/597 or 406/597 spanning from within the BleR cassette out to the SRP54 gene.
  • the minus gRNA control gel shows amplification of the native locus but no integration PCR products.
  • the present disclosure generally relates to compositions, methods and related materials for use in genetic engineering of organisms.
  • the disclosure provides methods and materials useful for affecting gene expression in vivo and/or in vitro.
  • Some embodiments disclosed herein relate to isolated, recombinant, or synthetic nucleic acid molecules having transcriptional regulatory activity such as, for example, regulatory elements.
  • Some embodiments disclosed herein relate to methods for modifying, making, and using such regulatory elements.
  • Some embodiments disclosed herein relate to recombinant cells, methods for making and using the same, and biomaterials derived therefrom.
  • the terms “about” or “approximately” when referring to any numerical value are intended to mean a value of plus or minus 10% of the stated value.
  • “about 50 degrees C” encompasses a range of temperatures from 45 degrees C to 55 degrees C, inclusive.
  • “about 100 mM” encompasses a range of concentrations from 90 mM to 110 mM, inclusive.
  • “about” or “approximately” can mean within 5% of the stated value, or in some cases within 2.5% of the stated value, or, “about” can mean rounded to the nearest significant digit. All ranges provided within the application are inclusive of the values of the upper and lower ends of the range.
  • cells include the primary subject cells and any progeny thereof, without regard to the number of transfers. It should be understood that not all progeny are exactly identical to the parental cell (due to deliberate or inadvertent mutations or differences in environment); however, such altered progeny are included in these terms, so long as the progeny retain the same functionality as that of the originally transformed cell.
  • the term "construct” is intended to mean any recombinant nucleic acid molecule such as an expression cassette, plasmid, cosmid, virus, autonomously replicating polynucleotide molecule, phage, or linear or circular, single-stranded or double-stranded, DNA or RNA polynucleotide molecule, derived from any source, capable of genomic integration or autonomous replication, comprising a nucleic acid molecule where one or more nucleic acid sequences has been linked in a functionally operative manner, i.e. operably linked.
  • control organism refers to an organism, microorganism, or cell that is substantially identical to the subject organism, microorganism, or cell, except for the engineered genetic manipulation or introduced mutation disclosed for the subject organism, microorganism, or cell, and can provide a reference point for measuring changes in phenotype of the subject organism or cell.
  • substantially identical thus includes, for example, small random variations in genome sequence (“S Ps") that are not relevant to the genotype, phenotype, parameter, or gene expression level that is of interest in the subject microorganism.
  • a control organism or cell may comprise, for example, (a) a progenitor strain or species, cell or microorganism population, or organism, with respect to the subject organism, microorganism, or cell, where the progenitor lacks the genetically engineered constructs or alterations that were introduced into the progenitor strain, species, organism, or cell or microorganism population to generate the subject organism, microorganism, or cell; b) a wild-type organism or cell, i.e., of the same genotype as the starting material for the genetic alteration which resulted in the subject organism or cell; (c) an organism or cell of the same genotype as the starting material but which has been transformed with a null construct (i.e.
  • control organism may refer to an organism that does not contain the exogenous nucleic acid present in the transgenic organism of interest, but otherwise has the same or very similar genetic background as such a transgenic organism.
  • mutant refers to an organism that has a mutation in a gene that is the result of classical mutagenesis, for example, using gamma irradiation, UV, or chemical mutagens.
  • “Mutant” as used herein also refers to a recombinant cell that has altered structure or expression of a gene as a result of genetic engineering that many include, as non-limiting examples, overexpression, including expression of a gene under different temporal, biological, or environmental regulation and/or to a different degree than occurs naturally and/or expression of a gene that is not naturally expressed in the recombinant cell; homologous recombination, including knock-outs and knock-ins (for example, gene replacement with genes encoding polypeptides having greater or lesser activity than the wild type polypeptide, and/or dominant negative polypeptides); gene attenuation via RNAi, antisense RNA, or ribozymes, or the like; and genome engineering using meganucleases,
  • exogenous with respect to a nucleic acid or gene indicates that the nucleic or gene has been introduced (“transformed") into an organism, microorganism, or cell by human intervention.
  • exogenous nucleic acid is introduced into a cell or organism via a recombinant nucleic acid construct.
  • An exogenous nucleic acid can be a sequence from one species introduced into another species, i.e., a heterologous nucleic acid.
  • An exogenous nucleic acid can also be a sequence that is homologous to an organism (i.e., the nucleic acid sequence occurs naturally in that species or encodes a polypeptide that occurs naturally in the host species) that has been isolated, optionally modified, and subsequently reintroduced into cells of that organism.
  • An exogenous nucleic acid that includes a homologous sequence can often be distinguished from the naturally-occurring sequence by detection of any introduced modifications and/or the presence of non-natural sequences linked to the exogenous nucleic acid, e.g., non- native regulatory sequences flanking the homologous gene sequence in a recombinant nucleic acid construct.
  • a stably transformed exogenous nucleic acid can be detected and/or distinguished from a native gene by its juxtaposition to sequences in the genome where it has integrated. Further, a nucleic acid is considered exogenous if it has been introduced into a progenitor of the cell, organism, or strain under consideration.
  • expression cassette refers to a nucleic acid construct that encodes a protein or functional RNA operably linked to expression control elements, such as a promoter, and optionally, any or a combination of other nucleic acid sequences that affect the transcription or translation of the gene, such as, but not limited to, a transcriptional terminator, a ribosome binding site, a splice site or splicing recognition sequence, an intron, an enhancer, a polyadenylation signal, an internal ribosome entry site, etc.
  • a "functional RNA molecule” is an RNA molecule that can interact with one or more proteins or nucleic acid molecules to perform or participate in a structural, catalytic, or regulatory function that affects the expression or activity of a gene or gene product other than the gene that produced the functional RNA.
  • a functional RNA can be, for example, a transfer RNA (tRNA), ribosomal RNA (rRNA), anti-sense RNA (asRNA), microRNA (miRNA), short-hairpin RNA (shRNA), small interfering RNA (siRNA), a guide RNA (gRNA), crispr RNA (crRNA), or transactivating RNA (tracrRNA) of a CRISPR system, small nucleolar RNAs (snoRNAs), piwi- interacting RNA (piRNA), or a ribozyme.
  • tRNA transfer RNA
  • rRNA ribosomal RNA
  • asRNA anti-sense RNA
  • miRNA microRNA
  • shRNA short-hairpin RNA
  • siRNA small interfering RNA
  • gRNA guide RNA
  • crRNA crispr RNA
  • tracrRNA transactivating RNA
  • genes are used broadly to refer to any segment of a nucleic acid molecule (typically DNA, but optionally RNA) encoding a polypeptide or expressed RNA.
  • genes include sequences encoding expressed RNA (which can include polypeptide coding sequences or, for example, functional RNAs, such as ribosomal RNAs, tRNAs, antisense RNAs, microRNAs, short hairpin RNAs, gRNAs, crRNAs, tracrRNAs, ribozymes, etc.).
  • Genes may further comprise regulatory sequences required for or affecting their expression, as well as sequences associated with the protein or RNA-encoding sequence in its natural state, such as, for example, intron sequences, 5' or 3' untranslated sequences, etc.
  • a gene may only refer to a protein-encoding portion of a DNA or RNA molecule, which may or may not include introns.
  • a gene is preferably greater than 50 nucleotides in length, more preferably greater than 100 nucleotide in length, and can be, for example, between 50 nucleotides and 500,000 nucleotides in length, such as between 100 nucleotides and 100,000 nucleotides in length or between about 200 nucleotides and about 50,000 nucleotides in length, or about 200 nucleotides and about 20,000 nucleotides in length.
  • Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence information.
  • nucleic acid refers to, a segment of DNA or RNA (e.g., mRNA), and also includes nucleic acids having modified backbones (e.g., peptide nucleic acids, locked nucleic acids) or modified or non-naturally-occurring nucleobases.
  • the nucleic acid molecules can be double-stranded or single-stranded; a single stranded nucleic acid that comprises a gene or a portion thereof can be a coding (sense) strand or a non-coding (antisense) strand.
  • coding sequence or “coding region” or “amino acid-encoding sequences” as used herein, refer to regions of a nucleic acid sequence which can be transcribed to produce a functional RNA or an RNA transcript that can be translated into a polypeptide when placed under the control of appropriate expression control sequences and in the presence of appropriate cellular machinery or enzymes.
  • non-coding sequence or “non-coding region” refers to regions of a nucleic acid sequence that are not transcribed and translated into amino acids (e.g., introns, untranslated regions, etc.) or are not transcribed or do not form at least a portion of a mature functional RNA sequence.
  • protein or “polypeptide” is intended to encompass a singular “polypeptide” as well as plural “polypeptides,” and refers to a molecule composed of monomers (amino acids) linearly linked by amide bonds (also known as peptide bonds).
  • polypeptide refers to any chain or chains of two or more amino acids, and does not refer to a specific length of the product.
  • a nucleic acid molecule may be "derived from” an indicated source, which includes the isolation (in whole or in part) of a nucleic acid segment from an indicated source.
  • a nucleic acid molecule may also be derived from an indicated source by, for example, direct cloning, PCR amplification, or artificial synthesis from the indicated polynucleotide source or based on a sequence associated with the indicated polynucleotide source.
  • Genes or nucleic acid molecules derived from a particular source or species also include genes or nucleic acid molecules having sequence modifications with respect to the source nucleic acid molecules.
  • a gene or nucleic acid molecule derived from a source can include one or more mutations with respect to the source gene or nucleic acid molecule that are unintended or that are deliberately introduced, and if one or more mutations, including substitutions, deletions, or insertions, are deliberately introduced the sequence alterations can be introduced by random or targeted mutation of cells or nucleic acids, by amplification or other molecular biology techniques, or by chemical synthesis, or any combination thereof.
  • a gene or nucleic acid molecule that is derived from a referenced gene or nucleic acid molecule that encodes a functional RNA or polypeptide can encode a functional RNA or polypeptide having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%), or at least 99% sequence identity with the referenced or source functional RNA or polypeptide, or to a functional fragment thereof.
  • a gene may also be disclosed as encoding a polypeptide derived from an indicated source.
  • amino acid sequence of a polypeptide encoded by a gene as disclosed herein may be an amino acid sequence of a polypeptide from a first organism other than a host organism, and the gene or polypeptide may be referred to as being derived from that organism, even if the gene has been codon-optimized for expression in the second (host) organism.
  • an "isolated" nucleic acid or protein is removed from its natural milieu or the context in which the nucleic acid or protein exists in nature.
  • an isolated protein or nucleic acid molecule is removed from the cell or organism with which it is associated in its native or natural environment.
  • An isolated nucleic acid or protein can be, in some instances, partially or substantially purified, but no particular level of purification is required for isolation.
  • an isolated nucleic acid molecule can be a nucleic acid sequence that has been excised from the chromosome, genome, or episome that it is integrated into in nature.
  • a "purified" nucleic acid molecule or nucleotide sequence, or protein or polypeptide sequence is substantially free of cellular material and cellular components.
  • the purified nucleic acid molecule or protein may be free of chemicals beyond buffer or solvent, for example. "Substantially free” is not intended to mean that other components beyond the novel nucleic acid molecules are undetectable.
  • Naturally-occurring and wild type refer to a form found in nature.
  • a naturally occurring or wild type nucleic acid molecule, nucleotide sequence or protein may be present in and isolated from a natural source, and is not intentionally modified by human manipulation.
  • expression includes the expression of a gene at least at the level of RNA production
  • an "expression product” includes the resultant product, e.g., a polypeptide or functional RNA (e.g., a ribosomal RNA, a tRNA, an antisense RNA, a micro RNA, an shRNA, a ribozyme, etc.), of an expressed gene.
  • a polypeptide or functional RNA e.g., a ribosomal RNA, a tRNA, an antisense RNA, a micro RNA, an shRNA, a ribozyme, etc.
  • increased expression includes an alteration in gene expression to facilitate increased mRNA production and/or increased polypeptide expression.
  • “Increased production” when referring to protein abundance or the abundance of active protein resulting from gene expression, protein turnover rates, protein activation states, and the like, includes an increase in the amount of polypeptide expression, in the level of the enzymatic activity of a polypeptide, or a combination of both, as compared to the native production or enzymatic activity of the polypeptide.
  • Attenuated means reduced in amount, degree, intensity, or strength. Attenuated gene expression may refer to a significantly reduced amount and/or rate of transcription of the gene in question, or of translation, folding, or assembly of the encoded protein. As nonlimiting examples, an attenuated gene may be due to a mutation or a disruption in the gene ⁇ e.g., a gene disrupted by partial or total deletion, truncation, frameshifting, or insertional mutation) or may have decreased expression due to alteration, replacement, and/or elimination of one or more gene regulatory sequences.
  • a mutant alga having attenuated expression of a gene can be a recombinant alga in which the attenuation is the result of genetic engineering, i.e., by human intervention that includes, typically, introduction of one or more non- native nucleic acid molecules or polypeptides into the alga.
  • gene attenuation can be by classical mutagenesis according to protocols known in the art or adapted therefrom.
  • An attenuated gene may also be a gene that is targeted by a "gene knockdown" construct, such as, for example, a construct encoding an antisense RNA, a microRNA, a short hairpin RNA, a guide RNA or transactivating RNA of a CRISPR system, or a ribozyme.
  • a "gene knockdown" construct such as, for example, a construct encoding an antisense RNA, a microRNA, a short hairpin RNA, a guide RNA or transactivating RNA of a CRISPR system, or a ribozyme.
  • Exogenous nucleic acid molecule or “exogenous gene” refers to a nucleic acid molecule or gene that has been introduced (“transformed") into a cell.
  • a transformed cell may be referred to as a recombinant cell, into which additional exogenous gene(s) may be introduced.
  • a descendent of a cell transformed with a nucleic acid molecule is also referred to as “transformed” if it has inherited the exogenous nucleic acid molecule.
  • the exogenous gene may be from a different species (and so “heterologous"), or from the same species (and so “homologous”), relative to the cell being transformed.
  • An "endogenous" nucleic acid molecule, gene or protein is a native nucleic acid molecule, gene or protein as it occurs in, or is naturally produced by, the host.
  • exogenous refers to a gene or protein that is not derived from the host organism species.
  • transgene refers to an exogenous gene, that is, a gene introduced into a microorganism or a progenitor by human intervention.
  • ortholog of a gene or protein as used herein refers to its functional equivalent in another species.
  • Gene and protein Accession numbers are unique identifiers for a sequence record publicly available at the National Center for Biotechnology Information (NCBI) website (ncbi.nlm.nih.gov) maintained by the United States National Institutes of Health.
  • NCBI National Center for Biotechnology Information
  • the "Genlnfo Identifier" (GI) sequence identification number is specific to a nucleotide or amino acid sequence. If a sequence changes in any way, a new GI number is assigned.
  • a Sequence Revision History tool is available to track the various GI numbers, version numbers, and update dates for sequences that appear in a specific GenBank record. Searching and obtaining nucleic acid or gene sequences or protein sequences based on Accession numbers and GI numbers is well known in the arts of, e.g., cell biology, biochemistry, molecular biology, and molecular genetics.
  • percent identity or “homology” with respect to nucleic acid or polypeptide sequences are defined as the percentage of nucleotide or amino acid residues in the candidate sequence that are identical with the known polypeptides, after aligning the sequences for maximum percent identity and introducing gaps, if necessary, to achieve the maximum percent homology.
  • N-terminal or C-terminal insertion or deletions shall not be construed as affecting homology, and internal deletions and/or insertions into the polypeptide sequence of less than about 30, less than about 20, or less than about 10 amino acid residues shall not be construed as affecting homology.
  • BLAST Basic Local Alignment Search Tool
  • blastp blastp, blastn, blastx, tblastn, and tblastx
  • blastp blastp
  • blastn blastn
  • blastx blastx
  • tblastn tblastx
  • tblastx Altschul (1997), Nucleic Acids Res. 25, 3389-3402, and Karlin (1990), Proc. Natl. Acad. Sci. USA 87, 2264-2268
  • the approach used by the BLAST program is to first consider similar segments, with and without gaps, between a query sequence and a database sequence, then to evaluate the statistical significance of all matches that are identified, and finally to summarize only those matches which satisfy a preselected threshold of significance.
  • the scoring matrix is set by the ratios of M (i.e., the reward score for a pair of matching residues) to N (i.e., the penalty score for mismatching residues), wherein the default values for M and N can be +5 and -4, respectively.
  • M i.e., the reward score for a pair of matching residues
  • N i.e., the penalty score for mismatching residues
  • the default values for M and N can be +5 and -4, respectively.
  • sequence identities of at least 40%, at least 45%, at least 50%, at least 55%, of at least 70%, at least 65%, at least 70%, at least 75%, at least 80%, or at least 85%, for example at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or about 100% sequence identity with the full-length polypeptide or nucleic acid sequence, or to fragments thereof comprising a consecutive sequence of at least 100, at least 125, at least 150 or more amino acid residues of the entire protein; variants of such sequences, e.g., wherein at least one amino acid residue has been inserted N- and/or C-terminal to, and/or within, the disclosed sequence(s) which contain(
  • Contemplated variants can additionally or alternately include those containing predetermined mutations by, e.g., homologous recombination or site-directed or PCR mutagenesis, and the corresponding polypeptides or nucleic acids of other species, including, but not limited to, those described herein, the alleles or other naturally occurring variants of the family of polypeptides or nucleic acids which contain an insertion and substitution; and/or derivatives wherein the polypeptide has been covalently modified by substitution, chemical, enzymatic, or other appropriate means with a moiety other than a naturally occurring amino acid which contains the insertion and substitution (for example, a detectable moiety such as an enzyme).
  • a detectable moiety such as an enzyme
  • non-native is used herein to refer to nucleic acid sequences or amino acid sequences as they naturally occur in the host.
  • non-native is used herein to refer to nucleic acid sequences or amino acid sequences that do not occur naturally in the host.
  • a nucleic acid sequence or amino acid sequence that has been removed from a cell, subjected to laboratory manipulation, and introduced or reintroduced into a host cell is considered “non-native.”
  • Synthetic or partially synthetic genes introduced into a host cell are “non-native.”
  • Non-native genes further include genes endogenous to the host alga operably linked to one or more heterologous regulatory sequences that have been recombined into the host genome.
  • a "recombinant" or “engineered” nucleic acid molecule is a nucleic acid molecule that has been altered through human manipulation.
  • a recombinant nucleic acid molecule includes any nucleic acid molecule that: 1) has been partially or fully synthesized or modified in vitro, for example, using chemical or enzymatic techniques ⁇ e.g., by use of chemical nucleic acid synthesis, or by use of enzymes for the replication, polymerization, digestion (exonucleolytic or endonucleolytic), ligation, reverse transcription, transcription, base modification (including, e.g., methylation), integration or recombination (including homologous and site-specific recombination) of nucleic acid molecules); 2) includes conjoined nucleotide sequences that are not conjoined in nature, 3) has been engineered using molecular cloning techniques such that it lacks one or more nucleotides with respect to the naturally occurring nucleic
  • a cDNA is a recombinant DNA molecule, as is any nucleic acid molecule that has been generated by in vitro polymerase reaction(s), or to which linkers have been attached, or that has been integrated into a vector, such as a cloning vector or expression vector.
  • recombinant protein refers to a protein produced by genetic engineering.
  • the term recombinant, engineered, or genetically engineered refers to organisms that have been manipulated by introduction of a heterologous or exogenous (e.g., non-native) recombinant nucleic acid sequence into the organism, and includes, without limitation, gene knockouts, targeted mutations, and gene replacement, promoter replacement, deletion, or insertion, or transfer of a nucleic acid molecule, e.g., a transgene, synthetic gene, promoter, or other sequence into the organism.
  • Recombinant or genetically engineered organisms can also be organisms into which constructs for gene "knock down" have been introduced.
  • Such constructs include, but are not limited to, one or more guide RNAs, RNAi, microRNA, shRNA, siRNA, antisense, and ribozyme constructs. Also included are organisms whose genomes have been altered by the activity of cas nucleases, meganucleases, or zinc finger nucleases. An exogenous or recombinant nucleic acid molecule can be integrated into the recombinant/genetically engineered organism's genome or in other instances are not integrated into the recombinant/genetically engineered organism's genome.
  • "recombinant alga" or "recombinant host cell” includes progeny or derivatives of the recombinant algae of the disclosure. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny or derivatives may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
  • heterologous when used in reference to a polynucleotide, a gene, a nucleic acid, a polypeptide, or an enzyme, refers to a polynucleotide, gene, a nucleic acid, polypeptide, or an enzyme that is not derived from the host species.
  • heterologous gene or “heterologous nucleic acid sequence” as used herein, refers to a gene or nucleic acid sequence from a different species than the species of the host organism it is introduced into.
  • heterologous means that the regulatory or auxiliary sequence or sequence encoding a protein domain or localization sequence is from a different source than the gene with which the regulatory or auxiliary nucleic acid sequence or nucleic acid sequence encoding a protein domain or localization sequence is juxtaposed in a genome, chromosome or episome.
  • a promoter operably linked to a gene to which it is not operably linked to in its natural state is referred to herein as a “heterologous promoter,” even though the promoter may be derived from the same species (or, in some cases, the same organism) as the gene to which it is linked.
  • An intron inserted into a gene that it is not associated with in nature is referred to herein as a “heterologous intron,” even though the promoter may be derived from the same species (or, in some cases, the same organism) as the gene into which it is engineered.
  • heterologous means that the localization sequence or protein domain is derived from a protein different from that into which it is incorporated by genetic engineering.
  • hybridization refers generally to the ability of nucleic acid molecules to join via complementary base strand pairing. Such hybridization may occur when nucleic acid molecules are contacted under appropriate conditions and/or circumstances.
  • two nucleic acid molecules are said to be capable of specifically hybridizing to one another if the two molecules are capable of forming an anti-parallel, double-stranded nucleic acid structure.
  • a nucleic acid molecule is said to be the "complement” of another nucleic acid molecule if they exhibit complete complementarity.
  • nucleic acid molecules are said to exhibit "complete complementarity" when every nucleotide of one of the molecules is complementary to its base pairing partner nucleotide of the other.
  • Two molecules are said to be “minimally complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under at least conventional "low-stringency” conditions.
  • the molecules are said to be “complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under conventional "high-stringency” conditions.
  • Nucleic acid molecules that hybridize to other nucleic acid molecules are said to be "hybridizable cognates" of the other nucleic acid molecules.
  • Conventional stringency conditions are described by Sambrook et al., Molecular Cloning, A Laboratory Handbook, Cold Spring Harbor Laboratory Press, 1989), and by Haymes et al. In: Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985). Departures from complete complementarity are therefore permissible, as long as such departures do not completely preclude the capacity of the molecules to form a double-stranded structure.
  • a nucleic acid molecule or fragment thereof of the present disclosure it needs only be sufficiently complementary in sequence to be able to form a stable double-stranded structure under the particular solvent and salt concentrations employed.
  • Appropriate stringency conditions which promote DNA hybridization include, for example, 6. Ox sodium chloride/sodium citrate (SSC) at about 45oC, followed by a wash of 2.0x SSC at about 50oC.
  • SSC sodium chloride/sodium citrate
  • the temperature in the wash step can be increased from low stringency conditions at room temperature, about 22oC, to high stringency conditions at about 65oC.
  • Both temperature and salt may be varied, or either the temperature or the salt concentration may be held constant while the other variable is changed.
  • low stringency conditions may be used to select nucleic acid sequences with lower sequence identities to a target nucleic acid sequence.
  • High stringency conditions may be used to select for nucleic acid sequences with higher degrees of identity to the disclosed nucleic acid sequences (Sambrook et al., 1989, supra).
  • high stringency conditions involve nucleic acid hybridization in about 2x SSC to about lOx SSC (diluted from a 20x SSC stock solution containing 3 M sodium chloride and 0.3 M sodium citrate, pH 7.0 in distilled water), about 2.5 ⁇ to about 5 ⁇ Denhardt's solution (diluted from a 50x stock solution containing 1% (w/v) bovine serum albumin, 1% (w/v) ficoll, and 1% (w/v) polyvinylpyrrolidone in distilled water), about 10 mg/mL to about 100 mg/mL fish sperm DNA, and about 0.02% (w/v) to about 0.1% (w/v) SDS, with an incubation at about 50oC to about 70oC for several hours to overnight.
  • lOx SSC diluted from a 20x SSC stock solution containing 3 M sodium chloride and 0.3 M sodium citrate, pH 7.0 in distilled water
  • Denhardt's solution diluted from a 50x stock solution containing
  • High stringency conditions are preferably provided by 6x SSC, 5 x Denhardt's solution, 100 mg/mL sheared and denatured salmon sperm DNA, and 0.1%) (w/v) SDS, with incubation at 55 xC for several hours. Hybridization is generally followed by several wash steps.
  • the wash compositions generally comprise 0.5 x SSC to about lOx SSC, and 0.01% (w/v) to about 0.5% (w/v) SDS with a 15-min incubation at about 20°C to about 70°C.
  • the nucleic acid segments remain hybridized after washing at least one time in O. l x SSC at 65°C.
  • very high stringency conditions may be used to select for nucleic acid sequences with much higher degrees of identity to the disclosed nucleic acid sequences.
  • Very high stringency conditions are defined as prehybridization and hybridization at 42°C in 5 x SSPE, 0.3% SDS, 200 ⁇ g/mL sheared and denatured salmon sperm DNA, and 50% formamide and washing three times each for 15 minutes using 2x SSC, 0.2% SDS at 70°C.
  • expression cassette refers to a nucleic acid construct that encodes a protein or functional RNA (e.g. a tRNA, a short hairpin RNA, one or more microRNAs, a ribosomal RNA, etc.) operably linked to expression control elements, such as a promoter, and optionally, any or a combination of other nucleic acid sequences that affect the transcription or translation of the gene, such as, but not limited to, a transcriptional terminator, a ribosome binding site, a splice site or splicing recognition sequence, an intron, an enhancer, a polyadenylation signal, an internal ribosome entry site, etc.
  • a transcriptional terminator e.g. a tRNA, a short hairpin RNA, one or more microRNAs, a ribosomal RNA, etc.
  • regulatory sequence refers to a nucleotide sequence located upstream (5'), within, or downstream (3') of a coding sequence. Transcription of the coding sequence and/or translation of an RNA molecule resulting from transcription of the coding sequence are typically affected by the presence or absence of the regulatory sequence. These regulatory element sequences may comprise promoters, cis- elements, enhancers, terminators, or introns. Regulatory elements may be isolated or identified from UnTranslated Regions (UTRs) from a particular polynucleotide sequence. Any of the regulatory elements described herein may be present in a chimeric or hybrid regulatory expression element. Any of the regulatory elements described herein may be present in a recombinant construct of the present invention.
  • promoter refers to a nucleic acid sequence capable of binding RNA polymerase to initiate transcription of a gene in a 5' to 3' ("downstream") direction.
  • a gene is "under the control of or “regulated by” a promoter when the binding of RNA polymerase to the promoter is the proximate cause of said gene's transcription.
  • the promoter or promoter region typically provides a recognition site for RNA polymerase and other factors necessary for proper initiation of transcription.
  • a promoter may be isolated from the 5' untranslated region (5' UTR) of a genomic copy of a gene. Alternatively, a promoter may be synthetically produced or designed by altering known DNA elements.
  • chimeric promoters that combine sequences of one promoter with sequences of another promoter. Promoters may be defined by their expression pattern based on, for example, metabolic, environmental, or developmental conditions.
  • a promoter can be used as a regulatory element for modulating expression of an operably linked transcribable polynucleotide molecule, e.g., a coding sequence. Promoters may contain, in addition to sequences recognized by RNA polymerase and, preferably, other transcription factors, regulatory sequence elements such as cis- elements or enhancer domains that affect the transcription of operably linked genes.
  • An "algal promoter" is a native or non-native promoter that is functional in algal cells.
  • constitutive promoter refers to a promoter that is active under most environmental and developmental conditions.
  • a constitutive promoter is active regardless of external environment, such as light and culture medium composition.
  • a constitutive promoter is active in the presence and in the absence of a nutrient.
  • a constitutive promoter may be a promoter that is active (mediates transcription of a gene to which it is operably-linked) under conditions of nitrogen depletion as well as under conditions in which nitrogen is not limiting (nitrogen replete conditions).
  • an "inducible" promoter is a promoter that is active in response to particular environmental conditions, such as the presence or absence of a nutrient or regulator, the presence of light, etc.
  • operably linked denotes a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of a polynucleotide sequence such that the control sequence directs or regulates the expression of the coding sequence of a polypeptide and/or functional RNA).
  • a promoter is in operable linkage with a nucleic acid sequence if it can mediate transcription of the nucleic acid sequence.
  • an expression cassette can result in transcription and/or translation of an encoded RNA or polypeptide under appropriate conditions. Antisense or sense constructs that are not or cannot be translated are not excluded by this definition.
  • RNAi RNA RNA sequence
  • the inserted polynucleotide sequence need not be identical, but may be only substantially identical to a sequence of the gene from which it was derived. As explained herein, these substantially identical variants are specifically covered by reference to a specific nucleic acid sequence.
  • selectable marker or “selectable marker gene” as used herein includes any gene that confers a phenotype on a cell in which it is expressed to facilitate the selection of cells that are transfected or transformed with a nucleic acid construct of the invention.
  • the term may also be used to refer to gene products that effectuate said phenotypes.
  • Nonlimiting examples of selectable markers include: 1) genes conferring resistance to antibiotics such as amikacin (aphA6), ampicillin (ampR), blasticidin (bis, bsr, bsd), bleomicin or phleomycin (ZEOCINTM) (ble), chloramphenicol (cat), emetine (RBS14p or cry 1-1), erythromycin (ermE), G418 (GENETICINTM) (neo), gentamycin (aac3 or aacC4), hygromycin B (aphlV, hph, hpt), kanamycin (nptll), methotrexate (DHFR mtxR), penicillin and other ⁇ -lactams ( ⁇ -lactamases), streptomycin or spectinomycin (aadA, spec/strep), and tetracycline (tetA, tetM, tetQ); 2) genes conferring tolerance
  • a "reporter gene” is a gene encoding a protein that is detectable or has an activity that produces a detectable product.
  • a reporter gene can encode a visual marker or enzyme that produces a detectable signal, such as cat, lacZ, uidA, xylE, an alkaline phosphatase gene, an a- amylase gene, an a-galactosidase gene, a ⁇ -glucuronidase gene, a ⁇ -lactamase gene, a horseradish peroxidase gene, a luciferin/luciferase gene, an R-locus gene, a tyrosinase gene, or a gene encoding a fluorescent protein, including but not limited to a blue, cyan, green, red, or yellow fluorescent protein, a photoconvertible, photoswitchable, or optical highlighter fluorescent protein, or any of variant thereof, including, without limitation, codon-optimized, rapidly folding, monomeric, increased stability,
  • terminal or “terminator sequence” or “transcription terminator” as used herein refers to a regulatory section of genetic sequence that causes RNA polymerase to cease transcription.
  • transformation refers to the introduction of one or more exogenous nucleic acid sequences or polynucleotides into a host cell or organism by using one or more physical, chemical, or biological methods.
  • Physical and chemical methods of transformation i.e., “transfection” include, by way of non-limiting example, electroporation, particle bombardment, and liposome delivery.
  • Biological methods of transformation i.e., "transduction" include transfer of DNA using engineered viruses or microbes (e.g., Agrobacterium).
  • photosynthetic organism is any prokaryotic or eukaryotic organism that can perform photosynthesis.
  • Photosynthetic organisms include higher plants (i.e., vascular plants), bryophytes, algae, and photosynthetic bacteria.
  • algae includes, but is not limited to, a species of Bacillariophyceae (diatoms), Bolidomonas, Chlorophyceae (green algae), Chrysophyceae (golden algae), Carophyceae, Cyanophyceae (cyanobacteria), Eustigmatophyceae (pico-plankton), Glaucocystophytes, Pelagophytes, Phaeophyceae (brown algae), Prasinophyceae (pico-plankton), Raphidophytes, Rhodophyceae (red algae), Synurophyceae and Xanthophyceae (yellow-green algae).
  • algae includes microalgae.
  • microalgae refers to microscopic, single-celled algae species including, but not limited to, eukaryotic single-celled algae of the Bacillariophyceae, Chlorophyceae, Prasinophyceae, Trebouxiophyceae, and Eustigmatophyceae classes.
  • photosynthetic bacteria includes, but is not limited to, cyanobacteria, green sulfur bacteria, purple sulfur bacteria, purple non-sulfur bacteria, and green non-sulfur bacteria.
  • Genes were identified and isolated from a Parachlorella environmental isolate (designated WT-1185) as sources for promoter and terminator sequences that can find use in the expression of genes, such as but not limited to transgenes, in eukaryotic microorganisms.
  • SEQ ID NO: l SEQ ID NO: 4, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23 were identified to comprise promoters that were demonstrated to mediate expression of transgenes in WT-1185, and SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24 were identified as comprising terminators functional in WT-1185.
  • RPS4 Ribosomal Protein S4 p06567 SEQ ID NO: 1 SEQ ID NO: 7
  • ACP Acyl-Carrier Protein
  • FBPase Fructose 1,6-Bisphosphatase
  • isolated and recombinant DNA (nucleic acid) molecules are provided herein that comprise nucleotide sequences having about 80% identity to at least 100 contiguous nucleotides to any one of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • a nucleotide sequence as provided herein can direct expression in a eukaryotic cell, such as but not limited to an algal, fungal, or chlorophyte cell, of a nucleic acid sequence to which it is operably linked, for example, the nucleotide sequence can direct expression of a nucleic acid sequence of a protein-encoding sequence or a sequence encoding a functional RNA.
  • molecule nucleotide sequence as provided herein can mediate transcriptional termination of a gene to which it is operably linked.
  • an isolated DNA molecule as provided herein can comprise a nucleotide sequence having at least 85% or at least 90% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO: 7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • an isolated or recombinant DNA molecule as provided herein can include a nucleotide sequence that can have at least 95%, 96%>, 97%, 98%>, or 99% percent identity to at least 100 contiguous nucleotides to any one of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • an isolated DNA molecule can comprise any of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • a nucleotide sequence as provided herein can mediate or enhance expression of a heterologous nucleic acid sequence to which it is operably linked.
  • the isolated DNA molecule can have terminator activity, for example, the nucleotide sequence can mediate transcriptional termination of a gene to which it is operably linked.
  • isolated and recombinant DNA (nucleic acid) molecules are provided herein that comprise nucleotide sequences having about 80%> identity to at least 100 contiguous nucleotides from the 3 '-most end of any one of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • an isolated DNA molecule as provided herein can have at least 85%> or at least 90% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides, extending from the 3' end and extending 5', of SEQ ID NO: l, SEQ ID NO: 4, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • a DNA molecule as provided herein can include a nucleotide sequence that can have at least 95%, 96%, 97%), 98%), or 99% percent identity to at least 100 contiguous nucleotides extending from the 3' end of any one of SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • an isolated DNA molecule can comprise any of SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • a nucleic acid molecule as provided herein can mediate or enhance expression of a heterologous nucleic acid sequence to which it is operably linked.
  • the isolated DNA molecule can find use, for example, as a sequence that when operably linked to a nucleic acid sequence can affect expression of the nucleic acid sequence, which can comprise, for example, a sequence encoding a polypeptide or functional RNA.
  • the isolated DNA molecule can increase or decrease expression of a nucleic acid sequence to which it is operably linked, or may mediate transcription of the operably-linked nucleic acid sequence as a promoter.
  • Methods for assessing the functionality of nucleotide sequences for promoter activity as well as for enhancing or decreasing the activity of proximal promoters are well-known in the art.
  • promoter function can be validated by confirming the ability of the putative promoter or promoter variant or fragment to drive expression of a selectable marker gene to which the putative promoter or promoter fragment or variant is operably linked by detecting and, optionally, analyzing, resistant colonies (and/or their progeny) after plating of cells transformed with the promoter construct on selective media.
  • promoter activity may be assessed by measuring the levels of RNA transcripts produced from a promoter construct, for example, using reverse transcription - polymerase chain reaction (RT-PCR, e.g., Watt et al.
  • promoter activity can be assessed using chloramphenicol acetyltransferase (CAT) assays (where the heterologous sequence operably linked to the isolated nucleic acid molecule that comprises a putative promoter encodes chloramphenicol acetyltransferase, see, for example, Gerrish et al. (2000) J. Biol. Chem.
  • CAT chloramphenicol acetyltransferase
  • luciferase assays where the heterologous nucleic acid is a lux or luc gene, for example (see, for example, Ferrante et al. (2008) PLoS ONE 3 : e3200), or in vivo assays using a fluorescent protein gene to determine the functionality of any of the sequences disclosed herein, including sequences of reduced size or having one or more nucleotide changes with respect to any of SEQ ID NOs 1-8. (see, for example, Akamura et al. (2011) Anal. Biochem. 412: 159-164).
  • promoters comprising a nucleic acid sequence such as any described herein, for example, a nucleic acid sequence having at least 80% identity to at least 100 contiguous nucleotides of SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23.
  • a promoter as provided herein may include a nucleotide sequence that has at least 85% or at least 90% sequence identity to at least 100, 200, 300, 400, or 500 contiguous nucleotides of SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23.
  • a promoter can include a sequence that has at least 95% sequence identity to at least 100, 200, 300, 400, or 500 contiguous nucleotides of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23.
  • a promoter as provided herein can be selected from the group consisting of an isolated DNA molecule comprising at least 100 contiguous nucleotides of SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23.
  • a promoter as provided herein includes a nucleotide sequence that has at least 85%, at least 90%, or at least 95% sequence identity to at least 100, 200, 300, 400, or 500 contiguous nucleotides extending from the 3' end toward the 5' end of any of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, or 530 contiguous nucleotides extending from the 3' end of SEQ ID NO: l .
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, 600, 700, 800, 900, or 1000 contiguous nucleotides extending from the 3' end of SEQ ID NO:4.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, or 572 contiguous nucleotides extending from the 3' end of SEQ ID NO:8.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, or 1044 contiguous nucleotides extending from the 3' end of SEQ ID NO: 11.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, 600, 700, 800, or 832 contiguous nucleotides extending from the 3' end of SEQ ID NO: 13.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, 600, or 642 contiguous nucleotides extending from the 3' end of SEQ ID NO: 15.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, or 588 contiguous nucleotides extending from the 3' end of SEQ ID NO: 17.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, 600, 700, or 707 contiguous nucleotides extending from the 3' end of SEQ ID NO: 19.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, 600, 700, 800, or 874 contiguous nucleotides extending from the 3' end of SEQ ID NO:21.
  • a promoter as provided herein comprises a nucleic acid sequence having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% to at least 100, 200, 300, 400, 500, 600, 700, 800, or 874 contiguous nucleotides extending from the 3' end of SEQ ID NO:23.
  • a promoter is selected from the group consisting of: SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23.
  • a promoter as provided herein can be a constitutive promoter, for example the promoter may be active in culture conditions in which one or more nutrients are deficient as well as in culture conditions in which nutrients are sufficient for proliferation and/or growth of the culture.
  • a promoter as provided herein may direct expression of an operably linked nucleic acid sequence under conditions in which a host cell that includes the promoter construct is limited in nitrogen availability (nitrogen depletion/deficiency) as well as under conditions in which a host cell that includes the promoter construct is not limited in nitrogen availability (nitrogen replete conditions).
  • a promoter as provided herein may direct expression of an operably linked nucleic acid sequence under conditions in which a host cell that includes the promoter construct is exposed to varying or continuous light intensities throughout the day.
  • promoters allow RNA polymerase to attach to the DNA near a gene in order for transcription to take place.
  • Promoters contain specific DNA sequences that provide transcription factors an initial binding site from which they can recruit RNA polymerase binding. These transcription factors have specific protein motifs that enable them to interact with specific corresponding nucleotide sequences to regulate gene expressions.
  • the proximal promoter sequence may be between approximately 50, and 500 bp upstream of the translational start site of the open reading frame of the gene, although these are general guidelines only, and may contain, in addition to sequences for binding RNA polymerase, specific transcription factor binding sites.
  • Some promoters also include distal sequence upstream of the gene that may contain additional regulatory elements, often with a weaker influence than the proximal promoter. Eukaryotic transcriptional complexes can bend the DNA back on itself, thus allowing for potential placement of additional regulatory sequences as far as several kilobases from the TSS. Many eukaryotic promoters contain a TATA box, although this is not a universal feature of eukaryotic promoters. The TATA box binds the TATA binding protein, which assists in the formation of the RNA polymerase transcriptional complex. TATA boxes usually lie within approximately 50 bp of the TSS. A promoter may be constitutive or expressed conditionally.
  • promoters are inducible, and may activate or increase transcription in response to an inducing agent. In contrast, the rate of transcription of a gene under control of a constitutive promoter is not dependent on an inducing agent.
  • a constitutive promoter can be made a conditional or inducible promoter by the addition of sequences that confer responsiveness to particular conditions or to an inducing agent.
  • promoters provided herein may be constitutive or may be inducible or conditional. Further, promoters or portions of promoters may be combined in series to achieve a stronger level of expression or a more complex pattern of regulation.
  • a promoter as provided herein such as but not limited to a promoter that comprises a nucleotide sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides of SEQ ID NCv l, SEQ ID NO:4, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23, can mediate transcription of an operably linked nucleic acid sequence in a eukaryotic algal cell, such as, but not limited to any species of any of the genera Achnanthes, Amphiprora, Amphora, Ankistrodesmus, Asteromonas, Boekelovia, Bolidomon
  • a promoter as disclosed herein can be functional in a green alga, i.e., an algal member of the Chlorophyte division of the Viridiplantae kingdom, including without limitation, a microalga of any of the classes Chlorophyceae, Chlorodendrophyceae, Pedinophyceae, Pleurastrophyceae, Prasinophyceae, and Trebouxiophyceae.
  • a promoter as provided herein can be operable in a species that is a member of any of the Chlorophyceae, Prasinophyceae, Trebouxiophyceae, or Chlorodendrophyceae classes, such as a species of any of the Asteromonas, Ankistrodesmus, Carteria, Chlamydomonas, Chlorococcum, Chlorogonium, Chrysosphaera, Desmodesmus, Dunaliella, Haematococcus, Monoraphidium, Neochloris, Oedogonium, Pelagomonas, Pleurococcus,, Pyrobotrys, Scenedesmus, Volvox, Micromonas, Ostreococcus Prasinocladus Scherffelia, Tetraselmis, Botryococcus, Chlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Parachlorella, Picochlor
  • a promoter as provided can direct expression of a gene to which it is operably linked in a species of Trebouxiophyceae, such as but not limited to Botryococcus, Chlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Parachlorella, Picochlorum, Prototheca, or Pseudochlorella.
  • a species of Trebouxiophyceae such as but not limited to Botryococcus, Chlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Parachlorella, Picochlorum, Prototheca, or Pseudochlorella.
  • a promoter as provided herein can mediate transcription of an operably linked nucleic acid sequence in a eukaryotic cell, such as but not limited to a eukaryotic algal cell, during culturing of the cell under conditions of nitrogen depletion as well as during culturing of the cell under nitrogen replete conditions.
  • a promoter as described herein can preferably mediate transcription of an operably linked nucleic acid sequence in Parachlorella cells cultured under conditions of nitrogen depletion or cultured under nitrogen replete conditions.
  • a promoter or promoter region can include variants of the promoters disclosed herein derived by deleting sequences, duplicating sequences, or adding sequences from other promoters or as designed, for example, by bioinformatics, or by subjecting the promoter to random or site-directed mutagenesis, etc.
  • promoters can obtained by truncation of any of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, and SEQ ID NO:23, or truncated versions of variants thereof having at least 80%, at least 85%, at least 90%, or at least 95% identity thereto.
  • Promoters of the present invention can include nucleic acid sequences having at least 80%, at least 85%, at least 90%, at least 95%, or between 95% and 100% identity to the sequences between about 0 bp, 20 bp, 50 bp, 100 bp, 200 bp or 300 bp to about 500 bp, 600 bp, 700 bp, 800 bp, 900 bp, or 1 kb upstream of the trinucleotide ATG sequence at the start site of a protein coding region of a native Parachlorella gene, such as, for example, an RSP4 gene, a photosystem II subunit R (PsbR) gene, an organellar oligopeptidase A gene, a FPBase gene, an Elongation Factor 2 gene, a 30S ribosomal protein S17 gene, a mitochondrial ATP synthase gene, a rubisco small subunit isoform 1 gene, a rubisco small subunit iso
  • the activity or strength of a promoter may be measured in terms of the amounts of RNA it produces, or the amount of protein accumulation in a cell or tissue, which can optionally be measured by an activity of the expressed protein, e.g., fluorescence, luminescence, acyltransferase activity, etc., relative to a control promoter, for example, a promoter whose transcriptional activity has been previously assessed or a promoter from which a truncated and/or sequence modified variant has been derived, relative to a promoterless construct, or relative to non-transformed cells.
  • an activity of the expressed protein e.g., fluorescence, luminescence, acyltransferase activity, etc.
  • a control promoter for example, a promoter whose transcriptional activity has been previously assessed or a promoter from which a truncated and/or sequence modified variant has been derived, relative to a promoterless construct, or relative to non-transformed cells.
  • the activity or strength of a promoter may be measured in terms of the amount of mRNA accumulated that corresponds to a nucleic acid sequence to which it is operably linked in a cell, relative to the total amount of mRNA or protein produced by the cell.
  • the promoter preferably expresses an operably linked nucleic acid sequence at a level greater than 0.01%; preferably in a range of about 0.5% to about 20% (w/w) of the total cellular RNA.
  • the activity can also be measured by quantifying fluorescence, luminescence, or absorbance of the cells or a product made by the cells or an extract thereof, depending on the activity of a reporter protein that may be expressed from the promoter.
  • the activity or strength of a promoter may be expressed relative to a well-characterized promoter (e.g., for which transcriptional activity was previously assessed).
  • a less-characterized promoter may be operably linked to a reporter sequence (e.g., a fluorescent protein) and introduced into a specific cell type.
  • a well-characterized promoter is similarly prepared and introduced into the same cellular context. Transcriptional activity of the unknown promoter is determined by comparing the amount of reporter expression, relative to the well characterized promoter.
  • a promoter described herein can have promoter activity in a eukaryotic cell, preferably in an algal cell or chlorophyte cell.
  • a promoter as provided herein is active in an algal or chlorophyte cell in nutrient replete and nutrient-depleted culture conditions.
  • An algal promoter as provided herein can be used as a 5' regulatory element for modulating expression of an operably linked gene or genes in algal species as well as other organisms, including fungi, chlorophyte algae, and plants.
  • terminators comprise a nucleotide sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100 or at least 150 contiguous nucleotides of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24, for example, having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100 or at least 150 contiguous nucleotides from the 5' end extending toward the 3' end of SEQ ID NO: 7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO
  • a terminator can have at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, or at least 950 contiguous nucleotides of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • a terminator as provided herein can have at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to SEQ ID NO: 7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • Terminators are genetic sequences that mark the end of a gene for transcription. Without being bound by theory, the terminators of the present invention may improve expression of the nucleic acid sequence (amount of encoded RNA or protein produced), and may mediate polyadenylation or enhance RNA transcript stability. Most terminator sequences in eukaryotes consist of at least two DNA sequences: (1) a binding site for terminator proteins and (2) an upstream element located among the last twelve nucleotides of the transcript. The protein binding sites are usually orientation-sensitive and essential to termination. Termination usually occurs between twelve and twenty nucleotides upstream of the binding site. The upstream element's functionality usually depends more on its overall base composition (T-rich) than on the specific sequence (Reeder & Lang (1997) Trends Biochem. Sci. 22:473-477, herein incorporated by reference in its entirety). INTRONS
  • intron is used herein to refer to a nucleotide sequence within a gene that is removed from the RNA transcribed from the gene by RNA splicing. (The term intron is used to refer to the RNA sequence as it occurs in RNA molecules prior to splicing as well as to the DNA sequence as it occurs in the gene.)
  • the introns disclosed herein are "spliceosomal introns” that occur naturally in the nuclear genes of eukaryotes and are spliced out by the splicing machinery (spliceosome) of eukaryotic cells.
  • introns derived from naturally-occurring introns e.g., introns at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%. 97%, 98%, or 99% identical to the sequence of a naturally-occurring intron or an internally deleted variant thereof, for example, a variant having from 1 to 1000 bp deleted from within the borders of the intron.
  • chimeric introns that comprise intron sequences of two or more naturally- occurring introns. Introns include a GT (GU in the primary RNA transcript) at the 5' end, a branch site sequence near the 3 ' end of the intron, and an AG acceptor site at the 3 ' end of the intron.
  • the surrounding exon sequence includes a GG at the 5' border with the intron, and a G after the AG at the 3' end of the intron.
  • Such sequences can optionally be engineered into the coding sequences of a gene as provided herein at the site of intron insertion.
  • An intronylated gene as provided herein is engineered to include at least one heterologous intron, that is, at least one intron that does not naturally occur in the gene that encodes the polypeptide encoded by the engineered gene, and an intronylated gene in some embodiments is preferably engineered to include at least three, at least four, or at least five heterologous introns, that is, at least three, at least four, or at least five introns that do not naturally occur in the gene.
  • amino acid-encoding sequences of the engineered gene can encode a polypeptide that is not encoded by the gene from which the heterologous introns are derived.
  • the heterologous introns are inserted into a gene that they do not occur in naturally, for example, using genetic engineering or gene synthesis techniques.
  • the amino acid-encoding sequences of the engineered gene may optionally be altered for example to generate sequences immediately proximal to a heterologous intron to allow for correct splicing of the introduced intron and/or to alter the codon usage (for example, to reflect a codon preference of the host) and/or to introduce a mutation.
  • the at least three heterologous introns are derived from one or more genes other than the gene from which the amino acid-encoding sequences of the engineered gene are derived, for example, the at least three exogenous introns can be derived from naturally-occurring introns.
  • the at least three, at least four, or at least five exogenous introns can be naturally-occurring introns from another gene of the same or different organism from which the amino acid encoding sequences of the engineered gene are derived, or can be derived from naturally-occurring introns from another gene of the same or different organism from which the amino acid encoding sequences of the engineered gene, for example, by one or more sequence modifications or internal deletion of sequences from the naturally-occurring intron(s).
  • the at least three, at least four, or at least five exogenous introns inserted into an engineered gene are all naturally- occurring introns of the same gene, and in some embodiments multiple introns of the same naturally-occurring gene may be introduced into the engineered gene in the same order in which they occur in the naturally-occurring gene from which they are derived.
  • the engineered gene is operably linked to a promoter, and the promoter and exogenous introns can optionally be derived from the same organism.
  • the engineered gene is operably linked to a promoter and a terminator, and the promoter, terminator, and exogenous introns can all be derived from the same organism, and can all be derived from the same gene.
  • the amino acid-encoding sequences of the engineered gene can be codon-optimized, and in some examples can be codon optimized for expression in an organism from which the exogenous introns are derived.
  • An engineered gene as provided herein can include at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine exogenous introns.
  • an engineered gene includes at least five introns.
  • the introns can range in size from about 15 nucleotides to 2000 nucleotides or more, for example, from 30 nucleotides to 1000 nucleotides or from about 60 nucleotides to about 500 nucleotides in length.
  • the three or more introns are selected from the group consisting of introns having at least 80%, 85%, 90%, 95%, 96%.
  • an engineered gene can include three or more introns selected from the group of introns having at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%.
  • an engineered gene can include three or more introns selected from the group of introns having at least 80%, 85%, 90%, 95%, 96%. 97%, 98%, or 99% identity to SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID NO:43.
  • the engineered gene that includes at least three or at least four heterologous introns can further include a promoter operably linked to the engineered gene.
  • the promoter can optionally be heterologous with respect to the polypeptide encoded by the gene, and can optionally be derived from a different species that the species from which the encoded polypeptide is derived. In some embodiments the promoter may be derived from the same species as at least one, at least two, at least three, at least four or all of the heterologous introns.
  • the nucleic acid molecule can optionally further include a terminator operably linked to the engineered gene that includes at least three or at least four heterologous introns.
  • the terminator can optionally be heterologous with respect to the polypeptide encoded by the gene, and can optionally be derived from a different species that the species from which the encoded polypeptide is derived. In some embodiments the terminator may be derived from the same species as at least one or all of the at least three or at least four heterologous introns. In some embodiments the terminator may be derived from the same species as a promoter operably linked to the engineered gene.
  • Expression cassettes are also provided in the present invention, in which the expression cassettes comprise one or more regulatory elements as described herein to drive the expression of transgenes.
  • These cassettes comprise isolated nucleic acid molecules that include any one of the promoter sequences described herein or any combination thereof, operably linked to a gene of interest, with the gene of interest positioned downstream of the promoter sequence, and optionally with any one of the terminator sequences described herein or any combination thereof operably linked downstream of the transgene.
  • any of the promoters listed in Table 1, or promoters comprising sequences having at least 80% identity to subsequences of any of the promoter of Table 1 as described herein can be used in combination with any terminator listed in Table 1, or terminators comprising sequences having at least 80% identity to subsequences thereof as described herein, in an expression cassette.
  • the promoters of the invention can be used with any heterologous or homologous gene(s).
  • a heterologous or homologous gene according to the invention may encode a protein or polypeptide.
  • the heterologous or homologous gene can encode a functional RNA, such as a tRNA, rRNA, small nucleolar RNA (snoRNA), ribozyme, antisense RNA (asRNA), micro RNA (miRNA), short hairpin RNA (shRNA), small interfering RNA (siRNA), piwi- interacting RNA (piRNA), transactivator RNA (tracrRNA), or a guide RNA of a CRISPR system (gRNA).
  • a functional RNA such as a tRNA, rRNA, small nucleolar RNA (snoRNA), ribozyme, antisense RNA (asRNA), micro RNA (miRNA), short hairpin RNA (shRNA), small interfering RNA (siRNA), piw
  • a guide RNA can optionally be a chimeric guide RNA that includes a tracr sequences. Any known or later-discovered heterologous or homologous gene which encodes a desired product can be operably linked to a promoter sequence of the invention using known methods.
  • Non-limiting examples of known genes suitable for use with the promoters of the invention include genes encoding proteins associated with lipid biosynthesis; lipases; proteins associated with carbohydrate metabolism; transporter polypeptides; proteins conferring resistance to an antibiotic, herbicide, or toxin; reporter proteins (e.g., fluorescent proteins or enzymes that produce detectable products) polypeptides of the Calvin-Benson cycle; polypeptides that participate in photosynthesis (such as but not limited to, photosynthetic reaction center polypeptides, light-harvesting chlorophyll-binding proteins, oxygen-evolving complex polypeptides, cytochromes, ferredoxins, etc.); dehydrogenases, such as NADPH-forming dehydrogenases; transcription factors; proteins involved in cell signaling (e.g., G proteins or kinases); or functional RNAs.
  • genes suitable for use with the promoters of the invention include genes encoding proteins associated with lipid biosynthesis; lipases; proteins associated with carbohydrate metabolism; transporter
  • an expression cassette can comprise a promoter as described herein (for example, a promoter comprising a nucleotide sequence having at least 80% identity to at least 100 contiguous nucleotides of SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23) operably linked to a gene encoding a polypeptide, where the polypeptide can be any polypeptide of interest, and in illustrative and nonlimiting examples, can be a protein associated with lipid biosynthesis, an acetyl-CoA carboxylase, a malonyl type 1 fatty acid synthase, a type 2 fatty acid synthase subunit, a beta ketoacyl-ACP synthase, a malonyl-CoA-malonyl-ACP acyltransfer
  • an expression cassette can comprise a promoter as described herein (for example, a promoter comprising a nucleotide sequence having at least 80% identity to at least 100 contiguous nucleotides SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23) operably linked to a gene encoding a functional RNA, optionally wherein the functional RNA is an antisense RNA, a small hairpin RNA, a microRNA, an siRNA, an snoRNA, a piRNA, or a ribozyme.
  • a promoter comprising a nucleotide sequence having at least 80% identity to at least 100 contiguous nucleotides SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO
  • An expression cassette as provided herein can further include a terminator, such as but not limited to a terminator as disclosed herein.
  • an expression cassette can include a promoter and terminator that are both heterologous with respect to the polypeptide or functional RNA-encoding sequence of the expression cassettte.
  • the promoter and terminator of an expression cassette can optionally be derived from the same genus or species, and can further optionally be derived from the same gene.
  • a promoter and terminator of an expression cassette can be derived from Parachlorella genes and may be derived from the same Parachlorella gene
  • An expression cassette as provided herein can optionally include an intronylated gene that includes one or more heterologous introns, as described above, i.e., introns that are not native to the gene from which the protein or functional RNA-encoding sequences are derived.
  • One or more heterologous introns of an engineered gene of an expression cassette can be derived from the same species as the promoter operably linked to the engineered gene.
  • One or more heterologous introns of an engineered gene of an expression cassette can be derived from the same gene as the promoter operably linked to the engineered gene.
  • the expression cassette can further include a terminator operably linked to the engineered gene that is derived from the same species as the one or more heterologous introns and the promoter operably linked to the engineered gene.
  • the terminator can optionally be derived from the same gene as the one or more heterologous introns and the promoter operably linked to the engineered gene.
  • the operably linked promoter and terminator and all of the introns of an engineered gene are derived from the same gene.
  • the promoter, terminator, and introns of an engineered gene are derived from the same gene of an algal species, which is exemplary embodiments can be the RPS4 gene or FBPase gene of an algal species.
  • the present invention also provides vectors that can comprise the regulatory elements and/or expression cassettes described herein.
  • the vectors can further optionally comprise at least one origin of replication ("ORI") sequence for replication in a cell.
  • the vectors may further optionally comprise one or more selectable markers under the control of one or more eukaryotic promoters, one or more selectable markers under the control of one or more prokaryotic promoters, and/or one or more sequences that mediate recombination of an exogenous nucleic acid sequence into the target cell's genome.
  • An ORI is the sequence in a DNA molecule at which replication begins.
  • the ORI serves as a base of assembly for the pre-replication complex.
  • Such replication can proceed unidirectionally or bidirectionally.
  • An expression vector as provided herein can include an ORI for replication of the expression vector in a cloning host, such as E. coli or Saccharomyces, and/or can include an ORI for replication of the expression vector in a target cell, which can be, for example, an algal or chlorophyte cell.
  • the structural biology of ORIs is widely conserved among prokaryotes, eukaryotes, and viruses. Most ORIs possess simple tri-, tetra-, or higher nucleotide repetition patterns. Most are AT-rich and contain inverted repeats. Those skilled in the art will be familiar with the more common ORIs, such as pi 5 A and the pUC ORI.
  • a vector described herein may also carry a selectable marker.
  • a vector that includes an expression cassette may include, as a selectable marker, a gene conferring resistance to a poison, such as an antibiotic, an herbicide, or some other toxin, so that transformants can be selected by exposing the cells to the poison and selecting those cells which survive the encounter.
  • Non-limiting examples of selectable markers include: 1) genes conferring resistance to antibiotics such as amikacin (aphA6), ampicillin (ampR), blasticidin (bis, bsr, bsd), bleomicin or phleomycin (ZEOCINTM) (ble), nourseothricin (natl), chloramphenicol (cat), emetine (RBS14p or cry 1-1), erythromycin (ermE), G418 (GENETIC INTM) (neo), gentamycin (aac3 or aacC4), hygromycin B (aphlV, hph, hpt), kanamycin (nptll), methotrexate (DHFR mtxR), penicillin and other ⁇ -lactams ( ⁇ -lactamases), streptomycin or spectinomycin (aadA, spec/strep), and tetracycline (tetA, tet
  • the selectable marker gene can be operably linked to and/or under the control of a promoter as provided herein.
  • the promoter regulating expression of the selectable marker may be conditional or inducible but is preferably constitutive, and can be, for example, any promoter disclosed herein or another promoter.
  • the selectable marker may be placed under the control of the expression cassette promoter.
  • the selectable marker and the expression cassette may be operably linked with an internal ribosome entry site ("IRES") element between the expression cassette and the selectable marker (Komar & Hatzoglou (2011) Cell Cycle 10:229-240 and Hellen & Sarnow (2001) Genes & Dev. 15: 1593-1612, incorporated by reference in their entireties) or a "2A" sequence (Kim et al. (2011) PLoS One 6(4):el8556, incorporated by reference in its entirety).
  • IRS internal ribosome entry site
  • a vector for transformation of a eukaryotic cell such as but not limited to a eukaryotic microalgal cell or phytoplankter cell, in which the vector includes a selectable marker gene operably linked to a promoter as provided herein, for example, a promoter that includes a nucleotide sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, or at least 800 contiguous nucleotides of SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23, or a promoter that comprises SEQ ID NO: 1, SEQ ID NO:
  • the transformation vector can further include one or more additional genes or constructs for transfer into the host cell, such as a gene encoding a polypeptide such as but not limited to any disclosed hereinabove or a construct encoding a functional RNA, where the gene encoding a polypeptide or functional RNA can optionally be operably linked to a promoter as described herein, or can optionally be operably linked to another promoter.
  • additional genes or constructs for transfer into the host cell such as a gene encoding a polypeptide such as but not limited to any disclosed hereinabove or a construct encoding a functional RNA, where the gene encoding a polypeptide or functional RNA can optionally be operably linked to a promoter as described herein, or can optionally be operably linked to another promoter.
  • the vectors as provided herein may comprise a terminator as provided herein.
  • a vector of the present invention may comprise a nucleotide sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, or at least 800 contiguous nucleotides of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • a gene of interest or a selectable marker gene on a vector of the present invention may be operably linked to a terminator sequence as provided herein.
  • a gene of interest or a selectable marker may be operably linked to SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • a selectable marker operably linked to a promoter such as a promoter described herein can be provided on a separate construct, where both the gene-of-interest construct and the selectable marker construct are used together in transformation protocols. Selected transformants are then analyzed for co-transformation of the construct that includes the gene-of-interest (see, e.g., Kindle (1990) Proc. Nat 'l. Acad. Sci. USA 87: 1228-1232).
  • transformants may be selected by routine methods familiar to those skilled in the art, such as, by way of a non-limiting example, extracting nucleic acid from the putative transformants and screening by PCR.
  • transformants may be screened by detecting expression of a reporter gene, such as but not limited to a chloramphenicol acyltransferase gene (cat) lacZ, uidA, xylE, an alkaline phosphatase gene, an a-amylase gene, an a-galactosidase gene, a ⁇ -lactamase gene, a ⁇ -glucuronidase gene, a horseradish peroxidase gene, a luciferin/luciferase gene, an R-locus gene, a tyrosinase gene, or a gene encoding a fluorescent protein, such as any of the blue, cyan, green, red, yellow, photoconvertible, or photoswitchable fluorescent proteins or any of their variants, including codon-optimized, rapidly folding, monomelic, increased stability, and enhanced fluorescence variants.
  • a reporter gene such as but not limited to a chloramphenicol acyltransfer
  • a reporter gene used in a vector may optionally be regulated by a promoter as provided herein.
  • a transformation vector may include a gene encoding a reporter, such as, for example, a fluorescent protein, operably linked to a promoter as provided herein.
  • the vector is designed for integration of one or more genes (such as the expression cassette) into the host genome.
  • the expression vectors may include Agrobacterium flanking sequences designed for integrating transgenes into the genome of a target plant cell.
  • vectors can be targeted for integration into a plant or algal chromosome by including flanking sequences that enable homologous recombination into the chromosome or targeted for integration into endogenous host plasmids by including flanking sequences that enable homologous recombination into the endogenous plasmids.
  • the expression vectors can be designed to have regions of sequences flanking the transgene that are homologous to chloroplast sequences to promote homologous recombination and integration of the sequence of interest.
  • a transformation vector can include sequences for site-specific recombination such as but not limited to lox sites on which the ere recombinase acts.
  • the expression vector will contain one or more enhancer elements. Enhancers are short regions of DNA that can bind trans-acting factors to enhance transcription levels. Although enhancers usually act in cis, an enhancer need not be particularly close to its target gene, and may sometimes not be located on the same chromosome. Enhancers can sometimes be located in introns.
  • a gene or genes encoding enzymes that participate in the synthesis of a fatty acid product is cloned into the vector as an expression cassette that includes a promoter as disclosed herein.
  • the expression cassette may optionally include a transit peptide-encoding sequence for directing the expressed enzyme to the chloroplast or endoplasmic reticulum of transformed eukaryotic cells, an intron sequence, a sequence having a poly-adenylation signal, etc.
  • a vector comprising an expression cassette as described herein, wherein the vector further comprises one or more of: a selectable marker gene, an origin of replication, and one or more sequences for promoting integration of the expression cassette into the host genome.
  • a vector is provided comprising an isolated or recombinant nucleic acid molecule as described herein, wherein the isolated nucleic acid molecule is operably linked to a nucleic acid sequence encoding a selectable marker or a reporter protein, such as, for example, any described herein.
  • the vector further comprises one or more of: an origin of replication, one or more sequences for promoting integration of the expression cassette into the host genome, a sequence as reported herein that comprises a terminator, or an additional gene, wherein the additional gene encodes an antisense RNA, a microRNA, an shRNA, a ribozyme, structural protein, an enzyme, a transcription factor, or a transporter.
  • the present invention also provides transformation methods in which a eukaryotic cell is transformed with an expression vector as described herein.
  • the methods comprise introducing an expression vector as provided herein that includes at least one promoter as provided herein and then selecting for a transformant.
  • the expression vector may be introduced by many methods familiar to those skilled in the art including, as non-limiting examples: natural DNA uptake (Chung et al. (1998) FEMS Microbiol. Lett. 164:353-61); conjugation (Wolk et al. (1984) Proc. Nat 'l. Acad. Sci. USA 81, 1561-65); transduction; glass bead transformation (Kindle et al. (1989) J. Cell Biol.
  • Agrobacterium- mediated transformation can also be performed on algal cells, for example after removing or wounding the algal cell wall ⁇ e.g., WO 2000/62601).
  • the eukaryotic cell transformed can be, for example, a fungal, heterokont, chlorophyte, algal, or plant cell.
  • the eukaryotic cell transformed using an expression vector as provided herein can be an algal cell, such as a species of Achnanthes, Amphiprora, Amphora, Ankistrodesmus, Asteromonas, Boekelovia, Bolidomonas, Borodinella, Botrydium, Botryococcus, Bracteococcus, Chaetoceros, Carteria, Chlamydomonas, Chlorococcum, Chlorogonium, Chlorella, Chroomonas, Chrysosphaera, Cricosphaera, Crypthecodinium, Cryptomonas, Cyclotella, Dunaliella, Ellipsoidon, Emiliania, Eremosphaera, Ernodesmius, Euglena, Eustigmatos, Franceia, Fra
  • the eukaryotic cell transformed using the methods provided herein can optionally be a species of Parachlorella, such as non-limiting examples: Parachlorella kessieri, P. hussii, P. beijerinckii, P. sp. CCAP 206/1, or P. sp. pgu003.
  • Parachlorella kessieri P. hussii, P. beijerinckii, P. sp. CCAP 206/1, or P. sp. pgu003.
  • the eukaryotic cell can be an alga of the Chlorophyceae, Trebouxiophyceae, or Prasinphyceae class.
  • the herein disclosed electroporation method is a method of transforming a species of the Trebouxiophyceae class, such as, for example, a species of a genus such as Actinastrum, Amphikorikos, Asterochloris, Auxenochlorella, Botryococcus, Chlorella, Choricystis, Coccomyxa, Coenocystis, Closteriopsis, Diacanthos, Dicloster, Dictyosphaerium, Dictoyochloropsis, Didymogenes, Diplosphaera, Eremosphaera, Franceia, Fusochloruis, Gloeotila, Helicosporidium, Heveochlorella, Koliella, Koriellopsis, Lagerheimia, Lep
  • the herein disclosed method can be used to transform a species belonging to the genus of Botryococcus, Chlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Parachlorella, Picochlorum, Prototheca, or Pseudochlorella.
  • the transformed cell can be a species belonging to the genus of Auxenochlorella, Chlorella, Heveochlorella, Marinichlorella, Parachlorella, Pseudochlorella or Tetrachlorella.
  • the cell transformed using the methods provided herein can optionally be a species of Parachlorella, such as non-limiting examples: Parachlorella kessieri, P. hussii, P. beijerinckii, P. sp. CCAP 206/1, or P. sp. pgu003.
  • a Parachlorella cell is transformed by electroporation or particle bombardment.
  • the expression vector used to transform the host cell may encode, for a selectable marker, a ble gene conferring bleomycin resistance, a bsd gene conferring blasticidin resistance, a green fluorescent protein gene, a polypeptide, or a functional RNA.
  • a Parachlorella cell is transformed by a biolistic method.
  • the transformation may be achieved by use of a gene gun.
  • the gene gun may employ a rupture disc of 1100 rps, of 1350 rps, of 1550 rps, of 1800 rps, or of 2000 rps.
  • a biolistic transformation of the present invention may involve any amount of DNA coated particles per bombardment; for example, a biolistic transformation method of the present invention may employ 1 ⁇ g, 2 ⁇ g, 3 ⁇ g, 4 ⁇ g, 5 ⁇ g, 6 ⁇ g, 7 ⁇ g, 8 ⁇ g, 9 ⁇ g, 10 ⁇ g, 20 ⁇ g, 30 ⁇ g, 40 ⁇ ⁇ , 50 ⁇ ⁇ , 100 ⁇ ⁇ , 150 ⁇ ⁇ , 200 ⁇ ⁇ , 300 ⁇ ⁇ , 400 ⁇ ⁇ , 500 ⁇ ⁇ , 1 mg, 2 mg, or 3 mg of DNA coated particles per bombardment. Additionally or alternatively, the DNA coated particles may be deployed using at least about 300 psi.
  • the gene gun can be fired using at least 300 psi, at least 350 psi, at least 400 psi, at least 450 psi, at least 500 psi, at least 550 psi, at least 600 psi, at least 650 psi, or at least 700 psi.
  • the distance between a microcarrier and a plate in a biolistic transformation method of the present invention may vary according to the construct being transformed into a cell and the cells; for example the distance may be approximately 3 cm, approximately 4 cm, approximately 5 cm, approximately 6 cm, approximately 7 cm, approximately 8 cm, or even approximately 9 cm.
  • a microcarrier used in a biolistic method of the present invention may be in either the top position or bottom position, depending on the construct being transformed into a cell and the cells.
  • DNA used in a biolistic method of the present invention may be linearized by restriction enzyme digest or may be circular supercoiled plasmid DNA, depending on the construct being transformed into a cell and the cells.
  • Cells grown for use in a biolistic method of the present invention may be grown in a variety of culture conditions; for example, cells destined for bombardment in a biolistic method of the present invention may be grown in PM024 liquid media or on PM024 agar plates for 2 days prior to bombardment.
  • Cells used in a biolistic method of the present invention may be in different phases of a 14: 10 diel cycle; for example, cells may be in either a light phase or dark phase, depending on the construct being transformed into a cell and the cells.
  • a variety of antibiotic concentrations may be used to select transformants after transformation by a biolistic method of the present invention.
  • One skilled in the art will recognize that these parameters are all subject to routine optimization involving only limited experimentation.
  • Transformation of WT-1185 was also achieved by electroporation using a procedure as describe herein.
  • a lOOmL seed culture inoculated to lxlO A 6cells/mL six days before transformation was used to inoculate a 1L culture to lxlO A 6cells/mL two days before transformation.
  • cells were pelleted by centrifugation at 5000xg for 20 minutes, washed three times with O. lum filtered 385mM sorbitol, and resuspended to 5xlO A 9 cells/mL in 385mM sorbitol.
  • Electroporation of lOOuL concentrated cells was performed in 0.2cm cuvettes in a BioRad Gene Pulser XcellTM under varied conditions. Immediately after electroporating pre-chilled cells and cuvettes, lmL cold sorbitol was added and used to transfer cells into lOmL PM074. After overnight recovery, cells were concentrated and spread onto 13cm-diameter PM074 media containing zeocin at 250mg/L and grown under the conditions listed in the biolistics section.
  • a Parachlorella cell is transformed by an electroporation method.
  • the transformation may be achieved by use of an electroporator.
  • the electroporator can be set to output approximately 0.5 kilovolts (kV), 0.6 kV, 0.7 kV, 0.8 kV, 0.9 kV, 1.0 kV, 1.1 kV, 1.2 kV, 1.3 kV, 1.4 kV, 1.5 kV, or 1.6 kV.
  • successful electroporation can be achieved with at least 0.5 kilovolts (kV), at least 0.6 kV, at least 0.7 kV, at least 0.8 kV, at least 0.9 kV, at least 1.0 kV, at least 1.1 kV, at least 1.2 kV, at least 1.3 kV, at least 1.4 kV, at least 1.5 kV, at least 1.6 kV, or with between 0.5 kV and 1.7 kV, or between 0.6 kV and 1.6 kV, or between 0.7 kV and 1.5 kV, or between 0.9 kV and 1.5 kV, or between 1.0 kV and 1.2 kV.
  • kV kilovolts
  • the electroporator can be set to output a field strength of approximately 250 kilovolts per meter (kV/m), 300 kV/m, 350 kV/m, 400 kV/m, 450 kV/m, 500 kV/m, 550 kV/m, 600 kV/m, 650 kV/m, 700 kV/m, 750 kV/m, or 800 kV/m, or for example, with at least 250 kV/m, at least 300 kV/m, at least 350 kV/m, at least 400 kV/m, at least 450 kV/m, at least 500 kV/m, at least 550 kV/m, at least 600 kV/m, at least 650 kV/m, at least 700 kV/m, at least 750 kV/m, or at least 800 kV/m.
  • successful electroporation can be achieved with between 250 kV/m and 850 kV/m, or between 300 kV/m and 800 kV/m, or between 700 kV/m and 750 kV/m, or between 450 kV/m and 750 kV/m, or between 500 kV/m and 600 kV/m. Additionally or alternatively, successful electroporation can be achieved with resistances of approximately 100 Ohms, 200 Ohms, 300 Ohms, or 400 Ohms.
  • successful electroporation can be achieved with a resistance setting of at least 100 Ohms, at least 200 Ohms, or at least 300 Ohm, for example, with a resistance setting between 100 Ohms and 400 Ohms, between 200 Ohms and 400 Ohms, between 200 Ohms and 300 Ohms, or between 300 Ohms and 400 Ohms.
  • successful electroporation can be achieved with a capacitance of approximately 10 ⁇ , 25 ⁇ , 50 ⁇ , at least 10 ⁇ , at least 25 ⁇ , or at least 50 ⁇ .
  • the capacitance can be between 10 ⁇ and 50 ⁇ , or between 25 ⁇ and 50 ⁇ .
  • Successful electroporation can be achieved using approximately 0.5 ⁇ g, ⁇ ⁇ , 2 ⁇ g, 3 ⁇ g, 4 ⁇ g, 5 ⁇ g, 6 ⁇ g, 7 ⁇ g, or 8 ⁇ g of DNA.
  • successful electroporation can be achieved with between 0.5 ⁇ g and 8 ⁇ g of DNA, between 1 ⁇ g and 8 ⁇ g of DNA, or between 2 ⁇ g and 8 ⁇ of DNA.
  • the optimal electroporation conditions are a field strength between 500 kV/m and 600 kV/m, and a resistance between 200 Ohms and 300 Ohms, and a capacitance between 25 ⁇ and 50 ⁇ .
  • the herein disclosed electroporation method is a method of transforming a species of the Trebouxiophyceae class.
  • the herein disclosed method can be used to transform a species belonging to the genus of Botryococcus, Chlorella, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Parachlorella, Picochlorum, Prototheca, or Pseudochlorella.
  • the transformed cell can be a species belonging to the genus of Chlorella, Parachlorella, or Pseudochlorella.
  • the cell transformed using the methods provided herein can optionally be a species of Parachlorella, such as non- limiting examples: Parachlorella kessieri, P. hussii, P. beijerinckii, P. sp. CCAP 206/1, or P. sp. pgu003.
  • Parachlorella kessieri P. hussii, P. beijerinckii, P. sp. CCAP 206/1, or P. sp. pgu003.
  • Eukaryotic host cells such as any of the cells disclosed hereinabove transformed with the expression vectors are also provided herein.
  • Transformed algal cell cultures can be diluted, plated on agar, and allowed to grow until isolated colonies can be selected for further propagation as clonal strains.
  • a eukaryotic cell comprising an isolated or recombinant nucleic acid molecule as described herein or an expression cassette as described herein, or a vector as described herein.
  • Algae can be cultured phototrophically, in the absence of a fixed carbon source, or mixotrophically, where the cultures are supplied with light for at least part of the day, and also supplied with a reduced carbon source, such as a sugar (e.g., glucose, fructose, galactose, mannose, rhamnose, arabinose, xylose, lactose, sucrose, maltose), an organic acid (e.g., acetate, citrate, succinate), or glycerol.
  • a sugar e.g., glucose, fructose, galactose, mannose, rhamnose, arabinose, xylose, lactose, sucrose, maltose
  • an organic acid e.g., acetate, citrate, succinate
  • glycerol e.g., glycerol
  • a photosynthetic organism can be cultured mixotrophically, in which the organism is grown in the presence of light for at least a part of the day, and also provided with one or more sources of reduced carbon.
  • the photosynthetic organism can be grown mixotrophically for a period of time, followed by a period of phototrophic growth, or vice versa.
  • Media for phototrophic or mixotrophic growth of algae are known in the art, and media can be optimized to enhance growth or production of fatty acid products for a particular species.
  • Artificial light sources can be used as the sole light source or to enhance or extend natural light.
  • Growth of algae can be in open areas, such as, for example, ponds, canals, channels, raceways, or tanks, or can be in bioreactors.
  • Bioreactors are preferred for mixotrophic growth, and can also be used for phototrophic growth.
  • the bioreactors can be of any sizes and form, and can include inlets for providing nutrients, additives, or gases, such as but not limited to air or C0 2 .
  • a bioreactor preferably also has an outlet for sampling of the culture.
  • a bioreactor can be configured such that the algal culture is mixed during the growth period, for example, by stirring, rocking, shaking, inverting, bubbling of gases through the culture, etc.
  • Outdoor ponds, raceways, tanks, canals, etc. can also be designed for mixing of cultures through, for example, paddles, pumps, hoses or jets for circulation of the culture media, or tubes, hoses or inlets for supplying air or CO 2 to the culture.
  • the methods include culturing a recombinant algal mutant in a suitable medium to provide an algal culture and recovering biomass or at least one product from the culture.
  • the algal culture is preferably a photoautotrophic culture, and the culture medium preferably does not include a substantial amount of reduced carbon, that is, the culture does not include reduced carbon in a form or at a level that can be used by the algae for growth.
  • the algae may be cultured in any suitable vessel, including flasks or bioreactors, where the algae may be exposed to artificial or natural light.
  • the culture comprising recombinant mutant algae may be cultured on a light/dark cycle that may be, for example, a natural or programmed light/dark cycle, and as illustrative examples, may provide twelve hours of light to twelve hours of darkness, fourteen hours of light to ten hours of darkness, sixteen hours of light to eight hours of darkness, etc.
  • Culturing refers to the intentional fostering of growth (e.g., increases in cell size, cellular contents, and/or cellular activity) and/or propagation (e.g., increases in cell numbers via mitosis) of one or more cells by use of selected and/or controlled conditions.
  • growth e.g., increases in cell size, cellular contents, and/or cellular activity
  • propagation e.g., increases in cell numbers via mitosis
  • proliferation may be termed proliferation.
  • the recombinant mutants provided herein can achieve higher cell density of the culture over time, for example, over a period of a week or more, with respect to a culture wild type algal cells of the same strain.
  • a recombinant mutant may be cultured for at least five, at least six, at least seven at least eight, at least nine, at least ten, at least eleven at least twelve, at least thirteen, at least fourteen, or at least fifteen days, or at least one, two three, four, five, six, seven, eight, nine, or ten weeks, or longer.
  • Non-limiting examples of selected and/or controlled conditions that can be used for culturing the recombinant alga can include the use of a defined medium (with known characteristics such as pH, ionic strength, and/or carbon source), specified temperature, oxygen tension, carbon dioxide levels, growth in a bioreactor (e.g. a photobioreactor), or the like, or combinations thereof.
  • a defined medium with known characteristics such as pH, ionic strength, and/or carbon source
  • specified temperature e.g. a biobioreactor
  • the alga or host cell can be grown mixotrophically, using both light and a reduced carbon source.
  • the alga or host cell can be cultured phototrophically. When growing phototrophically, the algal strain can advantageously use light as an energy source.
  • An inorganic carbon source such as C02 or bicarbonate can be used for synthesis of biomolecules by the alga.
  • “inorganic carbon” can be in the form of C02 (carbon dioxide), carbonic acid, bicarbonate salts, carbonate salts, hydrogen carbonate salts, or the like, or combinations thereof, which cannot be further oxidized for sustainable energy nor used as a source of reducing power by organisms.
  • Algae grown photoautotrophically can be grown on a culture medium in which inorganic carbon is substantially the sole source of carbon.
  • any organic (reduced) carbon molecule or organic carbon compound that may be provided in the culture medium either cannot be taken up and/or metabolized by the cell for energy and/or is not present in an amount sufficient to provide sustainable energy for the growth and proliferation of the cell culture.
  • Cells grown photoautrophically can be grown under constant light or a diel cycle, for example a diel cycle in which the light period can be, for example, at least four hours, about five hours, about six hours, about seven hours, about eight hours, at least eight hours, about nine hours, about ten hours, about eleven hours, about twelve hours, about thirteen and a half hours, or up to about sixteen hours per day, for example, between about twelve hours and about fourteen hours, or between about fourteen hours and about sixteen hours.
  • the light period can be, for example, at least four hours, about five hours, about six hours, about seven hours, about eight hours, at least eight hours, about nine hours, about ten hours, about eleven hours, about twelve hours, about thirteen and a half hours, or up to about sixteen hours per day, for example, between about twelve hours and about fourteen hours, or between about fourteen hours and about sixteen hours.
  • Algae and host cells that can be useful in accordance with the methods of the present disclosure can be found in various locations and environments throughout the world.
  • the particular growth medium for optimal propagation and generation of lipid and/or other products can vary and may be optimized to promote growth, propagation, or production of a product such as a lipid, protein, pigment, antioxidant, etc.
  • certain strains of algae may be unable to grow in a particular growth medium because of the presence of some inhibitory component or the absence of some essential nutritional requirement of the particular strain of alga or host cell.
  • Solid and liquid growth media are generally available from a wide variety of sources, as are instructions for the preparation of particular media suitable for a wide variety of strains of algas.
  • various fresh water and salt water media can include those described in Barsanti (2005) Algae: Anatomy, Biochemistry & Biotechnology, CRC Press for media and methods for culturing algae.
  • Algal media recipes can also be found at the websites of various algal culture collections, including, as nonlimiting examples, the UTEX Culture Collection of Algae (www.sbs.utexas.edu/utex/media.aspx); Culture Collection of Algae and Protozoa (www.ccap.ac.uk); and Katedra Botaniky (botany.natur.cuni.cz/algo/caup-media.html).
  • the culture methods can optionally include inducing expression of one or more genes for the production of a product, such a but not limited to a protein that participates in the production of a lipid, one or more proteins, antioxidants, or pigments, and/or regulating a metabolic pathway in the alga.
  • Inducing expression can include adding a nutrient or compound to the culture, removing one or more components from the culture medium, increasing or decreasing light and/or temperature, and/or other manipulations that promote expression of the gene of interest. Such manipulations can largely depend on the nature of the (heterologous) promoter operably linked to the gene of interest.
  • the recombinant algae can be cultured in a photobioreactor equipped with an artificial light source, and/or having one or more walls that is transparent enough to light, including sunlight, to enable, facilitate, and/or maintain acceptable alga growth and proliferation.
  • photosynthetic algae or host cells can additionally or alternately be cultured in shake flasks, test tubes, vials, microtiter dishes, petri dishes, or the like, or combinations thereof.
  • recombinant photosynthetic alga or host cells may be grown in ponds, canals, sea-based growth containers, trenches, raceways, channels, or the like, or combinations thereof.
  • the temperature may be unregulated, or various heating or cooling method or devices may be employed.
  • a source of inorganic carbon such as, but not limited to, C02, bicarbonate, carbonate salts, and the like
  • air, C02-enriched air, flue gas, or the like, or combinations thereof can be supplied to the culture.
  • the recombinant algal can include one or more non-native genes encoding a polypeptide for the production of a product, such as, but limited to, a lipid, a colorant or pigment, an antioxidant, a vitamin, a nucleotide, an nucleic acid, an amino acid, a hormone, a cytokine, a peptide, a protein, a polymer, or combinations thereof.
  • a product such as, but limited to, a lipid, a colorant or pigment, an antioxidant, a vitamin, a nucleotide, an nucleic acid, an amino acid, a hormone, a cytokine, a peptide, a protein, a polymer, or combinations thereof.
  • the encoded polypeptide can be an enzyme, metabolic regulator, cofactor, carrier protein, transporter, or combinations thereof.
  • the methods include culturing a recombinant mutant alga that includes at least one non- native gene encoding a polypeptide that participates in the production of a product, to produce biomass or at least one algal product.
  • Products such as lipids and proteins can be recovered from culture by recovery means known to those of ordinary skill in the art, such as by whole culture extraction, for example, using organic solvents.
  • recovery of fatty acid products can be enhanced by homogenization of the cells.
  • lipids such as fatty acids, fatty acid derivatives, and/or triglycerides can be isolated from algae by extraction of the algae with a solvent at elevated temperature and/or pressure, as described in the co-pending, commonly- assigned U.S. patent application publication 2013-0225846A1 entitled "Solvent Extraction of Products from Algae", filed on February 29, 2012, which is incorporated herein by reference in its entirety.
  • Biomass can be harvested, for example, by centrifugation or filtering.
  • the biomass may be dried and/or frozen. Further products may be isolated from biomass, such as, for example, lipids or one or more proteins.
  • an algal biomass comprising biomass of a recombinant algal mutant, such as any disclosed herein, for example, an algal mutant that includes a heterologous gene operably linked to a promoter having at least 80% identity to any of SEQ ID NO: l, SEQ ID NO: 4, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, or SEQ ID NO:23; and additionally or alternatively wherein the heterologous gene is followed by a terminator having at least 80% identity to SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, or SEQ ID NO:24.
  • a heterologous gene operably linked to a promoter having at least 80% identity to any of SEQ ID NO: l, SEQ ID NO: 4, SEQ ID NO: 8,
  • Embodiment 1 is an isolated DNA molecule comprising a nucleotide sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous nucleotides of SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21 or SEQ ID NO:23; for example wherein the nucleotide sequence has at least 80%, at least 85%, at least 90%, at least 95%), at least 96%, at least 97%, at least 98%, or at least 99% identity to a sequence selected from the group consisting of: a sequence having at least 100, at least 200, at least 300, at least 400, at least 500, or between 500 and 530 con
  • sequence having at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, or between 900 and 1000 contiguous nucleotides from the 3' end of SEQ ID NO :4;
  • sequence having at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, or between 1000 and 1044 contiguous nucleotides from the 3' end of SEQ ID NO: 11;
  • Embodiment 2 is an isolated or recombinant nucleic acid molecule according to Embodiment 1, wherein at least one of the following are satisfied:
  • the nucleic acid molecule comprises a promoter operable in a eukaryotic cell, preferably an algal cell or chlorophyte cell;
  • nucleic acid molecule is an expression cassette; or
  • the nucleic acid molecule is a vector.
  • Embodiment 3 A promoter comprising a nucleic acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to a sequence selected from the group consisting of a sequence having at least 100, at least 200, at least 300, at least 400, at least 500, or between 500 and 530 contiguous nucleotides from the 3' end of SEQ ID NO: l; a nucleotide sequence having at least 80%>, at least 85%>, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, or between 900 and 1000 contiguous nucleotides from the 3' end of SEQ ID NO:4; a nucleotide sequence having at least 80%>
  • Embodiment 4 A nucleic acid molecule comprising a promoter according to Embodiment 3 operably linked to a heterologous nucleotide sequence, wherein any of the following are satisfied:
  • the heterologous nucleotide sequence encodes a polypeptide, wherein the polypeptide is optionally any of: a protein associated with lipid biosynthesis, a polypeptide having lipolytic activity, a polypeptide that participates in photosynthesis, a protein associated with carbon fixation, a transporter protein, a dehydrogenase, a transcription factor, a transcriptional activator, a metabolic enzyme, a protein involved in protein synthesis, a protein involved in cell signaling, a kinase, or a G protein;
  • the heterologous nucleotide sequence encodes a functional RNA, wherein the functional RNA is optionally any of: an antisense RNA, a small hairpin RNA, a microRNA, an antisense RNA, a siRNA, a piRNA, a gRNA, or a ribozyme;
  • the heterologous nucleotide sequence is further operably linked to a terminator, optionally wherein the terminator comprises a nucleotide sequence having at least 80% at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100 contiguous nucleotides of a sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24;
  • the heterologous nucleotide sequence encodes a polypeptide and includes at least one intron that is heterologous with respect to the polypeptide-encoding sequence, optionally wherein at least one intron comprises a nucleotide sequence having at least 80% at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100 contiguous nucleotides of a sequence selected from the group consisting of SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID NO:43.
  • Embodiment 5 An isolated DNA molecule comprising a nucleotide sequence having at least 80% at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%), or about 100% identity to at least 100 contiguous nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24, optionally wherein the nucleotide sequence is operably linked to a heterologous sequence.
  • Embodiment 6 An isolated or recombinant nucleic acid molecule comprising a gene encoding a polypeptide, wherein the gene comprises at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine introns having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, or SEQ ID NO:43; optionally wherein:
  • the gene is operably linked to a promoter, optionally a promoter according to Embodiment 3; and/or the gene is operably linked to a terminator, optionally a terminator comprising a nucleotide sequence having at least 80% at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100 contiguous nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24;
  • the at least one, at least two at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine introns are derived from same species as the promoter, optionally wherein the at least one, at least two at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine introns are derived from the same species as the promoter and terminator;
  • the at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine introns are derived from the same gene as the promoter, optionally wherein the at least one, at least two at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine introns are derived from the same gene as the promoter and terminator.
  • Embodiment 7 An expression cassette comprising: a gene encoding a polypeptide operably linked to a heterologous promoter according to Embodiment 3;
  • the gene includes at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine introns are derived from the same species as the promoter, optionally wherein the at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, or at least nine introns are derived from the same gene;
  • a terminator comprising a nucleotide sequence having at least 80% at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or about 100% identity to at least 100 contiguous nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24 operably linked to the gene.
  • Embodiment 8 An expression cassette according to Embodiment 7, wherein: the gene encodes a selectable marker protein, a detectable marker protein, a protein associated with lipid biosynthesis, an acetyl-CoA carboxylase, a malonyl type 1 fatty acid synthase, a Type 2 fatty acid synthase subunit, a beta ketoacyl-ACP synthase, a malonyl-CoA-malonyl-ACP acyltransferase, an acyl-ACP thioesterase, an acyl-CoA thioesterase, a 4-hydroxybenzoyl thioesterase, an alcohol forming acyl reductase, a wax synthase, an aldehyde decarbonylase, a fatty acid decarboxylase, a lipase, a glyceraldehyde 3 phosphate dehydrogenase, an acyl-CoA synthet
  • Embodiment 9 A vector comprising an expression cassette according to any of Embodiments 4, 7, or 8, wherein the expression vector further comprises one or more of: an origin of replication; one or more sequences for promoting integration of the expression cassette into the host genome, a selectable marker gene; and a reporter gene; optionally wherein the selectable marker gene is selected from the group consisting of a gene conferring resistance to an antibiotic, a gene conferring resistance to an herbicide, a gene encoding acetyl CoA carboxylase (ACCase), a gene encoding acetohydroxy acid synthase (ahas), a gene encoding acetolactate synthase, a gene encoding aminoglycoside phosphotransferase, a gene encoding anthranilate synthase, a gene encoding bromoxynil nitrilase, a gene encoding cytochrome P450-NADH- cytochrome P450 oxidoreductase,
  • Embodiment 10 A method for transforming a eukaryotic cell comprising: 1) introducing a vector or expression cassette according to any of Embodiments 7-9 into the eukaryotic cell; and 2) selecting for a transformed eukaryotic cell, preferably wherein the eukaryotic host cell is a fungal, algal, heterokont, chlorophyte, or plant host cell; optionally wherein the eukaryotic host cell is an algal cell such as a species of Bacillariophyceae (diatoms), Bolidomonas, Chlorophyceae (green algae), Chrysophyceae (golden algae), Cyanophyceae (cyanobacteria), Eustigmatophyceae (pi co-plankton), Glaucocystophytes, Pelagophyceae, Bolidophyceae, Prasinophyceae (pi co-plankton), Raphidophyceae, Rhodophyceae (red
  • Embodiment 11 A method according to Embodiment 10, wherein the vector is introduced by a biolistic procedure, optionally wherein at least about 300 psi of pressure is used to impel biolistic microcarriers coated with vector DNA into the eukaryotic cell.
  • Embodiment 12 A method according to Embodiment 10, wherein the vector is introduced by an electroporation procedure, optionally wherein the voltage is at least 0.5 kV, the capacitance is at least 10 ⁇ , and the resistance is at least 100 Ohms, and further optionally wherein the voltage is between 1.0-1.2k V (5000-6000V/cm), the resistance is between 200-300 ohms, and the capacitance is between 25-50uF.
  • Embodiment 13 A method for co-transforming a eukaryotic cell comprising: 1) introducing an expression cassette according to Embodiment 7 or Embodiment 8 and a nucleic acid sequence encoding a selectable marker into the eukaryotic cell; and 2) selecting for the presence of the selectable marker in a transformed eukaryotic cell to provide a eukaryotic cell transformed with the expression cassette, optionally wherein the eukaryotic cell is an algal cell, optionally wherein the selectable marker gene is operably linked to a promoter according to Embodiment 3, optionally wherein the selectable marker gene is selected from the group consisting of a gene conferring resistance to an antibiotic, a gene conferring resistance to an herbicide, a gene encoding acetyl CoA carboxylase (ACCase), a gene encoding acetohydroxy acid synthase (ahas), a gene encoding acetolactate synthase, a gene encoding aminoglycoside
  • Embodiment 14 A method according to Embodiment 13, wherein the selectable marker gene is a gene conferring resistance to an antibiotic, optionally wherein the antibiotic is blasticidin, bleomycin (ZeocinTM), or nourseothricin.
  • the antibiotic is blasticidin, bleomycin (ZeocinTM), or nourseothricin.
  • Embodiment 15 A eukaryotic host cell comprising an isolated or recombinant nucleic acid molecule, expression cassette, or vector of any of Embodiments 1-9, wherein the eukaryotic host cell is optionally a microalgal cell, optionally a species of Bacillariophyceae, Bolidomonas, Chlorophyceae, Chrysophyceae, Eustigmatophyceae, Glaucocystophytes, Pelagophyceae, Bolidophyceae, Prasinophyceae, Raphidophyceae, Rhodophyceae (red algae), Synurophyceae or Xanthophyceae (yellow-green algae), or of a genus selected from the group consisting of: Achnanthes, Amphiprora, Amphora, Ankistrodesmus, Asteromonas, Boekelovia, Bolidomonas, Borodinella, Botrydium, Botryococc
  • Parachlorella is a genus of green alga in the Chlorophyte phylum. Along with genera such as but not limited to Botryococcus, Eremosphaera, Franceia, Micractinium, Nannochloris, Oocystis, Picochlorum, Prototheca, Stichococcus, and Viridiella, Parachlorella and related genera Auxenochlorella, Chlorella, Heveochlorella, Marinichlorella, Pseudochlorella, and Tetrachlorella, are members of the Trebouxiophyceae class of the Chlorophyta phylum. A strain of Parachlorella was isolated from the environment and given the designation WT-1185.
  • PM074 is a nitrogen replete ("nitrate-only") medium that is 10X F/2 made by adding 1.3 ml PROLINE® F/2 Algae Feed Part A (Aquatic Eco-Systems) and 1.3 ml PROLINE® F/2 Algae Feed Part B (Aquatic Eco-Systems) to a final volume of 1 liter of a solution of Instant Ocean salts (35 g/L) (Aquatic Eco Systems, Apopka, FL).
  • Proline A and Proline B together include 8.8 mM NaN0 3 , 0.361mM NaH 2 P0 4 .H 2 0, 10X F/2 Trace metals, and 10X F/2 Vitamins (Guillard (1975) Culture of phytoplankton for feeding marine invertebrates, in "Culture of Marine Invertebrate Animals.” (eds: Smith W.L. and Chanley M.H.) Plenum Press, New York, USA. pp 26-60).
  • PM126 is a nitrogen replete ("nitrate-only") medium that is 10X F/2 made by adding 1.3 ml PROLINE® F/2 Algae Feed Part A (Aquatic Eco-Systems) and 1.3 ml PROLINE® F/2 Algae Feed Part B (Aquatic Eco-Systems) to a final volume of 1 liter of a solution of Instant Ocean salts (7 g/L) (Aquatic Eco Systems, Apopka, FL).
  • Proline A and Proline B together include 8.8 mM NaN0 3 , 0.361mM NaH 2 P0 4 .H 2 0, 10X F/2 Trace metals, and 10X F/2 Vitamins (Guillard (1975) Culture of phytoplankton for feeding marine invertebrates, in "Culture of Marine Invertebrate Animals.” (eds: Smith W.L. and Chanley M.H.) Plenum Press, New York, USA. pp 26-60).
  • RPS4 P promoter SEQ ID NO: l
  • BleR bleomycin resistance gene
  • a terminator sourced from Nannochloropsis wild type strain WT-3730 (3730 T4; SEQ ID NO:3).
  • a second green fluorescent protein (GFP) reporter cassette comprised the TurboGFP (Evrogen, Moscow, Russia) coding sequence codon optimized for Parachlorella (SEQ ID NO:5) flanked by the PsbR P promoter (SEQ ID NO:4) at the 5' end of the gene and the Nannochloropsis gaditana wild type (WT-3730) strain T5 terminator also sourced from WT-3730 (SEQ ID NO:6) at the 3' end of the gene; this expression construct was cloned into a minimal E. coli pUC backbone vector.
  • the GFP cassette was oriented on the DNA strand opposite to the strand encoding the BleR (zeocin) selection cassette.
  • a schematic of plasmid pSG6450 is shown in Figure 1. For transformation into Parachlorella cells by particle bombardment, pSG6450 was linearized with Ndel.
  • Transformation of Parachlorella WT-1185 was accomplished using the Helios® Gene Gun System (Bio-Rad, Hercules, CA, USA). The protocol was developed using the manufacturer's instruction manual (available online at bio-rad.com/en-us/product/helios-gene- gun-system). As detailed below, DNA for transformation was precipitated onto gold particles, the gold particles were adhered to the inside of lengths of tubing, and a burst of helium gas fired through the tubing by the Gene Gun projected the DNA-coated gold particles into WT-1185 cells adhered on solid non-selective media. The following day, cells were moved onto selective media for growth of transformed colonies.
  • Preparation of DNA-coated gold bullets was accomplished by either one of two methods: a 40-bullet method and a 10-bullet method.
  • the 40-bullet method which required approximately 40 ⁇ g of DNA, used the BioRad Tubing Prep Station and was performed according to the manufacturer's protocol in the Helios® Gene Gun System manual and as described in U.S. Patent No. 8,883,993, incorporated herein by reference.
  • the 10-bullet prep method was performed as described in co-pending and commonly owned patent U.S. patent application number 15/343,064, filed November 3, 2016.
  • the flexible tubing was disconnected from the manifold drier at the Leur lock and attached to a 10 mL syringe.
  • the DNA/gold suspension was mixed well and drawn into the TefzelTM tubing by application of suction by the syringe.
  • the TefzelTM tubing was laid on a flat surface for five minutes while the gold settles out of solution and adheres to the inside of the tubing. Since the percentage of the gold that adheres to the tubing is influenced by the settling time, the settling time was consistent for every sample.
  • Transformation of the Ndel linearized pSG6450 construct by particle bombardment into WT-1185 resulted in a single transgenic colony generated (referred to as strain 6450-1) which was resistant to Zeocin (250 ⁇ g per ml) on agar plates.
  • the colony was resuspended in 200 ⁇ _, PM074 liquid media and aliquots of the cell suspension were spotted for secondary selection onto PM074/zeo250 media and grown in a Conviron incubator set at 25°C with 1% C0 2 on a 16:8 ligh dark diel cycle.
  • Plasmid pSG6530 was constructed to drive the expression of BleR (zeocin resistance marker) by flanking this coding region with paired promoter and terminator sequences from the RPS4 gene, SEQ ID NO: l and SEQ ID NO:7, respectively ( Figure 3A). Plasmid pSG6531 was constructed to drive the expression of BleR (zeocin resistance marker) by flanking this coding region with paired promoter and terminator sequences from the ACP gene, SEQ ID NO: 8 and SEQ ID NO:9, respectively ( Figure 3B).
  • pSG6530 and pSG6531 constructs (unlike the pSG6450 construct of Examples 2-4) each had a transgene that was not a Parachlorella gene or derived from a Parachlorella gene) operably linked to a Parachlorella promoter and a Parachlorella terminator.
  • the transgene was operably linked to a promoter and terminator from the same Parachlorella gene: the RPS4 gene promoter (SEQ ID NO: l) and terminator (SEQ ID NO:7) flanked the BleR gene in pSG6530 and the ACPI promoter (SEQ ID NO: 8) and terminator (SEQ ID NO: 9) flanked the BleR gene in pSG6531.
  • the RPS4 gene promoter SEQ ID NO: l
  • terminator SEQ ID NO:7 flanked the BleR gene in pSG6530
  • the ACPI promoter SEQ ID NO: 8
  • terminator SEQ ID NO: 9 flanked the BleR gene in pSG6531.
  • Results for particle bombardment transformation of pSG6530 and pSG6531 are summarized in Table 2.
  • Table 2 Results for particle bombardment transformation of pSG6530 and pSG6531 are summarized in Table 2.
  • a marked increase in the number of transformants obtained was observed for these constructs relative to the pSG6450 construct described in Examples 2 and 4.
  • Both the AscI-NotI restriction digested fragments or the constructs linearized in the vector backbone were (separately) transformed into WT-1185 cells, and transformants were obtained from transformations with both fragments and linearized vectors.
  • Colony PCR was performed on a subset of these transgenic lines to detect the transgenic construct from the 5' end of the promoter to the 3' end of the terminator for both plasmids to determine whether the entire transformed construct was present.
  • the construct was similar to that of pSGE6530 ( Figure 3A), except that introns 1-5 of the Parachlorella RPS4 gene (Table 4) were introduced into the codon- optimized sequence of the BleR gene (SEQ ID NO:2).
  • Intron 1 of the RPS4 gene (SEQ ID NO:25) was inserted immediately after nucleotide 67 of SEQ ID NO:2; intron 2 of the RPS4 gene (SEQ ID NO:26) was inserted immediately after nucleotide 168 of SEQ ID NO:2; intron 3 of the RPS4 gene (SEQ ID NO:27) was inserted immediately after nucleotide 208 of SEQ ID NO:2; intron 4 of the RPS4 gene (SEQ ID NO:28) was inserted immediately after nucleotide 260 of SEQ ID NO:2; and intron 5 of the RPS4 gene (SEQ ID NO:29) was inserted immediately after nucleotide 337 of SEQ ID NO:2.
  • promoter and terminator control elements were tested that could be used in constructs to drive the expression of transgenes of interest such as, for example, a reporter gene, RNAi constructs, RNA-guided endonucleases, regulators such as transcription factors, or genes involved in metabolic pathways related to lipid biosynthesis or photosynthetic efficiency.
  • Seven promoter/terminator pairs (see Table 1) from endogenous genes were selected based on covering a wide range of gene structures with respect to open reading frame size, intron number, and expression strength. Genes were only considered if the upstream and downstream genes were present on the same strand of DNA, and if deduced 5' and 3' untranslated regions were available from RNA-Seq cDNA assemblies.
  • FIG. 6A provides an overlay of a histogram of fluorescence of wild type cells (black curve) with the fluorescence histogram of transformed cells (gray curve), where the transformed cells give rise to a single peak that is shifted to the right (higher fluorescence) with respect to the wild type cells.
  • This transformant line - demonstrating in flow cytometry a single peak shifted to the right relative to the fluorescence peak of control cells - is said to gray curve shifted to the right with respect to the black wild type curve) and wild-type cells in which the peak is closer to that of the wild type cells but nonetheless is a single peak, shifted to the right (higher fluorescence) with respect to the background fluorescence of wild type cells.
  • the transformant line whose flow cytometry fluorescence profile is shown in Figure 6A is classified here as having "high” fully penetrant expression of GFP, whereas the transformant line whose flow cytometry fluorescence profile is shown in Figure 6B is classified here as having "low” fully penetrant expression of GFP.
  • Plasmids pSG6633 to pSG6640 also included a unique cloning site, a sequence recognized by Pmel (SEQ ID NO:31), between the BleR and GFP expression cassettes. This site allows for convenient insertion of a transgene to generate an expression cassette for any gene of interest driven by any of the validated promoter/terminator elements described in Table 1 and Table 7.
  • BsdR blasticidin resistance gene
  • SEQ ID NO:32 An intronylated version of the blasticidin resistance gene was also synthesized and used to generate a vector similar to pSGE6640, with the exception that the selectable marker is the BsdR intronylated gene (SEQ ID NO:33) rather than the BleR- intronylated gene.
  • intron 1 of the Parachlorella RPS4 gene (SEQ ID NO:25) was inserted immediately after nucleotide 93 of SEQ ID NO:32; intron 2 of the Parachlorella RPS4 gene (SEQ ID NO:26) was inserted immediately after nucleotide 135 of SEQ ID NO:32; intron 3 of the Parachlorella RPS4 gene (SEQ ID NO:27) was inserted immediately after nucleotide 187 of SEQ ID NO:32; intron 4 of the Parachlorella RPS4 gene (SEQ ID NO:28) was inserted immediately after nucleotide 238 of SEQ ID NO:32; and intron 5 of the Parachlorella RPS4 gene (SEQ ID NO:29) was inserted immediately after nucleotide 315 of SEQ ID NO:32.
  • a gene for nourseothricin acetyltransferase (natl) from Streptomyces noursei was codon optimized for Parachlorella (SEQ ID NO:50) and intronylated with the same five introns of the Parachlorella RPS4 gene as were used in Example 8, above, for intronylation of the BsdR gene.
  • the resulting intronylated natl gene (SEQ ID NO:51) was cloned in a vector flanked by the RSP4 promoter at the 5' end of the natl gene and the RPP4 terminator at the 3' end of the natl gene as shown in Figure 8B.
  • the construct linearized with Pvul and, separately, the intronylated natl gene expression cassette (promoter - intronylated natl gene - terminator) as an isolated Asc/Not fragment were transformed into Parachlorella WT-1185 cells by particle bombardment as disclosed in Example 3. Tranformants were selected on plates that included 300 mg/mL nourseothricin.
  • the non-intronylated, codon-optimized natl gene (SEQ ID NO:50) was also cloned in the same vector, also flanked by the RSP4 promoter at the 5' end of the natl gene and the RPP4 terminator at the 3' end of the natl gene ( Figure 8A) and transformed into Parachlorella WT- 1185 cells by particle bombardment.
  • the construct, linearized with Pvul, and, separately, the promoter- Natl gene - terminator cassette (as an isolated Asc/Not fragment) were transformed into Parachlorella WT-1185 cells as disclosed in Example 3, and tranformants were selected on plates that included 300 mg/mL nourseothricin.
  • Transformation of WT-1185 was also achieved by electroporation using a procedure as describe herein.
  • a 100 mL seed culture inoculated to lxlO 6 cells/mL six days before transformation was used to inoculate a 1L culture to 1x10 cells/mL two days before transformation.
  • cells were pelleted by centrifugation at 5000 x g for 20 minutes, washed three times with 0.1 ⁇ filtered 385 mM sorbitol, and resuspended to 5 x 10 9 cells/mL in 385 mM sorbitol.
  • Electroporation of 100 ⁇ _, concentrated cells was performed in 0.2 cm cuvettes in a BioRad Gene Pulser XcellTM under varied conditions.
  • the DNA used for optimization of electroporation was linearized pSG6640 including ble and TurboGFP expression cassettes.
  • the ble cassette included the RPS4 promoter operably linked to the ble gene (SEQ ID NO:2) and the RPS4 terminator.
  • the TurboGFP cassette included the ACP promoter operably linked to the TurboGFP gene (SEQ ID NO: 5) and the ACP terminator.
  • lmL cold sorbitol was added and used to transfer cells into 10 mL PM074. After overnight recovery, cells were concentrated and spread onto 13 cm-diameter PM074 media containing zeocin at 250 mg/L and grown under the conditions listed in the biolistics section.
  • the Cas9 gene of Streptococcus pyogenes with an N-terminal nuclear localization sequence (NLS) and C terminal FLAG tag was codon-optimized for expression in Parachlorella and four different constructs were designed, each of which had a different number of introns of the FBPase gene inserted into the codon-optmized Cas9 gene.
  • the intronylated Cas9 genes in each case were operably linked to the Parachlorella RPS17 promoter (SEQ ID NO: 17) at the 5' end of the gene and the RPS17 terminator (SEQ ID NO: 18) at the 3' end of the gene.
  • the constructs included the codon-optimized Cas9 gene with the first 1, 2, 5, or 9 introns from the endogenous Parachlorella FBPase gene (Figure 10).
  • the first intron was inserted 330 base pairs into the coding sequence. These plasmids were transformed into WE-1185. Table 8. Introns of the Parachlorella FBPase gene.
  • the plasmids, pSGE6707, pSGE6708, pSGE6709, and pSGE6710 also indued the TurboGFP gene (Evrogen, Moscow, Russia) codon-optimized for Parachlorella (SEQ ID NO: 5) driven by the ACP promoter (SEQ ID NO: 8) and terminated by the ACP terminator (SEQ ID NO:9).
  • Parachlorella chloroplastic SRP54 gene was targeted for knockout (cpSRP54, protein-encoding sequence provided as SEQ ID NO: 10).
  • cpSRP54 protein-encoding sequence provided as SEQ ID NO: 10
  • Parachlorella mutants created by classical mutagenesis with mutations in this gene had a pale green phenotype. Since mutation of this target leads to viable transgenic lines with a phenotype that is easily observable by eye, SRP54 was selected as a target for testing Cas9 editing capability in Parachlorella.
  • a zeocin resistance selection cassette (containing a bleomycin resistance "BleR” gene codon-optimized for Parachlorella that included introns from Parachlorella (SEQ ID NO:30) operably linked to the Parachlorella RPS4 promoter (SEQ ID NO: 1) and terminated by the Parachlorella RPS4 terminator (SEQ ID NO:7) was co-transformed into GE- 15699 as a donor or editing DNA for insertion into the target site with a guide RNAs (gRNA) targeting SRP54 (target sequence, including PAM, provided as SEQ ID NO:45) by electroporation as disclosed herein, and the transformed cells were plated onto media containing zeocin.
  • gRNA guide RNAs
  • colony PCR was also done using primers designed to amplify from the SRP54 gene (AE597; SEQ ID N047:) into the BleR selectable marker (either primer AE405 (SEQ ID NO:48) or primer AE406 (SEQ ID NO:49).
  • primers designed to amplify from the SRP54 gene AE597; SEQ ID N047:
  • primers designed to amplify from the SRP54 gene AE597; SEQ ID N047:
  • primer AE405 SEQ ID NO:48
  • primer AE406 SEQ ID NO:49
  • the primers were designed to produce a 1.2kb band from amplification by either primer pair 405/597 or primer pair 406/597 spanning from within the BleR cassette out to the SRP54 gene ( Figure 13).

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biotechnology (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Microbiology (AREA)
  • Cell Biology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Medicinal Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Nutrition Science (AREA)
  • Oil, Petroleum & Natural Gas (AREA)
  • Virology (AREA)
  • Physiology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

La présente invention concerne de nouveaux éléments régulateurs comprenant des séquences promotrices provenant de microorganismes. L'invention concerne en outre des assemblages d'ADN contenant ces nouveaux éléments régulateurs, et des microorganismes recombinants comprenant ces éléments régulateurs. L'invention concerne également des procédés de modification, de fabrication et d'utilisation des éléments régulateurs. Les éléments régulateurs et les procédés de transformation selon la présente invention sont particulièrement adaptés pour Parachlorella et d'autres microalgues.
PCT/US2016/064275 2015-11-30 2016-11-30 Compositions et procédés pour l'expression de gènes dans des algues Ceased WO2017095960A1 (fr)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US201562261217P 2015-11-30 2015-11-30
US62/261,217 2015-11-30
US201562261777P 2015-12-01 2015-12-01
US62/261,777 2015-12-01
US201662323480P 2016-04-15 2016-04-15
US62/323,480 2016-04-15

Publications (1)

Publication Number Publication Date
WO2017095960A1 true WO2017095960A1 (fr) 2017-06-08

Family

ID=58778075

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2016/064275 Ceased WO2017095960A1 (fr) 2015-11-30 2016-11-30 Compositions et procédés pour l'expression de gènes dans des algues

Country Status (2)

Country Link
US (1) US10041079B2 (fr)
WO (1) WO2017095960A1 (fr)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2018378833A1 (en) 2017-12-08 2020-07-09 Synthetic Genomics, Inc. Improving algal lipid productivity via genetic modification of a TPR domain containing protein
CN113015801A (zh) * 2018-09-20 2021-06-22 赛诺菲 基于内含子的通用克隆方法和组合物
US11718857B2 (en) * 2019-08-09 2023-08-08 Alliance For Sustainable Energy, Llc Broad host range genetic tools for engineering microalgae
IL293882A (en) * 2019-12-17 2022-08-01 Viridos Inc Recombinant algae with high lipid productivity
KR102354855B1 (ko) * 2020-10-05 2022-01-21 한양대학교 산학협력단 클로렐라 불가리스의 형질전환용 벡터 및 이를 이용한 형질전환 방법

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060281107A1 (en) * 2005-05-04 2006-12-14 Exagen Diagnostics, Inc. Acute myelogenous leukemia biomarkers
US20060292572A1 (en) * 2004-01-09 2006-12-28 Stuart Robert O Cell-type-specific patterns of gene expression
US20140142160A1 (en) * 2010-11-12 2014-05-22 The General Hospital Corporation Polycomb-associated Non-Coding RNAs
US20140186919A1 (en) * 2012-12-12 2014-07-03 Feng Zhang Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation
WO2016109840A2 (fr) * 2014-12-31 2016-07-07 Synthetic Genomics, Inc. Compositions et procédés pour une efficacité élevée de l'édition du génome in vivo

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060292572A1 (en) * 2004-01-09 2006-12-28 Stuart Robert O Cell-type-specific patterns of gene expression
US20060281107A1 (en) * 2005-05-04 2006-12-14 Exagen Diagnostics, Inc. Acute myelogenous leukemia biomarkers
US20140142160A1 (en) * 2010-11-12 2014-05-22 The General Hospital Corporation Polycomb-associated Non-Coding RNAs
US20140186919A1 (en) * 2012-12-12 2014-07-03 Feng Zhang Engineering and optimization of improved systems, methods and enzyme compositions for sequence manipulation
WO2016109840A2 (fr) * 2014-12-31 2016-07-07 Synthetic Genomics, Inc. Compositions et procédés pour une efficacité élevée de l'édition du génome in vivo

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
DATABASE GenBank [O] 17 May 2007 (2007-05-17), "EST7545 LK04 Laupala kohalensis cDNA clone 1061021797615 5-, mRNA sequence", XP055388540, Database accession no. EH636437 *

Also Published As

Publication number Publication date
US10041079B2 (en) 2018-08-07
US20170152520A1 (en) 2017-06-01

Similar Documents

Publication Publication Date Title
AU2012396282B2 (en) Tetraselmis promoters and terminators for use in eukaryotic cells
US11162106B2 (en) Inducible expression of genes in algae
US10612034B2 (en) Promoters and terminators for use in eukaryotic cells
US10041079B2 (en) Compositions and methods for expressing genes in algae
US20200157558A1 (en) Algal chloroplastic srp54 mutants
US9309523B2 (en) Nannochloropsis kozak consensus sequence
AU2018378833A1 (en) Improving algal lipid productivity via genetic modification of a TPR domain containing protein
AU2012381062B2 (en) Promoters and terminators for use in eukaryotic cells

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16871448

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16871448

Country of ref document: EP

Kind code of ref document: A1